JP4036665B2

JP4036665B2 - Image encoding method and image decoding method

Info

Publication number: JP4036665B2
Application number: JP2002075296A
Authority: JP
Inventors: 典男伊藤; 伸也長谷川; 寛草尾; 裕之堅田; 友子青野
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1997-09-19
Filing date: 2002-03-19
Publication date: 2008-01-23
Anticipated expiration: 2018-06-17
Also published as: JP2002359745A

Description

【０００１】
【発明の属する技術分野】
本発明は、ディジタル画像処理の分野に属し、画像データを高能率に符号化する画像符号化方法及びこの画像符号化方法で符号化された符号化データを復号する画像復号方法に関するものである。
【０００２】
【従来の技術】
自然画像をデジタルデータに変換してコンピュータ処理するための画像フォーマットとして、フラッシュ・ピックスフォーマット（ＦｌａｓｈＰｉｘＦｏｒｍａｔＳｐｅｃｉｆｉｃａｔｉｏｎＶｅｒｓｉｏｎ１．０）が提案されている。
【０００３】
このフォーマットでは、表示・印刷装置の能力やユーザーの要求に応じて必要な解像度のデータを素早く取り出すために、複数の解像度のデータを同時に保持している。また、画像の拡大縮小や編集の際に画像データ内の必要な部分だけを処理することで負荷を軽減できるよう、画像をタイル単位に分割して保持している。
【０００４】
フラッシュ・ピックスフォーマットに従って画像を符号化する符号化装置について、図３２を用いて説明する。図３２（ａ）は画像の縮小及びタイル分割を示す図であり、図３２（ｂ）は符号化装置の一例を示すブロック図である。
【０００５】
フラッシュ・ピックスでは最初に図３２（ａ）の画像１〜４に示す１／１〜１／８サイズの画像を生成し、各画像１〜４に対してそれぞれタイル分割及び圧縮を行うという点に特徴がある。
【０００６】
まず、図３２（ａ）の画像１を図３２（ｂ）の符号化装置で符号化する場合について説明する。ここで、図３２（ａ）の画像１〜４の破線はタイルの境界を表わしている。
【０００７】
原画像は、タイル分割部３２０１で６４画素×６４画素から成るタイルに分割され、続いてＪＰＥＧ圧縮部３２０２でタイル毎に圧縮処理される。各タイル毎の符号化データはタイル分割部３２０１からのタイル分割情報と合わせて符号化データ統合部３２０３で一つに統合され、符号化データ１が出力される。
【０００８】
次に、図３２（ａ）の画像２について説明する。原画像が１／２縮小部３２０４で縦横とも１／２に縮小された後、同様にタイル分割部３２０５、ＪＰＥＧ圧縮部３２０６、符号化データ統合部３２０７を経て、符号化データ２となる。
【０００９】
図３２（ａ）の縮小画像群（画像２〜４）を生成する縮小処理は、縮小画像全体が１タイル内に収まる大きさになるまで繰り返される。図３２（ａ）の例では、画像３のサイズは、１つのタイルに収まっておらず、さらに１／２縮小処理が行われ、１つのタイル内におさまる画像４のサイズが得られたところで縮小処理を終了する。
【００１０】
画像３の符号化データは１／２縮小部３２０８、タイル分割部３２０９、ＪＰＥＧ圧縮部３２１０、符号化データ統合部３２１１により生成され、画像４の符号化データは１／２縮小部３２１２、タイル分割部３２１３、ＪＰＥＧ圧縮部３２１４、符号化データ統合部３２１５により生成される。
【００１１】
この方式では、１／１サイズ画像の符号化データとは別に、縮小した別解像度の画像についてもそれぞれ符号化データを保持するために、符号化データ量が約１．４倍に増大してしまう点、符号化時には、各解像度で圧縮処理を行うため処理量が大きい点が問題となる。
【００１２】
一方、フラッシュ・ピックスとは別に、ウェーブレット（Ｗａｖｅｌｅｔ）変換による画像圧縮方式があり、この方式では原画像のサイズに対して圧縮を行った一つの符号化データから異なる解像度の画像データを容易に復号することができるので、複数解像度に対応することによる符号化データ量の増大の問題は発生しない。
【００１３】
すなわち、前述のフラッシュ・ピックスで符号化データ量が１．４倍となったのに対し、１倍の符号化データ量で複数解像度を復号する要求に答えることができる。
【００１４】
ウェーブレット変換圧縮では、図３３の基本ブロック図に示す処理が行われる。原画像はウェーブレット変換部３３０１でウェーブレット変換されたサブバンド分割データとなり、量子化部３３０２で量子化され、エントロピー符号化部３３０３でエントロピー符号化された後、符号化データとなる。
【００１５】
図３３中のウェーブレット変換部３３０１をより詳細に示したブロック図を図３４に、ウェーブレット変換による画像変換を図３５に示す。これらは３回の２次元サブバンド分割を行った場合の例である。
【００１６】
図３５（ａ）の原画像は、図３４の水平方向のローパスフィルタ３４０１と水平方向のハイパスフィルタ３４０２とにより、２つの水平方向サブバンドに分割され、各々１／２サブサンプリング部３４０７、３４０８によって１／２に間引かれる。
【００１７】
分割された２つの水平方向サブバンドは、それぞれ垂直方向についてもローパスフィルタ３４０３、３４０５とハイパスフィルタ３４０４、３４０６とによるサブバンド分割と、１／２サブサンプリング部３４０９〜３４１２によるサブサンプリングとが行われ、この時点で４つのサブバンドに変換される。
【００１８】
このうち、水平方向高域、垂直方向高域のサブバンド（図３４のヌ）、水平方向高域、垂直方向低域のサブバンド（図３４のリ）、水平方向低域、垂直方向高域のサブバンド（図３４のチ）は、それぞれ図３５（ｂ）のチ、リ、ヌに示すウェーブレット変換係数となる。
【００１９】
残りの水平方向、垂直方向とも低域のサブバンド３４１３についてのみ、再帰的にサブバンド分割を繰り返していく。
【００２０】
この再起的なサブバンド分割は、水平方向ローパスフィルタ３４１４、３４２６、水平方向ハイパスフィルタ３４１５、３４２７、垂直方向ローパスフィルタ３４１６、３４１８、３４２８、３４３０、垂直方向ハイパスフィルタ３４１７、３４１９、３４２９、３４３１、及び１／２サブサンプリング部３４２０〜３４２５、３４３２〜３４３７によってなされる。
【００２１】
尚、図３４のイ〜トのサブバンドは、図３５（ｂ）のイ〜トに対応する。
【００２２】
このようにして得られた図３５（ｂ）のウェーブレット変換係数を、サブバンド毎に図３３の量子化部３３０２で量子化し、さらに同図のエントロピー符号化部３３０３でエントロピー符号化して符号化データを得る。尚、エントロピー符号化部３３０３ではハフマン符号化や算術符号化を用いることができる。
【００２３】
一方、ウェーブレット変換の復号は、図３６に示すように、符号化データをエントロピー復号部３６０１でエントロピー復号し、逆量子化部３６０２で逆量子化した後、逆ウェーブレット変換部３６０３でサブバンド合成して復号画像を得る。
【００２４】
ウェーブレット変換を用いた符号化の特徴として、図３５（ｂ）に示すように、解像度に応じた階層構造を持つ点があり、このため復号の際に符号化データの一部、若しくは全体を用いて、異なる解像度の画像を容易に復号することができる。
【００２５】
すなわち、図３５（ｂ）のイ，ロ，ハ，ニのサブバンドを復号すれば、原画像の１／４の画像を復号することができ、これに加えてホ，ヘ，トを復号すれば、１／２の画像を復号することができ、全てのサブバンドを復号すれば、１／１サイズの画像を復号することができる。
【００２６】
ここで、図３４におけるＨ−ＬＰ、Ｈ−ＨＰ、Ｖ−ＬＰ、Ｖ−ＨＰフィルタの動作について、図３７を用いて説明する。なお、図３７（ｂ）は図３７（ａ）の円で囲った部分を拡大したものである。
【００２７】
図３７（ａ）の原画像に対してウェーブレット変換を行うために、原画像右上端近くの画素３７０１に対するタップ数９ビットの水平方向フィルタの出力を求める場合、フィルタの演算対象は３７０２に示した領域になる。
【００２８】
しかしこの場合、フィルタ演算対象３７０２の一部は原画像の外にはみ出しており、この部分には画素データが存在しない。垂直フィルタについても同様の問題が生じる。
【００２９】
このように、変換対象画像の周辺部では、フィルタのタップ数に応じて画像外部のデータも必要となる。さらにサブバンド分割を繰り返すと、フィルタがはみ出す領域は広くなる。
【００３０】
この問題は、一般にはある規則に従って画像を端部で折り返す等の方法で処理される。
【００３１】
【発明が解決しようとする課題】
フラッシュ・ピックスのように、複数の解像度の画像に対する符号化データを別々に持つ場合、拡大・縮小などの画像データ処理時の負荷を軽減することができるが、符号化データサイズが約１．４倍に増大する欠点がある。
【００３２】
一方、ウェーブレット変換符号化を用いると、原画像のサイズに対して圧縮を行った一つの符号化データのみから複数の解像度データを容易に復号できるため、符号化データサイズは増大しない。
【００３３】
しかしながら、フラッシュ・ピックスで用いられている、画像をタイルに分割しタイル単位に符号化する方式（特定の画像領域が画像処理の対象となる場合に、必要な画像タイルのみを画像処理の対象とすることで処理にかかる負荷を軽減できる）をウェーブレット変換符号化方式に適用した場合、ウェーブレット変換に使用するフィルタがタイル境界からはみ出すために、問題が生ずる。
【００３４】
すなわち、フラッシュ・ピックスのようなＪＰＥＧ符号化を利用するものは、符号化処理がタイル内で閉じているためにタイル単位の符号化が容易であったのに対し、ウェーブレット変換符号化では処理がタイルの周囲にはみ出るため、タイル単位での符号化処理・管理が困難になるという問題があった。
【００３５】
さらに、従来のウェーブレット変換符号化では、図３３のウェーブレット変換部３３０１の出力、すなわち図３５（ｂ）のウェーブレット変換係数を全て保持するメモリが必要であり、この際ウェーブレット変換係数は原画像と同一の解像度を有するため、メモリ必要量が大きくなるという問題があった。この問題は高解像度の画像を扱う場合により顕著となる。
【００３６】
本発明はかかる課題に鑑みてなされたものであり、複数の解像度の復号及びタイルによる管理をウェーブレット変換を用いて実現することにより、高機能、高効率の符号化を小規模なハードウェア構成で可能とするものである。
【００３７】
【課題を解決するための手段】
本発明の画像符号化方法は、画像データをＮ画素×Ｍ画素のタイルに分割し、各タイルに対応する符号化対象データとして、タイル内のＮ画素×Ｍ画素を出力するステップと、前記各タイルに対応する符号化対象データの周囲に所定のデータを外挿してサブバンド分割を行い、各タイルをそれぞれ独立にウェーブレット符号化するステップと、任意のタイルを独立して、前記サブバンド単位で復号するための管理情報を生成するステップと、前記管理情報を前記符号化情報に付加して、ビットストリームを生成するステップとを具備し、前記管理情報は、各タイルの符号化情報のビットストリーム上での位置を示す情報と、各サブバンドを管理・識別する情報とを含み、前記符号化情報とは独立した位置にまとめて配置されることを特徴とする。
【００３８】
本発明の画像復号方法は、画像データをタイルに分割し、各タイルをそれぞれ独立にウェーブレット符号化した符号化情報と、符号化情報を管理するための管理情報とからなるビットストリームを入力とし、必要とするタイル及び解像度に応じた復号画像を復号する画像復号方法であって、前記管理情報は、各タイルもしくは解像度に対応する符号化情報のサイズを含み、前記符号化情報とは独立した位置にまとめて配置され、復号するタイルあるいは解像度に対応する符号化情報の格納位置を前記サイズをもとに解析するステップと、前記格納位置をもとに、ウェーブレット復号を行うステップと、前記ウェーブレット復号されたタイル単位の復号画像を連結するステップとを備えた、画像復号方法である。
【００３９】
【発明の実施の形態】
以下、本発明の実施の形態を詳細に説明する。図１は本発明の実施形態１の画像符号化方法を説明するためのブロック図である。
【００４０】
図２（ａ）に示すような原画像の画像データは、まずタイル分割部１０１で予め決められたＮ画素×Ｍ画素のタイルに分割される。分割された画像を図２（ｂ）に示す。タイル分割部１０１では、各タイルに対応するデータとしてタイル内のＮ画素×Ｍ画素の画像を出力する。
【００４１】
分割されたタイルのうち、図２（ｂ）のタイルｉについて、その後の処理を説明する。タイルｉの画像データを、ウェーブレット変換部１０２でサブバンド分割する。
【００４２】
ここで、タイルの周辺近くをサブバンド分割処理する際には、タイル周囲のデータを外挿する。すなわち、図３７（ｂ）に示したように、ウェーブレット変換に用いるフィルタの演算対象範囲３７０２がタイル外にはみ出す場合、タイルの外側のデータが必要となるため、ウェーブレット変換部１０２では、データを外挿してサブバンド分割する。
【００４３】
外挿方法としては、例えば図２（ｃ）に示すように、タイル内の画像を折り返して鏡像を生成する手法を用いる。続いて、量子化部１０３でウェーブレット変換係数を量子化し、エントロピー符号化部１０４でエントロピー符号化して、タイルｉの符号化データを得る。
【００４４】
エントロピー符号化には、ハフマン符号化や算術符号化を用いることができる。このウェーブレット変換部１０２、量子化部１０３、エントロピー符号化部１０４をまとめてウェーブレット変換符号化部１０５と呼ぶ。
【００４５】
一方、管理情報生成部１０６は、タイル分割部１０１から得られた各タイルの空間的な位置に関するタイル分割情報と、ウェーブレット変換符号化部１０５から得られた各サブバンドの情報とを用いて、タイル及びサブバンドを管理・識別するための管理情報を生成する。この管理情報は、符号化データ統合部１０７で利用される。
【００４６】
符号化データ統合部１０７は、管理情報生成部１０６より出力される管理情報を使用して、エントロピー符号化部１０４より出力される符号化情報を整理・統合し、かつ管理情報をビットストリーム中に付加して、最終的な符号化データを作成する。
【００４７】
ここで、符号化データをサブバンド及びタイルに従って管理するのは、画像を復号する際に、図３２（ａ）に示した例のような異なった解像度の画像や、画像中の特定のタイルのみを復号することを可能にするためである。
【００４８】
このように作成された符号化データのビットストリームの一例を図３に示す。ビットストリームは、ビットストリーム全体の情報を管理するヘッダーと、各タイル毎の符号化情報とから構成され、各タイル毎の符号化情報は、タイル毎の情報を管理するタイルヘッダーと、画像タイルを前記ウェーブレット変換符号化部１０５で符号化したタイル毎の符号化情報とから構成される。
【００４９】
タイルヘッダーには、各サブバンドに対応するビット位置の情報が記述されており、ここを参照することで必要なサブバンドに対応するビット列がどこにあるかを知ることができる。
【００５０】
勿論、本発明によるビットストリームの構成は、図３に示すものに限定されるものではない。例えば、図３と同じ構成である図４（ａ）に示したものに対し、図４（ｂ）のように各タイルのサブバンド情報を別々に分離した後、これを並び換え、それぞれのサブバンド情報にタイルヘッダを付加して独立したタイルとする構成としても良い。このようにすると、縮小画像のタイルだけにアクセスすることで、縮小された全体画像を素早く再現することが可能となる。
【００５１】
次に、本発明の実施形態２の画像符号化方法について説明する。ここで、実施形態２の画像符号化方法の構成は、図１とともに上述した実施形態１のブロック図と同じであり、タイル分割部１０１の動作のみが異なっている。このため、以下ではこのタイル分割部１０１の動作について、図５を用いて説明する。
【００５２】
実施形態１のタイル分割部１０１では、Ｎ×Ｍ画素のタイルに原画像を分割した後、特定のタイルをウェーブレット変換部１０２に出力する際に、タイル内部の画像データのみを出力として切り出していたが、実施形態２におけるタイル分割部１０１は、原画像に適当な窓関数を乗じることでデータを切り出して出力するものを用いる。
【００５３】
例えば、図５のタイルｉｊを切り出す場合、原画像データに対して水平方向に窓関数ＦＸｉ、続いて垂直方向に窓関数ＦＹｊを乗じた結果を、タイル分割部１０１の出力とする。尚、ｉは水平方向のタイル番号、ｊは垂直方法のタイル番号である。
【００５４】
これにより、図５中の斜線部の画像に、窓関数に応じた重みを乗じた結果が、タイル分割部１０１の出力となる。ここで窓関数としては、全区間を通じた総和が１となるようなものを用いる。
【００５５】
すなわち、
ΣＦＸｉ（ｘ）＝１（０≦ｘ≦ｗ）
ΣＦＹｊ（ｙ）＝１（０≦ｙ≦ｈ）
を満たす窓関数を用いる。
【００５６】
ただし、ｗは原画像の幅、ｈは原画像の高さを表し、ｘ、ｙ軸は原画像の左上角を原点Ｏとし、それぞれ右向き、下向きに取られているものとする。
【００５７】
また、ＦＸｉ（ｘ）の総和はｉに対して、ＦＸｊ（Ｙ）の総和はｊに対して取られているものとする。図５のＦＸｉ−１、ＦＸｉ、ＦＹ１、ＦＹｊ、ＦＹｊ＋１は、このような条件を満たす関数の一部を表したのもである。
【００５８】
この窓関数によるデータ切り出しの結果、タイル分割部１０１の出力には、タイルｉｊ内部の画素だけでなく、周囲の画素も窓関数の値に応じた重みで符号化対象データの中に含まれることになる。
【００５９】
次に、上述した実施形態１の画像符号化方法で符号化されたデータを復号する画像復号方法について、本発明の実施形態３として説明する。図６は実施形態３の画像復号方法を説明するためのブロック図である。
【００６０】
入力となる符号化データは、実施形態１で説明した画像符号化方法で符号化されたものである。管理情報分離部４０１は符号化データの中からタイル分割に関する管理情報・サブバンドに関する管理情報を分離して取り出す。
【００６１】
取り出された管理情報に基づき、符号化データ抽出部４０２ではユーザの要求に応じて、符号化情報中の必要となるタイル及びサブバンドの符号化情報部分を判定し抽出する。尚、図３に示したビットストリームの例では、管理情報はヘッダー及びタイルヘッダーにある。
【００６２】
抽出された符号化情報は、エントロピー復号部４０３でエントロピー復号され、逆量子化部４０４で逆量子化され、復号対象のタイルに対応するウェーブレット変換係数が得られる。
【００６３】
ウェーブレット変換係数は、逆ウェーブレット変換部４０５で逆ウェーブレット変換され、対象タイルの復号画像が得られる。このエントロピー復号部４０３、逆量子化部４０４、逆ウェーブレット変換部４０５をまとめてウェーブレット変換復号部４０６と呼ぶ。
【００６４】
さらに、タイル連結部４０７で、管理情報生成部４０１からのタイル分割情報に基づき、復号されたタイル群を連結して、所望の領域・解像度の復号画像を得る。
【００６５】
図３に示したビットストリームの例を用いて説明すると、低い解像度の全体画像（全タイル）を復号する場合、各タイルヘッダーのサブバンド情報を参照しながら、低解像度のサブバンドに相当する符号化データ部分である１−ａ、２−ａ、…、ｉ−ａ、…を、タイル毎に順次ウェーブレット変換復号部４０６でウェーブレット変換復号する。
【００６６】
そして、得られた低解像度のタイルをタイル連結部４０７で連結すれば、低解像度の全体画像を得ることができる。
【００６７】
また、低解像度復号画像から、ある特定のタイルｉを拡大して、最高解像度で表示したい場合、タイルｉに相当する符号化情報である第ｉタイル符号化情報全体を復号すれば良い。
【００６８】
すなわち、既に抽出済みの符号化情報ｉ−ａに加えてｉ−ｂを抽出し、ｉ−ａとあわせて復号すれば、所望の復号画像が得られる。勿論、全部の符号化情報（全てのタイル、全てのサブバンド）を復号すれば、高解像度でかつ全ての領域の復号画像を得ることができる。
【００６９】
以上のように、ユーザの要求に応じて任意の解像度、任意のタイルの画像を容易に復号することができる。
【００７０】
次に、本発明の実施形態４の画像復号方法について説明する。入力となる符号化データは、実施形態２で説明した画像符号化方法により符号化されたものである。ここで、実施形態４の画像復号方法の構成は、図６とともに上述した実施形態３と同じであり、タイル連結部４０７の動作のみが異なっている。このため、以下ではこのタイル分割部４０７の動作について、図７を用いて説明する。
【００７１】
実施形態２の画像符号化方法では、各タイルの符号化対象画素がタイルの周辺画素を含むため、ウェーブレット変換復号部４０６で復号されたタイルの復号データの大きさは、タイルの大きさよりも大きくなる。
【００７２】
図７においては、タイルは２画素×２画素で構成され、またタイルの復号データの大きさは４画素×４画素である。この場合、タイルｉｊの復号データは図７の斜線部となり、隣接するタイルと１画素の幅だけ重なり合う。
【００７３】
タイル連結部４０７では、タイルの連結の際に、復号データが重なり合う位置については、復号データを足しあわせて画素値を求める。例えば、図７の画素ａについては、
ａ（ｉ−１，ｊ−１）＋ａ（ｉ，ｊ−１）＋ａ（ｉ−１，ｊ）＋ａ（ｉ，ｊ）
によって、画素値を計算する。
【００７４】
ここで、ａ（ｉ，ｊ）は画素ａの位置におけるタイルｉｊの復号データを表すものとする。
【００７５】
次に、本発明の実施形態５の画像符号化方法について説明する。図８は実施形態５の画像符号化方法を説明するためのブロック図である。
【００７６】
実施形態５の画像符号化方法が、図１とともに上述した実施形態１の画像符号化方法と異なっている点は、タイルをウェーブレット変換符号化する際に、タイル周囲を無条件に外挿するのではなく、対象タイルの周囲の別のタイルが存在していればそれを利用する点である。
【００７７】
実施形態１の場合と同様、図９（ａ）に示すように、タイル分割部５０１で分割された原画像のうち、タイルｉについてその後の処理を説明する。タイルｉの画像データをウェーブレット変換部５０３で変換するにあたり、ウェーブレット変換に使用するフィルタがタイルｉからはみ出る領域に周囲の画素が存在する場合は、その画素のデータも用いてタイルｉをウェーブレット変換する。
【００７８】
すなわち、図９（ａ）のタイルｉをウェーブレット変換するために、まず図９（ａ）のタイルｉの周囲のタイル、イ〜チの中から、図９（ｂ）中に斜線で示したウェーブレット変換に必要な周囲画素領域をタイルｉに付加した後、タイルｉのウェーブレット変換を行う。
【００７９】
この付加処理を行うのが周囲画素追加部５０２で、タイル分割部５０１から得られるタイル分割情報に基づき、符号化対象のタイルの周囲に別タイルが存在するか否かを判断し、タイルが存在する場合に必要な画素を付加する。
【００８０】
上記の例において、周辺画素追加部５０２は周囲の全てのタイルを追加してタイル画像データを出力するため、これが入力されるウェーブレット変換部５０３では、タイル単体の画像を処理する実施形態１におけるウェーブレット変換部１０２に比べて大きな画像を変換する必要がある。
【００８１】
変換画像が大きくなると、これを使用した機器は大きな作業領域が必要となり、コストアップと動作速度低下につながる。そこで、前記変換画像をより小さくするような別モードは有効であり、これを次に示す。
【００８２】
これは、図９（ｃ），（ｄ）に示すように、周辺画素追加部５０２で追加する領域をｘ方向もしくはｙ方向に制限し、ウェーブレット変換部５０３へ入力するタイル画像データを小さくするものである。
【００８３】
例えば、図９（ｃ）の場合では、符号化対象のタイルの上下に別タイルが存在する場合に必要な画素を付加する。符号化対象のタイルの左右については、タイル内の画像を折り返して鏡像を生成する手法を用いる。また、図９（ｄ）の場合は、図９（ｃ）の場合と上下、左右が逆になる。
【００８４】
ウェーブレット変換を行う手法としては、図９（ｂ），（ｃ），（ｄ）のいずれか一つだけを用いてサブバンド分割を繰り返す手法、あるいはサブバンド毎に図９（ｂ），（ｃ），（ｄ）の画素追加方法を切替える手法がある。
【００８５】
尚、このウェーブレット変換部５０３の出力として必要となるのは、符号化対象タイルｉのウェーブレット変換係数のみであり、周囲画素追加部５０２で追加された画素はタイルｉ内部の画素のウェーブレット変換係数を算出するためにのみ利用される。
【００８６】
続いて、量子化部５０４で量子化を行い、エントロピー符号化部５０５でエントロピー符号化を行って、タイルｉの符号化情報を得る。このウェーブレット変換部５０３、量子化部５０４、エントロピー符号化部５０５をまとめてウェーブレット変換符号化部５０６と呼ぶ。
【００８７】
一方、管理情報生成部５０７は、タイル分割部５０１から得られた各タイルの空間的な位置に関するタイル分割情報と、ウェーブレット変換符号化部５０６から得られた各サブバンドの情報とを用いて、タイル及びサブバンドを管理・識別するための管理情報を生成する。この管理情報は、符号化データ統合部５０８で利用される。
【００８８】
符号化データ統合部５０８は、管理情報生成部５０７より出力される管理情報を使用して、エントロピー符号化部５０５より出力される符号化情報を整理・統合し、かつ管理情報をビットストリーム中に付加して、例えば図３に示した一例のように、最終的な符号化データを作成する。
【００８９】
さらに、本発明の実施形態６の画像符号化方法について説明する。実施形態６の画像符号化方法の構成は、図８とともに上述した実施形態５と同じであり、周囲画素追加部５０２の動作のみが異なっている。このため、以下ではこの周囲画素追加部５０２の動作について、図１０を用いて説明する。
【００９０】
図１０におけるタイルｉの処理を例として説明する。実施形態５として説明した周囲画素追加部５０２では、タイルｉが入力となった場合に、タイルｉ内の画素のウェーブレット変換係数算出に必要となる画素、すなわちフィルタがはみ出す範囲の画素を全てタイルｉに付加していた。この範囲を図１０中に斜線で示した周辺画素範囲とする。
【００９１】
しかし、一般にタイルｉから大きく離れた画素がタイルｉ内のウェーブレット変換係数に及ぼす影響はかなり小さいため、本実施形態では、付加すべき周辺画素に適当な重みづけ関数を乗じた結果を、タイルｉに付加することにより、付加する画素数を減らし演算量を削減する。
【００９２】
重みづけ関数には、タイルｉに近い部分では１、離れるに従って０に近づくような関数を使用する。図１０に示す重みづけ関数はその一例である。図１０の例では、重みづけ関数を乗じた結果、実際に付加される画素は網点を施した有効画素部分だけであり、その外部はウェーブレット変換に必要な画素ではあるが０とみなされ付加されない。
【００９３】
尚、重みづけ関数としては、図１０に示したもののほか、タイルｉからの距離がある基準内であれば１、それより離れていれば０となるような階段関数も使用することができる。
【００９４】
次に、本発明の実施形態７の画像符号化方法について説明する。図１１は実施形態７の画像符号化方法を説明するためのブロック図である。
【００９５】
実施形態７の画像符号化方法が、図１とともに上述した実施形態１及び図８とともに上述した実施形態５の画像符号化方法と異なっている点は、原画像をタイル化する前に、原画像全体に対してウェーブレット変換部７０１でウェーブレット変換を行い、その後でウェーブレット変換部７０１の出力であるウェーブレット変換係数をタイル単位に並び替えてタイルを構成する点である。
【００９６】
図１１において、原画像はタイル化される前にウェーブレット変換部７０１でウェーブレット変換される。次に、タイル構成部７０２で、空間上で同一のタイルに対応しているウェーブレット変換係数を集めてタイルを構成する並べ替えを行う。
【００９７】
ウェーブレット変換部７０１でウェーブレット変換されて得られたサブバンドの例を図１２（ａ）に示す。この場合、図１２（ａ）の中で最も低い周波数のサブバンド中の係数ｂ０は、他のサブバンド中の係数部分ｂ１，ｂ２，ｂ３，ｂ４，ｂ５，ｂ６，ｂ７，ｂ８，ｂ９と空間的に対応関係にある。
【００９８】
ここで、ｂ１〜ｂ３は１×１、ｂ４〜ｂ６は２×２、ｂ７〜ｂ９は４×４個の係数で構成されている。これらｂ０〜ｂ９をそれぞれのサブバンドから抜き出してきて、図１２（ｂ）に示す形に構成したものを１つのタイルとして、その他のウェーブレット変換係数についても全てタイル単位に並べ替えることにより、実施形態５で原画像をタイルに分割してからウェーブレット変換した場合と同様の結果が得られる。
【００９９】
尚、ｂ０は一つの係数である必要はなく、ｋ個×ｌ個の係数で構成される係数のブロックであっても構わない。この場合、ｂ１〜ｂ３はｋ×ｌ、ｂ４〜ｂ６は２ｋ×２ｌ、ｂ７〜ｂ９は４ｋ×４ｌ個の係数で構成されることになる。
【０１００】
タイル構成部７０２から出力されるタイル化されたウェーブレット変換係数は、量子化部７０３で量子化され、エントロピー符号化部７０４でエントロピー符号化されて符号化情報となる。
【０１０１】
一方、管理情報生成部７０６は、タイル構成部７０２から得られた各タイルの空間的な位置に関するタイル分割情報と、ウェーブレット変換符号化部７０５から得られた各サブバンドの情報とを用いて、タイル及びサブバンドを管理・識別するための管理情報を生成する。この管理情報は、符号化データ統合部７０７で利用される。
【０１０２】
符号化データ統合部７０７は、管理情報生成部７０６より出力される管理情報を使用して、エントロピー符号化部７０４より出力される符号化情報を整理・統合し、かつ管理情報をビットストリーム中に付加して、例えば図３に示した一例のように、最終的な符号化データを作成する。
【０１０３】
尚、タイル構成部７０２は、量子化部７０３の前段に配置しているが、これに限定されるものではなく、例えば量子化部７０３の後段に配置しても良い。
【０１０４】
また、上述した実施形態５乃至７のいずれかの画像符号化方法により符号化されたデータを復号する画像復号方法について、本発明の実施形態８として説明する。図１３は実施形態８の画像復号方法を説明するためのブロック図である。入力となる符号化データは、実施形態５乃至７のいずれかの画像符号化方法で符号化された符号化データである。
【０１０５】
図１３において、符号化データの中から、管理情報分離部９０１でタイル分割に関する管理情報・サブバンドに関する管理情報を分離して取り出し、取り出された管理情報に基づき、符号化データ抽出部９０２でユーザの要求に応じて、符号化情報中の必要となる符号化情報部分を判定し抽出する。すなわち、必要なタイル及び解像度に対応する符号化データを抽出する。
【０１０６】
抽出された符号化情報は、タイルを単位としてエントロピー復号部９０３でエントロピー復号され、逆量子化部９０４で逆量子化され、復号に必要なタイルに対応するウェーブレット変換係数が得られる。
【０１０７】
ウェーブレット変換係数は、逆ウェーブレット変換部９０５で逆ウェーブレット変換され、周囲の画素のデータを含んだ復号画像が得られる。このエントロピー復号部９０３、逆量子化部９０４、逆ウェーブレット変換部９０５をまとめてウェーブレット変換復号部９０６と呼ぶ。
【０１０８】
さらに、タイル統合部９０７で、管理情報分離部９０１からの管理情報に基づいて、復号されたタイル群を統合する。ここでは、各タイルの復号画像で空間的に重なる部分は重畳させて全体の復号画像を得る。
【０１０９】
すなわち、図５とともに上述した実施形態２では、タイルの周辺画素を含めてウェーブレット変換している。また、実施形態５の画像符号化方法においては、図９（ｂ）に示すように、ウェーブレット変換時にタイルの周辺画素を用いており、同様に図１０とともに上述した実施形態６でも、周囲の画素を用いている。
【０１１０】
また、実施形態７の画像符号化方法では、タイルの周辺画素を用いる処理は明示されていないが、原画像全体をウェーブレット変換した際に、原理的に実施形態５と等価な処理がなされている。
【０１１１】
このため、図１３のウェーブレット変換復号部９０６でウェーブレット変換復号した際に、周辺画素のデータが発生し、タイル統合部９０７では復号したタイルの周辺画素を隣接タイルに重畳させることになる。重畳には画素間の加算を用いる。
【０１１２】
次に、本発明の実施形態９の画像復号方法について説明する。これは、実施形態８の画像復号方法と同じく、実施形態５乃至７のいずれかの画像符号化方法で符号化された符号化データを入力とする画像復号方法である。図１４は実施形態９の画像復号方法を説明するためのブロック図である。
【０１１３】
図１４において、符号化データの中から、管理情報分離部１００１でタイル分割に関する管理情報・サブバンドに関する管理情報を分離して取り出し、取り出された管理情報をに基づき、符号化データ抽出部１００２でユーザの要求に応じて、符号化情報中の必要となる符号化データ部分を判定し抽出する。すなわち、必要なタイル及び解像度に相当する符号化情報を抽出する。
【０１１４】
抽出された符号化情報は、タイルを単位としてエントロピー復号部１００３でエントロピー復号され、逆量子化部１００４で逆量子化され、復号に必要なタイルに対応するウェーブレット変換係数が得られる。ここで、ウェーブレット変換係数並べ換え部１００５でウェーブレット変換係数をタイル化する前の状態に並べ換える。
【０１１５】
すなわち、図１２（ｂ）に示すタイル単位に分割されているウェーブレット変換係数を、図１２（ａ）に示す状態に並べ換える。全てのタイルの処理が完了した時点で、図１２（ａ）のウェーブレット変換係数全体が得られる。
【０１１６】
並べ換えられたウェーブレット変換係数は、一回の逆ウェーブレット変換で復号することができるため、ウェーブレット変換係数を逆ウェーブレット変換部１００６で逆ウェーブレット変換すれば、全体の復号画像を得ることができる。
【０１１７】
このエントロピー復号部１００３、逆量子化部１００４、逆ウェーブレット変換部１００６をまとめてウェーブレット変換復号部１００７と呼ぶ。尚、ウェーブレット変換係数並べ換え部１００５は、逆量子化部１００４の後段に配置しているが、これに限定されるものではなく、例えば逆量子化部１００４の前段に配置しても良い。
【０１１８】
次に、本発明の実施形態１０の画像符号化方法について説明する。図１５（ｅ）は実施形態１、実施形態２、実施形態５、実施形態６の画像符号化方法におけるウェーブレット変換部（図１の１０２、図８の５０３）に対応する部分を示したブロック図である。
【０１１９】
図１５（ｅ）のメモリ１１０２は、ウェーブレット変換部１１０１でサブバンド分割されたウェーブレット変換係数を格納するためのものである。この際、メモリ１１０２には、現在ウェーブレット変換部１１０１で処理中のタイルに対応するウェーブレット変換係数のみを格納し、タイルのウェーブレット変換が終了したら、データを次の工程である量子化部（図１の１０３、図８の５０４）に引き渡す。
【０１２０】
従って、メモリ１１０２に格納すべきデータ量は、画像全体に対応するものではなく、１タイルをウェーブレット変換するのに必要なデータ量に抑えることができる。
【０１２１】
すなわち、タイル化を行わないウェーブレット変換では、図１５（ａ）に示すように、変換対象が画像全体となり、ウェーブレット変換部１１０１の出力である図１５（ｂ）のウェーブレット変換係数の全てをメモリに格納する必要があったのに対し、例えば図１５（ｃ）に示すように、タイル化を行うことによって、図１５（ｄ）に対応するウェーブレット変換係数が格納できるメモリのみを用意すればよいことになり、必要メモリ量の大幅な削減が可能となる。
【０１２２】
画像復号方法でも同様な効果が期待できる。本発明の実施形態１１の画像復号方法について説明する。図１６（ｅ）は実施形態３、実施形態４、実施形態８として上述した画像復号方法のうち、逆ウェーブレット変換部（図６の４０５、図１３の９０５）に対応する部分を示したブロック図である。
【０１２３】
図１６（ｅ）のメモリ１２０１には、まず一つのタイルを復号するのに必要なウェーブレット変換係数が格納され、逆ウェーブレット変換部１２０２でサブバンド合成が行われる。
【０１２４】
従って、復号対象画像を図１６（ｂ）とした場合、タイル化しないウェーブレット変換では、メモリに格納すべきデータ量が、図１６（ａ）に示す全てのウェーブレット変換係数であるのに対し、図１６（ｄ）に示すように、タイル分割された画像を復号する場合は、本実施形態のメモリ１２０１に格納すべきデータ量は、図１６（ｃ）に対応するウェーブレット変換係数ですみ、必要なメモリ量が大幅に削減される。
【０１２５】
以上、説明してきた本発明のいずれの実施形態においても、符号化におけるウェーブレット変換時に複数のサブバンド分割フィルタを用いて、適応的に切り替えることによって構成することができる。
【０１２６】
ここで、サブバンド分割フィルタとは、上述の従来例として説明したサブバンド分割に用いるローパスフィルタおよびハイパスフィルタである。ウェーブレット変換ではサブバンド分割が繰り返されるが、この時各サブバンド分割で用いるフィルタにはタップ数や係数値によって種々の種類がある。
【０１２７】
従って、各サブバンド分割で適切なフィルタを用いれば、ウェーブレット変換係数で必要となる符号化対象画像の周辺画素の必要量を、サブバンド毎に制御できることになり、処理量と画質とのバランスをとった最適なウェーブレット変換を行うことができる。
【０１２８】
このような画像符号化方法に対応した画像復号方法では、ウェーブレット変換時に用いたサブバンド分割フィルタに対応するサブバンド合成フィルタを用い、各サブバンド合成でフィルタを切り替えながら逆ウェーブレット変換が行われる。
【０１２９】
次に、本発明の実施形態１２の画像符号化方法について説明する。本実施形態においては、入力された画像は予め定められた複数の符号化方式のうちの１つの方式で符号化することができるものである。
【０１３０】
図１７は実施形態１２の画像符号化方法の一例を説明するためのブロック図であり、本実施例においては、実施形態１の方式と実施形態７の方式とを切替えて符号化するものである。
【０１３１】
図１７において、タイルウェーブレット符号化部１６０１は、入力画像をタイル単位にウェーブレット符号化し、符号化情報を出力する。また、該タイルウェーブレット符号化部１６０１は、タイル分割情報、サブバンド情報およびフラグ情報を出力する。
【０１３２】
管理情報生成部１６０３は、該タイル分割情報、該サブバンド情報、該フラグ情報を入力とし、これらを組合せて管理情報を生成、出力する。符号化データ結合部１０７では、該符号化情報と管理情報とを足し合わせた符号化データを出力する。
【０１３３】
タイルウェーブレット符号化部１６０１において、入力された原画像はタイル分割部１０１で分割され、分割画像が第１スイッチ１６０４の端子０に入力される。また、第１スイッチ１６０１の端子１には原画像がそのまま入力される。これらの出力の一方が、第１スイッチ１６０４を介してウェーブレット符号化部１６０７に入力される。
【０１３４】
ウェーブレット符号化部１６０７は、入力された画像に対してウェーブレット符号化する。第１のウェーブレット変換部１６０８の出力は、第２スイッチ１６０５を介して直接量子化部１０３に入力されるか、さらにタイル構成部７０２を介して量子化部１０３に入力される。
【０１３５】
尚、上記第１のウェーブレット変換部１６０８の動作は、図１とともに上述した実施形態１におけるウェーブレット変換部１０２と同じであるため、その説明は省略する。
【０１３６】
そして、フラグ発生部１６０２にて実施形態１の符号化方式か実施形態７の符号化方式のどちらを使用するかを表すフラグを出力し、同時に第１スイッチ１６０４、第２スイッチ１６０５、第３スイッチ１６０６を制御する。
【０１３７】
各スイッチ１６０４，１６０５，１６０６が端子０に結合されれば、実施形態１の方式で符号化したのと同等の処理を行い、端子１に結合されれば実施形態７の方式で符号化したのと同等の処理を行う。
【０１３８】
尚、タイル構成部７０２の動作は、図１１とともに上述した実施形態７のものと同じであるので、その説明は省略する。
【０１３９】
以上のように、本実施例によれば、タイル単位に符号化を行うことができ、また、画像毎に処理の簡単な実施形態１の方式で符号化するか、処理は若干複雑になるが、タイル境界にひずみの発生しない実施形態７の方式で符号化するかを、選択的に切替えることができる。
【０１４０】
また、図１８は実施形態１２の画像符号化方法の別の一例を説明するためのブロック図であり、本実施例においては、実施形態１の方式と実施形態５の方式とを切替えて符号化することができるものである。
【０１４１】
本実施例の画像符号化方法は、図１８に示すように、図１７において実施形態７に関わるタイル構成部７０２を削除し、実施形態５に関わる周辺画素追加部５０２と第２のウェーブレット符号化部１７０５とを追加し、さらにこれらを切替えるためのスイッチが変更されている。同図のタイルウェーブレット符号化部１７０１及びウェーブレット符号化部１７０２以外の動作は、図１７のものと同じなので、その説明は省略する。
【０１４２】
ウェーブレット符号化部１７０２は、入力された画像のウェーブレット符号化を行い、符号化情報を出力する。入力は２種類あり、一方は第１のウェーブレット変換部１６０８に接続され、他方は第２のウェーブレット変換部１７０５に接続されている。
【０１４３】
画像が第１のウェーブレット変換部１６０８に入力された場合、ウェーブレット変換部１７０２はウェーブレット符号化部１６０７と同じ動作をする。一方、画像が第２のウェーブレット変換部１７０５に入力された場合は、該第２のウェーブレット変換部１７０５の処理がウェーブレット変換部５０３と同じであるため、ウェーブレット符号化部１７０２はウェーブレット符号化部５０６と同じ動作をする。
【０１４４】
タイルウェーブレット符号化部１７０１において、入力された画像はタイル単位に分割され第１スイッチ１７０３に入力される。他方では、該分割された画像にその周辺の画像が足し合わされ、第２スイッチ１７０４に入力される。フラグ発生部１７０６は、ウェーブレット符号化部１７０２にて第１のウェーブレット変換部１６０８を使用するか、第２のウェーブレット変換部１７０５を利用するかを選択し、これを示すフラグを出力する。
【０１４５】
同時に、第１スイッチ１７０３もしくは第２スイッチ１７０４の一方のみをオンするような制御を行う。すなわち、第１スイッチ１７０３がオンの場合は、分割された画像は第１のウェーブレット変換部１６０８に入力され、実施形態１の方式で符号化したのと同等の処理を行う。第２スイッチ１７０４がオンの場合は、分割された画像とその周辺の画像とが第２のウェーブレット変換部１７０５に入力され、実施形態５の方式で符号化したのと同等の処理を行う。
【０１４６】
これによって、タイル単位に符号化を行うことができ、また、画像毎に処理の簡単な実施形態１の方式で符号化するか、処理は若干複雑になるが、タイル境界にひずみの発生しない実施形態５の方式で符号化するかを、選択的に切替えて符号化することができる。
【０１４７】
さらに、図１９は実施形態１２の画像符号化方法の別の一例を説明するためのブロック図であり、本実施例においては、実施形態１の方式、実施形態５の方式、及び実施形態７の方式を切替えて符号化することができるものである。
【０１４８】
本実施例の画像符号化方法は、図１９に示すように、図１８において実施形態７に関わるタイル構成部７０２が追加され、またこれらを切替えるためのスイッチが変更されている。同図のタイルウェーブレット符号化部１８０１及びウェーブレット符号化部１８０７以外の動作は、図１７のものと同じなので、その説明は省略する。
【０１４９】
ウェーブレット符号化部１８０７は、入力された画像のウェーブレット符号化を行い、符号化情報を出力する。第１のウェーブレット変換部１６０８の出力は第３スイッチ１８０５を介して直接量子化部１０３に入力されるか、さらにタイル構成部７０２を介して量子化部１０３に入力される。第２のウェーブレット変換部１７０５の出力は直接量子化部１０３に入力される。
【０１５０】
タイルウェーブレット符号化部１８０１において、入力された画像は直接第１スイッチ１８０３の端子０に入力されるか、タイルに分割された後第１スイッチ１８０３の端子１に入力されるか、あるいは該分割されたタイルにその周辺の画素が足し合わされた画像が第１スイッチ１８０３の端子２に入力される。
【０１５１】
これらの画像が、第２スイッチ１８０４を介して第１のウェーブレット変換部１６０８もしくは第２のウェーブレット変換部１７０５に入力され、量子化部１０３およびエントロピー符号化部１０４を経て、符号化情報として出力される。
【０１５２】
フラグ発生部１８０２は、第１スイッチ１８０３、第２スイッチ１８０４、第３スイッチ１８０５、第４スイッチ１８０６を制御し、０、１、２の３つのモードを切替える。各スイッチ１８０３、１８０４、１８０５、１８０６の端子に示す番号は、このモード番号を示す。
【０１５３】
例えば、第１スイッチ１８０３が端子０に接続されると、残りのスイッチ１８０４、１８０５、１８０６も端子０に接続される。このため、各スイッチ１８０３、１８０４、１８０５、１８０６が端子０に接続された場合は、実施形態７の方式で符号化したのと同等の処理を行う。
【０１５４】
また、各スイッチ１８０３、１８０４、１８０５、１８０６が端子１に接続された場合は、実施形態１の方式で符号化したのと同等の処理を行い、第１スイッチ１８０３、第２スイッチ１８０４、第４スイッチ１８０６が端子２に接続された場合には、実施形態５の方式で符号化したのと同等の処理を行う。
【０１５５】
これによって、タイル単位に符号化を行うことができ、また、画像毎に処理の簡単な実施形態１の方式で符号化するか、処理は若干複雑になるが、タイル境界にひずみの発生しない実施形態５もしくは実施形態７の方式で符号化するかを、選択的に切替えて符号化することができる。
【０１５６】
次に、本発明の実施形態１３の画像復号方法について説明する。これは、実施形態１２として上述した画像符号化方法で符号化されたデータを復号する画像復号方法である。本実施形態においては、入力される符号化データは予め定められた複数の復号方式の中から一つを選んで復号される。
【０１５７】
図２０は実施形態１３の画像復号方法の一例を説明するためのブロック図であり、本実施例の画像復号方法においては、実施形態１の方式と実施形態７の方式とを切替えて符号化した符号化データを復号することができるものである。
【０１５８】
図２０において、管理情報分離部４０１にて分離された符号化情報と管理情報とが、それぞれタイルウェーブレット復号部１９０１に入力される。タイルウェーブレット復号部１９０１は、該符号化情報と管理情報とを用いて、タイル単位に復号を行い、復号画像を出力する。
【０１５９】
該符号化情報は、ウェーブレット復号部１９０２に入力され、ウェーブレット復号される。該ウェーブレット復号部１９０２で復号された画像は、第２スイッチ１９０４を介して直接出力されるか、さらにタイル連結部４０７を介して出力される。
【０１６０】
ウェーブレット復号部１９０２において、逆量子化部４０４の出力は第１スイッチ１９０３を介して、直接第１の逆ウェーブレット変換部１９０６に入力されるか、さらにウェーブレット係数並べ換え部１００５を介して、該第１の逆ウェーブレット変換部に入力される。
【０１６１】
尚、上記第１の逆ウェーブレット変換部１９０６の動作は、図６とともに上述した実施形態３における逆ウェーブレット変換部４０５と同じであるため、その説明は省略する。
【０１６２】
フラグ抽出部１９０５では、管理情報から第１スイッチ１９０３と第２スイッチ１９０４とを制御するフラグを抽出する。各スイッチ１９０３、１９０４が端子０に接続された場合は、実施形態３の画像復号方法と同じ動作を行い、端子１に接続された場合は、実施形態９の画像復号方法と同じ動作を行う。
【０１６３】
尚、タイル構成部４０７の動作は、図６とともに上述した実施形態３のものと同じであるので、その説明は省略する。
【０１６４】
以上のように、本実施例によれば、タイル単位に復号することができ、また、画像毎に処理の簡単な実施形態３の方式で復号するか、処理は若干複雑になるが、タイル境界にひずみの発生しない実施形態９の方式で復号するかを、選択的に切替えることができる。
【０１６５】
また、図２１は実施形態１３の画像復号方法の別の一例を説明するためのブロック図であり、本実施例の画像復号方法において、実施形態１の方式と実施形態５の方式とを切替えて符号化した符号化データを復号することができるものである。
【０１６６】
図２１において、タイルウェーブレット復号部２００１及びウェーブレット復号部２００２以外の部分の動作は、図２０のものと同じなので、その説明は省略する。
【０１６７】
ウェーブレット復号部２００２は、入力される符号化情報をウェーブレット復号する。この時、逆量子化部４０４の出力は、第１スイッチ２００４を介して、第１の逆ウェーブレット変換部１９０６か、第２の逆ウェーブレット変換部２００３に入力される。
【０１６８】
該逆第１のウェーブレット変換部１９０６の出力は、タイル連結部４０７へ入力され、第２のウェーブレット変換部２００３の出力は、タイル統合部９０７ヘ入力される。
【０１６９】
尚、上記第２の逆ウェーブレット変換部２００３の動作は、図１３とともに上述した実施形態８における逆ウェーブレット変換部９０５と同じであるため、その説明は省略する。
【０１７０】
タイルウェーブレット復号部２００１において、ウェーブレット復号部２００２で入力される符号化情報をウェーブレット復号し、該ウェーブレット復号部２００２の出力は、タイル連結部４０７もしくはタイル統合部９０７のいずれかに連結され、復号画像が再生される。
【０１７１】
一方、フラグ抽出部２００５では、入力された管理情報からフラグを抽出し、該抽出されたフラグにより第１スイッチ２００４が切り替わる。第１スイッチ２００４が端子０に接続された場合、実施形態３の画像復号方法と同じ動作を行い、端子１に接続された場合は、実施形態８の画像復号方法と同じ動作をする。
【０１７２】
これによって、タイル単位に復号することができ、また、画像毎に処理の簡単な実施形態３の方式で復号するか、処理は若干複雑になるが、タイル境界にひずみの発生しない実施形態８の方式で復号するかを、選択的に切替えることができる。
【０１７３】
さらに、図２２は実施形態１３の画像復号方法の別の一例を説明するためのブロック図であり、本実施例の画像復号方法においては、実施形態１の方式、実施形態５の方式、及び実施形態７の方式を切替えて符号化した符号化データを復号することができるものである。
【０１７４】
本実施例の画像復号方法は、図２２に示すように、図２１において、ウェーブレット係数並べ換え部１００５が追加され、またこれらを切替えるスイッチが変更されている。同図において、タイルウェーブレット復号部２１０１及びウェーブレット復号部２１０２以外の部分の動作は、図２０のものと同じなので、その説明は省略する。
【０１７５】
ウェーブレット復号部２１０２は、入力される符号化情報をウェーブレット復号する。この時、逆量子化部４０４の出力は、第１スイッチ２１０３の端子０を介して、第１の逆ウェーブレット変換部１９０６に直接入力されるか、第１スイッチ２１０３の端子１とウェーブレット係数並べ換え部１００５とを介して、第１の逆ウェーブレット変換部１９０６に入力されるか、第１スイッチ２１０３の端子２を介して、第２の逆ウェーブレット変換部２００３に入力される。
【０１７６】
該第１の逆ウェーブレット変換部１９０６の出力は、第２スイッチ２１０４を介して、タイル連結部４０７へ入力されるか、直接復号画像が出力される。第２の逆ウェーブレット変換部２００３の出力は、タイル統合部９０７ヘ入力される。その他の部分の動作は、ウェーブレット復号部２００２と同じなので、その説明は省略する。
【０１７７】
タイルウェーブレット復号部２１０１において、フラグ抽出部２１０５は管理情報からフラグを抽出する。該抽出されたフラグ情報により、第１スイッチ２１０３、第２スイッチ２１０４が制御される。また、残りの管理情報は、タイル連結部４０７とタイル統合部９０７とに入力される。
【０１７８】
各スイッチ２１０３、２１０４が端子０に接続された場合、実施形態３の画像復号方法と同じ動作を行い、端子１に接続された場合、実施形態９の画像復号方法と同じ動作を行い、第１スイッチ２１０３が端子２に接続された場合は、第２スイッチ２１０４の接続先に関わらず、実施形態８の画像復号方法と同じ動作を行う。
【０１７９】
これによって、タイル単位に復号することができ、また、画像毎に処理の簡単な実施形態３の方式で符号化するか、処理は若干複雑になるが、タイル境界にひずみの発生しない実施形態８もしくは実施形態９の方式で復号するかを、選択的に切替えることができる。
【０１８０】
次に、本発明の実施形態１４の画像符号化方法について説明する。本実施形態においては、タイルを管理するための管理情報にタイルを区別する情報を追加し、目的のタイルの符号化情報を高速に復号できるようにするものである。
【０１８１】
図２３は実施形態１４の画像符号化方法の一例を説明するためのブロック図である。図２３において、入力された原画像は、タイルウェーブレット符号化部２２０１でタイル単位に符号化され、管理のための情報（例えば、タイル分割情報、フラグ情報、サブバンド情報）と符号化情報とが生成される。
【０１８２】
ＩＤ生成部２２０２では、各タイルを区別するためのＩＤ情報が生成される。管理情報生成部２２０３は、該管理のための情報と該ＩＤ情報とを足し合わせて、管理情報を生成する。符号化データ結合部２２０４は、該符号化情報と管理情報とを結合し、さらに各タイルの先頭にタイルの先頭を示すスタートコードを加えて、符号化データを生成する。
【０１８３】
符号化データのフォーマットの一例としては、図２４（ａ）に示すように、各タイルの情報がそのタイルのスタートコードと管理情報（タイルヘッダー）と符号化情報とから構成される。タイルウェーブレット符号化部２２０１は、実施形態１、実施形態２、実施形態５、実施形態６、実施形態７、実施形態１０、実施形態１２、実施形態１４における画像符号化方法を使用することができる。
【０１８４】
ここで、原画像を分割したタイルを区別するため、左上から順に１、２．．．とＩＤ情報を割り当てれば、タイルは任意の順序で符号化でき、また符号化の後に順序を入れ換えることも可能となる。もし、タイルの符号化する順序が予め決められていれば、ＩＤ生成部２２０２を省略することができる。
【０１８５】
それぞれのタイルは、スタートコードから始まるため、これを目印に各タイルがどこにあるのかを識別することができる。この代わりに、そのタイルのデータ量（符号化情報とタイルヘッダーとを合わせたもの）を用いた場合も、各タイルがどこにあるのかを識別することができる。
【０１８６】
また、図２５は実施形態１４の画像符号化方法の別の例を説明するためのブロック図であり、図２３に示した画像符号化方法にタイルのサイズ計算を行うデータ量計測部２３０１を付加したもので、このデータ量計測部２３０１及び管理情報生成部２３０２以外の部分の動作説明は省略する。
【０１８７】
図２５において、データ量計測部２３０１は、タイル毎に符号化されたデータ量を計測して、これを出力する。管理情報生成部２３０２は、管理のための情報、ＩＤ情報、及びタイルのデータ量を足し合わせて、管理情報を生成する。
【０１８８】
符号化データのフォーマットの一例としては、図２４（ｂ）に示すように、各タイルの先頭に該タイルの符号化情報のデータ量が配置され、続いて他の管理情報（タイルヘッダー）と符号化情報とが続く。尚、タイルのデータ量は、必ずしも各タイルの先頭に配置する必要はなく、例えば先頭にまとめることもできる。
【０１８９】
さらに、図２６は実施形態１４の画像符号化方法の別の例を説明するためのブロック図であり、図２５に示した画像符号化方法に符号化データ並べ変え部２４０１を追加したもので、他の部分の動作説明は省略する。
【０１９０】
図２６において、符号化データ並べ換え部２４０１は、符号化データ結合部２２０４で作成された符号化データから、各タイルのデータ量を抜き出し、これらを符号化データの先頭に配置してから、残りを順番に並べて符号化データを出力する。
【０１９１】
符号化データのフォーマットの一例としては、図２４（ｃ）に示すように、先頭に配置された全てのタイルのデータ量を足し合わせることで、容易に目的のタイルの位置を計算することができる。
【０１９２】
また、図２７に示す構成でも同様の効果をあげることができる。図２７は実施形態１４の画像符号化方法の別の例を説明するためのブロック図であり、図２５に示した画像符号化方法に符号化データ蓄積バッファ２５０１及び管理情報蓄積バッファ２５０２を追加したもので、この符号化データ蓄積バッファ２５０１、管理情報蓄積バッファ２５０２、及び符号化データ結合部２５０３以外の動作説明は省略する。
【０１９３】
図２７において、タイルウェーブレット符号化部２２０１より出力される符号化情報は、符号化データ蓄積バッファ２５０１で一旦蓄積される。管理情報蓄積バッファ２５０２は、管理情報生成部２３０２で生成された各タイルの管理情報を蓄積し、該管理情報からタイルのデータ量を抜き出してから、これを符号化データ結合部２５０３に出力し、次いで残りの管理情報を出力する。
【０１９４】
符号化データ結合部２５０３では、該入力された全タイルのデータ量を最初に出力し、残りの管理情報及び符号化情報を結合して出力する。
【０１９５】
以上のように、本実施形態によれば、符号化データの中から復号するタイルの符号化情報を高速に検索し、復号することが可能となる。
【０１９６】
次に、本発明の実施形態１５の画像復号方法について説明する。図２８は実施形態１５の画像復号方法を説明するためのブロック図であり、本実施形態は、上述した実施形態１４の画像符号化方法で符号化されたデータを復号する画像復号方法である。
【０１９７】
図２８において、復号タイル決定部２６０３は、ユーザの指示により復号するタイルのＩＤを決定する。管理情報分離部２６０６は、符号化データから各タイルの先頭を示すスタートコードを検索し、タイルに関する管理情報と符号化情報とを分離する。
【０１９８】
データ読み飛ばし制御部２６０２は、上記管理情報に基づいて、これから復号するタイルのタイルＩＤが該決定されたタイルＩＤかどうかを判定し、これが該タイルＩＤならば、第１スイッチ２６０５及び第２スイッチ２６０４をオンにする。こうして、タイルウェーブレット復号部２６０１は、特定のタイルのみを復号することが可能となる。
【０１９９】
タイルの管理情報にそのタイルのデータ量が記述されている場合は、管理情報分離部２６０６は各タイルの先頭を検索する必要はなく、記述されているデータ量分だけ読み飛ばせば良い。尚、タイルウェーブレット復号部２６０１は、実施形態３、実施形態４、実施形態８、実施形態９、実施形態１１、実施形態１３、実施形態１５の画像復号方法を使用することができる。
【０２００】
以上のように、本実施形態によれば、全ての符号化データを復号せずに、タイルの先頭の管理情報のみを復号することで、目的のタイルを素早く復号することができる。
【０２０１】
次に、本発明の実施形態１６の画像符号化方法について説明する。本実施形態においては、タイルを管理するための管理情報に周辺のタイルの情報も追加し、周辺のタイルの符号化情報も高速に復号できるようにするものである。
【０２０２】
図２９（ａ）は実施形態１６の画像符号化方法の一例を説明するためのブロック図である。本実施例の画像符号化方法は、図２３に示した実施形態１４に周辺タイルＩＤ決定部２８０１を追加したものであり、また、管理情報生成部２８０２の動作が異なっている。このため、周辺タイルＩＤ決定部２８０１及び管理情報生成部２８０２以外の部分の説明は省略する。
【０２０３】
尚、タイルウェーブレット符号化部２８０１は、実施形態５、実施形態６、実施形態７、実施形態１０、実施形態１２、実施形態１４の画像符号化方法を使用することができる。
【０２０４】
図２９（ａ）において、周辺タイルＩＤ決定部２８０１は、タイル分割情報、フラグ情報、サブバンド情報、ＩＤ生成部２２０２で生成されたタイルＩＤから復号時に必要な周辺のタイルＩＤを決定する。管理情報作成部２８０２は、タイル分割情報、フラグ情報、サブバンド情報、タイルＩＤに該周辺のタイルＩＤを足し合わせた管理情報を生成する。
【０２０５】
尚、周辺タイルＩＤ決定部２８０１にて決定される複数のタイルＩＤは、符号化に必要な全てのタイルＩＤである必要はなく、例えば図２９（ｂ）に示すように、符号化するタイルの左上、左下に位置するタイルのタイルＩＤに限定しても良い。
【０２０６】
符号化データのフォーマットの一例としては、図２４（ａ）において管理情報（タイルヘッダー）がタイルＩＤと周辺タイルのＩＤとを含む構成が考えられる。
【０２０７】
また、図３０は実施形態１６の画像符号化方法の別の例を説明するためのブロック図であり、管理情報に周辺タイルの位置情報も含めることによって、復号時にタイル化された符号化情報の検索を高速化しようとするものである。本実施例の画像符号化方法は、図２７に示した実施形態１４から管理情報蓄積バッファ２５０２を削除し、データ量格納部２９０１、相対位置計算部２９０２、情報蓄積バッファ２９０４を追加したものである。
【０２０８】
このデータ量格納部２９０１、相対位置計算部２９０２、情報蓄積バッファ２９０４、及び管理情報生成部２９０３、ＩＤ生成部２９０５以外の動作は、上述のものと同様であるので、その説明は省略する。
【０２０９】
図３０において、タイルウェーブレット符号化部２２０１から出力される符号化情報は、全て符号化データ蓄積バッファ２５０１に蓄積され、また該タイルウェーブレット符号化部２２０１から出力されるタイル分割情報、フラグ情報、サブバンド情報の各情報は、全て情報蓄積バッファ２９０４に蓄積される。データ量計測部２３０１で出力された各タイルの符号化情報のデータ量は、全てデータ量格納部２９０１に格納される。
【０２１０】
ＩＤ生成部２９０５は、各タイルを区別するためのＩＤ情報を出力し、情報蓄積バッファ２９０４、データ量格納部２９０１、及び符号化データ蓄積バッファ２５０１が蓄積している情報を、タイル単位に出力するよう制御する。データ量格納部２９０１は、入力されたタイルＩＤに基づいて、そのタイルのデータ量を管理情報生成部２９０３に出力し、該タイルＩＤを持つタイルとその周辺タイルの相対位置を計算するのに必要なタイルのデータ量を相対位置計算部２９０２へ出力する。
【０２１１】
相対位置計算部２９０２では、入力された各タイルのデータ量を用いて、符号化するタイルに対する周辺タイルの符号化情報の存在する相対位置を計算し、その結果を出力する。管理情報生成部２９０３は、入力されるタイルＩＤ情報、タイル分割情報、フラグ情報、サブバンド情報、タイルデータ量、該周辺タイルの相対位置などから管理情報を生成し、符号化データ結合部２５０３ヘ出力する。
【０２１２】
このように、全ての符号化データを復号せずに、タイルの先頭の管理情報のみを復号することで、目的のタイルと復号に必要な周辺のタイルを素早く復号できるような符号化データを生成することが可能となる。
【０２１３】
次に、本発明の実施形態１９の画像復号方法について説明する。図３１は実施形態１９の画像復号方法を説明するためのブロック図であり、本実施形態は、上述した実施形態１８の画像符号化方法で符号化されたデータを復号する画像復号方法である。
【０２１４】
本実施形態は、図２８に示した実施形態１５にバッファ３００１を追加したもので、このバッファ３００１及びデータ読み飛ばし制御部３００２以外の動作は、図２８のものと同じであるため、その説明は省略する。
【０２１５】
図３１において、入力された符号化データは、一時バッファ３００１に格納され、順次出力される。データ読み飛ばし制御部３００２は、入力された管理情報に基づいて、これから復号するタイルのＩＤを抽出し、これが該決定されたタイルＩＤもしくは周辺タイルのタイルＩＤならば、第１スイッチ２６０５及び第２スイッチ２６０４をオンにする。
【０２１６】
上記管理情報が復号に必要な周辺タイルのタイルＩＤを含んでいるならば、バッファ３００１から該周辺タイルの符号化情報を出力するよう制御する。こうして、タイルウェーブレット復号部２６０１は、特定のタイルとその周辺とを復号することができる。
【０２１７】
ここで、管理情報に含まれる復号された周辺タイルＩＤが周辺のタイル数より小さい予め決められた個数（例えば、図２９（ｂ）の網点で示したタイル）である場合、復号に必要な他の位置のタイルＩＤ（図２９（ｂ）の白いタイル）は、上記復号された周辺タイルＩＤより決定される。
【０２１８】
尚、タイルウェーブレット復号部２６０１は、実施形態８、実施形態９、実施形態１１、実施形態１３、実施形態１５の画像復号方法を使用することができる。
【０２１９】
これによって、全ての符号化データを復号せずに、タイルの先頭の管理情報のみを復号することで、目的のタイルと復号に必要な周辺のタイルとを素早く復号することが可能となる。
【０２２０】
以上のとおり、本実施形態の画像符号化方法及び画像復号方法を用いれば、符号化データ量を増大させることなしに、ユーザの要求に応じた解像度の復号画像を容易に復号することが可能となる。これは、ＪＰＥＧを用いるフラッシュ・ピックスが複数の解像度に対応するために、符号化データ量が１．４倍に増大するのに比して大きな利点である。
【０２２１】
また、画像をタイルに分割して特定領域のみの復号を可能とする際に、ウェーブレット変換による符号化は、タイル内に閉じた処理が原理的に困難であり、タイル分割処理に不向きであったのに対し、本発明ではウェーブレット変換を用いながら、タイル単位での符号化・復号処理を可能にしている。
【０２２２】
すなわち、画像をタイル単位に符号化することによって、画像の一部を復号したい場合に、画像全体を復号しなくとも、その領域を含むタイルを復号すれば良いため、ランダムアクセス機能を向上させることができる。
【０２２３】
【発明の効果】
本発明の画像符号化方法及び画像復号方法によれば、画像を復号する際に異なる解像度で復号したり、画像中の特定のタイルのみを復号することができる。また、低解像度の画像を復号したい時は、低い解像度のサブバンド情報のみにアクセスすることで、縮小された全体画像をすばやく再現することができる。
【０２２４】
さらに、各タイルに対応する符号化情報の格納位置を示す情報を、符号化情報とは独立した位置に置いて管理することによって、容易に目的のタイルの位置を計算することができる。また、全ての符号化情報の復号を行わなくても、符号化情報とは独立して配置された管理情報を用いることで、目的のタイルをすばやく復号することができる。
【図面の簡単な説明】
【図１】本発明の実施形態１の画像符号化方法を説明するためのブロック図である。
【図２】本発明の実施形態１の画像符号化方法を説明する説明図である。
【図３】本発明の実施形態１の画像符号化方法におけるビットストリームの一例を示す説明図である。
【図４】本発明の実施形態１の画像符号化方法におけるビットストリームの別の例を示す説明図である。
【図５】本発明の実施形態２の画像符号化方法を説明する説明図である。
【図６】本発明の実施形態３の画像復号方法を説明するためのブロック図である。
【図７】本発明の実施形態４の画像復号方法を説明する説明図である。
【図８】本発明の実施形態５の画像符号化方法を説明するためのブロック図である。
【図９】本発明の実施形態５の画像符号化方法を説明する説明図である。
【図１０】本発明の実施形態６の画像符号化方法を説明する説明図である。
【図１１】本発明の実施形態７の画像符号化方法を説明するためのブロック図である。
【図１２】本発明の実施形態７の画像符号化方法を説明する説明図である。
【図１３】本発明の実施形態８の画像復号方法を説明するためのブロック図である。
【図１４】本発明の実施形態９の画像復号方法を説明するためのブロック図である。
【図１５】本発明の実施形態１０の画像符号化方法を説明するためのブロック図、及びその動作を説明する説明図である。
【図１６】本発明の実施形態１１の画像復号方法を説明するためのブロック図、及びその動作を説明する説明図である。
【図１７】本発明の実施形態１２の画像符号化方法の一例を説明するためのブロック図である。
【図１８】本発明の実施形態１２の画像符号化方法の別の例を説明するためのブロック図である。
【図１９】本発明の実施形態１２の画像符号化方法の別の例を説明するためのブロック図である。
【図２０】本発明の実施形態１３の画像復号方法の一例を説明するためのブロック図である。
【図２１】本発明の実施形態１３の画像復号方法の別の例を説明するためのブロック図である。
【図２２】本発明の実施形態１３の画像復号方法の別の例を説明するためのブロック図である。
【図２３】本発明の実施形態１４の画像符号化方法の一例を説明するためのブロック図である。
【図２４】本発明の実施形態１４の画像符号化方法におけるビットストリームの一例を説明するための説明図である。
【図２５】本発明の実施形態１４の画像符号化方法の別の例を説明するためのブロック図である。
【図２６】本発明の実施形態１４の画像符号化方法の別の例を説明するためのブロック図である。
【図２７】本発明の実施形態１４の画像符号化方法の別の例を説明するためのブロック図である。
【図２８】本発明の実施形態１５の画像復号方法を説明するためのブロック図である。
【図２９】本発明の実施形態１６の画像符号化方法の一例を説明するためのブロック図、及びその動作を説明する説明図である。
【図３０】本発明の実施形態１６の画像符号化方法の別の例を説明するためのブロック図である。
【図３１】本発明の実施形態１７の画像復号方法を説明するためのブロック図である。
【図３２】従来の技術を説明するためのブロック図、及びその動作を説明する説明図である。
【図３３】従来の技術を示すブロック図である。
【図３４】従来の技術を示すブロック図である。
【図３５】従来の技術を説明する説明図である。
【図３６】従来の技術を示すブロック図である。
【図３７】従来の技術を説明する説明図である。
【符号の説明】
１０１…タイル分割部、１０２…ウェーブレット変換部、１０３…量子化部、１０４…エントロピー符号化部、１０５…ウェーブレット変換符号化部、１０６…管理情報生成部、１０７…符号化データ統合部、４０１…管理情報分離部、４０２…符号化データ抽出部、４０３…エントロピー符号化部、４０４…逆量子化部、４０５…逆ウェーブレット変換部、４０６…ウェーブレット変換復号部、４０７…タイル連結部、５０１…タイル分割部、５０２…周囲画素追加部、５０３…ウェーブレット変換部、５０４…量子化部、５０５…エントロピー符号化部、５０６…ウェーブレット変換符号化部、５０７…管理情報生成部、５０８…符号化データ統合部、７０１…ウェーブレット変換部、７０２…タイル構成部、７０３…量子化部、７０４…エントロピー符号化部、７０５…ウェーブレット変換符号化部、７０６…管理情報生成部、７０７…符号化データ統合部、９０１…管理情報分離部、９０２…符号化データ抽出部、９０３…エントロピー復号部、９０４…逆量子化部、９０５…逆ウェーブレット変換部、９０６…ウェーブレット変換復号部、９０７…タイル統合部、１００１…管理情報分離部、１００２…符号化データ抽出部、１００３…エントロピー復号部、１００４…逆量子化部、１００５…ウェーブレット変換係数並べ換え部、１００６…逆ウェーブレット変換部、１００７…ウェーブレット変換復号部、１１０１…ウェーブレット変換復号部、１１０２…メモリ、１２０１…メモリ、１２０２…逆ウェーブレット変換部、１６０１、１７０１、１８０１、２１０１、２２０１…タイルウェーブレット符号化部、１６０２、１７０６、１８０２、１９０５、２００５、２１０５…フラグ発生部、１６０３、２２０３、２３０２、２８０２、２９０３…管理情報生成部、１６０４、１７０３、１８０３、１９０３、２００４、２１０３、２６０５第１スイッチ、１６０５、１７０４、１８０４、１９０４、２１０４、２６０４…第２スイッチ、１６０６、１８０５…第３スイッチ、１６０７、１７０２、１８０７…ウェーブレット符号化部、１６０８…第１のウェーブレット符号化部、１７０５…第２のウェーブレット符号化部、１８０６…第４スイッチ、２２０４、２５０３…符号化データ結合部、１９０１、２００１、２６０１…タイルウェーブレット復号部、１９０２、２００２、２１０２…ウェーブレット復号部、１９０６…第１の逆ウェーブレット変換部、２００３…第２の逆ウェーブレット変換部、２２０２、２９０５…ＩＤ作成部、２３０１…データ量計測部、２４０１…符号化データ並べ変え部、２５０１…符号化データ蓄積バッファ、２５０２…管理情報蓄積バッファ、２６０２、３００２…データ読み飛ばし制御部、２６０３…復号タイル決定部、２８０１…周辺タイルＩＤ決定部、２９０１…データ量格納部、２９０２…相対位置計算部、３００１…バッファ、２６０６…管理情報分離部、２９０４…情報蓄積バッファ、３２０１、３２０５、３２０９、３２１３…タイル分割部、３２０４、３２０８、３２１２…１／２縮小部、３２０２、３２０６、３２１０、３２１４…ＪＰＥＧ圧縮部、３２０３、３２０７、３２１１、３２１５…符号化データ統合部、３３０１…ウェーブレット変換部、３３０２…量子化部、３３０３…エントロピー復号部、３３０４…ウェーブレット変換符号化部、３４０１、３４１４、３４２６…水平方向ローパスフィルタ、３４０２、３４１５、３４２７…水平方向ハイパスフィルタ、３４０３、３４０５、３４１６、３４３４、３４２８、３４３０…垂直方向ロー、パスフィルタ、３４０４、３４０６、３４１７、３４１９、３４２９、３４３１…垂直方向ハイパスフィルタ、３４０７〜３４１２、３４２０〜３４２５、３４３２〜３４３７…１／２サブサンプリング部、３６１３…水平方向低域・垂直方向低域のサブバンド、３６０１…エントロピー復号部、３６０２…逆量子化部、３６０３…逆ウェーブレット変換部、３６０４…ウェーブレット変換復号部、３７０１…フィルタ適用画素、３７０２…フィルタ演算対象範囲。 [0001]
BACKGROUND OF THE INVENTION
The present invention belongs to the field of digital image processing, and relates to an image encoding method for encoding image data with high efficiency and an image decoding method for decoding encoded data encoded by this image encoding method.
[0002]
[Prior art]
As an image format for converting natural images into digital data and performing computer processing, a flash pix format (FlashPix Format Specification Version 1.0) has been proposed.
[0003]
In this format, data of a plurality of resolutions are simultaneously held in order to quickly extract data of a necessary resolution according to the capability of the display / printing apparatus and the user's request. In addition, the image is divided and held in units of tiles so that the load can be reduced by processing only necessary portions in the image data when the image is enlarged or reduced or edited.
[0004]
An encoding apparatus for encoding an image in accordance with the flash pix format will be described with reference to FIG. FIG. 32A is a diagram illustrating image reduction and tile division, and FIG. 32B is a block diagram illustrating an example of an encoding device.
[0005]
In the flash picks, first, 1/1 to 1/8 size images shown in images 1 to 4 in FIG. 32A are generated, and tile division and compression are performed on each image 1 to 4, respectively. There are features.
[0006]
First, the case where the image 1 of FIG. 32A is encoded by the encoding device of FIG. 32B will be described. Here, broken lines in images 1 to 4 in FIG. 32A represent tile boundaries.
[0007]
The original image is divided into tiles each having 64 pixels × 64 pixels by the tile dividing unit 3201, and then compressed for each tile by the JPEG compression unit 3202. The encoded data for each tile is integrated into one by the encoded data integration unit 3203 together with the tile division information from the tile division unit 3201, and the encoded data 1 is output.
[0008]
Next, the image 2 in FIG. 32A will be described. After the original image is reduced to 1/2 in both vertical and horizontal directions by the 1/2 reduction unit 3204, similarly, the encoded data 2 is obtained through the tile division unit 3205, the JPEG compression unit 3206, and the encoded data integration unit 3207.
[0009]
The reduction processing for generating the reduced image group (images 2 to 4) in FIG. 32A is repeated until the size of the entire reduced image is within one tile. In the example of FIG. 32A, the size of the image 3 does not fit in one tile, and is further reduced by ½ reduction processing when the size of the image 4 that fits in one tile is obtained. The process ends.
[0010]
The encoded data of image 3 is generated by a 1/2 reduction unit 3208, a tile division unit 3209, a JPEG compression unit 3210, and an encoded data integration unit 3211. The encoded data of image 4 is generated by a 1/2 reduction unit 3212, tile division. Generated by the unit 3213, the JPEG compression unit 3214, and the encoded data integration unit 3215.
[0011]
In this method, in addition to the encoded data of the 1/1 size image, the encoded data is also held for each of the reduced resolution images, so that the encoded data amount increases by about 1.4 times. On the other hand, at the time of encoding, a problem arises in that the amount of processing is large because compression processing is performed at each resolution.
[0012]
On the other hand, apart from flash pix, there is an image compression method by wavelet transform. In this method, image data with different resolution can be easily decoded from one encoded data that is compressed for the size of the original image. Therefore, there is no problem of an increase in the amount of encoded data due to support for multiple resolutions.
[0013]
That is, while the encoded data amount has increased by 1.4 times with the above-described flash pix, it is possible to respond to a request for decoding a plurality of resolutions with an encoded data amount of 1 ×.
[0014]
In the wavelet transform compression, the processing shown in the basic block diagram of FIG. 33 is performed. The original image becomes subband division data wavelet transformed by the wavelet transform unit 3301, quantized by the quantization unit 3302, entropy coded by the entropy coding unit 3303, and becomes coded data.
[0015]
FIG. 34 is a block diagram showing the wavelet transformation unit 3301 in FIG. 33 in more detail, and FIG. 35 shows image transformation by wavelet transformation. These are examples when three-dimensional two-dimensional subband division is performed.
[0016]
The original image in FIG. 35A is divided into two horizontal subbands by a horizontal low-pass filter 3401 and a horizontal high-pass filter 3402 in FIG. 34, and is divided by ½ subsampling units 3407 and 3408, respectively. Decimated by half.
[0017]
The divided two horizontal sub-bands are also sub-band divided by the low-pass filters 3403 and 3405 and the high-pass filters 3404 and 3406 and sub-sampling by the 1/2 sub-sampling units 3409 to 3412 in the vertical direction. At this point, it is converted into four subbands.
[0018]
Of these, the horizontal high band, the vertical high band subband (Fig. 34), the horizontal high band, the vertical low band (Fig. 34), the horizontal low band, the vertical high band. The subbands (H in FIG. 34) are wavelet transform coefficients indicated by H, R, and N in FIG. 35B, respectively.
[0019]
Subband division is repeated recursively only for the remaining low-frequency subband 3413 in both the horizontal and vertical directions.
[0020]
This recursive subband splitting includes horizontal low-pass filters 3414, 3426, horizontal high-pass filters 3415, 3427, vertical low-pass filters 3416, 3418, 3428, 3430, vertical high-pass filters 3417, 3419, 3429, 3431, and This is done by 1/2 sub-sampling units 3420-3425, 3432-3437.
[0021]
Note that the sub-bands (i) to (e) in FIG. 34 correspond to (i) to (b) in FIG.
[0022]
The wavelet transform coefficient of FIG. 35B obtained in this way is quantized by the quantization unit 3302 of FIG. 33 for each subband, and further entropy-encoded by the entropy encoding unit 3303 of FIG. Get. The entropy encoding unit 3303 can use Huffman encoding or arithmetic encoding.
[0023]
On the other hand, as shown in FIG. 36, wavelet transform decoding is performed by entropy decoding encoded data by an entropy decoding unit 3601, dequantizing by an inverse quantization unit 3602, and then subband synthesis by an inverse wavelet transform unit 3603. To obtain a decoded image.
[0024]
As a feature of encoding using wavelet transform, as shown in FIG. 35 (b), there is a point having a hierarchical structure corresponding to the resolution. For this reason, part or all of the encoded data is used for decoding. Thus, images with different resolutions can be easily decoded.
[0025]
That is, if the subbands a, b, c, and d in FIG. 35B are decoded, a quarter of the original image can be decoded, and in addition to this, h, f, and g can be decoded. For example, a 1/2 image can be decoded, and if all subbands are decoded, a 1/1 size image can be decoded.
[0026]
Here, the operation of the H-LP, H-HP, V-LP, and V-HP filters in FIG. 34 will be described with reference to FIG. FIG. 37 (b) is an enlarged view of a portion surrounded by a circle in FIG. 37 (a).
[0027]
In order to perform wavelet transform on the original image of FIG. 37A, when obtaining the output of a horizontal filter with a tap number of 9 bits for the pixel 3701 near the upper right end of the original image, the calculation target of the filter is indicated by 3702 Become an area.
[0028]
However, in this case, a part of the filter calculation target 3702 protrudes outside the original image, and no pixel data exists in this part. Similar problems arise with vertical filters.
[0029]
As described above, in the peripheral portion of the conversion target image, data outside the image is required according to the number of taps of the filter. If the subband division is repeated further, the region where the filter protrudes becomes wider.
[0030]
This problem is generally handled by a method such as folding an image at an end according to a certain rule.
[0031]
[Problems to be solved by the invention]
In the case of having separately encoded data for images of a plurality of resolutions such as flash pix, the load at the time of image data processing such as enlargement / reduction can be reduced, but the encoded data size is about 1.4. There is a disadvantage that doubles.
[0032]
On the other hand, when wavelet transform coding is used, a plurality of resolution data can be easily decoded from only one piece of coded data obtained by compressing the size of the original image, so that the coded data size does not increase.
[0033]
However, the method used in flash picks to divide an image into tiles and encode them in tile units (when a specific image area is the target of image processing, only the necessary image tiles are subject to image processing. If this is applied to the wavelet transform coding method, the filter used for the wavelet transform protrudes from the tile boundary.
[0034]
In other words, those using JPEG encoding such as Flash Picks are easy to encode in units of tiles because the encoding process is closed in the tile, whereas in wavelet transform encoding, the processing is easy. Since it protrudes around the tile, there is a problem that encoding processing and management in tile units becomes difficult.
[0035]
Furthermore, the conventional wavelet transform coding requires a memory that holds all the output of the wavelet transform unit 3301 in FIG. 33, that is, the wavelet transform coefficients in FIG. 35B. At this time, the wavelet transform coefficients are the same as the original image. Therefore, there is a problem that the required amount of memory becomes large. This problem becomes more prominent when dealing with high-resolution images.
[0036]
The present invention has been made in view of such a problem, and realizes high-function, high-efficiency coding with a small-scale hardware configuration by realizing the decoding of a plurality of resolutions and management by tile using wavelet transform. It is possible.
[0037]
[Means for Solving the Problems]
  The image encoding method of the present invention divides image data into tiles of N pixels × M pixels, and outputs N pixels × M pixels in the tile as encoding target data corresponding to each tile; The subband division is performed by extrapolating predetermined data around the encoding target data corresponding to the tile, and each tile is independently wavelet-encoded. Generating management information for decoding; and adding the management information to the encoded information to generate a bitstream, wherein the management information is a bitstream of encoded information of each tile. Information that indicates the position above and information that manages and identifies each subband.Therefore, they are arranged together at a position independent of the encoded information.It is characterized by that.
[0038]
  Image decoding of the present inventionMethodIs the image dataTheEach tile, and each tile is independentNiEncoded information obtained by wavelet encoding;Manage encoding informationA bitstream consisting of management information forAs an input, the decoded image corresponding to the required tile and resolutionAn image decoding method for decoding, comprising:The management information includes the size of the encoding information corresponding to each tile or resolution, and is collectively arranged at a position independent of the encoding information, and the storage position of the encoding information corresponding to the tile or resolution to be decoded is described above. Analyzing based on size, and based on the storage location,Performing wavelet decoding and said wavelet decodingTataIlunitConcatenated decoded imagesDoStep andAn image decoding method provided.
[0039]
DETAILED DESCRIPTION OF THE INVENTION
  Hereinafter, embodiments of the present invention will be described in detail. FIG. 1 shows an image encoding according to the first embodiment of the present invention.How to explainIt is a block diagram.
[0040]
The image data of the original image as shown in FIG. 2A is first divided into tiles of N pixels × M pixels determined in advance by the tile dividing unit 101. The divided image is shown in FIG. The tile dividing unit 101 outputs an image of N pixels × M pixels in the tile as data corresponding to each tile.
[0041]
Of the divided tiles, the subsequent processing will be described for the tile i in FIG. The wavelet transform unit 102 divides the image data of the tile i into subbands.
[0042]
Here, when subband division processing is performed near the periphery of the tile, data around the tile is extrapolated. That is, as shown in FIG. 37 (b), when the calculation target range 3702 of the filter used for the wavelet transformation protrudes outside the tile, data outside the tile is required. Insert and sub-band.
[0043]
As an extrapolation method, for example, as shown in FIG. 2C, a method of generating a mirror image by folding an image in a tile is used. Subsequently, the quantization unit 103 quantizes the wavelet transform coefficient, and the entropy encoding unit 104 performs entropy encoding to obtain encoded data of the tile i.
[0044]
For entropy coding, Huffman coding or arithmetic coding can be used. The wavelet transform unit 102, the quantization unit 103, and the entropy coding unit 104 are collectively referred to as a wavelet transform coding unit 105.
[0045]
On the other hand, the management information generation unit 106 uses the tile division information regarding the spatial position of each tile obtained from the tile division unit 101 and the information on each subband obtained from the wavelet transform coding unit 105, Management information for managing and identifying tiles and subbands is generated. This management information is used by the encoded data integration unit 107.
[0046]
The encoded data integration unit 107 uses the management information output from the management information generation unit 106 to organize and integrate the encoded information output from the entropy encoding unit 104, and to manage the management information in the bitstream. In addition, final encoded data is created.
[0047]
Here, encoded data is managed according to subbands and tiles when decoding an image, only images with different resolutions such as the example shown in FIG. 32A, or specific tiles in the image. This makes it possible to decrypt
[0048]
An example of the bit stream of the encoded data created in this way is shown in FIG. The bitstream is composed of a header that manages information of the entire bitstream and encoding information for each tile. The encoding information for each tile includes a tile header that manages information for each tile, and an image tile. It is composed of encoded information for each tile encoded by the wavelet transform encoding unit 105.
[0049]
In the tile header, information on bit positions corresponding to each subband is described. By referring to this, it is possible to know where a bit string corresponding to a necessary subband is located.
[0050]
Of course, the configuration of the bitstream according to the present invention is not limited to that shown in FIG. For example, the subband information of each tile is separated separately as shown in FIG. 4 (b) with respect to the one shown in FIG. 4 (a) having the same configuration as FIG. The tile information may be added to the band information to form an independent tile. In this way, it is possible to quickly reproduce the reduced overall image by accessing only the tiles of the reduced image.
[0051]
  Next, image coding according to Embodiment 2 of the present inventionMethodWill be described. Here, the image coding of the second embodimentMethodThe configuration is the same as the block diagram of the first embodiment described above with reference to FIG. 1, and only the operation of the tile dividing unit 101 is different. Therefore, hereinafter, the operation of the tile dividing unit 101 will be described with reference to FIG.
[0052]
In the tile dividing unit 101 of the first embodiment, after dividing an original image into tiles of N × M pixels, when outputting a specific tile to the wavelet transform unit 102, only the image data inside the tile is cut out as an output. However, the tile dividing unit 101 according to the second embodiment uses a unit that cuts out and outputs data by multiplying the original image by an appropriate window function.
[0053]
For example, when the tile ij in FIG. 5 is cut out, the result of multiplying the original image data by the window function FXi in the horizontal direction and then the window function FYj in the vertical direction is set as the output of the tile dividing unit 101. Note that i is a tile number in the horizontal direction, and j is a tile number in the vertical method.
[0054]
As a result, the result of multiplying the hatched image in FIG. 5 by the weight according to the window function is the output of the tile dividing unit 101. Here, as the window function, a window function having a total sum of 1 through all sections is used.
[0055]
That is,
ΣFXi (x) = 1 (0 ≦ x ≦ w)
ΣFYj (y) = 1 (0 ≦ y ≦ h)
Use a window function that satisfies
[0056]
However, w represents the width of the original image, h represents the height of the original image, and the x and y axes are assumed to have the upper left corner of the original image as the origin O, and are taken rightward and downward, respectively.
[0057]
Further, it is assumed that the sum of FXi (x) is taken for i and the sum of FXj (Y) is taken for j. FXi−1, FXi, FY1, FYj, and FYj + 1 in FIG. 5 represent a part of functions that satisfy such conditions.
[0058]
As a result of the data extraction by the window function, the output of the tile dividing unit 101 includes not only the pixels inside the tile ij but also surrounding pixels in the encoding target data with weights according to the value of the window function. become.
[0059]
  Next, the image coding of the first embodiment described aboveMethodImage decoding to decode data encoded withMethodWill be described as Embodiment 3 of the present invention. FIG. 6 shows image decoding according to the third embodiment.How to explainIt is a block diagram.
[0060]
  The input encoded data is the image encoding described in the first embodiment.MethodIt is encoded with. A management information separation unit 401 separates and extracts management information related to tile division and management information related to subbands from the encoded data.
[0061]
Based on the extracted management information, the encoded data extraction unit 402 determines and extracts encoded information portions of necessary tiles and subbands in the encoded information in response to a user request. In the example of the bit stream shown in FIG. 3, the management information is in the header and the tile header.
[0062]
The extracted encoded information is entropy-decoded by the entropy decoding unit 403 and dequantized by the inverse quantization unit 404 to obtain wavelet transform coefficients corresponding to the decoding target tile.
[0063]
The wavelet transform coefficient is subjected to inverse wavelet transform by the inverse wavelet transform unit 405, and a decoded image of the target tile is obtained. The entropy decoding unit 403, the inverse quantization unit 404, and the inverse wavelet transform unit 405 are collectively referred to as a wavelet transform decoding unit 406.
[0064]
Further, the tile connecting unit 407 connects the decoded tile groups based on the tile division information from the management information generating unit 401 to obtain a decoded image having a desired region / resolution.
[0065]
Referring to the example of the bitstream shown in FIG. 3, when decoding an entire image (all tiles) with a low resolution, a code corresponding to a subband with a low resolution while referring to the subband information of each tile header. .., Ia,..., Which are converted data parts, are wavelet transform decoded sequentially by the wavelet transform decoding unit 406 for each tile.
[0066]
Then, if the obtained low resolution tiles are connected by the tile connecting unit 407, an entire image with a low resolution can be obtained.
[0067]
In addition, when it is desired to enlarge a specific tile i from the low-resolution decoded image and display it at the highest resolution, the entire i-th tile encoded information that is encoded information corresponding to the tile i may be decoded.
[0068]
That is, if i-b is extracted in addition to the already extracted encoded information ia and decoded together with ia, a desired decoded image can be obtained. Of course, if all the encoded information (all tiles, all subbands) is decoded, it is possible to obtain decoded images of all regions with high resolution.
[0069]
As described above, an image of an arbitrary resolution and an arbitrary tile can be easily decoded according to a user request.
[0070]
  Next, image decoding according to the fourth embodiment of the present invention.MethodWill be described. The input encoded data is the image encoding described in the second embodiment.MethodIt is encoded by. Here, the image decoding of Embodiment 4MethodThe configuration is the same as that of the third embodiment described above with reference to FIG. 6, and only the operation of the tile connecting unit 407 is different. Therefore, hereinafter, the operation of the tile dividing unit 407 will be described with reference to FIG.
[0071]
  Image coding according to Embodiment 2MethodThen, since the pixel to be encoded of each tile includes the peripheral pixels of the tile, the size of the decoded data of the tile decoded by the wavelet transform decoding unit 406 is larger than the size of the tile.
[0072]
In FIG. 7, the tile is composed of 2 pixels × 2 pixels, and the size of the decoded data of the tile is 4 pixels × 4 pixels. In this case, the decoded data of the tile ij is a hatched portion in FIG. 7 and overlaps with the adjacent tile by the width of one pixel.
[0073]
The tile connecting unit 407 obtains a pixel value by adding the decoded data at the position where the decoded data overlaps when the tiles are connected. For example, for pixel a in FIG.
a (i-1, j-1) + a (i, j-1) + a (i-1, j) + a (i, j)
To calculate the pixel value.
[0074]
Here, a (i, j) represents the decoded data of the tile ij at the position of the pixel a.
[0075]
  Next, image coding according to Embodiment 5 of the present inventionMethodWill be described. FIG. 8 shows image encoding according to the fifth embodiment.How to explainIt is a block diagram.
[0076]
  Image coding of embodiment 5MethodIs the image coding of the first embodiment described above with reference to FIG.MethodThe difference is that when a tile is wavelet transform encoded, it is not extrapolated around the tile unconditionally, but another tile around the target tile is used if it exists. is there.
[0077]
As in the case of the first embodiment, as shown in FIG. 9A, the subsequent processing is described for the tile i in the original image divided by the tile dividing unit 501. When the image data of the tile i is converted by the wavelet transform unit 503, if a surrounding pixel exists in a region where the filter used for the wavelet transform protrudes from the tile i, the tile i is also wavelet transformed using the data of the pixel. .
[0078]
That is, in order to perform wavelet transform on the tile i in FIG. 9A, first, the wavelet indicated by hatching in FIG. 9B is selected from the tiles around the tile i in FIG. 9A. After the surrounding pixel area necessary for the conversion is added to the tile i, the wavelet conversion of the tile i is performed.
[0079]
This additional processing is performed by the surrounding pixel adding unit 502. Based on the tile division information obtained from the tile dividing unit 501, it is determined whether another tile exists around the encoding target tile, and the tile exists. In this case, necessary pixels are added.
[0080]
In the above example, the peripheral pixel adding unit 502 adds all the surrounding tiles and outputs tile image data. Therefore, the wavelet transform unit 503 to which the peripheral pixel adding unit 502 is input receives the wavelet according to the first embodiment that processes an image of a single tile. It is necessary to convert a large image as compared with the conversion unit 102.
[0081]
When the converted image becomes large, a device using the converted image requires a large work area, leading to an increase in cost and a decrease in operation speed. Therefore, another mode in which the converted image is made smaller is effective and will be described below.
[0082]
As shown in FIGS. 9C and 9D, the area added by the peripheral pixel adding unit 502 is limited to the x direction or the y direction, and the tile image data input to the wavelet transform unit 503 is reduced. It is.
[0083]
For example, in the case of FIG. 9C, necessary pixels are added when different tiles exist above and below the encoding target tile. For the left and right sides of the tile to be encoded, a method of generating a mirror image by folding the image in the tile is used. In the case of FIG. 9D, the top and bottom and the left and right are reversed from the case of FIG. 9C.
[0084]
As a method of performing wavelet transform, a method of repeating subband division using only one of FIGS. 9B, 9C, and 9D, or FIGS. 9B and 9C for each subband. ) And (d) are methods for switching the pixel addition method.
[0085]
Note that only the wavelet transform coefficient of the encoding target tile i is required as an output of the wavelet transform unit 503, and the pixels added by the surrounding pixel adding unit 502 are the wavelet transform coefficients of the pixels inside the tile i. Used only to calculate.
[0086]
Subsequently, quantization is performed by the quantization unit 504, and entropy coding is performed by the entropy coding unit 505, thereby obtaining coding information of the tile i. The wavelet transform unit 503, the quantization unit 504, and the entropy coding unit 505 are collectively referred to as a wavelet transform coding unit 506.
[0087]
On the other hand, the management information generation unit 507 uses the tile division information regarding the spatial position of each tile obtained from the tile division unit 501 and the information on each subband obtained from the wavelet transform coding unit 506, Management information for managing and identifying tiles and subbands is generated. This management information is used by the encoded data integration unit 508.
[0088]
The encoded data integration unit 508 uses the management information output from the management information generation unit 507 to organize and integrate the encoded information output from the entropy encoding unit 505, and to manage the management information in the bitstream. In addition, final encoded data is created as in the example shown in FIG. 3, for example.
[0089]
  Furthermore, the image coding according to the sixth embodiment of the present inventionMethodWill be described. Image coding according to Embodiment 6MethodThis configuration is the same as that of the fifth embodiment described above with reference to FIG. 8, and only the operation of the surrounding pixel adding unit 502 is different. Therefore, hereinafter, the operation of the surrounding pixel adding unit 502 will be described with reference to FIG.
[0090]
The processing of tile i in FIG. 10 will be described as an example. In the surrounding pixel adding unit 502 described as the fifth embodiment, when the tile i is input, all the pixels that are necessary for calculating the wavelet transform coefficient of the pixel in the tile i, that is, the pixels in the range where the filter protrudes are included in the tile i. It was added to. This range is a peripheral pixel range indicated by hatching in FIG.
[0091]
However, since the influence of a pixel far away from the tile i on the wavelet transform coefficient in the tile i is generally small, in the present embodiment, the result obtained by multiplying the peripheral pixel to be added by an appropriate weighting function is obtained. By adding to, the number of pixels to be added is reduced and the amount of calculation is reduced.
[0092]
As the weighting function, a function is used such that the portion close to the tile i is 1 and the distance is close to 0. The weighting function shown in FIG. 10 is an example. In the example of FIG. 10, as a result of multiplication by the weighting function, the pixels actually added are only effective pixel portions with halftone dots, and the outside is a pixel necessary for wavelet transform but is regarded as 0 and added. Not.
[0093]
As the weighting function, besides the one shown in FIG. 10, a step function that can be 1 if the distance from the tile i is within a certain standard and 0 if it is further away can be used.
[0094]
  Next, image coding according to Embodiment 7 of the present inventionMethodWill be described. FIG. 11 shows the image encoding according to the seventh embodiment.How to explainIt is a block diagram.
[0095]
  Image coding according to Embodiment 7MethodIs the image coding of the first embodiment described above with reference to FIG. 1 and the fifth embodiment described above with reference to FIG.MethodThe difference is that the wavelet transform unit 701 performs wavelet transform on the entire original image before tiling the original image, and then arranges the wavelet transform coefficients, which are the output of the wavelet transform unit 701, in tile units. It is a point which constitutes a tile instead.
[0096]
In FIG. 11, the original image is wavelet transformed by a wavelet transform unit 701 before being tiled. Next, the tile configuration unit 702 performs sorting to collect the wavelet transform coefficients corresponding to the same tile in space and configure the tiles.
[0097]
FIG. 12A shows an example of subbands obtained by wavelet transform by the wavelet transform unit 701. FIG. In this case, the coefficient b0 in the subband of the lowest frequency in FIG. 12A is the space between the coefficient parts b1, b2, b3, b4, b5, b6, b7, b8, b9 in the other subbands. In correspondence.
[0098]
Here, b1 to b3 are composed of 1 × 1, b4 to b6 are composed of 2 × 2, and b7 to b9 are composed of 4 × 4 coefficients. An embodiment in which these b0 to b9 are extracted from the respective sub-bands, and are configured in the form shown in FIG. 12B as one tile, and all other wavelet transform coefficients are rearranged in units of tiles. The result similar to that obtained when the original image is divided into tiles in step 5 and then wavelet transformed is obtained.
[0099]
Note that b0 does not have to be one coefficient, and may be a block of coefficients composed of k × 1 coefficients. In this case, b1 to b3 are composed of k × l, b4 to b6 are composed of 2k × 2l, and b7 to b9 are composed of 4k × 4l coefficients.
[0100]
The tiled wavelet transform coefficient output from the tile construction unit 702 is quantized by the quantization unit 703 and entropy-encoded by the entropy encoding unit 704 to become encoded information.
[0101]
On the other hand, the management information generation unit 706 uses the tile division information regarding the spatial position of each tile obtained from the tile configuration unit 702 and the information on each subband obtained from the wavelet transform coding unit 705. Management information for managing and identifying tiles and subbands is generated. This management information is used by the encoded data integration unit 707.
[0102]
The encoded data integration unit 707 uses the management information output from the management information generation unit 706 to organize and integrate the encoded information output from the entropy encoding unit 704, and the management information in the bitstream In addition, final encoded data is created as in the example shown in FIG. 3, for example.
[0103]
Note that the tile configuration unit 702 is arranged before the quantization unit 703, but the present invention is not limited to this. For example, the tile configuration unit 702 may be arranged after the quantization unit 703.
[0104]
  In addition, the image encoding according to any one of the fifth to seventh embodiments described above.MethodImage decoding to decode data encoded byMethodWill be described as Embodiment 8 of the present invention. FIG. 13 shows image decoding according to the eighth embodiment.How to explainIt is a block diagram. The encoded data to be input is the image encoding according to any one of the fifth to seventh embodiments.MethodIt is the encoded data encoded by.
[0105]
In FIG. 13, management information separation unit 901 separates management information related to tile division and management information related to subbands from the encoded data, and based on the extracted management information, the encoded data extraction unit 902 extracts the user information. In response to the request, a necessary encoded information part in the encoded information is determined and extracted. That is, encoded data corresponding to a necessary tile and resolution is extracted.
[0106]
The extracted encoded information is entropy-decoded by the entropy decoding unit 903 in units of tiles, dequantized by the inverse quantization unit 904, and wavelet transform coefficients corresponding to tiles necessary for decoding are obtained.
[0107]
The wavelet transform coefficient is subjected to inverse wavelet transform by the inverse wavelet transform unit 905, and a decoded image including data of surrounding pixels is obtained. The entropy decoding unit 903, the inverse quantization unit 904, and the inverse wavelet transform unit 905 are collectively referred to as a wavelet transform decoding unit 906.
[0108]
Further, the tile integration unit 907 integrates the decoded tile group based on the management information from the management information separation unit 901. Here, the spatially overlapped portion of the decoded image of each tile is superimposed to obtain the entire decoded image.
[0109]
  That is, in the second embodiment described above with reference to FIG. 5, wavelet transform is performed including the peripheral pixels of the tile. Also, the image coding according to the fifth embodimentMethodAs shown in FIG. 9B, the peripheral pixels of the tile are used at the time of wavelet transform. Similarly, the peripheral pixels are also used in the sixth embodiment described above with reference to FIG.
[0110]
  Also, the image coding according to the seventh embodimentMethodHowever, the process using the peripheral pixels of the tile is not clearly described, but when the entire original image is wavelet transformed, the process equivalent to the fifth embodiment is performed in principle.
[0111]
For this reason, when wavelet transform decoding is performed by the wavelet transform decoding unit 906 in FIG. 13, peripheral pixel data is generated, and the tile integration unit 907 superimposes the peripheral pixels of the decoded tile on adjacent tiles. For the superposition, addition between pixels is used.
[0112]
  Next, image decoding according to Embodiment 9 of the present inventionMethodWill be described. This is the image decoding of the eighth embodiment.MethodSimilarly to the image coding according to any one of Embodiments 5 to 7MethodDecoding using encoded data encoded withMethodIt is. FIG. 14 shows image decoding according to the ninth embodiment.How to explainIt is a block diagram.
[0113]
In FIG. 14, the management information separation unit 1001 separates and extracts management information related to tile division and management information related to subbands from the encoded data, and the encoded data extraction unit 1002 extracts the management information based on the extracted management information. In accordance with a user request, a necessary encoded data portion in the encoded information is determined and extracted. That is, encoding information corresponding to a necessary tile and resolution is extracted.
[0114]
The extracted encoding information is entropy-decoded by the entropy decoding unit 1003 in units of tiles, and dequantized by the inverse quantization unit 1004, and wavelet transform coefficients corresponding to tiles necessary for decoding are obtained. Here, the wavelet transform coefficient rearrangement unit 1005 rearranges the wavelet transform coefficients to the state before tiling.
[0115]
That is, the wavelet transform coefficients divided into tile units shown in FIG. 12B are rearranged into the state shown in FIG. When all the tiles have been processed, the entire wavelet transform coefficient shown in FIG. 12A is obtained.
[0116]
Since the rearranged wavelet transform coefficients can be decoded by a single inverse wavelet transform, if the wavelet transform coefficients are inversely wavelet transformed by the inverse wavelet transform unit 1006, the entire decoded image can be obtained.
[0117]
The entropy decoding unit 1003, the inverse quantization unit 1004, and the inverse wavelet transform unit 1006 are collectively referred to as a wavelet transform decoding unit 1007. The wavelet transform coefficient rearranging unit 1005 is arranged at the subsequent stage of the inverse quantization unit 1004, but is not limited to this, and may be disposed, for example, before the inverse quantization unit 1004.
[0118]
  Next, image coding according to Embodiment 10 of the present inventionMethodWill be described. FIG. 15 (e) shows the image coding of the first embodiment, the second embodiment, the fifth embodiment, and the sixth embodiment.Method10 is a block diagram showing a portion corresponding to a wavelet transform unit (102 in FIG. 1, 503 in FIG. 8).
[0119]
A memory 1102 in FIG. 15E is for storing the wavelet transform coefficients that are sub-band divided by the wavelet transform unit 1101. At this time, only the wavelet transform coefficient corresponding to the tile currently being processed by the wavelet transform unit 1101 is stored in the memory 1102, and when the wavelet transform of the tile is completed, the data is converted into a quantization unit (FIG. 1) as the next step. 103 and 504) of FIG.
[0120]
Therefore, the amount of data to be stored in the memory 1102 does not correspond to the entire image, and can be suppressed to the amount of data necessary to wavelet transform one tile.
[0121]
That is, in the wavelet transform without tiling, as shown in FIG. 15A, the conversion target is the entire image, and all the wavelet transform coefficients of FIG. 15B, which is the output of the wavelet transform unit 1101, are stored in the memory. For example, as shown in FIG. 15C, it is necessary to prepare only a memory that can store the wavelet transform coefficients corresponding to FIG. 15D by performing tiling as shown in FIG. Thus, the required memory amount can be greatly reduced.
[0122]
  Image decodingMethodBut the same effect can be expected. Image decoding according to the eleventh embodiment of the present inventionMethodWill be described. FIG. 16E illustrates the image decoding described above as the third, fourth, and eighth embodiments.MethodIt is the block diagram which showed the part corresponding to an inverse wavelet transformation part (405 of FIG. 6, 905 of FIG. 13) among these.
[0123]
First, wavelet transform coefficients necessary to decode one tile are stored in the memory 1201 in FIG. 16E, and subband synthesis is performed by the inverse wavelet transform unit 1202.
[0124]
Therefore, when the decoding target image is shown in FIG. 16B, in the wavelet transform that is not tiled, the amount of data to be stored in the memory is all the wavelet transform coefficients shown in FIG. As shown in FIG. 16D, when decoding a tiled image, the amount of data to be stored in the memory 1201 of this embodiment is the wavelet transform coefficient corresponding to FIG. The amount of memory is greatly reduced.
[0125]
As described above, any of the embodiments of the present invention described above can be configured by adaptively switching using a plurality of subband division filters at the time of wavelet transform in encoding.
[0126]
Here, the sub-band division filter is a low-pass filter and a high-pass filter used for the sub-band division described as the conventional example. In the wavelet transform, subband division is repeated. At this time, there are various types of filters used in each subband division depending on the number of taps and coefficient values.
[0127]
Therefore, if an appropriate filter is used for each subband division, the necessary amount of peripheral pixels of the image to be encoded required by the wavelet transform coefficient can be controlled for each subband, and the balance between the processing amount and the image quality can be controlled. The optimal wavelet transform can be performed.
[0128]
  Such image codingMethodDecoding corresponding toMethodThen, using a subband synthesis filter corresponding to the subband division filter used at the time of wavelet transformation, the inverse wavelet transformation is performed while switching the filters in each subband synthesis.
[0129]
  Next, image coding according to the twelfth embodiment of the present inventionMethodWill be described. In the present embodiment, the input image can be encoded by one of a plurality of predetermined encoding methods.
[0130]
  FIG. 17 shows image encoding according to the twelfth embodiment.MethodExampleTo explainIt is a block diagram, and in the present embodiment, encoding is performed by switching between the system of the first embodiment and the system of the seventh embodiment.
[0131]
In FIG. 17, a tile wavelet encoding unit 1601 performs wavelet encoding on an input image in units of tiles, and outputs encoding information. Further, the tile wavelet encoding unit 1601 outputs tile division information, subband information, and flag information.
[0132]
The management information generation unit 1603 receives the tile division information, the subband information, and the flag information as inputs, and generates and outputs management information by combining them. The encoded data combining unit 107 outputs encoded data obtained by adding the encoded information and management information.
[0133]
In the tile wavelet encoding unit 1601, the input original image is divided by the tile dividing unit 101, and the divided image is input to the terminal 0 of the first switch 1604. Further, the original image is input as it is to the terminal 1 of the first switch 1601. One of these outputs is input to the wavelet encoding unit 1607 via the first switch 1604.
[0134]
The wavelet encoding unit 1607 performs wavelet encoding on the input image. The output of the first wavelet transform unit 1608 is directly input to the quantization unit 103 via the second switch 1605 or is further input to the quantization unit 103 via the tile configuration unit 702.
[0135]
The operation of the first wavelet transform unit 1608 is the same as that of the wavelet transform unit 102 in the first embodiment described above with reference to FIG.
[0136]
Then, the flag generation unit 1602 outputs a flag indicating whether the encoding system of the first embodiment or the encoding system of the seventh embodiment is used, and at the same time, the first switch 1604, the second switch 1605, and the third switch 1606 is controlled.
[0137]
If each switch 1604, 1605, 1606 is coupled to the terminal 0, the same processing as that of the encoding in the first embodiment is performed, and if it is coupled to the terminal 1, the encoding in the seventh embodiment is performed. Performs the same processing as
[0138]
The operation of the tile configuring unit 702 is the same as that of the seventh embodiment described above with reference to FIG.
[0139]
As described above, according to the present embodiment, encoding can be performed in units of tiles, and encoding is performed for each image by the method of the first embodiment that is easy to process, or the processing is slightly complicated. It is possible to selectively switch whether encoding is performed according to the method of the seventh embodiment in which no distortion occurs at the tile boundary.
[0140]
  FIG. 18 shows image encoding according to the twelfth embodiment.MethodAnother example ofTo explainFIG. 4 is a block diagram, and in the present example, encoding can be performed by switching between the system of the first embodiment and the system of the fifth embodiment.
[0141]
  Image coding of this embodimentMethod18, the tile configuration unit 702 related to the seventh embodiment in FIG. 17 is deleted, the peripheral pixel adding unit 502 and the second wavelet encoding unit 1705 related to the fifth embodiment are added, and The switch for switching between these has been changed. Operations other than the tile wavelet encoding unit 1701 and the wavelet encoding unit 1702 in FIG.
[0142]
The wavelet encoding unit 1702 performs wavelet encoding on the input image and outputs encoding information. There are two types of inputs, one connected to the first wavelet transform unit 1608 and the other connected to the second wavelet transform unit 1705.
[0143]
When an image is input to the first wavelet transform unit 1608, the wavelet transform unit 1702 performs the same operation as the wavelet encoding unit 1607. On the other hand, when the image is input to the second wavelet transform unit 1705, the processing of the second wavelet transform unit 1705 is the same as that of the wavelet transform unit 503, and therefore the wavelet encoding unit 1702 is the wavelet encoding unit 506. Behaves the same as
[0144]
In the tile wavelet encoding unit 1701, the input image is divided into tile units and input to the first switch 1703. On the other hand, the peripheral image is added to the divided image and input to the second switch 1704. The flag generation unit 1706 selects whether the wavelet encoding unit 1702 uses the first wavelet transform unit 1608 or the second wavelet transform unit 1705, and outputs a flag indicating this.
[0145]
At the same time, control is performed such that only one of the first switch 1703 and the second switch 1704 is turned on. That is, when the first switch 1703 is on, the divided image is input to the first wavelet transform unit 1608, and processing equivalent to that encoded by the method of the first embodiment is performed. When the second switch 1704 is on, the divided image and the surrounding image are input to the second wavelet transform unit 1705, and processing equivalent to that performed by the method of the fifth embodiment is performed.
[0146]
As a result, encoding can be performed on a tile-by-tile basis, and encoding can be performed for each image by the method of the first embodiment that is easy to process, or the processing is slightly complicated, but no distortion occurs at the tile boundary. It is possible to selectively switch the encoding according to the method of the fifth mode.
[0147]
  Further, FIG. 19 shows the image encoding of the twelfth embodiment.MethodAnother example ofTo explainFIG. 4 is a block diagram, and in the present example, encoding can be performed by switching between the scheme of the first embodiment, the scheme of the fifth embodiment, and the scheme of the seventh embodiment.
[0148]
  Image coding of this embodimentMethodAs shown in FIG. 19, a tile configuration unit 702 according to the seventh embodiment is added in FIG. 18, and a switch for switching these is changed. Operations other than the tile wavelet encoding unit 1801 and the wavelet encoding unit 1807 in FIG.
[0149]
A wavelet encoding unit 1807 performs wavelet encoding of the input image and outputs encoding information. The output of the first wavelet transform unit 1608 is directly input to the quantization unit 103 via the third switch 1805 or is further input to the quantization unit 103 via the tile configuration unit 702. The output of the second wavelet transform unit 1705 is directly input to the quantization unit 103.
[0150]
In the tile wavelet encoding unit 1801, the input image is directly input to the terminal 0 of the first switch 1803, or is input to the terminal 1 of the first switch 1803 after being divided into tiles, or the divided image is divided. An image in which the surrounding pixels are added to the tile is input to the terminal 2 of the first switch 1803.
[0151]
These images are input to the first wavelet transform unit 1608 or the second wavelet transform unit 1705 via the second switch 1804, and output as encoded information via the quantizer 103 and the entropy encoder 104. The
[0152]
The flag generating unit 1802 controls the first switch 1803, the second switch 1804, the third switch 1805, and the fourth switch 1806, and switches the three modes 0, 1, and 2. The numbers shown on the terminals of the respective switches 1803, 1804, 1805, and 1806 indicate the mode numbers.
[0153]
For example, when the first switch 1803 is connected to the terminal 0, the remaining switches 1804, 1805, and 1806 are also connected to the terminal 0. For this reason, when each of the switches 1803, 1804, 1805, and 1806 is connected to the terminal 0, processing equivalent to that encoded by the method of the seventh embodiment is performed.
[0154]
When each of the switches 1803, 1804, 1805, and 1806 is connected to the terminal 1, the same processing as that performed in the method of the first embodiment is performed, and the first switch 1803, the second switch 1804, the fourth switch When the switch 1806 is connected to the terminal 2, the same processing as that performed by the method of the fifth embodiment is performed.
[0155]
As a result, encoding can be performed on a tile-by-tile basis, and encoding can be performed for each image by the method of the first embodiment that is easy to process, or the processing is slightly complicated, but no distortion occurs at the tile boundary. It is possible to selectively switch whether encoding is performed according to the scheme of the fifth or seventh embodiment.
[0156]
  Next, image decoding according to the thirteenth embodiment of the present invention.MethodWill be described. This is the image encoding described above as the twelfth embodiment.MethodImage decoding to decode data encoded withMethodIt is. In the present embodiment, input encoded data is decoded by selecting one of a plurality of predetermined decoding methods.
[0157]
  FIG. 20 illustrates image decoding according to the thirteenth embodiment.MethodExampleTo explainIt is a block diagram and the image decoding of a present ExampleMethodIn the method, encoded data encoded by switching between the method of the first embodiment and the method of the seventh embodiment can be decoded.
[0158]
In FIG. 20, the encoded information and the management information separated by the management information separation unit 401 are input to the tile wavelet decoding unit 1901, respectively. The tile wavelet decoding unit 1901 performs decoding in tile units using the encoded information and management information, and outputs a decoded image.
[0159]
The encoded information is input to the wavelet decoding unit 1902 and subjected to wavelet decoding. The image decoded by the wavelet decoding unit 1902 is directly output via the second switch 1904 or further output via the tile connecting unit 407.
[0160]
In the wavelet decoding unit 1902, the output of the inverse quantization unit 404 is input directly to the first inverse wavelet transform unit 1906 via the first switch 1903 or further through the wavelet coefficient rearrangement unit 1005. To the inverse wavelet transform unit.
[0161]
The operation of the first inverse wavelet transform unit 1906 is the same as that of the inverse wavelet transform unit 405 in the third embodiment described above with reference to FIG.
[0162]
  The flag extraction unit 1905 extracts a flag for controlling the first switch 1903 and the second switch 1904 from the management information. When the switches 1903 and 1904 are connected to the terminal 0, the image decoding according to the third embodiment is performed.MethodWhen the same operation is performed and the terminal 1 is connected, the image decoding according to the ninth embodiment is performed.MethodPerforms the same operation as.
[0163]
The operation of the tile configuring unit 407 is the same as that of the third embodiment described above with reference to FIG.
[0164]
As described above, according to the present embodiment, decoding can be performed in units of tiles, and decoding can be performed for each image by the method of the third embodiment in which processing is simple or processing is slightly complicated. It is possible to selectively switch whether decoding is performed according to the method of the ninth embodiment in which no distortion occurs.
[0165]
  FIG. 21 shows image decoding according to the thirteenth embodiment.MethodAnother example ofTo explainIt is a block diagram and the image decoding of a present ExampleMethodThe encoded data encoded by switching between the method of the first embodiment and the method of the fifth embodiment can be decoded.
[0166]
In FIG. 21, the operations of the parts other than the tile wavelet decoding unit 2001 and the wavelet decoding unit 2002 are the same as those in FIG.
[0167]
The wavelet decoding unit 2002 performs wavelet decoding on the input encoded information. At this time, the output of the inverse quantization unit 404 is input to the first inverse wavelet transform unit 1906 or the second inverse wavelet transform unit 2003 via the first switch 2004.
[0168]
The output of the inverse first wavelet transform unit 1906 is input to the tile connection unit 407, and the output of the second wavelet transform unit 2003 is input to the tile integration unit 907.
[0169]
The operation of the second inverse wavelet transform unit 2003 is the same as that of the inverse wavelet transform unit 905 in the eighth embodiment described above with reference to FIG.
[0170]
The tile wavelet decoding unit 2001 performs wavelet decoding on the encoded information input by the wavelet decoding unit 2002, and the output of the wavelet decoding unit 2002 is connected to either the tile connection unit 407 or the tile integration unit 907, and the decoded image Is played.
[0171]
  On the other hand, the flag extraction unit 2005 extracts a flag from the input management information, and the first switch 2004 is switched by the extracted flag. When the first switch 2004 is connected to the terminal 0, the image decoding according to the third embodiment is performed.MethodWhen the same operation is performed and the terminal 1 is connected, the image decoding according to the eighth embodiment is performed.MethodBehaves the same as
[0172]
As a result, decoding can be performed in units of tiles, and decoding can be performed for each image by the method of the third embodiment in which processing is simple, or the processing is slightly complicated, but distortion in the tile boundary does not occur. It is possible to selectively switch whether the decoding is performed by the method.
[0173]
  Furthermore, FIG. 22 shows image decoding according to the thirteenth embodiment.MethodAnother example ofTo explainIt is a block diagram and the image decoding of a present ExampleMethodIn the method, encoded data encoded by switching the method of the first embodiment, the method of the fifth embodiment, and the method of the seventh embodiment can be decoded.
[0174]
  Image decoding of this embodimentMethodAs shown in FIG. 22, in FIG. 21, a wavelet coefficient rearranging unit 1005 is added, and a switch for switching them is changed. In the figure, the operations of the parts other than the tile wavelet decoding unit 2101 and the wavelet decoding unit 2102 are the same as those in FIG.
[0175]
The wavelet decoding unit 2102 performs wavelet decoding on the input encoded information. At this time, the output of the inverse quantization unit 404 is directly input to the first inverse wavelet transform unit 1906 via the terminal 0 of the first switch 2103, or the terminal 1 of the first switch 2103 and the wavelet coefficient rearrangement unit The first inverse wavelet transform unit 1906 or the second inverse wavelet transform unit 2003 via the terminal 2 of the first switch 2103.
[0176]
The output of the first inverse wavelet transform unit 1906 is input to the tile connecting unit 407 via the second switch 2104 or a decoded image is directly output. The output of the second inverse wavelet transform unit 2003 is input to the tile integration unit 907. Since the operation of other parts is the same as that of the wavelet decoding unit 2002, the description thereof is omitted.
[0177]
In the tile wavelet decoding unit 2101, the flag extraction unit 2105 extracts a flag from the management information. The first switch 2103 and the second switch 2104 are controlled by the extracted flag information. The remaining management information is input to the tile connecting unit 407 and the tile integrating unit 907.
[0178]
  When the switches 2103 and 2104 are connected to the terminal 0, the image decoding according to the third embodiment is performed.MethodWhen the same operation is performed and the terminal 1 is connected, the image decoding according to the ninth embodiment is performed.MethodWhen the first switch 2103 is connected to the terminal 2, the image decoding according to the eighth embodiment is performed regardless of the connection destination of the second switch 2104.MethodPerforms the same operation as.
[0179]
In this way, decoding can be performed in units of tiles, and encoding is performed for each image by the method of the third embodiment, which is easy to process, or the processing is slightly complicated, but distortion does not occur at the tile boundary. Alternatively, it is possible to selectively switch whether the decoding is performed according to the method of the ninth embodiment.
[0180]
  Next, image coding according to Embodiment 14 of the present inventionMethodWill be described. In the present embodiment, information for distinguishing tiles is added to management information for managing tiles so that encoding information of a target tile can be decoded at high speed.
[0181]
  FIG. 23 shows image coding according to the fourteenth embodiment.MethodExampleTo explainIt is a block diagram. In FIG. 23, an input original image is encoded in tile units by a tile wavelet encoding unit 2201, and information for management (for example, tile division information, flag information, subband information) and encoding information are included. Generated.
[0182]
The ID generation unit 2202 generates ID information for distinguishing each tile. The management information generation unit 2203 adds management information and the ID information to generate management information. The encoded data combining unit 2204 combines the encoded information and the management information, and further adds a start code indicating the head of the tile to the head of each tile to generate encoded data.
[0183]
  As an example of the format of the encoded data, as shown in FIG. 24A, the information of each tile is composed of the start code of the tile, management information (tile header), and encoded information. The tile wavelet encoding unit 2201 performs image encoding in the first embodiment, the second embodiment, the fifth embodiment, the sixth embodiment, the seventh embodiment, the tenth embodiment, the twelfth embodiment, and the fourteenth embodiment.MethodCan be used.
[0184]
Here, in order to distinguish the tiles obtained by dividing the original image, 1, 2,. . . And ID information are assigned, tiles can be encoded in an arbitrary order, and the order can be changed after encoding. If the order of encoding the tiles is determined in advance, the ID generation unit 2202 can be omitted.
[0185]
Since each tile starts with a start code, it is possible to identify where each tile is located using this as a mark. Instead, when the data amount of the tile (a combination of the encoding information and the tile header) is used, it is possible to identify where each tile is located.
[0186]
  FIG. 25 shows image encoding according to the fourteenth embodiment.MethodAnother example ofTo explainFIG. 24 is a block diagram showing the image encoding shown in FIG.MethodIn addition, a data amount measuring unit 2301 for calculating the size of the tile is added, and description of operations other than the data amount measuring unit 2301 and the management information generating unit 2302 is omitted.
[0187]
In FIG. 25, the data amount measuring unit 2301 measures the amount of data encoded for each tile and outputs this. The management information generation unit 2302 generates management information by adding the information for management, ID information, and the amount of tile data.
[0188]
As an example of the format of the encoded data, as shown in FIG. 24B, the data amount of the encoded information of the tile is arranged at the head of each tile, and subsequently, the other management information (tile header) and the code Followed by conversion information. Note that the data amount of tiles does not necessarily have to be arranged at the head of each tile, and can be collected at the head, for example.
[0189]
  Further, FIG. 26 shows the image encoding of the fourteenth embodiment.MethodAnother example ofTo explainFIG. 26 is a block diagram showing the image encoding shown in FIG.MethodThe encoded data rearranging unit 2401 is added to the above, and the description of the operation of other parts is omitted.
[0190]
In FIG. 26, the encoded data rearranging unit 2401 extracts the data amount of each tile from the encoded data created by the encoded data combining unit 2204, arranges these at the head of the encoded data, and then performs the rest. Arrange in order and output encoded data.
[0191]
As an example of the format of the encoded data, as shown in FIG. 24C, the position of the target tile can be easily calculated by adding the data amounts of all the tiles arranged at the head. .
[0192]
  The same effect can be obtained with the configuration shown in FIG. FIG. 27 shows image encoding according to the fourteenth embodiment.MethodAnother example ofTo explainFIG. 26 is a block diagram showing the image encoding shown in FIG.MethodIn addition, an encoded data storage buffer 2501 and a management information storage buffer 2502 are added, and description of operations other than the encoded data storage buffer 2501, the management information storage buffer 2502, and the encoded data combining unit 2503 is omitted.
[0193]
In FIG. 27, the encoded information output from the tile wavelet encoding unit 2201 is temporarily stored in the encoded data storage buffer 2501. The management information accumulation buffer 2502 accumulates the management information of each tile generated by the management information generation unit 2302, extracts the tile data amount from the management information, and outputs this to the encoded data combination unit 2503. Next, the remaining management information is output.
[0194]
The encoded data combining unit 2503 first outputs the data amount of all the input tiles, and combines and outputs the remaining management information and encoded information.
[0195]
As described above, according to the present embodiment, it is possible to search and decode the encoded information of the tile to be decoded from the encoded data at high speed.
[0196]
  Next, image decoding according to the fifteenth embodiment of the present invention.MethodWill be described. FIG. 28 shows image decoding according to the fifteenth embodiment.MethodTheTo explainIt is a block diagram, and the present embodiment is the image coding of the fourteenth embodiment described aboveMethodImage decoding to decode data encoded withMethodIt is.
[0197]
In FIG. 28, the decoding tile determination unit 2603 determines the tile ID to be decoded according to the user's instruction. The management information separation unit 2606 retrieves a start code indicating the head of each tile from the encoded data, and separates the management information and the encoded information regarding the tile.
[0198]
Based on the management information, the data skip control unit 2602 determines whether the tile ID of a tile to be decoded is the determined tile ID. If this tile ID is the tile ID, the first switch 2605 and the second switch Turn on 2604. In this way, the tile wavelet decoding unit 2601 can decode only a specific tile.
[0199]
  When the data amount of the tile is described in the tile management information, the management information separation unit 2606 does not need to search the head of each tile, and it is only necessary to skip the amount of data described. The tile wavelet decoding unit 2601 performs image decoding according to the third, fourth, eighth, ninth, eleventh, eleventh, thirteenth, and fifteenth embodiments.MethodCan be used.
[0200]
As described above, according to the present embodiment, it is possible to quickly decode a target tile by decoding only the management information at the head of the tile without decoding all the encoded data.
[0201]
  Next, image coding according to Embodiment 16 of the present inventionMethodWill be described. In the present embodiment, peripheral tile information is also added to management information for managing tiles, and encoded information of peripheral tiles can be decoded at high speed.
[0202]
  FIG. 29A shows the image encoding according to the sixteenth embodiment.MethodExampleTo explainIt is a block diagram. Image coding of this embodimentMethodFIG. 23 is obtained by adding a peripheral tile ID determination unit 2801 to the fourteenth embodiment shown in FIG. 23, and the operation of the management information generation unit 2802 is different. For this reason, description of parts other than the peripheral tile ID determination unit 2801 and the management information generation unit 2802 is omitted.
[0203]
  Note that the tile wavelet encoding unit 2801 performs image encoding according to the fifth, sixth, seventh, tenth, twelfth, and fourteenth embodiments.MethodCan be used.
[0204]
In FIG. 29A, the peripheral tile ID determination unit 2801 determines peripheral tile IDs necessary for decoding from tile division information, flag information, subband information, and tile IDs generated by the ID generation unit 2202. The management information creation unit 2802 generates management information in which tile division information, flag information, subband information, and tile ID are added to the surrounding tile ID.
[0205]
Note that the plurality of tile IDs determined by the peripheral tile ID determination unit 2801 do not have to be all tile IDs necessary for encoding. For example, as shown in FIG. You may limit to tile ID of the tile located in the upper left and lower left.
[0206]
As an example of the format of the encoded data, a configuration in which the management information (tile header) includes a tile ID and peripheral tile IDs in FIG.
[0207]
  FIG. 30 shows the image encoding according to the sixteenth embodiment.MethodAnother example ofTo explainIt is a block diagram, and seeks to speed up the search for encoded information tiled at the time of decoding by including position information of neighboring tiles in management information. Image coding of this embodimentMethodIn FIG. 27, the management information storage buffer 2502 is deleted from the fourteenth embodiment shown in FIG. 27, and a data amount storage unit 2901, a relative position calculation unit 2902, and an information storage buffer 2904 are added.
[0208]
Since the operations other than the data amount storage unit 2901, the relative position calculation unit 2902, the information accumulation buffer 2904, the management information generation unit 2903, and the ID generation unit 2905 are the same as those described above, description thereof will be omitted.
[0209]
In FIG. 30, all the encoded information output from the tile wavelet encoding unit 2201 is stored in the encoded data storage buffer 2501, and the tile division information, flag information, and sub information output from the tile wavelet encoding unit 2201 are stored. All pieces of band information are stored in the information storage buffer 2904. The data amount of the encoded information of each tile output from the data amount measuring unit 2301 is all stored in the data amount storage unit 2901.
[0210]
The ID generation unit 2905 outputs ID information for distinguishing each tile, and outputs information stored in the information storage buffer 2904, the data amount storage unit 2901, and the encoded data storage buffer 2501 in units of tiles. Control as follows. The data amount storage unit 2901 outputs the data amount of the tile to the management information generation unit 2903 based on the input tile ID, and is necessary for calculating the relative position of the tile having the tile ID and the surrounding tile. The data amount of the tile is output to the relative position calculation unit 2902.
[0211]
The relative position calculation unit 2902 calculates the relative position where the encoding information of the neighboring tiles exists with respect to the tile to be encoded, using the input data amount of each tile, and outputs the result. The management information generation unit 2903 generates management information from the input tile ID information, tile division information, flag information, subband information, tile data amount, relative position of the surrounding tiles, etc. Output.
[0212]
In this way, by decoding only the management information at the head of the tile without decoding all the encoded data, the encoded data can be generated so that the target tile and surrounding tiles necessary for decoding can be quickly decoded. It becomes possible to do.
[0213]
  Next, the image decoding according to the nineteenth embodiment of the present invention.MethodWill be described. FIG. 31 shows image decoding according to the nineteenth embodiment.MethodTheTo explainIt is a block diagram, and the present embodiment is the image coding according to the eighteenth embodiment described above.MethodImage decoding to decode data encoded withMethodIt is.
[0214]
In this embodiment, a buffer 3001 is added to the fifteenth embodiment shown in FIG. 28, and the operations other than the buffer 3001 and the data read skip controller 3002 are the same as those in FIG. Omitted.
[0215]
In FIG. 31, input encoded data is stored in a temporary buffer 3001 and sequentially output. The data skip control unit 3002 extracts the ID of the tile to be decoded based on the input management information, and if this is the determined tile ID or the tile ID of the surrounding tile, the first switch 2605 and the second switch Switch 2604 is turned on.
[0216]
If the management information includes tile IDs of peripheral tiles necessary for decoding, control is performed so that encoded information of the peripheral tiles is output from the buffer 3001. In this way, the tile wavelet decoding unit 2601 can decode a specific tile and its surroundings.
[0217]
Here, when the decoded peripheral tile ID included in the management information is a predetermined number smaller than the number of peripheral tiles (for example, tiles indicated by halftone dots in FIG. 29B), it is necessary for decoding. Tile IDs at other positions (white tiles in FIG. 29B) are determined from the decoded peripheral tile IDs.
[0218]
  The tile wavelet decoding unit 2601 performs image decoding according to the eighth, ninth, eleventh, thirteenth, and fifteenth embodiments.MethodCan be used.
[0219]
Thus, by decoding only the management information at the head of the tile without decoding all the encoded data, it becomes possible to quickly decode the target tile and surrounding tiles necessary for decoding.
[0220]
  As described above, the image encoding of this embodimentMethodAnd image decodingMethodBy using, it becomes possible to easily decode a decoded image having a resolution according to a user's request without increasing the amount of encoded data. This is a significant advantage compared to an increase in the amount of encoded data by 1.4 times because the flash pix using JPEG supports multiple resolutions.
[0221]
In addition, when dividing an image into tiles and enabling decoding of only a specific area, the encoding by wavelet transform is difficult in principle to be closed in the tile, and is unsuitable for tile division processing. On the other hand, in the present invention, encoding / decoding processing in units of tiles is possible using wavelet transform.
[0222]
In other words, by encoding the image in units of tiles, if it is desired to decode a part of the image, it is only necessary to decode the tile including that area without decoding the entire image, thereby improving the random access function. Can do.
[0223]
【The invention's effect】
According to the image encoding method and the image decoding method of the present invention, when decoding an image, it is possible to decode at different resolutions, or to decode only a specific tile in the image. Further, when it is desired to decode a low resolution image, only the low resolution subband information can be accessed to quickly reproduce the reduced overall image.
[0224]
Furthermore, by managing information indicating the storage position of the encoded information corresponding to each tile at a position independent of the encoded information, the position of the target tile can be easily calculated. Moreover, even if it does not decode all encoding information, the target tile can be quickly decoded by using the management information arranged independently of the encoding information.
[Brief description of the drawings]
FIG. 1 shows image coding according to the first embodiment of the present invention.MethodTheTo explainIt is a block diagram.
[Fig. 2] Image coding according to the first embodiment of the present invention.MethodIt is explanatory drawing explaining these.
[Fig. 3] Image coding according to the first embodiment of the present invention.MethodIt is explanatory drawing which shows an example of the bit stream in.
FIG. 4 is an image encoding according to the first embodiment of the present invention.MethodIt is explanatory drawing which shows another example of the bit stream in.
FIG. 5 is an image coding according to the second embodiment of the present invention.MethodIt is explanatory drawing explaining these.
[Fig. 6] Image decoding according to Embodiment 3 of the present invention.MethodTheTo explainIt is a block diagram.
[Fig. 7] Image decoding according to the fourth embodiment of the present invention.MethodIt is explanatory drawing explaining these.
[Fig. 8] Image coding according to Embodiment 5 of the present invention.MethodTheTo explainIt is a block diagram.
[Fig. 9] Image coding according to Embodiment 5 of the present invention.MethodIt is explanatory drawing explaining these.
[Fig. 10] Image coding according to Embodiment 6 of the present invention.MethodIt is explanatory drawing explaining these.
[Fig. 11] Image coding according to Embodiment 7 of the present invention.MethodTheTo explainIt is a block diagram.
FIG. 12 is an image encoding according to a seventh embodiment of the present invention.MethodIt is explanatory drawing explaining these.
FIG. 13 is an image decoding according to the eighth embodiment of the present invention.MethodTheTo explainIt is a block diagram.
FIG. 14 is an image decoding according to the ninth embodiment of the present invention.MethodTheTo explainIt is a block diagram.
FIG. 15 is an image encoding according to the tenth embodiment of the present invention.MethodTheTo explainIt is a block diagram and explanatory drawing explaining the operation | movement.
FIG. 16 is an image decoding according to the eleventh embodiment of the present invention.MethodTheTo explainIt is a block diagram and explanatory drawing explaining the operation | movement.
FIG. 17 is an image encoding according to a twelfth embodiment of the present invention.MethodExampleTo explainIt is a block diagram.
FIG. 18 is an image encoding according to a twelfth embodiment of the present invention.MethodAnother example ofTo explainIt is a block diagram.
FIG. 19 is an image encoding according to a twelfth embodiment of the present invention.MethodAnother example ofTo explainIt is a block diagram.
FIG. 20 is an image decoding according to the thirteenth embodiment of the present invention.MethodExampleTo explainIt is a block diagram.
FIG. 21 shows image decoding according to the thirteenth embodiment of the present invention.MethodAnother example ofTo explainIt is a block diagram.
FIG. 22 is an image decoding according to the thirteenth embodiment of the present invention.MethodAnother example ofTo explainIt is a block diagram.
FIG. 23: Image coding according to the fourteenth embodiment of the present inventionMethodExampleTo explainIt is a block diagram.
[Fig. 24] Image coding according to Embodiment 14 of the present invention.MethodExample of bitstream inTo explainIt is explanatory drawing.
FIG. 25: Image coding according to the fourteenth embodiment of the present inventionMethodAnother example ofTo explainIt is a block diagram.
[Fig. 26] Image coding according to Embodiment 14 of the present invention.MethodAnother example ofTo explainIt is a block diagram.
FIG. 27: Image coding according to the fourteenth embodiment of the present inventionMethodAnother example ofTo explainIt is a block diagram.
FIG. 28 shows image decoding according to the fifteenth embodiment of the present invention.MethodTheTo explainIt is a block diagram.
FIG. 29 is an image encoding according to the sixteenth embodiment of the present invention.MethodExampleTo explainIt is a block diagram and explanatory drawing explaining the operation | movement.
FIG. 30 is an image encoding according to the sixteenth embodiment of the present invention.MethodAnother example ofTo explainIt is a block diagram.
FIG. 31 is an image decoding according to the seventeenth embodiment of the present invention.MethodTheTo explainIt is a block diagram.
FIG. 32 shows conventional technology.To explainIt is a block diagram and explanatory drawing explaining the operation | movement.
FIG. 33 is a block diagram showing a conventional technique.
FIG. 34 is a block diagram showing a conventional technique.
FIG. 35 is an explanatory diagram for explaining a conventional technique.
FIG. 36 is a block diagram showing a conventional technique.
FIG. 37 is an explanatory diagram for explaining a conventional technique.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 101 ... Tile division part, 102 ... Wavelet transformation part, 103 ... Quantization part, 104 ... Entropy coding part, 105 ... Wavelet transformation coding part, 106 ... Management information generation part, 107 ... Encoded data integration part, 401 ... Management information separation unit, 402 ... encoded data extraction unit, 403 ... entropy coding unit, 404 ... inverse quantization unit, 405 ... inverse wavelet transform unit, 406 ... wavelet transform decoding unit, 407 ... tile concatenation unit, 501 ... tile Dividing unit, 502 ... Ambient pixel adding unit, 503 ... Wavelet transform unit, 504 ... Quantizing unit, 505 ... Entropy coding unit, 506 ... Wavelet transform coding unit, 507 ... Management information generating unit, 508 ... Integrated encoded data , 701... Wavelet transform unit, 702... Tile configuration unit, 703. Entropy encoding unit, 705 ... wavelet transform encoding unit, 706 ... management information generation unit, 707 ... encoded data integration unit, 901 ... management information separation unit, 902 ... encoded data extraction unit, 903 ... entropy decoding unit, 904 ... dequantization unit, 905 ... inverse wavelet transform unit, 906 ... wavelet transform decoding unit, 907 ... tile integration unit, 1001 ... management information separation unit, 1002 ... encoded data extraction unit, 1003 ... entropy decoding unit, 1004 ... inverse Quantization unit, 1005... Wavelet transform coefficient rearrangement unit, 1006... Inverse wavelet transform unit, 1007... Wavelet transform decoding unit, 1101... Wavelet transform decoding unit, 1102. 1701, 1801, 2101 2201 ... Tile wavelet encoding unit, 1602, 1706, 1802, 1905, 2005, 2105 ... Flag generation unit, 1603, 2203, 2302, 2802, 2903 ... Management information generation unit, 1604, 1703, 1803, 1903, 2004, 2103 , 2605 1st switch, 1605, 1704, 1804, 1904, 2104, 2604 ... 2nd switch, 1606, 1805 ... 3rd switch, 1607, 1702, 1807 ... wavelet encoding unit, 1608 ... first wavelet encoding unit 1705 ... 2nd wavelet encoding unit, 1806 ... 4th switch, 2204, 2503 ... encoded data combining unit, 1901, 2001, 2601 ... tile wavelet decoding unit, 1902, 2002, 2102 ... wavelet ..., 1906... 1st inverse wavelet transform unit, 2003... 2nd inverse wavelet transform unit, 2202 and 2905... ID creation unit, 2301... Data amount measurement unit, 2401. ... encoded data storage buffer, 2502 ... management information storage buffer, 2602, 3002 ... data skip control unit, 2603 ... decoding tile determination unit, 2801 ... peripheral tile ID determination unit, 2901 ... data amount storage unit, 2902 ... relative position Calculation unit, 3001 ... buffer, 2606 ... management information separation unit, 2904 ... information storage buffer, 3201, 3205, 3209, 3213 ... tile division unit, 3204, 3208, 3212 ... 1/2 reduction unit, 3202, 3206, 3210, 3214 ... JPEG compression unit, 3203, 3207, 3211, 215: Encoded data integration unit, 3301 ... Wavelet transform unit, 3302 ... Quantization unit, 3303 ... Entropy decoding unit, 3304 ... Wavelet transform coding unit, 3401, 3414, 3426 ... Horizontal low-pass filter, 3402, 3415, 3427 ... horizontal high-pass filters, 3403, 3405, 3416, 3434, 3428, 3430 ... vertical low-pass filters, 3404, 3406, 3417, 3419, 3429, 3431 ... vertical high-pass filters, 3407 to 3412, 3420 to 3425, 3432 to 3437: 1/2 sub-sampling unit, 3613: sub-band of horizontal low band / vertical low band, 3601 ... entropy decoding unit, 3602 ... inverse quantization unit, 3603 ... inverse wavelet transform unit, 3604 Wavelet transform decoding unit, 3701 ... filter application pixel, 3702 ... filter calculation target range.

Claims

Dividing the image data into tiles of N pixels × M pixels and outputting N pixels × M pixels in the tile as encoding target data corresponding to each tile;
Extrapolating predetermined data around the encoding target data corresponding to each tile to perform subband division, and independently performing wavelet encoding on each tile;
Generating management information for independently decoding arbitrary tiles in units of subbands;
Adding the management information to the encoded information to generate a bitstream;
Arrangement wherein the management information includes information indicating a position in the bit stream of the coded information for each tile, seen including a management-information identifying each subband, are summarized in a position independent from said coded information picture coding method characterized in that it is.

Dividing the image data into tiles, tiles and resolution of each tile type and c Eburetto coded coding information independently, a bit stream composed of a management information for managing the encoded information requires An image decoding method for decoding a decoded image according to
The management information includes the size of encoding information corresponding to each tile or resolution, and is collectively arranged at a position independent of the encoding information,
Analyzing the storage position of the encoded information corresponding to the tile or resolution to be decoded based on the size;
Performing wavelet decoding based on the storage location ;
And a step of connecting the decoded image of the wavelet decoded tile units,
Image decoding method.

The image decoding method according to claim 2, wherein
The step of analyzing the storage position of the encoding information corresponding to the tile to be decoded or the resolution based on the size is an image obtained by adding the storage position of the encoding information corresponding to the tile to be decoded or the resolution to the size Decryption method.