JP2006319556A

JP2006319556A - Image decoding device

Info

Publication number: JP2006319556A
Application number: JP2005138759A
Authority: JP
Inventors: Ikuro Ueno; 幾朗上野; Masayuki Yoshida; 雅之吉田; Ryuta Suzuki; 隆太鈴木
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2005-05-11
Filing date: 2005-05-11
Publication date: 2006-11-24
Anticipated expiration: 2025-05-11
Also published as: JP4579049B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image decoding device capable of extracting and decoding the code data of the code amount appropriate to browsing by taking both of the image quality for a reproduced image and a processing time required for decoding into consideration. <P>SOLUTION: The image decoding device comprises a code stream storage means 210 for accumulating code streams, a code stream extraction means 201 for extracting a predetermined code data from the code streams, a code stream decoding means 220 for decoding code streams extracted by the code stream extraction means, a code amount detection means 205 for detecting the code amount giving a specified layer or the image quality, a decoding speed measurement means 208 for measuring a time required for decoding and a decoding speed, a decoding speed storage means 207 for storing the decoding speed measured by the decoding speed measurement means, and an extracted packet determining means 206 for determining a code data to be extracted from code streams based on the code amount detected by the code amount detection means and the decoding speed stored at the decoding speed storage means. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

この発明は、静止画像の階層的な符号データを対象とした画像復号装置に関するものである。 The present invention relates to an image decoding apparatus for hierarchical code data of still images.

現在、インターネットを中心に静止画像符号化アルゴリズムＪＰＥＧが広く普及しているが、一方で次世代の符号化方式としてさらなる性能改善、機能付加の要求を背景として標準化された、ＪＰＥＧ２０００が次第に使われ始めてきている。以下に、標準書（非特許文献１参照）に従ってＪＰＥＧ２０００符号化アルゴリズムの基本方式の概略を説明する。 Currently, the still image coding algorithm JPEG is widespread mainly on the Internet. On the other hand, JPEG2000, which has been standardized against the background of the demand for further performance improvement and function addition as the next generation coding method, is gradually being used. It is coming. In the following, an outline of the basic scheme of the JPEG2000 encoding algorithm will be described according to the standard (see Non-Patent Document 1).

まず、入力される画像信号は、ウェーブレット変換部で２次元のウェーブレット変換が施され、複数のサブバンドに帯域分割される。ここで、２次元のウェーブレット変換は１次元のウェーブレット変換の組み合わせとして実現される。つまり、垂直方向の一次元ウェーブレット変換を列毎に順次行う処理と水平方向の一次元ウェーブレット変換をライン毎に順次行う処理である。また、１次元のウェーブレット変換は、所定の特性を持つローパスフィルタとハイパスフィルタおよびダウンサンプラから構成されるものである。 First, an input image signal is subjected to two-dimensional wavelet transform by a wavelet transform unit, and is divided into a plurality of subbands. Here, the two-dimensional wavelet transform is realized as a combination of the one-dimensional wavelet transform. That is, a process of sequentially performing vertical one-dimensional wavelet transform for each column and a process of sequentially performing horizontal one-dimensional wavelet transform for each line. The one-dimensional wavelet transform is composed of a low-pass filter, a high-pass filter, and a down sampler having predetermined characteristics.

こうして生成された２次元のウェーブレット変換係数は、低域成分をＬ、高域成分をＨとし、水平方向の変換を１文字目、垂直副走査方向の変換を２文字目で表現することで、ＬＬ、ＨＬ、ＬＨ、ＨＨと表現される。また、これらの帯域分割された成分はサブバンドと呼ばれている。ここで、水平、垂直方向の低域成分（ＬＬ成分）は再帰的にウェーブレッ変換が施される。 The two-dimensional wavelet transform coefficient generated in this way is expressed by expressing the low-frequency component as L, the high-frequency component as H, the horizontal conversion by the first character, and the vertical sub-scanning conversion by the second character. Expressed as LL, HL, LH, HH. These band-divided components are called subbands. Here, the horizontal and vertical low-frequency components (LL components) are recursively subjected to wavelet transform.

次に、各サブバンドのウェーブレット変換係数は、サブバンド毎に設定された量子化ステップサイズにより量子化される。 Next, the wavelet transform coefficient of each subband is quantized by the quantization step size set for each subband.

次に、各サブバンドの量子化後のウェーブレット変換係数をコードブロックと呼ばれる固定サイズの領域に分割した後、多値データからなるコードブロックを２値のビットプレーン表現に変換し、各ビットプレーンを３通りの符号化パス（Significant Propagation Decoding Pass, Magnitude Refinement Pass, Cleanup Pass）に分割する。 Next, after the wavelet transform coefficients after quantization of each subband are divided into fixed-size areas called code blocks, the code block consisting of multi-value data is converted into a binary bit plane representation, and each bit plane is converted to Dividing into three types of encoding passes (Significant Propagation Decoding Pass, Magnitude Refinement Pass, and Cleanup Pass).

３つの符号化パスから出力される２値信号は、それぞれの符号化パス毎にコンテクストモデリングが行われ、エントロピー符号化が行われる。 The binary signal output from the three coding passes is subjected to context modeling for each coding pass and subjected to entropy coding.

また、エントロピー符号化処理と並行して、各コードブロックに於いて符号化パス毎の符号量と歪情報を計算する。 In parallel with the entropy encoding process, the code amount and distortion information for each encoding pass are calculated in each code block.

最後に、ラグランジェの乗数法を用いて画質劣化（符号化歪）を最小にしながら、目標とする符号サイズ以下に符号量を調整するレート制御が一般的に適用される。この方法では、各コードブロックｉにおける切り捨てポイント（それより下位の符号化パスを切り捨てる）をｎｉ、各切捨てポイントまでの符号量をＲ（ｉ，ｎｉ）、符号化歪をＤ（ｉ，ｎｉ）とした時、ラグランジェの乗数法を使い、次式
Σ（Ｒ（ｉ，ｎｉ）＋λＤ（ｉ，ｎｉ））
を最小にする切捨てポイントによって生ずる画面全体での総符号量Ｒが目標符号量Ｒｍａｘの範囲内であることを満足するまで変数λを調整する。ここで、符号化歪とは、ある符号化パスまでを復号した時に発生する再生画像の平均二乗誤差とする。 Finally, rate control that adjusts the code amount to a target code size or less while minimizing image quality degradation (encoding distortion) using the Lagrange multiplier method is generally applied. In this method, the truncation point in each code block i (which truncates the lower encoding pass) is ni, the code amount up to each truncation point is R (i, ni), and the encoding distortion is D (i, ni). When using Lagrange's multiplier method, the following formula Σ (R (i, ni) + λD (i, ni))
The variable λ is adjusted until the total code amount R in the entire screen generated by the cut-off point that minimizes is satisfied within the range of the target code amount Rmax. Here, the coding distortion is a mean square error of a reproduced image generated when decoding up to a certain coding pass.

変数λが求まったら、その変数λに対応する切捨てポイントまでの符号データを全てのコードブロックから集めて、更に各コードブロックにおける符号化パス数を付加情報として付け加え、最終的な符号データを構成する。こうして、目標符号量Ｒｍａｘのもとで符号化歪みを最小とする符号データを生成することができる。 When the variable λ is obtained, the code data up to the truncation point corresponding to the variable λ is collected from all the code blocks, and the number of coding passes in each code block is added as additional information to form the final code data. . In this way, code data that minimizes coding distortion can be generated under the target code amount Rmax.

符号化パスを複数のレイヤに分割してコードストリームを構成することも可能である。これには、目標符号量Ｄｍａｘを小さな値から大きな値まで複数回設定して、前記手順により切り捨てポイントを複数設定する。次に、切り捨てポイント間の符号データをレイヤとして分割し、同一解像度レベル、同一コンポーネント、同一レイヤの符号データをパケットとしてまとめる。各パケットにはそのバイト数などを格納したパケットヘッダを付加する。上位ビットプレーンのパケットを下位ビットプレーンのパケットよりもコードストリーム内で先に配置することにより、重要なパケットほどコードストリーム内で先に配置される構成が実現できる。復号時には、コードストリームの先頭から所望のバイト数を満たすパケットを抽出すれば、所望の符号量のもとで符号化歪が最小となるコードストリームが得られる。 It is also possible to configure the code stream by dividing the coding pass into a plurality of layers. For this purpose, the target code amount Dmax is set a plurality of times from a small value to a large value, and a plurality of truncation points are set by the above procedure. Next, the code data between the cut-off points is divided into layers, and the code data of the same resolution level, the same component, and the same layer are collected as a packet. A packet header storing the number of bytes is added to each packet. By arranging higher-order bit-plane packets earlier in the code stream than lower-bit-plane packets, it is possible to realize a configuration in which more important packets are placed earlier in the code stream. At the time of decoding, if a packet satisfying a desired number of bytes is extracted from the head of the code stream, a code stream with a minimum coding distortion can be obtained under a desired code amount.

以上のようなＪＰＥＧ２０００国際標準規格は、ＩＳＯやＩＴＵ−Ｔ等の標準化機関を通して入手することができる。また、ＪＰＥＧ２０００の最新情報については、http:／／www.jpeg.orgを参照することにより入手することができる。 The JPEG2000 international standard as described above can be obtained through a standardization organization such as ISO or ITU-T. The latest information on JPEG2000 can be obtained by referring to http://www.jpeg.org.

ＩＳＯ／ＩＴＵ１５４４４-１:２０００ISO / ITU 15444-1: 2000

上述した符号データ抽出方法では、所望の符号量の符号データを抽出し、復号することが可能だが、処理にどれだけの時間が掛かるかがわからない、また、表示される再生画像がどの程度の画質のものかがわからないといった問題があった。 In the code data extraction method described above, it is possible to extract and decode code data of a desired code amount, but it is not known how much time it takes to process, and how much image quality the reproduced image to be displayed is. There was a problem that I did not know what.

この発明は前記のような問題点を解決するためになされたもので、復号側で指定される所望の画質あるいは処理時間を考慮して抽出符号量を決定することにより、復号側のユーザが表示までに必要以上長く待つことや、再生画質が極端に悪くなることを防ぎ、画像の快適な閲覧を提供することを目的とする。 The present invention has been made to solve the above-described problems, and a decoding-side user can display by determining an extraction code amount in consideration of a desired image quality or processing time designated on the decoding side. An object of the present invention is to provide a comfortable browsing of images by preventing waiting for an unnecessarily long time or an extremely poor reproduction image quality.

この発明に係る画像復号装置は、複数のレイヤに分割して構成されたコードストリームを復号する画像復号装置であって、コードストリームを蓄積するコードストリーム蓄積手段と、前記コードストリーム蓄積手段に蓄積されたコードストリームから所定の符号データを抽出するコードストリーム抽出手段と、前記コードストリーム抽出手段により抽出されたコードストリームを復号するコードストリーム復号手段と、指定されたレイヤあるいは画質を与える符号量を検出する符号量検出手段と、復号に要する時間と復号速度を計測する復号速度計測手段と、前記復号速度計測手段により計測された復号速度を記憶する復号速度記憶手段と、前記符号量検出手段により検出された符号量及び前記復号速度記憶手段に記憶された復号速度に基づいて前記コードストリームから抽出する符号データを決定する抽出パケット決定手段とを備えたものである。 An image decoding apparatus according to the present invention is an image decoding apparatus that decodes a code stream configured by being divided into a plurality of layers, the code stream storing means for storing the code stream, and stored in the code stream storing means. A code stream extraction unit for extracting predetermined code data from the code stream; a code stream decoding unit for decoding the code stream extracted by the code stream extraction unit; and a code amount that provides a specified layer or image quality. Code amount detection means, decoding speed measurement means for measuring the time and decoding speed required for decoding, decoding speed storage means for storing the decoding speed measured by the decoding speed measurement means, and detection by the code amount detection means Code amount and the decoding speed stored in the decoding speed storage means. Those with an extraction packet determining means for determining the encoded data to be extracted from the code stream Te.

また、他の発明に係る画像復号装置は、複数のレイヤに分割して構成されたコードストリームを復号する画像復号装置であって、コードストリームを蓄積するコードストリーム蓄積手段と、前記コードストリーム蓄積手段に蓄積されたコードストリームから所定の符号データを抽出するコードストリーム抽出手段と、指定されたレイヤあるいは指定された画質を与える符号量とコードストリームを伝送するコードストリーム・符号量情報伝送手段と、指定されたレイヤあるいは画質を与える符号量を検出する符号量検出手段と、指定されたレイヤあるいは指定された画質を与える符号量とコードストリームを受信するコードストリーム・符号量情報受信手段と、抽出されたコードストリームを復号するコードストリーム復号手段と、復号に要する時間を計測する復号速度計測手段と、復号速度を記憶する復号速度記憶手段と、伝送速度を計測する伝送速度計測手段と、伝送速度を記憶する伝送速度記憶手段と、前記コードストリームから抽出する符号データを決定する抽出パケット決定手段とを備えたものである。 An image decoding apparatus according to another invention is an image decoding apparatus for decoding a code stream configured by being divided into a plurality of layers, the code stream storing means for storing the code stream, and the code stream storing means Code stream extraction means for extracting predetermined code data from the code stream stored in the code stream, a code amount for giving a specified layer or a specified image quality and a code stream / code quantity information transmission means for transmitting the code stream, and designation Code amount detection means for detecting a code amount that gives a specified layer or image quality, a code amount that gives a specified layer or a specified image quality, and a code stream / code amount information receiving means for receiving a code stream; Code stream decoding means for decoding the code stream and necessary for decoding A decoding speed measuring means for measuring the transmission time; a decoding speed storing means for storing the decoding speed; a transmission speed measuring means for measuring the transmission speed; a transmission speed storing means for storing the transmission speed; Extraction packet determination means for determining code data is provided.

この発明によれば、再生画像の画質と復号に要する処理時間の両方を考慮することにより、閲覧に適切な符号量の符号データを抽出して復号することができる。 According to the present invention, it is possible to extract and decode code data having a code amount appropriate for browsing by taking into consideration both the image quality of a reproduced image and the processing time required for decoding.

実施の形態１．
本実施の形態１による画像復号装置では、復号装置内部の記憶装置に格納されたコードストリームからユーザが静止画像を閲覧するのに適当な符号量の符号データを抽出して、復号する例を示す。符号化・復号方式としては、ＪＰＥＧ２０００を使って説明する。なお、符号化はリアルタイムで行うのではなく、予め実行しておき、コードストリームを蓄積しておくことを想定している。 Embodiment 1 FIG.
In the image decoding apparatus according to the first embodiment, an example in which code data having a code amount appropriate for a user to view a still image is extracted from a code stream stored in a storage device inside the decoding apparatus and decoded is shown. . The encoding / decoding method will be described using JPEG2000. It is assumed that encoding is not performed in real time, but is executed in advance and a code stream is accumulated.

まず、コードストリームの構成を説明するために、図１に示す画像符号化装置の構成を示すブロック図を使ってコードストリームを生成する符号化手順を説明する。
図１に示す画像符号化装置は、入力画像信号を２次元のウェーブレット変換を再帰的に行うウェーブレット変換手段１０１と、ウェーブレット変換手段１０１によって生成されたウェーブレット変換係数を予め設定された量子化ステップサイズで量子化処理する量子化手段１０２と、量子化されたウェーブレット変換係数をビットプレーンに分解し２値算術符号化するエントロピー符号化手段１０３と、エントロピー符号化された符号データを所定のフォーマットに変換してコードストリームを生成するフォーマッティング手段１０４と、各コードブロックにおいてある符号化パスの符号化が終了する度に、その符号化パスまでを復号した場合に発生する符号化歪Ｄを算出するレート歪特性算出手段１０５と、コードストリームを蓄積するコードストリーム蓄積手段１１０とを備えている。 First, in order to describe the configuration of the code stream, an encoding procedure for generating a code stream will be described using the block diagram showing the configuration of the image encoding device shown in FIG.
The image encoding apparatus shown in FIG. 1 includes a wavelet transform unit 101 that recursively performs two-dimensional wavelet transform on an input image signal, and a wavelet transform coefficient generated by the wavelet transform unit 101 with a preset quantization step size. Quantization means 102 for performing quantization processing, entropy coding means 103 for decomposing quantized wavelet transform coefficients into bit planes and binary arithmetic coding, and converting entropy-coded code data into a predetermined format And a rate distortion that calculates a coding distortion D that occurs when decoding up to that coding pass every time a coding pass in each code block is finished. The characteristic calculation means 105 and a code for storing the code stream And a de stream storage means 110.

図には示していない画像入力装置（例えば、イメージスキャナやデジタルカメラ、もしくはネットワークや記憶媒体等）から入力された画像信号は、ウェーブレット変換手段１０１で１次元のウェーブレット変換が垂直方向、水平方向の両方向に対して２次元的に施され、サブバンドに帯域分割される。ここで、１次元のウェーブレット変換は、低域通過フィルタと高域通過フィルタのフィルタバンクによって構成される。 An image signal input from an image input device (for example, an image scanner, a digital camera, a network, a storage medium, or the like) not shown in the figure is converted into a one-dimensional wavelet transform by the wavelet transform unit 101 in the vertical direction and the horizontal direction. It is applied two-dimensionally in both directions and is divided into subbands. Here, the one-dimensional wavelet transform is constituted by a filter bank of a low-pass filter and a high-pass filter.

図２は、２次元のウェーブレット変換を２回再帰的に施した例である。図中、先頭の数字は分解レベルを表しており、続くＬまたはＨの２つの英字は、水平方向、垂直方向のフィルタの種類を表している。Ｌは低域通過フィルタを、Ｈは高域通過フィルタを施した結果を表している。また、「再帰的に」ウェーブレット変換を２回施すと言うことは、まず第１回目のウェーブレット変換により、１ＬＬ，１ＨＬ，１ＬＨ，１ＨＨが生成されると、その１ＬＬに対して２回目のウェーブレット変換を施し、２ＬＬ，２ＨＬ，２ＬＨ，２ＨＨを生成することを意味している。 FIG. 2 is an example in which a two-dimensional wavelet transform is recursively performed twice. In the figure, the first number represents the decomposition level, and the following two alphabetic characters L or H represent the types of filters in the horizontal and vertical directions. L represents a low-pass filter, and H represents the result of applying a high-pass filter. In addition, “recursively” performing wavelet transform twice means that when 1LL, 1HL, 1LH, and 1HH are generated by the first wavelet transform, the second wavelet transform is performed on the 1LL. To generate 2LL, 2HL, 2LH, and 2HH.

量子化処理手段１０２では、サブバンド毎に設定された量子化ステップサイズによりウェーブレット変換係数を量子化する。 The quantization processing means 102 quantizes the wavelet transform coefficient with the quantization step size set for each subband.

エントロピー符号化手段１０３では、各サブバンドのウェーブレット変換係数をコードブロックと呼ばれる固定サイズの矩形領域に分割した後、多値データからなるそれぞれのコードブロックを２値のビットプレーンに変換する。通常このコードブロックの大きさは、６４ｘ６４、３２ｘ３２などのサイズに設定される。 The entropy encoding unit 103 divides the wavelet transform coefficients of each subband into fixed-size rectangular areas called code blocks, and then converts each code block made up of multi-value data into a binary bit plane. Usually, the size of this code block is set to a size of 64 × 64, 32 × 32 or the like.

ここで、図３を用いてビットプレーンの分解について詳しく説明する。図３において、（ａ）は４ｘ４のコードブロックの一例を表している。これらのデータに対して、正負を表す１ビットの信号と絶対値の表現に変換し、それらのデータを縦方向に２進表現した結果を各行単位に並べたものが（ｂ）となる。次に、（ｂ）に対して同一のビット番号のビットを集めたものが（ｃ）となる。ここで、最下位ビット（ＬＳＢ：Least Significant Bit）を第０ビット、第上位ビット（ＭＳＢ：Most Significant Bit）を第３ビットとした時、第０ビットで集めたものを第０ビットプレーン、第１ビットで集めたものを第１ビットプレーン、第２ビットで集めたものを第２ビットプレーン、第３ビットで集めたものを第３ビットプレーンとしている。これ以外にも正負を表すビットの集まりとして符号ビットプレーンを作成する。 Here, the decomposition of the bit plane will be described in detail with reference to FIG. In FIG. 3, (a) represents an example of a 4 × 4 code block. These data are converted into a 1-bit signal representing positive and negative values and an absolute value representation, and the result of binary representation of the data in the vertical direction is arranged in units of rows (b). Next, (c) is a collection of bits having the same bit number with respect to (b). Here, when the least significant bit (LSB) is the 0th bit and the most significant bit (MSB) is the 3rd bit, the 0th bit plane is a collection of the 0th bit. A collection of 1 bits is a first bit plane, a collection of 2 bits is a second bit plane, and a collection of 3 bits is a third bit plane. In addition to this, a sign bit plane is created as a collection of bits representing positive and negative.

エントロピー符号化手段１０３では、ビットプレーン内の各ビットを、そのコンテクストに応じて、３通りの符号化パス、シグニフィカントプロパゲーションデコーディングパス（Significant Propagation Decoding Pass）、マグニチュードリファインメントパス（Magnitude Refinement Pass）、クリーンナップパス（Cleanup Pass）に分割する。 In the entropy encoding means 103, each bit in the bit plane is divided into three types of encoding pass, significant propagation decoding pass (Significant Propagation Decoding Pass), and magnitude refinement pass (Magnitude Refinement Pass) according to the context. And divide it into Cleanup Pass.

次に、それぞれの符号化パス毎に算術符号化するためのコンテクストモデリングを行う。但し、ＭＳＢプレーンから数えて全て０となるビットプレーンはコンテクストモデリング、符号化は行わず、全て０のビットプレーンの数をヘッダに書くだけとする。そして、最初に１が出現したビットプレーンについては全てのビットがクリーンナップパスに分類されるが、その他のビットプレーンについては前述したように３種類の符号化パスに分類される。 Next, context modeling for arithmetic coding is performed for each coding pass. However, the bit planes that are all 0s from the MSB plane are not subjected to context modeling and encoding, and only the number of all 0 bit planes is written in the header. For the bit plane in which 1 appears first, all bits are classified as a cleanup pass, while the other bit planes are classified into three types of encoding passes as described above.

図４は、コードブロックのビットプレーン数が６で有効なビットプレーン数が４の場合の例を示している。コンテストモデリングが終了すると、算術符号によるエントロピー符号化が行われる。エントロピー符号化は全てのビットプレーンに対して実行され、生成された符号データは、フォーマッティング手段１０４に一旦保存される。 FIG. 4 shows an example in which the number of bit planes in the code block is 6 and the number of effective bit planes is 4. When the contest modeling is completed, entropy coding using arithmetic codes is performed. Entropy coding is performed on all bit planes, and the generated code data is temporarily stored in the formatting means 104.

全てのコードブロックのエントロピ符号化を終了したら、各コードブロックの符号化パスをレイヤと呼ばれる単位に分割する。例えば再生画像のＳＮＲ向上への寄与度の高い符号化パスほど上位レイヤとして、符号化パスをいくつかレイヤに分割して、同一解像度、同一コンポーネント、空間位置の近いコードブロックの同一レイヤの符号データをまとめて、パケットと呼ばれる単位を構成する。 When entropy encoding of all code blocks is completed, the encoding pass of each code block is divided into units called layers. For example, the higher the contribution level to the SNR improvement of the reproduced image, the higher the upper layer, the coding pass is divided into several layers, and the code data of the same layer of the code block of the same resolution, the same component, and the close spatial position Are combined to form a unit called a packet.

図５にコードストリームの概念図を示す。ここでは説明を簡単にするために、コンポーネント数１、空間的な位置によるパケットの分割は行わない場合、つまりレイヤと解像度レベルだけからパケットを分類し、上位レイヤのパケットほどコードストリーム内で先に配置する例を示す。ウェーブレット変換係数の２ＬＬが解像度レベル０、２ＨＬ、２ＬＨ、２ＨＨが解像度レベル１、１ＨＬ、１ＬＨ、１ＨＨが解像度レベル２である。この例では、各解像度レベルを５つのレイヤに分割しているので、全１５個のパケットＫｒ,ｓ（解像度レベルｒ＝０〜２、レイヤｓ＝０〜４）が定義される。 FIG. 5 shows a conceptual diagram of the code stream. Here, for the sake of simplicity, in the case where the number of components is one and the packet is not divided by the spatial position, that is, the packets are classified based on only the layer and the resolution level. An example of arrangement will be shown. Wavelet transform coefficients 2LL are resolution levels 0, 2HL, 2LH, 2HH are resolution levels 1, 1HL, 1LH, and 1HH are resolution levels 2. In this example, since each resolution level is divided into five layers, a total of 15 packets Kr, s (resolution level r = 0-2, layer s = 0-4) are defined.

次に、各パケットにそのパケットのバイト数などの属性を格納したパケットヘッダを付加した上で所定の順序に並べる。コードストリームの先頭には、画像のサイズ、ウェーブレット変換レベル数などの情報を格納したメインヘッダを、最後にはコードストリームの終わりを示すＥＯＣマーカと呼ばれるコードを付けて、ひとつのコードストリームとなる。 Next, a packet header storing attributes such as the number of bytes of the packet is added to each packet, and the packets are arranged in a predetermined order. A code stream called an EOC marker indicating the end of the code stream is added to the main header storing information such as the image size and the number of wavelet transform levels at the head of the code stream, thereby forming one code stream.

次に、レイヤの決定方法を示す。レート歪特性算出手段１０５では、各コードブロックにおいてある符号化パスの符号化が終了する度に、その符号化パスまでを復号した場合に発生する符号化歪Ｄを算出する。なお、本実施の形態では、符号化歪をある符号化パスまでの復号した時の再生画像の平均二乗誤差として定義する。また、各コードブロックにおいてある符号化パスの符号化が終了する度にその符号化パスまでの累積符号量Ｒをカウントする。 Next, a layer determination method will be described. The rate distortion characteristic calculation means 105 calculates the coding distortion D that occurs when decoding up to the coding pass every time the coding pass in each code block is finished. In this embodiment, encoding distortion is defined as a mean square error of a reproduced image when decoding up to a certain encoding pass. Further, every time encoding of a certain coding pass is completed in each code block, the accumulated code amount R up to that coding pass is counted.

すべてのコードブロックの符号化が終了し、全てのコードブロックで符号化歪と符号量の関係が計算できたら、目標総歪量Ｄｍａｘ以下で符号量が最小となる符号化パスを全てのコードブロックに対して見つける処理を開始する。この最適化問題はラグランジェの未定乗数法を用いることで実現できる。具体的には、あるコードブロックｉにおける符号化パスｎｉまでの符号量をＲ（ｉ，ｎｉ）、符号化歪をＤ（ｉ，ｎｉ）とした時、
Σ（Ｄ（ｉ，ｎｉ）＋λＲ（ｉ，ｎｉ））
が最小となるポイントを見つけ、その時の総歪量Ｄｓｕｍが目標歪量Ｄｍａｘとなるようなλを求めることになる。これは、図６に示すように、符号量Ｒと符号化歪Ｄのグラフで、傾き−λが接線となるようなポイントを見つけ、そのポイントの符号化歪の総和ＤｓｕｍがＤｍａｘとなるまでλを調整することと同じである。 When the encoding of all the code blocks is completed and the relationship between the encoding distortion and the code amount can be calculated in all the code blocks, all the code blocks are passed through the encoding path having the minimum code amount below the target total distortion amount Dmax. The process of finding for is started. This optimization problem can be realized by using Lagrange's undetermined multiplier method. Specifically, when the code amount up to the encoding pass ni in a certain code block i is R (i, ni) and the encoding distortion is D (i, ni),
Σ (D (i, ni) + λR (i, ni))
Is found, and λ is obtained so that the total distortion amount Dsum at that time becomes the target distortion amount Dmax. As shown in FIG. 6, in the graph of the code amount R and the encoding distortion D, a point where the slope −λ is a tangent is found, and the sum Dsum of the encoding distortion at that point becomes Dmax. Is the same as adjusting

次に、λ調整方法の具体的な例を、図７を用いて説明する。まず、傾き−λの下限値を与えるλｍａｘ、上限値を与えるλｍｉｎを定義し（図７のステップ７０１）、λ＝（λｍｉｎ＋λｍａｘ）／２となるλを計算する（図７のステップ７０２）。コードブロックｉにおいてＤ（ｉ，ｎｉ）＋λＲ（ｉ，ｎｉ）が最小となる符号化パスｎｉを求め、その符号量をＤ（ｉ，ｎｉ）とする。全てのコードブロック対してΣＤ（ｉ，ｎｉ）＝Ｄｓｕｍを計算し（図７のステップ７０３〜７０７）、
Ｄｓｕｍ≧Ｄｍａｘであれば λｍａｘ＝λ
Ｄｓｕｍ＜Ｄｍａｘであれば λｍｉｎ＝λ
とする（図７のステップ７０８〜７１０）。 Next, a specific example of the λ adjustment method will be described with reference to FIG. First, λmax giving the lower limit value of the slope −λ and λmin giving the upper limit value are defined (step 701 in FIG. 7), and λ that satisfies λ = (λmin + λmax) / 2 is calculated (step 702 in FIG. 7). An encoding pass ni that minimizes D (i, ni) + λR (i, ni) in the code block i is obtained, and its code amount is set to D (i, ni). ΣD (i, ni) = Dsum is calculated for all code blocks (steps 703 to 707 in FIG. 7),
If Dsum ≧ Dmax, λmax = λ
If Dsum <Dmax, λmin = λ
(Steps 708 to 710 in FIG. 7).

新しいλｍｉｎ、λｍａｘが求められたら、先と同じように、λ＝（λｍｉｎ＋λｍａｘ）／２の計算を行い、以下同様に繰り返す。以上の処理をλｍａｘ−λｍｉｎ≦ε（εは正の定数）となるまで繰り返す（図７のステップ７０２〜７０３）。こうして、最終的に求められたλに対する符号化パスｎｉが求めるべき符号化パスとなる。 When new λmin and λmax are obtained, λ = (λmin + λmax) / 2 is calculated in the same manner as before, and the same is repeated thereafter. The above processing is repeated until λmax−λmin ≦ ε (ε is a positive constant) (steps 702 to 703 in FIG. 7). In this way, the encoding pass ni for the finally obtained λ becomes the encoding pass to be obtained.

フォーマッティング手段１０４では、各コードブロックにおいて符号化パスｎｉより上位の符号化パスの符号データを集めることにより所望の符号化歪Ｄｍａｘ以下で総符号量が最小となるレイヤを作成することができる。 The formatting means 104 can create a layer having a minimum total code amount below a desired encoding distortion Dmax by collecting code data of an encoding pass higher than the encoding pass ni in each code block.

ここで、最初のレイヤではＤｍａｘを大きな値として前記の処理を実行し、次第に小さな値を設定して、各レイヤの境界となる符号化パスを算出する。例えば、表１に示すように、Ｄｍａｘを設定すれば、客観的画質評価尺度として広く使用されているＰＳＮＲ（＝１０×ｌｏｇ（２５５×２５５／Ｄｍａｘ））のレイヤ構造が、レイヤが上がるごとに高くなるよう形成される。 Here, in the first layer, the above-described processing is executed with Dmax being a large value, and gradually decreasing values are calculated to calculate a coding pass that becomes a boundary between the layers. For example, as shown in Table 1, when Dmax is set, the layer structure of PSNR (= 10 × log (255 × 255 / Dmax)) widely used as an objective image quality evaluation scale is It is formed to be higher.

次に、本実施の形態１に係る画像復号装置について図８を用いて説明する。この画像復号装置では、画像復号装置内部に蓄積されたコードストリームから、再生画像の画質と復号に要する処理時間の両方を考慮して適切な符号量の符号データを部分的に抽出して復号する。各レイヤで達成する符号化歪は予め復号側でもわかっているものとする。 Next, the image decoding apparatus according to the first embodiment will be described with reference to FIG. In this image decoding device, code data having an appropriate code amount is partially extracted from the code stream stored in the image decoding device in consideration of both the image quality of the reproduced image and the processing time required for decoding, and decoded. . It is assumed that the coding distortion achieved in each layer is known in advance on the decoding side.

図８に示されるように、実施の形態１に係る画像復号装置は、コードストリーム蓄積手段２１０から表示するのに必要なコードストリームを部分的に抽出するコードストリーム抽出手段２０１と、コードストリームを復号して再生画像を生成するコードストリーム復号手段２２０と、コードストリームの所望のパケットのバイト数を検出する符号量検出手段２０５と、復号すべきパケットを決定する抽出パケット決定手段２０６と、復号速度を記憶しておく復号速度記憶手段２０７と、エントロピー復号、逆量子化、ウェーブレット逆変換に要する処理時間と処理した符号量から復号速度を算出する復号速度計測手段２０８とを備えている。 As shown in FIG. 8, the image decoding apparatus according to Embodiment 1 decodes a code stream extraction unit 201 that partially extracts a code stream necessary for display from the code stream storage unit 210 and the code stream. A code stream decoding unit 220 for generating a reproduced image, a code amount detecting unit 205 for detecting the number of bytes of a desired packet of the code stream, an extracted packet determining unit 206 for determining a packet to be decoded, and a decoding speed. Decoding speed storage means 207 for storing, and decoding speed measuring means 208 for calculating the decoding speed from the processing time required for entropy decoding, inverse quantization, and inverse wavelet transform and the amount of processed code are provided.

コードストリーム復号手段２２０は、コードストリームを２値算術復号してウェーブレット変換係数量子化値を再生するエントロピー復号手段２０２と、ウェーブレット変換係数量子化値を予め設定された量子化ステップサイズで逆量子化処理する逆量子化手段２０３と、ウェーブレット変換係数を逆ウェーブレット変換し、再生画像データを生成するウェーブレット逆変換手段２０４とからなる。 The code stream decoding unit 220 includes an entropy decoding unit 202 that performs binary arithmetic decoding on the code stream to reproduce the wavelet transform coefficient quantized value, and dequantizes the wavelet transform coefficient quantized value with a preset quantization step size. It comprises an inverse quantization means 203 for processing, and a wavelet inverse transform means 204 that performs inverse wavelet transform of wavelet transform coefficients to generate reproduced image data.

以下、コードストリーム抽出方法を中心に図９のフローチャートを用いて動作を説明する。コードストリームの抽出方法には、画質優先モードと処理時間優先モードの２種類の復号モードがあり、ユーザが事前に指定しておく。本実施の形態１では、時間優先モードを説明する。 Hereinafter, the operation will be described with reference to the flowchart of FIG. 9 focusing on the code stream extraction method. There are two types of codestream extraction methods, an image quality priority mode and a processing time priority mode, which are designated in advance by the user. In the first embodiment, the time priority mode will be described.

最初に、コードストリーム抽出手段２０１により標準処理時間Ｔを設定する（図９のステップ９０２）。例えば、３秒など、ユーザが画像を指定してから表示までに要する標準的な時間である。次に、画質を設定するパラメータである標準復号レイヤ数Ｎを設定する（図９のステップ９０３）。Ｎ＝２とすれば、２レイヤが復号される標準的なレイヤ数となり、表１のコードストリームでは、Ｄｍａｘ＝６５（ＰＳＮＲ＝３０）により決定されたレイヤまでが復号されることになる。 First, the standard processing time T is set by the code stream extraction means 201 (step 902 in FIG. 9). For example, it is a standard time required from the time the user designates an image to the display, such as 3 seconds. Next, the standard decoding layer number N, which is a parameter for setting the image quality, is set (step 903 in FIG. 9). If N = 2, the standard number of layers to be decoded is two layers. In the code stream of Table 1, the layers determined by Dmax = 65 (PSNR = 30) are decoded.

次に、抽出パケット決定手段２０６により、復号速度記憶手段２０７から予測復号速度ｐ［bytes／sec］を読み出す（図９のステップ９０４）。この値は、画像が復号されるたびに更新されていくが、最初の画像に対しては、初期値としては適当な値を予め設定しておく。次に、符号量検出手段２０５によりコードストリーム蓄積手段２１０に格納された復号対象となるコードストリームのＮレイヤの符号量ｂ［bytes］を検出する（図９のステップ９０５）。各パケットの符号量は、コードストリームのパケットヘッダに書き込まれている値を読みとることにより得られる。例えば、図５のコードストリームでレイヤ０の符号量は、パケットＫ_０,０、Ｋ_０,１、Ｋ_０,２のパケットヘッダに書き込まれている各パケットの符号量を読みとることにより得られる。 Next, the extracted packet determination unit 206 reads the predicted decoding rate p [bytes / sec] from the decoding rate storage unit 207 (step 904 in FIG. 9). This value is updated each time an image is decoded, but an appropriate value is set in advance as an initial value for the first image. Next, the code amount detection unit 205 detects the code amount b [bytes] of the N layer of the code stream stored in the code stream storage unit 210 (step 905 in FIG. 9). The code amount of each packet is obtained by reading the value written in the packet header of the code stream. For example, the code amount of layer 0 in the code stream of FIG. 5 is obtained by reading the code amount of each packet written in the packet header of the packets K _0,0 , K _0,1 , K _0,2 .

次に、抽出パケット決定手段２０６により、処理に要する時間の予測値である、予測処理時間ｔ＝ｂ／ｐ［sec］を算出する（図９のステップ９０６）。ここで、予測処理時間ｔと標準処理時間Ｔを比較する（図９のステップ９０７）。もし、予測処理時間ｔが標準処理時間Ｔに比べて短ければ、Ｎレイヤ分をコードストリームから抽出し、復号する（図９のステップ９０９、９１０）。逆に、予測処理時間ｔが標準処理時間Ｔに比べて長ければ、標準処理時間で復号できる符号量Ｔｐ［byte］だけを抽出し、復号する（図９のステップ９０８、９１０）。なお、本実施の形態１では、符号を抽出する際、抽出する符号データ境界をパケット境界とする、つまり、ある符号量の符号データを抽出する場合には、実際にはその符号量を超える最も近いパケット境界までを抽出するものとする。例えば、図５のコードストリームで、先頭からパケットＫ_０,２の途中までが抽出するバイト数Ｔｐ［byte］だった場合、抽出するパケットは、Ｋ０,０、Ｋ_０,１、Ｋ_０,２となる。 Next, the extracted packet determining means 206 calculates a predicted processing time t = b / p [sec], which is a predicted value of the time required for processing (step 906 in FIG. 9). Here, the predicted processing time t is compared with the standard processing time T (step 907 in FIG. 9). If the predicted processing time t is shorter than the standard processing time T, N layers are extracted from the code stream and decoded (steps 909 and 910 in FIG. 9). Conversely, if the predicted processing time t is longer than the standard processing time T, only the code amount Tp [byte] that can be decoded by the standard processing time is extracted and decoded (steps 908 and 910 in FIG. 9). In Embodiment 1, when a code is extracted, the code data boundary to be extracted is set as a packet boundary. That is, when code data of a certain code amount is extracted, the code amount that actually exceeds the code amount is the most. It is assumed that the data is extracted up to the nearest packet boundary. For example, in the code stream of FIG. 5, when the number of bytes Tp [byte] to be extracted from the beginning to the middle of the packet K _0,2 is the extracted packet, K0,0, K _0,1 , K _0,2 It becomes.

復号速度計測手段２０８では、エントロピー復号手段がその処理の開始時点に出力する復号開始信号とウェーブレット逆変換手段がその処理の終了時点に出力する復号終了信号を受け取り、それらの受信時間の差から復号に要する時間Δｔを算出する（図９のステップ９１１）。そして、抽出したコードストリームの符号量と処理時間Δｔから、当該コードストリームに対する復号速度ｐ［byte／sec］を算出し（図９のステップ９１２）、復号速度記憶手段２０７に書き込む。次の画像の復号時には、現在のコードストリームに対する復号速度がその予測値として適用されることになる。 The decoding speed measuring means 208 receives a decoding start signal output by the entropy decoding means at the start time of the process and a decoding end signal output by the wavelet inverse transform means at the end time of the process, and decodes from the difference between the reception times. Is calculated (step 911 in FIG. 9). Then, the decoding speed p [byte / sec] for the code stream is calculated from the code amount of the extracted code stream and the processing time Δt (step 912 in FIG. 9) and written in the decoding speed storage means 207. At the time of decoding the next image, the decoding speed for the current code stream is applied as the predicted value.

以上が一つのコードストリームに対する処理手順であり、さらに次に復号する画像が指定されれば、次の画像に対して予測復号速度読み込み（図９のステップ９０４）から処理を実行する。 The above is the processing procedure for one code stream, and if an image to be decoded next is designated, the process is executed for the next image after reading the predicted decoding speed (step 904 in FIG. 9).

なお、本実施の形態１では、符号化歪が所定の値となるようなレイヤ分割を説明したが、必ずしも符号化歪でなくてもよく、例えば主観的評価尺度（MOS）が所定の値となる符号化パスまでをレイヤとして分割しても構わない。また、抽出する画質の尺度としてレイヤ数を指定する方法を示したが、必ずしもレイヤ数でなくともよく、例えば平均２乗誤差、ＰＳＮＲなどでもよい。 In the first embodiment, the layer division has been described such that the coding distortion has a predetermined value. However, the layering is not necessarily the coding distortion. For example, the subjective evaluation scale (MOS) is a predetermined value. It is possible to divide up to an encoding pass as a layer. In addition, although the method of specifying the number of layers as a measure of the image quality to be extracted has been shown, the number of layers is not necessarily limited, and for example, a mean square error, PSNR, or the like may be used.

もし、単に標準復号レイヤ数だけ符号データを復号するのであれば、場合によっては復号時間が大幅に長くなってしまうことが生じる。本実施の形態１ではそのような場合、標準処理時間で復号が可能な範囲に符号量を制限することにより、復号時間が短縮されるという効果が得られる。 If code data is simply decoded by the number of standard decoding layers, the decoding time may be significantly increased in some cases. In the first embodiment, in such a case, an effect that the decoding time is shortened can be obtained by limiting the code amount to a range that can be decoded in the standard processing time.

一方、単に標準処理時間を要する符号量だけを復号するのであれば、場合によっては必要以上の画質が得られる符号データまでを復号してしまうことがある。本実施の形態１ではそのような場合、標準復号レイヤ数の符号データだけを復号することにより、復号時間が短縮されるという効果が得られる。 On the other hand, if only the amount of code that requires the standard processing time is decoded, in some cases, even code data that can obtain an image quality higher than necessary may be decoded. In the first embodiment, in such a case, the decoding time is shortened by decoding only the code data of the standard decoding layer number.

以上のように、本実施の形態１では、再生画像の画質と復号に要する処理時間の両方を考慮することにより、閲覧に適切な符号量の符号データを抽出して復号することが可能となる。 As described above, in the first embodiment, it is possible to extract and decode code data having a code amount appropriate for browsing by considering both the image quality of a reproduced image and the processing time required for decoding. .

実施の形態２．
前記実施の形態１では、復号処理時間が標準処理時間以内となることを最優先に符号データを抽出するモードである、時間優先モードを説明したが、この実施の形態２では、符号歪が所望の値以下となること（所望の画質を達成すること）を最優先に符号データを抽出する画質優先モードの例を説明する。コードストリームの生成方法、コードストリームの構成、復号装置のブロック図は、実施の形態１と同じなので説明を省略し、復号処理におけるコードストリーム抽出の動作を図１０のフローチャートを使って説明する。 Embodiment 2. FIG.
In the first embodiment, the time priority mode, which is a mode in which code data is extracted with the highest priority that the decoding processing time is within the standard processing time, has been described. In the second embodiment, code distortion is desired. An example of an image quality priority mode in which code data is extracted with the highest priority being set to be equal to or less than the value (achieving a desired image quality) will be described. Since the code stream generation method, the code stream configuration, and the block diagram of the decoding apparatus are the same as those in the first embodiment, the description thereof will be omitted, and the code stream extraction operation in the decoding process will be described with reference to the flowchart of FIG.

最初に、コードストリーム抽出手段２０１により標準処理時間Ｔを設定する（図１０のステップ１００２）。例えば、３秒など、ユーザが画像を指定してから表示までに要する標準的な時間である。次に、画質を設定するパラメータである標準復号レイヤ数Ｎを設定する（図１０のステップ１００３）。Ｎ＝２とすれば、２レイヤが復号される標準的なレイヤ数となり、表１のコードストリームでは、Ｄｍａｘ＝６５（ＰＳＮＲ＝３０）により決定されたレイヤまでが復号されることになる。 First, the standard processing time T is set by the code stream extraction means 201 (step 1002 in FIG. 10). For example, it is a standard time required from the time the user designates an image to the display, such as 3 seconds. Next, the standard decoding layer number N, which is a parameter for setting the image quality, is set (step 1003 in FIG. 10). If N = 2, the standard number of layers to be decoded is two layers. In the code stream of Table 1, the layers determined by Dmax = 65 (PSNR = 30) are decoded.

次に、コードストリーム抽出手段２０１により、復号速度記憶手段２０７から予測復号速度ｐ［bytes／sec］を読み出す（図１０のステップ１００４）。この値は、画像が復号されるたびに更新されていくが、最初の画像に対しては、初期値としては適当な値を予め設定しておく。次に、符号量検出手段２０５によりコードストリーム蓄積手段２１０に格納された復号対象となるコードストリームのＮレイヤの符号量ｂ［bytes］を検出する（図１０のステップ１００５）。例えば、図５のコードストリームでレイヤ０の符号量を算出するには、パケットＫ_０,０、Ｋ_０,１、Ｋ_０,２のパケットヘッダに書き込まれている各パケットの符号量を読みとることにより得られる。 Next, the code stream extraction unit 201 reads the predicted decoding rate p [bytes / sec] from the decoding rate storage unit 207 (step 1004 in FIG. 10). This value is updated each time an image is decoded, but an appropriate value is set in advance as an initial value for the first image. Next, the code amount detection unit 205 detects the code amount b [bytes] of the N layer of the code stream stored in the code stream storage unit 210 (step 1005 in FIG. 10). For example, in order to calculate the code amount of layer 0 in the code stream of FIG. 5, the code amount of each packet written in the packet header of the packets K _0,0 , K _0,1 , K _0,2 is read. Is obtained.

次に、抽出パケット決定手段２０６により、処理に要する時間の予測値である、予測処理時間ｔ＝ｂ／ｐ［sec］を算出する（図１０のステップ１００６）。ここで、予測処理時間ｔと標準処理時間Ｔを比較する（図１０のステップ１００７）。もし、予測処理時間ｔが標準処理時間Ｔに比べて短ければ、標準処理時間で復号できる符号量Ｔｐ［byte］だけを抽出し、復号する（図１０のステップ１００８、１０１０）。逆に、予測処理時間ｔが標準処理時間Ｔに比べて長ければ、Ｎレイヤ分を抽出し、復号する（図１０のステップ１００９、１０１０）。なお、本実施の形態２では、符号を抽出する際、抽出する符号データ境界をパケット境界とする。ある符号量の符号データを抽出する場合には、実際にはその符号量を超える最も近いパケット境界までを抽出するものとする。 Next, the extracted packet determining means 206 calculates a predicted processing time t = b / p [sec], which is a predicted value of the time required for processing (step 1006 in FIG. 10). Here, the predicted processing time t is compared with the standard processing time T (step 1007 in FIG. 10). If the predicted processing time t is shorter than the standard processing time T, only the code amount Tp [byte] that can be decoded by the standard processing time is extracted and decoded (steps 1008 and 1010 in FIG. 10). Conversely, if the predicted processing time t is longer than the standard processing time T, N layers are extracted and decoded (steps 1009 and 1010 in FIG. 10). In the second embodiment, when a code is extracted, a code data boundary to be extracted is set as a packet boundary. In the case of extracting code data of a certain code amount, it is actually assumed that the nearest packet boundary exceeding the code amount is extracted.

復号速度計測手段２０８では、エントロピー復号手段がその処理の開始時点に出力する復号開始信号とウェーブレット逆変換手段がその処理の終了時点に出力する復号終了信号を受け取り、それらの受信時間の差から復号に要する時間Δｔを算出する（図１０のステップ１０１１）。そして、抽出したコードストリームの符号量と処理時間Δｔから、当該コードストリームに対する復号速度ｐ［byte／sec］を算出し（図１０のステップ１０１２）、復号速度記憶手段２０７に書き込む。次の画像の復号時には、現在のコードストリームに対する復号速度がその予測値として適用されることになる。 The decoding speed measuring means 208 receives a decoding start signal output by the entropy decoding means at the start time of the process and a decoding end signal output by the wavelet inverse transform means at the end time of the process, and decodes from the difference between the reception times. Is calculated (step 1011 in FIG. 10). Then, the decoding speed p [byte / sec] for the code stream is calculated from the code amount of the extracted code stream and the processing time Δt (step 1012 in FIG. 10), and written in the decoding speed storage means 207. At the time of decoding the next image, the decoding speed for the current code stream is applied as the predicted value.

以上が一つのコードストリームに対する処理手順であり、さらに次に復号する画像が指定されれば、次の画像に対して予測復号速度読み込み（図１０のステップ１００４）から処理を実行する。 The above is the processing procedure for one code stream, and if the next image to be decoded is designated, the processing is executed from the predicted decoding speed reading (step 1004 in FIG. 10) for the next image.

なお、本実施の形態２では、符号化歪が所定の値となるようなレイヤ分割を説明したが、必ずしも符号化歪でなくてもよく、例えば主観的評価尺度（MOS）が所定の値となる符号化パスまでをレイヤとして分割しても構わない。また、抽出する画質の尺度としてレイヤ数を指定する方法を示したが、必ずしもレイヤ数でなくともよく、例えば平均２乗誤差、ＰＳＮＲなどでもよい。 In the second embodiment, the layer division is described such that the coding distortion becomes a predetermined value. However, the layer division is not necessarily the coding distortion. For example, the subjective evaluation scale (MOS) is a predetermined value. It is possible to divide up to an encoding pass as a layer. In addition, although the method of specifying the number of layers as a measure of the image quality to be extracted has been shown, the number of layers is not necessarily limited, and for example, a mean square error, PSNR, or the like may be used.

もし、必ず標準復号レイヤ数だけを復号するのであれば、復号時間が標準処理時間に比べ大幅に短くなり、標準処理時間に相当する符号データによる復号画像に比べて低い画質の画像しか再生できないことが生じる。本実施の形態２では、そのような場合、標準処理時間で復号が可能だけの符号データを抽出することにより、標準処理時間の範囲でより高い画質の画像が再生されるという効果が得られる。 If only the number of standard decoding layers is always decoded, the decoding time will be significantly shorter than the standard processing time, and only low-quality images can be played back compared to the decoded image with code data corresponding to the standard processing time. Occurs. In the second embodiment, in such a case, by extracting code data that can be decoded in the standard processing time, an effect of reproducing an image with higher image quality within the standard processing time can be obtained.

一方、必ず標準処理時間を要する符号量だけを復号するのであれば、場合によっては標準復号レイヤから得られる画質を達成できない符号データまでしか復号しないことがある。本実施の形態ではそのような場合、標準レイヤ数の符号データだけを復号することにより、より高い画質の画像が再生されるという効果が得られる。 On the other hand, if only the code amount that requires the standard processing time is to be decoded, only code data that cannot achieve the image quality obtained from the standard decoding layer may be decoded. In this embodiment, in such a case, an effect that an image with higher image quality is reproduced is obtained by decoding only the code data of the standard number of layers.

以上のように、本実施の形態２では、再生画像の画質と復号に要する処理時間の両方を考慮することにより、閲覧に適切な符号量の符号データを抽出して復号することが可能となる。 As described above, in the second embodiment, it is possible to extract and decode code data having a code amount appropriate for viewing by considering both the image quality of the reproduced image and the processing time required for decoding. .

実施の形態３．
前記実施の形態１および２では、コードストリームが復号装置内部にあることを前提としていたが、この実施の形態３では、コードストリームが復号装置とネットワークでつながれたコードストリーム蓄積装置内部に格納されている場合について説明する。なお、符号化はリアルタイムで行うのではなく、予め実行しておき、コードストリームを蓄積しておくことを想定している。コードストリーム生成の処理は、図１に示した実施の形態１と同様なので説明を省略する。 Embodiment 3 FIG.
In the first and second embodiments, it is assumed that the code stream is in the decoding apparatus. In the third embodiment, the code stream is stored in the code stream storage apparatus connected to the decoding apparatus and the network. The case will be described. It is assumed that encoding is not performed in real time, but is executed in advance and a code stream is accumulated. The code stream generation process is the same as that of the first embodiment shown in FIG.

図１１は、この発明の実施の形態３に係る画像復号装置の構成を示すブロック図であり、このブロック図を用いてコードストリームの抽出から復号まで動作を説明する。図１１に示されるように、実施の形態２に係る画像復号装置は、コードストリームを蓄積するコードストリーム蓄積手段１１０１と、コードストリーム蓄積手段１１０１から必要な符号データを部分的に抽出するコードストリーム抽出手段１１１０と、指定されたレイヤの符号量情報と、抽出されたコードストリームを伝送するコードストリーム・符号量情報伝送手段１１１１と、指定されたレイヤの符号量を検出する符号量検出手段１１１２と、指定されたレイヤの符号量情報および抽出されたコードストリームを受信するコードストリーム・符号量情報受信手段１１２０と、コードストリームを復号して再生画像を生成するコードストリーム復号手段１１３０と、復号すべきパケットを決定する抽出パケット決定手段１１２５と、復号速度を記憶しておく復号速度記憶手段１１２６と、エントロピー復号、逆量子化、ウェーブレット逆変換に要する処理時間と処理した符号量から復号速度を算出する復号速度計測手段１１２７と、伝送速度を記憶しておく伝送速度記憶手段１１２８と、コードストリームの伝送速度を計測する伝送速度計測手段１１２９とを備えている。 FIG. 11 is a block diagram showing the configuration of the image decoding apparatus according to Embodiment 3 of the present invention, and the operation from code stream extraction to decoding will be described using this block diagram. As shown in FIG. 11, the image decoding apparatus according to Embodiment 2 includes a code stream storage unit 1101 that stores a code stream, and a code stream extraction that partially extracts necessary code data from the code stream storage unit 1101. Means 1110, code amount information of the designated layer, code stream / code amount information transmission means 1111 for transmitting the extracted code stream, code amount detection means 1112 for detecting the code amount of the designated layer, Code stream / code amount information receiving means 1120 for receiving the code amount information of the specified layer and the extracted code stream, code stream decoding means 1130 for decoding the code stream and generating a reproduced image, and a packet to be decoded Extraction packet determining means 1125 for determining the decoding speed, and decoding speed Decoding speed storage means 1126 for storing the decoding speed, decoding speed measuring means 1127 for calculating the decoding speed from the processing time required for entropy decoding, inverse quantization, and inverse wavelet transform and the amount of code processed, and the transmission speed Transmission rate storage means 1128 to be placed, and transmission rate measurement means 1129 for measuring the transmission rate of the code stream.

コードストリーム復号手段１１３０は、コードストリームを２値算術復号してウェーブレット変換係数量子化値を再生するエントロピー復号手段１１２１と、ウェーブレット変換係数量子化値を予め設定された量子化ステップサイズで逆量子化処理する逆量子化手段１１２２と、ウェーブレット変換係数を逆ウェーブレット変換し、再生画像データを生成するウェーブレット逆変換手段１１２３とからなる。 The code stream decoding unit 1130 includes an entropy decoding unit 1121 that performs binary arithmetic decoding of the code stream to reproduce the wavelet transform coefficient quantized value, and dequantizes the wavelet transform coefficient quantized value with a preset quantization step size. An inverse quantization unit 1122 for processing and a wavelet inverse transform unit 1123 for generating a reproduction image data by performing an inverse wavelet transform on the wavelet transform coefficient.

以下、コードストリーム抽出方法を中心に図１２のフローチャートを用いて動作を説明する。コードストリームの抽出方法には、画質優先モードと処理時間優先モードの２種類の復号モードがあり、ユーザが事前に指定しておく。本実施の形態３では、時間優先モードを説明する。 Hereinafter, the operation will be described with reference to the flowchart of FIG. 12 focusing on the code stream extraction method. There are two types of codestream extraction methods, an image quality priority mode and a processing time priority mode, which are designated in advance by the user. In the third embodiment, the time priority mode will be described.

最初に、コードストリーム抽出手段１１１０により標準処理時間Ｔを設定する（図１２のステップ１２０２）。例えば、３秒など、ユーザが画像を指定してから表示までに要する標準的な時間である。次に、画質を設定するパラメータである標準復号レイヤ数Ｎを設定する（図１２のステップ１２０３）。Ｎ＝２とすれば、２レイヤが復号される標準的なレイヤ数となり、表１のコードストリームでは、Ｄｍａｘ＝６５（ＰＳＮＲ＝３０）により決定されたレイヤまでが復号されることになる。 First, the standard processing time T is set by the code stream extraction means 1110 (step 1202 in FIG. 12). For example, it is a standard time required from the time the user designates an image to the display, such as 3 seconds. Next, the standard decoding layer number N, which is a parameter for setting the image quality, is set (step 1203 in FIG. 12). If N = 2, the standard number of layers to be decoded is two layers. In the code stream of Table 1, the layers determined by Dmax = 65 (PSNR = 30) are decoded.

抽出パケット決定手段１１２５により、復号速度記憶手段１１２６から予測復号速度ｐ１［bytes／sec］を読み出す（図１２のステップ１２０４）。同様に、伝送速度記憶手段１１２８から予測伝送速度ｐ２［bytes／sec］を読み出す（図１２のステップ１２０５）。これらの値は、画像が復号されるたびに更新されていくが、最初の画像に対しては、初期値として適当な値を予め設定しておく。 The extracted packet determination unit 1125 reads the predicted decoding rate p1 [bytes / sec] from the decoding rate storage unit 1126 (step 1204 in FIG. 12). Similarly, the predicted transmission rate p2 [bytes / sec] is read from the transmission rate storage means 1128 (step 1205 in FIG. 12). These values are updated every time the image is decoded, but appropriate values are set in advance as initial values for the first image.

コードストリーム・符号量情報受信手段１１２０は、ネットワークにより接続されたコードストリーム・符号量情報送信手段１１１１に標準レイヤ数の符号量情報を要求する。さらに、コードストリーム・符号量情報送信手段１１１１は、符号量抽出手段１１１２に符号量を取得するよう要求を出して、符号量抽出手段１１１２は、コードストリーム蓄積手段１１０１に格納してあるコードストリームから標準レイヤ数の符号量を読みとる（図１２のステップ１２０６）。各パケットの符号量は、コードストリームのパケットヘッダに書き込まれている各パケットヘッダの符号量を読みとることにより得られる。 The code stream / code amount information receiving unit 1120 requests code amount information of the number of standard layers from the code stream / code amount information transmitting unit 1111 connected by the network. Further, the code stream / code amount information transmitting unit 1111 issues a request to the code amount extracting unit 1112 to acquire the code amount, and the code amount extracting unit 1112 uses the code stream stored in the code stream storage unit 1101. The code amount of the standard layer number is read (step 1206 in FIG. 12). The code amount of each packet is obtained by reading the code amount of each packet header written in the packet header of the code stream.

符号量抽出手段１１１２は、符号量情報をコードストリーム・符号量情報送信手段１１１１に出力し、さらに、コードストリーム・符号量情報送信手段１１１１からコードストリーム・符号量情報受信手段１１２０にネットワーク経由で符号量情報が伝送される（図１２のステップ１２０７）。受信された符号量情報は、抽出パケット決定手段１１２５に送られる。 The code amount extraction unit 1112 outputs the code amount information to the code stream / code amount information transmission unit 1111, and further encodes the code stream / code amount information transmission unit 1111 to the code stream / code amount information reception unit 1120 via the network. Quantity information is transmitted (step 1207 in FIG. 12). The received code amount information is sent to the extracted packet determining means 1125.

抽出パケット決定手段１１２５では、標準レイヤ数分の符号量ｂ、予測復号速度ｐ１，予測伝送速度ｐ２を用いて、処理に要する時間の予測値、予測処理時間ｔ＝ｂ／ｐ１＋ｂ／ｐ２［sec］を算出する（図１２のステップ１２０８）。そして、予測処理時間ｔと標準処理時間Ｔを比較する（図１２のステップ１２０９）。 The extracted packet determination means 1125 uses the code amount b for the number of standard layers, the predicted decoding speed p1, and the predicted transmission speed p2, and the predicted value of the time required for processing, the predicted processing time t = b / p1 + b / p2 [sec]. Is calculated (step 1208 in FIG. 12). Then, the predicted processing time t is compared with the standard processing time T (step 1209 in FIG. 12).

もし、予測処理時間ｔが標準処理時間Ｔに比べて短ければ、Ｎレイヤ分をコードストリームから抽出し、受信する（図１２のステップ１２１１）。受信に際しては、まず、コードストリーム・符号量情報受信手段１１２０は、コードストリーム・符号量情報送信手段１１１１にＮレイヤ分のパケットを要求する。コードストリーム・符号量情報送信手段１１１１は、コードストリーム抽出手段１１１０に指定のパケットを抽出するよう要求を出して、コードストリームを受け取る。コードストリーム・符号量情報送信手段１１１１は、ネットワーク経由で抽出されたコードストリームを送信し、コードストリーム・符号量情報受信手段１１２０が受信する。 If the predicted processing time t is shorter than the standard processing time T, N layers are extracted from the code stream and received (step 1211 in FIG. 12). When receiving, first, the code stream / code amount information receiving unit 1120 requests the code stream / code amount information transmitting unit 1111 for packets of N layers. The code stream / code amount information transmission unit 1111 issues a request to the code stream extraction unit 1110 to extract a designated packet, and receives the code stream. The code stream / code amount information transmitting unit 1111 transmits the code stream extracted via the network, and the code stream / code amount information receiving unit 1120 receives the code stream.

逆に、予測処理時間ｔが標準処理時間Ｔに比べて長ければ、標準処理時間で復号できる符号量Ｔ・ｐ１・ｐ２（ｐ１＋ｐ２）［byte］に相当するパケットを抽出し、受信する（図１２のステップ１２１０）。なお、本実施の形態では、符号を抽出する際、抽出する符号データ境界をパケット境界とする。ある符号量の符号データを抽出する場合には、実際にはその符号量を超える最も近いパケット境界までを抽出するものとする。 On the contrary, if the predicted processing time t is longer than the standard processing time T, a packet corresponding to the code amount T · p1 · p2 (p1 + p2) [byte] that can be decoded in the standard processing time is extracted and received (FIG. 12). Step 1210). In the present embodiment, when a code is extracted, a code data boundary to be extracted is set as a packet boundary. In the case of extracting code data of a certain code amount, it is actually assumed that the nearest packet boundary exceeding the code amount is extracted.

受信されたコードストリームは、エントロピー復号手段１１２１，逆量子化手段１１２２，ウェーブレット逆変換手段１１２３の順で処理され再生画像データが出力される（図１２のステップ１２１２）。 The received code stream is processed in the order of entropy decoding means 1121, inverse quantization means 1122, and wavelet inverse transform means 1123, and reproduced image data is output (step 1212 in FIG. 12).

復号速度計測手段１１２６では、エントロピー復号手段がその処理の開始時点に出力する処理開始信号とウェーブレット逆変換手段がその処理の終了時点に出力する処理開始信号を受け取り、それらの受信時間の差から復号に要する時間Δｔ１を算出する（図１２のステップ１２１３）。そして、抽出したコードストリームの符号量と復号時間Δｔ１から、当該コードストリームに対する復号速度ｐ１［byte／sec］を算出し、復号速度記憶手段１１２６に書き込む（図１２のステップ１２１４）。 The decoding speed measuring means 1126 receives a processing start signal output from the entropy decoding means at the start time of the process and a processing start signal output from the wavelet inverse transform means at the end time of the process, and decodes from the difference between the reception times. Is calculated (step 1213 in FIG. 12). Then, the decoding speed p1 [byte / sec] for the code stream is calculated from the code amount of the extracted code stream and the decoding time Δt1, and written to the decoding speed storage means 1126 (step 1214 in FIG. 12).

伝送速度計測手段１１２９では、コードストリーム・符号量情報受信手段１１２０がコードストリームの受信開始時点に出力する受信開始信号と受信終了時点に出力する受信終了信号を受け取り、それらの時間差から受信に要する時間Δｔ２を算出する（図１２のステップ１２１５）。そして、抽出したコードストリームの符号量と伝送時間Δｔ２から、当該コードストリームに対する伝送速度ｐ２［byte／sec］を算出し、復号速度記憶手段に書き込む（図１２のステップ１２１６）。次の画像の復号時には、現在のコードストリームに対する復号速度、伝送速度が予測値として適用されることになる。 In the transmission rate measuring means 1129, the code stream / code amount information receiving means 1120 receives the reception start signal output at the reception start time of the code stream and the reception end signal output at the reception end time, and the time required for reception from the time difference between them. Δt2 is calculated (step 1215 in FIG. 12). Then, the transmission rate p2 [byte / sec] for the code stream is calculated from the code amount of the extracted code stream and the transmission time Δt2, and written to the decoding rate storage means (step 1216 in FIG. 12). At the time of decoding the next image, the decoding speed and transmission speed for the current code stream are applied as predicted values.

以上が一つのコードストリームに対する処理手順であり、さらに次に復号する画像が指定されれば、次の画像に対して予測復号速度読み込み（図１２のステップ１２０４）から処理を実行する。 The above is the processing procedure for one code stream. If an image to be decoded next is specified, the process is executed for the next image after reading the predicted decoding speed (step 1204 in FIG. 12).

なお、本実施の形態３では、符号化歪が所定の値となるようなレイヤ分割を説明したが、必ずしも符号化歪でなくてもよく、例えば主観的評価尺度（MOS）が所定の値となる符号化パスまでをレイヤとして分割しても構わない。また、抽出する画質の尺度としてレイヤ数を指定する方法を示したが、必ずしもレイヤ数でなくともよく、例えば平均２乗誤差、ＰＳＮＲなどでもよい。 In the third embodiment, the layer division has been described in which the coding distortion has a predetermined value. However, the layering is not necessarily the coding distortion. For example, the subjective evaluation scale (MOS) is a predetermined value. It is possible to divide up to an encoding pass as a layer. In addition, although the method of specifying the number of layers as a measure of the image quality to be extracted has been shown, the number of layers is not necessarily limited, and for example, a mean square error, PSNR, or the like may be used.

また、実施の形態３では、符号量情報は、復号する画像を指定するたびに伝送する方法を示したが、予め復号側に伝送して記憶しておいても構わない。 In the third embodiment, the code amount information is transmitted every time an image to be decoded is designated. However, the code amount information may be transmitted and stored in advance on the decoding side.

もし、単に標準復号レイヤ数だけ符号データを復号するのであれば、場合によっては復号時間が大幅に長くなってしまうことが生じる。本実施の形態３ではそのような場合、標準処理時間で復号が可能な範囲に符号量を制限することにより、復号時間が短縮されるという効果が得られる。 If code data is simply decoded by the number of standard decoding layers, the decoding time may be significantly increased in some cases. In the third embodiment, in such a case, an effect that the decoding time is shortened can be obtained by limiting the code amount to a range that can be decoded in the standard processing time.

一方、単に標準処理時間を要する符号量だけを復号するのであれば、場合によっては必要以上の画質が得られる符号データまでを復号してしまうことがある。本実施の形態３ではそのような場合、標準復号レイヤ数の符号データだけを復号することにより、復号時間が短縮されるという効果が得られる。 On the other hand, if only the amount of code that requires the standard processing time is decoded, in some cases, even code data that can obtain an image quality higher than necessary may be decoded. In the third embodiment, in such a case, the decoding time is shortened by decoding only the code data of the standard decoding layer number.

以上のように、本実施の形態３では、再生画像の画質と復号に要する処理時間の両方を考慮することにより、閲覧に適切な符号量の符号データを抽出して復号することが可能となる。 As described above, in the third embodiment, it is possible to extract and decode code data having a code amount appropriate for browsing by considering both the image quality of a reproduced image and the processing time required for decoding. .

実施の形態４．
前記実施の形態３では、復号処理時間が標準処理時間以内となることを最優先に符号データを抽出するモードである、時間優先モードを説明したが、この実施の形態４では、符号歪が所望の値以下となること（所望の画質を達成すること）を最優先に符号データを抽出する画質優先モードの例を説明する。コードストリームの生成方法、コードストリームの構成、復号装置のブロック図は実施の形態３と同じなので説明を省略し、復号処理におけるコードストリーム抽出の動作を図１３のフローチャートを使って説明する。 Embodiment 4 FIG.
In the third embodiment, the time priority mode, which is a mode in which code data is extracted with the highest priority when the decoding processing time is within the standard processing time, has been described. In the fourth embodiment, code distortion is desired. An example of an image quality priority mode in which code data is extracted with the highest priority being set to be equal to or less than the value (achieving a desired image quality) will be described. Since the code stream generation method, the code stream configuration, and the block diagram of the decoding apparatus are the same as those in the third embodiment, description thereof will be omitted and the code stream extraction operation in the decoding process will be described with reference to the flowchart of FIG.

最初に、標準処理時間Ｔを設定する（図１３のステップ１３０２）。例えば、３秒など、ユーザが画像を指定してから表示までに要する標準的な時間である。次に、画質を設定するパラメータである標準復号レイヤ数Ｎを設定する（図１３のステップ１３０３）。Ｎ＝２とすれば、２レイヤが復号される標準的なレイヤ数となり、表１のコードストリームでは、Ｄｍａｘ＝６５により決定されたレイヤまでが復号されることになる。 First, the standard processing time T is set (step 1302 in FIG. 13). For example, it is a standard time required from the time the user designates an image to the display, such as 3 seconds. Next, the standard decoding layer number N, which is a parameter for setting the image quality, is set (step 1303 in FIG. 13). If N = 2, the standard number of layers to be decoded is two layers. In the code stream of Table 1, up to the layer determined by Dmax = 65 is decoded.

復号速度記憶手段１２２６から予測復号速度ｐ１［bytes／sec］を読み出す（図１３のステップ１３０４）。同様に、伝送速度記憶手段１１２８から予測伝送速度ｐ２［bytes／sec］を読み出す（図１３のステップ１３０５）。これらの値は、画像が復号されるたびに更新されていくが、最初の画像に対しては、初期値として適当な値を予め設定しておく。 The predicted decoding speed p1 [bytes / sec] is read from the decoding speed storage means 1226 (step 1304 in FIG. 13). Similarly, the predicted transmission rate p2 [bytes / sec] is read from the transmission rate storage means 1128 (step 1305 in FIG. 13). These values are updated every time the image is decoded, but appropriate values are set in advance as initial values for the first image.

コードストリーム・符号量情報受信手段１１２０は、ネットワークにより接続されたコードストリーム・符号量情報送信手段１１１１に標準レイヤ数の符号量情報を要求する。さらに、コードストリーム・符号量情報送信手段１１１１は、符号量抽出手段１１１２に符号量を取得するよう要求を出して、符号量抽出手段１１１２はコードストリーム蓄積手段１１０１に格納してあるコードストリームから標準レイヤ数の符号量を読みとる（図１３のステップ１３０６）。各パケットの符号量は、コードストリームのパケットヘッダに書き込まれている各パケットヘッダの符号量を読みとることにより得られる。 The code stream / code amount information receiving unit 1120 requests code amount information of the number of standard layers from the code stream / code amount information transmitting unit 1111 connected by the network. Further, the code stream / code amount information transmitting unit 1111 issues a request to the code amount extracting unit 1112 to acquire the code amount, and the code amount extracting unit 1112 uses the code stream stored in the code stream accumulating unit 1101 as a standard. The code amount of the number of layers is read (step 1306 in FIG. 13). The code amount of each packet is obtained by reading the code amount of each packet header written in the packet header of the code stream.

符号量抽出手段１１１２は、符号量情報をコードストリーム・符号量情報送信手段１１１１に出力し、さらに、コードストリーム・符号量情報送信手段１１１１からコードストリーム・符号量情報受信手段１１２０にネットワーク経由で符号量情報が伝送される（図１３のステップ１３０７）。受信された符号量情報は、抽出パケット決定手段１１２５に送られる。 The code amount extraction unit 1112 outputs the code amount information to the code stream / code amount information transmission unit 1111, and further encodes the code stream / code amount information transmission unit 1111 to the code stream / code amount information reception unit 1120 via the network. Quantity information is transmitted (step 1307 in FIG. 13). The received code amount information is sent to the extracted packet determining means 1125.

抽出パケット決定手段１１２５では、標準レイヤ数分の符号量ｂ、予測復号速度ｐ１，予測伝送速度ｐ２を用いて、処理に要する時間の予測値、予測処理時間ｔ＝ｂ／ｐ１＋ｂ／ｐ２［sec］を算出する（図１３のステップ１３０８）。そして、予測処理時間ｔと標準処理時間Ｔを比較する（図１３のステップ１３０９）。 The extracted packet determination unit 1125 uses the code amount b, the predicted decoding speed p1, and the predicted transmission speed p2 for the number of standard layers, and the predicted value of the time required for processing, the predicted processing time t = b / p1 + b / p2 [sec]. Is calculated (step 1308 in FIG. 13). Then, the predicted processing time t and the standard processing time T are compared (step 1309 in FIG. 13).

もし、予測処理時間ｔが標準処理時間Ｔに比べて長ければ、Ｎレイヤ分をコードストリームから抽出し、受信する（図１３のステップ１３１１）。受信に際しては、まず、コードストリーム・符号量情報受信手段１１２０は、コードストリーム・符号量情報送信手段１１１１にＮレイヤ分のパケットを要求する。コードストリーム・符号量情報送信手段１１１１は、コードストリーム抽出手段１１１０に指定のパケットを抽出するよう要求を出して、コードストリームを受け取る。コードストリーム・符号量情報送信手段１１１１は、ネットワーク経由で抽出されたコードストリームを送信し、コードストリーム・符号量情報受信手段１１２０が受信する。 If the predicted processing time t is longer than the standard processing time T, N layers are extracted from the code stream and received (step 1311 in FIG. 13). When receiving, first, the code stream / code amount information receiving unit 1120 requests the code stream / code amount information transmitting unit 1111 for packets of N layers. The code stream / code amount information transmission unit 1111 issues a request to the code stream extraction unit 1110 to extract a designated packet, and receives the code stream. The code stream / code amount information transmitting unit 1111 transmits the code stream extracted via the network, and the code stream / code amount information receiving unit 1120 receives the code stream.

逆に、予測処理時間ｔが標準処理時間Ｔに比べて短ければ、標準処理時間で復号できる符号量Ｔ・ｐ１・ｐ２（ｐ１＋ｐ２）［byte］に相当するパケットを抽出し、受信する（図１３のステップ１３１０）。なお、本実施の形態では、符号を抽出する際、抽出する符号データ境界をパケット境界とする。ある符号量の符号データを抽出する場合には、実際にはその符号量を超える最も近いパケット境界までを抽出するものとする。 Conversely, if the predicted processing time t is shorter than the standard processing time T, a packet corresponding to the code amount T · p1 · p2 (p1 + p2) [byte] that can be decoded in the standard processing time is extracted and received (FIG. 13). Step 1310). In the present embodiment, when a code is extracted, a code data boundary to be extracted is set as a packet boundary. In the case of extracting code data of a certain code amount, it is actually assumed that the nearest packet boundary exceeding the code amount is extracted.

受信されたコードストリームは、エントロピー復号手段１１２１，逆量子化手段１１２２，ウェーブレット逆変換手段１１２３の順で処理され再生画像データが出力される（図１３のステップ１３１２）。 The received code stream is processed in the order of entropy decoding means 1121, inverse quantization means 1122, and wavelet inverse transform means 1123, and reproduced image data is output (step 1312 in FIG. 13).

復号速度計測手段１１２６では、エントロピー復号手段がその処理の開始時点に出力する処理開始信号とウェーブレット逆変換手段がその処理の終了時点に出力する処理開始信号を受け取り、それらの受信時間の差から復号に要する時間Δｔ１を算出する（図１３のステップ１３１３）。そして、抽出したコードストリームの符号量と復号時間Δｔ１から、当該コードストリームに対する復号速度ｐ１［byte／sec］を算出し、復号速度記憶手段１１２６に書き込む（図１３のステップ１３１４）。 The decoding speed measuring means 1126 receives a processing start signal output from the entropy decoding means at the start time of the process and a processing start signal output from the wavelet inverse transform means at the end time of the process, and decodes from the difference between the reception times. Is calculated (step 1313 in FIG. 13). Then, the decoding speed p1 [byte / sec] for the code stream is calculated from the code amount of the extracted code stream and the decoding time Δt1, and written to the decoding speed storage means 1126 (step 1314 in FIG. 13).

伝送速度計測手段１１２９では、コードストリーム・符号量情報受信手段１１２０がコードストリームの受信開始時点に出力する受信開始信号と受信終了時点に出力する受信終了信号を受け取り、それらの時間差から受信に要する時間Δｔ２を算出する（図１３のステップ１３１５）。そして、抽出したコードストリームの符号量と伝送時間Δｔ２から、当該コードストリームに対する伝送速度ｐ２［byte／sec］を算出し、復号速度記憶手段に書き込む（図１３のステップ１３１６）。次の画像の復号時には、現在のコードストリームに対する復号速度、伝送速度が予測値として適用されることになる。 In the transmission rate measuring means 1129, the code stream / code amount information receiving means 1120 receives the reception start signal output at the reception start time of the code stream and the reception end signal output at the reception end time, and the time required for reception from the time difference between them. Δt2 is calculated (step 1315 in FIG. 13). Then, the transmission rate p2 [byte / sec] for the code stream is calculated from the code amount of the extracted code stream and the transmission time Δt2, and written in the decoding rate storage means (step 1316 in FIG. 13). At the time of decoding the next image, the decoding speed and transmission speed for the current code stream are applied as predicted values.

以上が一つのコードストリームに対する処理手順であり、さらに次に復号する画像が指定されれば、次の画像に対して予測復号速度読み込み（図１３のステップ１３０４）から処理を実行する。 The above is the processing procedure for one code stream, and if the next image to be decoded is designated, the processing is executed from the predicted decoding speed reading (step 1304 in FIG. 13) for the next image.

なお、本実施の形態４では、符号化歪が所定の値となるようなレイヤ分割を説明したが、必ずしも符号化歪でなくてもよく、例えば主観的評価尺度（MOS）が所定の値となる符号化パスまでをレイヤとして分割しても構わない。また、抽出する画質の尺度としてレイヤ数を指定する方法を示したが、必ずしもレイヤ数でなくともよく、例えば平均２乗誤差、ＰＳＮＲなどでもよい。 In the fourth embodiment, the layer division has been described in which the coding distortion has a predetermined value. However, the layering is not necessarily the coding distortion. For example, the subjective evaluation scale (MOS) is a predetermined value. It is possible to divide up to an encoding pass as a layer. In addition, although the method of specifying the number of layers as a measure of the image quality to be extracted has been shown, the number of layers is not necessarily limited, and for example, a mean square error, PSNR, or the like may be used.

また、実施の形態４では、符号量情報は、復号する画像を指定するたびに伝送する方法を示したが、予め復号側に伝送して記憶しておいても構わない。 In the fourth embodiment, the code amount information is transmitted every time an image to be decoded is designated. However, the code amount information may be transmitted and stored in advance on the decoding side.

もし、標準復号レイヤ数だけ復号するのであれば、復号時間が標準処理時間に比べ大幅に短くなり、標準処理時間に相当する符号データによる復号画像に比べて低い画質の画像しか再生できないことが生じる。本実施の形態４ではそのような場合、標準処理時間で復号が可能だけの符号データを抽出することにより、標準処理時間の範囲でより高い画質の画像が再生されるという効果が得られる。 If decoding is performed for the number of standard decoding layers, the decoding time is significantly shorter than the standard processing time, and only low-quality images can be reproduced compared to a decoded image using code data corresponding to the standard processing time. . In the fourth embodiment, in such a case, by extracting code data that can be decoded in the standard processing time, an effect of reproducing an image with higher image quality within the standard processing time can be obtained.

一方、必ず標準処理時間を要する符号量だけを復号するのであれば、場合によっては標準復号レイヤから得られる画質を達成できない符号データまでしか復号しないことがある。本実施の形態４ではそのような場合、標準レイヤ数の符号データだけを復号することにより、より高い画質の画像が再生されるという効果が得られる。 On the other hand, if only the code amount that requires the standard processing time is to be decoded, only code data that cannot achieve the image quality obtained from the standard decoding layer may be decoded. In the fourth embodiment, in such a case, an effect that a higher quality image is reproduced can be obtained by decoding only the code data of the standard number of layers.

以上のように、本実施の形態４では、再生画像の画質と復号に要する処理時間の両方を考慮することにより、閲覧に適切な符号量の符号データを抽出して復号することが可能となる。 As described above, in the fourth embodiment, it is possible to extract and decode code data having a code amount suitable for browsing by considering both the image quality of a reproduced image and the processing time required for decoding. .

画像符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of an image coding apparatus. 分解レベル２までウェーブレット変換をした時のサブバンドを示す図である。It is a figure which shows a subband when wavelet transformation is carried out to decomposition level 2. ビットプレーンを説明する図である。It is a figure explaining a bit plane. ビットプレーンから符号化パスへの分解を説明する図である。It is a figure explaining decomposition | disassembly from a bit plane to an encoding pass. コードストリームの概念図である。It is a conceptual diagram of a code stream. レイヤ境界の決定方法を示す図である。It is a figure which shows the determination method of a layer boundary. λの調整方法を示すフローチャートである。It is a flowchart which shows the adjustment method of (lambda). この発明の実施の形態１及び２に係る画像復号装置の構成を示すブロック図である。It is a block diagram which shows the structure of the image decoding apparatus which concerns on Embodiment 1 and 2 of this invention. この発明の実施の形態１におけるコードストリーム抽出処理のフローチャートである。It is a flowchart of the code stream extraction process in Embodiment 1 of this invention. この発明の実施の形態２におけるコードストリーム抽出処理のフローチャートである。It is a flowchart of the code stream extraction process in Embodiment 2 of this invention. この発明の実施の形態３に係る画像復号装置の構成を示すブロック図である。It is a block diagram which shows the structure of the image decoding apparatus which concerns on Embodiment 3 of this invention. この発明の実施の形態３におけるコードストリーム抽出処理のフローチャートである。It is a flowchart of the code stream extraction process in Embodiment 3 of this invention. この発明の実施の形態４におけるコードストリーム抽出処理のフローチャートである。It is a flowchart of the code stream extraction process in Embodiment 4 of this invention.

Explanation of symbols

２０１コードストリーム抽出手段、２０２エントロピー復号手段、２０３逆量子化手段、２０４ウェーブレット逆変換手段、２０５符号量検出手段、２０６抽出パケット決定手段、２０７復号速度記憶手段、２０８復号速度計測手段、２１０コードストリーム蓄積手段、２２０コードストリーム復号手段、１１０１コードストリーム蓄積手段、１１１０コードストリーム抽出手段、１１１１コードストリーム・符号量情報伝送手段、１１１２符号量検出手段、１１２０コードストリーム・符号量情報受信手段、１１２１エントロピー復号手段、１１２２逆量子化手段、１１２３ウェーブレット逆変換手段、１１２５抽出パケット決定手段、１１２６復号速度記憶手段、１１２７復号速度計測手段、１１２８伝送速度記憶手段、１１２９伝送速度計測手段、１１３０コードストリーム復号手段。 201 code stream extraction means, 202 entropy decoding means, 203 inverse quantization means, 204 wavelet inverse transform means, 205 code amount detection means, 206 extracted packet determination means, 207 decoding speed storage means, 208 decoding speed measurement means, 210 code stream Storage means, 220 Code stream decoding means, 1101 Code stream storage means, 1110 Code stream extraction means, 1111 Code stream / code amount information transmission means, 1112 Code amount detection means, 1120 Code stream / code amount information reception means, 1121 Entropy decoding Means, 1122 inverse quantization means, 1123 wavelet inverse transform means, 1125 extraction packet determination means, 1126 decoding speed storage means, 1127 decoding speed measurement means, 1128 transmission Degree storage unit, 1129 transmission rate measuring means 1130 code stream decoding unit.

Claims

An image decoding device that decodes a codestream configured by being divided into a plurality of layers,
Code stream storage means for storing the code stream;
Code stream extraction means for extracting predetermined code data from the code stream stored in the code stream storage means;
Code stream decoding means for decoding the code stream extracted by the code stream extraction means;
Code amount detection means for detecting a code amount that gives a specified layer or image quality;
Decoding speed measuring means for measuring the time required for decoding and the decoding speed;
Decoding speed storage means for storing the decoding speed measured by the decoding speed measuring means;
An image decoding apparatus comprising: an extracted packet determining unit that determines code data to be extracted from the code stream based on a code amount detected by the code amount detecting unit and a decoding rate stored in the decoding rate storage unit.

The image decoding device according to claim 1,
The extracted packet determination means includes
Calculate the prediction processing time from the decoding speed for the code stream processed in the past and the code amount that gives the specified layer or image quality,
If the calculated prediction processing time is shorter than the specified processing time, the code data of the code amount that gives the specified layer or image quality is extracted,
An image decoding apparatus, wherein if the calculated prediction processing time is longer than the designated processing time, code data having a code amount that requires the designated processing time is extracted.

The image decoding device according to claim 1,
The extracted packet determination means includes
Calculate the prediction processing time from the decoding speed for the code stream processed in the past and the code amount that gives the set layer or image quality,
If the calculated prediction processing time is longer than the specified processing time, the code data of the code amount that gives the specified layer or image quality is extracted,
If the calculated processing time is shorter than the designated processing time, code data having a code amount that requires the designated processing time is extracted.

An image decoding device that decodes a codestream configured by being divided into a plurality of layers,
Code stream storage means for storing the code stream;
Code stream extraction means for extracting predetermined code data from the code stream stored in the code stream storage means;
A code stream / code amount information transmission means for transmitting a code amount and a code stream that gives a designated layer or a designated image quality;
Code amount detection means for detecting a code amount that gives a specified layer or image quality;
A code stream / code quantity information receiving means for receiving a code stream and a code stream that gives a designated layer or a designated image quality;
Code stream decoding means for decoding the extracted code stream;
Decoding speed measuring means for measuring the time required for decoding;
Decoding speed storage means for storing the decoding speed;
A transmission speed measuring means for measuring the transmission speed;
Transmission rate storage means for storing the transmission rate;
An image decoding apparatus comprising: extracted packet determining means for determining code data to be extracted from the code stream.

The image decoding device according to claim 4, wherein
The extracted packet determination means includes
Calculate the processing time from the processing speed for the code stream processed in the past, the transmission speed and the code amount that gives the specified layer or image quality,
If the calculated processing time is shorter than the specified processing time, extract the code data of the code amount that gives the specified layer or image quality,
If the calculated processing time is longer than the designated processing time, code data having a code amount that requires the designated processing time is extracted.

The image decoding device according to claim 4, wherein
The extracted packet determination means includes
Calculate the processing time from the processing speed for the code stream processed in the past, the transmission speed and the code amount that gives the specified layer or image quality,
If the calculated processing time is longer than the specified processing time, extract the code data of the code amount that gives the specified layer or image quality,
If the calculated processing time is shorter than the designated processing time, code data having a code amount that requires the designated processing time is extracted.