JP2004343710A

JP2004343710A - Device and method for decoding moving image

Info

Publication number: JP2004343710A
Application number: JP2004073404A
Authority: JP
Inventors: Hiroshi Kajiwara; 浩梶原
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2003-04-24
Filing date: 2004-03-15
Publication date: 2004-12-02

Abstract

<P>PROBLEM TO BE SOLVED: To obtain an excellent reproduction image quality with almost no visual disturbance by efficiently decoding all of or a part of moving image encoding data corresponding to the processing capability of a moving image decoder. <P>SOLUTION: The moving image decoder is so constituted that each frame of the moving image data is disassembled to a plurality of subbands and the moving image encoding data, which are generated by bit-plane-encoding the coefficient of the subband for each prescribed unit, are decoded. It is provided with a decoding processing time measurement part (105), which acquires the information for determining the magnitude of the difference between the times required for the decoding processing of the moving image encoding data at the prescribed unit, an undecoding bit plane decision part (107) for deciding the bit plane to be undecoded on the basis of the acquired information, a bit plane decoding part, which restores the coefficient of the subband from the encoding data other than the bit plane to be undecoded at the prescribed unit, and an inverse discrete wavelet converting part (104) which composites the restored coefficient of the subband and generates the frame data. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、各フレームが独立に符号化された動画像データから、その全部または一部を復号して再生画像を得る動画像復号装置、及び方法、及びこの方法を実現するプログラム、及びこのプログラムを記憶する記憶媒体に関するものである。 The present invention relates to a moving image decoding apparatus and method for obtaining a reproduced image by decoding all or a part of moving image data in which each frame is independently encoded, a program realizing the method, and a program And a storage medium for storing.

一般に、動画像データの符号化方式は、フレーム間の相関を利用するものとしないものとに大別することができる。それぞれの方式には長所及び短所が存在し、どちらの方式が適しているかということは使用するアプリケーションに依存する。例えば、ＭｏｔｉｏｎＪＰＥＧは、動画像データの各フレームを一枚の静止画像としてとらえて独立に符号化する方式であり、フレーム間の相関を用いない符号化方式の一例である。フレーム毎に独立に符号化することによって、動画像の分割、連結、部分的な書き換えなどの動画編集が容易であることや、復号側の処理能力に応じて復号フレーム数を選択して復号することが可能であるという利点がある。 In general, moving image data encoding methods can be broadly classified into those that use correlation between frames and those that do not. Each method has advantages and disadvantages, and which method is suitable depends on the application used. For example, Motion JPEG is a method in which each frame of moving image data is treated as one still image and independently encoded, and is an example of an encoding method that does not use correlation between frames. Independent encoding for each frame facilitates moving image editing such as division, concatenation, and partial rewriting of moving images, and selects and decodes the number of decoding frames according to the processing capacity of the decoding side. There is an advantage that it is possible.

近年、動画像データをフレーム毎に独立に符号化する符号化方式において、各フレームをウェーブレット変換とビットプレーン符号化とを組み合わせて符号化する方式が注目を集めている。このような動画像符号化方式には、ウェーブレット変換におけるサブバンド分解の仕組みを利用して空間解像度を段階的に変えた復号が可能であること、また、復号ビットプレーン数を変えることにより、復号画素精度を段階的に変更することが可能である等の大きな特徴がある。 2. Description of the Related Art In recent years, in an encoding method for independently encoding moving image data for each frame, a method of encoding each frame by combining wavelet transform and bit plane encoding has attracted attention. Such a moving picture coding method can perform decoding by changing the spatial resolution step by step using the mechanism of subband decomposition in the wavelet transform, and can also perform decoding by changing the number of decoding bit planes. There is a great feature that the pixel accuracy can be changed step by step.

ＩＳＯ／ＩＥＣＪＴＣ１／ＳＣ２９／ＷＧ１で標準化作業が進められている画像符号化方式であるＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４）もウェーブレット変換とビットプレーン符号化との組み合わせにより構成されている。同標準のＰａｒｔ３では、ＭｏｔｉｏｎＪＰＥＧ２０００の名称で、動画像の各フレームの符号化にＪＰＥＧ２０００を適用した場合のファイルフォーマットを規定している。 JPEG2000 (ISO / IEC 15444), which is an image coding system whose standardization is being advanced in ISO / IEC JTC1 / SC29 / WG1, is also configured by a combination of wavelet transform and bit plane coding. Part 3 of the standard defines a file format when JPEG2000 is applied to encoding of each frame of a moving image under the name of Motion JPEG2000.

ＭｏｔｉｏｎＪＰＥＧ２０００に代表されるこのような動画像符号化方式は、前述のように復号解像度、復号画素精度の柔軟性といった利点がある一方で、ビットプレーン符号化による符号化・復号処理の負荷が高いという欠点がある。特に、専用の動画像記録装置で記録した映像をパーソナルコンピュータで再生する場合に、コンピュータの性能によっては、全データを実時間で復号・表示することはできないという問題が起こる。 Such a moving picture coding method represented by Motion JPEG2000 has advantages such as flexibility of decoding resolution and decoding pixel precision as described above, but has a high load of coding / decoding processing by bit plane coding. There is a disadvantage that. In particular, when a video recorded by a dedicated moving picture recording device is reproduced by a personal computer, there is a problem that all data cannot be decoded and displayed in real time depending on the performance of the computer.

このような問題に対して、フレームを復号するのに所望の復号処理時間を定めて符号化処理単位に復号処理時間を割り振り、割り振られた復号処理時間内でビットプレーン単位に復号する方法が開示されている（例えば、特許文献１参照）。 With respect to such a problem, there is disclosed a method of determining a desired decoding processing time for decoding a frame, allocating a decoding processing time to an encoding processing unit, and decoding in bit plane units within the allocated decoding processing time. (For example, see Patent Document 1).

特開平１１−２８８３０７号公報JP-A-11-288307

しかしながら、特許文献１に開示されるような所定の時間で復号処理を打ち切る動画像復号装置では、フレーム毎に復号処理打ち切りのポイント（復号ビットプレーン数）が変化しやすいため、動画像として再生した場合に歪みの形状の時間変化を生じ、これがフリッカー（ちらつき）として視覚上の妨害要因となるという問題がある。 However, in a moving picture decoding apparatus that terminates the decoding processing at a predetermined time as disclosed in Patent Document 1, the point at which the decoding processing is terminated (the number of decoding bit planes) is likely to change for each frame, and thus the moving picture is reproduced as a moving picture. In such a case, there is a problem that a temporal change in the shape of the distortion occurs, which causes visual disturbance as flicker.

本発明は上記問題点を鑑みてなされたものであり、動画像符号化データの全部または一部を、動画像復号装置の処理能力に応じて効率良く復号し、視覚的な妨害の少ない良好な再生画質を得ることができる動画像復号装置並びに方法を提供することを目的とする。 The present invention has been made in view of the above-described problems, and efficiently or efficiently decodes all or a part of encoded video data in accordance with the processing capability of a video decoding device. It is an object of the present invention to provide a moving image decoding apparatus and method capable of obtaining a reproduction image quality.

上記目的を達成するために、動画像データの各フレームを複数のサブバンドに分解し、サブバンドの係数を所定単位毎に上位のビットから下位のビットへとビットプレーン、またはサブビットプレーン単位で符号化して生成された動画像符号化データを復号する、本発明の動画像復号装置は、前記所定単位の動画像符号化データの復号処理に割り当てられた時間と実際の復号処理にかかった時間の差の大小を判定するための情報を取得する復号処理時間情報取得手段と、前記復号処理時間情報取得手段により得られる情報に基づいて、復号しないビットプレーン、またはサブビットプレーンを決定する非復号ビットプレーン決定手段と、前記非復号ビットプレーン決定手段により決定されたビットプレーン、またはサブビットプレーン以外の符号化データから、サブバンドの係数を前記所定単位で復元するビットプレーン復号手段と、前記ビットプレーン復号手段により得られた前記複数のサブバンドの係数を合成し、フレームデータを生成するサブバンド合成手段とを備えることを特徴とする。 In order to achieve the above object, each frame of the moving image data is decomposed into a plurality of sub-bands, and the sub-band coefficients are converted from a higher-order bit to a lower-order bit for each predetermined unit in a bit plane or a sub bit plane unit. The video decoding apparatus of the present invention, which decodes the encoded video data generated by encoding, comprises a time allocated to a decoding process of the predetermined unit of video encoded data and a time required for an actual decoding process. Decoding processing time information obtaining means for obtaining information for determining the magnitude of the difference between the non-decoding and non-decoding bit planes or sub-bit planes to be determined based on the information obtained by the decoding processing time information obtaining means. A bit plane determined by the non-decoded bit plane determining means, Bit plane decoding means for restoring sub band coefficients in predetermined units from encoded data, and sub band synthesis for synthesizing the plurality of sub band coefficients obtained by the bit plane decoding means to generate frame data Means.

また、動画像データの各フレームを複数のサブバンドに分解し、サブバンドの係数を所定単位毎に上位のビットから下位のビットへとビットプレーン、またはサブビットプレーン単位で符号化して生成された動画像符号化データを復号する、本発明の動画像復号方法は、前記所定単位の動画像符号化データの復号処理に割り当てられた時間と実際の復号処理にかかった時間の差の大小を判定するための情報を取得する復号処理時間情報取得工程と、前記復号処理時間情報取得工程により得られる情報に基づいて、復号しないビットプレーン、またはサブビットプレーンを決定する非復号ビットプレーン決定工程と、前記非復号ビットプレーン決定工程により決定されたビットプレーン、またはサブビットプレーン以外の符号化データから、サブバンドの係数を前記所定単位で復元するビットプレーン復号工程と、前記ビットプレーン復号工程により得られた前記複数のサブバンドの係数を合成し、フレームデータを生成するサブバンド合成工程とを備えることを特徴とする。 Further, each frame of the moving image data is decomposed into a plurality of sub-bands, and the coefficients of the sub-bands are generated by encoding the bits of the sub-band from the upper bits to the lower bits for each predetermined unit in bit plane or sub-bit plane units. The moving picture decoding method of the present invention for decoding moving picture encoded data determines a magnitude of a difference between a time allocated to decoding processing of the predetermined unit of moving picture encoded data and a time required for actual decoding processing. Decoding processing time information obtaining step of obtaining information to perform, based on the information obtained by the decoding processing time information obtaining step, a non-decoding bit plane, or a non-decoding bit plane determining step of determining a sub bit plane, From encoded data other than the bit plane or sub-bit plane determined in the non-decoding bit plane determining step, A bit plane decoding step of restoring band coefficients in the predetermined unit; and a subband combining step of combining the coefficients of the plurality of subbands obtained in the bitplane decoding step to generate frame data. Features.

本発明によれば、動画像符号化データの全部または一部を、動画像復号装置の処理能力に応じて効率良く復号し、視覚的な妨害の少ない良好な再生画質を得ることができる。 According to the present invention, it is possible to efficiently decode all or a part of encoded video data in accordance with the processing capability of the video decoding device and obtain good reproduction image quality with little visual interference.

以下、添付図面を参照して本発明の好適な実施の形態を詳細に説明する。ただし、本実施の形態において例示される構成部品の寸法、材質、形状、それらの相対配置などは、本発明が適用される装置の構成や各種条件により適宜変更されるべきものであり、本発明がそれらの例示に限定されるものではない。 Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. However, the dimensions, materials, shapes, relative arrangements, and the like of the components exemplified in the present embodiment should be appropriately changed depending on the configuration of the apparatus to which the present invention is applied and various conditions. Is not limited to those examples.

＜動画像符号化データの概略＞
まず、本実施の形態における動画像復号装置で復号する対象となる動画像符号化データについて説明する。 <Outline of encoded video data>
First, moving picture coded data to be decoded by the moving picture decoding apparatus according to the present embodiment will be described.

図１は、本実施の形態の動画像復号装置で復号する対象となる動画像符号化データを生成する動画像符号化装置２００の構成を示す図である。動画像符号化装置２００ではウェーブレット変換とビットプレーン符号化とを組み合わせた符号化方式により、動画像を構成する各フレームを独立に符号化するものである。図１に示すように、動画像データ入力部２０１、離散ウェーブレット変換部２０２、係数量子化部２０３、ビットプレーン符号化部２０４、符号列形成部２０５及び２次記憶装置２０６、信号線２０７を備えている。 FIG. 1 is a diagram illustrating a configuration of a moving picture encoding apparatus 200 that generates encoded moving picture data to be decoded by the moving picture decoding apparatus according to the present embodiment. In the moving picture coding apparatus 200, each frame constituting a moving picture is independently coded by a coding method combining wavelet transform and bit plane coding. As shown in FIG. 1, a moving image data input unit 201, a discrete wavelet transform unit 202, a coefficient quantization unit 203, a bit plane encoding unit 204, a code string forming unit 205, a secondary storage device 206, and a signal line 207 are provided. ing.

次に、図１に示す動画像符号化装置２００の各構成要素の動作を説明しながら、動画像符号化装置２００における符号化処理の流れについて説明する。尚、ここでは、１秒あたり３０フレームのレートで、１画素の輝度値が８ビットのモノクロ動画像データが４秒分（合計１２０フレーム）、動画像符号化装置２００に取り込まれ、符号化されるものとして説明する。すなわち、動画像符号化装置２００では、動画像データ入力部２０１から入力される１秒あたり３０フレームの動画像データをフレーム単位に符号化し、最終的に２次記憶装置２０６に符号化データを格納するものである。 Next, the flow of the encoding process in the video encoding device 200 will be described while describing the operation of each component of the video encoding device 200 shown in FIG. Here, at a rate of 30 frames per second, monochrome moving image data of which the luminance value of one pixel is 8 bits is taken into the moving image encoding device 200 for 4 seconds (total of 120 frames) and encoded. It will be described as what is. That is, in the moving picture coding apparatus 200, the moving picture data of 30 frames per second inputted from the moving picture data input unit 201 is coded in frame units, and the coded data is finally stored in the secondary storage device 206. Is what you do.

まず、動画像データ入力部２０１から１秒あたり３０フレームのレートで、４秒分の動画像データが入力される。動画像データ入力部２０１は、例えばディジタルカメラ等の撮像部分であって、ＣＣＤ等の撮像デバイスとガンマ補正、シェーディング補正等の各種画像調整回路とによって実現することが可能である。動画像データ入力部２０１は、入力された動画像データを１フレームずつ離散ウェーブレット変換部２０２に送る。尚、以降の説明において、便宜上各フレームデータには、入力された順に１から番号を与えて、例えばフレーム１、フレーム２、…というような番号で各フレームを識別するようにする。また、各フレームにおける水平方向の画素位置（座標）をｘ、垂直方向の画素位置をｙとし、画素位置（ｘ，ｙ）の画素値をＰ（ｘ，ｙ）で表す。 First, moving image data for 4 seconds is input from the moving image data input unit 201 at a rate of 30 frames per second. The moving image data input unit 201 is, for example, an imaging unit such as a digital camera, and can be realized by an imaging device such as a CCD and various image adjustment circuits such as gamma correction and shading correction. The moving image data input unit 201 sends the input moving image data to the discrete wavelet transform unit 202 frame by frame. In the following description, for convenience, each frame data is given a number from 1 in the order of input, and each frame data is identified by a number such as, for example, frame 1, frame 2,. In each frame, the horizontal pixel position (coordinate) is x, the vertical pixel position is y, and the pixel value at the pixel position (x, y) is represented by P (x, y).

動画像データ入力部２０１から入力した１フレームの画像データは、離散ウェーブレット変換部２０２でそれぞれ不図示の内部バッファに適宜格納され、２次元離散ウェーブレット変換が行われる。２次元離散ウェーブレット変換は、１次元の離散ウェーブレット変換を水平及び垂直方向それぞれに適用することにより実現するものである。図２は、２次元離散ウェーブレット変換によって処理される符号化対象画像のサブバンドを説明するための概念図である。 One frame of image data input from the moving image data input unit 201 is appropriately stored in an internal buffer (not shown) by the discrete wavelet transform unit 202, and two-dimensional discrete wavelet transform is performed. The two-dimensional discrete wavelet transform is realized by applying a one-dimensional discrete wavelet transform to each of the horizontal and vertical directions. FIG. 2 is a conceptual diagram for explaining subbands of an encoding target image processed by two-dimensional discrete wavelet transform.

すなわち、図２（ａ）に示されるような符号化対象画像に対して、まず垂直方向に１次元離散ウェーブレット変換を適用し、図２（ｂ）に示すように低周波サブバンドＬと高周波サブバンドＨとに分解する。次に、それぞれのサブバンドに対して水平方向の１次元離散ウェーブレット変換を適用することにより、図２（ｃ）に示すようなＬＬ、ＨＬ、ＬＨ、ＨＨの４つのサブバンドに分解する。 That is, a one-dimensional discrete wavelet transform is first applied in the vertical direction to the image to be encoded as shown in FIG. 2A, and the low-frequency sub-band L and the high-frequency sub-band L are applied as shown in FIG. Decomposes into band H. Next, by applying a one-dimensional horizontal discrete wavelet transform to each subband, the subbands are decomposed into four subbands LL, HL, LH, and HH as shown in FIG. 2C.

動画像符号化装置２００の離散ウェーブレット変換部２０２では、上述した２次元離散ウェーブレット変換により得られたサブバンドＬＬに対して、さらに繰り返し２次元離散ウェーブレット変換を適用する。これによって、符号化対象画像をＬＬ、ＬＨ１、ＨＬ１、ＨＨ１、ＬＨ２、ＨＬ２、ＨＨ２の７つのサブバンドに分解することができる。図３は、２回の２次元離散ウェーブレット変換によって得られる７つのサブバンドを示す図である。 The discrete wavelet transform unit 202 of the video encoding device 200 further repeatedly applies a two-dimensional discrete wavelet transform to the sub-band LL obtained by the above-described two-dimensional discrete wavelet transform. As a result, the encoding target image can be decomposed into seven subbands LL, LH1, HL1, HH1, LH2, HL2, and HH2. FIG. 3 is a diagram illustrating seven sub-bands obtained by two two-dimensional discrete wavelet transforms.

尚、動画像符号化装置２００では、各サブバンド内の係数をＣ（Ｓｂ，ｘ，ｙ）と表す。ここで、Ｓｂはサブバンドの種類を表し、ＬＬ、ＬＨ１、ＨＬ１、ＨＨ１、ＬＨ２、ＨＬ２、ＨＨ２のいずれかである。また、（ｘ，ｙ）は各サブバンド内の左上隅の係数位置を（０，０）としたときの水平方向及び垂直方向の係数位置（座標）を表す。 In the moving picture coding apparatus 200, the coefficient in each subband is represented as C (Sb, x, y). Here, Sb represents the type of subband, and is one of LL, LH1, HL1, HH1, LH2, HL2, and HH2. (X, y) represents coefficient positions (coordinates) in the horizontal and vertical directions when the coefficient position at the upper left corner in each subband is (0, 0).

動画像符号化装置２００は、離散ウェーブレット変換部２０２におけるＮ個の１次元信号ｓ（ｎ）（但し、ｎは０〜Ｎ−１の整数）に対する１次元離散ウェーブレット変換として２つの方法を備える。一つは式（１）、（２）に示す整数型５×３フィルタによる変換であり、もう一つは式（３）、（４）に示す実数型５×３フィルタによる変換である。 The moving image coding apparatus 200 includes two methods as a one-dimensional discrete wavelet transform for N one-dimensional signals s (n) (where n is an integer of 0 to N−1) in the discrete wavelet transform unit 202. One is conversion using an integer type 5 × 3 filter shown in equations (1) and (2), and the other is conversion using a real number type 5 × 3 filter shown in equations (3) and (4).

ｈ（ｎ）＝ｓ（２ｎ＋１）
−ｆｌｏｏｒ｛（ｓ（２ｎ）＋ｓ（２ｎ＋２））／２｝ …（１） h (n) = s (2n + 1)
−floor {(s (2n) + s (2n + 2)) / 2} (1)

ｌ（ｎ）＝ｓ（２ｎ）
＋ｆｌｏｏｒ｛（ｈ（ｎ−１）＋ｈ（ｎ）＋２）／４｝ …（２） l (n) = s (2n)
+ Floor {(h (n-1) + h (n) +2) / 4} (2)

ｈ（ｎ）＝ｓ（２ｎ＋１）−（ｓ（２ｎ）＋ｓ（２ｎ＋２））／２ …（３） h (n) = s (2n + 1)-(s (2n) + s (2n + 2)) / 2 (3)

ｌ（ｎ）＝ｓ（２ｎ）＋（ｈ（ｎ−１）＋ｈ（ｎ））／４ …（４） l (n) = s (2n) + (h (n-1) + h (n)) / 4 (4)

但し、ｈ（ｎ）は高周波サブバンドの係数、ｌ（ｎ）は低周波サブバンドの係数、ｆｌｏｏｒ｛Ｒ｝は実数Ｒを超えない最大の整数値を表す。尚、式（１）、（２）及び式（３）、（４）の計算において必要となる１次元信号ｓ（ｎ）の両端（ｎ＜０及びｎ＞Ｎ−１）におけるｓ（ｎ）は、公知の方法により１次元信号ｓ（ｎ）（ｎ＝０〜Ｎ−１）の値から求めておく。 Here, h (n) is a coefficient of the high frequency sub-band, l (n) is a coefficient of the low frequency sub-band, and floor {R} is a maximum integer value not exceeding the real number R. In addition, s (n) at both ends (n <0 and n> N-1) of the one-dimensional signal s (n) required in the calculation of the equations (1) and (2) and the equations (3) and (4). Is obtained from the value of the one-dimensional signal s (n) (n = 0 to N-1) by a known method.

整数型５×３フィルタと実数型５×３フィルタのいずれを適用するかは、信号線２０７を介して装置外部から入力されるフィルタ選択信号によって、フレーム単位に指定することができる。例えば、信号線２０７から入力されるフィルタ選択信号が「０」である場合、着目するフレームを整数型５×３フィルタによって分解し、フィルタ選択信号が「１」である場合、着目するフレームを実数型５×３フィルタによって分解するといった具合である。 Which of the integer type 5 × 3 filter and the real type 5 × 3 filter is to be applied can be designated in frame units by a filter selection signal input from outside the device via the signal line 207. For example, when the filter selection signal input from the signal line 207 is “0”, the frame of interest is decomposed by an integer 5 × 3 filter, and when the filter selection signal is “1”, the frame of interest is a real number. Decomposition is performed by a type 5 × 3 filter.

係数量子化部２０３では、離散ウェーブレット変換部２０２で生成された各サブバンドの係数Ｃ（Ｓｂ，ｘ，ｙ）を、各サブバンド毎に定めた量子化ステップｄｅｌｔａ（Ｓｂ）を用いて量子化する。ここで、量子化された係数値をＱ（Ｓｂ，ｘ，ｙ）と表す場合、係数量子化部２０３で行われる量子化処理は式（５）により表すことができる。 The coefficient quantization unit 203 quantizes the coefficient C (Sb, x, y) of each subband generated by the discrete wavelet transform unit 202 using a quantization step delta (Sb) determined for each subband. I do. Here, when the quantized coefficient value is represented as Q (Sb, x, y), the quantization process performed by the coefficient quantization unit 203 can be represented by Expression (5).

Ｑ（Ｓｂ，ｘ，ｙ）＝ｓｉｇｎ｛Ｃ（Ｓｂ，ｘ，ｙ）｝
×ｆｌｏｏｒ｛｜Ｃ（Ｓｂ，ｘ，ｙ）｜／ｄｅｌｔａ（Ｓｂ）｝…（５） Q (Sb, x, y) = sign {C (Sb, x, y)}
× floor {| C (Sb, x, y) | / delta (Sb)} (5)

ここで、ｓｉｇｎ｛Ｉ｝は整数Ｉの正負符号を表す関数であり、Ｉが正の場合は１を、負の場合は−１を返す。また、ｆｌｏｏｒ｛Ｒ｝は実数Ｒを超えない最大の整数値を表す。但し、上述の量子化処理は離散ウェーブレット変換部２０２において実数型５×３フィルタが選択され、使用された場合にのみ適用されるものであり、信号線２０７から入力されるフィルタ選択信号により整数型５×３フィルタが選択されている場合には係数Ｃ（Ｓｂ，ｘ，ｙ）を量子化された係数値として出力する。即ちこの場合、Ｑ（Ｓｂ，ｘ，ｙ）＝Ｃ（Ｓｂ，ｘ，ｙ）となる。 Here, sign {I} is a function representing the sign of the integer I, and returns 1 if I is positive and -1 if I is negative. Floor {R} represents the maximum integer value not exceeding the real number R. However, the above-described quantization processing is applied only when a real type 5 × 3 filter is selected and used in the discrete wavelet transform unit 202, and is applied to an integer type by a filter selection signal input from a signal line 207. When the 5 × 3 filter is selected, the coefficient C (Sb, x, y) is output as a quantized coefficient value. That is, in this case, Q (Sb, x, y) = C (Sb, x, y).

ビットプレーン符号化部２０４は、係数量子化部２０３において量子化された係数値Ｑ（Ｓｂ，ｘ，ｙ）を符号化する。尚、各サブバンドの係数をブロック分割し、別々に符号化することによりランダムアクセスを容易にする方法など、符号化手法として様々な手法が提案されているが、ここでは説明を簡単にするためにサブバンド単位で符号化することとする。 The bit plane encoding unit 204 encodes the coefficient value Q (Sb, x, y) quantized by the coefficient quantization unit 203. Note that various methods have been proposed as encoding methods, such as a method of facilitating random access by dividing the coefficients of each subband into blocks and separately encoding the coefficients, but in order to simplify the description, To be encoded in subband units.

各サブバンドの量子化された係数値Ｑ（Ｓｂ，ｘ，ｙ）（以降、単に「係数値」と称す。）の符号化は、サブバンド内の係数値Ｑ（Ｓｂ，ｘ，ｙ）の絶対値を自然２進数で表現し、上位の桁から下位の桁へとビットプレーン方向を優先して二値算術符号化することにより行われる。各サブバンドの係数値Ｑ（Ｓｂ，ｘ，ｙ）を自然２進表記した場合の下からｎ桁目のビットをＱｎ（ｘ，ｙ）と表記して説明する。尚、２進数の桁を表す変数ｎをビットプレーン番号と呼ぶこととし、ビットプレーン番号ｎはＬＳＢ（最下位ビット）を０桁目とする。
図４は、ビットプレーン符号化部２０４でサブバンドＳｂを符号化する処理手順を説明するためのフローチャートである。図４に示すように、まず、符号化対象となるサブバンドＳｂ内の係数の絶対値を調べ、その最大値Ｍａｂｓ（Ｓｂ）を求める（ステップＳ６０１）。次に、サブバンドＳｂ内の係数の絶対値を表すためにＭａｂｓ（Ｓｂ）を２進数で表現する場合に必要となる桁数Ｎ_ＢＰ（Ｓｂ）を式（６）を用いて求める（ステップＳ６０２）。 Coding of the quantized coefficient value Q (Sb, x, y) of each subband (hereinafter, simply referred to as “coefficient value”) is performed by encoding the coefficient value Q (Sb, x, y) in the subband. This is performed by expressing the absolute value in a natural binary number and performing binary arithmetic coding with priority on the bit plane direction from the upper digit to the lower digit. A description will be given by describing the n-th bit from the bottom in the case where the coefficient value Q (Sb, x, y) of each subband is expressed in natural binary notation as Qn (x, y). Note that a variable n representing a digit of a binary number is referred to as a bit plane number, and the bit plane number n has an LSB (least significant bit) as the 0th digit.
FIG. 4 is a flowchart illustrating a processing procedure for encoding subband Sb in bit plane encoding section 204. As shown in FIG. 4, first, the absolute value of the coefficient in the subband Sb to be encoded is checked, and the maximum value Mabs (Sb) is obtained (step S601). Next, the number of digits N _BP (Sb) required when Mabs (Sb) is represented by a binary number in order to represent the absolute value of the coefficient in the sub-band Sb is obtained using Expression (6) (Step S602). ).

Ｎ_ＢＰ（Ｓｂ）＝ｃｅｉｌ｛ｌｏｇ２（Ｍａｂｓ（Ｓｂ）＋１）｝ …（６） N _BP (Sb) = ceil {log2 (Mabs (Sb) +1)} (6)

但し、ｃｅｉｌ｛Ｒ｝は実数Ｒに等しい、又はそれ以上の最小の整数値を表すものとする。 Note that ceil {R} represents the smallest integer value equal to or greater than the real number R.

次に、ビットプレーン番号ｎに有効桁数Ｎ_ＢＰ（Ｓｂ）を代入する（ステップＳ６０３）。そして、ビットプレーン番号ｎから１を引いてｎ−１を求めてｎに代入する（ステップＳ６０４）。 Next, the number of significant digits N _BP (Sb) is substituted for the bit plane number n (step S603). Then, 1 is subtracted from the bit plane number n to obtain n-1 and substitute for n (step S604).

さらに、ｎ桁目のビットプレーンを二値算術符号を用いて符号化する（ステップＳ６０５）。ビットプレーン内の各ビットを符号化する際には、符号化済みの情報からいくつかの状態（コンテクスト）に分類し、それぞれ異なる出現確率予測モデルで符号化する。動画像符号化装置２００においては、使用する算術符号としてＭＱ−Ｃｏｄｅｒを用いる。このＭＱ−Ｃｏｄｅｒを用いて、ある状態（コンテクスト）で発生した二値シンボルを符号化する手順、或いは、算術符号化処理のための初期化手順、終端手順については、静止画像の国際標準ＩＳＯ／ＩＥＣ１５４４４−１勧告等に詳細に説明されているのでここでは説明を省略する。 Further, the n-th bit plane is encoded using a binary arithmetic code (step S605). When each bit in the bit plane is encoded, it is classified into several states (contexts) from the encoded information, and is encoded using different appearance probability prediction models. The moving picture coding apparatus 200 uses MQ-Coder as an arithmetic code to be used. A procedure for encoding a binary symbol generated in a certain state (context) using the MQ-Coder, or an initialization procedure and an end procedure for arithmetic encoding processing, are described in International Standard ISO / ISO for Still Images. Since it is described in detail in the IEC15444-1 recommendation and the like, the description is omitted here.

また、動画像符号化装置２００では、各ビットプレーンの符号化の開始時に算術符号化器を初期化し、終了時に算術符号化器の終端処理を行うものとする。また、個々の係数の最初に符号化される「１」の直後に、その係数の正負符号を０、１で表し、算術符号化する。ここでは、正の場合は０、負の場合は１とする。例えば、係数が−５で、この係数の属するサブバンドＳｂの有効桁数Ｎ_ＢＰ（Ｓｂ）が６の場合、係数の絶対値は２進数０００１０１で表され、各ビットプレーンの符号化により上位桁から下位桁へと符号化される。そして、２番目のビットプレーンの符号化時（この場合、上から４桁目）に最初の「１」が符号化され、この直後に正負符号「１」を算術符号化する。 Also, in the moving picture coding apparatus 200, the arithmetic coder is initialized at the start of coding of each bit plane, and termination processing of the arithmetic coder is performed at the end. Immediately after "1" which is coded at the beginning of each coefficient, the sign of the coefficient is represented by 0 or 1, and arithmetically coded. Here, the value is 0 for a positive value and 1 for a negative value. For example, when the coefficient is −5 and the number of significant digits N _BP (Sb) of the subband Sb to which the coefficient belongs is 6, the absolute value of the coefficient is represented by a binary number 00001, and the upper digit is obtained by encoding each bit plane. To the lower digit. Then, at the time of encoding the second bit plane (in this case, the fourth digit from the top), the first “1” is encoded, and immediately after that, the sign “1” is arithmetically encoded.

次に、ビットプレーン番号ｎが０であるか否かを判定する（ステップＳ６０６）。その結果、ｎが０、すなわちステップＳ６０５においてＬＳＢプレーンの符号化を行った場合（ＹＥＳ）、サブバンドＳｂの符号化処理を終了する。また、それ以外の場合（ＮＯ）、ステップＳ６０４に処理を移す。 Next, it is determined whether the bit plane number n is 0 (step S606). As a result, if n is 0, that is, if encoding of the LSB plane has been performed in step S605 (YES), the encoding process of the subband Sb ends. Otherwise (NO), the process moves to step S604.

上述した処理によって、サブバンドＳｂの全係数を符号化することができ、各ビットプレーンｎに対応する符号列ＣＳ（Ｓｂ，ｎ）を生成する。生成した符号列ＣＳ（Ｓｂ，ｎ）は、符号列形成部２０５に送られ、符号列形成部２０５内の不図示のバッファに一時的に格納される。 By the above-described processing, all the coefficients of the sub-band Sb can be encoded, and a code string CS (Sb, n) corresponding to each bit plane n is generated. The generated code sequence CS (Sb, n) is sent to the code sequence forming unit 205, and is temporarily stored in a buffer (not shown) in the code sequence forming unit 205.

符号列形成部２０５では、ビットプレーン符号化部２０４により全サブバンドの係数の符号化が終了して全符号列が内部バッファに格納されると、所定の順序で内部バッファに格納された符号列を読み出す。そして、必要な付加情報を挿入して、１枚のフレームに対応する符号列を形成し、２次記憶装置２０６へと出力する。 In the code sequence forming unit 205, when the encoding of the coefficients of all the subbands is completed by the bit plane coding unit 204 and all the code sequences are stored in the internal buffer, the code sequences stored in the internal buffer in a predetermined order Read out. Then, necessary additional information is inserted to form a code string corresponding to one frame and output to the secondary storage device 206.

符号列形成部２０５で生成される最終的な符号列は、ヘッダと、レベル０、レベル１及びレベル２の３つに階層化された符号化データとにより構成される。ヘッダには画像の水平方向、垂直方向の画素数や、２次元離散ウェーブレット変換の適用回数、選択されたフィルタを指定する情報、各サブバンドの量子化ステップｄｅｌｔａ（Ｓｂ）など、復号に必要となる付加情報が格納される。 The final code string generated by the code string forming unit 205 includes a header and coded data hierarchized into three levels: level 0, level 1, and level 2. The header includes information such as the number of pixels in the horizontal and vertical directions of the image, the number of applications of the two-dimensional discrete wavelet transform, information specifying the selected filter, and the quantization step delta (Sb) of each subband. Is stored.

レベル０の符号化データは、サブバンドＬＬの係数を符号化して得られるＣＳ（ＬＬ，Ｎ_ＢＰ（ＬＬ）−１）からＣＳ（ＬＬ，０）の符号列で構成される。また、レベル１は、ＬＨ１、ＨＬ１、ＨＨ１の各サブバンドの係数を符号化して得られる符号列ＣＳ（ＬＨ１，Ｎ_ＢＰ（ＬＨ１）−１）からＣＳ（ＬＨ１，０）、ＣＳ（ＨＬ１，Ｎ_ＢＰ（ＨＬ１）−１）からＣＳ（ＨＬ１，０）、及びＣＳ（ＨＨ１，Ｎ_ＢＰ（ＨＨ１）−１）からＣＳ（ＨＨ１，０）で構成される。さらに、レベル２は、ＬＨ２、ＨＬ２、ＨＨ２の各サブバンドの係数を符号化して得られる符号列ＣＳ（ＬＨ２，Ｎ_ＢＰ（ＬＨ２）−１）からＣＳ（ＬＨ２，０）、ＣＳ（ＨＬ２，Ｎ_ＢＰ（ＨＬ２）−１）からＣＳ（ＨＬ２，０）、及びＣＳ（ＨＨ２，Ｎ_ＢＰ（ＨＨ２）−１）からＣＳ（ＨＨ２，０）で構成される。 The coded data of level 0 is composed of a code string of CS (LL, _NBP (LL) -1) to CS (LL, 0) obtained by coding the coefficients of the subband LL. The level 1 is, LH1, HL1, encodes the coefficients of each subband HH1 code string obtained by _CS from _{(LH1, N BP (LH1)} -1) CS (LH1,0), CS (HL1, N _BP (HL1) -1) from CS (HL1,0), and _{CS (HH1, N BP (HH1} ) consists -1) with CS (HH1,0). Furthermore, level 2, LH2, HL2, encodes the coefficients of each subband HH2 code string obtained by _CS from _{(LH2, N BP (LH2)} -1) CS (LH2,0), CS (HL2, N _BP (HL2) -1) from CS (HL2,0), and _{CS (HH2, N BP (HH2} ) consists -1) with CS (HH2,0).

図５は、符号列形成部２０５において生成される１枚のフレームデータに対応する符号列の細部構造を示す図である。 FIG. 5 is a diagram illustrating a detailed structure of a code string corresponding to one frame data generated by the code string forming unit 205.

図５に示すように構成された符号列は、復号側でヘッダとレベル０の符号化データを復号することにより元の１／４の解像度の復元画像を得ることができる。また、レベル１の符号化データを加えて復号することにより元の１／２の解像度の復元画像を得ることができる。さらに、レベル２の符号化データまで加えて復号した場合には、元の解像度の復元画像を得ることができるというように、徐々に解像度を上げて画像を復号することができる。 In the code string configured as shown in FIG. 5, a decoded image having the original quarter resolution can be obtained by decoding the header and the coded data of level 0 on the decoding side. Further, by adding and decoding the level 1 encoded data, a restored image having the original 1/2 resolution can be obtained. Furthermore, when decoding is performed in addition to the encoded data of level 2, it is possible to decode the image by gradually increasing the resolution so that a restored image of the original resolution can be obtained.

一方、各レベルのビットプレーン符号化データの、上位のいくつかのビットプレーンのみを復号した場合には荒い復号画像を、下位のビットプレーンへと復号対象を増やしていった場合には、徐々に精度を上げて各サブバンドの変換係数を復元することができ、復号画質を向上させることが可能となる。 On the other hand, when only some upper bit planes of the bit plane encoded data of each level are decoded, a coarse decoded image is gradually decoded. The transform coefficients of each subband can be restored with higher accuracy, and the decoded image quality can be improved.

２次記憶装置２０６は、例えば、ハードディスクやメモリといった記憶装置であり、符号列形成部２０５で生成された符号列を内部に格納する。２次記憶装置２０６では符号列形成部２０５から出力される各フレームの符号列が連結され、動画像データの符号化データが構成される。図６は２次記憶装置２０６に格納される動画像符号化データの構造を示すものである。先頭のヘッダには動画像としての付加情報、例えば、フレーム数や、再生フレームレートなどが格納される。 The secondary storage device 206 is, for example, a storage device such as a hard disk or a memory, and stores therein the code string generated by the code string forming unit 205. In the secondary storage device 206, the code strings of the respective frames output from the code string forming unit 205 are connected to form encoded data of moving image data. FIG. 6 shows the structure of the encoded video data stored in the secondary storage device 206. In the first header, additional information as a moving image, for example, the number of frames, a reproduction frame rate, and the like are stored.

＜第１の実施形態＞
図７は、本発明の第１の実施形態に係る動画像復号装置１００の構成を示すブロック図である。前述の動画像符号化装置２００と共通するブロックについては同じ参照番号を用いる。図７に示すように、第１の実施形態に係る動画像復号装置１００は、２次記憶装置２０６、符号列読み出し部１０１、ビットプレーン復号部１０２、逆離散ウェーブレット変換部１０４、動画像データ出力部１０５、復号処理時間計測部１０６、非復号ビットプレーン決定部１０７とを備える。 <First embodiment>
FIG. 7 is a block diagram illustrating a configuration of the video decoding device 100 according to the first embodiment of the present invention. The same reference numerals are used for blocks common to the above-described video encoding device 200. As shown in FIG. 7, the moving image decoding apparatus 100 according to the first embodiment includes a secondary storage device 206, a code string reading unit 101, a bit plane decoding unit 102, an inverse discrete wavelet transform unit 104, and a moving image data output. A decoding unit 105, a decoding processing time measuring unit 106, and a non-decoding bit plane determining unit 107.

以下、図７を参照して、第１の実施形態の動画像復号装置１００の動作手順について説明する。 Hereinafter, an operation procedure of the video decoding device 100 according to the first embodiment will be described with reference to FIG.

第１の実施形態の動画像復号装置１００で復号対象とする動画像符号化データは、前述の動画像符号化装置２００により生成した符号化データである。また、動画像符号化データの生成にあたっては、全てのフレームで整数型５×３フィルタを使用したものとする。すなわち、前述した動画像符号化装置２００の信号線２０７から整数型５×３フィルタを選択する信号を入力して動画像データの符号化を行う。 The encoded video data to be decoded by the video decoding device 100 of the first embodiment is the encoded data generated by the video encoding device 200 described above. In addition, when generating encoded video data, it is assumed that an integer type 5 × 3 filter is used for all frames. That is, a signal for selecting an integer type 5 × 3 filter is input from the signal line 207 of the moving picture coding apparatus 200 described above to perform coding of moving picture data.

動画像符号化データの復号は、符号化データ中のフレーム単位で行われる。そこで、符号列読み出し部１０１は、二次記憶装置２０６に格納されている符号化データから着目するフレームの符号化データを読み出して不図示の内部バッファに格納する。符号化データのフレーム単位の読み出しは、フレーム１、フレーム２というように順番に行われる。 The decoding of the encoded moving image data is performed in units of frames in the encoded data. Therefore, the code string reading unit 101 reads the encoded data of the frame of interest from the encoded data stored in the secondary storage device 206, and stores the encoded data in an internal buffer (not shown). Reading of encoded data in units of frames is performed in order, such as frame 1 and frame 2.

ビットプレーン復号部１０２は、符号列読み出し部１０１の内部バッファに格納される符号化データをサブバンド順に読み出して、量子化された変換係数データＱ（Ｓｂ，ｘ，ｙ）を復号する。ビットプレーン復号部１０２における処理は、図１に示されるビットプレーン符号化部２０４と対を成すものである。 The bit plane decoding unit 102 reads the encoded data stored in the internal buffer of the code string reading unit 101 in the order of subbands, and decodes the quantized transform coefficient data Q (Sb, x, y). The processing in the bit plane decoding unit 102 forms a pair with the bit plane encoding unit 204 shown in FIG.

すなわち、ビットプレーン符号化部２０４では、上位のビットプレーンから下位のビットプレーンへと係数の絶対値の各ビットを所定のコンテクストにより二値算術符号化する。これに対し、ビットプレーン復号部１０２では、同様に上位のビットプレーンから下位のビットプレーンへと符号化時と同じコンテクストにより二値算術符号化データの復号を行い、係数の各ビットを復元する。また、係数の正負符号については符号化時と同じタイミングで、同じコンテクストを用いて算術符号の復号を行うようにする。 That is, the bit plane coding unit 204 performs binary arithmetic coding of each bit of the absolute value of the coefficient from the upper bit plane to the lower bit plane in a predetermined context. On the other hand, the bit plane decoding unit 102 similarly decodes the binary arithmetic coded data from the upper bit plane to the lower bit plane in the same context as when encoding, and restores each bit of the coefficient. Further, with respect to the sign of the coefficient, the arithmetic code is decoded using the same context at the same timing as the encoding.

但し、このとき非復号ビットプレーン決定部１０７からは、各サブバンドについて復号しない下位ビットプレーン数ＮＤ（Ｓｂ）が指示され、ビットプレーン復号部１０２では指示される枚数の下位ビットプレーンについては復号処理を行わない。例えば、サブバンドＨＨ２の係数を復号する場合に、非復号ビットプレーン決定部１０７から与えられるサブバンドＨＨ２の非復号ビットプレーン数ＮＤ（ＨＨ２）が２である場合、符号列読み出し部１０１から読み出されるサブバンドＨＨ２の係数の符号化データのＣＳ（ＨＨ２，Ｎ_ＢＰ（ＨＨ２）−１）からＣＳ（ＨＨ２，２）までを復号してサブバンドの係数を復元し、ＣＳ（ＨＨ２，１）およびＣＳ（ＨＨ２，０）の２枚のビットプレーンについては復号しない。 However, at this time, the non-decoding bit plane determining unit 107 indicates the number of lower bit planes ND (Sb) not to be decoded for each subband, and the bit plane decoding unit 102 performs decoding processing on the specified number of lower bit planes. Do not do. For example, when decoding the coefficient of the sub-band HH2, when the number ND (HH2) of the non-decoded bit planes of the sub-band HH2 given from the non-decoded bit plane determining unit 107 is 2, the coefficient is read from the code string reading unit 101. restore the coefficients of subband decoded from the encoded data of the coefficients of the subband _{HH2 CS (HH2, N BP (} HH2) -1) to CS (HH2,2), CS (HH2,1 ) and CS The two bit planes (HH2, 0) are not decoded.

逆離散ウェーブレット変換部１０４では、図１における離散ウェーブレット変換部２０２でのウェーブレット変換処理の逆変換を行い、フレームのデータを復元する。本第１の実施形態の動画像復号装置１００では、全てのフレームで整数型５×３フィルタを使用して生成される動画像符号化データを復号対象とするので、上述した式（１）、（２）に対応する逆変換を行う。 The inverse discrete wavelet transform unit 104 performs an inverse transform of the wavelet transform process in the discrete wavelet transform unit 202 in FIG. 1 to restore frame data. In the video decoding device 100 according to the first embodiment, since the video coded data generated by using the integer type 5 × 3 filter in all frames is to be decoded, the above equation (1) is used. The inverse transformation corresponding to (2) is performed.

動画像の再生表示を行う場合には、各フレームデータは所定の時間に表示される。一方、逆離散ウェーブレット変換１０４からの出力は、復号処理に掛かる時間により左右されるので、表示すべき時間と同期するものではない。このため、復号したフレームデータをバッファに格納して表示時間との調整が必要になる。例えば、動画像データ出力部１０５をネットワーク回線のインターフェースにより実現して、表示時間との調整のためのバッファ格納処理を本第１の実施の形態の動画像復号装置１００の外部で行うようにしても良いし、動画像データ出力部１０５の内部で行うようにしても良い。 When playing back and displaying a moving image, each frame data is displayed at a predetermined time. On the other hand, the output from the inverse discrete wavelet transform 104 is not synchronized with the time to be displayed because it depends on the time required for the decoding process. For this reason, it is necessary to store the decoded frame data in a buffer and adjust the display time. For example, the moving image data output unit 105 is realized by an interface of a network line, and a buffer storage process for adjusting the display time is performed outside the moving image decoding device 100 according to the first embodiment. Alternatively, the processing may be performed inside the moving image data output unit 105.

動画像データ出力部１０５の具体例として、動画像データ出力部１０５にディスプレイを接続し、動画像表示を行う場合について説明する。図１６は動画像データ出力部１０５の一形態と、ディスプレイへの接続を示す図である。同図において１６０１は複数のフレームデータを格納することが可能なバッファ、１６０２はディスプレイインターフェース、１６０３はディスプレイである。バッファ１６０１は逆離散ウェーブレット変換部１０４から可変の時間間隔で出力される復元画像データを、順番に格納する。復元画像データ格納の際には格納する復元画像データのフレーム番号と格納するアドレスを保持し、後から順番に取り出せるようにしておく。ディスプレイインターフェース１６０２は動画像データのフレームレートに応じて一定の時間間隔（例えば秒あたり３０フレームの動画像であるならば１／３０秒おき）でバッファ１６０１から復元画像データを順番に取り出し、ディスプレイ１６０３へと表示する。取り出したフレームデータはバッファ１６０１から消去する。このように、バッファ１６０１は、逆離散ウェーブレット変換部１０４からの復元画像データの入力と、ディスプレイインターフェースによるデータ取り出しの時間間隔の違いを調整する役割を果たす。 As a specific example of the moving image data output unit 105, a case where a display is connected to the moving image data output unit 105 to display moving images will be described. FIG. 16 is a diagram showing one mode of the moving image data output unit 105 and connection to a display. In the figure, reference numeral 1601 denotes a buffer capable of storing a plurality of frame data, 1602 denotes a display interface, and 1603 denotes a display. The buffer 1601 sequentially stores restored image data output from the inverse discrete wavelet transform unit 104 at variable time intervals. When the restored image data is stored, the frame number of the restored image data to be stored and the storage address are held so that they can be sequentially taken out later. The display interface 1602 sequentially takes out the restored image data from the buffer 1601 at a fixed time interval (for example, every 1/30 second if the moving image is 30 frames per second) according to the frame rate of the moving image data, and displays it on the display 1603. To be displayed. The extracted frame data is deleted from the buffer 1601. As described above, the buffer 1601 plays a role of adjusting the difference in the time interval between the input of the restored image data from the inverse discrete wavelet transform unit 104 and the data extraction by the display interface.

本実施の形態の動画像復号装置では、後述する処理によって、１フレームの復号処理時間の平均が目標復号時間Ｔとなるように制御を行う。しかしながら、各フレームのデータ復元に掛かる時間にはばらつきがあるため、バッファ１６０１に格納されるフレームデータの枚数が変化する。そのため、表示時間になってもフレームデータがバッファに準備されていない状態にならないように、動画復号の開始から所定時間後に再生表示を開始したり、あるいは所定のフレーム枚数がバッファに蓄積されてから再生開始するなど、動画再生開始のタイミングを調整する。また、バッファ容量に制約のある場合であって、バッファに所定枚数以上のフレームデータが格納されている場合には、フレームデータの復号処理を休止するよう制御することも必要となる。 The moving picture decoding apparatus according to the present embodiment performs control so that the average of the decoding processing time of one frame becomes the target decoding time T by the processing described later. However, since the time required for data restoration of each frame varies, the number of frame data stored in the buffer 1601 changes. Therefore, in order to prevent the frame data from being unprepared in the buffer even when the display time has elapsed, reproduction display is started after a predetermined time from the start of video decoding, or after a predetermined number of frames are accumulated in the buffer. Adjust the timing of video playback start, such as starting playback. In addition, when the buffer capacity is limited, and when a predetermined number or more of frame data is stored in the buffer, it is necessary to control to stop the decoding process of the frame data.

そして、動画像データ出力部１０５は、逆離散ウェーブレット変換部１０４から出力される復元画像データを装置外部へと出力する。動画像データ出力部１０５は、例えばネットワーク回線や表示デバイスへのインタフェース等によって実現することが可能である。 Then, the moving image data output unit 105 outputs the restored image data output from the inverse discrete wavelet transform unit 104 to the outside of the device. The moving image data output unit 105 can be realized by, for example, an interface to a network line or a display device.

復号処理時間計測部１０６は、各フレームについて、符号列読み出し部１０１によるフレーム符号化データの読み出し開始から動画像データ出力部１０５による復元されたフレームデータの出力までにかかる時間Ｄｔを測定し、非復号ビットプレーン決定部１０７へと出力する。ただし、図１６に例示したように、動画像データ出力部１０５が内部にフレームデータを格納するバッファを備え、装置外部への出力を一定間隔に調整する場合には、時間Ｄｔはバッファ１６０１へのフレームデータ格納までの時間とする。 The decoding processing time measuring unit 106 measures the time Dt required for each frame from the start of reading the encoded frame data by the code string reading unit 101 to the output of the restored frame data by the moving image data output unit 105. Output to decoding bit plane determining section 107. However, as illustrated in FIG. 16, when the moving image data output unit 105 includes a buffer for storing the frame data therein and adjusts the output to the outside of the apparatus at a constant interval, the time Dt is set to the buffer 1601. This is the time until the frame data is stored.

非復号ビットプレーン決定部１０７は、復号処理時間計測部１０６から出力される１フレームの復号処理時間を元に、各サブバンドの非復号ビットプレーンを決定する。非復号ビットプレーン決定部１０７はその内部に、非復号ビットプレーン数決定のインデックス値となる変数Ｑ（以降、「Ｑファクタ」と呼ぶ。）と、それぞれのＱファクタにおいて各サブバンドの非復号ビットプレーン数を示したテーブルと、目標復号処理時間Ｔ、時間差分ΔＴを保持する。図８にＱファクタと各サブバンドの非復号ビットプレーン数の対応を表すテーブルの例を示す。 Non-decoding bit plane determining section 107 determines a non-decoding bit plane for each subband based on the decoding processing time of one frame output from decoding processing time measuring section 106. The non-decoding bit plane determining unit 107 includes therein a variable Q (hereinafter, referred to as a “Q factor”) serving as an index value for determining the number of non-decoding bit planes, and a non-decoding bit of each subband in each Q factor. It holds a table indicating the number of planes, a target decoding processing time T, and a time difference ΔT. FIG. 8 shows an example of a table representing the correspondence between the Q factor and the number of non-decoded bit planes of each subband.

図９は、動画像復号装置１００による動画像符号化データの復号処理の流れを示すフローチャートである。図９に示すように、まず、動画像符号化データの復号開始時点、即ち、フレーム１の符号化データの復号開始前にＱファクタ、時間差分ΔＴを０にリセットする（ステップＳ７０１）。 FIG. 9 is a flowchart showing the flow of the decoding processing of the encoded video data by the video decoding apparatus 100. As shown in FIG. 9, first, the Q factor and the time difference ΔT are reset to 0 at the start of the decoding of the encoded video data, that is, before the decoding of the encoded data of the frame 1 starts (step S701).

次に、非復号ビットプレーン決定部１０７で、Ｑファクタに基づいて各サブバンドの非復号ビットプレーン数をテーブルから読み出し、ビットプレーン復号部１０２へ設定する（ステップＳ７０２）。 Next, the non-decoding bit plane determining unit 107 reads out the number of non-decoding bit planes of each subband from the table based on the Q factor, and sets the number in the bit plane decoding unit 102 (step S702).

続いて、符号列読み出し部１０１から逆離散ウェーブレット変換部１０４の処理により、１フレームの復号が行われ、動画像データ出力部１０５にフレームデータが出力される（ステップＳ７０３）。 Subsequently, one frame is decoded by the processing of the inverse discrete wavelet transform unit 104 from the code string reading unit 101, and the frame data is output to the moving image data output unit 105 (step S703).

復号処理時間計測部１０６はステップＳ７０３で行われた１フレームの復号処理にかかった時間Ｄｔを計測し、非復号ビットプレーン決定部１０７へと渡す（ステップＳ７０４）。 The decoding processing time measurement unit 106 measures the time Dt required for the decoding processing of one frame performed in step S703, and passes it to the non-decoding bit plane determination unit 107 (step S704).

非復号ビットプレーン決定部１０７は、１フレームの目標復号時間Ｔと実際にかかった復号処理時間Ｄｔの差分を求め、保持している時間差分ΔＴに加算する（ステップＳ７０５）。 The non-decoding bit plane determining unit 107 obtains the difference between the target decoding time T of one frame and the actually applied decoding processing time Dt, and adds the difference to the held time difference ΔT (step S705).

次に、ΔＴの値に応じてＱファクタを更新する（ステップＳ７０６）。ΔＴがあらかじめ設定した所定の閾値Ｕｑ（Ｕｑ＞０）よりも大きければＱから１を減じ、値を小さくする。ΔＴが所定の閾値よりも大きくなるのは目標の時間の総和に対して実際にかかった復号時間の総和が小さい場合であり、復号画質を向上させるためにＱの値を小さくすることにより非復号ビットプレーン数を減らす。また、反対にΔＴがあらかじめ設定した所定の閾値Ｌｑ（Ｌｑ＜０）よりも小さければＱに１を加えて、値を大きくする。ΔＴが所定の閾値Ｌｑよりも小さくなるのは目標の時間の総和に対して実際にかかった復号時間の総和が大きい場合であり、１フレームの復号時間を短縮するために値を大きくすることにより非復号ビットプレーン数を増やす。但しＱの値の範囲は０から９までとし、上述の更新処理により０より小さくなった場合には０、９より大きくなった場合には９とする。なお、ΔＴが閾値ＬｑとＵｑの範囲内ある場合には（Ｌｑ≦ΔＴ≦Ｕｑ）、目標の時間の総和に対して実際にかかった復号時間の総和が丁度良い範囲内にあるので、Ｑの値は変更せずにそのままにしておく。 Next, the Q factor is updated according to the value of ΔT (step S706). If ΔT is larger than a predetermined threshold Uq (Uq> 0), 1 is subtracted from Q to reduce the value. ΔT becomes larger than the predetermined threshold value when the total sum of the decoding time actually taken with respect to the total sum of the target time is small, and the non-decoding is performed by reducing the value of Q in order to improve the decoding image quality. Reduce the number of bit planes. Conversely, if ΔT is smaller than a predetermined threshold Lq (Lq <0), 1 is added to Q to increase the value. ΔT becomes smaller than the predetermined threshold Lq when the total sum of the decoding times actually taken with respect to the total sum of the target times is large, and by increasing the value to shorten the decoding time of one frame. Increase the number of non-decoded bit planes. However, the range of the value of Q is from 0 to 9, and is 0 when it is smaller than 0 and 9 when it is larger than 9 by the above-described update processing. If ΔT is within the range between the thresholds Lq and Uq (Lq ≦ ΔT ≦ Uq), the sum of the decoding times actually taken with respect to the sum of the target times is just within a good range. Leave the value unchanged.

復号処理を行ったフレームが最後のフレームであるか否かを判定し、最後のフレームでない場合（ＮＯ）にはステップＳ７０２に処理を移し、次のフレームの復号を行い、最後のフレームである場合（ＹＥＳ）は動画像符号化データの復号処理を終了する（ステップＳ７０７）。 It is determined whether or not the decoded frame is the last frame. If it is not the last frame (NO), the process proceeds to step S702, where the next frame is decoded and the last frame is decoded. (YES) ends the decoding processing of the encoded video data (step S707).

上述したように、１フレームの復号処理にかかる時間と目標復号時間の差分の累積値から、非復号ビットプレーン数のインデックス値であるＱファクタを定め、Ｑファクタに応じてサブバンド毎の非復号ビットプレーン数を変えることで、再生画像の視覚上の不具合をできるだけ抑制して、復号処理時間を制御することができる。 As described above, the Q factor that is the index value of the number of non-decoding bit planes is determined from the accumulated value of the difference between the time required for decoding one frame and the target decoding time, and the non-decoding for each subband is determined according to the Q factor. By changing the number of bit planes, it is possible to control the decoding processing time while minimizing visual defects in the reproduced image.

＜第２の実施形態＞
第１の実施形態では、復号対象となる動画像符号化データは全て整数型５×３フィルタによりサブバンド分解を行うことを前提として説明した。本第２の実施形態の動画像復号装置では、実数型５×３フィルタを使用してサブバンド分解を行った動画像符号化データを復号対象とする場合について説明する。即ち、前述した動画像符号化装置２００において信号線２０７から実数型５×３フィルタを選択する信号を入力して、動画像の各フレームを符号化する。符号化する際に各サブバンドの量子化ステップｄｅｌｔａ（Ｓｂ）を全フレームで同一とする。 <Second embodiment>
In the first embodiment, the description has been made on the assumption that all the moving image encoded data to be decoded are subjected to subband decomposition by an integer type 5 × 3 filter. In the moving picture decoding apparatus according to the second embodiment, a case will be described in which moving picture coded data subjected to subband decomposition using a real number type 5 × 3 filter is to be decoded. That is, a signal for selecting the real number type 5 × 3 filter is input from the signal line 207 to the moving picture coding apparatus 200 described above, and each frame of the moving picture is coded. When encoding, the quantization step delta (Sb) of each subband is set to be the same for all frames.

図１０は、本発明の第２の実施形態に係る動画像復号装置３００の構成を示すブロック図である。前述の動画像符号化装置２００、および第１の実施形態の動画像復号装置１００と共通するブロックについては同じ参照番号を用いる。図１０に示すように、第２の実施形態に係る動画像復号装置３００は、２次記憶装置２０６、符号列読み出し部９０４、ビットプレーン復号部１０２、係数逆量子化部９０１、逆離散ウェーブレット変換部９０２、動画像データ出力部１０５、復号処理時間計測部１０６、非復号ビットプレーン決定部９０３とを備える。 FIG. 10 is a block diagram illustrating a configuration of a video decoding device 300 according to the second embodiment of the present invention. The same reference numerals are used for blocks common to the above-described video encoding device 200 and the video decoding device 100 of the first embodiment. As shown in FIG. 10, the video decoding device 300 according to the second embodiment includes a secondary storage device 206, a code string reading unit 904, a bit plane decoding unit 102, a coefficient inverse quantization unit 901, an inverse discrete wavelet transform. A video data output unit 105; a decoding processing time measuring unit 106; and a non-decoded bit plane determining unit 903.

以下、同図を用いて、本第２の実施形態の動画像復号装置３００の動作手順について説明する。 Hereinafter, the operation procedure of the video decoding device 300 according to the second embodiment will be described with reference to FIG.

まず、第１の実施形態の符号列読み出し部１０１と同様にして符号列読み出し部９０４により、二次記憶装置２０６に格納されている動画像符号化データから着目するフレームの符号化データを読み出して不図示の内部バッファに格納する。このとき、読み出したフレームの符号化データのヘッダから各サブバンドＳｂの量子化ステップｄｅｌｔａ（Ｓｂ）を読み出し、同じく不図示の内部バッファに格納しておく。 First, the coded data of the target frame is read from the coded video data stored in the secondary storage device 206 by the coded sequence reading unit 904 in the same manner as the coded sequence reading unit 101 of the first embodiment. It is stored in an internal buffer (not shown). At this time, the quantization step delta (Sb) of each sub-band Sb is read from the header of the coded data of the read frame and stored in an internal buffer (not shown).

ビットプレーン復号部１０２は、第１の実施形態と同様にして符号列読み出し部９０４の内部バッファに格納される符号化データから、量子化された変換係数データＱ（Ｓｂ，ｘ，ｙ）を復号する。本第２の実施形態の動画像復号装置３００においても、非復号ビットプレーン決定部９０３から指示されるＮＤ（Ｓｂ）枚の下位ビットプレーンについては復号処理を行わない。 The bit plane decoding unit 102 decodes the quantized transform coefficient data Q (Sb, x, y) from the encoded data stored in the internal buffer of the code sequence reading unit 904 in the same manner as in the first embodiment. I do. Also in the video decoding device 300 of the second embodiment, the decoding process is not performed on ND (Sb) lower-order bit planes specified by the non-decoding bit plane determining unit 903.

係数逆量子化部９０１では、各サブバンド毎に定めた量子化ステップｄｅｌｔａ（Ｓｂ）とビットプレーン復号部１０２で復号された量子化された係数値をＱ（Ｓｂ，ｘ，ｙ）とから、各サブバンドの係数Ｃ（Ｓｂ，ｘ，ｙ）を復元する。 The coefficient inverse quantization unit 901 calculates the quantization step delta (Sb) determined for each sub-band and the quantized coefficient value decoded by the bit plane decoding unit 102 from Q (Sb, x, y). The coefficient C (Sb, x, y) of each subband is restored.

逆離散ウェーブレット変換部９０２では、図１における離散ウェーブレット変換部２０２でのウェーブレット変換処理の逆変換を行い、フレームのデータを復元する。本第２の実施形態の動画像復号装置３００では、全てのフレームで実数型５×３フィルタを使用して生成される動画像符号化データを復号対象とするので、上述した式（３）、（４）に対応する逆変換を行う。 The inverse discrete wavelet transform unit 902 performs inverse transform of the wavelet transform process in the discrete wavelet transform unit 202 in FIG. 1 to restore frame data. In the moving picture decoding apparatus 300 of the second embodiment, the moving picture coded data generated by using the real number type 5 × 3 filter is to be decoded in every frame. The inverse transformation corresponding to (4) is performed.

そして、動画像データ出力部１０５は、逆離散ウェーブレット変換部９０２から出力される復元画像データを装置外部へと出力する。 Then, the moving image data output unit 105 outputs the restored image data output from the inverse discrete wavelet transform unit 902 to the outside of the device.

復号処理時間計測部１０６は、第１の実施形態と同様に、各フレームについて、符号列読み出し部９０４によるフレーム符号化データの読み出し開始から動画像データ出力部１０５による復元されたフレームデータの出力までにかかる時間Ｄｔを測定し、非復号ビットプレーン決定部９０３へと出力する。 As in the first embodiment, the decoding processing time measuring unit 106 performs, for each frame, from the start of reading the encoded frame data by the code string reading unit 904 to the output of the restored frame data by the moving image data output unit 105. Is measured and output to the non-decoding bit plane determining unit 903.

非復号ビットプレーン決定部９０３は、復号処理時間計測部１０６から出力される１フレームの復号処理時間を元に、各サブバンドの非復号ビットプレーンを決定する。非復号ビットプレーン決定部９０３はその内部に、各サブバンドの非復号ビットプレーン数ＮＤ（Ｓｂ）を示したテーブルと、目標復号処理時間Ｔ、時間差分ΔＴ、サブバンドインデックスＳＩを保持する。図１１にサブバンドＳｂ毎の非復号ビットプレーン数ＮＤ（Ｓｂ）を保持するテーブルの例を示す。 The non-decoding bit plane determining unit 903 determines a non-decoding bit plane for each subband based on the decoding processing time of one frame output from the decoding processing time measuring unit 106. The non-decoding bit plane determining unit 903 holds therein a table indicating the number ND (Sb) of non-decoding bit planes of each subband, a target decoding processing time T, a time difference ΔT, and a subband index SI. FIG. 11 shows an example of a table that holds the number ND (Sb) of non-decoded bit planes for each subband Sb.

図１２は、本画像復号装置３００による動画像符号化データの復号処理の流れを示すフローチャートである。図１２に示すように、まず、動画像符号化データの復号開始時点、即ち、フレーム１の符号化データの復号開始前にサブバンドインデックスＳＩ、時間差分ΔＴを０にリセットする（ステップＳ１１０１）。 FIG. 12 is a flowchart illustrating a flow of a decoding process of moving image encoded data by the image decoding device 300. As shown in FIG. 12, first, the subband index SI and the time difference ΔT are reset to 0 at the start of the decoding of the encoded video data, that is, before the decoding of the encoded data of the frame 1 is started (step S1101).

次に、非復号ビットプレーン決定部９０３に保持するサブバンド毎の非復号ビットプレーン数ＮＤ（Ｓｂ）を全て０に初期化する（ステップＳ１１０２）。 Next, the number of non-decoded bit planes ND (Sb) for each subband held in the non-decoded bit plane determination unit 903 is initialized to 0 (step S1102).

次に、非復号ビットプレーン決定部９０３に格納された非復号ビットプレーン数ＮＤ（Ｓｂ）を読み出し、ビットプレーン復号部１０２へ設定する（ステップＳ１１０３）。 Next, the number of non-decoded bit planes ND (Sb) stored in the non-decoded bit plane determining unit 903 is read and set to the bit plane decoding unit 102 (step S1103).

続いて、符号列読み出し部９０４から逆離散ウェーブレット変換部９０２の処理により、１フレームの復号が行われ、動画像データ出力部１０５にフレームデータが出力される（ステップＳ１１０４）。 Subsequently, one frame is decoded by the processing of the inverse discrete wavelet transform unit 902 from the code string reading unit 904, and the frame data is output to the moving image data output unit 105 (step S1104).

復号処理時間計測部１０６はステップＳ１１０４で行われた１フレームの復号処理にかかった時間Ｄｔを計測し、非復号ビットプレーン決定部９０３へと渡す（ステップＳ１１０５）。 The decoding processing time measuring unit 106 measures the time Dt required for the decoding processing of one frame performed in step S1104, and passes it to the non-decoded bit plane determining unit 903 (step S1105).

非復号ビットプレーン決定部９０３は、１フレームの目標復号時間Ｔと実際にかかった復号処理時間Ｄｔの差分を求め、保持している時間差分ΔＴに加算する（ステップＳ１１０６）。 The non-decoding bit plane determining unit 903 obtains the difference between the target decoding time T of one frame and the decoding processing time Dt actually taken, and adds the difference to the held time difference ΔT (step S1106).

次に、ΔＴの値に応じて非復号ビットプレーン数ＮＤ（Ｓｂ）を保持するテーブル、および、サブバンドインデックスＳＩを更新する（ステップＳ１１０７）。 Next, the table holding the number ND (Sb) of non-decoded bit planes and the subband index SI are updated according to the value of ΔT (step S1107).

図１３はステップＳ１１０７で行われる処理の流れを示すフローチャートである。まず、ΔＴがあらかじめ設定した所定の閾値Ｕｑ（Ｕｑ＞０）よりも大きいか否かを判断する（ステップＳ１２０１）。大きい場合（ＹＥＳ）にはサブバンドインデックスＳＩから１を減じる（ステップＳ１２０２）。そして、ＳＩが−１かどうかを判断し（ステップＳ１２０３）、−１である場合にはＳＩを６に設定する（ステップＳ１２０４）。次に、サブバンドインデックスＳＩに対応するサブバンドＳ（ＳＩ）の非復号ビットプレーン数ＮＤ（Ｓ（ＳＩ））から１を引く（ステップＳ１２０５）。ΔＴが所定の閾値よりも大きくなるのは目標の時間の総和に対して実際にかかった復号時間の総和が小さい場合であるので、非復号ビットプレーン数を減らすことで復号画質を向上させる。サブバンドインデックスＳＩとサブバンドの対応は図１４の通りである。例えばＳＩが２であれば、対応するサブバンドはＨＬ２であり、ＮＤ（ＨＬ２）の値から１を引くといった具合である。そして、ＮＤ（Ｓ（ＳＩ））と０を比較し（ステップＳ１２０６）、ＮＤ（Ｓ（ＳＩ））が０より小さい値となった場合にはＮＤ（Ｓ（ＳＩ））を０とし（ステップＳ１２０７）、ＳＩを０に戻す（ステップＳ１２１３）。 FIG. 13 is a flowchart showing the flow of the process performed in step S1107. First, it is determined whether ΔT is larger than a predetermined threshold Uq (Uq> 0) set in advance (step S1201). If it is larger (YES), 1 is subtracted from the subband index SI (step S1202). Then, it is determined whether or not SI is -1 (step S1203). If it is -1, SI is set to 6 (step S1204). Next, 1 is subtracted from the number ND (S (SI)) of non-decoded bit planes of the subband S (SI) corresponding to the subband index SI (step S1205). Since ΔT becomes larger than the predetermined threshold value when the total sum of the decoding time actually taken with respect to the total sum of the target time is small, the decoding image quality is improved by reducing the number of non-decoding bit planes. The correspondence between the subband index SI and the subband is as shown in FIG. For example, if SI is 2, the corresponding subband is HL2, and 1 is subtracted from the value of ND (HL2). Then, ND (S (SI)) is compared with 0 (step S1206). If ND (S (SI)) is smaller than 0, ND (S (SI)) is set to 0 (step S1207). ), Return SI to 0 (step S1213).

一方、ステップＳ１２０１の比較の結果、ΔＴ≦Ｕｑである場合（ＮＯ）、ΔＴをあらかじめ設定した所定の閾値Ｌｑ（Ｌｑ＜０）と比較し（ステップＳ１２０８）、ΔＴ＞Ｌｑである場合（ＮＯ）には処理を終了する。ΔＴ≦Ｌｑの場合（ＹＥＳ）、ＮＤ（Ｓ（ＳＩ））に１を加える（ステップＳ１２０９）。ΔＴが所定の閾値よりも小さくなるのは目標の時間の総和に対して実際にかかった復号時間の総和が長い場合であるので、非復号ビットプレーン数を増やすことで、１フレームの復号時間を短縮する。続いてＳＩにも１を加え（ステップＳ１２１０）、ＳＩを７と比較して（ステップＳ１２１１）、ＳＩが７ならば（ＹＥＳ）ＳＩを０に設定する（ステップＳ１２１２）。 On the other hand, if ΔT ≦ Uq as a result of the comparison in step S1201 (NO), ΔT is compared with a predetermined threshold Lq (Lq <0) set in advance (step S1208), and if ΔT> Lq (NO). Ends the processing. If ΔT ≦ Lq (YES), 1 is added to ND (S (SI)) (step S1209). Since ΔT becomes smaller than the predetermined threshold value when the total sum of the decoding times actually taken with respect to the total sum of the target times is long, the decoding time of one frame can be reduced by increasing the number of non-decoding bit planes. Shorten. Subsequently, 1 is also added to SI (step S1210), and SI is compared with 7 (step S1211). If SI is 7 (YES), SI is set to 0 (step S1212).

以上の処理により、ΔＴが所定の値より大きい場合、または所定の値よりも小さい場合に、ひとつのサブバンドの非復号ビットプレーン数ＮＤ（Ｓｂ）を１レベル変化させる。 By the above processing, when ΔT is larger than the predetermined value or smaller than the predetermined value, the number ND (Sb) of non-decoded bit planes of one subband is changed by one level.

図１２の処理に戻り、復号処理を行ったフレームが最後のフレームであるか否かを判定し（ステップＳ１１０８）、最後のフレームでない場合（ＮＯ）にはステップＳ１１０３に処理を移し、次のフレームの復号を行い、最後のフレームである場合（ＹＥＳ）は動画像符号化データの復号処理を終了する。 Returning to the processing of FIG. 12, it is determined whether or not the frame subjected to the decoding processing is the last frame (step S1108). If it is not the last frame (NO), the processing moves to step S1103, and the next frame , And if it is the last frame (YES), the decoding process of the encoded video data ends.

上述したように、１フレームの復号処理にかかる時間と目標復号時間の差分の累積値から、サブバンド毎の非復号ビットプレーン数を変えることで、再生画像の視覚上の不具合できるだけ抑制して、復号処理時間を制御することができる。 As described above, by changing the number of non-decoding bitplanes for each subband from the accumulated value of the difference between the time required for the decoding process of one frame and the target decoding time, visual defects of the reproduced image are suppressed as much as possible. The decoding processing time can be controlled.

＜第３の実施形態＞
第１、第２の実施形態の動画像復号装置では、ビットプレーンを単位として非復号部分を定めたが、着目するビットの周囲の符号化済みの部分に基づき、ビットプレーン内の各ビットをカテゴリ分けして複数のパス（サブビットプレーン）に分解し、パスを単位として非復号部分を定めることもできる。以下、パスを単位として非復号部分を定める実施形態について説明する。 <Third embodiment>
In the video decoding devices of the first and second embodiments, the non-decoding part is determined in units of bit planes, but each bit in the bit plane is classified into a category based on a coded part around a bit of interest. It is also possible to divide the data into a plurality of paths (sub-bit planes) and determine a non-decoding part in units of paths. Hereinafter, an embodiment will be described in which a non-decoding portion is determined in units of paths.

本第３の実施形態の動画像復号装置の復号対象となる動画像符号化データを生成する過程は、基本的には前述した図１に示した動画像符号化装置２００の処理と同様であるが、ビットプレーン符号化部２０４におけるビットプレーンの符号化の方法が異なっており、前述のように１つのビットプレーンを複数のパスに分けて符号化する。説明を簡略化するため、ここでは具体的なパスへ分割方法は記さないが、ＩＳＯ／ＩＥＣ１５４４４−１勧告書に記載されるＪＰＥＧ２０００におけるビットプレーン符号化方法と同様の方法により符号化する。ＪＰＥＧ２０００では、最上位のビットプレーンを除き、３つのパスに分解して符号化が行われる。従って、あるサブバンドＳｂの有効ビット数がＮ_ＢＰ（Ｓｂ）である場合、（Ｎ_ＢＰ（Ｓｂ）−１）×３＋１のパスによって符号化される。それぞれのパスにより生成される符号をＣＳＰ（Ｓｂ，ｎ）と記す。ここでｎはパスの番号であり、最初のパスの番号を（Ｎ_ＢＰ（Ｓｂ）−１）×３）とし、最後のパスの番号を０とする。 The process of generating the encoded video data to be decoded by the video decoding device of the third embodiment is basically the same as the process of the video encoding device 200 shown in FIG. 1 described above. However, the bit plane encoding unit 204 uses a different bit plane encoding method. One bit plane is divided into a plurality of passes and encoded as described above. For simplicity of description, a specific method of dividing into paths is not described here, but encoding is performed by a method similar to the bit plane encoding method in JPEG2000 described in the ISO / IEC15444-1 recommendation. In JPEG2000, encoding is performed by decomposing into three passes, except for the most significant bit plane. Therefore, the number of effective bits of a certain subband Sb be a _N BP _(Sb), it is encoded by the (N BP (Sb) -1) × 3 + 1 paths. The code generated by each pass is referred to as CSP (Sb, n). Here, n is the number of the path, the first path number is ( _NBP (Sb) -1) × 3), and the last path number is 0.

符号列形成部２０５により、ビットプレーン単位の符号化データを並べて符号列を形成したのと同様に、パスの符号化データを並べて符号列を形成する。図１５はこのようにして生成された、本第３の実施形態の動画像復号装置で復号対象となる動画像符号化データの構造の例である。図５に示した第１、第２の実施形態の符号化対象と比較すると、符号化データを構成する要素がビットプレーン符号化データＣＳ（Ｓｂ，ｎ）からパスの符号化データＣＳＰ（Ｓｂ，ｎ）に置き換わった点が異なっている。ｎはビットプレーン、または、パスの番号である。さらに、第１、第２の実施形態で復号対象とする動画像符号化データはビットプレーン符号化部２０４の出力するビットプレーン符号化データを全て含んだが、ここでは、符号列形成部２０５により符号化データの廃棄が行われる例を示している。サブバンドＨＨ１，ＬＨ２，ＨＬ２については最後のパスの符号化データを廃棄し、ＨＨ２については最後の２つのパスの符号化データを廃棄している。 In the same manner as the code string is formed by arranging the coded data in bit plane units by the code string forming unit 205, the coded data of the paths is arranged to form a code string. FIG. 15 shows an example of the structure of the encoded moving image data to be decoded by the moving image decoding device of the third embodiment generated in this manner. As compared with the encoding targets of the first and second embodiments shown in FIG. 5, the elements constituting the encoded data are the bit encoded data CS (Sb, n) and the encoded data CSP (Sb, n). n is a bit plane or path number. Furthermore, the moving picture coded data to be decoded in the first and second embodiments includes all the bit plane coded data output from the bit plane coding unit 204. An example is shown in which discarded encrypted data is performed. For the subbands HH1, LH2, and HL2, the encoded data of the last pass is discarded, and for HH2, the encoded data of the last two passes are discarded.

なお、本第３の実施形態の動画像復号装置の構成は、図７に示す第１の実施形態の動画像復号装置１００の構成と同じであり、ビットプレーン復号部１０２と非復号ビットプレーン決定部１０７の動作が異なるのみである。 Note that the configuration of the video decoding device of the third embodiment is the same as the configuration of the video decoding device 100 of the first embodiment shown in FIG. Only the operation of the unit 107 is different.

ビットプレーン復号部１０２では、前述の動画像符号化装置２００のビットプレーン符号化と対をなす復号処理により、パスの符号化データＣＳＰ（Ｓｂ，ｎ）を復号して各パスのビットを取り出し、サブバンドの係数を復元する。このとき、第１の実施形態の動画像復号装置１００では非復号ビットプレーン決定部１０７から復号しない下位ビットプレーン数ＮＤ（Ｓｂ）が指示されたが、本第３の実施形態では復号しない下位のパスの数ＮＤＰ（Ｓｂ）が指示され、ビットプレーン復号部１０２では下位ＮＤＰ（Ｓｂ）のパスの符号化データは復号しない。即ち、ＣＳＰ（Ｓｂ，ＮＤＰ（Ｓｂ）−１）からＣＳＰ（Ｓｂ，０）の復号を行わない。 The bit plane decoding unit 102 decodes the coded data CSP (Sb, n) of the path and extracts the bits of each path by a decoding process paired with the bit plane coding of the moving image coding device 200 described above. Restore the subband coefficients. At this time, in the moving picture decoding apparatus 100 according to the first embodiment, the non-decoding bit plane determining unit 107 instructs the number of lower bit planes ND (Sb) not to be decoded. The number of paths NDP (Sb) is specified, and the bit plane decoding unit 102 does not decode the encoded data of the lower NDP (Sb) path. That is, CSP (Sb, 0) is not decoded from CSP (Sb, NDP (Sb) -1).

非復号ビットプレーン決定部１０７は、第１の実施形態で非復号ビットプレーン数ＮＤ（Ｓｂ）をビットプレーン復号部１０２に指示したのと同様の処理により、非復号パス数ＮＤＰ（Ｓｂ）を指示する。 The non-decoding bit plane determining unit 107 specifies the number of non-decoding paths NDP (Sb) by the same processing as instructing the bit plane decoding unit 102 of the number of non-decoding bit planes ND (Sb) in the first embodiment. I do.

以上のように、本第３の実施形態の動画像復号装置１００では、ビットプレーンよりも細かい符号化単位で非復号部分を設定できるため、より細かな復号画質の調整、および復号時間の調整が可能となる。 As described above, in the video decoding device 100 of the third embodiment, since the non-decoding part can be set in a coding unit finer than the bit plane, finer adjustment of the decoding image quality and decoding time can be performed. It becomes possible.

＜第４の実施形態＞
上記第１から第３の実施形態においては、各フレームの目標復号時間Ｔと実際の復号処理時間Ｄｔとの差分を累積した値ΔＴに基づいて復号範囲を調整していた。第１の実施形態の説明で動画像データ出力部１０５の具体例として説明したように、動画像データ出力部１０５をバッファとディスプレイインターフェースにより構成し、ディスプレイを接続して動画像表示を行うような場合には、実際に復号処理時間を計測せずともバッファに格納されているフレームデータの枚数を監視することで同等の処理を行うことができる。以下、このような実施の形態について説明する。 <Fourth embodiment>
In the first to third embodiments, the decoding range is adjusted based on the value ΔT obtained by accumulating the difference between the target decoding time T of each frame and the actual decoding processing time Dt. As described as a specific example of the moving image data output unit 105 in the description of the first embodiment, the moving image data output unit 105 is configured by a buffer and a display interface, and a moving image is displayed by connecting a display. In this case, the same processing can be performed by monitoring the number of frame data stored in the buffer without actually measuring the decoding processing time. Hereinafter, such an embodiment will be described.

本第４の実施形態の動画像復号装置の復号対象となる動画像符号化データを生成する過程は、前述した図１に示す動画像符号化装置２００の処理と同様である。 The process of generating the encoded video data to be decoded by the video decoding device of the fourth embodiment is the same as the process of the video encoding device 200 shown in FIG. 1 described above.

図１７は、本発明の第４の実施形態に係る動画像復号装置４００の構成を示すブロック図である。動画像符号化装置２００、および第１の実施形態の動画像復号装置１００と共通するブロックについては同じ参照番号を用いる。図１７に示すように、第４の実施形態に係る動画像復号装置４００は、２次記憶装置２０６、符号列読み出し部１０１、ビットプレーン復号部１０２、逆離散ウェーブレット変換部１０４、バッファ１６０１、ディスプレイインターフェース１６０２、非復号ビットプレーン決定部１７０２、バッファ状態監視部１７０１とを備え、ディスプレイ１６０３に接続されている。同図の通り、本発明の第４の実施形態の動画像復号装置４００は第１の実施形態の動画像復号装置１００の動画像データ出力部１０５を図１６のようにバッファ１６０１とディスプレイインターフェース１６０２で構成し、復号処理時間計測部１０６をバッファ状態監視部１７０１に、非復号ビットプレーン決定部１０７を非復号ビットプレーン決定部１７０２に置き換えている。 FIG. 17 is a block diagram illustrating a configuration of a video decoding device 400 according to the fourth embodiment of the present invention. The same reference numerals are used for blocks common to the moving picture coding apparatus 200 and the moving picture decoding apparatus 100 of the first embodiment. As shown in FIG. 17, a moving image decoding device 400 according to the fourth embodiment includes a secondary storage device 206, a code string reading unit 101, a bit plane decoding unit 102, an inverse discrete wavelet transform unit 104, a buffer 1601, and a display. An interface 1602, a non-decoded bit plane determining unit 1702, and a buffer status monitoring unit 1701 are provided, and are connected to the display 1603. As shown in the figure, the moving picture data output unit 105 of the moving picture decoding apparatus 100 according to the fourth embodiment of the present invention comprises a buffer 1601 and a display interface 1602 as shown in FIG. The decoding processing time measuring unit 106 is replaced with a buffer state monitoring unit 1701 and the non-decoded bit plane determining unit 107 is replaced with a non-decoded bit plane determining unit 1702.

以下、同図を用いて、本第４の実施形態の動画像復号装置４００の動作手順について説明する。符号列読み出し部１０１、ビットプレーン復号部１０２、逆離散ウェーブレット変換部１０４の動作は第１の実施形態の動画像復号装置１００での動作と同様である。また、バッファ１６０１、ディスプレイインターフェース１６０２の基本動作についても同じく第１の実施の形態の説明にて動画像データ出力部１０５をバッファとディスプレイインターフェースにより構成する具体例として説明した通りである。但し、本第４の実施形態の動画像復号装置４００では所定の時間に本来表示すべきデータがバッファ１６０１に準備できていない状態を避けるために、復号開始後、時間ｍＴ経過してから復号フレームデータの表示を開始することとする。Ｔは上述の実施の形態と同じく、１フレームの目標復号処理時間であり、ｍは任意の正の整数である。即ち、ディスプレイインターフェース１６０２は復号処理の開始から時間ｍＴ経過の後、バッファ１６０１から一定の時間間隔にて復元フレームデータを順番に取り出してディスプレイへの表示を行う。１フレームの復号データをバッファ１６０１に準備する時間として、復号処理時間以外の要素は無視できるものとして考えるならば、秒あたり３０フレームの動画像の場合、目標復号時間Ｔは１／３０秒であり、また、バッファ１６０１から復元フレームデータを取り出す間隔もまた１／３０秒である。 Hereinafter, the operation procedure of the video decoding device 400 according to the fourth embodiment will be described with reference to FIG. The operations of the code string reading unit 101, the bit plane decoding unit 102, and the inverse discrete wavelet transform unit 104 are the same as the operations of the video decoding device 100 according to the first embodiment. The basic operations of the buffer 1601 and the display interface 1602 are also the same as those described in the first embodiment as a specific example in which the moving image data output unit 105 is configured by the buffer and the display interface. However, in the moving picture decoding apparatus 400 according to the fourth embodiment, in order to avoid a state in which data to be originally displayed at a predetermined time is not prepared in the buffer 1601, a decoding frame is set after a lapse of time mT from the start of decoding. Display of data is started. T is the target decoding processing time for one frame, as in the above embodiment, and m is any positive integer. That is, after a lapse of time mT from the start of the decoding process, the display interface 1602 sequentially retrieves the restored frame data from the buffer 1601 at fixed time intervals and displays the data on the display. Assuming that the time for preparing one frame of decoded data in the buffer 1601 is negligible, other than the decoding processing time, the target decoding time T is 1/30 second for a moving image of 30 frames per second. The interval at which the restored frame data is extracted from the buffer 1601 is also 1/30 second.

図１８は復号開始からの時系列で本動画像復号装置４００の動作を表すものである。図１８に示すように、復号開始時点（時間０）から時間ｍＴ経過した時点において最初のフレームが表示され、時間Ｔ間隔でフレームが次々と表示される。上述の第１から第３の実施形態にて復号範囲を制御する指標として用いた時間差分ΔＴは、時間ｎＴから着目するあるフレームｎが復号された時間を減じたものであり、この値が上限値Ｕｑより大きい場合には復号範囲を広げて復号画質を上げ、逆にこの値が下限値Ｌｑより小さい場合には復号範囲を狭くして復号画質を落として１フレーム当りの復号処理時間を短くするよう制御していたが、本第４の実施形態の動画像復号装置ではバッファ１６０１に格納される復号フレームデータ数に着目して同様の制御を行う。 FIG. 18 shows the operation of the main video decoding apparatus 400 in a time series from the start of decoding. As shown in FIG. 18, the first frame is displayed when a time mT has elapsed from the decoding start time (time 0), and the frames are displayed one after another at time T intervals. The time difference ΔT used as an index for controlling the decoding range in the above-described first to third embodiments is obtained by subtracting the time when a certain frame n of interest is decoded from the time nT, and this value is an upper limit. If the value is larger than the value Uq, the decoding range is widened to increase the decoding quality. Conversely, if the value is smaller than the lower limit Lq, the decoding range is narrowed to lower the decoding quality and shorten the decoding processing time per frame. However, the video decoding apparatus according to the fourth embodiment performs similar control by focusing on the number of decoded frame data stored in the buffer 1601.

図１８において、復号開始からフレームｎの復号終了時点の時間をｄとして（ｎ−１）Ｔ＜ｄ＜ｎＴである場合、即ち、０＜ΔＴ＜Ｔである場合、フレームｎの復号終了時点でバッファ１６０１に格納されている復号フレーム数はｍ枚である。ΔＴ＞Ｔとなる場合、フレームｎ復号終了時点でバッファ１６０１にはｍ＋１枚以上の復号フレームデータが格納される。反対に、ｎＴ＜ｄ＜（ｎ＋１）Ｔである場合、即ち−Ｔ＜ΔＴ＜０である場合、フレームｎ復号終了時点でバッファ１６０１に格納されている復号フレーム数はｍ−１枚である。ΔＴ＜−Ｔとなる場合には、ｍ−２枚以下となる。フレームｎの復号終了時点でバッファ１６０１に格納される復号フレーム数がｍ＋１枚以上であれば復号範囲を広げて復号画質を向上させ、逆に、ｍ−２枚以下であれば復号画質を落として１フレーム当りの復号処理時間を短くするように制御すれば、Ｕｑ＝Ｔ、Ｌｑ＝−ＴとしてΔＴに基づいて復号範囲を制御するのと同様の動作をさせることができる。なお、逆離散ウェーブレット変換部１０４によるバッファ１６０１への復号フレームデータ格納と、ディスプレイインターフェース１６０２による復号フレームデータの読み出しは同一時刻には発生しないものとする。 In FIG. 18, when the time from the start of decoding to the end of decoding of frame n is d, (n-1) T <d <nT, that is, 0 <ΔT <T, The number of decoded frames stored in the buffer 1601 is m. When ΔT> T, at the end of frame n decoding, m + 1 or more decoded frame data is stored in the buffer 1601. Conversely, when nT <d <(n + 1) T, that is, when −T <ΔT <0, the number of decoded frames stored in the buffer 1601 at the end of frame n decoding is m−1. If ΔT <−T, m−2 or less. If the number of decoded frames stored in the buffer 1601 at the end of the decoding of the frame n is m + 1 or more, the decoding range is widened to improve the decoding quality, and conversely, if m-2 or less, the decoding quality is reduced. If control is performed to shorten the decoding processing time per frame, the same operation as controlling the decoding range based on ΔT with Uq = T and Lq = −T can be performed. Note that storage of decoded frame data in the buffer 1601 by the inverse discrete wavelet transform unit 104 and reading of decoded frame data by the display interface 1602 do not occur at the same time.

バッファ状態監視部１７０１は、各フレームの復号終了時点ごとにバッファ１６０１に格納されている復号フレームの枚数ｐを取得し、非復号ビットプレーン決定部１７０２へと出力する。 The buffer state monitoring unit 1701 obtains the number p of decoded frames stored in the buffer 1601 at each decoding end time of each frame, and outputs the number to the non-decoded bit plane determining unit 1702.

非復号ビットプレーン決定部１７０２では、バッファ状態監視部１７０１から出力される格納フレーム数ｐを元に、各サブバンドの非復号ビットプレーンを決定する。非復号ビットプレーン決定部１７０２はその内部に、非復号ビットプレーン数決定のインデックス値となる変数Ｑ（以下、「Ｑファクタ」と呼ぶ。）と、それぞれのＱファクタにおいて各サブバンドの非復号ビットプレーン数を示したテーブルと、復号開始から表示開始までの遅延時間を決定するパラメータｍを保持する。図８はＱファクタと各サブバンドの非復号ビットプレーン数の対応を表すテーブルの例を示す。 The non-decoding bit plane determining unit 1702 determines a non-decoding bit plane for each subband based on the number of stored frames p output from the buffer status monitoring unit 1701. The non-decoding bit plane determining unit 1702 includes therein a variable Q (hereinafter, referred to as a “Q factor”) serving as an index value for determining the number of non-decoding bit planes, and a non-decoding bit of each subband in each Q factor. A table indicating the number of planes and a parameter m for determining a delay time from the start of decoding to the start of display are held. FIG. 8 shows an example of a table indicating the correspondence between the Q factor and the number of non-decoded bit planes of each subband.

図１９は、動画像復号装置４００による動画像符号化データの復号処理の流れを示すフローチャートである。図１９に示すように、まず、動画像符号化データの復号開始時点、即ち、フレーム１の符号化データの復号開始前にＱファクタ、時間差分ΔＴを０にリセットする（ステップＳ１９０１）。 FIG. 19 is a flowchart showing a flow of a decoding process of encoded video data by the video decoding device 400. As shown in FIG. 19, first, the Q factor and the time difference ΔT are reset to 0 at the start of decoding of the encoded video data, that is, before the decoding of the encoded data of frame 1 is started (step S1901).

次に、非復号ビットプレーン決定部１７０２で、Ｑファクタに基づいて各サブバンドの非復号ビットプレーン数をテーブルから読み出し、ビットプレーン復号部１０２へ設定する（ステップＳ１９０２）。 Next, the non-decoding bit plane determining unit 1702 reads the number of non-decoding bit planes of each subband from the table based on the Q factor, and sets the number in the bit plane decoding unit 102 (step S1902).

続いて、符号列読み出し部１０１から逆離散ウェーブレット変換部１０４の処理により１フレームの復号が行われ、バッファ１６０１にフレームデータが格納される（ステップＳ１９０３）。 Subsequently, one frame is decoded by the processing of the inverse discrete wavelet transform unit 104 from the code string reading unit 101, and the frame data is stored in the buffer 1601 (step S1903).

バッファ監視部１７０１はバッファ１６０１に格納されているフレームデータの枚数ｐを取得し、非復号ビットプレーン決定部１７０２へと渡す（ステップＳ１９０４）。 The buffer monitoring unit 1701 obtains the number p of frame data stored in the buffer 1601 and passes it to the non-decoded bit plane determining unit 1702 (step S1904).

復号処理の開始から時間ｍＴが経過しているか否か、即ち、復号フレームデータの表示を開始しているか否かを判断し（ステップＳ１９０５）、表示開始前である場合（ＮＯ）、ステップＳ１９０２に戻って次のフレームの処理を行う。表示開始済である場合（ＹＥＳ）には、ｐの値に応じてＱファクタを更新する（ステップＳ１９０６）。ｐがｍよりも大きければ（即ちｐ＞ｍならば）Ｑから１を減じ、値を小さくする。ｐ＞ｍとなるのは目標の時間の総和に対して実際にかかった復号時間の総和が小さい場合であり、復号画質を向上させるためにＱの値を小さくすることにより非復号ビットプレーン数を減らす。また、反対にｐがｍ−１より小さければ（即ち、ｐ＜ｍ−１ならば）Ｑに１を加えて、値を大きくする。ｐ＜ｍ−１となるのは目標の時間の総和に対して実際にかかった復号時間の総和が大きい場合であり、１フレームの復号時間を短縮するために値を大きくすることにより非復号ビットプレーン数を増やす。但しＱの値の範囲は０から９までとし、上述の更新処理により０より小さくなった場合には０、９より大きくなった場合には９とする。なお、ｐ＝ｍであれば、目標の時間の総和に対して実際にかかった復号時間の総和が丁度良い範囲内にあるので、Ｑの値は変更せずにそのままにしておく。 It is determined whether or not the time mT has elapsed from the start of the decoding process, that is, whether or not the display of the decoded frame data has been started (step S1905). Then, the process returns to the next frame. If the display has been started (YES), the Q factor is updated according to the value of p (step S1906). If p is greater than m (that is, if p> m), 1 is subtracted from Q to reduce the value. The case where p> m is satisfied when the total sum of the decoding time actually taken with respect to the total sum of the target time is small. cut back. Conversely, if p is smaller than m-1 (that is, if p <m-1), 1 is added to Q to increase the value. The case where p <m-1 is satisfied is when the total sum of the decoding time actually taken with respect to the total sum of the target time is large, and by increasing the value to shorten the decoding time of one frame, the non-decoded bits are reduced. Increase the number of planes. However, the range of the value of Q is from 0 to 9, and is 0 when the value is smaller than 0 and 9 when the value is larger than 9 by the above-described update processing. If p = m, the sum of the decoding times actually taken with respect to the sum of the target times is within a good range, so the value of Q is left unchanged.

次のステップＳ１９０７で復号処理を行ったフレームが最後のフレームであるか否かを判定し、最後のフレームでない場合（ＮＯ）にはステップＳ７０２に戻って次のフレームの復号を行い、最後のフレームである場合（ＹＥＳ）は動画像符号化データの復号処理を終了する。 In the next step S1907, it is determined whether or not the decoded frame is the last frame. If it is not the last frame (NO), the process returns to step S702 to decode the next frame, and the last frame is decoded. Is satisfied (YES), the decoding process of the encoded video data ends.

復号処理が終了してもディスプレイインターフェース１６０２はバッファ１６０１に格納されるフレームデータの取り出しを時間ｍＴ分継続する。 Even when the decoding process is completed, the display interface 1602 continues extracting frame data stored in the buffer 1601 for the time mT.

以上のように、本第４の実施形態の動画像復号装置４００では、１フレームの復号処理にかかる時間と目標復号時間の差分の累積値から復号範囲を調整することと同等な手法として、バッファに保持された復号フレーム数によって復号範囲を調整した。このような構成にすることで、個々のフレームの処理時間の計測が困難である場合にも、バッファに保持する復号フレーム数に応じて非復号ビットプレーン数を変えることで、再生画像の視覚上の不具合できるだけ抑制して、復号処理時間を制御することができる。 As described above, the moving picture decoding apparatus 400 according to the fourth embodiment employs a buffer as a technique equivalent to adjusting the decoding range from the cumulative value of the difference between the time required for decoding one frame and the target decoding time. The decoding range was adjusted according to the number of decoding frames held in. With such a configuration, even when it is difficult to measure the processing time of each frame, by changing the number of non-decoded bitplanes according to the number of decoded frames held in the buffer, the reproduced image can be visually recognized. Can be controlled as much as possible, and the decoding processing time can be controlled.

＜他の実施形態＞
本発明は、上述した実施形態に限定されるものではない。例えば、上述した第１から第３の実施形態においては、サブバンドを単位にビットプレーン符号化を行ったが、サブバンドをブロックに分割し、ブロック毎にビットプレーン符号化を行ってもよい。また、一つのビットプレーンを複数のパスで符号化するようにしても構わない。 <Other embodiments>
The present invention is not limited to the embodiments described above. For example, in the above-described first to third embodiments, bit-plane encoding is performed in units of sub-bands. However, sub-bands may be divided into blocks, and bit-plane encoding may be performed for each block. Further, one bit plane may be encoded by a plurality of passes.

また、二値算術符号化の方法としてMQ-Coderを用いる例について述べたが、上述の実施形態に限定されるものではなく、例えば、QM-Coder等、MQ-Coder以外の算術符号化方法を適用しても構わないし、マルチコンテクストの情報源を符号化するに適する方式であればその他の２値符号化方式を適用しても構わない。 Further, although an example using MQ-Coder as a method of binary arithmetic coding has been described, the present invention is not limited to the above-described embodiment, and for example, an arithmetic coding method other than MQ-Coder, such as QM-Coder, may be used. It may be applied, or any other binary encoding method may be applied as long as it is a method suitable for encoding a multi-context information source.

また、サブバンド分解のためのフィルタは上述の実施形態に限定されるものではなく、実数型９×７フィルタなど、その他のフィルタを使用しても構わない。さらに、その適用回数についても上述の実施形態に限定されるものではない。上述の実施の形態では水平方向、垂直方向に同回数の１次元離散ウェーブレット変換を施したが、同一回数でなくてもよい。 Further, the filter for subband decomposition is not limited to the above-described embodiment, and another filter such as a real number type 9 × 7 filter may be used. Further, the number of times of application is not limited to the above embodiment. In the above-described embodiment, the same number of one-dimensional discrete wavelet transforms are performed in the horizontal direction and the vertical direction.

さらに、動画像符号化データの構造についても上述の実施の形態に限定されるものではなく、符号列の順序、付加情報の格納形態など、変えても構わない。例えば、本発明はフレームデータの符号化方式としてＩＳＯ／ＩＥＣ１５４４４−１に定めるＪＰＥＧ２０００を用いる場合に好適なものであり、ＪＰＥＧ２０００の規格に記される符号化データ、あるいは同規格のＰａｒｔ３に規定するＭｏｔｉｏｎＪＰＥＧ２０００の符号化データとしても良い。 Further, the structure of the encoded video data is not limited to the above-described embodiment, and the order of the code string, the storage format of the additional information, and the like may be changed. For example, the present invention is suitable for the case where JPEG2000 defined in ISO / IEC15444-1 is used as a frame data encoding method, and encoded data described in the JPEG2000 standard or Motion defined in Part 3 of the standard. It may be JPEG2000 encoded data.

また、復号処理時間の計測についても上述の実施の形態に限定されるものではなく、例えば、ウェーブレット変換などの処理はおおむね一定の処理時間と推定し、ビットプレーン復号にかかる時間のみを計測するようにしても構わないし、複数フレーム単位に処理時間を計測し、非復号部分を制御しても構わない。 Also, the measurement of the decoding processing time is not limited to the above-described embodiment. For example, processing such as wavelet transform is estimated to be a substantially constant processing time, and only the time required for bit plane decoding is measured. Alternatively, the processing time may be measured in units of a plurality of frames, and the non-decoding part may be controlled.

更に、本発明は、複数の機器（例えば、ホストコンピュータ、インタフェース機器、リーダ、プリンタ等）から構成されるシステムに適用しても、一つの機器からなる装置（例えば、複写機、ファクシミリ装置等）に適用してもよい。 Further, even if the present invention is applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, a printer, and the like), an apparatus including one device (for example, a copier, a facsimile device, and the like) May be applied.

また、本発明の目的は、前述した実施形態の機能を実現するソフトウェアのプログラムコードを記録した記憶媒体（または記録媒体）を、システムあるいは装置に供給し、そのシステムあるいは装置のコンピュータ（またはＣＰＵやＭＰＵ）が記憶媒体に格納されたプログラムコードを読み出し実行することによっても、達成されることは言うまでもない。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施形態の機能を実現することになり、そのプログラムコードを記憶した記憶媒体は本発明を構成することになる。また、コンピュータが読み出したプログラムコードを実行することにより、前述した実施形態の機能が実現されるだけでなく、そのプログラムコードの指示に基づき、コンピュータ上で稼働しているオペレーティングシステム（ＯＳ）などが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。ここでプログラムコードを記憶する記憶媒体としては、例えば、フレキシブルディスク、ハードディスク、ＲＯＭ、ＲＡＭ、磁気テープ、不揮発性のメモリカード、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＤＶＤ、光ディスク、光磁気ディスク、ＭＯなどが考えられる。 Further, an object of the present invention is to supply a storage medium (or a recording medium) in which a program code of software for realizing the functions of the above-described embodiments is recorded to a system or an apparatus, and to provide a computer (or a CPU or Needless to say, the present invention can also be achieved by an MPU) reading and executing a program code stored in a storage medium. In this case, the program code itself read from the storage medium realizes the function of the above-described embodiment, and the storage medium storing the program code constitutes the present invention. When the computer executes the readout program code, not only the functions of the above-described embodiments are realized, but also an operating system (OS) running on the computer based on the instruction of the program code. It goes without saying that a part or all of the actual processing is performed and the functions of the above-described embodiments are realized by the processing. Here, examples of the storage medium for storing the program code include a flexible disk, a hard disk, a ROM, a RAM, a magnetic tape, a nonvolatile memory card, a CD-ROM, a CD-R, a DVD, an optical disk, a magneto-optical disk, and an MO. Can be considered.

さらに、記憶媒体から読み出されたプログラムコードが、コンピュータに挿入された機能拡張カードやコンピュータに接続された機能拡張ユニットに備わるメモリに書込まれた後、そのプログラムコードの指示に基づき、その機能拡張カードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 Further, after the program code read from the storage medium is written into a memory provided in a function expansion card inserted into the computer or a function expansion unit connected to the computer, the function is executed based on the instruction of the program code. It goes without saying that the CPU included in the expansion card or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

本発明を上記記憶媒体に適用する場合、その記憶媒体には、先に説明したフローチャートに対応するプログラムコードが格納されることになる。 When the present invention is applied to the storage medium, the storage medium stores program codes corresponding to the flowcharts described above.

本発明の実施の形態における動画像符号化装置の構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a video encoding device according to an embodiment of the present invention. ２次元離散ウェーブレット変換によって処理される符号化対象画像のサブバンドを説明するための図である。FIG. 3 is a diagram for explaining subbands of an encoding target image processed by two-dimensional discrete wavelet transform. ２回の２次元離散ウェーブレット変換によって得られる７つのサブバンドを示す図である。FIG. 11 is a diagram illustrating seven subbands obtained by two two-dimensional discrete wavelet transforms. ビットプレーン符号化部でサブバンドＳｂを符号化する処理手順を説明するためのフローチャートである。35 is a flowchart for describing a processing procedure of encoding a subband Sb in a bit plane encoding unit. 符号列形成部において生成される１フレーム分の動画像符号化データに対応する符号列の細部構造を示す図である。FIG. 3 is a diagram illustrating a detailed structure of a code string corresponding to one frame of encoded moving image data generated by the code string forming unit. 二次記憶装置に格納される各フレームの符号列の一例を示す図である。FIG. 3 is a diagram illustrating an example of a code string of each frame stored in a secondary storage device. 本発明の第１、第３の実施形態に係る動画像復号装置の構成を示すブロック図である。It is a block diagram showing the composition of the video decoding device concerning a 1st, 3rd embodiment of the present invention. 本発明の第１の実施形態に係るＱファクタと各サブバンドの非復号ビットプレーンＮＤ（Ｓｂ）、まは、Ｑファクタと各サブバンドの非復号パス数ＮＤＰ（Ｓｂ）の関係を示す図である。FIG. 7 is a diagram illustrating a relationship between a Q factor and a non-decoding bit plane ND (Sb) of each subband or a Q factor and the number of non-decoding paths NDP (Sb) of each subband according to the first embodiment of the present invention. is there. 本発明の第１、第３の実施形態に係る動画像復号装置の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of the video decoding device which concerns on the 1st, 3rd embodiment of this invention. 本発明の第２の実施形態に係る動画像復号装置の構成を示すブロック図である。It is a block diagram showing the composition of the video decoding device concerning a 2nd embodiment of the present invention. 本発明の第２の実施形態に係る非復号ビットプレーン決定部に保持されるテーブルの例を示す図である。FIG. 14 is a diagram illustrating an example of a table held in a non-decoding bit plane determination unit according to the second embodiment of the present invention. 本発明の第２の実施形態に係る動画像復号装置の処理の流れを示すフローチャートである。It is a flow chart which shows a flow of processing of a video decoding device concerning a 2nd embodiment of the present invention. ステップＳ１１０７におけるＮＤ（Ｓｂ）テーブル、ＳＩの更新処理の流れを示すフローチャートである。It is a flowchart which shows the flow of update processing of ND (Sb) table and SI in step S1107. 本発明の第２の実施形態に係るサブバンドインデックスＳＩとサブバンドの対応を示す図である。FIG. 13 is a diagram illustrating a correspondence between a subband index SI and a subband according to the second embodiment of the present invention. 本発明の第３の実施形態に係る動画像復号装置で復号対象とする１フレーム分の動画像符号化データの構造を示す図である。FIG. 15 is a diagram illustrating a structure of encoded video data for one frame to be decoded by the video decoding device according to the third embodiment of the present invention. 動画像データ出力部の具体的構成例を示すブロック図である。FIG. 3 is a block diagram illustrating a specific configuration example of a moving image data output unit. 本発明の第４の実施形態に係る動画像復号装置の構成を示すブロック図である。It is a block diagram showing the composition of the video decoding device concerning a 4th embodiment of the present invention. 本発明の第４の実施形態に係る動画像復号装置の時系列での動作を示すタイムチャートである。15 is a time chart illustrating a time-series operation of the video decoding device according to the fourth embodiment of the present invention. 本発明の第４の実施形態に係る動画像復号装置の処理の流れを示すフローチャートである。It is a flow chart which shows a flow of processing of a video decoding device concerning a 4th embodiment of the present invention.

Explanation of reference numerals

１００動画像復号装置
１０１符号列読み出し部
１０２ビットプレーン復号部
１０４逆離散ウェーブレット変換部
１０５動画像データ出力部
１０６復号処理時間計測部
１０７非復号ビットプレーン決定部
２００動画像復号装置
２０１動画像データ入力部
２０２離散ウェーブレット変換部
２０３係数量子化部
２０４ビットプレーン符号化部
２０５符号列形成部
２０６２次記憶装置
２０７信号線
３００動画像復号装置
４００動画像復号装置
９０１係数逆量子化部
９０２逆離散ウェーブレット変換部
９０３非復号ビットプレーン決定部
９０４符号列読み出し部
１６０１バッファ
１６０２ディスプレイインターフェース
１６０３ディスプレイ
１７０１バッファ状態監視部
１７０２非復号ビットプレーン決定部 REFERENCE SIGNS LIST 100 moving image decoding device 101 code string reading unit 102 bit plane decoding unit 104 inverse discrete wavelet transform unit 105 moving image data output unit 106 decoding processing time measuring unit 107 non-decoding bit plane determining unit 200 moving image decoding device 201 moving image data input Unit 202 discrete wavelet transform unit 203 coefficient quantization unit 204 bit plane encoding unit 205 code string forming unit 206 secondary storage device 207 signal line 300 video decoding device 400 video decoding device 901 coefficient inverse quantization unit 902 inverse discrete wavelet Conversion section 903 Non-decoding bit plane determining section 904 Code string reading section 1601 Buffer 1602 Display interface 1603 Display 1701 Buffer state monitoring section 1702 Non-decoding bit plane determining section

Claims

Each frame of the moving image data is decomposed into a plurality of sub-bands, and the moving image generated by encoding the coefficients of the sub-bands from a higher-order bit to a lower-order bit for each predetermined unit in bit plane or sub-bit plane units A moving image decoding device that decodes encoded data,
Decoding processing time information obtaining means for obtaining information for determining the magnitude of the difference between the time allocated to the decoding processing of the video encoded data of the predetermined unit and the time required for the actual decoding processing,
Based on the information obtained by the decoding processing time information obtaining means, a non-decoding bit plane, or a non-decoding bit plane determining means for determining a sub-bit plane,
Bit plane determined by the non-decoding bit plane determining means, or bit plane decoding means for restoring sub-band coefficients in the predetermined unit from encoded data other than sub-bit planes,
A subband synthesizing unit that synthesizes coefficients of the plurality of subbands obtained by the bit plane decoding unit and generates frame data.

The decoding processing time information obtaining means obtains the decoding processing time required for the decoding processing of the encoded video data, and the non-decoding bit plane determining means obtains the decoding processing time based on the decoding processing time obtained from the decoding processing time information obtaining means. The moving picture decoding apparatus according to claim 1, wherein a bit plane or a sub bit plane not to be decoded is determined.

Further provided is a decoded frame data storage means for storing the decoded frame data,
The decoding processing time information obtaining means obtains the number of frames stored in the decoded frame data storage means, and the non-decoding bit plane determining means does not perform decoding based on the number of frames obtained from the decoding processing time information obtaining means. The moving picture decoding apparatus according to claim 1, wherein a bit plane or a sub-bit plane is determined.

The non-decoding bit plane determining unit holds a parameter indicating image quality, adjusts the parameter based on information obtained by the decoding processing time information obtaining unit, and sets a non-decoding bit plane or sub bit of each subband by the parameter. The moving picture decoding apparatus according to claim 1, wherein a plane is determined.

The non-decoding bit plane determining means holds a table for storing the number of bit planes not to be decoded for each sub-band, or the number of sub-bit planes, and is stored in the table in accordance with information obtained by the decoding processing time information obtaining means. 4. The moving picture decoding apparatus according to claim 1, wherein the number of bit planes or sub bit planes not decoded is increased or decreased.

The non-decoding bit plane determining means calculates a difference between a time allocated to the predetermined unit of moving image encoded data decoding processing and a decoding processing time obtained by the decoding processing information obtaining means, and calculates the calculated difference. 3. The moving picture decoding apparatus according to claim 2, wherein a bit plane or a sub-bit plane not to be decoded is determined based on a value obtained by accumulating.

The sub-band decomposition for generating the moving image encoded data is performed by a two-dimensional discrete wavelet transform, and the sub-band synthesizing unit synthesizes the frame data using a two-dimensional inverse discrete wavelet transform. The moving picture decoding device according to claim 1.

The moving picture decoding apparatus according to any one of claims 1 to 7, wherein the predetermined unit is a frame or a block obtained by dividing the frame into a plurality.

Each frame of the moving image data is decomposed into a plurality of sub-bands, and the moving image generated by encoding the coefficients of the sub-bands from a higher-order bit to a lower-order bit for each predetermined unit in bit plane or sub-bit plane units A moving image decoding method for decoding encoded data,
A decoding processing time information obtaining step of obtaining information for determining the magnitude of a difference between the time allocated to the decoding processing of the moving image encoded data of the predetermined unit and the time required for the actual decoding processing,
Based on the information obtained by the decoding processing time information obtaining step, a non-decoding bit plane, or a non-decoding bit plane determining step of determining a sub-bit plane,
A bit plane determined by the non-decoding bit plane determining step, or a bit plane decoding step of restoring subband coefficients in the predetermined unit from encoded data other than the sub bit plane,
A subband synthesizing step of synthesizing coefficients of the plurality of subbands obtained in the bitplane decoding step to generate frame data.

In the decoding processing time information obtaining step, a decoding processing time required for decoding the moving picture encoded data is obtained, and in the non-decoding bit plane determining step, based on the decoding processing time obtained in the decoding processing time information obtaining step. The moving picture decoding method according to claim 9, wherein a bit plane or a sub-bit plane not to be decoded is determined.

Further comprising a decoded frame data storing step for storing the decoded frame data,
In the decoding processing time information obtaining step, the number of frames stored in the decoded frame data storing step is obtained, and in the non-decoding bit plane determining step, decoding is not performed based on the number of frames obtained in the decoding processing time information obtaining step. The moving picture decoding method according to claim 9, wherein a bit plane or a sub-bit plane is determined.

In the non-decoding bit plane determining step, a parameter indicating image quality is managed, and the parameter is adjusted based on the information obtained in the decoding processing time information obtaining step. The moving picture decoding method according to claim 9, wherein a bit plane is determined.

In the non-decoding bit plane determining step, a table for storing the number of bit planes or sub bit planes that are not decoded for each subband is managed, and stored in the table according to the information obtained in the decoding processing time information obtaining step. 12. The moving picture decoding method according to claim 9, wherein the number of bit planes or sub-bit planes not decoded is increased or decreased.

In the non-decoding bit plane determining step, the difference between the time allocated to the decoding processing of the predetermined unit of moving image encoded data and the decoding processing time obtained in the decoding processing time information obtaining step is calculated and calculated. The moving picture decoding method according to claim 10, wherein a bit plane or a sub-bit plane not to be decoded is determined based on a value obtained by accumulating the differences.

The sub-band decomposition for generating the moving image encoded data is performed by two-dimensional discrete wavelet transform, and the sub-band synthesizing step synthesizes frame data using two-dimensional inverse discrete wavelet transform. The moving picture decoding method according to claim 9.

16. The video decoding method according to claim 9, wherein the predetermined unit is a frame or a block obtained by dividing the frame into a plurality.

9. A program executable by an information processing apparatus, the information processing apparatus executing the program functioning as the moving picture decoding apparatus according to claim 1.

A program executable by an information processing apparatus, comprising a program code for implementing the moving picture decoding method according to claim 9.

A storage medium readable by an information processing apparatus, storing the program according to claim 17.