JP2005159859A

JP2005159859A - Image encoding method

Info

Publication number: JP2005159859A
Application number: JP2003397535A
Authority: JP
Inventors: Masahiro Kawarada; 昌大瓦田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2003-11-27
Filing date: 2003-11-27
Publication date: 2005-06-16

Abstract

<P>PROBLEM TO BE SOLVED: To propose an image encoding method capable of obtaining a still image with high resolution in an arbitrary motion picture frame, preventing the motion picture from getting deteriorated, and reducing an amount of data to be recorded compared with conventional ones. <P>SOLUTION: A still image with high resolution is recorded for each frame. Hierarchical encoding is applied to the still image with high resolution to divide image data into sub bands. Reversible compression is applied to sub bands constituting the motion picture frame, and irreversible encoding is applied to the remaining sub bands. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、動画とそれより高解像度の静止画が混在して含まれている画像データを入力しフレーム内圧縮する画像符号化装置にあって、入力された画像に離散ウェーブレット変換を施す画像符号化方法に関する。 The present invention relates to an image encoding device that inputs image data including a mixture of a moving image and a still image having a higher resolution than the moving image, and compresses the input image within a frame, and an image code for performing discrete wavelet transform on the input image It relates to the conversion method.

近年、動画と静止画の両方を撮影する需要が高まっている。たとえば、動画の撮影を行うビデオカメラに、デジタルカメラのような静止画の撮影機能を持つ機種が登場している。このビデオカメラは、動画の撮影中において静止画用のレリーズが押下された場合、動画の１フレームを切り出して、切り出した静止画を動画とは異なる記録媒体に記録する。こういった静止画は、プリンタで印刷する用途があるため、通常、動画よりも高解像度で記録される。 In recent years, there is an increasing demand for shooting both moving images and still images. For example, a video camera that shoots a moving image has appeared that has a still image shooting function such as a digital camera. This video camera cuts out one frame of a moving image and records the cut out still image on a recording medium different from the moving image when a release for still image is pressed during shooting of the moving image. Since such still images have a purpose of printing with a printer, they are usually recorded at a higher resolution than a moving image.

一方、撮影された動画のデータと静止画のデータを別々のデータファイルで管理することは煩雑である。逆に言えば、動画データと静止画データとを１つのデータファイルで管理した方が、利便性が良い。従来は、動画のデータと静止画のデータを１つのデータファイルとして管理するために、動画データのみ管理し、その動画の１フレームから静止画を生成することが行われていた。 On the other hand, it is cumbersome to manage the captured moving image data and still image data in separate data files. In other words, it is more convenient to manage moving image data and still image data in one data file. Conventionally, in order to manage moving image data and still image data as one data file, only moving image data is managed and a still image is generated from one frame of the moving image.

ここで、記録される動画は、後々生じる静止画の切り出しを鑑みて、フレーム毎にまとまって記録されることが望ましい。記録される画像を圧縮符号化する場合、フレーム毎に圧縮処理を行うと、圧縮されたデータを１フレーム毎に切り出して復号することが容易にできる。 Here, it is desirable that the moving images to be recorded are recorded for each frame in consideration of the subsequent cutout of still images. When compressing and encoding an image to be recorded, if compression processing is performed for each frame, the compressed data can be easily cut out and decoded for each frame.

静止画あるいは動画の１フレームを圧縮符号化する画像符号化方式として、ＪＰＥＧ２０００がＩＳＯ／ＩＥＣで規格化されている（ＩＳＯ／ＩＥＣ１５４４４−１：非特許文献1参照。）。ＪＰＥＧ２０００は、解像度に関する階層符号化を行うため、高解像度の画像データと低解像度の画像データをまとめて符号化できる。またさらに、画像に可逆圧縮と非可逆圧縮のいずれかを共通のフォーマット内で選択できる。図７は可逆圧縮を行う場合におけるＪＰＥＧ２０００の符号化の処理手順を示したブロック図である。以下、ＪＰＥＧ２０００の可逆圧縮の処理について詳細に説明する。 As an image encoding method for compressing and encoding one frame of a still image or a moving image, JPEG2000 is standardized by ISO / IEC (see ISO / IEC 154444-1: Non-Patent Document 1). Since JPEG2000 performs hierarchical encoding related to resolution, high-resolution image data and low-resolution image data can be encoded together. Furthermore, either a reversible compression or an irreversible compression can be selected for the image within a common format. FIG. 7 is a block diagram showing a JPEG 2000 encoding process procedure when lossless compression is performed. The JPEG 2000 lossless compression process will be described in detail below.

ＪＰＥＧ２０００は、はじめに、入力画像の各コンポーネントに含まれる画素の値にＤＣレベルシフトという処理を行って、符号付きの整数に変換する。シフトする量は、画素値のビット精度から算出される。 JPEG2000 first converts a pixel value included in each component of an input image into a signed integer by performing a process called DC level shift. The shift amount is calculated from the bit accuracy of the pixel value.

ＤＣレベルシフトされた画像データは、コンポーネント変換を施され、輝度成分と色差成分に変換される。 The DC level shifted image data is subjected to component conversion and converted into a luminance component and a color difference component.

コンポーネント変換された画像データは、コンポーネント毎に２次元のウェーブレット変換を施される。ＪＰＥＧ２０００のウェーブレット変換には、リフティング演算が採用されている。リフティング演算とは、１次元に連なるデータにおいて両隣のデータの和に係数を掛けたものを中心の係数に加える演算を繰り返すことで、フィルタリングとサブサンプリングを同時に行う。ＪＰＥＧ２０００の規格に定められている可逆フィルタのリフティング演算（順変換）は、下記の漸化式で表される。 The component-converted image data is subjected to two-dimensional wavelet transform for each component. The JPEG2000 wavelet transform employs a lifting operation. In the lifting operation, filtering and sub-sampling are simultaneously performed by repeating an operation of adding the coefficient obtained by multiplying the sum of the adjacent data in the one-dimensional data to the coefficient at the center. The lifting calculation (forward conversion) of the reversible filter defined in the JPEG2000 standard is expressed by the following recurrence formula.

式１に示した漸化式は、整数の加減算、シフト算、Ｆｌｏｏｒ関数のみで構成されているため、小数演算を行わず、出力される係数は整数となる。よって、演算前の値と演算後の値との関係は可逆性を有する。リフティング演算の結果、画素の座標が偶数の場合は低周波成分、画素の座標が奇数の場合は高周波成分が表れる。ウェーブレット変換は、上記の漸近式に示されるリフティング演算を１次元方向に繰り返し行い、画像データを低周波成分と高周波成分とに分割する。ＪＰＥＧ２０００は、この演算を左端から右端に向かう横方向の画素ラインに順次適用し、さらに上端から下端に向かう縦方向の画素ライン毎に順次適用することで、画像データを２次元のサブバンドに分割する。図８は、ウェーブレット変換によって縦４画素横４画素の画像がサブバンドに分割される様子を示した模式図である。以下、図８を用いてウェーブレット変換の処理を説明する。 Since the recurrence formula shown in Formula 1 is composed only of integer addition / subtraction, shift calculation, and floor function, decimal calculation is not performed, and the output coefficient is an integer. Therefore, the relationship between the value before calculation and the value after calculation has reversibility. As a result of the lifting operation, a low frequency component appears when the pixel coordinates are even, and a high frequency component appears when the pixel coordinates are odd. In the wavelet transform, the lifting operation shown in the asymptotic formula is repeatedly performed in a one-dimensional direction to divide the image data into a low frequency component and a high frequency component. JPEG2000 divides the image data into two-dimensional subbands by sequentially applying this calculation to the pixel lines in the horizontal direction from the left end to the right end, and further sequentially for each pixel line in the vertical direction from the upper end to the lower end. To do. FIG. 8 is a schematic diagram showing a state in which an image of 4 pixels in the vertical direction and 4 pixels in the horizontal direction is divided into subbands by wavelet transform. Hereinafter, the wavelet transform process will be described with reference to FIG.

図８において、図中のＬ、Ｈは係数に含まれる周波数成分を表しており、Ｌは低周波成分、Ｈは高周波成分を表している。また、カッコ内に示された数字は、元画像における画素の座標を表しており、左側の値が上端から下端に向かう縦方向の座標、右側の値が左端から右端に向かう横方向の座標を表している。ウェーブレット変換は、はじめに上端から下端に向かう縦の画素ライン毎に左端から右端まで１次元のリフティング演算が行われる。このリフティング演算の結果を図８（ａ）に示す。リフティング演算の後、演算結果をＬ、Ｈの成分ごとにまとめる並び替えを行う。係数を並び替えた結果を図８（ｂ）に示す。続いて、左端から右端に向かう横の画素ライン毎に上端から下端まで１次元のリフティング演算を行う。この結果を図８（ｃ）に示す。続いて、縦の画素ライン毎の演算と同様に、演算結果をＬ、Ｈの成分ごとにまとめる並び替えを行う。この結果を図８（ｄ）に示す。 In FIG. 8, L and H in the figure represent frequency components included in the coefficients, L represents a low frequency component, and H represents a high frequency component. The numbers shown in parentheses represent the coordinates of the pixels in the original image. The left-hand value is the vertical coordinate from the upper end to the lower end, and the right-hand value is the horizontal coordinate from the left end to the right end. Represents. In the wavelet transform, a one-dimensional lifting operation is first performed from the left end to the right end for each vertical pixel line from the upper end to the lower end. The result of this lifting operation is shown in FIG. After the lifting calculation, rearrangement is performed to combine the calculation results for each of the L and H components. The result of rearranging the coefficients is shown in FIG. Subsequently, a one-dimensional lifting operation is performed from the upper end to the lower end for each horizontal pixel line from the left end to the right end. The result is shown in FIG. Subsequently, as in the calculation for each vertical pixel line, the calculation result is rearranged for each of the L and H components. The result is shown in FIG.

ウェーブレット変換によって生成されたサブバンド係数は、サブバンド毎に量子化ステップを設定して量子化できる。可逆圧縮の場合、全てのサブバンドの係数は、値１の量子化ステップで量子化される。このとき、整数となったサブバンド係数を値１の量子化ステップで量子化するので、量子化された係数は量子化前の係数と変わらない。よって、可逆変換の量子化処理は、事実上量子化しないことを意味する。 The subband coefficients generated by the wavelet transform can be quantized by setting a quantization step for each subband. In the case of lossless compression, all subband coefficients are quantized in a quantization step of value 1. At this time, since the subband coefficients that have become integers are quantized in the quantization step of value 1, the quantized coefficients are not different from the coefficients before quantization. Therefore, the reversible transformation quantization process means that the quantization is practically not performed.

量子化されたサブバンド係数は、さらにコードブロックに分割され、コードブロック単位に係数ビットモデリングを施される。まず、コードブロックについて、図９を用いて説明する。なお、図９では、１枚の画像を構成する４つのサブバンドに含まれるコードブロックについて説明している。 The quantized subband coefficients are further divided into code blocks, and coefficient bit modeling is performed for each code block. First, the code block will be described with reference to FIG. In FIG. 9, code blocks included in four subbands constituting one image are described.

コードブロックは、図９で示された４つのサブバンド（図中のＬＬサブバンド、ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の各左上端を原点に、サブバンド係数を横幅：２^ｘｃｂ、縦幅：２^ｙｃｂのブロックに分割する。ここでｘｃｂ、ｙｃｂは、２〜１０の整数で、ｘｃｂ＋ｙｃｂ≦１２を満たす任意の値である。 The code block has four subbands shown in FIG. 9 (LL subband, HL subband, LH subband, and HH subband) as the origin, and the subband coefficient is the horizontal width: 2 ^xcb The vertical width is divided into 2 ^ycb blocks. Here, xcb and ycb are integers of 2 to 10, and are arbitrary values satisfying xcb + ycb ≦ 12.

次に、係数ビットモデリングの処理について説明する。サブバンド係数を含む各コードブロックは、各係数を符号と絶対値に分けて処理され、その絶対値に含まれるビットについてビットプレーンごとに所定の順序で処理されていく。 Next, the coefficient bit modeling process will be described. Each code block including subband coefficients is processed by dividing each coefficient into a code and an absolute value, and the bits included in the absolute value are processed in a predetermined order for each bit plane.

図１０は、縦８係数横８係数のコードブロックが、各係数の正負の符号と、各係数の絶対値におけるＭＳＢからＬＳＢにかけての８つのビットプレーンとに分割されている様態を示している。係数ビットモデリングでは、ＭＳＢからＬＳＢにかけての全てのビットプレーンについて、プレーンごとに処理が進められる。 FIG. 10 shows a state in which a code block of 8 vertical coefficients and 8 horizontal coefficients is divided into positive and negative signs of each coefficient and 8 bit planes from MSB to LSB in the absolute value of each coefficient. In coefficient bit modeling, processing is advanced for each plane for all bit planes from the MSB to the LSB.

係数ビットモデリングでは、ビットプレーンに含まれる各ビットを、３つの処理パスを用いて、所定の順序で走査する。 In coefficient bit modeling, each bit included in a bit plane is scanned in a predetermined order using three processing passes.

係数ビットモデリングにおける３つの処理パスで処理されたビットは、全て算術符号化される。全てのビットが算術符号化されるため、係数ビットモデリング〜算術符号化にかけての処理は、データの損失が無く、可逆性が保たれる。 All the bits processed in the three processing passes in the coefficient bit modeling are arithmetically encoded. Since all bits are arithmetically encoded, the processing from coefficient bit modeling to arithmetic encoding has no loss of data and maintains reversibility.

算術符号化された符号は、符号の順序を整列して、解像度成分ごとに集められたり、画像内の小領域ごとに集められたり、あるいは画像のＭＳＢ〜ＬＳＢを複数のレイヤーに分割して個々に集められる。 Arithmetic-coded codes are collected for each resolution component by aligning the order of the codes, collected for each small area in the image, or divided into a plurality of layers by dividing the MSB to LSB of the image into multiple layers. To be collected.

以上のように、ＪＰＥＧ２０００の可逆圧縮は、一連の処理に用いられる演算の係数を整数のみに限定し、さらに事実上量子化を行わないようにすることで、処理の可逆性を保っている。 As described above, the lossless compression of JPEG2000 maintains the reversibility of processing by limiting the coefficients of operations used for a series of processing to only integers and further virtually not performing quantization.

また、ＪＰＥＧ２０００が規格化された後、ＭｏｔｉｏｎＪＰＥＧ２０００（ＪＰＥＧ２０００Ｐａｒｔ３）と呼ばれる連続画像の符号化方法がＩＳＯ／ＩＥＣにて規格化された（ＩＳＯ／ＩＥＣ１５４４４−３：非特許文献２参照。）。ＭｏｔｉｏｎＪＰＥＧ２０００は、ＪＰＥＧ２０００Ｐａｒｔ１の方式で圧縮符号化された個々の画像データに、連続画像の再生に必要な各種属性情報を付加して記録する規格である。
ＩＳＯ／ＩＥＣ−１５４４４−１：ＪＰＥＧ２０００規格書ｐａｒｔ−１ＩＳＯ／ＩＥＣ−１５４４４−３：ＪＰＥＧ２０００規格書ｐａｒｔ−３ After JPEG2000 was standardized, a continuous image encoding method called Motion JPEG2000 (JPEG2000 Part3) was standardized by ISO / IEC (see ISO / IEC 15444-3: Non-Patent Document 2). Motion JPEG2000 is a standard for recording by adding various attribute information necessary for continuous image reproduction to individual image data compressed and encoded by the JPEG2000 Part1 method.
ISO / IEC-15444-1: JPEG2000 standard part-1 ISO / IEC-15444-3: JPEG2000 standard part-3

従来、高解像度の静止画と低解像度の動画の両方を記録する場合、静止画と動画を別々の記録媒体あるいは記録領域に記録されていた。これでは、１ヶ所の撮影で動画とともに静止画を撮影しても、１つの記録媒体の中にまとまった１つのファイルとして記録されず、管理しづらかった。 Conventionally, when recording both a high-resolution still image and a low-resolution moving image, the still image and the moving image are recorded on different recording media or recording areas. In this case, even if a still image is shot together with a moving image by shooting at one place, it is not recorded as a single file on one recording medium, and is difficult to manage.

また、静止画と動画の両方を記録する場合、撮影される静止画は、撮影時に指定したタイミングのフレームのみを記録していた。このため、撮影された動画の再生時に、任意の動画フレームにおける高解像度の静止画を得ることができなかった。 Further, when recording both a still image and a moving image, only the frame at the timing specified at the time of shooting is recorded as the still image to be shot. For this reason, a high-resolution still image in an arbitrary moving image frame cannot be obtained when the captured moving image is played back.

撮影された動画の再生時に、任意の動画フレームにおける高解像度の静止画を得るためには、全てのフレームについて高解像度の静止画を記録しておき、低解像度の動画フレームについては、高解像度の静止画における低周波成分から抽出する方法がとられていた。 To obtain a high-resolution still image in any video frame when playing back the captured video, record a high-resolution still image for all frames, and record a high-resolution still image for a low-resolution video frame. A method of extracting from low frequency components in a still image has been taken.

しかし、全てのフレームに対して高解像度の静止画を記録すると、記録するデータ量が大きい。そこで従来、高解像度成分の静止画を圧縮して記録していた。ここで、高解像度成分の静止画を可逆圧縮すると、画質劣化は生じないが記録するデータ量はなお大きく、実用的でなかった。また、高解像度の静止画を非可逆圧縮すると、静止画に画質劣化が生じるだけでなく、静止画の低周波成分として抽出される動画フレームに画質劣化が生じていた。 However, if high-resolution still images are recorded for all frames, the amount of data to be recorded is large. Therefore, conventionally, still images with high resolution components have been compressed and recorded. Here, when a high-resolution component still image is reversibly compressed, the image quality does not deteriorate, but the amount of data to be recorded is still large and impractical. Further, when irreversible compression is performed on a high-resolution still image, not only image quality deterioration occurs in the still image but also image quality deterioration occurs in a moving image frame extracted as a low-frequency component of the still image.

本発明は、上記問題点を鑑みてなされたものであり、任意の動画フレームにおいて高解像度の静止画を得ることができ、かつ動画に劣化が生じず、記録するデータ量を従来と比べて少なくすることのできる画像符号化方法を提供することを目的としている。 The present invention has been made in view of the above-described problems. A high-resolution still image can be obtained in an arbitrary moving image frame, the moving image is not deteriorated, and the amount of data to be recorded is smaller than that in the past. An object of the present invention is to provide an image encoding method that can be used.

本発明は上記課題を解決するために、フレーム毎に高解像度の静止画を記録していき、高解像度の静止画に階層符号化を施して画像データをサブバンドに分割し、動画フレームを構成するサブバンドに可逆圧縮を施して、残るサブバンドに非可逆符号化を施すことを特徴とする。 In order to solve the above problems, the present invention records a high-resolution still image for each frame, applies hierarchical encoding to the high-resolution still image, divides the image data into subbands, and configures a moving image frame It is characterized in that lossless compression is performed on the subbands to be processed and lossy encoding is performed on the remaining subbands.

上記の構成を改めて以下（１）〜（７）に整理して示す。 The above configuration is newly organized and shown in the following (1) to (7).

（１）画像データを入力してフレーム内圧縮し符号化する画像符号化方法において、
入力された画像データを可逆演算でサブバンド分割し、
生成されたサブバンドのうち、一定以下の低周波成分を含むサブバンドに可逆圧縮を施し、
残るサブバンドに非可逆圧縮を施すことを特徴とする画像符号化方法。 (1) In an image encoding method for inputting image data and compressing and encoding it within a frame,
The input image data is subband divided by reversible calculation,
Of the generated subbands, apply reversible compression to subbands containing low frequency components below a certain level.
An image encoding method, characterized in that lossy compression is performed on the remaining subbands.

（２）上記（１）において、上記可逆演算は、整数のデータに整数の乗算係数とシフト演算及び丸め処理を用いることで、演算有効桁の打ち切り誤差を防止することを特徴とする画像符号化方法。 (2) In the above (1), the reversible calculation uses integer multiplication coefficients, shift calculation, and rounding processing for integer data, thereby preventing an error in truncation of significant calculation digits. Method.

（３）上記（２）において、生成されたサブバンドを値１の量子化ステップで量子化し、さらに、上記非可逆圧縮の処理は、前記残るサブバンドの係数ビットについて、一定の下位ビットを打ち切ることを特徴とする画像符号化方法。 (3) In the above (2), the generated subband is quantized in a quantization step of value 1, and the lossy compression process cuts off certain lower bits of the remaining subband coefficient bits. An image encoding method characterized by the above.

（４）上記（３）において、上記画像符号化方法は、ＪＰＥＧ２０００方式であることを特徴とする画像符号化方法。 (4) The image encoding method according to (3), wherein the image encoding method is a JPEG2000 system.

（５）上記（２）において、上記可逆圧縮の処理は、上記一定以下の低周波成分を含むサブバンドについて、値１の量子化ステップで量子化し、上記非可逆圧縮の処理は、上記残るサブバンドについて、１を超える量子化ステップで量子化することを特徴とする画像符号化方法。 (5) In the above (2), the lossless compression process quantizes the subband including the low frequency component below the predetermined value in the quantization step of value 1, and the lossy compression process includes the remaining subbands. An image encoding method, wherein a band is quantized with more than one quantization step.

（６）上記（５）において、上記量子化ステップを、画像データの符号化で出力されるデータファイルのファイルヘッダーに記録することを特徴とする画像符号化方法。 (6) The image encoding method according to (5), wherein the quantization step is recorded in a file header of a data file output by encoding image data.

（７）上記（６）において、上記画像符号化方法は、ＪＰＥＧ２０００方式であることを特徴とする画像符号化方法。 (7) The image encoding method according to (6), wherein the image encoding method is a JPEG2000 system.

本発明によれば、高解像度の静止画と低解像度の動画の両方を記録する場合、フレーム毎に高解像度の静止画を記録していき、高解像度の静止画に階層符号化を施して画像データをサブバンドに分割し、動画フレームを構成するサブバンドに可逆圧縮を施して、残るサブバンドに非可逆符号化を施すことで、任意の動画フレームにおいて高解像度の静止画を得ることができ、かつ動画に劣化が生じず、静止画全体を可逆圧縮する場合より記録するデータ量を少なくすることができる。また、本発明の実施形態で記録された画像データは、ＪＰＥＧ２０００方式に準拠した全ての復号器で復号することができる。 According to the present invention, when recording both a high-resolution still image and a low-resolution moving image, the high-resolution still image is recorded for each frame, and the high-resolution still image is subjected to hierarchical encoding. By dividing the data into subbands, performing lossless compression on the subbands that make up the video frame, and irreversible encoding on the remaining subbands, a high-resolution still image can be obtained in any video frame. In addition, the moving image is not deteriorated and the amount of data to be recorded can be reduced as compared with the case where the entire still image is reversibly compressed. Further, the image data recorded in the embodiment of the present invention can be decoded by all decoders compliant with the JPEG2000 system.

以下、本発明の実施の形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail.

本発明の第１の実施形態に係る画像符号化方法は、ＪＰＥＧ２０００方式に準拠して、入力された画像にウェーブレット変換を施すことでサブバンドに分割し、各サブバンドを量子化して、係数ビットモデリングを行い、係数ビットモデリングの際に動画フレームの復号に不要となるサブバンドの下位ビットを切り捨てて、残るビットを算術符号化するものである。 The image encoding method according to the first embodiment of the present invention divides into subbands by applying wavelet transform to an input image in accordance with the JPEG2000 system, quantizes each subband, and generates coefficient bits. Modeling is performed, and the lower bits of the subband that are not required for decoding of the moving image frame are discarded during coefficient bit modeling, and the remaining bits are arithmetically encoded.

図１は、本発明の第１の実施形態に係る処理手順を説明するためのブロック図である。なお、本実施形態では、動画フレームの画像サイズに対して縦横各４倍の画像サイズである高解像度の静止画を入力すると仮定した説明を行う。以下、高解像度の静止画データが入力された場合における本発明の第２の実施形態に係る処理手順について説明する。 FIG. 1 is a block diagram for explaining a processing procedure according to the first embodiment of the present invention. In the present embodiment, description will be made on the assumption that a high-resolution still image having an image size that is four times each of the vertical and horizontal directions with respect to the image size of the moving image frame is input. The processing procedure according to the second embodiment of the present invention when high-resolution still image data is input will be described below.

入力された画像データは、はじめにＪＰＥＧ２０００方式のＤＣレベルシフトとコンポーネント変換を施される。 The input image data is first subjected to JPEG2000 DC level shift and component conversion.

コンポーネント変換された画像データは、ＪＰＥＧ２０００の５×３フィルタで２次元のウェーブレット変換を施されて、複数のサブバンドに分割される。図２は、画像データが２次元のウェーブレット変換により複数のサブバンドに分割された様態を示す模式図である。図２より、入力された高解像度の静止画が複数のサブバンドに分割されて、低解像度の動画フレームに必要な２ＬＬサブバンドと、高解像度の静止画にのみ必要な１ＨＬ、１ＬＨ、１ＨＨ、２ＨＬ、２ＬＨ、２ＨＨの各サブバンドとに分けられる。 The component-converted image data is subjected to two-dimensional wavelet transform using a JPEG2000 5 × 3 filter, and is divided into a plurality of subbands. FIG. 2 is a schematic diagram showing a state in which image data is divided into a plurality of subbands by two-dimensional wavelet transform. As shown in FIG. 2, the input high-resolution still image is divided into a plurality of sub-bands, and the 2LL sub-band necessary for the low-resolution moving image frame and the 1HL, 1LH, 1HH, It is divided into 2HL, 2LH and 2HH subbands.

生成された全てのサブバンドは、本実施形態においては値１の量子化ステップで量子化される。なお、この処理は、事実上量子化しないことを意味している。 All the generated subbands are quantized in a quantization step of value 1 in this embodiment. Note that this processing effectively means that no quantization is performed.

量子化されたサブバンド係数は、コードブロックに分割され、コードブロック単位にＪＰＥＧ２０００方式の係数ビットモデリングを施される。 The quantized subband coefficients are divided into code blocks, and JPEG2000-based coefficient bit modeling is performed on a code block basis.

係数ビットモデリングされた係数ビットのうち、動画フレームの再生に必要でないサブバンドに含まれていた係数ビットについて、許容する画質劣化レベルに応じて下位ビットを切り捨てる。特定のコードブロックに含まれる下位ビットを切り捨てることで、特定のサブバンドに非可逆圧縮を施し、データ量を削減できる。 Of the coefficient bits that have been coefficient bit-modeled, the lower-order bits of the coefficient bits included in the subbands that are not necessary for the reproduction of the moving image frame are discarded according to the allowable image quality degradation level. By truncating the lower bits included in a specific code block, lossy compression can be performed on a specific subband, and the amount of data can be reduced.

図３は、画像全体の係数ビットのうち、下位ビットの打ち切りを行う係数ビットを含むサブバンドと、下位ビットの打ち切りを行わない係数ビットを含むサブバンドとを図示した模式図である。図３において、動画フレームの再生に必要となる２ＬＬサブバンドに含まれていた係数ビットには下位ビットの打ち切りを行わず、他のサブバンド（１ＨＬ、１ＬＨ、１ＨＨ、２ＨＬ、２ＬＨ、２ＨＨの各サブバンド）に含まれていた係数ビットには下位ビットの打ち切りを行うことを示している。 FIG. 3 is a schematic diagram illustrating a subband including a coefficient bit for performing lower bit truncation and a subband including a coefficient bit for not performing lower bit truncation among coefficient bits of the entire image. In FIG. 3, the coefficient bits included in the 2LL subbands necessary for the reproduction of the moving image frame are not truncated, and the other subbands (1HL, 1LH, 1HH, 2HL, 2LH, 2HH) This indicates that the lower order bits are cut off for the coefficient bits included in the (subband).

残るビットには、ＪＰＥＧ２０００方式の算術符号化が施されて、出力される。 The remaining bits are subjected to JPEG2000 arithmetic coding and output.

高解像度の静止画を動画のフレーム数だけ入力し、それぞれの画像に上述した画像処理を行うことで、可逆圧縮された低解像度の動画フレームを記録しつつ、非可逆圧縮された高解像度の静止画を記録することができ、高解像度の静止画全体を可逆圧縮することと比べて少ないデータ量で記録することができる。 By inputting the number of high-resolution still images for the number of frames of the moving image and performing the above-described image processing on each image, the low-resolution moving image frames that are losslessly compressed are recorded and the high-resolution still images that are irreversibly compressed are recorded. The image can be recorded, and the entire high-resolution still image can be recorded with a smaller amount of data compared to reversible compression.

本発明の第２の実施形態に係る画像符号化方法は、入力された画像にウェーブレット変換を施すことでサブバンドに分割し、動画フレームの復号に必要となるサブバンドに値１の量子化ステップで量子化し、他のサブバンドに１以外の値の量子化ステップで量子化し、算術符号化するものである。図４は、本発明の第２の実施形態に係る処理手順を説明するためのブロック図である。なお、本実施形態では、動画フレームの画像サイズに対して縦横各４倍の画像サイズである高解像度の静止画を入力すると仮定した説明を行う。以下、高解像度の静止画が入力された場合における本発明の第２の実施形態に係る処理手順について説明する。 The image encoding method according to the second embodiment of the present invention performs a wavelet transform on an input image to divide it into subbands, and a quantization step with a value of 1 in the subbands necessary for decoding a moving image frame Is quantized by a quantization step of a value other than 1 in other subbands, and arithmetically encoded. FIG. 4 is a block diagram for explaining a processing procedure according to the second embodiment of the present invention. In the present embodiment, description will be made on the assumption that a high-resolution still image having an image size that is four times each of the vertical and horizontal directions with respect to the image size of the moving image frame is input. The processing procedure according to the second embodiment of the present invention when a high-resolution still image is input will be described below.

コンポーネント変換された画像データは、ＪＰＥＧ２０００の５×３フィルタで２次元のウェーブレット変換を施されて、複数のサブバンドに分割される。分割されたサブバンドは、第１の実施形態で説明した、図２に示すサブバンドの様態と同様である。 The component-converted image data is subjected to two-dimensional wavelet transform using a JPEG2000 5 × 3 filter, and is divided into a plurality of subbands. The divided subbands are the same as the subband modes illustrated in FIG. 2 described in the first embodiment.

本実施形態では、生成されたサブバンドのうち、動画フレームの再生に必要となるサブバンドは値１の量子化ステップで量子化し、他のサブバンドに１以上の値の量子化ステップで量子化する。サブバンド毎に設定する量子化ステップの一例を図５に示す。また、この量子化の様態を図６に示す。図６において、動画フレームの再生に必要となる２ＬＬサブバンドに含まれていた係数には、値１の量子化ステップで量子化し、他のサブバンド（１ＨＬ、１ＬＨ、１ＨＨ、２ＨＬ、２ＬＨ、２ＨＨの各サブバンド）に含まれていた係数には、１を超える値の量子化ステップで量子化することを示している。量子化に用いた量子化ステップは、ファイルヘッダーに記録される。 In the present embodiment, of the generated subbands, the subbands necessary for playback of the moving image frame are quantized by a quantization step having a value of 1, and the other subbands are quantized by a quantization step having a value of 1 or more. To do. An example of the quantization step set for each subband is shown in FIG. Further, this quantization mode is shown in FIG. In FIG. 6, the coefficients included in the 2LL subbands necessary for the reproduction of the moving image frame are quantized in the quantization step of value 1, and the other subbands (1HL, 1LH, 1HH, 2HL, 2LH, 2HH) are quantized. The coefficient included in each subband of FIG. 4 indicates that quantization is performed in a quantization step having a value exceeding 1. The quantization step used for quantization is recorded in the file header.

量子化されたサブバンド係数は、コードブロックに分割され、コードブロック単位にＪＰＥＧ２０００方式の係数ビットモデリングと算術符号化を施されて、出力される。なお、本実施形態では、動画フレームの再生に用いる２ＬＬ以外のサブバンドに対して下位ビットの打ち切りを行うか否かは任意である。 The quantized subband coefficients are divided into code blocks, subjected to JPEG2000 coefficient bit modeling and arithmetic coding for each code block, and output. In the present embodiment, it is arbitrary whether or not to perform lower bit truncation for subbands other than 2LL used for reproduction of moving image frames.

本発明の第１の実施形態に係る画像符号化方法の処理手順を示すブロック図The block diagram which shows the process sequence of the image coding method which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る２次元のウェーブレット変換により複数のサブバンドに分割された様態を示す模式図The schematic diagram which shows the aspect divided | segmented into the several subband by the two-dimensional wavelet transformation which concerns on the 1st Embodiment of this invention 下位ビットの打ち切りを行う係数ビットを含むサブバンドと、下位ビットの打ち切りを行わない係数ビットを含むサブバンドとを図示した、本発明の第１の実施形態に係る模式図Schematic diagram according to the first embodiment of the present invention, illustrating a subband including coefficient bits that perform truncation of lower bits and a subband that includes coefficient bits that do not perform truncation of lower bits. 本発明の第２の実施形態に係る画像符号化方法の処理手順を示すブロック図The block diagram which shows the process sequence of the image coding method which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係るサブバンド毎に設定する量子化ステップの一例を表で示す図The figure which shows an example of the quantization step set for every subband which concerns on the 2nd Embodiment of this invention with a table | surface 本発明の第２の実施形態に係る各サブバンドの量子化を説明するための模式図Schematic diagram for explaining quantization of each subband according to the second embodiment of the present invention. 可逆圧縮を行う場合におけるＪＰＥＧ２０００の符号化の処理手順を示したブロック図The block diagram which showed the processing procedure of the encoding of JPEG2000 in the case of performing lossless compression ウェーブレット変換によって縦４画素横４画素の画像がサブバンドに分割される様子を示した模式図Schematic diagram showing how an image of 4 pixels vertically and 4 pixels horizontally is divided into subbands by wavelet transform サブバンド係数がコードブロックに分割される様子を示した模式図Schematic showing how subband coefficients are divided into code blocks コードブロックに含まれる係数のビットがどのように処理されていくかを説明するための模式図Schematic diagram for explaining how the bits of the coefficients included in the code block are processed

Claims

In an image encoding method for inputting image data and compressing and encoding it in a frame,
The input image data is subband divided by reversible calculation,
Of the generated subbands, apply reversible compression to subbands containing low frequency components below a certain level.
An image encoding method, characterized in that lossy compression is performed on the remaining subbands.

2. The image encoding method according to claim 1, wherein the lossless calculation uses an integer multiplication coefficient, a shift calculation, and a rounding process for integer data, thereby preventing a truncation error of a calculation effective digit.

3. The generated subband is quantized by a quantization step of value 1, and the lossy compression process is characterized by truncating certain lower bits of the remaining subband coefficient bits. An image encoding method to be performed.

4. The image encoding method according to claim 3, wherein the image encoding method is a JPEG2000 system.

3. The lossless compression process according to claim 2, wherein the lossless compression process is performed by quantizing a subband including a low frequency component equal to or less than a certain value in a quantization step of 1, and the lossy compression process is performed on the remaining subband by 1 An image encoding method characterized by performing quantization with a quantization step exceeding 1.

6. The image encoding method according to claim 5, wherein the quantization step is recorded in a file header of a data file output by encoding image data.

7. The image encoding method according to claim 6, wherein the image encoding method is a JPEG2000 system.