JP4449400B2

JP4449400B2 - Image encoding apparatus and method, program, and recording medium

Info

Publication number: JP4449400B2
Application number: JP2003351392A
Authority: JP
Inventors: 隆浩福原; 青司木村; 淳哉荒木
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-10-25
Filing date: 2003-10-09
Publication date: 2010-04-14
Anticipated expiration: 2023-10-09
Also published as: JP2004166254A

Description

本発明は、例えばＪＰＥＧ−２０００方式のように、ウェーブレット変換とエントロピー符号化とにより画像を圧縮する画像符号化装置及びその方法、並びに画像符号化処理をコンピュータに実行させるプログラム及びそのプログラムが記録された記録媒体に関する。 The present invention records, for example, an image encoding apparatus and method for compressing an image by wavelet transform and entropy encoding as in the JPEG-2000 system, a program for causing a computer to execute image encoding processing, and the program. The present invention relates to a recording medium.

従来の代表的な画像圧縮方式として、ＩＳＯ（International Standards Organization）によって標準化されたＪＰＥＧ（Joint Photographic Experts Group）方式がある。これは、離散コサイン変換（DCT:Descrete Cosine Transform）を用い、比較的高いビットが割り当てられる場合には、良好な符号化画像及び復号画像を供することが知られている。しかし、ある程度以上に符号化ビット数を少なくすると、ＤＣＴ特有のブロック歪みが顕著になり、主観的に劣化が目立つようになる。 As a conventional typical image compression method, there is a JPEG (Joint Photographic Experts Group) method standardized by ISO (International Standards Organization). This is known to use a discrete cosine transform (DCT) and provide a good encoded image and decoded image when relatively high bits are assigned. However, if the number of encoded bits is reduced to a certain degree, block distortion peculiar to DCT becomes remarkable, and deterioration becomes conspicuous subjectively.

一方、近年では画像をフィルタバンクと呼ばれるハイパス・フィルタとローパス・フィルタとを組み合わせたフィルタによって複数の帯域に分割し、各帯域毎に符号化を行う方式の研究が盛んになっている。その中でも、ウェーブレット変換符号化は、ＤＣＴのように高圧縮でブロック歪みが顕著になるという欠点がないことから、ＤＣＴに代わる新たな技術として有力視されている。 On the other hand, in recent years, research on a method in which an image is divided into a plurality of bands by a filter combining a high-pass filter and a low-pass filter called a filter bank and coding is performed for each band has been actively conducted. Among them, wavelet transform coding is regarded as a promising new technology to replace DCT because it does not have the disadvantage that block distortion becomes remarkable due to high compression unlike DCT.

２００１年１月に国際標準化が完了したＪＰＥＧ−２０００方式は、このウェーブレット変換に高能率なエントロピー符号化（ビットプレーン単位のビット・モデリングと算術符号化）を組み合わせた方式を採用しており、ＪＰＥＧに比べて符号化効率の大きな改善を実現している。 The JPEG-2000 system, whose international standardization was completed in January 2001, employs a system that combines this wavelet transform with highly efficient entropy coding (bit modeling and arithmetic coding in units of bit planes). Compared to this, the coding efficiency is greatly improved.

これらの国際規格ではデコーダ側の規格のみが定められており、エンコーダ側は自由に設計することができる。その反面、目標の圧縮率を実現する効果的なレート制御手法についての規格が存在しないため、ノウハウの確立が何よりも重要になる。 In these international standards, only the standard on the decoder side is defined, and the encoder side can be designed freely. On the other hand, since there is no standard for an effective rate control method that achieves the target compression rate, the establishment of know-how is most important.

特にＪＰＥＧ方式では、このレート制御が困難であり、目標値を得るまでに複数回の符号化を施す必要も多々あった。しかしながら、これは処理時間の増大に繋がるため、ＪＰＥＧ−２０００方式では、１度の符号化で目標の符号量を得ることが望まれている。 In particular, in the JPEG method, this rate control is difficult, and it is often necessary to perform encoding a plurality of times before obtaining the target value. However, since this leads to an increase in processing time, in the JPEG-2000 system, it is desired to obtain a target code amount by one encoding.

ここで、ＪＰＥＧ−２０００方式では、ＲＤ（Rate-Distortion）特性を利用したレート制御手法が一般的に用いられているが、このレート制御手法は、汎用性があるものの非常に計算負荷が高いという欠点があった。 Here, in the JPEG-2000 system, a rate control method using RD (Rate-Distortion) characteristics is generally used. However, although this rate control method is versatile, it has a very high computational load. There were drawbacks.

そこで本件出願人は、以下の特許文献１において、一旦生成した符号化コードストリームを後尾から切り捨てることでレート制御を行う技術を提案している。この技術によれば、目標符号量に正確に合わせた制御が可能になる。 Therefore, the present applicant has proposed a technique for performing rate control by truncating a once generated encoded code stream from the tail in Patent Document 1 below. According to this technique, it is possible to perform control in accordance with the target code amount accurately.

特開２００２−１６５０９８号公報JP 2002-165098 A

しかしながら、この特許文献１の技術では、画質に与える影響が大きい符号ブロックが符号化コードストリームの後尾にあった場合には、それらが切り捨てられることで画質劣化が生じる虞があるため、さらに効果的なレート制御手法が望まれている。 However, in the technique of Patent Document 1, when there are code blocks that have a large influence on image quality at the end of the encoded code stream, they may be discarded, resulting in image quality deterioration, which is more effective. A rate control technique is desired.

また、ＪＰＥＧ−２０００方式では、一般に負荷が高いエントロピー符号化の処理の軽減手法についての規格も存在しないため、ノウハウの確立が何よりも重要になる。 In addition, in the JPEG-2000 system, there is no standard for a method for reducing entropy coding processing, which is generally a heavy load, and thus establishment of know-how is most important.

さらに、ＪＰＥＧ−２０００方式は、静止画の規格であり、動画像では頻繁に存在するインタレース画像に対する十分な検討がなされていない。したがって、静止画用の技術をそのまま動画像に応用すると、画質劣化が目立ってしまうという欠点が露呈する。 Furthermore, the JPEG-2000 system is a standard for still images, and sufficient consideration has not been made for interlaced images that frequently exist in moving images. Therefore, if the technology for still images is applied to a moving image as it is, the disadvantage that image quality deterioration becomes conspicuous is exposed.

本発明は、このような従来の実情に鑑みて提案されたものであり、例えばＪＰＥＧ−２０００方式の画像符号化装置において、算術符号化の処理負荷を軽減すると共に、画質劣化を抑えて効果的にレート制御を行う画像符号化装置及びその方法、並びに画像符号化処理をコンピュータに実行させるプログラム及びそのプログラムが記録されたコンピュータ読み取り可能な記録媒体を提供することを目的とする。 The present invention has been proposed in view of such conventional circumstances. For example, in a JPEG-2000 image encoding apparatus, the processing load of arithmetic encoding is reduced and image quality deterioration is effectively suppressed. Another object of the present invention is to provide an image encoding apparatus and method for performing rate control, a program for causing a computer to execute image encoding processing, and a computer-readable recording medium on which the program is recorded.

上述した目的を達成するために、本発明に係る画像符号化装置は、入力画像に対して、低域フィルタ及び高域フィルタを垂直方向及び水平方向に施してサブバンドを生成し、低域成分のサブバンドに対して階層的にフィルタリング処理を施すフィルタリング手段と、上記フィルタリング手段によって生成されたサブバンドを分割し、所定の大きさの符号ブロックを生成する符号ブロック生成手段と、上記符号ブロック単位に最上位ビットから最下位ビットに至るビットプレーンを生成するビットプレーン生成手段と、符号化対象となる符号化対象数を予測して符号化対象数情報を生成する符号化対象予測手段と、上記符号化対象予測手段によって生成された上記符号化対象のうち、各符号ブロックの最上位ビット側から上記符号化対象数情報で与えられる符号化対象についてのみ、符号化を行う符号化手段とを備え、上記符号化対象予測手段は、上記入力画像のフレーム内の全ての符号ブロックについてゼロビットプレーンを除く有効ビットプレーン数を計数し、当該計数結果に基づいて所定のテーブルを参照することで、フレーム毎に上記符号化対象となる上記符号化対象数を求める。 In order to achieve the above-described object, an image coding apparatus according to the present invention generates a subband by applying a low-pass filter and a high-pass filter to an input image in a vertical direction and a horizontal direction, and generates a low-band component. and facilities to filtering means hierarchically filtering processing on subbands divides the sub-band generated by said filtering means, a code block generating means for generating a code block of a predetermined size, the code block A bit plane generating means for generating a bit plane from the most significant bit to the least significant bit as a unit, an encoding target prediction means for predicting the encoding target number to be encoded and generating encoding target number information , among the encoding target generated by the encoding target prediction unit, the encoding target number information from the most significant bit side of each code block For given coded only encode a row Cormorant encoding means, the encoding target prediction unit, the number of effective bit-planes except the zero bit planes for all code blocks in a frame of the input image By counting and referring to a predetermined table based on the counting result, the number of encoding objects to be encoded is obtained for each frame.

また、上述した目的を達成するために、本発明に係る画像符号化方法は、入力画像に対して、低域フィルタ及び高域フィルタを垂直方向及び水平方向に施してサブバンドを生成し、低域成分のサブバンドに対して階層的にフィルタリング処理を施すフィルタリング工程と、上記フィルタリング工程にて生成されたサブバンドを分割し、所定の大きさの符号ブロックを生成する符号ブロック生成工程と、上記符号ブロック単位に最上位ビットから最下位ビットに至るビットプレーンを生成するビットプレーン生成工程と、符号化対象となる符号化対象数を予測して符号化対象数情報を生成する符号化対象予測工程と、上記符号化対象予測工程にて生成された上記符号化対象のうち、各符号ブロックの最上位ビット側から上記符号化対象数情報で与えられる上記符号化対象数についてのみ、符号化を行う符号化工程とを有し、上記符号化対象予測工程では、上記入力画像のフレーム内の全ての符号ブロックについてゼロビットプレーンを除く有効ビットプレーン数を計数し、当該計数結果に基づいて所定のテーブルを参照することで、フレーム毎に上記符号化対象となる上記符号化対象数を求める。 In order to achieve the above-described object, the image coding method according to the present invention applies a low-pass filter and a high-pass filter to an input image in the vertical direction and the horizontal direction to generate subbands, A filtering process for hierarchically filtering the subbands of the band components, a code block generating process for generating a code block of a predetermined size by dividing the subband generated in the filtering process, and A bit plane generating step for generating a bit plane from the most significant bit to the least significant bit for each code block, and an encoding target prediction step for generating the encoding target number information by predicting the encoding target number to be encoded And among the encoding targets generated in the encoding target prediction step, the encoding target number information from the most significant bit side of each code block Erareru the encoding for the subject count only possess an encoding step of performing encoding, in the coded prediction step, the effective bit planes excluding the zero bit planes for all code blocks in a frame of the input image By counting the number and referring to a predetermined table based on the counting result, the number of encoding objects to be encoded is obtained for each frame.

また、上述した目的を達成するために、本発明に係る画像符号化装置は、入力画像に対して、低域フィルタ及び高域フィルタを垂直方向及び水平方向に施してサブバンドを生成し、低域成分のサブバンドに対して階層的にフィルタリング処理を施すフィルタリング手段と、上記フィルタリング手段によって生成されたサブバンドを分割し、所定の大きさの符号ブロックを生成する符号ブロック生成手段と、上記符号ブロック単位に最上位ビットから最下位ビットに至るビットプレーンを生成するビットプレーン生成手段と、符号化対象となる符号化対象数を予測して符号化対象数情報を生成する符号化対象予測手段と、上記符号化対象予測手段によって生成された上記符号化対象のうち、各符号ブロックの最上位ビット側から上記符号化対象数情報で与えられる符号化対象についてのみ、符号化を行う符号化手段とを備え、上記符号化対象予測手段は、上記入力画像のサブバンド内の全ての符号ブロックについてゼロビットプレーンを除く有効ビットプレーン数を計数し、当該計数結果に基づいて所定のテーブルを参照することで、サブバンド毎に符号化対象となる上記符号化対象数を求める。In order to achieve the above-described object, the image coding apparatus according to the present invention generates a subband by applying a low-pass filter and a high-pass filter to an input image in a vertical direction and a horizontal direction to generate a subband. Filtering means for hierarchically filtering the subbands of the band components; code block generating means for dividing the subband generated by the filtering means to generate a code block of a predetermined size; and the code A bit plane generating means for generating a bit plane from the most significant bit to the least significant bit in block units, and an encoding target prediction means for predicting the number of encoding targets to be encoded and generating encoding target number information Of the encoding targets generated by the encoding target prediction means, the encoding targets from the most significant bit side of each code block An encoding unit that performs encoding only for an encoding target given by information, and the encoding target prediction unit includes effective bit planes excluding zero bit planes for all code blocks in a subband of the input image. By counting the number and referring to a predetermined table based on the counting result, the number of encoding targets to be encoded is obtained for each subband.

また、上述した目的を達成するために、本発明に係る画像符号化方法は、入力画像に対して、低域フィルタ及び高域フィルタを垂直方向及び水平方向に施してサブバンドを生成し、低域成分のサブバンドに対して階層的にフィルタリング処理を施すフィルタリング工程と、上記フィルタリング工程にて生成されたサブバンドを分割し、所定の大きさの符号ブロックを生成する符号ブロック生成工程と、上記符号ブロック単位に最上位ビットから最下位ビットに至るビットプレーンを生成するビットプレーン生成工程と、符号化対象となる符号化対象数を予測して符号化対象数情報を生成する符号化対象予測工程と、上記符号化対象予測工程にて生成された上記符号化対象のうち、各符号ブロックの最上位ビット側から上記符号化対象数情報で与えられる上記符号化対象数についてのみ、符号化を行う符号化工程とを有し、上記符号化対象予測工程では、上記入力画像のサブバンド内の全ての符号ブロックについてゼロビットプレーンを除く有効ビットプレーン数を計数し、当該計数結果に基づいて所定のテーブルを参照することで、サブバンド毎に符号化対象となる上記符号化対象数を求める。In order to achieve the above-described object, the image coding method according to the present invention applies a low-pass filter and a high-pass filter to an input image in the vertical direction and the horizontal direction to generate subbands, A filtering process for hierarchically filtering the subbands of the band components, a code block generating process for generating a code block of a predetermined size by dividing the subband generated in the filtering process, and A bit plane generating step for generating a bit plane from the most significant bit to the least significant bit for each code block, and an encoding target prediction step for generating the encoding target number information by predicting the encoding target number to be encoded And the encoding target number information from the most significant bit side of each code block among the encoding targets generated in the encoding target prediction step. The encoding step for encoding only the number of encoding targets obtained, and in the encoding target prediction step, the effective bits excluding zero bit planes for all the code blocks in the subband of the input image By counting the number of planes and referring to a predetermined table based on the counting result, the number of encoding targets to be encoded is obtained for each subband.

また、本発明に係るプログラムは、上述した画像符号化処理をコンピュータに実行させるものであり、本発明に係る記録媒体は、そのようなプログラムが記録されたコンピュータ読み取り可能なものである。 A program according to the present invention causes a computer to execute the above-described image encoding process, and a recording medium according to the present invention is a computer-readable medium on which such a program is recorded.

本発明に係る画像符号化装置及びその方法によれば、符号化対象となる符号化対象数を予測して符号化対象数情報を生成し、符号化対象のうち、各符号ブロックの最上位ビット側から符号化対象数情報で与えられる符号化対象についてのみ符号化を行うことにより、符号化の処理負荷を軽減することができる。 According to the image encoding device and the method thereof according to the present invention, the encoding target number information is generated by predicting the encoding target number to be encoded, and the most significant bit of each code block among the encoding targets By performing the encoding only on the encoding target given by the encoding target number information from the side, the processing load of the encoding can be reduced.

また、本発明に係るプログラム及び記録媒体によれば、上述した画像符号化処理をソフトウェアにより実現することができる。 Moreover, according to the program and recording medium which concern on this invention, the image coding process mentioned above is realizable with software.

以下、本発明を適用した具体的な実施の形態について、図面を参照しながら詳細に説明する。この実施の形態は、本発明を、ＪＰＥＧ−２０００方式により入力画像を圧縮符号化する画像符号化装置及びその方法に適用したものである。以下では、先ず、画像符号化装置の全体構成及びその動作について説明し、次いで、この画像符号化装置において本発明が適用された要部を説明する。 Hereinafter, specific embodiments to which the present invention is applied will be described in detail with reference to the drawings. In this embodiment, the present invention is applied to an image encoding apparatus and method for compressing and encoding an input image by the JPEG-2000 system. In the following, first, the overall configuration and operation of the image encoding device will be described, and then the main part to which the present invention is applied in this image encoding device will be described.

（１）画像符号化装置の構成及び動作
本実施の形態における画像符号化装置の概略構成を図１に示す。図１に示すように、画像符号化装置１は、ＤＣレベルシフト部１０と、ウェーブレット変換部１１と、量子化部１２と、符号ブロック化部１３と、ビットプレーン分解部１４と、符号化対象予測部１５と、ビットモデリング部１６と、算術符号化部１７と、レート制御部１９と、ヘッダ生成部２０と、パケット生成部２１とから構成されている。ここで、ビットモデリング部１６と算術符号化部１７とにより、ＥＢＣＯＴ（Embedded Coding with Optimized Truncation）部１８が構成される。 (1) Configuration and Operation of Image Encoding Device FIG. 1 shows a schematic configuration of the image encoding device in the present embodiment. As shown in FIG. 1, the image encoding device 1 includes a DC level shift unit 10, a wavelet transform unit 11, a quantization unit 12, a code block unit 13, a bit plane decomposition unit 14, and an encoding target. The prediction unit 15, the bit modeling unit 16, the arithmetic coding unit 17, the rate control unit 19, the header generation unit 20, and the packet generation unit 21 are configured. Here, the bit modeling unit 16 and the arithmetic coding unit 17 constitute an EBCOT (Embedded Coding with Optimized Truncation) unit 18.

ＤＣレベルシフト部１０は、後段のウェーブレット変換部１１におけるウェーブレット変換を効率的に行い圧縮率を向上させるために、原信号のレベルシフトを行う。原理的には、ＲＧＢ信号は、正の値（符号なしの整数値）を持つため、原信号のダイナミックレンジを半分にするレベルシフトを行うことで、圧縮効率を向上させることができる。これに対して、ＹＣｂＣｒ信号におけるＣｂやＣｒといった色差信号は、正負両方の整数値を持つため、レベルシフトは行われない。 The DC level shift unit 10 performs level shift of the original signal in order to efficiently perform the wavelet transform in the wavelet transform unit 11 in the subsequent stage and improve the compression rate. In principle, since the RGB signal has a positive value (an unsigned integer value), the compression efficiency can be improved by performing a level shift that halves the dynamic range of the original signal. On the other hand, since the color difference signals such as Cb and Cr in the YCbCr signal have both positive and negative integer values, level shift is not performed.

ウェーブレット変換部１１は、通常、低域フィルタと高域フィルタとから構成されるフィルタバンクによって実現される。なお、デジタルフィルタは、通常複数タップ長のインパルス応答（フィルタ係数）を持っているため、フィルタリングが行えるだけの入力画像を予めバッファリングしておく必要があるが、簡単のため、図１では図示を省略する。 The wavelet transform unit 11 is usually realized by a filter bank including a low-pass filter and a high-pass filter. Since a digital filter usually has an impulse response (filter coefficient) having a multi-tap length, it is necessary to buffer an input image that can be filtered in advance. Is omitted.

ＤＣレベルシフト部１０は、フィルタリングに必要な最低限の画像信号Ｄ１０を入力し、上述のようにレベルシフトを行う。そして、ウェーブレット変換部１１は、ＤＣレベルシフト後の画像信号Ｄ１１に対して、ウェーブレット変換を行うフィルタリング処理を行い、ウェーブレット変換係数Ｄ１２を生成する。 The DC level shift unit 10 receives the minimum image signal D10 necessary for filtering and performs level shift as described above. Then, the wavelet transform unit 11 performs a filtering process for performing wavelet transform on the image signal D11 after the DC level shift, and generates a wavelet transform coefficient D12.

このウェーブレット変換では、通常図２に示すように低域成分が繰り返し変換されるが、これは画像のエネルギの多くが低域成分に集中しているためである。このことは、図３（Ａ）に示す分割レベル＝１から図３（Ｂ）に示す分割レベル＝３のように、分割レベルを進めていくに従って、同図のようにサブバンドが形成されていくことからも分かる。ここで、図２におけるウェーブレット変換のレベル数は３であり、この結果計１０個のサブバンドが形成されている。ここで、図２においてＬ，Ｈはそれぞれ低域，高域を表し、Ｌ，Ｈの前の数字は分割レベルを表す。すなわち、例えば１ＬＨは、水平方向が低域で垂直方向が高域である分割レベル＝１のサブバンドを表す。 In this wavelet transform, the low frequency components are usually repeatedly transformed as shown in FIG. 2, because most of the energy of the image is concentrated on the low frequency components. This is because subbands are formed as shown in the figure as the division level is advanced from division level = 1 shown in FIG. 3A to division level = 3 shown in FIG. 3B. It can be seen from going. Here, the number of wavelet transform levels in FIG. 2 is 3, and as a result, a total of 10 subbands are formed. Here, L and H in FIG. 2 represent a low frequency and a high frequency, respectively, and the numbers before L and H represent a division level. That is, for example, 1LH represents a subband of division level = 1 in which the horizontal direction is a low frequency and the vertical direction is a high frequency.

量子化部１２は、ウェーブレット変換部１１から供給されたウェーブレット変換係数Ｄ１２に対して非可逆圧縮を施す。量子化手段としては、ウェーブレット変換係数Ｄ１２を量子化ステップサイズで除算するスカラ量子化を用いることができる。ここで、ＪＰＥＧ−２０００方式の規格上、上述の非可逆圧縮を行う場合で、非可逆の９×７ウェーブレット変換フィルタを用いる場合には、自動的にスカラ量子化を併用することが決められている。一方、可逆の５×３ウェーブレット変換フィルタを用いる場合には、量子化が行われず、後述するレート制御部１９において符号量制御が行われる。したがって、図１の量子化部１２が動作するのは、実際には非可逆の９×７ウェーブレット変換フィルタを用いた場合である。以下、この非可逆の９×７ウェーブレット変換フィルタを用いる場合を想定して説明を進める。 The quantization unit 12 performs irreversible compression on the wavelet transform coefficient D12 supplied from the wavelet transform unit 11. As the quantization means, scalar quantization that divides the wavelet transform coefficient D12 by the quantization step size can be used. Here, according to the JPEG-2000 standard, when the above-mentioned lossy compression is performed and the lossy 9 × 7 wavelet transform filter is used, it is decided to automatically use scalar quantization together. Yes. On the other hand, when a reversible 5 × 3 wavelet transform filter is used, quantization is not performed and code amount control is performed in a rate control unit 19 described later. Therefore, the quantization unit 12 in FIG. 1 operates when an irreversible 9 × 7 wavelet transform filter is actually used. Hereinafter, the description will be made assuming that this irreversible 9 × 7 wavelet transform filter is used.

符号ブロック化部１３は、量子化部１２で生成された量子化係数Ｄ１３を、エントロピー符号化の処理単位である所定の大きさの符号ブロックに分割する。ここで、サブバンド中の符号ブロックの位置関係を図４に示す。通常、例えば６４×６４程度のサイズの符号ブロックが、分割後の全てのサブバンド中に生成される。したがって、図２において最も分割レベルが小さい３ＨＨのサブバンドの大きさが６４０×３２０であった場合には、６４×６４の符号ブロックは、水平方向に１０個、垂直方向に５個、合計５０個存在することになる。符号ブロック化部１３は、符号ブロック毎の量子化係数Ｄ１４をビットプレーン分解部１４に供給し、後段の符号化処理は、これらの符号ブロック毎に行われる。 The code blocking unit 13 divides the quantized coefficient D13 generated by the quantizing unit 12 into code blocks having a predetermined size, which is a processing unit of entropy coding. Here, the positional relationship of the code blocks in the subband is shown in FIG. Usually, for example, a code block having a size of about 64 × 64 is generated in all subbands after division. Therefore, when the size of the 3HH subband having the smallest division level in FIG. 2 is 640 × 320, the number of 64 × 64 code blocks is 10 in the horizontal direction and 5 in the vertical direction, for a total of 50. There will be. The code block unit 13 supplies the quantization coefficient D14 for each code block to the bit plane decomposition unit 14, and the subsequent encoding process is performed for each code block.

ビットプレーン分解部１４は、符号ブロック毎の量子化係数Ｄ１４をビットプレーンに分解する。このビットプレーンの概念について図５を用いて説明する。図５（Ａ）は、縦４個、横４個の計１６個の係数からなる量子化係数を仮定したものである。この１６個の係数のうち絶対値が最大のものは１３であり、２進数表現では１１０１となる。したがって、係数の絶対値のビットプレーンは、図５（Ｂ）に示すような４つのビットプレーンから構成される。なお、各ビットプレーンの要素は、全て０又は１の数をとる。一方、量子化係数の符号は、−６が唯一負の値であり、それ以外は０又は正の値である。したがって、符号のビットプレーンは、図５（Ｃ）に示すようになる。ビットプレーン分解部１４は、このビットブレーンに分解された量子化係数Ｄ１５を符号化対象予測部１５に供給する。 The bit plane decomposition unit 14 decomposes the quantization coefficient D14 for each code block into bit planes. The concept of this bit plane will be described with reference to FIG. FIG. 5A assumes a quantized coefficient composed of a total of 16 coefficients of 4 vertical and 4 horizontal. Among the 16 coefficients, the coefficient having the maximum absolute value is 13, which is 1101 in binary representation. Therefore, the bit plane of the absolute value of the coefficient is composed of four bit planes as shown in FIG. Note that all the elements of each bit plane take a number of 0 or 1. On the other hand, as for the sign of the quantized coefficient, −6 is the only negative value, and other values are 0 or a positive value. Therefore, the bit plane of the code is as shown in FIG. The bit plane decomposition unit 14 supplies the quantization coefficient D15 decomposed into the bit brains to the encoding target prediction unit 15.

符号化対象予測部１５は、ビットプレーン分解部１４においてビットブレーンに分解された量子化係数Ｄ１５に基づいて、最終的に符号化されるビットプレーン数又は符号化パス数をＥＢＣＯＴ部１８で符号化する前に事前予測し、予測されたビットプレーンを抽出して、抽出したビットプレーン毎の係数ビットＤ１６をビットモデリング部１６に供給する。なお、この符号化対象予測部１５における予測処理の詳細については後述する。 The encoding target prediction unit 15 uses the EBCOT unit 18 to encode the number of bit planes or the number of encoding passes to be finally encoded based on the quantization coefficient D15 decomposed into bit brains by the bit plane decomposition unit 14. Pre-prediction is performed, the predicted bit plane is extracted, and the coefficient bit D 16 for each extracted bit plane is supplied to the bit modeling unit 16. Details of the prediction process in the encoding target prediction unit 15 will be described later.

ビットモデリング部１６は、符号化対象予測部１５から供給されたビットプレーン毎の係数ビットＤ１６に対して、以下のようにして係数ビットモデリングを行い、係数ビット毎のコンテキストＤ１７を算術符号化部１７に供給する。そして、算術符号化部１７は、この係数ビット毎のコンテキストＤ１７に対して算術符号化を施し、得られた算術符号Ｄ１８をレート制御部１９に供給する。ここで、本実施の形態では、特にＪＰＥＧ−２０００規格で定められたＥＢＣＯＴと呼ばれるエントロピー符号化を例に取りながら説明する。このＥＢＣＯＴについては、例えば、文献「IS0/IEC 15444-1, Information technology-JPEG 2000, Part 1:Core coding system」等に詳細に記載されている。なお、上述したように、ビットモデリング部１６と算術符号化部１７とにより、ＥＢＣＯＴ部１８が構成される。 The bit modeling unit 16 performs coefficient bit modeling on the coefficient bits D16 for each bit plane supplied from the encoding target prediction unit 15 as follows, and the context D17 for each coefficient bit is converted to the arithmetic coding unit 17. To supply. The arithmetic coding unit 17 performs arithmetic coding on the context D17 for each coefficient bit, and supplies the obtained arithmetic code D18 to the rate control unit 19. In this embodiment, entropy coding called EBCOT defined by the JPEG-2000 standard will be described as an example. The EBCOT is described in detail in, for example, the document “IS0 / IEC 15444-1, Information technology-JPEG 2000, Part 1: Core coding system”. As described above, the EBCOT unit 18 is configured by the bit modeling unit 16 and the arithmetic coding unit 17.

ＥＢＣＯＴは、所定の大きさのブロック毎にそのブロック内の係数ビットの統計量を測定しながら符号化する手段であり、符号ブロック単位に量子化係数をエントロピー符号化する。符号ブロックは、最上位ビット（MSB）から最下位ビット（LSB）方向にビットプレーン毎に独立して符号化される。また、符号ブロックの縦横のサイズは、４から２５６までの２の冪乗で、通常は３２×３２、６４×６４、１２８×３２等の大きさが使用される。量子化係数は、ｎビットの符号付き２進数で表されており、ｂｉｔ０からｂｉｔ(ｎ−２)がＬＳＢからＭＳＢまでのそれぞれのビットを表す。なお、残りの１ビットは符号である。符号ブロックの符号化は、ＭＳＢ側のビットプレーンから順番に、以下の(a)〜(c)に示す３種類の符号化パスによって行われる。
(a) Significance Propagation Pass
(b) Magnitude Refinement Pass
(c) Clean Up Pass EBCOT is a means for encoding for each block of a predetermined size while measuring the statistical amount of coefficient bits in the block, and entropy-encodes the quantized coefficient in units of code blocks. The code block is encoded independently for each bit plane in the direction from the most significant bit (MSB) to the least significant bit (LSB). The vertical and horizontal sizes of the code block are powers of 2 from 4 to 256, and sizes such as 32 × 32, 64 × 64, and 128 × 32 are usually used. The quantization coefficient is represented by an n-bit signed binary number, and bit0 to bit (n−2) represent respective bits from LSB to MSB. The remaining 1 bit is a code. The coding of the code block is performed by the three kinds of coding passes shown in the following (a) to (c) in order from the bit plane on the MSB side.
(a) Significance Propagation Pass
(b) Magnitude Refinement Pass
(c) Clean Up Pass

３つの符号化パスの用いられる順序を図６に示す。図６に示すように、先ずビットプレーン（ｎ−２）（MSB）がClean Up Pass（以下、適宜ＣＵパスという。）によって符号化される。続いて、順次ＬＳＢ側に向かい、各ビットプレーンが、Significance Propagation Pass（以下、適宜ＳＰパスという。）、Magnitude Refinement Pass（以下、適宜ＭＲパスという。）、Clean Up Pass の順序で用いられて符号化される。 The order in which the three coding passes are used is shown in FIG. As shown in FIG. 6, first, the bit plane (n-2) (MSB) is encoded by a Clean Up Pass (hereinafter referred to as a CU pass as appropriate). Subsequently, each bit plane is sequentially used in the order of Significance Propagation Pass (hereinafter referred to as SP pass), Magnitude Refinement Pass (hereinafter referred to as MR pass), and Clean Up Pass. It becomes.

但し、実際にはＭＳＢ側から何番目のビットプレーンで初めて１が出てくるかをヘッダに書き、ゼロ係数から構成されるビットプレーン（ゼロビットプレーン）は符号化しない。この順序で３種類の符号化パスを繰返し用いて符号化し、任意のビットプレーンの任意の符号化パスまでで符号化を打ち切ることにより、符号量と画質のトレードオフを取る、すなわちレート制御を行うことができる。 In practice, however, the first bit plane from the MSB side where 1 appears for the first time is written in the header, and the bit plane composed of zero coefficients (zero bit plane) is not encoded. In this order, encoding is performed by repeatedly using three types of encoding passes, and the encoding is terminated up to an arbitrary encoding pass of an arbitrary bit plane, thereby taking a trade-off between code amount and image quality, that is, rate control is performed. be able to.

ここで、係数ビットの走査（スキャニング）について図７を用いて説明する。符号ブロックは、高さ４個の係数ビット毎にストライプ（stripe）に分けられる。ストライプの幅は、符号ブロックの幅に等しい。スキャン順とは１個の符号ブロック内の全ての係数ビットを辿る順番であり、符号ブロック中では上のストライプから下のストライプへの順序、各ストライプ中では左の列から右の列への順序、各列中では上から下への順序でスキャニングされる。なお、各符号化パスにおいて符号ブロック中の全ての係数ビットがこのスキャン順で処理される。 Here, scanning (scanning) of coefficient bits will be described with reference to FIG. The code block is divided into stripes every four coefficient bits in height. The width of the stripe is equal to the width of the code block. The scan order is the order in which all the coefficient bits in one code block are traced. In the code block, the order is from the upper stripe to the lower stripe, and in each stripe, the order is from the left column to the right column. In each row, scanning is performed from top to bottom. In each coding pass, all coefficient bits in the code block are processed in this scan order.

以下、上述した３つの符号化パスについて説明する。なお、この３つの符号化パスについては、何れも上述した文献「IS0/IEC 15444-1, Information technology-JPEG 2000, Part 1:Core coding system」に記載されている。 Hereinafter, the three coding passes described above will be described. Note that these three coding passes are all described in the above-mentioned document “IS0 / IEC 15444-1, Information technology-JPEG 2000, Part 1: Core coding system”.

(a) Significance Propagation Pass
あるビットプレーンを符号化するＳＰパスでは、８近傍の少なくとも１つの係数が有意（significant）であるようなnon-significantな係数ビットが算術符号化される。その符号化した係数ビットの値が１である場合には、符号の正負が続けて算術符号化される。 (a) Significance Propagation Pass
In the SP pass for encoding a bit plane, non-significant coefficient bits in which at least one coefficient in the vicinity of 8 is significant are arithmetically encoded. When the value of the encoded coefficient bit is 1, the sign of the sign is continued and arithmetic coding is performed.

ここでsignificanceとは、各係数ビットに対して符号化器が持つ状態である。significanceの初期値は、non-significantを表す「０」であり、その係数で１が符号化されたときにsignificantを表す「１」に変化し、以降常に「１」であり続ける。したがって、significanceとは、有効桁の情報を既に符号化したか否かを示すフラグともいえる。あるビットプレーンでＳＰパスが発生すれば、以降のビットプレーンではＳＰパスは発生しない。 Here, the significance is a state that the encoder has for each coefficient bit. The initial value of significance is “0” representing non-significant, changes to “1” representing significant when 1 is encoded with the coefficient, and continues to be always “1” thereafter. Therefore, the significance can also be said to be a flag indicating whether or not valid digit information has already been encoded. If an SP path is generated in a certain bit plane, no SP path is generated in the subsequent bit planes.

(b) Magnitude Refinement Pass
ビットプレーンを符号化するＭＲパスでは、ビットプレーンを符号化するＳＰパスで符号化していないsignificantな係数ビットが算術符号化される。 (b) Magnitude Refinement Pass
In the MR pass that encodes the bit plane, significant coefficient bits that are not encoded in the SP pass that encodes the bit plane are arithmetically encoded.

(c) Clean Up Pass
ビットプレーンを符号化するＣＵパスでは、ビットプレーンを符号化するＳＰパスで符号化していないnon-significantな係数ビットが算術符号化される。その符号化した係数ビットの値が１である場合には、符号の正負が続けて算術符号化される。 (c) Clean Up Pass
In the CU pass that encodes the bit plane, non-significant coefficient bits that are not encoded in the SP pass that encodes the bit plane are arithmetically encoded. When the value of the encoded coefficient bit is 1, the sign of the sign is continued and arithmetic coding is performed.

なお、以上の３つの符号化パスでの算術符号化では、ケースに応じてＺＣ（Zero Coding）、ＲＬＣ（Run-Length Coding）、ＳＣ（Sign Coding）、ＭＲ（Magnitude Refinement）が使い分けられて係数のコンテキストが選択される。そして、ＭＱ符号化と呼ばれる算術符号によって選択されたコンテキストが符号化される。このＭＱ符号化は、ＪＢＩＧ２で規定された学習型の２値算術符号である。ＭＱ符号化については、例えば、文献「ISO/IEC FDIS 14492, “Lossy/Lossless Coding of Bi-level Images”, March 2000」等に記載されている。ＪＰＥＧ−２０００では、全ての符号化パスで合計１９種類のコンテキストがある。 In arithmetic coding in the above three coding passes, ZC (Zero Coding), RLC (Run-Length Coding), SC (Sign Coding), and MR (Magnitude Refinement) are used properly according to the case. Context is selected. Then, the selected context is encoded by an arithmetic code called MQ encoding. This MQ coding is a learning type binary arithmetic code defined by JBIG2. The MQ coding is described in, for example, the document “ISO / IEC FDIS 14492,“ Lossy / Lossless Coding of Bi-level Images ”, March 2000”. In JPEG-2000, there are a total of 19 contexts in all coding passes.

以上のようにして、ビットモデリング部１６は、ビットプレーン毎の係数ビットＤ１６を３つの符号化パスで処理し、係数ビット毎のコンテキストＤ１７を生成する。そして、算術符号化部１７は、この係数ビット毎のコンテキストＤ１７に対して算術符号化を施す。 As described above, the bit modeling unit 16 processes the coefficient bit D16 for each bit plane in three coding passes, and generates a context D17 for each coefficient bit. The arithmetic encoding unit 17 performs arithmetic encoding on the context D17 for each coefficient bit.

レート制御部１９は、少なくとも一部の符号化パスの処理を行った後で、算術符号化部１７から供給された算術符号Ｄ１８の符号量をカウントし、目標の符号量に達した時点で、それより後の算術符号Ｄ１８を切り捨てる。このように、符号量をオーバーする直前で切り捨てることにより、確実に目標の符号量に抑えることができる。レート制御部１９は、この符号量制御完了後の算術符号Ｄ１９を、ヘッダ生成部２０及びパケット生成部２１に供給する。なお、レート制御部１９におけるレート制御処理の詳細については後述する。 The rate control unit 19 counts the code amount of the arithmetic code D18 supplied from the arithmetic coding unit 17 after performing at least a part of the coding pass processing, and when the target code amount is reached, The arithmetic code D18 after that is discarded. Thus, by truncating immediately before the code amount is exceeded, the target code amount can be reliably suppressed. The rate control unit 19 supplies the arithmetic code D19 after completion of the code amount control to the header generation unit 20 and the packet generation unit 21. Details of the rate control process in the rate control unit 19 will be described later.

ヘッダ生成部２０は、レート制御部１９から供給された符号量制御完了後の算術符号Ｄ１９に基づいて、符号ブロック内での付加情報、例えば符号ブロック内の符号化パスの個数や圧縮コードストリームのデータ長等をヘッダＤ２０として生成し、このヘッダＤ２０をパケット生成部２１に供給する。 Based on the arithmetic code D19 after completion of the code amount control supplied from the rate control unit 19, the header generation unit 20 adds additional information in the code block, for example, the number of coding paths in the code block and the compression code stream. The data length or the like is generated as a header D20, and this header D20 is supplied to the packet generator 21.

パケット生成部２１は、符号量制御完了後の算術符号Ｄ１９とヘッダＤ２０とを合わせてパケットＤ２１を生成し、符号化コードストリームとして出力する。この際、パケット生成部２１は、図８に示すように同一解像度レベルから個々のパケットを生成する。なお、図８から分かるように、最低域であるパケット−１は、ＬＬ成分のみを含み、それ以外のパケット−２乃至パケット−４は、ＬＨ成分，ＨＬ成分及びＨＨ成分を含む。 The packet generator 21 generates a packet D21 by combining the arithmetic code D19 after completion of the code amount control and the header D20, and outputs the packet D21 as an encoded code stream. At this time, the packet generator 21 generates individual packets from the same resolution level as shown in FIG. As can be seen from FIG. 8, packet-1 which is the lowest band includes only the LL component, and other packets-2 to packet-4 include the LH component, the HL component, and the HH component.

以上のように、本実施の形態における画像符号化装置１は、ウェーブレット変換及びエントロピー符号化を用いて入力画像を高効率に圧縮符号化し、パケット化して符号化コードストリームとして出力することができる。 As described above, the image encoding device 1 according to the present embodiment can compress and encode an input image with high efficiency using wavelet transform and entropy encoding, packetize it, and output it as an encoded code stream.

（２）画像符号化装置における適用部分
（２−１）符号化対象予測部の構成及び動作
ところで、一般にＪＰＥＧ−２０００方式により入力画像を圧縮符号化する画像符号化装置では、ＥＢＣＯＴにおけるエントロピー符号化の処理負荷が非常に高い。その一方で、目標の圧縮率又はビットレートで符号化する際には、ＥＢＣＯＴで符号化した後に実際には使用されないものが存在し、これらは結果的に無駄になる。 (2) Application Part in Image Encoding Device (2-1) Configuration and Operation of Encoding Target Prediction Unit By the way, in an image encoding device that generally compresses and encodes an input image by the JPEG-2000 method, entropy encoding in EBCOT The processing load is very high. On the other hand, when encoding at a target compression rate or bit rate, there are some that are not actually used after encoding with EBCOT, and as a result, they are wasted.

そこで、本実施の形態における符号化対象予測部１５は、最終的に使用される蓋然性の高いビットプレーンを事前予測し、このビットプレーンのみを抽出して上述したＥＢＣＯＴ部１８に供給する。具体的には、１フレーム分又は１サブバンド分の有効ビットプレーン数を計数し、この有効ビットプレーン数を発生する符号量についての指標として用いて、最終的に符号化されるビットプレーン数を予測する。 Therefore, the encoding target prediction unit 15 according to the present embodiment predicts a bit plane having a high probability of being finally used, extracts only this bit plane, and supplies the extracted bit plane to the EBCOT unit 18 described above. Specifically, the number of effective bit planes for one frame or one subband is counted, and this number of effective bit planes is used as an index for the amount of code for generating the number of effective bit planes. Predict.

この符号化対象予測部１５の内部構成の一例を図９に示す。図９に示すように、符号化対象予測部１５は、上述した有効ビットプレーン数を計数する有効ビットプレーン計数部３０と、ビットプレーン数テーブルを参照して、この有効ビットプレーン数から符号化対象となるビットプレーン数を予測するテーブル参照部３１と、ビットプレーン数テーブルを記憶する記憶部３２と、符号化対象となるビットプレーン数に基づいて、後述するように符号化対象ビットプレーンを抽出する符号化対象ビットプレーン抽出部３３とから構成されている。なお、この記憶部３２としては、読み取り専用の不揮発性記憶媒体であるＲＯＭ（Read Only Memory）を用いることができる。 An example of the internal configuration of the encoding target prediction unit 15 is shown in FIG. As illustrated in FIG. 9, the encoding target prediction unit 15 refers to the effective bit plane counting unit 30 that counts the number of effective bit planes and the bit plane number table, and calculates the encoding target from the number of effective bit planes. Based on the table reference unit 31 that predicts the number of bit planes to be stored, the storage unit 32 that stores the bit plane number table, and the number of bit planes to be encoded, the encoding target bit plane is extracted as described later. The encoding target bit plane extraction unit 33 is configured. As the storage unit 32, a read only memory (ROM) that is a read-only nonvolatile storage medium can be used.

有効ビットプレーン計数部３０は、符号ブロック毎のビットプレーンＤ３０を入力し、ゼロ係数から構成されるビットプレーン（ゼロビットプレーン）を除く有効ビットプレーン数を計数する。さらに、有効ビットプレーン計数部３０は、フレーム内又はサブバンド内の全ての符号ブロックの有効ビットプレーン数を加算し、算出された有効ビットプレーン数の総和Ｄ３１をテーブル参照部３１に供給する。 The valid bit plane counting unit 30 receives the bit plane D30 for each code block, and counts the number of valid bit planes excluding the bit plane composed of zero coefficients (zero bit plane). Further, the effective bit plane counting unit 30 adds the effective bit plane numbers of all code blocks in the frame or subband, and supplies the calculated total sum D 31 of effective bit planes to the table reference unit 31.

テーブル参照部３１は、記憶部３２に記憶されているビットプレーン数テーブルを参照して、有効ビットプレーン計数部３０から供給された有効ビットプレーン数の総和Ｄ３１から符号化対象ビットプレーン数Ｄ３２を読み出し、読み出した符号化対象ビットプレーン数Ｄ３３を符号化対象ビットプレーン抽出部３３に供給する。 The table reference unit 31 refers to the bit plane number table stored in the storage unit 32, and reads out the encoding target bit plane number D32 from the sum D31 of the effective bit plane numbers supplied from the effective bit plane counting unit 30. The read encoding target bit plane number D33 is supplied to the encoding target bit plane extraction unit 33.

ここで、ビットプレーン数テーブルは、有効ビットプレーン数の値域とその値域における符号化対象ビットプレーン数とが対応付けられたものである。 Here, the bit plane number table is a table in which the range of the number of effective bit planes is associated with the number of bit planes to be encoded in the range.

例えば、フレーム毎の有効ビットプレーン数の総和を用いる場合には、図１０に示すように、有効ビットプレーン数の値域に対して符号化対象ビットプレーン数が対応したものとなる。この場合、１フレーム内の符号化ブロックにおける有効ビットプレーン数の総和が「６００」であれば、このビットプレーン数テーブルを参照することで、即座に符号化対象ビットプレーン数として「５」という数字が得られる。 For example, when the total number of effective bit planes for each frame is used, as shown in FIG. 10, the number of encoding target bit planes corresponds to the range of the effective bit plane number. In this case, if the sum of the number of effective bit planes in the coding block in one frame is “600”, the number “5” is immediately set as the number of bit planes to be encoded by referring to this bit plane number table. Is obtained.

一方、サブバンド毎の有効ビットプレーン数の総和を用いる場合には、図１１に示すように、各サブバンド内の有効ビットプレーン数の値域に対して符号化対象ビットプレーン数が対応したものとなる。ここで、図１１は、図１２に示すように５回ウェーブレット変換・分割を行う場合のテーブル例を示したものである。この場合、分割レベル＝３でＬＨのサブバンド内の有効ビットプレーン数の総和が「１００」であれば、このビットプレーン数テーブルを参照することで、即座に符号化対象ビットプレーン数として「３」という数字が得られる。他のサブバンドについても同様である。なお、図１０のビットプレーン数テーブルよりも図１１のビットプレーン数テーブルの方が、きめ細かい制御が可能であり、より高精度に有効ビットプレーン数を予測することができる。 On the other hand, when the sum of the number of effective bit planes for each subband is used, the number of encoding target bitplanes corresponds to the range of the number of effective bitplanes in each subband, as shown in FIG. Become. Here, FIG. 11 shows an example of a table when the wavelet transform / division is performed five times as shown in FIG. In this case, if the sum of the number of effective bit planes in the LH subband is “100” at the division level = 3, the number of bit planes to be encoded is “3” immediately by referring to this bit plane number table. "Is obtained. The same applies to the other subbands. Note that the bit plane number table in FIG. 11 can perform finer control than the bit plane number table in FIG. 10, and the number of effective bit planes can be predicted with higher accuracy.

図９に戻って、符号化対象ビットプレーン抽出部３３は、テーブル参照部３１から供給された符号化対象ビットプレーン数Ｄ３３に基づいて、符号ブロック毎のビットプレーンＤ３０から符号化対象ビットプレーンＤ３４のみを抽出する。具体的には、各符号ブロック毎に、最上位ビット（MSB）側から符号化対象ビットプレーン数分のビットプレーンを抽出する。そして符号化対象ビットプレーン抽出部３３は、抽出した符号化対象ビットプレーンＤ３４を、図１に示したビットモデリング部１６に供給する。 Returning to FIG. 9, the encoding target bit plane extraction unit 33, based on the encoding target bit plane number D33 supplied from the table reference unit 31, only the encoding target bit plane D34 from the bit plane D30 for each code block. To extract. Specifically, bit planes corresponding to the number of encoding target bit planes are extracted from the most significant bit (MSB) side for each code block. Then, the encoding target bit plane extraction unit 33 supplies the extracted encoding target bit plane D34 to the bit modeling unit 16 illustrated in FIG.

この抽出された符号化対象ビットプレーンＤ３４のみが後段のＥＢＣＯＴ部１８で実際に符号化されるため、抽出された符号化対象ビットプレーンＤ３４の数が元の符号ブロック毎のビットプレーンＤ３０の数よりも少なければ、その分だけ処理負荷を軽減し、符号化に費やす時間を短縮することができる。 Since only the extracted encoding target bit plane D34 is actually encoded by the EBCOT unit 18 at the subsequent stage, the number of extracted encoding target bit planes D34 is greater than the number of bit planes D30 for each original code block. If less, the processing load can be reduced by that amount, and the time spent for encoding can be shortened.

また、本実施の形態における符号化対象予測部１５は、予め有効ビットプレーン数と符号化対象ビットプレーン数とを対応付けたビットプレーン数テーブルを参照するのみであるため、このテーブルを記憶するだけのメモリ容量で済み、極めて処理負荷が小さい。 In addition, since the encoding target prediction unit 15 in the present embodiment only refers to the bit plane number table in which the number of effective bit planes and the number of encoding target bit planes are associated in advance, only this table is stored. The memory capacity is sufficient, and the processing load is extremely small.

以上の処理を、図１３を用いて、ビットプレーンの観点から具体的に説明する。この図１３は、Ｙ（輝度）、Ｕ，Ｖ（色差）の３つのコンポーネントに対して各サブバンド内の符号ブロック（CB）毎のビットプレーンを図示したものである。例えば、右端のＶ−５ＨＨは、色差成分Ｖの５ＨＨ、すなわち水平方向及び垂直方向が高域である分割レベル＝５のサブバンド中に存在する符号ブロック（CB）毎のビットプレーンを示す。また、図１３において、空白領域はゼロビットプレーンを示し、斜線領域は符号化コードストリームに最終的に含まれるビットプレーンを示し、点領域は後段におけるレート制御の結果、使用されないビットプレーンを示す。なお、斜線領域と点領域とを合わせたビットプレーン数が、有効ビットプレーン数に相当する。 The above processing will be specifically described from the viewpoint of the bit plane with reference to FIG. FIG. 13 illustrates a bit plane for each code block (CB) in each subband for three components of Y (luminance), U, and V (color difference). For example, V-5HH at the right end indicates a bit plane for each code block (CB) existing in 5HH of the color difference component V, that is, in a subband of division level = 5 in which the horizontal direction and the vertical direction are high frequencies. In FIG. 13, a blank area indicates a zero bit plane, a hatched area indicates a bit plane that is finally included in the encoded code stream, and a dot area indicates a bit plane that is not used as a result of rate control in the subsequent stage. Note that the number of bit planes combining the shaded area and the dot area corresponds to the number of effective bit planes.

符号化対象予測部１５においては、上述したように、フレーム毎又はサブバンド毎の全符号ブロックにおける有効ビットプレーン数の総和から、符号化対象ビットプレーン数が求められる。そして、各符号ブロック（CB）毎に、最上位ビット（MSB）側から符号化対象ビットプレーン数分のビットプレーンが抽出される。 As described above, the encoding target prediction unit 15 determines the number of encoding target bit planes from the total number of effective bit planes in all code blocks for each frame or subband. For each code block (CB), bit planes corresponding to the number of bit planes to be encoded are extracted from the most significant bit (MSB) side.

点領域中にある境界線は、符号化対象予測部１５において抽出されるビットプレーンと抽出されないビットプレーンとの境界を示している。すなわち、この境界線よりも最下位ビット（LSB）側に存在しているビットプレーンは、符号化対象予測部１５において抽出されない。 A boundary line in the point area indicates a boundary between a bit plane extracted by the encoding target prediction unit 15 and a bit plane not extracted. That is, the bit plane existing on the least significant bit (LSB) side of this boundary line is not extracted by the encoding target prediction unit 15.

図１４は、図１３のＹ−０ＬＬ、すなわち輝度成分Ｙの最低域のサブバンドに着目した図である。図１４中で黒く塗った領域は、結果的にＥＢＣＯＴ部１８での符号化を省略することのできるビットプレーンである。 FIG. 14 is a diagram paying attention to Y-0LL of FIG. 13, that is, the lowest band of the luminance component Y. In FIG. 14, the black area is a bit plane that can be omitted in the EBCOT unit 18 as a result.

なお、この例では、各コンポーネント毎に同一のサブバンド内の全ての符号ブロックについて符号化対象ビットプレーン数を一定としたが、これに限定されるものではなく、コンポーネント毎、或いは符号ブロック毎に符号化対象ビットプレーン数を可変に設定することもできる。但し、この場合には、コンポーネント毎、或いは符号ブロック毎に符号化対象ビットプレーン数が設定されたビットプレーン数テーブルが必要となる。 In this example, the number of bit planes to be encoded is constant for all the code blocks in the same subband for each component, but the present invention is not limited to this, and for each component or code block It is also possible to variably set the number of encoding target bit planes. However, in this case, a bit plane number table in which the number of encoding target bit planes is set for each component or code block is required.

また、上述の説明では、ビットプレーン数テーブルを参照して、有効ビットプレーン数から符号化対象ビットプレーン数を予測するものとしたが、有効ビットプレーン数から符号化対象符号化パス数を予測するようにしても構わない。 In the above description, the number of encoding target bitplanes is predicted from the number of effective bitplanes by referring to the bitplane number table, but the number of encoding target encoding passes is predicted from the number of effective bitplanes. It doesn't matter if you do.

この場合における符号化対象予測部１５の内部構成を図１５に示す。図１５に示すように、符号化対象予測部１５は、上述した有効ビットプレーン数を計数する有効ビットプレーン計数部４０と、ビットプレーン数テーブルを参照して、この有効ビットプレーン数から符号化対象となる符号化パス数を求めるテーブル参照部４１と、符号化パス数テーブルを記憶する記憶部４２とから構成されている。 FIG. 15 shows an internal configuration of the encoding target prediction unit 15 in this case. As shown in FIG. 15, the encoding target prediction unit 15 refers to the effective bit plane counting unit 40 that counts the number of effective bit planes and the bit plane number table described above, and determines the encoding target from the number of effective bit planes. The table reference unit 41 obtains the number of encoding passes and the storage unit 42 stores the encoding pass number table.

有効ビットプレーン計数部４０は、符号ブロック毎のビットプレーンＤ４０を入力して有効ビットプレーン数を計数し、フレーム内又はサブバンド内の全ての符号ブロックの有効ビットプレーン数を加算した有効ビットプレーン数の総和Ｄ４１をテーブル参照部４１に供給する。 The effective bit plane counting unit 40 inputs the bit plane D40 for each code block, counts the number of effective bit planes, and adds the number of effective bit planes of all code blocks in the frame or subband. Is supplied to the table reference unit 41.

テーブル参照部４１は、記憶部４２に記憶されている符号化パス数テーブルを参照して、有効ビットプレーン計数部４０から供給された有効ビットプレーン数の総和Ｄ４１から符号化対象符号化パス数Ｄ４２を読み出し、読み出した符号化対象符号化パス数Ｄ４３を図１に示したビットモデリング部１６に供給する。 The table reference unit 41 refers to the encoding pass number table stored in the storage unit 42 and calculates the encoding target encoding pass number D42 from the sum D41 of the effective bit plane numbers supplied from the effective bit plane counting unit 40. And the read encoding target encoding pass number D43 is supplied to the bit modeling unit 16 shown in FIG.

ここで、符号化パス数テーブルは、有効ビットプレーン数の値域とその値域における符号化対象符号化パス数とが対応付けられたものである。 Here, the encoding pass number table associates the range of the number of effective bit planes with the number of encoding target encoding passes in the range.

例えば、フレーム毎の有効ビットプレーン数の総和を用いる場合には、図１６に示すように、有効ビットプレーン数の値域に対して符号化対象符号化パス数が対応したものとなる。なお、図１０と比較して分かる通り、１ビットプレーンでは最大で３個の符号化パスが発生する場合があるため、それを考慮して図１０の符号化対象ビットプレーン数よりも大きな数字に設定されている。 For example, when the total number of effective bit planes for each frame is used, the number of encoding target encoding passes corresponds to the range of the number of effective bit planes as shown in FIG. As can be seen from the comparison with FIG. 10, since a maximum of three coding passes may occur in one bit plane, the number is larger than the number of bit planes to be coded in FIG. It is set.

後段のＥＢＣＯＴ部１８では、各符号ブロック毎に、最上位ビット（MSB）側のビットプレーンから符号化対象符号化パス数分の符号化パスのみが実際に符号化されるため、符号化対象符号化パス数Ｄ４３が元の符号ブロック毎のビットプレーンＤ４０に発生する符号化パスの数よりも少なければ、その分だけ処理負荷を軽減し、符号化に費やす時間を短縮することができる。 Since the EBCOT unit 18 at the subsequent stage actually encodes only the number of encoding passes corresponding to the number of encoding target encoding passes from the bit plane on the most significant bit (MSB) side for each code block, If the number of encoding passes D43 is smaller than the number of encoding passes generated in the bit plane D40 for each original code block, the processing load can be reduced correspondingly, and the time spent for encoding can be reduced.

（２−２）レート制御部の構成及び動作
図１において算術符号化部１７から供給された算術符号Ｄ１８は、レート制御部１９を経由しないでそのまま後段部に行くと、目標の圧縮率又はビットレートでない符号化コードストリームが排出される蓋然性が高い。したがって、レート制御部１９で最終的なレート制御が行われる必要がある。 (2-2) Configuration and Operation of Rate Control Unit When the arithmetic code D18 supplied from the arithmetic coding unit 17 in FIG. 1 goes directly to the subsequent stage without passing through the rate control unit 19, the target compression rate or bit There is a high probability that non-rate coded code streams will be discharged. Accordingly, the rate control unit 19 needs to perform final rate control.

ここで、本実施の形態におけるレート制御部１９は、優先順位の高い算術符号から順に選択してその符号量を加算し、目標の符号量に達した時点で停止する。 Here, the rate control unit 19 in the present embodiment selects arithmetic codes in descending order of priority, adds the code amounts, and stops when the target code amount is reached.

このレート制御部１９の内部構成の一例を図１７に示す。図１７に示すように、レート制御部１９は、符号量計算制御部５０と、符号量加算制御部５１とにより構成されている。 An example of the internal configuration of the rate control unit 19 is shown in FIG. As shown in FIG. 17, the rate control unit 19 includes a code amount calculation control unit 50 and a code amount addition control unit 51.

符号量計算制御部５０は、算術符号化部１７（図１）から供給された算術符号Ｄ５０のビットプレーン毎の符号量Ｄ５１を計算し、符号量加算制御部５１に供給する。符号量加算制御部５１は、後述する優先順位に従って、１ビットプレーンずつ符号量Ｄ５１を加算する。加算されたビットプレーン情報Ｄ５２は、再び符号量計算制御部５０に供給され、それまでに加算された全てのビットプレーンの符号量と目標符号量Ｄ５４とが比較される。そして、目標符号量Ｄ５４に達した時点でこのループ制御を終了し、最終的な符号量制御後の算術符号Ｄ５３を、図１に示したヘッダ生成部２０及びパケット生成部２１に供給する。 The code amount calculation control unit 50 calculates the code amount D51 for each bit plane of the arithmetic code D50 supplied from the arithmetic encoding unit 17 (FIG. 1), and supplies the code amount D51 to the code amount addition control unit 51. The code amount addition control unit 51 adds the code amount D51 for each bit plane according to the priority order described later. The added bit plane information D52 is supplied again to the code amount calculation control unit 50, and the code amounts of all the bit planes added so far are compared with the target code amount D54. Then, when the target code amount D54 is reached, this loop control is terminated, and the arithmetic code D53 after the final code amount control is supplied to the header generation unit 20 and the packet generation unit 21 shown in FIG.

ここで、ビットプレーン毎の符号量Ｄ５１を加算する優先順位は、以下の通りである。すなわち、全サブバンドの最上位ビット（MSB）のビットプレーンのうち、サブバンド間で最もビット位置が高いものから最下位ビット（LSB）のビットプレーンの順に選択する。 Here, the priority order of adding the code amount D51 for each bit plane is as follows. That is, among the most significant bit (MSB) bit planes of all the subbands, the bit plane having the highest bit position between the subbands is selected in order from the least significant bit (LSB) bit plane.

また、各サブバンドの同じビット位置のビットプレーンについては、最低域のサブバンドから最高域のサブバンドの順に選択する。例えば図２に示したように３回ウェーブレット変換・分割を行った場合、図１８に示すように、０ＬＬ、１ＨＬ、１ＬＨ、１ＨＨ、２ＨＬ、２ＬＨ、２ＨＨ、３ＨＬ、３ＬＨ、３ＨＨの順に選択する。これは、画像の重要な部分が高域よりも低域に集まっているためである。 In addition, bit planes at the same bit position in each subband are selected in the order of the lowest subband to the highest subband. For example, when wavelet transform / division is performed three times as shown in FIG. 2, selection is made in the order of 0LL, 1HL, 1LH, 1HH, 2HL, 2LH, 2HH, 3HL, 3LH, and 3HH as shown in FIG. This is because important parts of the image are gathered in the low range rather than the high range.

また、Ｙ（輝度）、Ｕ，Ｖ（色差）の３つのコンポーネントの同じビット位置のビットプレーンについては、例えば、Ｙ、Ｕ、Ｖの順に選択する。これは、一般に色差情報よりも輝度情報に対して人間の視覚特性が敏感なためである。なお、同じ色差情報であるＵ及びＶの重要度は入力画像に依存するため、優先順位を適宜可変にすることが好ましい。 Further, bit planes at the same bit positions of the three components Y (luminance), U, and V (color difference) are selected in the order of Y, U, and V, for example. This is because human visual characteristics are generally more sensitive to luminance information than color difference information. Since the importance of U and V, which are the same color difference information, depends on the input image, it is preferable to change the priority appropriately.

このビットプレーンの加算処理手順を図１９のフローチャートに示す。先ずステップＳ１において、ＥＢＣＯＴ部１８で符号化された全符号化パスの情報と各ビットプレーンの符号量とを保持する。 This bit plane addition processing procedure is shown in the flowchart of FIG. First, in step S1, information on all coding passes encoded by the EBCOT unit 18 and the code amount of each bit plane are held.

次にステップＳ２において、加算符号量Ｙを０に初期化し、続くステップＳ３において、ゼロビットプレーンを含む、サブバンド間で最もビット位置の高い最初のビットプレーンを選択する。ここで、同じビット位置に複数のビットプレーンが存在する場合には、上述の通り、サブバンドについては最低域のサブバンドから最高域のサブバンドの順に、コンポーネントについては例えばＹ、Ｕ、Ｖの順に選択する。 Next, in step S2, the added code amount Y is initialized to 0, and in the subsequent step S3, the first bit plane having the highest bit position between subbands including the zero bit plane is selected. Here, when there are a plurality of bit planes at the same bit position, as described above, for the subbands, for example, Y, U, and V for components in the order of the lowest band to the highest band. Select in order.

続いてステップＳ４において、選択したビットプレーンの符号量Ｔ[Ns,Nc,C,B]をＹに加算する。ここで、Ｎｓ，Ｎｃ，Ｃ，Ｂは、それぞれサブバンド番号、コンポーネント番号、符号ブロック番号、ビットプレーン番号を示す。 In step S4, the code amount T [Ns, Nc, C, B] of the selected bit plane is added to Y. Here, Ns, Nc, C, and B indicate a subband number, a component number, a code block number, and a bit plane number, respectively.

ステップＳ５では、加算符号量Ｙが目標符号量以上であるか否かが判別される。加算符号量Ｙが目標符号量以上である場合（Yes）には加算処理を終了し、符号量Ｙが目標符号量未満である場合（No）にはステップＳ６に進む。 In step S5, it is determined whether or not the added code amount Y is greater than or equal to the target code amount. When the added code amount Y is equal to or greater than the target code amount (Yes), the addition process is terminated, and when the code amount Y is less than the target code amount (No), the process proceeds to step S6.

ステップＳ６では、同じビット位置のビットプレーンが存在するか否かが判別される。同じビット位置のビットプレーンが存在する場合（Yes）にはステップＳ７に進み、次のビットプレーンを選択してステップＳ４に戻る。一方、同じビット位置のビットプレーンが存在しない場合（No）にはステップＳ８に進む。 In step S6, it is determined whether there is a bit plane at the same bit position. If there is a bit plane at the same bit position (Yes), the process proceeds to step S7, the next bit plane is selected, and the process returns to step S4. On the other hand, if there is no bit plane at the same bit position (No), the process proceeds to step S8.

ステップＳ８では、ビット位置が最も低いか否か、すなわち最下位ビット（LSB）か否かが判別される。ビット位置が最も低い場合（Yes）には加算処理を終了し、そうでない場合（No）にはステップＳ９において最下位ビット（LSB）側の次のビット位置の最初のビットプレーンを選択してステップＳ４に戻る。 In step S8, it is determined whether or not the bit position is the lowest, that is, the least significant bit (LSB). If the bit position is the lowest (Yes), the addition process is terminated. If not (No), the first bit plane of the next bit position on the least significant bit (LSB) side is selected in Step S9 and the step is performed. Return to S4.

以上の処理を、図２０を用いて、ビットプレーンの観点から具体的に説明する。この図２０は、Ｙ（輝度）、Ｕ，Ｖ（色差）の３つのコンポーネントに対して各サブバンド内の符号化ブロック（CB）毎のビットプレーンを図示している。図１３と同様に、空白領域はゼロビットプレーンを示し、斜線領域は符号化コードストリームに最終的に含まれるビットプレーンを示し、点領域はレート制御の結果、使用されないビットプレーンを示す。 The above processing will be specifically described from the viewpoint of the bit plane with reference to FIG. FIG. 20 illustrates a bit plane for each coding block (CB) in each subband for three components of Y (luminance), U, and V (color difference). As in FIG. 13, the blank area indicates a zero bit plane, the hatched area indicates a bit plane that is finally included in the encoded code stream, and the dot area indicates a bit plane that is not used as a result of rate control.

図２０に示すように、レート制御部１９においては、ゼロビットプレーンを含む最もビット位置の高いビットプレーンから最もビット位置の低いビットプレーン、すなわち最下位ビット（LSB）のビットプレーンの順に、サブバンド、コンポーネントを渡りながら選択され、同じビット位置ではサブバンドについては最低域のサブバンドから最高域のサブバンドの順に、コンポーネントについては例えばＹ、Ｕ、Ｖの順にビットプレーンが選択される。具体的には、図２０において、Ｙ−０ＬＬ、Ｕ−０ＬＬ及びＶ−０ＬＬのビット位置がサブバンド間で最も高いため、Ｙ−０ＬＬ、Ｕ−０ＬＬ、Ｖ−０ＬＬの順に最上位ビット（MSB）側からビットプレーンが選択される。 As shown in FIG. 20, in the rate control unit 19, subbands are arranged in the order from the bit plane having the highest bit position including the zero bit plane to the bit plane having the lowest bit position, that is, the bit plane of the least significant bit (LSB). In the same bit position, bit planes are selected in the order of the lowest band to the highest band, and the components in the order of Y, U, V, for example. Specifically, in FIG. 20, since the bit positions of Y-0LL, U-0LL, and V-0LL are the highest among subbands, the most significant bit (MSB) in the order of Y-0LL, U-0LL, and V-0LL. The bit plane is selected from the () side.

なお、この例では、Ｙ−０ＬＬ〜Ｙ−５ＨＨ、Ｕ−０ＬＬ〜Ｕ−５ＨＨ、Ｖ−０ＬＬ〜Ｖ−５ＨＨと、サブバンド毎の選択をコンポーネント毎の選択に優先させているが、これに限定されるものではなく、コンポーネント毎の選択をサブバンド毎の選択に優先させるようにしても構わない。 In this example, Y-0LL to Y-5HH, U-0LL to U-5HH, V-0LL to V-5HH, and the selection for each subband are given priority over the selection for each component. The selection is not limited, and selection for each component may be given priority over selection for each subband.

このように、本実施の形態におけるレート制御部１９によれば、最終的に選択されずに切り捨てられるビットプレーン数が、１フレーム内の全ての符号ブロックに対して最下位ビット（LSB）側から数えて最大でも１ビットプレーンしか相違しないため、サブバンド間の画質差がなくなり、全体的に高画質な画像が得られる。 As described above, according to the rate control unit 19 in the present embodiment, the number of bit planes to be discarded without being finally selected is the least significant bit (LSB) side for all code blocks in one frame. Since only one bit plane is different at the maximum, there is no difference in image quality between subbands, and an overall high-quality image can be obtained.

（２−３）量子化部の動作
上述したように、量子化部１２は、ウェーブレット変換部１１から供給されたウェーブレット変換係数Ｄ１２に対して、量子化ステップサイズで除算するスカラ量子化により非可逆圧縮を施す。 (2-3) Operation of Quantization Unit As described above, the quantization unit 12 is irreversible by scalar quantization that divides the wavelet transform coefficient D12 supplied from the wavelet transform unit 11 by the quantization step size. Apply compression.

ところで、ＪＰＥＧ−２０００規格で規定されている量子化は、以下の式（１）に示すように、ある変換係数ａ_ｂ（ｘ，ｙ）（但し、ｘは水平方向の位置を示し、ｙは垂直方向の位置を示す）を、サブバンドｂの量子化ステップサイズΔＷ_ｂで除算することによって、量子化係数Ｑ_ｂ（ｘ，ｙ）を算出するものである。 By the way, the quantization defined in the JPEG-2000 standard is a conversion coefficient a _b (x, y) (where x indicates the position in the horizontal direction and y is The quantization coefficient Q _b (x, y) is calculated by dividing (vertical position) by the quantization step size ΔW _b of the subband b.

式（１）におけるΔＷ_ｂは、以下の式（２）に従って算出することができる。ここで、ΔＢ−Ｓｔｅｐは、全サブバンドで共通の基本ステップサイズを示し、Ｌ２_ｂは、サブバンドｂの合成フィルタ基底波形のＬ２ノルムを示し、ΔＮ_ｂは、サブバンドｂの正規化量子化ステップサイズを示す。 ΔW _b in equation (1) can be calculated according to equation (2) below. Here, ΔB-Step indicates a basic step size common to all subbands, L2 _b indicates the L2 norm of the combined filter base waveform of subband _b , and ΔN _b indicates normalized quantization of subband b. Indicates the step size.

次に、ΔＷ_ｂが算出されてから、以下の式（３）に従って指数ε_ｂ及び仮数μ_ｂを求める。ここで、Ｒ_ｂはサブバンドｂにおけるダイナミックレンジである。これらの値ε_ｂ、μ_ｂが実際に最終的な符号化コードストリームに含まれることになる。 Next, after ΔW _b is calculated, an index ε _b and a mantissa μ _b are obtained according to the following equation (3). Here, R _b is the dynamic range in subband b. These values ε _b and μ _b are actually included in the final encoded code stream.

ここで、本実施の形態における量子化部１２は、量子化ステップサイズΔＷ_ｂを求める際に、上述した式（２）を用いるのではなく、サブバンドｂ毎の重み係数ＶＷ_ｂを用いて、以下の式（４）に従って算出する。 Here, the quantization unit 12 according to the present embodiment uses the weighting coefficient VW _b for each subband b instead of using the above-described equation (2) when obtaining the quantization step size ΔW _b . It calculates according to the following formula | equation (4).

この重み係数ＶＷ_ｂは、サブバンド毎、或いはＹ（輝度）、Ｃｂ，Ｃｒ（色差）といったコンポーネント毎に設定することができ、重み係数テーブルとして記憶される。この重み係数の値が大きい場合には、上述した式（３）からΔＷ_ｂが小さくなり、式（１）から量子化係数Ｑ_ｂ（ｘ，ｙ）が大きくなるため、画像の重要な部分が集中している低域ほど値を大きくすることが好ましい。また、色差情報（Cb，Cr）よりも輝度情報（Y）に対して人間の視覚特性が敏感であるため、輝度情報（Y）の方の値を大きくすることが好ましい。 The weighting factor VW _b are each subband, or Y (luminance), Cb, can be set for each component such as Cr (color difference) is stored as the weighting coefficient table. When the value of the weighting factor is large, ΔW _b is reduced from the above-described equation (3), and the quantization coefficient Q _b (x, y) is increased from the equation (1). It is preferable to increase the value as the low frequencies are concentrated. Further, since the human visual characteristic is more sensitive to the luminance information (Y) than the color difference information (Cb, Cr), it is preferable to increase the value of the luminance information (Y).

図１２のように５回ウェーブレット変換・分割を行う場合の重み係数テーブルの一例を図２１に示す。図２１に示すように、分割レベルが小さい低域ほど値が大きく、また、色差情報（Cb，Cr）よりも輝度情報（Y）の方が値が大きくなっている。 FIG. 21 shows an example of a weighting coefficient table in the case where the wavelet transformation / division is performed five times as shown in FIG. As shown in FIG. 21, the value is larger as the division level is lower, and the luminance information (Y) is larger than the color difference information (Cb, Cr).

後段の符号ブロック化部１３では、量子化部１２で生成された量子化係数Ｄ１３が所定の大きさの符号ブロックに分割され、ビットプレーン分解部１４では、符号ブロック毎の量子化係数Ｄ１４がビットプレーンに分解されるため、量子化係数の値が大きいほど、ビットプレーン数が多くなる。したがって、上述したように、レート制御部１９において、全ての符号ブロックで最もビット位置の高い算術符号から最下位ビットの算術符号の順に、ビットプレーン毎又は符号化パス毎の算術符号を選択する場合には、より重要な情報を優先的に符号化コードストリームに含めることができ、結果として高画質な符号化画像を提供することができる。 In the code block generation unit 13 at the subsequent stage, the quantization coefficient D13 generated by the quantization unit 12 is divided into code blocks of a predetermined size, and in the bit plane decomposition unit 14, the quantization coefficient D14 for each code block is a bit. Since it is decomposed into planes, the number of bit planes increases as the value of the quantization coefficient increases. Therefore, as described above, when the rate control unit 19 selects the arithmetic code for each bit plane or for each coding pass in the order from the arithmetic code having the highest bit position to the arithmetic code of the least significant bit in all code blocks. Therefore, more important information can be preferentially included in the encoded code stream, and as a result, a high-quality encoded image can be provided.

（３）その他
上述したＪＰＥＧ−２０００規格は、静止画の規格であり、動画像では頻繁に存在するインタレース画像に対する十分な検討がなされていない。したがって、静止画用の技術をそのまま動画像に応用すると、劣化が目立ってしまうという欠点が露呈する。以下、具体的に説明する。 (3) Others The above-mentioned JPEG-2000 standard is a standard for still images, and sufficient consideration has not been made for interlaced images that frequently exist in moving images. Therefore, if the technology for still images is applied to a moving image as it is, the disadvantage that the deterioration becomes conspicuous is exposed. This will be specifically described below.

あるインタレースの動画シーンの１フレームを図２２に示す。これをＰＣモニタ等のプログレッシブ方式のモニタで見ると、動きのある部分にインタレースの影響が強く出て、横方向に縞模様が見える。 One frame of a certain interlaced moving image scene is shown in FIG. When this is viewed on a progressive monitor such as a PC monitor, the influence of interlacing is strong on the moving part, and a striped pattern appears in the horizontal direction.

すなわち、図２３（Ａ）に示すように、丸い物体が画面中を右方向に移動する場合、インタレース画面では問題ないが、プログレッシブ画面では図２３（Ｂ）のように横方向に縞模様が見えてしまう。これはインタレース画面が奇数フィールドと偶数フィールドとで飛び越し走査を用いていることに起因している。 That is, as shown in FIG. 23A, when a round object moves in the right direction on the screen, there is no problem in the interlaced screen, but in the progressive screen, a stripe pattern is formed in the horizontal direction as shown in FIG. I can see it. This is because the interlaced screen uses interlaced scanning in odd and even fields.

図２２に示した画像を５回ウェーブレット変換・分割した後の各サブバンドの様子を図２４に示す。図２４から分かるように、明らかに５ＬＨ成分、すなわち水平方向が低域で垂直方向が高域である分割レベル＝５のサブバンドの係数のエネルギが他のサブバンドに比べて大きくなっている。 FIG. 24 shows the state of each subband after the image shown in FIG. 22 is wavelet transformed and divided five times. As can be seen from FIG. 24, the energy of the subband coefficients of 5 LH components, that is, the division level = 5 in which the horizontal direction is the low frequency and the vertical direction is the high frequency is larger than the other subbands.

したがって、５ＬＨのサブバンド内の符号化ブロックをそのまま符号化すれば、インタレース成分を重視した符号化が実現できることになる。 Therefore, if an encoded block in the 5LH subband is encoded as it is, encoding with an emphasis on the interlace component can be realized.

一方、プログレシブ画像表示装置に出力する場合には、５ＬＨのサブバンド内の符号ブロックを符号化しなければよい。具体的には、５ＬＨのサブバンド内の全ての符号ブロックのビットプレーン又は符号化パス数を０にすればよい。なお、分割レベルが最高域のＬＨ成分に限らず、全ての分割レベルでＬＨ成分の符号ブロックのビットプレーン又は符号化パス数を０にするようにしても構わない。 On the other hand, in the case of outputting to a progressive image display device, a code block in a 5 LH subband may not be encoded. Specifically, the number of bit planes or the number of coding passes of all code blocks in the 5LH subband may be set to zero. The division level is not limited to the LH component in the highest range, and the bit plane or the number of coding passes of the code block of the LH component may be set to 0 at all division levels.

例えば、図１１に示したように、符号化対象予測部１５においてサブバンド毎の全ての符号ブロックの有効ビットプレーン数の総和から符号化対象ビットプレーン数を予測する場合には、図２５に示すように、５ＬＨ成分に対応する符号化対象ビットプレーン数を０に設定しておけばよい。 For example, as shown in FIG. 11, when the encoding target prediction unit 15 predicts the number of encoding target bit planes from the total number of effective bit planes of all the code blocks for each subband, as shown in FIG. Thus, the number of encoding target bit planes corresponding to the 5LH component may be set to 0.

なお、本発明は上述した実施の形態のみに限定されるものではなく、本発明の要旨を逸脱しない範囲において種々の変更が可能であることは勿論である。 It should be noted that the present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the present invention.

例えば、上述の実施の形態では、ハードウェアの構成として説明したが、これに限定されるものではなく、任意の処理を、ＣＰＵ（Central Processing Unit）にコンピュータプログラムを実行させることにより実現することも可能である。この場合、コンピュータプログラムは、記録媒体に記録して提供することも可能であり、また、インターネットその他の伝送媒体を介して伝送することにより提供することも可能である。 For example, in the above-described embodiment, the hardware configuration has been described. However, the present invention is not limited to this, and arbitrary processing may be realized by causing a CPU (Central Processing Unit) to execute a computer program. Is possible. In this case, the computer program can be provided by being recorded on a recording medium, or can be provided by being transmitted via the Internet or another transmission medium.

上述した本発明によれば、例えばＪＰＥＧ−２０００方式の画像符号化装置において、算術符号化の処理負荷を軽減すると共に、画質劣化を抑えて効果的にレート制御を行うことができる。 According to the above-described present invention, for example, in a JPEG-2000 image coding apparatus, it is possible to reduce the processing load of arithmetic coding and to effectively perform rate control while suppressing image quality deterioration.

本実施の形態における画像符号化装置の概略構成を説明する図である。It is a figure explaining schematic structure of the image coding apparatus in this Embodiment. 分割レベル＝３までウェーブレット変換・分割した場合のサブバンドを説明する図である。It is a figure explaining the subband at the time of performing wavelet transformation and a division | segmentation to division | segmentation level = 3. 実際の画像をウェーブレット変換・分割した場合のサブバンドを説明する図である。It is a figure explaining the subband at the time of carrying out wavelet transform and a division | segmentation of an actual image. 符号ブロックとサブバンドとの関係を説明する図である。It is a figure explaining the relationship between a code block and a subband. ビットプレーンを説明する図であり、同図（Ａ）は、計１６個の係数から成る量子化係数を示し、同図（Ｂ）は、係数の絶対値のビットプレーンを示し、同図（Ｃ）は、符号のビットプレーンを示す。FIG. 4A is a diagram for explaining a bit plane. FIG. 1A shows a quantized coefficient consisting of a total of 16 coefficients, FIG. 2B shows a bit plane of an absolute value of the coefficient, and FIG. ) Indicates a bit plane of the code. 符号ブロック内の符号化パスの処理手順を説明する図である。It is a figure explaining the process sequence of the encoding pass in a code block. 符号ブロック内の係数のスキャン順序を説明する図である。It is a figure explaining the scan order of the coefficient in a code block. 同画像符号化装置で生成されるパケットを説明する図である。It is a figure explaining the packet produced | generated with the image coding apparatus. 同画像符号化装置における符号化対象予測部の内部構成の一例を説明する図である。It is a figure explaining an example of an internal structure of the encoding object prediction part in the image coding apparatus. フレーム毎の有効ビットプレーン数の総和を利用する場合のビットプレーン数テーブルの一例を示す図である。It is a figure which shows an example of the bit plane number table in the case of utilizing the sum total of the number of effective bit planes for every frame. サブバンド毎の有効ビットプレーン数の総和を利用する場合のビットプレーン数テーブルの一例を示す図である。It is a figure which shows an example of the bit plane number table in the case of utilizing the sum total of the number of effective bit planes for every subband. 分割レベル＝５までウェーブレット変換・分割した場合のサブバンドを説明する図である。It is a figure explaining the subband at the time of performing wavelet transformation and a division | segmentation to division | segmentation level = 5. 同符号化対象予測部における処理をビットプレーンの観点から具体的に説明する図である。It is a figure which demonstrates the process in the encoding object prediction part concretely from a bit-plane viewpoint. 輝度成分Ｙの最低域のサブバンドに着目した図である。It is a figure which paid its attention to the subband of the lowest region of the luminance component Y. 符号化対象符号化パス数を予測する場合の符号化対象予測部の内部構成の一例を説明する図である。It is a figure explaining an example of an internal structure of the encoding object prediction part in the case of estimating the encoding object encoding pass number. サブバンド毎の有効ビットプレーン数の総和を利用する場合の符号化パス数テーブルの一例を示す図である。It is a figure which shows an example of the encoding pass number table in the case of utilizing the sum total of the number of effective bit planes for every subband. 同画像符号化装置におけるレート制御部の内部構成の一例を説明する図である。It is a figure explaining an example of an internal structure of the rate control part in the image coding apparatus. サブバンド毎の選択順序を説明する図である。It is a figure explaining the selection order for every subband. 同レート制御部におけるビットプレーン毎の符号量の加算処理手順を説明するフローチャートである。It is a flowchart explaining the addition processing procedure of the code amount for every bit plane in the same rate control part. 同レート制御部における処理をビットプレーンの観点から具体的に説明する図である。It is a figure which demonstrates the process in the rate control part concretely from a bit-plane viewpoint. サブバンド毎に重み付けして量子化する場合の重み係数テーブルの一例を示す図である。It is a figure which shows an example of the weighting coefficient table in the case of weighting and quantizing for every subband. インタレース動画像の一例を説明する図である。It is a figure explaining an example of an interlace moving image. インタレース画像における被写体画像の見え方を説明する図であり、同図（Ａ）は、物体が画面中を右方向に移動する場合を示し、同図（Ｂ）は、プログレッシブ画面における当該物体の見え方を示す。FIG. 4A is a diagram for explaining how a subject image appears in an interlaced image. FIG. 3A shows a case where an object moves in the right direction on the screen, and FIG. 3B shows the object on the progressive screen. Show how it looks. インタレース画像をウェーブレット変換・分割した場合のサブバンドを説明する図である。It is a figure explaining the subband at the time of carrying out wavelet transform and a division | segmentation of an interlaced image. 入力画像がインタレース画像であり、サブバンド毎の有効ビットプレーン数の総和を利用する場合のビットプレーン数テーブルの一例を示す図である。It is a figure which shows an example of the bit plane number table in case an input image is an interlace image and uses the sum total of the number of effective bit planes for each subband.

Explanation of symbols

１画像符号化装置、１０ＤＣレベルシフト部、１１ウェーブレット変換部、１２量子化部、１３符号ブロック化部、１４ビットプレーン分解部、１５符号化対象予測部、１６ビットモデリング部、１７算術符号化部、１８ＥＢＣＯＴ部、１９レート制御部、２０ヘッダ生成部、２１パケット生成部、３０有効ビットプレーン計数部、３１テーブル参照部、３２記憶部、３３符号化対象ビットプレーン抽出部、４０有効ビットプレーン計数部、４１テーブル参照部、４２記憶部、５０符号量計算制御部、５１符号量加算制御部 DESCRIPTION OF SYMBOLS 1 Image coding apparatus, 10 DC level shift part, 11 Wavelet transformation part, 12 Quantization part, 13 Code block part, 14 Bit plane decomposition part, 15 Encoding object prediction part, 16 bit modeling part, 17 Arithmetic coding Unit, 18 EBCOT unit, 19 rate control unit, 20 header generation unit, 21 packet generation unit, 30 effective bit plane counting unit, 31 table reference unit, 32 storage unit, 33 encoding target bit plane extraction unit, 40 effective bit plane Counting unit, 41 Table reference unit, 42 Storage unit, 50 Code amount calculation control unit, 51 Code amount addition control unit

Claims

Filtering means for generating a subband by applying a low-pass filter and a high-pass filter to an input image in a vertical direction and a horizontal direction, and performing a hierarchical filtering process on the subband of the low-frequency component;
Code block generation means for dividing the subband generated by the filtering means and generating a code block of a predetermined size;
Bit plane generating means for generating a bit plane from the most significant bit to the least significant bit in the code block unit;
Encoding target prediction means for predicting the encoding target number to be encoded and generating encoding target number information;
Among the encoding targets generated by the encoding target prediction means, an encoding means for performing encoding only for the encoding target given by the encoding target number information from the most significant bit side of each code block; Prepared,
The encoding target prediction means counts the number of effective bit planes excluding zero bit planes for all code blocks in the frame of the input image, and refers to a predetermined table on the basis of the counting result for each frame. An image encoding device that obtains the number of encoding objects to be encoded.

2. The image encoding device according to claim 1, wherein the number of encoding targets is the number of bit planes or the number of encoding passes to be encoded.

2. The image encoding device according to claim 1, wherein the predetermined table is a table in which a range of effective bit planes and the number of encoding targets to be encoded are associated with each other.

The code amount of the arithmetic code is added for each bit plane in the order of the arithmetic code having the highest bit position to the arithmetic code of the least significant bit in all the code blocks of the input image, and exceeds a predetermined target code amount. The image coding apparatus according to claim 1, further comprising code amount control means for stopping addition of the code amount of the arithmetic code.

5. The image encoding device according to claim 4, wherein the code amount control means adds the code amount of the arithmetic code in the order of the lowest band to the highest band for the arithmetic code at the same bit position.

5. The image coding apparatus according to claim 4, wherein the code amount control means adds the code amount of the arithmetic code in the order of the luminance information component to the color difference information component for the arithmetic code at the same bit position.

Further comprising: a quantization unit that quantizes the transform coefficient in the subband generated by the filtering unit by dividing the transform coefficient by a quantization step size that is weighted using a weighting factor set for each subband. Item 2. The image encoding device according to Item 1.

The image coding apparatus according to claim 7, wherein the weighting factor is set such that the quantization step size becomes smaller as a subband of a low frequency component.

8. The image encoding apparatus according to claim 7, wherein the weighting coefficient is set so that the quantization step size is smaller in the luminance information component than in the color difference information component.

Filtering means for generating a subband by applying a low-pass filter and a high-pass filter to an input image in a vertical direction and a horizontal direction, and performing a hierarchical filtering process on the subband of the low-frequency component;
Code block generation means for dividing the subband generated by the filtering means and generating a code block of a predetermined size;
Bit plane generating means for generating a bit plane from the most significant bit to the least significant bit in the code block unit;
Encoding target prediction means for predicting the encoding target number to be encoded and generating encoding target number information;
Among the encoding targets generated by the encoding target prediction means, an encoding means for performing encoding only for the encoding target given by the encoding target number information from the most significant bit side of each code block; Prepared,
The encoding target prediction unit counts the number of effective bit planes excluding zero bit planes for all code blocks in the subband of the input image, and refers to a predetermined table based on the counting result, thereby An image encoding apparatus that obtains the number of encoding targets for each band.

The image coding apparatus according to claim 10 , wherein the predetermined table is a table in which a range of effective bit planes and the number of coding targets to be coded are associated with each subband.

If the input image is an interlaced image, the horizontal direction according to claim 11, wherein the division level in the low range is vertical high frequency is the coded number corresponding to the subband of the highest region is set to 0 Image encoding device.

If the input image is an interlaced image, the horizontal direction according to claim 11, wherein said coded number corresponding to the subbands of all the division level is high band in the vertical direction in the low range is set to 0 the image Encoding device.

A filtering step of applying a low-pass filter and a high-pass filter to the input image in a vertical direction and a horizontal direction to generate subbands, and hierarchically filtering the subbands of the low-frequency components;
A code block generation step of dividing the subband generated in the filtering step and generating a code block of a predetermined size;
A bit plane generating step for generating a bit plane from the most significant bit to the least significant bit in the code block unit;
An encoding target prediction step for predicting the encoding target number to be encoded and generating encoding target number information;
Encoding that performs encoding only for the number of encoding targets given by the number of encoding targets information from the most significant bit side of each code block among the encoding targets generated in the encoding target prediction step A process,
In the encoding target prediction step, the number of effective bit planes excluding zero bit planes is counted for all code blocks in the frame of the input image, and a predetermined table is referred to based on the counting result, so that An image encoding method for obtaining the number of encoding objects to be encoded.

A filtering step of applying a low-pass filter and a high-pass filter to the input image in a vertical direction and a horizontal direction to generate subbands, and hierarchically filtering the subbands of the low-frequency components;
A code block generation step of dividing the subband generated in the filtering step and generating a code block of a predetermined size;
A bit plane generating step for generating a bit plane from the most significant bit to the least significant bit in the code block unit;
An encoding target prediction step for predicting the encoding target number to be encoded and generating encoding target number information;
Encoding that performs encoding only for the number of encoding targets given by the number of encoding targets information from the most significant bit side of each code block among the encoding targets generated in the encoding target prediction step A process,
In the encoding target prediction step, the number of effective bit planes excluding zero bit planes is counted for all code blocks in the subband of the input image, and a predetermined table is referred to based on the counting result, An image encoding method for obtaining the number of encoding objects to be encoded for each band.

In a program for causing a computer to execute a predetermined process,
A filtering step of applying a low-pass filter and a high-pass filter to the input image in the vertical direction and the horizontal direction to generate subbands and hierarchically filtering the subbands of the low-frequency components;
A code block generation step of dividing the subband generated in the filtering step and generating a code block of a predetermined size;
A bit plane generating step for generating a bit plane from the most significant bit to the least significant bit in the code block unit;
An encoding target prediction step for predicting the encoding target number to be encoded and generating encoding target number information;
An encoding step for encoding only the number of encoding targets given by the encoding target number information from the most significant bit side of each code block among the encoding targets generated in the encoding target prediction step And
In the encoding target prediction step, the number of effective bit planes excluding zero bit planes is counted for all code blocks in the frame of the input image, and a predetermined table is referred to based on the counting result, so that A program for obtaining the number of encoding objects to be encoded.

In a computer-readable recording medium on which a program for causing a computer to execute predetermined processing is recorded,
A filtering step of applying a low-pass filter and a high-pass filter to the input image in a vertical direction and a horizontal direction to generate subbands, and hierarchically filtering the subbands of the low-frequency components;
A code block generation step of dividing the subband generated in the filtering step and generating a code block of a predetermined size;
A bit plane generating step for generating a bit plane from the most significant bit to the least significant bit in the code block unit;
An encoding target prediction step for predicting the encoding target number to be encoded and generating encoding target number information;
An encoding step of encoding only the number of encoding targets given by the encoding target number information from the most significant bit side of each code block among the encoding targets generated in the encoding target prediction step And
In the encoding target prediction step, the number of effective bit planes excluding zero bit planes is counted for all code blocks in the frame of the input image, and a predetermined table is referred to based on the counting result, so that A recording medium on which is recorded a program for determining the number of encoding targets to be encoded.