JP2005167655A

JP2005167655A - Conversion coding method and conversion decoding method

Info

Publication number: JP2005167655A
Application number: JP2003404073A
Authority: JP
Inventors: Toshiyuki Kondo; 敏志近藤; Toshiro Sasai; 寿郎笹井
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-12-03
Filing date: 2003-12-03
Publication date: 2005-06-23

Abstract

<P>PROBLEM TO BE SOLVED: To more improve the coding efficiency for coding after switching over DCT and MP in block units. <P>SOLUTION: The method is composed of a means 101 for dividing image signals composed of a plurality of regions into image blocks with inputs of the image signals and region information, a first and second conversion means 102, 103 for converting the image blocks into code series, a means 104 for selecting either of code series generated by the first and second conversion means 102, 103, and a means 105 for combining the code series selected by the selecting means 104 to generate an output code series. The first conversion means 102 executes a conversion process after determining a conversion base function, based on the region information, and the second conversion means 103 executes a conversion process after determining the conversion base function, irrespective of the region information. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、画像信号を符号化または復号化する際に用いる変換符号化方法および変換復号化に関するものである。 The present invention relates to a transform encoding method and transform decoding used when encoding or decoding an image signal.

ＪＰＥＧ方式、ＭＰＥＧビデオ方式に代表される従来の画像符号化方式においては、画面を予め定められた単位に分割し、その分割単位で符号化を行う。例えば、ＭＰＥＧ−１、ＭＰＥＧ−２方式においては、マクロブロックと呼ばれる水平１６画素、垂直１６画素の単位で動き補償を行う。そして、動き補償後の残差信号に対しては、画面中の各マクロブロックに含まれるブロックと呼ばれる水平８画素、垂直８画素の単位で離散コサイン変換（以下ＤＣＴと略す）を行い、変換係数に量子化等の処理を行うことにより、最終的な符号列を得る。 In a conventional image coding system represented by the JPEG system and the MPEG video system, a screen is divided into predetermined units, and encoding is performed in the divided units. For example, in the MPEG-1 and MPEG-2 systems, motion compensation is performed in units of 16 horizontal pixels and 16 vertical pixels called macroblocks. The residual signal after motion compensation is subjected to discrete cosine transform (hereinafter abbreviated as DCT) in units of 8 horizontal pixels and 8 vertical pixels called blocks included in each macroblock in the screen, and conversion coefficients are obtained. By performing processing such as quantization, a final code string is obtained.

これに対して近年、低ビットレート用の変換符号化方式として、マッチングパースート（Matching Pursuit、以下ＭＰと略す）方式が提案されている（非特許文献１）。非特許文献１で提案されているＭＰ方式では、動き補償後の残差信号を符号化する際に、予め定められた多数の変換基底（非直交変換基底）を用いて、画面内の任意の位置に一つの変換基底を置く処理を繰り返すことにより符号化する方法である。 On the other hand, in recent years, a matching pursuit (hereinafter abbreviated as MP) method has been proposed as a transform coding method for a low bit rate (Non-patent Document 1). In the MP system proposed in Non-Patent Document 1, when encoding a residual signal after motion compensation, a predetermined number of transform bases (non-orthogonal transform bases) are used, This is a method of encoding by repeating the process of placing one transformation base at a position.

ＤＣＴとＭＰとを比較すると、ＤＣＴは高ビットレートでの符号化効率が高く、ＭＰは低ビットレートでの符号化効率が高くなる。そこでこの両者の特性を活かすために、ブロック単位で、ＤＣＴとＭＰを切り替える方式が提案されている（非特許文献２）。
R.Neff et. al., "Very Low Bit-Rate Video Coding Based on Matching Persuits", IEEE Trans. on Circuits and Systems for Video Technology, Vol.７, No.１, pp. １５８-１７１, Feb. １９９７. S.Valente, "Video coding ecology using DCT and block-restricted matching pursuit," International Conference on Image Processing, ２００３. Comparing DCT and MP, DCT has high encoding efficiency at a high bit rate, and MP has high encoding efficiency at a low bit rate. Therefore, in order to make use of the characteristics of both, a method of switching between DCT and MP in block units has been proposed (Non-Patent Document 2).
R. Neff et. Al., "Very Low Bit-Rate Video Coding Based on Matching Persuits", IEEE Trans. On Circuits and Systems for Video Technology, Vol. 7, No. 1, pp. 158-171, Feb. 1997 . S. Valente, "Video coding ecology using DCT and block-restricted matching pursuit," International Conference on Image Processing, 2003.

しかしながら、上記従来の方法では、画像をすべて矩形ブロックに分割して変換処理を行う。ＤＣＴを用いた場合、一般に異なる領域（異なる物体や異なる動き）の情報がＤＣＴブロック内に存在すると、高周波数成分が増加することから、符号化効率が低下することが知られている。
本発明は、上記従来の課題を解決するものであり、矩形ブロックに分割して、ブロック単位でＤＣＴとＭＰを切り替えて符号化を行う際に、さらに符号化効率を向上させることができる変換符号化方法および変換復号化方法を提供することを目的とする。 However, in the above conventional method, the conversion process is performed by dividing the entire image into rectangular blocks. When DCT is used, it is generally known that when information on different regions (different objects and different motions) exists in a DCT block, high frequency components increase, and thus encoding efficiency decreases.
The present invention solves the above-described conventional problems, and transform codes that can further improve the coding efficiency when divided into rectangular blocks and switched between DCT and MP for each block. It is an object to provide a conversion method and a conversion decoding method.

この課題を解決するために、第１の発明は、Ｍ個の領域からなる画像信号と前記領域を示す領域情報とを入力とし、前記画像信号を画像ブロックに分割する分割ステップと、前記画像ブロックを符号列に変換するＮ個（Ｎ≧２）の変換ステップと、前記Ｎ個の変換ステップにより生成されたＮ個の符号列から、１つの符号列を選択する選択ステップと、前記選択ステップにより選択された符号列を合成し、出力符号列を生成する符号列生成ステップと、を含み、前記Ｎ個の変換ステップは、前記領域情報に基づいて変換基底関数を決定して変換処理を行う第１の変換ステップと、前記領域情報に関係なく変換基底関数を決定して変換処理を行う第２の変換ステップと、を含むことを特徴する変換符号化方法である。 In order to solve this problem, according to a first aspect of the present invention, an image signal composed of M regions and region information indicating the region are input, and a dividing step of dividing the image signal into image blocks; N (N ≧ 2) conversion steps for converting the code string into a code sequence, a selection step for selecting one code sequence from the N code sequences generated by the N conversion steps, and the selection step A code sequence generation step of synthesizing the selected code sequences and generating an output code sequence, wherein the N number of conversion steps determine a conversion basis function based on the region information and perform a conversion process 1 is a transform coding method characterized by including a transform step of 1 and a second transform step of performing transform processing by determining a transform basis function regardless of the region information.

第２の発明は、Ｍ個の領域からなる画像信号をＮ個（Ｎ≧２）の変換方法を用いて符号化し生成した符号列と、前記領域を示す領域情報とを入力とし、前記符号列中に記述された前記変換方法を示す情報に基づいて、前記符号列を分割する符号列分割ステップと、前記符号列分割ステップにより分割された符号列に対して、逆変換を施し復号化画像を生成するＮ個の逆変換ステップと、前記逆変換ステップにより生成された復号化画像を合成する合成ステップとを含み、前記Ｎ個の逆変換ステップは、前記領域情報に基づいて変換基底関数を決定して逆変換処理を行う第１の逆変換ステップと、前記領域情報に関係なく変換基底関数を決定して逆変換処理を行う第２の逆変換ステップと、を含むことを特徴する変換復号化方法である。 According to a second aspect of the present invention, a code string generated by encoding an image signal composed of M areas using N (N ≧ 2) conversion methods and area information indicating the area are input, and the code string is input. A code string dividing step for dividing the code string based on the information indicating the conversion method described therein, and reverse decoding the code string divided by the code string dividing step to obtain a decoded image N inverse transformation steps to be generated and a synthesis step for synthesizing the decoded image generated by the inverse transformation step, wherein the N inverse transformation steps determine transformation basis functions based on the region information And a first inverse transform step for performing an inverse transform process and a second inverse transform step for determining a transform basis function regardless of the region information and performing an inverse transform process. Is the method.

以上の様に、本発明の変換符号化方法、変換復号化方法においては、領域情報を有する画像信号を符号化する際に、ブロック単位に分割し、２つの変換方法で符号化を行う。第１の変換方法は、ブロック内の領域毎に直交変換を行って符号化を行い、第２の変換方法は、ブロック内を領域に分割せずに、双直交変換を行って符号化（ＭＰ符号化）を行う。そして、ブロック毎に符号化効率が高くなる方を選択し、その符号列を出力する。一般に、符号化レートが高くなると直交変換の符号化効率が高く、また符号化レートが低くなると双直交変換の符号化効率が高くなる。また、動き補償後の残差信号に変換符号化を施す場合、残差成分が多い場合には直交変換の方が符号化効率が高く、残差成分が少ない場合には双直交変換の方が符号化効率が高くなる。そのため、上記の処理を行うことにより、低符号化レートから高符号化レートまで、幅広い符号化レートで高い符号化効率を達成することができ、ブロックの特性に応じて符号化効率の高い変換方法を選択することができる。また、直交変換においては、異なる領域の画像が一つのブロック内に入ると符号化効率が落ちることが知られているが、同じブロックに属する領域毎に直交変換を施すことにより、直交変換時の符号化効率を高くすることができる。このように本発明の変換符号化方法、変換復号化方法はその実用的価値が高い。 As described above, in the transform coding method and transform decoding method of the present invention, when an image signal having region information is encoded, it is divided into blocks and encoded by two transform methods. The first transform method performs encoding by performing orthogonal transform for each region in the block, and the second transform method performs encoding by performing bi-orthogonal transform without dividing the block into regions (MP Encoding). Then, the block having the higher coding efficiency is selected for each block, and the code string is output. In general, the encoding efficiency of orthogonal transform increases when the encoding rate increases, and the encoding efficiency of bi-orthogonal transform increases when the encoding rate decreases. Also, when transform coding is applied to the residual signal after motion compensation, when there are many residual components, orthogonal transform has higher coding efficiency, and when there are few residual components, bi-orthogonal transform is more efficient. Encoding efficiency is increased. Therefore, by performing the above processing, a high encoding efficiency can be achieved at a wide range of encoding rates from a low encoding rate to a high encoding rate, and a conversion method having a high encoding efficiency according to block characteristics. Can be selected. In addition, in orthogonal transform, it is known that coding efficiency decreases when images of different regions fall within one block. However, by performing orthogonal transform for each region belonging to the same block, Encoding efficiency can be increased. Thus, the transform encoding method and transform decoding method of the present invention have high practical value.

以下、本発明の実施の形態について、図１から図１３を用いて説明する。
（実施の形態１）
図１は、本発明の変換符号化方法を用いた動画像符号化装置１００のブロック図であり、分割手段１０１、第１の変換手段１０２、第２の変換手段１０３、選択手段１０４、符号列生成手段１０５から構成される。 Hereinafter, embodiments of the present invention will be described with reference to FIGS.
(Embodiment 1)
FIG. 1 is a block diagram of a moving picture coding apparatus 100 using the transform coding method of the present invention, in which a dividing unit 101, a first transform unit 102, a second transform unit 103, a selection unit 104, a code string The generating unit 105 is configured.

画像信号と領域情報とは分割手段１０１に入力される。ここで、画像信号は通常の画像信号、または動き補償後の残差信号である。画像信号と領域情報の例を図２（ａ）、図２（ｂ）に示す。領域情報は、画像信号を物体毎の領域や、動境界で分割した領域等を示す情報である。ここでは、図２（ｂ）に示すように、領域２０１〜領域２０４の４つの領域に分割されているものとする。 The image signal and the area information are input to the dividing unit 101. Here, the image signal is a normal image signal or a residual signal after motion compensation. Examples of image signals and area information are shown in FIGS. 2 (a) and 2 (b). The area information is information indicating an area for each object, an area obtained by dividing the image signal at a moving boundary, and the like. Here, as shown in FIG. 2B, it is assumed that the area is divided into four areas 201 to 204.

分割手段１０１では、所定の大きさで、入力された画像信号を分割する。ここでは画像信号をブロック形状に分割するとし、水平Ｎ画素×垂直Ｎ画素のブロックに分割を行う。例えばＮとしては４、８、１６等の値を取ることが出来る。分割手段１０１は、各ブロックの画像信号Ｐを第１の変換手段１０２と第２の変換手段１０３とに出力する。また、各ブロックに対する領域情報Ｒを第１の変換手段１０２に対して出力する。 The dividing unit 101 divides the input image signal with a predetermined size. Here, the image signal is divided into block shapes, and is divided into blocks of horizontal N pixels × vertical N pixels. For example, N can take values of 4, 8, 16, and the like. The dividing unit 101 outputs the image signal P of each block to the first conversion unit 102 and the second conversion unit 103. Further, the area information R for each block is output to the first conversion means 102.

第１の変換手段１０２での処理について説明する。
図３は、第１の変換手段１０２の構成を示すブロック図である。第１の変換手段１０２は、垂直方向変換手段３０１、水平方向変換手段３０２、垂直方向変換基底生成手段３０３、水平方向変換基底生成手段３０４、水平方向逆変換手段３０５、垂直方向逆変換手段３０６、可変長符号化手段３０７、量子化手段３０８、逆量子化手段３０９とから構成される。 Processing in the first conversion unit 102 will be described.
FIG. 3 is a block diagram showing the configuration of the first conversion means 102. The first conversion means 102 includes a vertical direction conversion means 301, a horizontal direction conversion means 302, a vertical direction conversion base generation means 303, a horizontal direction conversion base generation means 304, a horizontal direction reverse conversion means 305, a vertical direction reverse conversion means 306, The variable length encoding unit 307, the quantization unit 308, and the inverse quantization unit 309 are configured.

図２（ｃ）は、図２（ｂ）の領域２０３の近辺を拡大した図であり、領域２０３の近辺は波線で示すようにブロックに分割されているとする。今、図２（ｃ）に示すブロック２１０が入力されるとする。また、この場合の領域情報を図４（ａ）に示す。図４（ａ）において、各正方形は画素を示しており、水平８画素、垂直８画素のブロックである事を示している。また、白の部分が領域２０１を、黒の部分が領域２０３を示している。 FIG. 2C is an enlarged view of the vicinity of the area 203 in FIG. 2B, and it is assumed that the vicinity of the area 203 is divided into blocks as indicated by a wavy line. Assume that the block 210 shown in FIG. 2C is input. Also, the area information in this case is shown in FIG. In FIG. 4A, each square indicates a pixel, which indicates a block of 8 horizontal pixels and 8 vertical pixels. Further, the white portion indicates the region 201 and the black portion indicates the region 203.

第１の変換手段１０２では、ブロック２１０のうち、領域２０１の画素と領域２０３の画素とを別に符号化する。ここでは領域２０３の画素の符号化方法について説明する。
垂直方向変換基底生成手段３０３では、ブロック２１０内の領域２０３の画素に対して、垂直方向の変換係数を求めるための変換基底を生成する。この場合、列毎に領域２０３に属する画素数を計数し、その結果に基づいて変換基底を生成する。例えば、列４０１については、領域２０３の画素は２個であるので、（数１）においてＭ＝２として変換基底を決定する。また、列４０２については、領域２０３の画素は７個であるので、（数１）においてＭ＝７として変換基底を決定する。ここで（数１）において、ｖは垂直方向の周波数成分であり、ｙはブロック内の垂直方向の空間座標を示す。 In the first conversion means 102, the pixels in the region 201 and the pixels in the region 203 in the block 210 are encoded separately. Here, a method for encoding the pixels in the region 203 will be described.
The vertical direction conversion base generation unit 303 generates a conversion base for obtaining a vertical conversion coefficient for the pixels in the area 203 in the block 210. In this case, the number of pixels belonging to the region 203 is counted for each column, and a conversion base is generated based on the result. For example, for column 401, since there are two pixels in region 203, the conversion base is determined with M = 2 in (Equation 1). For column 402, since there are seven pixels in region 203, the conversion base is determined as M = 7 in (Equation 1). Here, in (Equation 1), v is a frequency component in the vertical direction, and y is a spatial coordinate in the vertical direction in the block.

垂直方向変換手段３０１は、ブロック内の領域情報Ｒに基づいて、符号化対象画素（領域２０３の画素）を垂直方向にブロック端まで移動させる。この場合、図４（ｂ）が符号化対象画素を垂直方向にブロック端まで移動したブロックとなる。そして、垂直方向変換基底生成手段３０３から入力された垂直方向の変換基底を用いて、列毎に垂直方向の変換を行い、変換係数を生成し、それを水平方向変換手段３０２に対して出力する。 Based on the region information R in the block, the vertical direction conversion unit 301 moves the encoding target pixel (the pixel in the region 203) in the vertical direction to the block end. In this case, FIG. 4B shows a block in which the encoding target pixel is moved to the block end in the vertical direction. Then, the vertical conversion base input from the vertical direction conversion base generation unit 303 is used to perform vertical conversion for each column to generate a conversion coefficient, which is output to the horizontal direction conversion unit 302. .

水平方向変換基底生成手段３０４では、水平方向の変換係数を求めるための変換基底を決定する。この場合、領域情報Ｒに基づいて、図４（ｂ）の状態で行毎に領域２０３に属する垂直方向変換係数の数を計数し、その結果に基づいて行毎の変換基底を生成する。例えば、行４０３については、領域２０３の係数は５個であるので、（数２）においてＭ＝５として変換基底を決定する。また、列４０４については、領域２０３の係数は１個であるので、（数２）においてＭ＝１として変換基底を決定する。（数２）において、ｕは水平方向の周波数成分であり、ｘはブロック内の水平方向の空間座標を示す。 The horizontal direction conversion base generation unit 304 determines a conversion base for obtaining a horizontal direction conversion coefficient. In this case, based on the region information R, the number of vertical direction conversion coefficients belonging to the region 203 is counted for each row in the state of FIG. 4B, and a conversion basis for each row is generated based on the result. For example, for the row 403, since there are five coefficients in the region 203, the conversion base is determined with M = 5 in (Equation 2). For column 404, since there is only one coefficient in region 203, the conversion base is determined with M = 1 in (Equation 2). In (Expression 2), u is a frequency component in the horizontal direction, and x indicates a spatial coordinate in the horizontal direction in the block.

水平方向変換手段３０２は、垂直方向に変換した変換係数ブロックに対して、領域情報Ｒに基づいて、水平方向にブロック端まで移動させる。この場合、図４（ｃ）が垂直方向に変換された変換係数を水平方向にブロック端まで移動したブロックとなる。そして、水平方向変換基底生成手段３０４から入力された水平方向の変換基底を用いて、行毎に水平方向の変換を行い、２次元の変換係数を生成する。 The horizontal direction conversion unit 302 moves the conversion coefficient block converted in the vertical direction to the block end in the horizontal direction based on the region information R. In this case, FIG. 4C shows a block obtained by moving the conversion coefficient converted in the vertical direction to the block end in the horizontal direction. Then, using the horizontal conversion base input from the horizontal conversion base generation unit 304, horizontal conversion is performed for each row to generate a two-dimensional conversion coefficient.

以上の処理により、水平方向変換手段３０２では、２次元の変換係数が得られたことになり、この場合の係数は、図４（ｃ）における斜線部となる。
量子化手段３０８では、水平方向変換手段３０２で生成された２次元の変換係数に対して量子化処理が施され、量子化された変換係数ブロックＣ１が出力される。ここで量子化とは、変換係数を所定の値（量子化値）で除算する処理を指す。量子化値は、変換係数毎に異なっても良い。 With the above processing, the horizontal direction conversion means 302 has obtained a two-dimensional conversion coefficient, and the coefficient in this case is the shaded portion in FIG. 4C.
The quantizing unit 308 performs a quantization process on the two-dimensional transform coefficient generated by the horizontal direction transform unit 302, and outputs a quantized transform coefficient block C1. Here, quantization refers to a process of dividing the transform coefficient by a predetermined value (quantized value). The quantization value may be different for each transform coefficient.

逆量子化手段３０９では、逆量子化処理が施され、逆量子化された変換係数が水平方向逆変換手段３０５に対して出力される。ここで逆量子化とは、量子化された変換係数に量子化値を積算する処理を指す。
水平方向逆変換手段３０５は、ブロック２１０の領域２０３の変換係数ブロックに対して（この際の係数は図４（ｃ）の斜線部の位置にある）、水平方向変換基底生成手段３０４から入力された水平方向の変換基底を用いて、行毎に水平方向の逆変換を行い、逆変換係数ブロックを生成する。 The inverse quantization means 309 performs inverse quantization processing and outputs the inversely quantized transform coefficients to the horizontal direction inverse transform means 305. Here, the inverse quantization refers to a process of adding the quantized value to the quantized transform coefficient.
The horizontal inverse transform unit 305 is input from the horizontal direction transform base generation unit 304 with respect to the transform coefficient block in the area 203 of the block 210 (the coefficient at this time is located at the shaded portion in FIG. 4C). Using the horizontal transformation base, horizontal transformation is performed for each row to generate an inverse transformation coefficient block.

垂直方向逆変換手段３０６は、水平方向に逆変換した逆変換係数ブロックに対して、領域情報Ｒに基づいて、図４（ｂ）の状態となるように、係数を水平方向に移動させる。そして、垂直方向変換基底生成手段３０３から入力された垂直方向の変換基底を用いて、列毎に垂直方向の逆変換を行い、２次元の逆変換係数（復号画素値）を生成する。そして最後に、領域情報Ｒに基づいて、復号画素を垂直方向に移動させ、図４（ａ）の状態とする。これを復号ブロックＤ１として出力する。 Based on the region information R, the vertical direction inverse transform unit 306 moves the coefficients in the horizontal direction based on the region information R with respect to the inverse transform coefficient block that has been inversely transformed in the horizontal direction. Then, using the vertical direction conversion base input from the vertical direction conversion base generation unit 303, vertical conversion is performed for each column to generate a two-dimensional inverse conversion coefficient (decoded pixel value). Finally, based on the region information R, the decoded pixel is moved in the vertical direction to obtain the state shown in FIG. This is output as a decoding block D1.

可変長符号化手段３０７では、変換係数ブロックＣ１に対して可変長符号化を施す。この場合、図４（ｄ）に示すように、低周波数係数から高周波数係数に向かって係数の１次元化を行い、係数０の連続長と、非零係数値とを作成し、予め定めた符号帳や算術符号化を用いて符号列Ｓ１を生成し出力する。また、この際の符号量Ｂ１を出力する。
以上の処理は、ブロック２１０のうち、領域２０１の画素についても同様の処理が行われ、これによりブロック２１０に対する第１の変換手段１０２の処理が完了する。この場合に生成されるブロック２１０の符号列は図７（ａ）のようになる。 The variable length coding means 307 performs variable length coding on the transform coefficient block C1. In this case, as shown in FIG. 4 (d), the coefficients are made one-dimensionally from the low frequency coefficient to the high frequency coefficient, a continuous length of coefficient 0 and a non-zero coefficient value are created, and predetermined. A code string S1 is generated and output using a codebook or arithmetic coding. Also, the code amount B1 at this time is output.
In the above processing, the same processing is performed for the pixels in the area 201 in the block 210, thereby completing the processing of the first conversion unit 102 for the block 210. The code string of the block 210 generated in this case is as shown in FIG.

続いて、第２の変換手段１０３での処理について説明する。
図５は、第２の変換手段１０３の構成を示すブロック図である。第２の変換手段１０３は、変換手段５０１、量子化手段５０２、逆量子化手段５０３、逆変換手段５０４、変換基底群保持手段５０５、可変長符号化手段５０６、から構成される。
変換手段５０１には、各ブロックの画像信号Ｐが入力される。ここでは、図２（ｃ）に示すブロック２１０が入力されるとする。第２の変換手段１０３では、ブロック２１０のうち、領域２０１の画素と領域２０３の画素とを区別せずに符号化する。 Subsequently, processing in the second conversion unit 103 will be described.
FIG. 5 is a block diagram showing the configuration of the second conversion means 103. The second transforming unit 103 includes a transforming unit 501, a quantizing unit 502, an inverse quantizing unit 503, an inverse transforming unit 504, a transform base group holding unit 505, and a variable length coding unit 506.
The conversion unit 501 receives the image signal P of each block. Here, it is assumed that the block 210 shown in FIG. In the second conversion unit 103, the pixels in the area 201 and the pixels in the area 203 in the block 210 are encoded without being distinguished.

処理対象ブロック（ここではブロック２１０）は、変換手段５０１に入力され、変換手段５０１では、処理対象ブロックに対してＭＰ符号化を行う。変換手段５０１において用いる変換基底は、変換基底群保持手段５０５に保持されているものとする。ＭＰ符号化においては、双直交な基底関数が用いられる。例えば、双直交な基底関数の例としては、ガボール関数がある。 The processing target block (here, block 210) is input to the converting unit 501, and the converting unit 501 performs MP encoding on the processing target block. It is assumed that the conversion base used in the conversion unit 501 is held in the conversion base group holding unit 505. In MP encoding, bi-orthogonal basis functions are used. For example, an example of a bi-orthogonal basis function is a Gabor function.

第２の変換手段１０３の処理での処理方法を図６のフローチャートを用いて説明する。
図６のフローチャートにおいては、ブロックの未符号化画像信号（初期値は入力画像信号）をｆ（ｘ，ｙ）で示す。また、変換基底群保持手段５０５には、変換基底群ｇｋ（ｘ，ｙ）が保持されている。
ステップＳ６０１では、初期値としてｎを１に設定する。以降、終了条件を満たすまで、ステップＳ６０２からステップＳ６０４の処理を繰り返す。 A processing method in the processing of the second conversion means 103 will be described with reference to the flowchart of FIG.
In the flowchart of FIG. 6, an uncoded image signal (initial value is an input image signal) of a block is indicated by f (x, y). Further, the conversion base group holding unit 505 holds the conversion base group gk (x, y).
In step S601, n is set to 1 as an initial value. Thereafter, the processing from step S602 to step S604 is repeated until the end condition is satisfied.

ステップＳ６０２では、変換手段５０１において、画像信号ｆ（ｘ，ｙ）のブロック内の各位置（ｄｘ，ｄｙ）について、各変換基底ｇｋ'（ｘ，ｙ）に対する内積値を計算し、その値をα（ｋ，ｄｘ，ｄｙ）とする。ここで、変換基底ｇｋ'（ｘ，ｙ）は、変換基底群保持手段５０５で保持された変換基底群ｇｋ（ｘ，ｙ）に対して、ブロック内でのノルム値を正規化したものである。 In step S602, the conversion unit 501 calculates an inner product value for each conversion basis gk ′ (x, y) for each position (dx, dy) in the block of the image signal f (x, y), and calculates the value. Let α (k, dx, dy). Here, the transformation basis gk ′ (x, y) is obtained by normalizing the norm value in the block with respect to the transformation basis group gk (x, y) held by the transformation basis group holding means 505. .

ステップＳ６０３では、変換手段５０１において、α（ｋ，ｄｘ，ｄｙ）の中から最大値を求め、そのときの位置をｄｎ＝（ｄｎｘ，ｄｎｙ）、変換基底の番号をｋｎ、内積値をαｎとする。そして、位置ｄｎ、変換基底番号ｋｎを可変長符号化手段５０６に対して、内積値αｎを量子化手段５０２に対して出力する。
ステップＳ６０４では、量子化手段５０２において、ステップＳ６０３で選択された内積値αｎに対して量子化処理を行う。ここで量子化処理とは、予め定めた量子化値によって、内積値αｎを除算する処理のことを指す。量子化された内積値をｑｎとする。 In step S603, the conversion means 501 obtains the maximum value from α (k, dx, dy), the position at that time is dn = (dnx, dny), the conversion base number is kn, and the inner product value is αn. To do. The position dn and the transformation base number kn are output to the variable length encoding unit 506 and the inner product value αn is output to the quantization unit 502.
In step S604, the quantization unit 502 performs a quantization process on the inner product value αn selected in step S603. Here, the quantization process refers to a process of dividing the inner product value αn by a predetermined quantization value. Let qn be the quantized inner product value.

ステップＳ６０５では、逆量子化手段５０３において、量子化された内積値ｑｎに対して逆量子化処理を行う。ここで逆量子化処理とは、入力値に対する前記の量子化値の積を求める処理のことを指す。逆量子化された内積値をαｎ'とする。
ステップＳ６０６では、逆変換手段５０４において、ｇｋｎ（ｘ＋ｄｎｘ，ｙ＋ｄｎｙ）×αｎ'を求め、それをｆ'（ｘ，ｙ）として変換手段５０１に対して出力する。また、ｆ'（ｘ，ｙ）を累積加算し復号画像信号を生成する。 In step S605, the inverse quantization means 503 performs an inverse quantization process on the quantized inner product value qn. Here, the inverse quantization process refers to a process for obtaining a product of the quantization value with respect to an input value. The inversely quantized inner product value is αn ′.
In step S606, the inverse conversion unit 504 obtains gkn (x + dnx, y + dny) × αn ′ and outputs it to the conversion unit 501 as f ′ (x, y). Further, f ′ (x, y) is cumulatively added to generate a decoded image signal.

ステップＳ６０７では、ｆ（ｘ，ｙ）からｆ'（ｘ，ｙ）を引き、未符号化画像信号を更新する。
ステップ６０８では、可変長符号化手段５０６において、可変長符号化が施される。可変長符号化手段５０６では、変換手段５０１から出力された、位置ｄｎ、変換基底番号ｋｎ、および量子化手段５０２から出力された量子化された内積値ｑｎ（上記の３つを合わせてアトムと呼ぶ）に対して、可変長符号化を行って符号列を生成する。 In step S607, f ′ (x, y) is subtracted from f (x, y), and the uncoded image signal is updated.
In step 608, variable length coding is performed in the variable length coding means 506. The variable length encoding unit 506 outputs the position dn, the conversion base number kn output from the conversion unit 501, and the quantized inner product value qn output from the quantization unit 502. The code string is generated by performing variable length coding.

可変長符号化の第１の方法としては、変換手段５０１により選択した順に符号列中に記述する方法がある。この方法を用いることにより、復号化側で符号列の復号化中にエラーを検出した場合であっても、画質に対する影響を最小限に押さえることができる。なぜならば、変換手段５０１では、残差信号のエネルギーを最も小さくする事ができる順にアトムの選択がなされるからである。この場合の符号列のフォーマット例を図７（ｂ）に示す。またこの際、アトムの情報は、前の値との差分値を符号化しても良い。例えば、位置の差分、基底番号の差分、内積値の差分である。これにより、符号化効率の向上を図ることができる。 As a first method of variable length coding, there is a method of describing in the code string in the order selected by the conversion means 501. By using this method, even when an error is detected during decoding of the code string on the decoding side, the influence on the image quality can be minimized. This is because the conversion unit 501 selects atoms in the order in which the energy of the residual signal can be minimized. An example of the format of the code string in this case is shown in FIG. At this time, as the atom information, a difference value from the previous value may be encoded. For example, a position difference, a base number difference, and an inner product value difference. Thereby, the encoding efficiency can be improved.

可変長符号化の第２の方法としては、ブロック内の位置関係に基づいた所定順にアトムを並べ直す。順序としては、ブロック内で左上から右下に向かってのラスタ順等がある。そして並べ直した順序で、アトムの情報を可変長符号化する。ブロック内の各画素位置に対して、アトムの個数、アトムの個数分の変換基底番号と内積値の組、を記述した場合の符号列のフォーマット例を図７（ｃ）に示す。また、ブロック内の各画素位置に対して、アトムが存在する画素位置についてのみ、アトムの個数、アトムの個数分の変換基底番号と内積値の組、を記述した場合の符号列のフォーマット例を図７（ｄ）に示す。この場合、現在の画素位置から、アトムが存在する次の画素位置までの画素の個数を記述する。図７（ｃ）、図７（ｄ）に示すような可変長符号化列フォーマットを用いることによって、位置情報の符号量を削減することができ、符号化効率を向上することができる。 As a second method of variable length coding, atoms are rearranged in a predetermined order based on the positional relationship within the block. The order includes raster order from the upper left to the lower right in the block. Then, the atom information is variable-length encoded in the rearranged order. FIG. 7C shows a code string format example in which the number of atoms and a set of conversion base numbers and inner product values corresponding to the number of atoms are described for each pixel position in the block. Also, for each pixel position in the block, only for the pixel position where an atom exists, a code string format example in which the number of atoms and a set of conversion base numbers and inner product values for the number of atoms are described As shown in FIG. In this case, the number of pixels from the current pixel position to the next pixel position where the atom exists is described. By using a variable-length coded sequence format as shown in FIGS. 7C and 7D, the amount of code of position information can be reduced, and coding efficiency can be improved.

ステップＳ６０９では、終了条件を満たすか否かの判定を行う。終了条件としては、例えば、所定の回数だけステップＳ６０２〜Ｓ６０８の処理を行った場合や（これはｎの値により判定できる）、未符号化画像信号のエネルギーが所定値を下回った場合（これは未符号化画像信号のエネルギーが所定のしきい値を下回った場合で判定する）、発生符号量が所定量に達した場合（これは可変長符号化手段５０６により生成した符号列の符号量を計数した結果により判定する）、１ビットの情報量に対する１回の符号化処理による未符号化画像信号のエネルギーの減少値が所定値を下回った場合（これは未符号化画像信号のエネルギーと符号列の符号量を計数結果から判定する）、等がある。終了条件を満たさなければ、ステップＳ６１０でｎを１増加させた後、再びステップＳ６０２以降の処理を繰り返す。終了条件を満たせばステップ６１１の処理を行う。 In step S609, it is determined whether an end condition is satisfied. As an end condition, for example, when the processes of steps S602 to S608 are performed a predetermined number of times (this can be determined by the value of n), or when the energy of the unencoded image signal falls below a predetermined value (this is (Determined when the energy of the unencoded image signal falls below a predetermined threshold value), when the generated code amount reaches a predetermined amount (this is the code amount of the code string generated by the variable length encoding means 506) (Determined based on the counting result) When the decrease value of the energy of the unencoded image signal by one encoding process with respect to the amount of information of 1 bit falls below a predetermined value (this is the energy and code of the unencoded image signal) The code amount of the column is determined from the counting result). If the end condition is not satisfied, n is incremented by 1 in step S610, and then the processing from step S602 is repeated. If the end condition is satisfied, the process of step 611 is performed.

ステップ６１１では、逆変換手段５０４において生成した復号画像信号をＤ２として出力する。また、可変長符号化手段５０６で生成した符号列をＳ２として、符号列の符号量をＢ２として出力する。
選択手段１０４には、処理対象ブロックの入力画像Ｐと、第１の変換手段１０２により処理した処理対象ブロックの復号化画像Ｄ１、符号列Ｓ１、符号量Ｂ１と、第２の変換手段１０３により処理した処理対象ブロックの復号化画像Ｄ２、符号列Ｓ２、符号量Ｂ２とが入力される。そして、いずれかの符号列を選択する。 In step 611, the decoded image signal generated by the inverse transform means 504 is output as D2. Also, the code string generated by the variable length coding means 506 is output as S2, and the code amount of the code string is output as B2.
The selection unit 104 includes an input image P of the processing target block, a decoded image D1 of the processing target block processed by the first conversion unit 102, a code string S1, a code amount B1, and a processing performed by the second conversion unit 103. The decoded image D2, the code string S2, and the code amount B2 of the processed block are input. Then, one of the code strings is selected.

第１の選択方法としては、処理対象ブロックの入力画像Ｐと、復号化画像Ｄ１、Ｄ２との誤差エネルギーを求め、その誤差エネルギーが小さい方を選択する。
第２の選択方法としては、処理対象ブロックの入力画像Ｐと、復号化画像Ｄ１、Ｄ２との誤差エネルギーを求め（それらをそれぞれＥ１、Ｅ２とする）、Ｅ１＋λＢ１とＥ２＋λＢ２とを計算し、その値の小さい方を選択する。ここでλは重み係数である。 As a first selection method, the error energy between the input image P of the processing target block and the decoded images D1 and D2 is obtained, and the one having the smaller error energy is selected.
As a second selection method, error energies between the input image P of the block to be processed and the decoded images D1 and D2 are obtained (referred to as E1 and E2, respectively), E1 + λB1 and E2 + λB2 are calculated, and their values are calculated. Select the smaller one. Here, λ is a weighting coefficient.

選択された方の符号列Ｓと、第１の変換手段１０２の出力と第２の変換手段１０３の出力とのいずれを選択したかという選択情報Ｌとは、符号列生成手段１０５に対して出力される。
符号列生成手段１０５では、選択情報Ｌと符号列Ｓとを接続して、処理対象ブロックの符号列とする。符号列のフォーマット例を図８に示す。 The selected code sequence S and selection information L indicating which one of the output of the first conversion unit 102 and the output of the second conversion unit 103 is selected are output to the code sequence generation unit 105. Is done.
In the code string generation means 105, the selection information L and the code string S are connected to form a code string of the processing target block. A format example of the code string is shown in FIG.

以上のように、本発明の変換符号化方法においては、領域情報を有する画像信号を符号化する際に、ブロック単位に分割し、２つの変換方法で符号化を行う。第１の変換方法は、ブロック内の領域毎に直交変換を行って符号化を行い、第２の変換方法は、ブロック内を領域に分割せずに、双直交変換を行って符号化（ＭＰ符号化）を行う。そして、ブロック毎に符号化効率が高くなる方を選択し、その符号列を出力する。 As described above, in the transform encoding method of the present invention, when encoding an image signal having region information, the image signal is divided into blocks and encoded by two transform methods. The first transform method performs encoding by performing orthogonal transform for each region in the block, and the second transform method performs encoding by performing bi-orthogonal transform without dividing the block into regions (MP Encoding). Then, the block having the higher coding efficiency is selected for each block, and the code string is output.

一般に、符号化レートが高くなると直交変換の符号化効率が高く、また符号化レートが低くなると双直交変換の符号化効率が高くなる。また、動き補償後の残差信号に変換符号化を施す場合、残差成分が多い場合には直交変換の方が符号化効率が高く、残差成分が少ない場合には双直交変換の方が符号化効率が高くなる。そのため、上記の処理を行うことにより、低符号化レートから高符号化レートまで、幅広い符号化レートで高い符号化効率を達成することができ、ブロックの特性に応じて符号化効率の高い変換方法を選択することができる。また、直交変換においては、異なる領域の画像が一つのブロック内に入ると符号化効率が落ちることが知られているが、同じブロックに属する領域毎に直交変換を施すことにより、直交変換時の符号化効率を高くすることができる。 In general, the encoding efficiency of orthogonal transform increases when the encoding rate increases, and the encoding efficiency of bi-orthogonal transform increases when the encoding rate decreases. Also, when transform coding is applied to the residual signal after motion compensation, when there are many residual components, orthogonal transform has higher coding efficiency, and when there are few residual components, bi-orthogonal transform is more efficient. Encoding efficiency is increased. Therefore, by performing the above processing, a high encoding efficiency can be achieved at a wide range of encoding rates from a low encoding rate to a high encoding rate, and a conversion method having a high encoding efficiency according to block characteristics. Can be selected. In addition, in orthogonal transform, it is known that coding efficiency decreases when images of different regions fall within one block. However, by performing orthogonal transform for each region belonging to the same block, Encoding efficiency can be increased.

なお、本発明の実施の形態においては、第１の変換手段の変換基底として、離散コサイン変換基底を用いて説明したが、これは他の変換基底であっても良い。例えば、他の変換基底としては、離散フーリエ変換、離散サイン変換、アダマール変換等の変換基底がある。
また、本発明の実施の形態においては、ブロック内に存在する領域毎に変換係数を求める場合について説明したが、これは例えばブロック内で存在する画素数が少ない領域に対しては、その領域に対する変換係数を求めなくても良い。これは、ブロック内に存在するその領域の画素数が少なければ、領域別に符号化しなくても画質劣化は小さいと考えられ、かつその領域の変換係数を求めないことにより、符号量を減らすことができるためである。このような処理を行うためには、画素数に対するしきい値を設定しておき、ブロック内のある領域の画素数がそのしきい値以上であれば、変換処理を行い、しきい値未満であれば変換処理を施さない、とすれば良い。この場合、符号化側と復号化側とでしきい値を予め設定しておくことや、符号列中にそのしきい値を記述することにより、どのように各ブロックを処理するかは、領域情報から自動的に判定することができる。 In the embodiment of the present invention, the discrete cosine transform base is used as the transform base of the first transform unit, but this may be another transform base. For example, other transform bases include transform bases such as discrete Fourier transform, discrete sine transform, and Hadamard transform.
In the embodiment of the present invention, the case where the transform coefficient is obtained for each area existing in the block has been described. For example, for an area having a small number of pixels existing in the block, It is not necessary to obtain the conversion coefficient. This is because if the number of pixels in the area existing in the block is small, the image quality degradation is considered to be small even if the coding is not performed for each area, and the code amount can be reduced by not obtaining the conversion coefficient of the area. This is because it can. In order to perform such processing, a threshold for the number of pixels is set, and if the number of pixels in a certain area in the block is equal to or greater than the threshold, conversion processing is performed. If there is, conversion processing is not performed. In this case, how to process each block by setting a threshold value in advance on the encoding side and the decoding side, or describing the threshold value in the code string depends on the region. It can be automatically determined from the information.

また、本発明の実施の形態においては、ブロック毎に第１の変換手段１０２と、第２の変換手段１０３のいずれかを選択して処理を行う場合について説明したが、これはカラー画像信号の輝度成分についてのみ行い、色差成分に対しては第１の変換手段１０２または第２の変換手段１０３のいずれかのみを用いて符号化処理をしても良い。これは、輝度成分と比較して、色差成分は画質劣化が目につきにくい特性を有しているためであり、処理量の削減を図ることができる。 In the embodiment of the present invention, the case has been described where one of the first conversion unit 102 and the second conversion unit 103 is selected and processed for each block. Only the luminance component may be performed, and the color difference component may be encoded using only the first conversion unit 102 or the second conversion unit 103. This is because the color difference component has a characteristic that image quality deterioration is less noticeable than the luminance component, and the processing amount can be reduced.

また、本発明の実施の形態においては、第１の変換手段１０２と、第２の変換手段１０３の２つの変換手段を用いて各ブロックを処理し、符号化効率の高い変換手段を選択する場合について説明したが、変換手段は３つ以上あっても良い。例えば、第３の変換手段として、ブロック内の領域を考慮せずにブロック全体を離散コサイン変換等の直交変換により変換する方法を用いることができる。これにより、さらに処理量は増加するが、さらに符号化効率の向上を図ることができる。 Further, in the embodiment of the present invention, each block is processed using two conversion units of the first conversion unit 102 and the second conversion unit 103, and a conversion unit with high coding efficiency is selected. However, there may be three or more conversion means. For example, as the third transforming unit, a method of transforming the entire block by orthogonal transform such as discrete cosine transform without considering the region in the block can be used. As a result, the processing amount is further increased, but the encoding efficiency can be further improved.

また、本実施の形態においては、第２の変換手段１０３では、変換基底群保持手段５０５が保持している変換基底群を用いて、変換手段５０１で変換処理を行う場合について説明したが、これは変換手段５０１での変換基底の選択頻度によって、アトム情報についての可変長符号化方法を適応的に変換させても良い。例えば、選択手段１０４が、選択手段１０４で第１の変換手段１０２と第２の変換手段１０３の生成した符号列のうち、いずれが選択されかを示す選択情報Ｌを第２の変換手段１０３に対して出力する。そして、第２の変換手段１０３では、選択情報Ｌを可変長符号化手段５０６で受け、第２の変換手段１０３の生成した符号列が選択された場合に、その符号列を生成する際に用いたアトムの変換基底番号を計数する。この動作を所定期間（例えば１フレーム期間）行い、その期間内で選択された回数が多い変換基底番号に対しては、次の期間では符号長が短くなるように符号化方法や符号化テーブルを適応的に変更する。この処理は、変換基底番号の計数を各ブロック毎に計数を行い、所定期間（例えば１フレーム分、フレーム先頭から、シーケンス先頭から）の累積値を用いて、ブロック単位で符号化方法や符号化テーブルを適応的に変更しても良い。このような処理を行うことにより、時間的に隣接したフレームや、現在のフレームで用いられた変換基底番号の分布に従って、符号化方法や符号化テーブルを適応的に変更するため、現在のフレームやブロックを符号化する際に選択される可能性が高い変換基底番号に対して、より短い符号を割り当てることができ、全体の符号量の削減を図ることができる。また、選択手段１０４で第２の変換手段１０３の生成した符号列を選択した場合にのみ、使用した変換基底番号の計数を行うので、符号列中にどのように符号化方法や符号化テーブルを変更したかを記述する必要がなく、付加情報のための符号は不要である。また、このように適応的に、可変長符号化方法や可変長符号化テーブルを変更しながら符号化した場合、定期的に可変長符号化方法や可変長符号化テーブルの再初期化を行っても良い。また、このように適応的に、可変長符号化方法や可変長符号化テーブルを変更しながら符号化したかどうかを示すフラグを符号列のヘッダ中に記述しても良い。これにより、符号列の伝送中等にエラーが発生し、符号列を途中から（例えば次のフレームの先頭から）復号化する場合に、再初期化されているフレームまでは復号化を中止し、再初期化されているフレームから復号化を再開することにより、正しく復号化処理を行うことができる。 In the present embodiment, the second conversion unit 103 has described the case where the conversion unit 501 performs the conversion process using the conversion base group held by the conversion base group holding unit 505. The variable length coding method for atom information may be adaptively converted according to the selection frequency of the conversion base in the conversion means 501. For example, the selection unit 104 sends selection information L indicating which one of the code strings generated by the selection unit 104 generated by the first conversion unit 102 and the second conversion unit 103 to the second conversion unit 103. Output. The second conversion unit 103 receives the selection information L by the variable length encoding unit 506, and when the code sequence generated by the second conversion unit 103 is selected, the second conversion unit 103 is used to generate the code sequence. Count the converted base number of the atom. This operation is performed for a predetermined period (for example, one frame period), and for a conversion base number that is selected a large number of times within that period, an encoding method and an encoding table are set so that the code length is shortened in the next period. Change adaptively. In this process, the conversion base number is counted for each block, and an encoding method or encoding is performed on a block-by-block basis using a cumulative value for a predetermined period (for example, one frame, from the frame head to the sequence head). The table may be changed adaptively. By performing such processing, the encoding method and the encoding table are adaptively changed according to the temporally adjacent frames and the distribution of the transform base numbers used in the current frame. A shorter code can be assigned to a transform base number that is highly likely to be selected when coding a block, and the overall code amount can be reduced. In addition, since the conversion base number used is counted only when the selection unit 104 selects the code sequence generated by the second conversion unit 103, the encoding method and the encoding table can be changed in the code sequence. It is not necessary to describe whether it has been changed, and a code for additional information is not necessary. In addition, when encoding is performed while changing the variable length coding method and the variable length coding table adaptively, the variable length coding method and the variable length coding table are periodically reinitialized. Also good. In addition, a flag indicating whether or not encoding is performed while changing the variable length encoding method or the variable length encoding table may be described adaptively in the header of the code string. As a result, an error occurs during transmission of the code string, and when the code string is decoded halfway (for example, from the beginning of the next frame), the decoding is stopped until the reinitialized frame, By restarting decoding from the initialized frame, the decoding process can be performed correctly.

また、本発明の実施の形態においては、各ブロックを第１の変換方法と第２の変換方法とを用いて符号化する場合について説明したが、これはカラー画像信号の輝度成分についてのみ行い、色差成分に対しては一方の変換方法のみを用いて符号化処理をしても良い。これは、輝度成分と比較して、色差成分は画質劣化が目につきにくい特性を有しているためであり、色差成分は１つの変換方法のみを用いることにより、符号量は削減することができながら、画質劣化も最小限に押さえることができる。この場合の符号化装置の構成を図９に示す。図９においては、色差成分を第１の変換手段１０２により処理する場合の構成を示している。 Further, in the embodiment of the present invention, the case where each block is encoded using the first conversion method and the second conversion method has been described, but this is performed only for the luminance component of the color image signal, The color difference component may be encoded using only one conversion method. This is because the color difference component has a characteristic that image quality deterioration is less noticeable than the luminance component, and the code amount can be reduced by using only one conversion method for the color difference component. However, image quality degradation can be minimized. FIG. 9 shows the configuration of the encoding apparatus in this case. FIG. 9 shows a configuration in the case where the color difference component is processed by the first conversion means 102.

（実施の形態２）
図１０は、本発明の変換復号化方法を用いた動画像復号化装置１０００のブロック図であり符号列分離手段１００１、第１の逆変換手段１００２、第２の逆変換手段１００３、合成手段１００４から構成される。 (Embodiment 2)
FIG. 10 is a block diagram of the moving picture decoding apparatus 1000 using the transform decoding method of the present invention. The code sequence separating means 1001, the first inverse transform means 1002, the second inverse transform means 1003, and the synthesis means 1004 are shown. Consists of

符号列分離手段１００１には、符号列が入力される。ここでは、実施の形態１で説明した本発明の変換符号化方法を用いた動画像符号化装置により生成された符号列が入力されるものとする。符号列は図８に示すように、ブロック毎に選択情報Ｌとそのブロックの符号列Ｓの組み合わせにより構成されているとする。符号列分離手段１００１では、選択情報Ｌを復号化することにより、そのブロックの符号列Ｓが第１の変換方法と第２の変換方法のいずれにより生成されているかを判断する。そして、そのブロックの符号列Ｓが第１の変換方法により生成されている場合には、ブロックの符号列Ｓを第１の逆変換手段１００２に対して出力する。また、そのブロックの符号列Ｓが第２の変換方法により生成されている場合には、ブロックの符号列Ｓを第２の逆変換手段１００３に対して出力する。 The code string is input to the code string separation unit 1001. Here, it is assumed that a code string generated by a moving picture coding apparatus using the transform coding method of the present invention described in Embodiment 1 is input. As shown in FIG. 8, it is assumed that the code string is composed of a combination of the selection information L and the code string S of the block for each block. The code string separation unit 1001 decodes the selection information L to determine whether the code string S of the block is generated by the first conversion method or the second conversion method. When the code sequence S of the block is generated by the first conversion method, the code sequence S of the block is output to the first inverse conversion unit 1002. If the code sequence S of the block is generated by the second conversion method, the block code sequence S is output to the second inverse conversion means 1003.

第１の逆変換手段１００２での処理について説明する。
図１１は、第１の逆変換手段１００２の構成を示すブロック図である。第１の逆変換手段１００２は、可変長復号化手段１１０１、逆量子化手段１１０２、水平方向逆変換手段１１０３、垂直方向逆変換手段１１０４、合成手段１１０５、垂直方向変換基底生成手段１１０７、水平方向変換基底生成手段１１０６とから構成される。 Processing in the first inverse transform unit 1002 will be described.
FIG. 11 is a block diagram showing the configuration of the first inverse conversion means 1002. The first inverse transform unit 1002 includes a variable length decoding unit 1101, an inverse quantization unit 1102, a horizontal direction inverse transform unit 1103, a vertical direction inverse transform unit 1104, a synthesis unit 1105, a vertical direction transform base generation unit 1107, a horizontal direction. It comprises conversion base generation means 1106.

可変長復号化手段１１０１には、図７（ａ）に示すフォーマットの符号列が入力されるとする。ここではブロック２１０の符号列が入力されるとする。可変長復号化手段１１０１は、入力された符号列を、領域毎（領域２０１と領域２０３）の変換係数にして逆量子化手段１１０２に対して出力する。この際、ブロックに複数の領域の符号列が存在するか否か、また複数の領域の符号列が存在する際に、いくつの領域の符号列が存在するかは、領域情報から得ることができる。 It is assumed that the variable length decoding unit 1101 receives a code string having the format shown in FIG. Here, it is assumed that the code string of the block 210 is input. The variable length decoding unit 1101 converts the input code string into a transform coefficient for each region (region 201 and region 203), and outputs it to the inverse quantization unit 1102. At this time, it can be obtained from the region information whether the code string of a plurality of regions exists in the block and how many code sequences are present when the code sequences of a plurality of regions exist. .

逆量子化手段１１０２では、逆量子化処理が施され、逆量子化された変換係数が水平方向逆変換手段１１０３に対して出力される。ここで逆量子化とは、量子化された変換係数に量子化値を積算する処理を指す。
水平方向逆変換手段１１０３は、ブロック２１０の領域２０１と領域２０３のそれぞれの変換係数に対して、逆変換を施す。例えば、領域２０３の変換係数の場合、ブロック２１０の領域２０３の変換係数ブロックに対して（この際の係数は図４（ｃ）の斜線部の位置にある）、水平方向変換基底生成手段１１０６から入力された水平方向の変換基底（（数２）を用いて生成）を用いて、行毎に水平方向の逆変換を行い、逆変換係数ブロックを生成する。 Inverse quantization means 1102 performs inverse quantization processing, and the inversely quantized transform coefficients are output to horizontal inverse transform means 1103. Here, the inverse quantization refers to a process of adding the quantized value to the quantized transform coefficient.
The horizontal reverse conversion unit 1103 performs reverse conversion on the conversion coefficients of the area 201 and the area 203 of the block 210. For example, in the case of the transform coefficient of the area 203, the horizontal direction transform base generation means 1106 is used with respect to the transform coefficient block of the area 203 of the block 210 (the coefficient at this time is located at the shaded portion in FIG. 4C). Using the input horizontal transformation base (generated using (Equation 2)), horizontal transformation is performed for each row to generate an inverse transformation coefficient block.

垂直方向逆変換手段１１０４は、水平方向に逆変換した逆変換係数ブロックに対して、領域情報Ｒに基づいて、図４（ｂ）の状態となるように、係数を水平方向に移動させる。そして、垂直方向変換基底生成手段１１０７から入力された垂直方向の変換基底（（数１）を用いて生成）を用いて、列毎に垂直方向の逆変換を行い、２次元の逆変換係数（復号画素値）を生成する。そして最後に、領域情報Ｒに基づいて、復号画素を垂直方向に移動させ、図４（ａ）の状態とする。復号画素は合成手段１１０５に対して生成される。 Based on the region information R, the vertical inverse transform unit 1104 moves the coefficients in the horizontal direction based on the region information R with respect to the inverse transform coefficient block that has been inversely transformed in the horizontal direction. Then, using the vertical conversion base (generated using (Equation 1)) input from the vertical direction conversion base generation means 1107, vertical conversion is performed for each column, and a two-dimensional inverse conversion coefficient ( (Decoded pixel value) is generated. Finally, based on the region information R, the decoded pixel is moved in the vertical direction to obtain the state shown in FIG. Decoded pixels are generated for the synthesis unit 1105.

合成手段１１０５では、ブロック内の各領域の復号画素（ブロック２１０の場合は、領域２０１と領域２０３の復号画素）を合成して、復号ブロックＤ１として出力する。
第２の逆変換手段１００３での処理について説明する。
図１２は第２の逆変換手段１００３の構成を示すブロック図であり、可変長復号化手段１２０１、逆量子化手段１２０２、逆変換手段１２０３、変換基底群保持手段１２０４からから構成される。 The synthesizing unit 1105 synthesizes the decoded pixels of the respective areas in the block (in the case of the block 210, the decoded pixels of the area 201 and the area 203) and outputs the result as a decoded block D1.
A process in the second inverse transform unit 1003 will be described.
FIG. 12 is a block diagram showing the configuration of the second inverse transform unit 1003, which includes a variable length decoding unit 1201, an inverse quantization unit 1202, an inverse transform unit 1203, and a transform base group holding unit 1204.

可変長復号化手段１２０１には、図７（ｂ）、図７（ｃ）または図７（ｄ）に示すフォーマットの符号列が入力されるものとする。可変長復号化手段１２０１は、入力符号列に対して可変長復号化を施す。例えば符号列が図７（ｂ）のフォーマットである場合には、各ブロックに対するアトムの個数を復号化した後、順に各アトムの情報（位置情報、変換基底番号、内積値）を復号化していく。また符号列が図７（ｃ）のフォーマットである場合には、ブロック内の各画素位置に対するアトムの個数を復号化した後、順に各アトムの情報（変換基底番号、内積値）を復号化していく。また符号列が図７（ｄ）のフォーマットである場合には、ブロック内の各画素位置に対するアトムの個数を復号化した後、順に各アトムの情報（変換基底番号、内積値）を復号化していく。この場合に、画素位置は図７（ｃ）のフォーマットの場合には１ずつ増加させていくが、図７（ｄ）のフォーマットの場合には符号列に記述されている画素数分だけ増加させていく。復号化された位置情報ｄｎ（＝（ｄｎｘ，ｄｎｙ））、変換基底番号ｋｎは逆変換手段１２０３に対して出力され、量子化された内積値ｑｎは逆量子化手段１２０２に対して出力される。 It is assumed that the variable-length decoding unit 1201 is input with a code string in the format shown in FIG. 7B, FIG. 7C, or FIG. Variable length decoding means 1201 performs variable length decoding on the input code string. For example, when the code string is in the format of FIG. 7B, after decoding the number of atoms for each block, information on each atom (position information, conversion base number, inner product value) is sequentially decoded. . If the code string has the format shown in FIG. 7C, the number of atoms for each pixel position in the block is decoded, and then information on each atom (conversion base number, inner product value) is sequentially decoded. Go. If the code string has the format shown in FIG. 7D, after decoding the number of atoms for each pixel position in the block, the information (transformation base number, inner product value) of each atom is decoded in order. Go. In this case, the pixel position is increased by 1 in the case of the format of FIG. 7C, but is increased by the number of pixels described in the code string in the case of the format of FIG. 7D. To go. The decoded position information dn (= (dnx, dny)) and the transform base number kn are output to the inverse transform unit 1203, and the quantized inner product value qn is output to the inverse quantizer 1202. .

逆量子化手段１２０２では、量子化された内積値ｑｎに対して逆量子化処理を行う。ここで逆量子化処理とは、入力値と量子化値との積を求める処理のことを指す。逆量子化された内積値をαｎ'は、逆変換手段１２０３に対して出力される。
変換基底群保持手段１２０４では、逆変換手段１２０３で用いる変換基底群ｇｋ（ｘ，ｙ）を生成、保持する。ここで、逆変換手段１２０３ではＭＰ復号化を行うとし、ＭＰ符号化においては、双直交な基底関数が用いられる。例えば、双直交な基底関数の例としては、ガボール関数がある。 The inverse quantization means 1202 performs an inverse quantization process on the quantized inner product value qn. Here, the inverse quantization process refers to a process for obtaining a product of an input value and a quantized value. The inversely quantized inner product value αn ′ is output to the inverse transform means 1203.
The conversion base group holding unit 1204 generates and holds the conversion base group gk (x, y) used in the inverse conversion unit 1203. Here, it is assumed that the inverse transform unit 1203 performs MP decoding, and bi-orthogonal basis functions are used in MP coding. For example, an example of a bi-orthogonal basis function is a Gabor function.

逆変換手段１２０３においては、ｇｋｎ'（ｘ＋ｄｎｘ，ｙ＋ｄｎｙ）×αｎ'を求め、それを累積加算し復号画像信号を生成する。ここで、変換基底ｇｋ'（ｘ，ｙ）は、変換基底群保持手段１２０４で保持された変換基底群ｇｋ（ｘ，ｙ）に対して、ブロック内でのノルム値を正規化したものである。復号画像信号は復号ブロックＤ２として出力する。
合成手段１００４は、第１の逆変換手段１００２から出力された復号ブロックＤ１と、第２の逆変換手段１００３から出力された復号ブロックＤ２とを、符号列分離手段１００１から出力された選択情報Ｌを元にして、ブロック単位で合成し、復号化画像フレームを生成し、出力する。 The inverse transforming unit 1203 obtains gkn ′ (x + dnx, y + dny) × αn ′ and cumulatively adds it to generate a decoded image signal. Here, the transformation basis gk ′ (x, y) is obtained by normalizing the norm value in the block with respect to the transformation basis group gk (x, y) held by the transformation basis group holding means 1204. . The decoded image signal is output as a decoded block D2.
The synthesizing unit 1004 selects the decoded block D1 output from the first inverse transform unit 1002 and the decoded block D2 output from the second inverse transform unit 1003 from the selection information L output from the code string separation unit 1001. Based on the above, synthesis is performed in units of blocks, and a decoded image frame is generated and output.

以上のように、本発明の変換復号化方法においては、領域情報を有する画像信号を符号化する際に、ブロック単位に分割し、２つの変換方法で符号化された符号列を入力とし、符号列中に記述されている変換符号化方法に基づいて、変換復号化方法（逆変換方法）をブロック単位で切り替える。第１の逆変換方法は、ブロック内の領域毎に逆直交変換を行って復号化を行い、第２の逆変換方法は、ブロック内を領域に分割せずに、双直交変換を行って復号化（ＭＰ復号化）を行う。そして、復号化されたブロックを合成して、復号化フレームを生成する。 As described above, in the transform decoding method of the present invention, when encoding an image signal having region information, a code string divided into blocks and encoded by two transform methods is input, Based on the transform coding method described in the column, the transform decoding method (inverse transform method) is switched in units of blocks. The first inverse transform method performs decoding by performing inverse orthogonal transform for each region in the block, and the second inverse transform method performs decoding by performing biorthogonal transform without dividing the block into regions. (MP decoding). Then, the decoded blocks are combined to generate a decoded frame.

したがって、本発明の変換復号化方法を用いることによって、本発明の変換符号化方法を用いて生成した符号列を正しく復号化することができる。
なお、本発明の実施の形態においては、第１の逆変換手段の変換基底として、離散コサイン変換基底を用いて説明したが、これは他の変換基底であっても良い。例えば、他の変換基底としては、離散フーリエ変換、離散サイン変換、アダマール変換等の変換基底がある。 Therefore, by using the transform decoding method of the present invention, the code string generated using the transform coding method of the present invention can be correctly decoded.
In the embodiment of the present invention, the discrete cosine transform base is used as the transform base of the first inverse transform unit. However, this may be another transform base. For example, other transform bases include transform bases such as discrete Fourier transform, discrete sine transform, and Hadamard transform.

また、本発明の実施の形態においては、ブロック内に存在する領域毎に変換係数が求められている場合について説明したが、これは例えばブロック内で存在する画素数が少ない領域に対しては、その領域に対する変換係数が求められていなくても良い。この場合、画素数に対するしきい値を設定しておき、ブロック内のある領域の画素数がそのしきい値以上であれば、逆変換処理を行い、しきい値未満であれば逆変換処理を施さない、とすれば良い。この場合、符号化側と復号化側とでしきい値を予め設定しておくことや、符号列中にそのしきい値を記述することにより、どのように各ブロックを処理するかは、領域情報から自動的に判定することができる。 Further, in the embodiment of the present invention, the case where the transform coefficient is obtained for each area existing in the block has been described. For example, for an area where the number of pixels existing in the block is small, The conversion coefficient for that region may not be obtained. In this case, a threshold value for the number of pixels is set, and if the number of pixels in a certain area in the block is equal to or greater than the threshold value, the inverse conversion process is performed. Do not give it. In this case, how to process each block by setting a threshold value in advance on the encoding side and the decoding side, or describing the threshold value in the code string depends on the region. It can be automatically determined from the information.

また、本発明の実施の形態においては、ブロック毎に第１の逆変換手段１００２と、第２の逆変換手段１００３のいずれかを用いて処理を行う場合について説明したが、これはカラー画像信号の輝度成分についてのみ行い、色差成分に対しては第１の逆変換手段１００２または第２の逆変換手段１００３のいずれかのみを用いて復号化処理をしても良い。
また、本発明の実施の形態においては、第１の逆変換手段１００２と、第２の逆変換手段１００３の２つの逆変換手段を用いて各ブロックを処理する場合について説明したが、逆変換手段は３つ以上あっても良い。例えば、第３の逆変換手段として、ブロック内の領域を考慮せずにブロック全体を離散コサイン変換等の逆直交変換により変換する方法を用いることができる。これは、符号化側の処理と同様にすればよい。 In the embodiment of the present invention, the case has been described where processing is performed using either the first inverse transform unit 1002 or the second inverse transform unit 1003 for each block. Only the luminance component may be performed, and the color difference component may be decoded using only the first inverse conversion unit 1002 or the second inverse conversion unit 1003.
In the embodiment of the present invention, the case has been described in which each block is processed using the two inverse transform units of the first inverse transform unit 1002 and the second inverse transform unit 1003. There may be more than two. For example, as the third inverse transform means, a method of transforming the entire block by inverse orthogonal transform such as discrete cosine transform without considering the area in the block can be used. This may be the same as the processing on the encoding side.

また、本実施の形態においては、第２の逆変換手段１００３では、変換基底群保持手段１２０４が保持している変換基底群を用いて、逆変換手段１２０３で逆変換処理を行う場合について説明したが、これは可変長復号化手段１２０１において、符号列中の変換基底の出現頻度によって、アトム情報についての可変長復号化方法を適応的に変換させても良い。例えば、可変長復号化手段１２０１では、符号列を復号化して得られたアトムの変換基底番号を計数する。この動作を所定期間（例えば１フレーム期間）行い、その期間内で選択された回数が多い変換基底番号に対しては、次の期間では符号長が短くなるように復号化方法や復号化テーブルを適応的に変更する。この処理は、変換基底番号の計数を各ブロック毎に計数を行い、所定期間（例えば１フレーム分、フレーム先頭から、シーケンス先頭から）の累積値を用いて、ブロック単位で復号化方法や復号化テーブルを適応的に変更しても良い。また、このように適応的に、可変長復号化方法や可変長復号化テーブルを変更しながら復号化した場合、定期的に可変長復号化方法や可変長復号化テーブルの再初期化を行っても良い。また、符号化側で適応的に可変長符号化方法や可変長符号化テーブルを変更しながら符号化したかどうかを示すフラグが符号列のヘッダ中に記述されている場合、そのフラグを参照しながら可変長復号化方法や可変長復号化テーブルの再初期化を行えば良い。これにより、符号列の伝送中等にエラーが発生し、符号列を途中から（例えば次のフレームの先頭から）復号化する場合に、再初期化されているフレームまでは復号化を中止し、再初期化されているフレームから復号化を再開することにより、正しく復号化処理を行うことができる。 Further, in the present embodiment, a case has been described in which the second inverse transform unit 1003 performs the inverse transform process by the inverse transform unit 1203 using the transform base group retained by the transform base group retaining unit 1204. However, in this case, the variable length decoding unit 1201 may adaptively convert the variable length decoding method for the atom information according to the appearance frequency of the conversion base in the code string. For example, the variable length decoding unit 1201 counts the conversion base number of the atom obtained by decoding the code string. This operation is performed for a predetermined period (for example, one frame period), and for a transform base number that is selected a large number of times within that period, a decoding method and a decoding table are set so that the code length is shortened in the next period. Change adaptively. In this process, the conversion base number is counted for each block, and the decoding method or decoding is performed on a block-by-block basis using a cumulative value for a predetermined period (for example, one frame, from the frame head to the sequence head). The table may be changed adaptively. In addition, when decoding is performed while changing the variable length decoding method and the variable length decoding table adaptively in this way, the variable length decoding method and the variable length decoding table are periodically reinitialized. Also good. In addition, when a flag indicating whether or not encoding is performed while adaptively changing the variable length coding method or variable length coding table on the encoding side is described in the header of the code string, the flag is referred to. The variable length decoding method and the variable length decoding table may be reinitialized. As a result, an error occurs during transmission of the code string, and when the code string is decoded halfway (for example, from the beginning of the next frame), the decoding is stopped until the reinitialized frame, By restarting decoding from the initialized frame, the decoding process can be performed correctly.

また、本発明の実施の形態においては、各ブロックを第１の逆変換方法と第２の逆変換方法とを用いて符号化する場合について説明したが、これはカラー画像信号の輝度成分についてのみ行い、色差成分に対しては一方の逆変換方法のみを用いて復号化処理をしても良い。この場合、例えば、符号列分離手段１００１において、色差成分の符号列を第１の逆変換手段１００２に対して常に出力するようにすれば良い。 In the embodiment of the present invention, the case where each block is encoded using the first inverse transformation method and the second inverse transformation method has been described, but this is only for the luminance component of the color image signal. The color difference component may be decoded using only one inverse transformation method. In this case, for example, the code string separation unit 1001 may always output the color difference component code string to the first inverse conversion unit 1002.

（実施の形態３）
さらに、上記各実施の形態で示した変換符号化方法および変換復号化方法を実現するためのプログラムを、フレキシブルディスク等の記録媒体に記録するようにすることにより、上記各実施の形態で示した処理を、独立したコンピュータシステムにおいて簡単に実施することが可能となる。 (Embodiment 3)
Further, the program for realizing the transform encoding method and transform decoding method shown in each of the above embodiments is recorded on a recording medium such as a flexible disk, thereby showing the above embodiments. Processing can be easily performed in an independent computer system.

図１３は、上記各実施の形態の変換符号化方法および変換復号化方法を、フレキシブルディスク等の記録媒体に記録されたプログラムを用いて、コンピュータシステムにより実施する場合の説明図である。
図１３(b)は、フレキシブルディスクの正面からみた外観、断面構造、及びフレキシブルディスクを示し、図１３(a)は、記録媒体本体であるフレキシブルディスクの物理フォーマットの例を示している。フレキシブルディスクＦＤはケースＦ内に内蔵され、該ディスクの表面には、同心円状に外周からは内周に向かって複数のトラックＴｒが形成され、各トラックは角度方向に１６のセクタＳｅに分割されている。従って、上記プログラムを格納したフレキシブルディスクでは、上記フレキシブルディスクＦＤ上に割り当てられた領域に、上記プログラムが記録されている。 FIG. 13 is an explanatory diagram in a case where the transform coding method and the transform decoding method of each of the above embodiments are implemented by a computer system using a program recorded on a recording medium such as a flexible disk.
FIG. 13B shows an appearance, a cross-sectional structure, and a flexible disk when viewed from the front of the flexible disk, and FIG. 13A shows an example of a physical format of the flexible disk that is a recording medium body. The flexible disk FD is built in the case F, and on the surface of the disk, a plurality of tracks Tr are formed concentrically from the outer periphery toward the inner periphery, and each track is divided into 16 sectors Se in the angular direction. ing. Therefore, in the flexible disk storing the program, the program is recorded in an area allocated on the flexible disk FD.

また、図１３(c)は、フレキシブルディスクＦＤに上記プログラムの記録再生を行うための構成を示す。変換符号化方法および変換復号化方法を実現する上記プログラムをフレキシブルディスクＦＤに記録する場合は、コンピュータシステムＣｓから上記プログラムをフレキシブルディスクドライブを介して書き込む。また、フレキシブルディスク内のプログラムにより変換符号化方法および変換復号化方法を実現する上記変換符号化方法および変換復号化方法をコンピュータシステム中に構築する場合は、フレキシブルディスクドライブによりプログラムをフレキシブルディスクから読み出し、コンピュータシステムに転送する。 FIG. 13C shows a configuration for recording and reproducing the program on the flexible disk FD. When the program for realizing the transform encoding method and transform decoding method is recorded on the flexible disk FD, the program is written from the computer system Cs via the flexible disk drive. When the above-described transform encoding method and transform decoding method for realizing the transform encoding method and transform decoding method by a program in a flexible disk are built in a computer system, the program is read from the flexible disk by a flexible disk drive. Transfer to computer system.

なお、上記説明では、記録媒体としてフレキシブルディスクを用いて説明を行ったが、光ディスクを用いても同様に行うことができる。また、記録媒体はこれに限らず、ＩＣカード、ＲＯＭカセット等、プログラムを記録できるものであれば同様に実施することができる。 In the above description, a flexible disk is used as the recording medium, but the same can be done using an optical disk. Further, the recording medium is not limited to this, and any recording medium such as an IC card or a ROM cassette capable of recording a program can be similarly implemented.

（実施の形態４）
さらにここで、上記実施の形態で示した変換符号化方法および変換復号化方法の応用例とそれを用いたシステムを説明する。 (Embodiment 4)
Furthermore, application examples of the transform coding method and transform decoding method shown in the above embodiment and a system using the same will be described.

図１４は、コンテンツ配信サービスを実現するコンテンツ供給システムex１００の全体構成を示すブロック図である。通信サービスの提供エリアを所望の大きさに分割し、各セル内にそれぞれ固定無線局である基地局ex１０７〜ex１１０が設置されている。
このコンテンツ供給システムex１００は、例えば、インターネットex１０１にインターネットサービスプロバイダex１０２および電話網ex１０４、および基地局ｅｘ１０７〜ｅｘ１１０を介して、コンピュータex１１１、ＰＤＡ（personal digital assistant）ex１１２、カメラex１１３、携帯電話ex１１４、カメラ付きの携帯電話ｅｘ１１５などの各機器が接続される。 FIG. 14 is a block diagram showing an overall configuration of a content supply system ex100 that implements a content distribution service. The communication service providing area is divided into desired sizes, and base stations ex107 to ex110, which are fixed radio stations, are installed in each cell.
The content supply system ex100 includes, for example, a computer ex111, a PDA (personal digital assistant) ex112, a camera ex113, a mobile phone ex114, a camera via the Internet ex101, the Internet service provider ex102, the telephone network ex104, and the base stations ex107 to ex110. Each device such as the attached mobile phone ex115 is connected.

しかし、コンテンツ供給システムex１００は図１４のような組合せに限定されず、いずれかを組み合わせて接続するようにしてもよい。また、固定無線局である基地局ex１０７〜ex１１０を介さずに、各機器が電話網ex１０４に直接接続されてもよい。
カメラex１１３はデジタルビデオカメラ等の動画撮影が可能な機器である。また、携帯電話は、ＰＤＣ（Personal Digital Communications）方式、ＣＤＭＡ（Code Division Multiple Access）方式、Ｗ−ＣＤＭＡ（Wideband-Code Division Multiple Access）方式、若しくはＧＳＭ（Global System for Mobile Communications）方式の携帯電話機、またはＰＨＳ（Personal Handyphone System）等であり、いずれでも構わない。 However, the content supply system ex100 is not limited to the combination as shown in FIG. 14, and any of the combinations may be connected. Further, each device may be directly connected to the telephone network ex104 without going through the base stations ex107 to ex110 which are fixed wireless stations.
The camera ex113 is a device capable of shooting a moving image such as a digital video camera. The mobile phone is a PDC (Personal Digital Communications) system, a CDMA (Code Division Multiple Access) system, a W-CDMA (Wideband-Code Division Multiple Access) system, or a GSM (Global System for Mobile Communications) system mobile phone, Alternatively, PHS (Personal Handyphone System) or the like may be used.

また、ストリーミングサーバex１０３は、カメラex１１３から基地局ex１０９、電話網ex１０４を通じて接続されており、カメラex１１３を用いてユーザが送信する符号化処理されたデータに基づいたライブ配信等が可能になる。撮影したデータの符号化処理はカメラex１１３で行っても、データの送信処理をするサーバ等で行ってもよい。また、カメラ１１６で撮影した動画データはコンピュータex１１１を介してストリーミングサーバex１０３に送信されてもよい。カメラex１１６はデジタルカメラ等の静止画、動画が撮影可能な機器である。この場合、動画データの符号化はカメラex１１６で行ってもコンピュータex１１１で行ってもどちらでもよい。また、符号化処理はコンピュータex１１１やカメラex１１６が有するＬＳＩex１１７において処理することになる。なお、画像符号化・復号化用のソフトウェアをコンピュータex１１１等で読み取り可能な記録媒体である何らかの蓄積メディア（ＣＤ−ＲＯＭ、フレキシブルディスク、ハードディスクなど）に組み込んでもよい。さらに、カメラ付きの携帯電話ex１１５で動画データを送信してもよい。このときの動画データは携帯電話ex１１５が有するＬＳＩで符号化処理されたデータである。 In addition, the streaming server ex103 is connected from the camera ex113 through the base station ex109 and the telephone network ex104, and live distribution or the like based on the encoded data transmitted by the user using the camera ex113 becomes possible. The encoded processing of the captured data may be performed by the camera ex113 or may be performed by a server or the like that performs data transmission processing. Further, the moving image data shot by the camera 116 may be transmitted to the streaming server ex103 via the computer ex111. The camera ex116 is a device that can shoot still images and moving images, such as a digital camera. In this case, the encoding of the moving image data may be performed by the camera ex116 or the computer ex111. The encoding process is performed in the LSI ex117 included in the computer ex111 and the camera ex116. Note that image encoding / decoding software may be incorporated into any storage medium (CD-ROM, flexible disk, hard disk, etc.) that is a recording medium readable by the computer ex111 or the like. Furthermore, you may transmit moving image data with the mobile telephone ex115 with a camera. The moving image data at this time is data encoded by the LSI included in the mobile phone ex115.

このコンテンツ供給システムex１００では、ユーザがカメラex１１３、カメラex１１６等で撮影しているコンテンツ（例えば、音楽ライブを撮影した映像等）を上記実施の形態同様に符号化処理してストリーミングサーバex１０３に送信する一方で、ストリーミングサーバex１０３は要求のあったクライアントに対して上記コンテンツデータをストリーム配信する。クライアントとしては、上記符号化処理されたデータを復号化することが可能な、コンピュータex１１１、ＰＤＡex１１２、カメラex１１３、携帯電話ex１１４等がある。このようにすることでコンテンツ供給システムex１００は、符号化されたデータをクライアントにおいて受信して再生することができ、さらにクライアントにおいてリアルタイムで受信して復号化し、再生することにより、個人放送をも実現可能になるシステムである。 In this content supply system ex100, the content (for example, video shot of music live) captured by the user with the camera ex113, camera ex116, etc. is encoded and transmitted to the streaming server ex103 as in the above embodiment. On the other hand, the streaming server ex103 distributes the content data to the requested client. Examples of the client include a computer ex111, a PDA ex112, a camera ex113, a mobile phone ex114, and the like that can decode the encoded data. In this way, the content supply system ex100 can receive and reproduce the encoded data at the client, and also realize personal broadcasting by receiving, decoding, and reproducing in real time at the client. It is a system that becomes possible.

このシステムを構成する各機器の符号化、復号化には上記各実施の形態で示した画像符号化装置あるいは画像復号化装置を用いるようにすればよい。
その一例として携帯電話について説明する。
図１５は、上記実施の形態で説明した変換符号化方法および変換復号化方法を用いた携帯電話ex１１５を示す図である。携帯電話ex１１５は、基地局ex１１０との間で電波を送受信するためのアンテナex２０１、ＣＣＤカメラ等の映像、静止画を撮ることが可能なカメラ部ex２０３、カメラ部ex２０３で撮影した映像、アンテナex２０１で受信した映像等が復号化されたデータを表示する液晶ディスプレイ等の表示部ex２０２、操作キーｅｘ２０４群から構成される本体部、音声出力をするためのスピーカ等の音声出力部ex２０８、音声入力をするためのマイク等の音声入力部ex２０５、撮影した動画もしくは静止画のデータ、受信したメールのデータ、動画のデータもしくは静止画のデータ等、符号化されたデータまたは復号化されたデータを保存するための記録メディアex２０７、携帯電話ex１１５に記録メディアex２０７を装着可能とするためのスロット部ex２０６を有している。記録メディアex２０７はＳＤカード等のプラスチックケース内に電気的に書換えや消去が可能な不揮発性メモリであるＥＥＰＲＯＭ（Electrically Erasable and Programmable Read Only Memory）の一種であるフラッシュメモリ素子を格納したものである。 The image encoding device or the image decoding device described in the above embodiments may be used for encoding and decoding of each device constituting this system.
A mobile phone will be described as an example.
FIG. 15 is a diagram showing the mobile phone ex115 using the transform coding method and transform decoding method described in the above embodiment. The cellular phone ex115 includes an antenna ex201 for transmitting and receiving radio waves to and from the base station ex110, a camera such as a CCD camera, a camera unit ex203 capable of taking a still image, a video shot by the camera unit ex203, and an antenna ex201. A display unit ex202 such as a liquid crystal display that displays data obtained by decoding received video and the like, a main body unit composed of a group of operation keys ex204, an audio output unit ex208 such as a speaker for audio output, and audio input To store encoded data or decoded data such as a voice input unit ex205 such as a microphone, captured video or still image data, received mail data, video data or still image data, etc. Recording medium ex207, and slot portion ex20 for enabling recording medium ex207 to be attached to mobile phone ex115 The it has. The recording medium ex207 stores a flash memory element which is a kind of EEPROM (Electrically Erasable and Programmable Read Only Memory) which is a nonvolatile memory that can be electrically rewritten and erased in a plastic case such as an SD card.

さらに、携帯電話ex１１５について図１６を用いて説明する。携帯電話ex１１５は表示部ex２０２及び操作キーｅｘ２０４を備えた本体部の各部を統括的に制御するようになされた主制御部ex３１１に対して、電源回路部ex３１０、操作入力制御部ex３０４、画像符号化部ex３１２、カメラインターフェース部ex３０３、ＬＣＤ（Liquid Crystal Display）制御部ex３０２、画像復号化部ex３０９、多重分離部ex３０８、記録再生部ex３０７、変復調回路部ex３０６及び音声処理部ex３０５が同期バスex３１３を介して互いに接続されている。 Further, the cellular phone ex115 will be described with reference to FIG. The cellular phone ex115 controls the power supply circuit ex310, the operation input control unit ex304, and the image coding for the main control unit ex311 which is configured to control the respective units of the main body unit including the display unit ex202 and the operation key ex204. Unit ex312, camera interface unit ex303, LCD (Liquid Crystal Display) control unit ex302, image decoding unit ex309, demultiplexing unit ex308, recording / reproducing unit ex307, modulation / demodulation circuit unit ex306, and audio processing unit ex305 via a synchronization bus ex313 Are connected to each other.

電源回路部ex３１０は、ユーザの操作により終話及び電源キーがオン状態にされると、バッテリパックから各部に対して電力を供給することによりカメラ付ディジタル携帯電話ex１１５を動作可能な状態に起動する。
携帯電話ex１１５は、ＣＰＵ、ＲＯＭ及びＲＡＭ等でなる主制御部ex３１１の制御に基づいて、音声通話モード時に音声入力部ex２０５で集音した音声信号を音声処理部ex３０５によってディジタル音声データに変換し、これを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。また携帯電話機ex１１５は、音声通話モード時にアンテナex２０１で受信した受信信号を増幅して周波数変換処理及びアナログディジタル変換処理を施し、変復調回路部ex３０６でスペクトラム逆拡散処理し、音声処理部ex３０５によってアナログ音声信号に変換した後、これを音声出力部ｅｘ２０８を介して出力する。 When the end call and power key are turned on by a user operation, the power supply circuit ex310 activates the camera-equipped digital mobile phone ex115 by supplying power from the battery pack to each unit. .
The mobile phone ex115 converts the voice signal collected by the voice input unit ex205 in the voice call mode into digital voice data by the voice processing unit ex305 based on the control of the main control unit ex311 including a CPU, a ROM, a RAM, and the like. The modulation / demodulation circuit unit ex306 performs spread spectrum processing, and the transmission / reception circuit unit ex301 performs digital analog conversion processing and frequency conversion processing, and then transmits the result via the antenna ex201. In addition, the cellular phone ex115 amplifies the received signal received by the antenna ex201 in the voice call mode, performs frequency conversion processing and analog-digital conversion processing, performs spectrum despreading processing by the modulation / demodulation circuit unit ex306, and analog audio by the voice processing unit ex305. After conversion into a signal, this is output via the audio output unit ex208.

さらに、データ通信モード時に電子メールを送信する場合、本体部の操作キーｅｘ２０４の操作によって入力された電子メールのテキストデータは操作入力制御部ex３０４を介して主制御部ex３１１に送出される。主制御部ex３１１は、テキストデータを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して基地局ex１１０へ送信する。 Further, when an e-mail is transmitted in the data communication mode, text data of the e-mail input by operating the operation key ex204 of the main body is sent to the main control unit ex311 via the operation input control unit ex304. The main control unit ex311 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex306, performs digital analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex301, and then transmits the text data to the base station ex110 via the antenna ex201.

データ通信モード時に画像データを送信する場合、カメラ部ex２０３で撮像された画像データをカメラインターフェース部ex３０３を介して画像符号化部ex３１２に供給する。また、画像データを送信しない場合には、カメラ部ex２０３で撮像した画像データをカメラインターフェース部ex３０３及びＬＣＤ制御部ex３０２を介して表示部ex２０２に直接表示することも可能である。 When transmitting image data in the data communication mode, the image data captured by the camera unit ex203 is supplied to the image encoding unit ex312 via the camera interface unit ex303. When image data is not transmitted, the image data captured by the camera unit ex203 can be directly displayed on the display unit ex202 via the camera interface unit ex303 and the LCD control unit ex302.

画像符号化部ex３１２は、本願発明で説明した画像符号化装置を備えた構成であり、カメラ部ex２０３から供給された画像データを上記実施の形態で示した画像符号化装置に用いた符号化方法によって圧縮符号化することにより符号化画像データに変換し、これを多重分離部ex３０８に送出する。また、このとき同時に携帯電話機ex１１５は、カメラ部ex２０３で撮像中に音声入力部ex２０５で集音した音声を音声処理部ex３０５を介してディジタルの音声データとして多重分離部ex３０８に送出する。 The image encoding unit ex312 has a configuration including the image encoding device described in the present invention, and an encoding method using the image data supplied from the camera unit ex203 in the image encoding device described in the above embodiment. The encoded image data is converted into encoded image data by compression encoding, and sent to the demultiplexing unit ex308. At the same time, the cellular phone ex115 sends the sound collected by the audio input unit ex205 during imaging by the camera unit ex203 to the demultiplexing unit ex308 as digital audio data via the audio processing unit ex305.

多重分離部ex３０８は、画像符号化部ex３１２から供給された符号化画像データと音声処理部ex３０５から供給された音声データとを所定の方式で多重化し、その結果得られる多重化データを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。 The demultiplexing unit ex308 multiplexes the encoded image data supplied from the image encoding unit ex312 and the audio data supplied from the audio processing unit ex305 by a predetermined method, and the multiplexed data obtained as a result is a modulation / demodulation circuit unit A spectrum spread process is performed in ex306, a digital analog conversion process and a frequency conversion process are performed in the transmission / reception circuit unit ex301, and then the signal is transmitted through the antenna ex201.

データ通信モード時にホームページ等にリンクされた動画像ファイルのデータを受信する場合、アンテナex２０１を介して基地局ex１１０から受信した受信信号を変復調回路部ex３０６でスペクトラム逆拡散処理し、その結果得られる多重化データを多重分離部ex３０８に送出する。
また、アンテナex２０１を介して受信された多重化データを復号化するには、多重分離部ex３０８は、多重化データを分離することにより画像データの符号化ビットストリームと音声データの符号化ビットストリームとに分け、同期バスex３１３を介して当該符号化画像データを画像復号化部ex３０９に供給すると共に当該音声データを音声処理部ex３０５に供給する。 When receiving data of a moving image file linked to a home page or the like in the data communication mode, the received signal received from the base station ex110 via the antenna ex201 is subjected to spectrum despreading processing by the modulation / demodulation circuit unit ex306, and the resulting multiplexing is obtained. Is sent to the demultiplexing unit ex308.
In addition, in order to decode the multiplexed data received via the antenna ex201, the demultiplexing unit ex308 separates the multiplexed data to generate an encoded bitstream of image data and an encoded bitstream of audio data. The encoded image data is supplied to the image decoding unit ex309 via the synchronization bus ex313, and the audio data is supplied to the audio processing unit ex305.

次に、画像復号化部ex３０９は、本願発明で説明した画像復号化装置を備えた構成であり、画像データの符号化ビットストリームを上記実施の形態で示した符号化方法に対応した復号化方法で復号することにより再生動画像データを生成し、これをＬＣＤ制御部ex３０２を介して表示部ex２０２に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まれる動画データが表示される。このとき同時に音声処理部ex３０５は、音声データをアナログ音声信号に変換した後、これを音声出力部ex２０８に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まる音声データが再生される。 Next, the image decoding unit ex309 is configured to include the image decoding device described in the present invention, and a decoding method corresponding to the encoding method described in the above embodiment for an encoded bit stream of image data. To generate playback moving image data, which is supplied to the display unit ex202 via the LCD control unit ex302, thereby displaying, for example, moving image data included in the moving image file linked to the homepage . At the same time, the audio processing unit ex305 converts the audio data into an analog audio signal, and then supplies the analog audio signal to the audio output unit ex208. Thus, for example, the audio data included in the moving image file linked to the home page is reproduced. The

なお、上記システムの例に限られず、最近は衛星、地上波によるディジタル放送が話題となっており、図１７に示すようにディジタル放送用システムにも上記実施の形態の少なくとも画像符号化装置または画像復号化装置のいずれかを組み込むことができる。具体的には、放送局ex４０９では映像情報の符号化ビットストリームが電波を介して通信または放送衛星ex４１０に伝送される。これを受けた放送衛星ex４１０は、放送用の電波を発信し、この電波を衛星放送受信設備をもつ家庭のアンテナex４０６で受信し、テレビ（受信機）ex４０１またはセットトップボックス（ＳＴＢ）ex４０７などの装置により符号化ビットストリームを復号化してこれを再生する。また、記録媒体であるCDやDVD等の蓄積メディアex４０２に記録した符号化ビットストリームを読み取り、復号化する再生装置ex４０３にも上記実施の形態で示した画像復号化装置を実装することが可能である。この場合、再生された映像信号はモニタex４０４に表示される。また、ケーブルテレビ用のケーブルex４０５または衛星／地上波放送のアンテナex４０６に接続されたセットトップボックスex４０７内に画像復号化装置を実装し、これをテレビのモニタex４０８で再生する構成も考えられる。このときセットトップボックスではなく、テレビ内に画像復号化装置を組み込んでも良い。また、アンテナex４１１を有する車ex４１２で衛星ex４１０からまたは基地局ex１０７等から信号を受信し、車ex４１２が有するカーナビゲーションex４１３等の表示装置に動画を再生することも可能である。 Note that the present invention is not limited to the above-described system, and recently, digital broadcasting by satellite and terrestrial has become a hot topic, and as shown in FIG. Any of the decoding devices can be incorporated. Specifically, in the broadcasting station ex409, the encoded bit stream of the video information is transmitted to the communication or broadcasting satellite ex410 via radio waves. Receiving this, the broadcasting satellite ex410 transmits a radio wave for broadcasting, and receives the radio wave with a home antenna ex406 having a satellite broadcasting receiving facility, such as a television (receiver) ex401 or a set top box (STB) ex407. The device decodes the encoded bit stream and reproduces it. In addition, the image decoding apparatus described in the above embodiment can also be implemented in a playback apparatus ex403 that reads and decodes an encoded bitstream recorded on a storage medium ex402 such as a CD or DVD that is a recording medium. is there. In this case, the reproduced video signal is displayed on the monitor ex404. Further, a configuration in which an image decoding device is mounted in a set-top box ex407 connected to a cable ex405 for cable television or an antenna ex406 for satellite / terrestrial broadcasting, and this is reproduced on the monitor ex408 of the television is also conceivable. At this time, the image decoding apparatus may be incorporated in the television instead of the set top box. In addition, a car ex412 having an antenna ex411 can receive a signal from a satellite ex410 or a base station ex107 and the like, and a moving image can be reproduced on a display device such as a car navigation ex413 that the car ex412 has.

更に、画像信号を上記実施の形態で示した画像符号化装置で符号化し、記録媒体に記録することもできる。具体例としては、DVDディスクｅｘ４２１に画像信号を記録するDVDレコーダや、ハードディスクに記録するディスクレコーダなどのレコーダｅx４２０がある。更にSDカードｅｘ４２２に記録することもできる。レコーダｅｘ４２０が上記実施の形態で示した画像復号化装置を備えていれば、DVDディスクｅｘ４２１やSDカードｅｘ４２２に記録した画像信号を再生し、モニタｅｘ４０８で表示することができる。 Further, the image signal can be encoded by the image encoding device shown in the above embodiment and recorded on a recording medium. As a specific example, there is a recorder ex420 such as a DVD recorder that records an image signal on a DVD disk ex421 or a disk recorder that records on a hard disk. Further, it can be recorded on the SD card ex422. If the recorder ex420 includes the image decoding device described in the above embodiment, the image signal recorded on the DVD disc ex421 or the SD card ex422 can be reproduced and displayed on the monitor ex408.

なお、カーナビゲーションex４１３の構成は例えば図１６に示す構成のうち、カメラ部ex２０３とカメラインターフェース部ex３０３、画像符号化部ｅｘ３１２を除いた構成が考えられ、同様なことがコンピュータex１１１やテレビ（受信機）ex４０１等でも考えられる。
また、上記携帯電話ex１１４等の端末は、符号化器・復号化器を両方持つ送受信型の端末の他に、符号化器のみの送信端末、復号化器のみの受信端末の３通りの実装形式が考えられる。 For example, the configuration of the car navigation ex413 may be a configuration excluding the camera unit ex203, the camera interface unit ex303, and the image encoding unit ex312 in the configuration illustrated in FIG. 16, and the same applies to the computer ex111 and the television (receiver). ) Ex401 can also be considered.
In addition to the transmission / reception type terminal having both the encoder and the decoder, the terminal such as the mobile phone ex114 has three mounting formats: a transmitting terminal having only an encoder and a receiving terminal having only a decoder. Can be considered.

このように、上記実施の形態で示した変換符号化方法および変換復号化方法を上述したいずれの機器・システムに用いることは可能であり、そうすることで、上記実施の形態で説明した効果を得ることができる。
なお、本発明はかかる上記実施形態に限定されるものではなく、本発明の範囲を逸脱することなく種々の変形または修正が可能である。 As described above, the transform coding method and transform decoding method shown in the above embodiment can be used in any of the above-described devices / systems, and as a result, the effects described in the above embodiment can be achieved. Can be obtained.
It should be noted that the present invention is not limited to the above-described embodiment, and various changes and modifications can be made without departing from the scope of the present invention.

本発明にかかる変換符号化方法および変換復号化方法は、領域情報を有する画像信号を符号化する際に、ブロック単位に分割し、２つの変換方法で符号化を行い、ブロック毎に符号化効率が高くなる方を選択することにより、符号化効率の向上を図ることができるという効果を有し、蓄積、伝送、通信等における変換符号化方法および変換復号化方法として有用である。 The transform coding method and transform decoding method according to the present invention, when coding an image signal having region information, divides into block units, performs coding by two transform methods, and performs coding efficiency for each block. By selecting the higher one, the encoding efficiency can be improved, which is useful as a transform encoding method and transform decoding method in storage, transmission, communication, and the like.

本発明の変換符号化方法を用いた変換符号化装置の構成を示すブロック図（実施の形態１）である。1 is a block diagram (Embodiment 1) showing the configuration of a transform coding apparatus using the transform coding method of the present invention. 本発明の変換符号化方法を説明するための画像信号と領域情報を示す模式図（実施の形態１）である。It is a schematic diagram (Embodiment 1) which shows the image signal and area | region information for demonstrating the transform coding method of this invention. 本発明の変換符号化装置における第１の変換手段の構成を示すブロック図（実施の形態１）である。It is a block diagram (Embodiment 1) which shows the structure of the 1st conversion means in the conversion coding apparatus of this invention. 本発明の変換符号化装置における第１の変換手段の動作を説明するための模式図（実施の形態１）である。It is a schematic diagram (Embodiment 1) for demonstrating operation | movement of the 1st conversion means in the conversion coding apparatus of this invention. 本発明の変換符号化装置における第２の変換手段の構成を示すブロック図（実施の形態１）である。It is a block diagram (Embodiment 1) which shows the structure of the 2nd conversion means in the conversion coding apparatus of this invention. 本発明の変換符号化装置における第２の変換手段の動作を示すフローチャート（実施の形態１）である。It is a flowchart (Embodiment 1) which shows the operation | movement of the 2nd conversion means in the conversion coding apparatus of this invention. 本発明の変換符号化装置により生成した符号列の構成例を示す模式図（実施の形態１）である。It is a schematic diagram (Embodiment 1) which shows the structural example of the code sequence produced | generated by the conversion coding apparatus of this invention. 本発明の変換符号化装置により生成した符号列の構成例を示す模式図（実施の形態１）である。It is a schematic diagram (Embodiment 1) which shows the structural example of the code sequence produced | generated by the conversion coding apparatus of this invention. 本発明の変換符号化方法を用いた変換符号化装置の一変形例を示すブロック図（実施の形態１）である。It is a block diagram (Embodiment 1) which shows the modification of the transform coding apparatus using the transform coding method of this invention. 本発明の変換復号化方法を用いた変換復号化装置の構成を示すブロック図（実施の形態２）である。It is a block diagram (Embodiment 2) which shows the structure of the transform decoding apparatus using the transform decoding method of this invention. 本発明の変換復号化装置における第１の逆変換手段の構成を示すブロック図（実施の形態２）である。It is a block diagram (Embodiment 2) which shows the structure of the 1st reverse conversion means in the conversion decoding apparatus of this invention. 本発明の変換復号化装置における第２の逆変換手段の構成を示すブロック図（実施の形態２）である。It is a block diagram (Embodiment 2) which shows the structure of the 2nd reverse conversion means in the conversion decoding apparatus of this invention. 上記各実施の形態の変換符号化方法および変換復号化方法をコンピュータシステムにより実現するためのプログラムを格納するための記録媒体についての説明図（実施の形態３）である。It is explanatory drawing (Embodiment 3) about the recording medium for storing the program for implement | achieving the conversion encoding method and conversion decoding method of said each embodiment by a computer system. コンテンツ供給システムの全体構成を示すブロック図（実施の形態４）である。FIG. 10 is a block diagram showing the overall configuration of a content supply system (Embodiment 4). 変換符号化方法および変換復号化方法を用いた携帯電話の例（実施の形態４）である。It is an example (Embodiment 4) of the mobile phone using the transform coding method and transform decoding method. 携帯電話のブロック図（実施の形態４）である。FIG. 10 is a block diagram of a mobile phone (Embodiment 4). ディジタル放送用システムの例（実施の形態４）である。It is an example (Embodiment 4) of the system for digital broadcasting.

Explanation of symbols

１０１分離手段
１０２第１の変換手段
１０３第２の変換手段
１０４選択手段
１０５符号列生成手段
１００１符号列分離手段
１００２第１の逆変換手段
１００３第２の逆変換手段
１００４合成手段
Ｃｓコンピュータ・システム
ＦＤフレキシブルディスク
ＦＤＤフレキシブルディスクドライブ 101 Separation Unit 102 First Conversion Unit 103 Second Conversion Unit 104 Selection Unit 105 Code String Generation Unit 1001 Code String Separation Unit 1002 First Inverse Conversion Unit 1003 Second Inverse Conversion Unit 1004 Synthesis Unit Cs Computer System FD Flexible disk FDD Flexible disk drive

Claims

With an image signal composed of M areas and area information indicating the areas as inputs,
A dividing step of dividing the image signal into image blocks;
N (N ≧ 2) conversion steps for converting the image block into a code string;
A selection step of selecting one code sequence from the N code sequences generated by the N conversion steps;
A code string generation step of synthesizing the code strings selected in the selection step and generating an output code string;
The N conversion steps include a first conversion step that determines a conversion basis function based on the region information and performs conversion processing, and a first conversion step that determines a conversion basis function regardless of the region information and performs conversion processing. A transform encoding method characterized by comprising two transform steps.

2. The transform coding method according to claim 1, wherein in the code string generation step, an identifier of the code string selected in the selection step is described.

The first conversion step includes:
A transform basis determination step for generating a transform basis function based on region information in the block;
The transform coding method according to claim 1, further comprising: a transform step of transforming the pixel value of the block into a transform coefficient using the transform basis function generated by the transform basis determining step.

In the second conversion step,
The image signal in the block is expressed by a linear sum of conversion bases selected from a predetermined conversion base group, and the conversion base number, the pixel position in the block in which the conversion base is arranged, and the conversion base are added. The transform encoding method according to claim 1, wherein the weight at the time is described in the code string.

In the second conversion step,
The number of the said conversion base, the position in the block which arrange | positions the said conversion base, and the weight at the time of adding the said conversion base are described in a code sequence in the order which selected the said conversion base. Transform coding method.

In the second conversion step,
5. The transform coding method according to claim 4, wherein a number of the transform base and a weight for adding the transform base are described in a code sequence in a predetermined pixel order in the block.

In the second conversion step,
When the code sequence of the second conversion step is selected by the selection step, the number of the conversion base selected when generating the code sequence is counted, and the code of the number of the conversion base is counted according to the count result The transform encoding method according to claim 4, wherein the conversion method is changed.

In the second conversion step,
When the code sequence of the second conversion step is selected by the selection step, the number of the conversion base selected when generating the code sequence is counted, and the code of the number of the conversion base is counted according to the count result Change the conversion method,
In the code string generation step,
The transform coding method according to claim 4, wherein the second transform step describes information indicating that the transform base number coding method has been changed in the code string.

A code string generated by encoding an image signal composed of M regions using N (N ≧ 2) conversion methods and region information indicating the region are input,
A code string dividing step of dividing the code string based on information indicating the conversion method described in the code string;
N inverse transform steps for performing inverse transform on the code string divided by the code string dividing step to generate a decoded image;
Combining a decoded image generated by the inverse conversion step,
The N inverse transform steps include a first inverse transform step that performs a transform process by determining a transform basis function based on the region information, and a transform basis function that determines a transform basis function regardless of the region information. A transform decoding method comprising: a second inverse transform step for performing processing.

The transform decoding method according to claim 9, wherein in the code string dividing step, the code string is divided based on an identifier described in the code string.

The first inverse transformation step includes:
A transform basis determination step for generating a transform basis function based on region information in the block;
The transform decoding method according to claim 9, further comprising: an inverse transform step of transforming transform coefficients of the block into pixel values using the transform basis function generated by the transform basis determining step.

In the second inverse transformation step,
The decoded image is generated based on a number of the conversion base described in the code string, a position in a block where the conversion base is arranged, and a weight when adding the conversion base. Item 10. The transform decoding method according to Item 9.

In the second inverse transformation step,
13. The decoded image is generated by adding the decoded number of the conversion base and the weight for adding the conversion base based on the position in the block in which the conversion base is arranged. Conversion decoding method.

In the second inverse transformation step,
13. The transform decoding method according to claim 12, wherein the decoded image is generated by adding the decoded transform base number and the weight for adding the transform base in a predetermined pixel order in the block.

In the second inverse transformation step,
13. The transform decoding method according to claim 12, wherein the number of the transform base selected when generating the code string is counted, and the decoding method of the transform base number is changed according to the count result. .

In the second inverse transformation step,
Information indicating whether the code string was generated while changing the encoding method of the conversion base number is acquired from the code string, and the code string is generated while changing the encoding method of the conversion base number When the code base is generated, the number of the transform base selected when generating the code string is counted, and the decoding method of the transform base number is changed according to the count result. 12. The transform decoding method according to 12.

A program for performing the transform encoding method according to claim 1 by a computer,
The above program is stored on the computer.
With an image signal composed of M areas and area information indicating the areas as inputs,
A dividing step of dividing the image signal into image blocks;
N (N ≧ 2) conversion steps for converting the image block into a code string;
A selection step of selecting one code sequence from the N code sequences generated by the N conversion steps;
Synthesizing the code strings selected in the selection step, and generating a code string generation step for generating an output code string;
The N conversion steps include a first conversion step that determines a conversion basis function based on the region information and performs conversion processing, and a first conversion step that determines a conversion basis function regardless of the region information and performs conversion processing. And a conversion step.

A program for performing the conversion decoding method according to claim 9 by a computer,
The above program is stored on the computer.
A code string generated by encoding an image signal composed of M regions using N (N ≧ 2) conversion methods and region information indicating the region are input,
A code string dividing step of dividing the code string based on information indicating the conversion method described in the code string;
N inverse transform steps for performing inverse transform on the code string divided by the code string dividing step to generate a decoded image;
Performing a synthesizing step of synthesizing the decoded image generated by the inverse transformation step;
The N inverse transform steps include a first inverse transform step that performs a transform process by determining a transform basis function based on the region information, and a transform basis function that determines a transform basis function regardless of the region information. And a second inverse conversion step for performing processing.

A recording medium storing data, wherein the data is
With an image signal composed of M areas and area information indicating the areas as inputs,
A dividing step of dividing the image signal into image blocks;
N (N ≧ 2) conversion steps for converting the image block into a code string;
A selection step of selecting one code sequence from the N code sequences generated by the N conversion steps;
Generated by the code sequence generation step of synthesizing the code sequences selected in the selection step and generating an output code sequence;
The recording medium, wherein the data describes an identifier of a code string selected in the selection step in the code string generation step.

A recording medium storing data, wherein the data is
With an image signal composed of M areas and area information indicating the areas as inputs,
A dividing step of dividing the image signal into image blocks;
N (N ≧ 2) conversion steps for converting the image block into a code string;
A selection step of selecting one code sequence from the N code sequences generated by the N conversion steps;
Generated by the code sequence generation step of synthesizing the code sequences selected in the selection step and generating an output code sequence;
The N conversion steps represent an image signal in the block by a linear sum of conversion bases selected from a predetermined conversion base group, the number of the conversion base, and the pixel in the block in which the conversion base is arranged. A conversion step for describing a position, a weight for adding the conversion bases in a code string,
The data describes the number of the conversion base, the position in the block in which the conversion base is arranged, and the weight for adding the conversion base in the order in which the conversion bases are selected. .

A recording medium storing data, wherein the data is
With an image signal composed of M areas and area information indicating the areas as inputs,
A dividing step of dividing the image signal into image blocks;
N (N ≧ 2) conversion steps for converting the image block into a code string;
A selection step of selecting one code sequence from the N code sequences generated by the N conversion steps;
Generated by the code sequence generation step of synthesizing the code sequences selected in the selection step and generating an output code sequence;
The N conversion steps represent an image signal in the block by a linear sum of conversion bases selected from a predetermined conversion base group, the number of the conversion base, and the pixel in the block in which the conversion base is arranged. A conversion step for describing a position, a weight for adding the conversion bases in a code string,
The recording medium is characterized in that the data describes a number of the conversion base and a weight for adding the conversion base in a predetermined pixel order in the block.