JPH06253281A

JPH06253281A - Picture signal orthogonal transformation method

Info

Publication number: JPH06253281A
Application number: JP3582393A
Authority: JP
Inventors: Hiroyuki Tsuji; 裕之辻; Kazuto Kamikura; 一人上倉
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1993-02-24
Filing date: 1993-02-24
Publication date: 1994-09-09

Abstract

PURPOSE:To concentrate transformation coefficients onto a base of a low frequency with less calculation processing when signals at an optionally shaped area in a picture are orthogonally transformed. CONSTITUTION:A picture analysis section 2 analyzes input picture data from a terminal 1 and outputs data 21 comprising lateral picture elements (a) and longitudinal picture elements (b) of a square area V surrounding a processing object picture segment W, W shape data 23, and W picture data 24. A base arithmetic operation section 3 calculates aXb sets of bases with respect to the area V and outputs a base picture in which the bases are arranged sequentially from a lower frequency in both longitudinal and lateral directions. A base section 4 applies zigzag scanning to the base picture from a low frequency to select n-set of base objects. A projection transformation section 5 projects the base objects onto the segment W and a base orthogonal section 6 makes orthogonal processing. DCT is applied to the picture data 24 by using the orthogonal bases for bases of the segment W and a transformation coefficient is outputted to a terminal 7.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、画像信号を効率よく伝
送または蓄積するための画像符号化方法において、画像
内の任意形状領域の信号を直交変換する画像信号直交変
換方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image signal orthogonal transformation method for orthogonally transforming a signal of an arbitrary shape region in an image in an image coding method for efficiently transmitting or accumulating an image signal.

【０００２】[0002]

【従来の技術】画像内の任意形状領域の信号を直交変換
して符号化する方式は、例えば"Ｃoding of Ａrbitrary
Ｓhaped Ｉmage Ｓegments Ｂased on a Ｇeneralized
Ｏrthogonal Ｔransform”（Ｍ．Ｇilge，et al，Ｓign
al Ｐrocessing：ＩmageCommunication Ｖol.１，１９
９０）に記載されている。ここでは、動画像信号に対し
てフレーム間予測を行い、その予測誤差値がある定めら
れたしきい値以上である任意形状領域Ｗに対して直交変
換を行い、その変換係数を量子化・符号化する方式が示
されている。更に、この文献には、ｎ画素からなる任意
形状領域Ｗに対する直交変換基底を求める方法として、
その任意形状領域Ｗを取り囲む横ａ画素、縦ｂ画素から
なる方形領域Ｖでのａ×ｂ個の直交変換基底を求め、そ
の直交変換基底を任意形状領域Ｗに射影して新たにａ×
ｂ個の基底を求め直し、それらの基底からｎ個を選定し
再直交化して直交変換基底を求めることが記述されてい
るが、ａ×ｂ個の基底を選定する具体的な方法について
は何ら述べられていない。2. Description of the Related Art A method of orthogonally transforming and coding a signal in an arbitrarily shaped region in an image is, for example, "Coding of Arbitrary".
Shaped Image Segments Based on a Generalized
Orthogonal Transform "(M. Gilge, et al, Sign
al Processing: ImageCommunication Vol.1, 19
90). Here, inter-frame prediction is performed on a moving image signal, orthogonal transformation is performed on an arbitrary shape region W whose prediction error value is greater than or equal to a predetermined threshold value, and the transform coefficient is quantized and encoded. The method of conversion is shown. Further, in this document, as a method for obtaining an orthogonal transformation basis for an arbitrary shape region W consisting of n pixels,
A × b orthogonal transformation bases in a rectangular region V, which is composed of horizontal a pixels and vertical b pixels surrounding the arbitrary shape region W, are obtained, and the orthogonal transformation bases are projected onto the arbitrary shape region W to newly obtain a ×
Although it is described that b bases are recalculated, n bases are selected from these bases, and orthogonal transformation bases are calculated by reorthogonalization, there is no specific method for selecting a × b bases. Not mentioned.

【０００３】一方、ａ×ｂ個の基底からｎ個の基底を選
定する従来技術としては、例えば”Ｓegment-based Ｉm
age coding by Ｓhape Ｏrthogonal Ｔransform”
（Ｙ．Ｋato，Ｐicture Ｃoding Ｓymposium’９１，１
２．４−１，１９９１）に記載の方法がある。この方法
は、任意形状領域Ｗに射影したａ×ｂ個の基底を正規化
し、モデル化された画像信号をそれらの基底にもとづい
て変換した際にエネルギー集中度の高くなる基底から順
にｎ個を選定するというものである。この方法により選
定されたｎ個の基底を再直交化すると、それらの直交基
底へのエネルギー集中度は大きく偏るため、それによる
直交変換係数を量子化・符号化することにより、画像信
号を効率的に符号化できる。On the other hand, a conventional technique for selecting n bases from a × b bases is, for example, "Segment-based Im".
age coding by Shape Orthogonal Transform ”
(Y. Kato, Picture Coding Symposium '91, 1
2.4-1, 1991). This method normalizes a × b bases projected on an arbitrary shape region W, and when the modeled image signal is converted based on these bases, n bases are selected in order from the base having the highest energy concentration. It is to select. When n bases selected by this method are re-orthogonalized, the degree of energy concentration on the orthogonal bases is greatly biased. Therefore, the orthogonal transform coefficient is quantized and coded to efficiently convert the image signal. Can be encoded as

【０００４】[0004]

【発明が解決しようとする課題】上記従来技術によりａ
×ｂ個の基底からｎ個の基底を選定するためには、まず
画像信号をモデル化し、その信号を用いて各基底に対す
るエネルギー集中度を計算しなければならず、さらに、
その計算を領域の形状が異なるたびに行わなければなら
ず、計算量が非常に膨大となる問題がある。According to the above conventional technique, a
In order to select n bases from xb bases, the image signal must first be modeled, and the energy concentration degree for each base must be calculated using the signal.
The calculation has to be performed every time the shape of the region is different, which causes a problem that the amount of calculation becomes very large.

【０００５】本発明の目的は、少ない計算量でａ×ｂ個
の基底からｎ個の基底を選定し、しかも上記従来手法と
同程度に変換係数を低域の基底へ集中させることができ
る、画像内のｎ画素からなる任意形状領域の信号の直交
変換方法を提供することにある。The object of the present invention is to select n bases from a × b bases with a small amount of calculation, and to concentrate the conversion coefficients on the low-frequency bases to the same extent as in the above conventional method. An object of the present invention is to provide a method for orthogonally transforming a signal in an arbitrarily shaped region consisting of n pixels in an image.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、画像内のｎ画素からなる任意形状領域Ｗ
を取り囲む横ａ画素、縦ｂ画素の方形領域Ｖでのａ×ｂ
個の基底を求め、その直交変換基底を任意形状領域Ｗに
射影して新たにａ×ｂ個の基底を求め直し、それらの基
底からｎ個を選定し、再直交化して直交変換基底を求め
るにあたり、任意形状領域Ｗを取り囲む横ａ画素、縦ｂ
画素からなる方形領域Ｖでのａ×ｂ個の直交変換基底を
求める際に、横方向、縦方向の各々に関して低い周波数
成分を表す基底から高い周波数成分を表す基底の順序に
横ａ×縦ｂ個の基底を並べ、それらの基底を任意形状領
域Ｗに射影し直した基底からｎ個を選定する際に、横ａ
×縦ｂ個の基底の中で横縦共に最も低い周波数成分を表
している基底から最も高い周波数成分を表している基底
へ向かって単純にジグザグの順序でスキャンをして、上
位ｎ個の基底を選定するようにしたことである。In order to achieve the above object, the present invention provides an arbitrary shape region W consisting of n pixels in an image.
A × b in a rectangular area V of horizontal a pixels and vertical b pixels surrounding the
Number of bases, the orthogonal transformation bases are projected onto the arbitrary shape region W, a * b bases are newly obtained, n pieces are selected from these bases, and the orthogonal transformation bases are obtained by re-orthogonalization. At this time, a horizontal pixel and a vertical b surrounding the arbitrarily shaped region W
When obtaining a × b orthogonal transformation bases in a rectangular region V made up of pixels, horizontal a × vertical b are arranged in the order of a base representing a low frequency component to a base representing a high frequency component in each of the horizontal and vertical directions. When n bases are arranged and n bases are selected from the bases obtained by reprojecting these bases on the arbitrary shape region W, the horizontal a
× The top n bases are simply scanned in a zigzag order from the base that represents the lowest frequency component in the horizontal and vertical directions to the base that represents the highest frequency component in the b vertical bases. Is to be selected.

【０００７】[0007]

【作用】基底を低い周波数成分から高い周波数成分の順
序に並べることは、直交変換基底を求めると同時に容易
に行うことが可能である。また、画像信号は一般に低周
波数成分にエネルギーが集中しているため、最も低い周
波数成分を表している基底から単純にジグザグの順序で
スキャンをして、その上位ｎ個の基底を選定することに
より、画像信号のモデルを用いるまでもなく、上記従来
手法と同程度に各基底へのエネルギー集中度を偏らせる
ことが可能であり、やはり効率的な符号化ができる。It is possible to arrange the bases in order from the low frequency component to the high frequency component at the same time as obtaining the orthogonal transform base. In addition, since image signals generally have energy concentrated in low-frequency components, by simply scanning in a zigzag order from the basis representing the lowest frequency component and selecting the upper n basis thereof. Even without using a model of an image signal, it is possible to bias the degree of energy concentration in each base to the same extent as in the above-mentioned conventional method, and efficient encoding can be performed.

【０００８】[0008]

【実施例】以下、本発明の一実施例について図面により
詳述する。An embodiment of the present invention will be described in detail below with reference to the drawings.

【０００９】図１は画像内の任意形状領域の信号を直交
変換し、その直交変換係数を量子化・符号化する際の、
本発明にかかわる変換基底導出の部分の一実施例を示す
ブロック図である。なお、本実施例では直交変換として
ＤＣＴを用いている。図において、１は入力端子、２は
画像解析部、３はＤＣＴ基底演算部、４は基底選択部、
５は射影変換部、６は基底直交化部、７は出力端子であ
る。また、２１は任意形状領域である対象セグメントＷ
を取り囲む方形領域Ｖの縦横の長さを示すデータ、２２
は対象セグメントＷを構成する画素数を示すデータ、２
３は対象セグメントＷの形状を表すデータ、２４は対象
セグメントＷ内に於ける輝度成分等の画像データを示し
ている。以下、順を追って図１の動作を説明をする。FIG. 1 shows a case where a signal in an arbitrary shape region in an image is orthogonally transformed and the orthogonal transformation coefficient is quantized / encoded.
It is a block diagram which shows one Example of the conversion base derivation | leading-out part which concerns on this invention. In this embodiment, DCT is used as the orthogonal transform. In the figure, 1 is an input terminal, 2 is an image analysis unit, 3 is a DCT basis calculation unit, 4 is a base selection unit,
Reference numeral 5 is a projective transformation unit, 6 is a base orthogonalization unit, and 7 is an output terminal. Further, 21 is a target segment W that is an arbitrarily shaped region
Data indicating vertical and horizontal lengths of a rectangular area V surrounding the
Is data indicating the number of pixels forming the target segment W, 2
Reference numeral 3 indicates data representing the shape of the target segment W, and reference numeral 24 indicates image data such as a luminance component in the target segment W. Hereinafter, the operation of FIG. 1 will be described step by step.

【００１０】入力端子１に画像データが送り込まれる。
本例では、フレーム間差分をとった予測誤差画像データ
が送り込まれると想定する。この画像データは画像解析
部２に入り、いくつかのセグメントに分けられるととも
に、各セグメントに対して領域の解析が行われる。この
画像解析部２から、セグメントＷを取り囲む方形領域Ｖ
の縦横の長さを示すデータ２１、セグメントＷを構成す
る画素数を示すデータ２２、および、セグメントＷの形
状を表すデータ２３が、領域に関する情報として送り出
される。本例では、画像内に図２のようなセグメントＷ
が含まれていたと想定する。図２は、対象セグメントＷ
を取り囲む方形領域Ｖは横１１×縦１２の計１３２画素
からなる領域で、対象セグメントＷは６７画素からなる
部分領域であることを示している。Image data is sent to the input terminal 1.
In this example, it is assumed that the prediction error image data obtained by taking the inter-frame difference is sent. This image data enters the image analysis unit 2, is divided into several segments, and the area is analyzed for each segment. From this image analysis unit 2, a rectangular area V surrounding the segment W is displayed.
Data 21 indicating the length and width of the segment W, data 22 indicating the number of pixels forming the segment W, and data 23 indicating the shape of the segment W are sent out as information regarding the area. In this example, the segment W as shown in FIG.
Was included. 2 shows the target segment W
A rectangular area V surrounding 11 is a total of 132 pixels in the horizontal direction and 12 in the vertical direction, and the target segment W is a partial area including 67 pixels.

【００１１】データ２１はＤＣＴ基底演算部３に送り込
まれ、方形領域Ｖに対するＤＣＴ基底が計算される。こ
のＤＣＴ基底を求める際に、横方向、縦方向の各々に関
して、低い周波数成分を表す基底から高い周波数成分を
表す基底の順序に、横ａ×縦ｂ個の基底をならべる。図
３は、図２の例について、方形領域Ｖの横１１×縦１２
の計１３２画素のＤＣＴ基底を縦横方向がそれぞれ低域
からの順番になるように並べた様子を表し、ひとつの方
形がそれぞれ基底画像に対応している。The data 21 is sent to the DCT basis calculation unit 3 and the DCT basis for the rectangular region V is calculated. When obtaining the DCT bases, horizontal a × vertical b bases are arranged in the order of a base representing a low frequency component to a base representing a high frequency component in each of the horizontal and vertical directions. FIG. 3 shows, for the example of FIG.
The DCT bases of a total of 132 pixels are arranged so that the vertical and horizontal directions are arranged in order from the low frequency range, and one square corresponds to each base image.

【００１２】こうして得られた基底画像は、対象セグメ
ントＷを構成する画素数（図２では６７画素）を示した
データ２２とともに基底選定部４に渡される。基底選定
部４では、方形領域Ｖの基底画像について、横縦共に最
も低い周波数成分を表している基底から最も高い周波数
成分を表している基底へ向かってジグザグにスキャン
し、その上位のセグメントＷを構成する画素数分の基底
を選定する。本例では、図３の矢線に示したようなジグ
ザクスキャンの順番で、６７個の基底候補が選び出され
る。The base image thus obtained is passed to the base selecting section 4 together with the data 22 indicating the number of pixels (67 pixels in FIG. 2) forming the target segment W. The base selection unit 4 scans the base image of the rectangular region V in a zigzag manner from the base representing the lowest frequency component in the horizontal and vertical directions to the base representing the highest frequency component, and the upper segment W thereof is scanned. Select as many bases as there are pixels to configure. In this example, 67 basis candidates are selected in the zigzag scan order as shown by the arrow in FIG.

【００１３】こうして選定された６７個の基底候補は射
影変換部５に送られる。射影変換部５では、対象セグメ
ントＷの形状を示したデータ２３を参照しながら、６７
個の基底を該セグメントＷに射影する。この射影された
基底は、一般に直交性がくずれているため、さらに基底
直交化部６に送られ、ここで直交化の手続きがなされ
る。直交化は、例えば、Ｓchmidtの方法やＨouseholder
の方法によれば良い。The 67 basis candidates thus selected are sent to the projective transformation unit 5. The projective transformation unit 5 refers to the data 23 showing the shape of the target segment W and refers to 67
Project the bases onto the segment W. Since the orthogonality of the projected base is generally broken, the base is further sent to the base orthogonalization unit 6, where the orthogonalization procedure is performed. For orthogonalization, for example, the Schmidt method or Householder
Method.

【００１４】以上のようにして、最終的に得られた６７
個の要素からなる直交基底を対象セグメントＷの基底と
して用い、画像解析部２から送られてくる該セグメント
Ｗ内の輝度成分等の画像データ２４に対してＤＣＴを施
す。こうして得られた変換係数が端子７から出力され
る。As described above, 67 finally obtained.
The orthogonal base composed of these elements is used as the base of the target segment W, and DCT is applied to the image data 24 such as the luminance component in the segment W sent from the image analysis unit 2. The conversion coefficient thus obtained is output from the terminal 7.

【００１５】[0015]

【発明の効果】以上説明したように、本発明によれば、
画像内の任意形状領域の信号を直交変換する際、より少
ない計算処理で従来手法と同程度に変換係数を低域の基
底へ集中させることができ、効率的な符号化が可能とな
る。As described above, according to the present invention,
When orthogonally transforming a signal in an arbitrarily shaped region in an image, the transform coefficients can be concentrated on the low-frequency base to the same extent as in the conventional method with less calculation processing, and efficient encoding is possible.

[Brief description of drawings]

【図１】本発明の一実施例のブロック図である。FIG. 1 is a block diagram of an embodiment of the present invention.

【図２】任意形状領域Ｗとそれを取り囲む方形領域Ｖの
一例を示す図である。FIG. 2 is a diagram showing an example of an arbitrarily shaped area W and a rectangular area V surrounding it.

【図３】ＤＣＴ基底を縦横方向にそれぞれ低域から高域
へ並べ、ジグザグスキャンで基底を選定することを説明
する図である。FIG. 3 is a diagram for explaining arranging DCT bases in the vertical and horizontal directions from a low band to a high band and selecting a base by zigzag scanning.

[Explanation of symbols]

１入力端子２画像解析部３ＤＣＴ基底演算部４基底選定部５射影変換部６基底直交化部７出力端子 1 Input Terminal 2 Image Analysis Section 3 DCT Basis Operation Section 4 Basis Selection Section 5 Projection Transformation Section 6 Base Orthogonalization Section 7 Output Terminal

Claims

[Claims]

1. A method of orthogonally transforming a signal of an arbitrary shape region in an image, wherein a × b is set for a rectangular region V of horizontal a pixels and vertical b pixels surrounding an arbitrary shape region W consisting of n pixels in the image. Number of orthogonal transformation bases are obtained, the a × b orthogonal transformation bases are projected onto the arbitrary shape region W, a × b bases are newly obtained, and the newly obtained a × b pieces are obtained. From the n pixels by selecting n bases from the bases, re-orthogonalizing the selected n bases to obtain n orthogonal bases, and using the re-orthogonalized orthogonal bases. A method of orthogonally transforming a signal of an arbitrary shape region W, wherein when obtaining a × b orthogonal transform bases for the rectangular region V, a base higher than a base representing a low frequency component in each of a horizontal direction and a vertical direction is obtained. The order of bases representing frequency components is a ×
When n bases are selected from new axb bases re-obtained by arraying b bases and projecting the axb orthogonal transformation bases onto the arbitrary shape region W, Of the b bases, the top n bases are selected by scanning in a zigzag order from the base representing the lowest frequency component in the horizontal and vertical directions to the base representing the highest frequency component. An image signal orthogonal transformation method characterized by: