JP2005135249A

JP2005135249A - Image region dividing device

Info

Publication number: JP2005135249A
Application number: JP2003372222A
Authority: JP
Inventors: Haruhisa Kato; 晴久加藤; Yasuyuki Nakajima; 康之中島
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2003-10-31
Filing date: 2003-10-31
Publication date: 2005-05-26

Abstract

<P>PROBLEM TO BE SOLVED: To provide a region dividing device capable of forming divided areas with high speed and accuracy by using information obtained through the partial decoding of compression-encoded data. <P>SOLUTION: A partial decoding part 1 partially decodes the compression-encoded data. First to third area dividing process parts 2-4 are hierarchized. The first area dividing process part 2 comprises a conversion coding information determining part 2-1, a coding mode information determining part 2-2, and a movement prediction information determining part 2-3 and provides rough area division. The second area dividing process part 3 comprises a movement prediction error information determining part 3-1 and subdivides the result of division by the first area division process part 2. The third area division process part 4 comprises an edge detection part 4-1 and subdivides the result of division by the second area division process part 3. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、映像の領域分割装置に関し、特に、圧縮符号化データを部分的に復号して得られる情報を用いて、高速かつ高精度に分割された領域を形成することができる領域分割装置に関するものである。 The present invention relates to an image area dividing device, and more particularly, to an area dividing device capable of forming an area divided at high speed and with high accuracy using information obtained by partially decoding compression-encoded data. Is.

従来、映像の領域分割技術として、Cannyエッジ検出方式、動的輪郭モデルであるスネークに基づく手法が知られている。映像は非常に広い信号帯域を持ち、膨大なデータ量を必要とするので、その伝送や記録は圧縮符号化データの形で行われるのが一般的であり、このような圧縮符号化データの形の映像に、従来の領域分割技術を適用する場合、圧縮符号化データは一旦画素領域まで復号され、画素領域で分割された領域が形成される。
J.Canny“A Computational Approach to Edge Detection”IEEE Trans. on Pattern Analysis and Machine Intelligence 8(6),1986. Conventionally, methods based on Canny edge detection and a snake that is an active contour model are known as image segmentation techniques. Since a video has a very wide signal band and requires a huge amount of data, transmission and recording are generally performed in the form of compression-encoded data. When the conventional area division technique is applied to this video, the compression encoded data is once decoded to the pixel area, and an area divided by the pixel area is formed.
J. Canny “A Computational Approach to Edge Detection” IEEE Trans. On Pattern Analysis and Machine Intelligence 8 (6), 1986.

しかしながら、従来の領域分割技術では、処理負荷の大きい復号処理により圧縮符号化データを一旦画素領域まで復号した後、分割された領域を形成するため、本来の領域分割処理に加えて圧縮符号化データの復号処理を必要とし、多大の計算コストがかかるという課題がある。また、処理速度が遅いという課題もある。 However, in the conventional area division technique, the compression-encoded data is decoded in addition to the original area division process in order to form a divided area after decoding the compression-encoded data to the pixel area by a decoding process with a large processing load. There is a problem that a large amount of calculation cost is required. There is also a problem that the processing speed is slow.

動的輪郭モデルであるスネークに基づく手法は、領域分割の精度に優れているが、膨大な計算量の反復演算を必要としており、また、領域分割の精度が初期形状やパラメータ設定に依存しやすいという課題がある。 The snake-based method, which is an active contour model, excels in segmentation accuracy, but requires an iterative calculation with a large amount of calculation, and the segmentation accuracy tends to depend on the initial shape and parameter settings. There is a problem.

本発明の目的は、前述した従来技術の課題を解決し、圧縮符号化データを部分的に復号して得られる情報を用いて、高速かつ高精度に分割された領域を形成することができる領域分割装置を提供することにある。 An object of the present invention is to solve the above-described problems of the prior art and form an area that can be divided at high speed and with high accuracy using information obtained by partially decoding compression-encoded data. It is to provide a dividing device.

前記した課題を解決するために、本発明は、圧縮符号化された映像を複数の領域に分割する映像の領域分割装置において、圧縮符号化データを部分的に復号する部分復号手段と、前記部分復号手段で部分的に復号された情報を用いて分割された領域を形成する少なくとも１つの領域分割処理手段を含み、互いに異なる分解能で分割された領域を形成する複数の領域分割処理手段とを備えた点に第１の特徴がある。 In order to solve the above-described problem, the present invention provides a partial decoding unit that partially decodes compression-encoded data in a video region dividing device that divides a compression-encoded video into a plurality of regions, A plurality of area division processing means including at least one area division processing means for forming a divided area using information partially decoded by the decoding means, and forming areas divided at different resolutions. There is a first feature.

また、本発明は、前記複数の領域分割処理手段が、領域分割の分解能の低いものから高いものへと階層化され、該階層に従って順次、分割された領域を形成する点に第２の特徴がある。 Further, the present invention has a second feature in that the plurality of area division processing means are hierarchized from one having a low resolution of area division to one having a high resolution, and sequentially forming divided areas according to the hierarchy. is there.

また、本発明は、前記複数の領域分割処理手段の各々が、上位階層の領域分割処理手段の分割結果を利用して分割された領域を形成する点に第３の特徴がある。 Further, the present invention has a third feature in that each of the plurality of area division processing means forms a divided area by using a division result of the upper layer area division processing means.

また、本発明は、前記複数の領域分割処理手段が、符号化情報の割り当て単位に応じて階層化された領域分割処理手段を含む点に第４の特徴がある。 In addition, the present invention has a fourth feature in that the plurality of area division processing means include area division processing means hierarchized according to an encoding information allocation unit.

また、本発明は、前記複数の領域分割処理手段が、符号化情報の割り当て単位ごとの符号化情報を用いて分割された領域を形成する領域分割処理手段を含む点に第５の特徴がある。 In addition, the present invention has a fifth feature in that the plurality of area division processing means includes area division processing means for forming a divided area using coding information for each encoding information allocation unit. .

また、本発明は、前記複数の領域分割処理手段が、変換符号化情報と符号化モード情報と動き予測情報を用いて分割された領域を形成する低分解能の領域分割処理手段を含む点に第６の特徴がある。 Further, the present invention is characterized in that the plurality of area division processing means include a low resolution area division processing means for forming an area divided by using transform coding information, coding mode information, and motion prediction information. There are 6 features.

また、本発明は、前記低分解能の領域分割処理手段が、色差信号についての変換符号化情報からブロック間の色信号ＤＣ成分の差が閾値以下かどうかを判別し、この差が閾値以下と判別したブロックを統合して分割された領域を形成する手段を有する点に第７の特徴がある。 Further, according to the present invention, the low resolution region division processing unit determines whether or not the difference of the color signal DC component between the blocks is less than a threshold from the transform coding information about the color difference signal, and determines that the difference is less than the threshold. There is a seventh feature in that it has means for forming divided regions by integrating the blocks.

また、本発明は、前記低分解能の領域分割処理手段が、符号化モード情報から一定時間内の最頻符号化モード情報を位置ごとに抽出し、該最頻符号化モードの同一性を利用して分割された領域を形成する手段を有する点に第８の特徴がある。 Further, according to the present invention, the low resolution region segmentation processing means extracts the mode encoding mode information within a predetermined time from the encoding mode information for each position, and uses the identity of the mode encoding mode. There is an eighth feature in that it has means for forming divided regions.

また、本発明は、前記低分解能の領域分割処理手段が、動き予測情報から得られる動きベクトルの方向および長さの類似性を利用して分割された領域を形成する手段を有する点に第９の特徴がある。 The ninth aspect of the present invention is that the low-resolution area dividing processing means includes means for forming a divided area by using the similarity of the direction and length of the motion vector obtained from the motion prediction information. There are features.

また、本発明は、前記複数の領域分割処理手段が、変換符号化情報を用いて分割された領域を形成する領域分割処理手段を含む点に第１０の特徴がある。 In addition, the present invention has a tenth feature in that the plurality of area division processing means include area division processing means for forming an area divided using transform coding information.

また、本発明は、前記変換符号化情報を用いて分割された領域を形成する領域分割処理手段が、輝度信号についての変換符号化情報からブロック間の輝度信号ＤＣ成分の差が閾値以下かどうかを判別し、この差が閾値以下と判別したブロックを統合して分割された領域を形成する手段を有する点に第１１の特徴がある。 In the present invention, the region division processing means for forming a region divided using the transform coding information may determine whether the difference in luminance signal DC component between blocks from the transform coding information for the luminance signal is equal to or less than a threshold value. There is an eleventh feature in that it has means for forming divided areas by integrating blocks determined as having a difference equal to or less than a threshold.

また、本発明は、前記変換符号化情報を用いて分割された領域を形成する領域分割処理手段が、輝度信号についての変換符号化情報のＡＣ成分から輝度信号のエッジの有無と方向を推定するエッジ推定手段を有し、この推定結果を利用して分割された領域を形成する点に第１２の特徴がある。 Further, according to the present invention, an area division processing unit that forms an area divided using the transform coding information estimates the presence / absence and direction of an edge of the luminance signal from the AC component of the transform coding information for the luminance signal. There is a twelfth feature in that it has an edge estimation means and forms a divided region using this estimation result.

また、本発明は、前記エッジ推定手段が、輝度信号についての変換符号化情報の水平方向および垂直方向それぞれのＡＣ成分部分絶対値和を利用して輝度信号のエッジの有無と方向を推定する点に第１３の特徴がある。 Further, according to the present invention, the edge estimation means estimates the presence / absence and direction of an edge of a luminance signal using the AC component partial absolute value sums in the horizontal direction and the vertical direction of the transform coding information for the luminance signal. There is a thirteenth feature.

また、本発明は、前記エッジ推定手段が、前記ＡＣ成分部分絶対値和と閾値の比較および水平方向および垂直方向それぞれのＡＣ成分部分絶対値和の大小関係あるいは両者の比を利用して輝度信号のエッジの有無と方向を推定する点に第１４の特徴がある。 Further, according to the present invention, the edge estimator uses a comparison between the AC component partial absolute value sum and a threshold value, a magnitude relationship between the AC component partial absolute value sums in the horizontal direction and the vertical direction, or a ratio between the two. There is a fourteenth feature in that the presence / absence and direction of the edge are estimated.

また、本発明は、前記変換符号化情報を用いて分割された領域を形成する領域分割処理手段が、ブロック単位以上の分解能で分割された領域を形成する点に第１５の特徴がある。 In addition, the present invention has a fifteenth feature in that the area division processing means for forming an area divided using the transform coding information forms an area divided at a resolution equal to or greater than a block unit.

また、本発明は、前記複数の領域分割処理手段が、輝度信号と色差信号の画素情報を用いて分割された領域を形成する領域分割処理手段を含む点に第１６の特徴がある。 In addition, the present invention has a sixteenth feature in that the plurality of area division processing means includes area division processing means for forming a divided area using pixel information of a luminance signal and a color difference signal.

さらに、本発明は、前記輝度信号と色差信号の画素情報を用いて分割された領域を形成する領域分割処理手段が、輝度信号と色差信号のエッジを個別に検出し、検出されたエッジを利用して分割された領域を形成する点に第１７の特徴がある。 Further, according to the present invention, an area division processing unit that forms an area divided using pixel information of the luminance signal and the color difference signal individually detects edges of the luminance signal and the color difference signal and uses the detected edges. Thus, there is a seventeenth feature in that a divided region is formed.

本発明では、圧縮符号化データを部分的に復号することにより得られる情報を用いて分割された領域を形成し、また、互いに異なる分解能で分割された領域を形成するので、計算量を少なくして処理コストを大幅に低減させることができ、また、要求される処理速度や分解能を勘案して適当な分解能での領域分割結果を適宜選択して得ることができる。 In the present invention, divided areas are formed using information obtained by partially decoding compression-encoded data, and areas divided at different resolutions are formed, so that the amount of calculation is reduced. Thus, the processing cost can be greatly reduced, and the region segmentation result with an appropriate resolution can be appropriately selected in consideration of the required processing speed and resolution.

また、複数の領域分割処理手段を階層化し、上位の分割結果を順次利用して分割された領域を形成することにより、高速かつ効率的に分割された領域を形成することが可能になる。さらに、符号化情報を用いてエッジ推定を行うことにより、符号化情報を用いるにも拘わらず、符号化単位以上の分解能で分割された領域を形成することが可能になる。 In addition, by layering a plurality of area division processing means and using the upper division results in order to form divided areas, it is possible to form divided areas at high speed and efficiently. Furthermore, by performing edge estimation using the encoded information, it is possible to form a region divided with a resolution equal to or higher than the encoding unit despite using the encoded information.

以下、図面を参照して本発明を詳細に説明する。図１は、本発明に係る映像の領域分割装置の一実施形態を示すブロック図である。以下では、国際標準であるMPEG-1ビデオ（ISO/IEC11172-2）で符号化された動画像に対して分割処理を適用した例について説明するが、本発明は、この符号化方式に限定されず、他の符号化方式にも適用できる。 Hereinafter, the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of a video area dividing apparatus according to the present invention. In the following, an example in which segmentation processing is applied to a moving image encoded with the MPEG-1 video (ISO / IEC11172-2), which is an international standard, will be described. However, the present invention is limited to this encoding method. In addition, the present invention can be applied to other encoding methods.

図１において、システム全体の入力として映像の圧縮符号化データが与えられる。この圧縮符号化データは、部分復号部１に入力される。部分復号部１は、圧縮符号化データを部分的に復号し、輝度信号および色差信号についての変換符号化情報（ＤＣＴ係数）、マクロブロックごとの動き予測（動きベクトル情報）および符号化モード情報を生成する。 In FIG. 1, video compression-encoded data is given as input of the entire system. The compressed encoded data is input to the partial decoding unit 1. The partial decoding unit 1 partially decodes the compression-encoded data, and receives transform coding information (DCT coefficient), motion prediction (motion vector information) and coding mode information for each macroblock for the luminance signal and the color difference signal. Generate.

なお、ここでの変換符号化情報（ＤＣＴ係数）とは、MPEG方式のＰ，Ｂピクチャのようにフレーム間の誤差が変換符号化されたものではなく、Ｉピクチャのようにフレームそのものが変換符号化されて生成される情報を意味する。したがって、Ｐ，Ｂピクチャの場合には、その情報をここで意味する変換符号化情報に再構成して処理対象にしたり、当該フレームではマクロブロックごとの動き予測（動きベクトル情報）や符号化モード情報処理を用いた領域分割処理のみを行うなどといった配慮を必要とする。 Note that the transform coding information (DCT coefficient) here is not information obtained by transform coding an error between frames as in the P and B pictures of the MPEG system, but the frame itself as in the I picture. It means the information that is generated as a result. Therefore, in the case of P and B pictures, the information is reconstructed into transform coding information as used herein to be processed, or motion prediction (motion vector information) and coding mode for each macroblock in the frame. It is necessary to consider that only area division processing using information processing is performed.

第１の領域分割処理部２は、変換符号化情報判定部２−１と符号化モード情報判定部２−２と動き予測情報判定部２−３とから構成され、色差信号についての情報を用いて大まかな領域分割を行う。 The first region division processing unit 2 includes a transform coding information determination unit 2-1, a coding mode information determination unit 2-2, and a motion prediction information determination unit 2-3, and uses information about color difference signals. And rough segmentation.

変換符号化情報判定部２−１は、色差信号についての変換符号化情報のＤＣ成分の差を調べ、その差が閾値以下のブロックを統合することにより大まかに分割された領域を形成する。 The transform coding information determination unit 2-1 examines the difference in DC components of the transform coding information for the color difference signal, and forms roughly divided regions by integrating blocks whose difference is equal to or less than a threshold value.

符号化モード情報判定部２−２は、まず、対象フレームを含む時間的前後のフレームにおける同位置の符号化モード情報を累計し、出現頻度が多い符号化モードを代表値とする。次に、変換符号化情報判定部２−１の分割結果に対して符号化モード代表値ごとの領域分割処理を施すことにより、より細かく分割された領域を形成する。 The encoding mode information determination unit 2-2 first accumulates the encoding mode information at the same position in frames before and after the time including the target frame, and sets an encoding mode having a high appearance frequency as a representative value. Next, a region divided more finely is formed by performing region division processing for each coding mode representative value on the division result of the transform coding information determination unit 2-1.

動き予測情報判定部２−３は、符号化モード情報判定部２−２による分割領域内のマクロブロックの動き予測情報の向きおよび長さの類似性を調べ、その類似性を基に符号化モード情報判定部２−２の分割結果に対して領域分割を施すことにより、さらに細かく分割された領域を形成する。 The motion prediction information determination unit 2-3 examines the similarity of the direction and length of the motion prediction information of the macroblock in the divided area by the encoding mode information determination unit 2-2, and encodes the coding mode based on the similarity. By performing region division on the division result of the information determination unit 2-2, a more finely divided region is formed.

MPEG方式では動き予測（動きベクトル情報）や符号化モード情報がマクロブロック（１６×１６画素）単位の情報であるので、第１の領域分割処理部２によりマクロブロック単位の精度で分割された領域が形成される。 In the MPEG system, since motion prediction (motion vector information) and coding mode information are information in units of macroblocks (16 × 16 pixels), areas divided by the first area division processing unit 2 with accuracy in units of macroblocks Is formed.

第１の領域分割処理部２の分割結果は、本装置の第１の領域分割結果として出力される。また、より細かい分割結果が要求される場合、第１の分割結果は第２の領域分割処理部３に与えられる。 The division result of the first area division processing unit 2 is output as the first area division result of the present apparatus. When a finer division result is required, the first division result is given to the second area division processing unit 3.

なお、ＰピクチャやＢピクチャの場合には、上述したように、変換符号化情報判定部２−１での処理は行わず、符号化モード情報判定部２−２や動き予測情報判定部２−３での処理で分割された領域を分割結果として出力するようにすればよい。 In the case of a P picture or a B picture, as described above, the process in the transform coding information determination unit 2-1 is not performed, and the coding mode information determination unit 2-2 or the motion prediction information determination unit 2- It suffices to output the region divided by the processing in 3 as the division result.

第２の領域分割処理部３は、変換符号化情報判定部３−１で構成され、第１の分割結果および輝度信号についての情報を用いて、さらに細かく分割された領域を形成する。 The second area division processing unit 3 includes a transform coding information determination unit 3-1, and forms a further finely divided area by using information about the first division result and the luminance signal.

変換符号化情報判定部３−１は、輝度信号についての変換符号化情報のＤＣ成分の差を調べ、第１の分割結果に対してこのＤＣ成分の差が閾値以下のブロックを分離することで、さらに細かく分割された領域を形成する。MPEG方式では変換符号化情報がブロック（８×８画素）単位の情報であるので、これによりブロック単位の精度で分割された領域が形成される。 The transform coding information determination unit 3-1 examines the difference in DC component of the transform coding information for the luminance signal, and separates blocks whose DC component difference is equal to or less than a threshold value from the first division result. Then, a further finely divided region is formed. In the MPEG system, the transform coding information is information in units of blocks (8 × 8 pixels), so that an area divided with block unit accuracy is formed.

さらに、輝度信号についての変換符号化情報のＡＣ成分からブロックごとのエッジ方向を推定し、推定されたエッジ方向を基にブロック単位以上に細かく分割された領域を形成する。 Further, the edge direction for each block is estimated from the AC component of the transform coding information for the luminance signal, and a region finely divided into block units or more is formed based on the estimated edge direction.

第２の領域分割処理部３による分割結果は、本装置の第２の分割結果として出力される。また、さらに細かい分割結果が要求される場合、第２の分割結果は第３の領域分割処理部４に与えられる。 The division result by the second area division processing unit 3 is output as the second division result of the present apparatus. Further, when a finer division result is required, the second division result is given to the third area division processing unit 4.

以下に、第２の領域分割処理部３におけるエッジ方向の推定とそれに基づく領域分割について具体例を上げて説明する。ブロックにおけるエッジ方向を推定するために、まず、下記（１），（２）式により８×８ＤＣＴ係数（変換符号化情報）のブロックの０行目のＡＣ成分の絶対値和Ｈと０列目のＡＣ成分の絶対値和Ｖを求め、閾値と比較する。 Hereinafter, the estimation of the edge direction in the second area division processing unit 3 and the area division based thereon will be described with specific examples. In order to estimate the edge direction in the block, first, the absolute value sum H of the AC component of the 0th row of the block and the 0th column of the 8 × 8 DCT coefficient (transform coding information) according to the following equations (1) and (2): The absolute value sum V of the AC components is obtained and compared with a threshold value.

ここで、ｄ_ｊ，ｉは、ｊ行ｉ列のＤＣＴ係数を表す。 Here, d _{j, i} represents a DCT coefficient of j rows and i columns.

閾値との比較の結果、０行目のＡＣ成分の絶対値和Ｈが閾値以下、かつ０列目のＡＣ成分の絶対値和Ｖも閾値以下のときは、当該ブロック部分にエッジは存在しないと推定する。 As a result of the comparison with the threshold value, when the absolute value sum H of the AC component in the 0th row is equal to or smaller than the threshold value and the absolute value sum V of the AC component in the 0th column is also equal to or smaller than the threshold value, there is no edge in the block portion. presume.

また、０行目のＡＣ成分の絶対値和Ｈが閾値以上、かつ０列目のＡＣ成分の絶対値和Ｖが閾値以下のときは、当該ブロック部分に水平エッジが存在すると推定し、０行目のＡＣ成分の絶対値和Ｈが閾値以下、かつ０列目のＡＣ成分の絶対値和Ｖが閾値以上のときは、当該ブロック部分に垂直エッジが存在すると推定する。 When the absolute value sum H of the AC components in the 0th row is equal to or greater than the threshold value and the absolute value sum V of the AC components in the 0th column is equal to or less than the threshold value, it is estimated that a horizontal edge exists in the block portion. When the absolute value sum H of the AC components of the eyes is equal to or smaller than the threshold and the absolute value sum V of the AC components in the 0th column is equal to or larger than the threshold, it is estimated that a vertical edge exists in the block portion.

０行目のＡＣ成分の絶対値和Ｈが閾値以上、かつ０列目のＡＣ成分の絶対値和Ｖも閾値以上のときは、低域要素の符号（正負）を調べるとともに絶対値Ｈ，Ｖ同士の大小を比較してエッジ方向を推定する。 When the absolute value sum H of the AC component in the 0th row is equal to or greater than the threshold and the absolute value sum V of the AC component in the 0th column is also equal to or greater than the threshold, the sign (positive / negative) of the low frequency element is checked and the absolute values H and V The edge direction is estimated by comparing the size of each other.

図２は、右上がり階段状映像のレベルおよびに対するＤＣＴ係数の例を示し、同図（ａ）は右下部分が黒で、左上部分が白の映像の場合であり、同図（ｂ）は、右下部分が白で、左上部分が黒の映像の場合である。 FIG. 2 shows an example of the DCT coefficient with respect to the level of the stepped video image that rises to the right. FIG. 2A shows a case where the lower right portion is black and the upper left portion is white, and FIG. This is a case where the lower right portion is white and the upper left portion is black.

また、図３は、右下がり階段状映像のレベルおよびに対するＤＣＴ係数の例を示し、同図（ａ）は左下部分が黒で、右上部分が白の映像の場合であり、同図（ｂ）は、左下部分が白で、右上部分が黒の映像の場合である。 FIG. 3 shows an example of the DCT coefficient with respect to the level of the step-down video image. FIG. 3A shows a case where the lower left portion is black and the upper right portion is white, and FIG. Is the case where the lower left part is white and the upper right part is black.

これらの図から明らかなように、右上がり方向のエッジを持つ場合にはＤＣＴ係数の低域要素（0,1）と（1,0）の正負が一致し、右下がり方向のエッジを持つ場合にはＤＣＴ係数の低域要素（0,1）と（1,0）の正負は一致しない。また、０行目のＡＣ成分の絶対値和Ｈと０列目のＡＣ成分の絶対値和Ｖとの大小関係（あるいはその比（Ｈ／Ｖ））によって右上がり、あるいは右下がりの角度が４５度より大きいか小さいかが推定できる。 As is clear from these figures, when there is an edge in the upward direction to the right, the low-frequency elements (0,1) and (1,0) of the DCT coefficient match and the edge in the downward direction is to the right Does not match the sign of the low frequency elements (0,1) and (1,0) of the DCT coefficient. In addition, depending on the magnitude relationship (or the ratio (H / V)) between the absolute value sum H of the AC components in the 0th row and the absolute value sum V of the AC components in the 0th column, the angle of rising to the right or falling to the right is 45. It can be estimated whether it is larger or smaller.

以上のことを踏まえると、図２のように、低域要素（0,1）と（1,0）の正負が一致し、０行目のＡＣ成分の絶対値和Ｈが０列目のＡＣ成分の絶対値和Ｖより大きい（比（Ｈ／Ｖ）が１より大きい）とき、エッジ方向は０〜４５度の範囲内にあると推定できる。 Based on the above, as shown in FIG. 2, the low-frequency elements (0,1) and (1,0) have the same sign, and the absolute value sum H of the AC components in the 0th row is the AC in the 0th column. When the sum of absolute values of components is larger than V (ratio (H / V) is greater than 1), it can be estimated that the edge direction is in the range of 0 to 45 degrees.

また、低域要素（0,1）と（1,0）の正負が一致し、０行目のＡＣ成分の絶対値和Ｈが０列目のＡＣ成分の絶対値和Ｖより小さい（比（Ｈ／Ｖ）が１より小さい）とき、エッジ方向は４５〜９０度の範囲内にあると推定できる。 Further, the sign of the low frequency element (0,1) and (1,0) match, and the absolute value sum H of the AC component in the 0th row is smaller than the absolute value sum V of the AC component in the 0th column (ratio ( When H / V) is less than 1, it can be estimated that the edge direction is in the range of 45 to 90 degrees.

また、図３のように、低域要素（0,1）と（1,0）の正負が一致せず、０行目のＡＣ成分の絶対値和Ｈが０列目のＡＣ成分の絶対値和Ｖより大きい（比（Ｈ／Ｖ）が１より大きい）とき、エッジ方向は１３５〜１８０度の範囲内にあると推定できる。 Also, as shown in FIG. 3, the low frequency elements (0,1) and (1,0) do not match, and the absolute value sum H of the AC components in the 0th row is the absolute value of the AC components in the 0th column. When the sum is greater than V (ratio (H / V) is greater than 1), it can be estimated that the edge direction is in the range of 135 to 180 degrees.

また、低域要素（0,1）と（1,0）の正負が一致せず、０行目のＡＣ成分の絶対値和Ｈが０列目のＡＣ成分の絶対値和Ｖより小さい（比（Ｈ／Ｖ）が１より小さい）とき、エッジ方向は９０〜１３５度の範囲内にあると推定できる。 Further, the sign of the low frequency elements (0,1) and (1,0) do not match, and the absolute value sum H of the AC component in the 0th row is smaller than the absolute value sum V of the AC component in the 0th column (ratio) When (H / V) is smaller than 1, it can be estimated that the edge direction is in the range of 90 to 135 degrees.

以上のようにして推定されたエッジ方向で、これまでに得られた領域分割結果を滑らかに連結し直すことにより、符号化情報単位（ブロック）以上の高解像度で分割された領域を形成することができる。 By smoothly reconnecting the region division results obtained so far with the edge direction estimated as described above, a region divided at a higher resolution than the encoded information unit (block) is formed. Can do.

第３の領域分割処理部４は、輝度信号および色差信号エッジ検出部４−１で構成され、第２の分割結果および画素情報を用いて、より細かく画素単位で分割された領域を形成する。 The third region division processing unit 4 includes a luminance signal and chrominance signal edge detection unit 4-1, and forms a region divided more finely in units of pixels using the second division result and pixel information.

輝度信号エッジ検出では、例えばCannyエッジ検出器を用いて輝度信号のエッジを検出し、色差信号エッジ検出では、色差信号を輝度信号と同じサイズにスケーリングした後、Cannyエッジ検出器を用いて色差信号のエッジを検出する。これらのエッジ検出結果を第２の分割結果に重ね合わせて最適化を図ることにより、さらに細かく画素単位で分割された領域を形成することができる。第３の領域分割処理部４による分割結果は、本装置の第３の分割結果として出力される。 In luminance signal edge detection, for example, the edge of the luminance signal is detected using a Canny edge detector, and in color difference signal edge detection, the color difference signal is scaled to the same size as the luminance signal, and then the color difference signal is detected using the Canny edge detector. Detect edges of By superimposing these edge detection results on the second division result for optimization, it is possible to form a more finely divided region. The division result by the third area division processing unit 4 is output as the third division result of the present apparatus.

上記実施形態では、第１〜第３の領域分割処理部２〜４を階層的に構成し、各領域分割処理部の分割結果をそれぞれ出力させるので、必要な分解能での分割結果を適宜選択して得ることができる。なお、第３の領域分割処理部４は、画素レベルまで復号された情報を必要とするが、第３の領域分割処理部４による処理は必要時のみ行えばよいので、装置全体としての処理負担は軽くすることができる。また、第３の領域分割処理部４による分割結果までは要求されない場合には第３の領域分割処理部４は不要である。 In the above embodiment, since the first to third region division processing units 2 to 4 are hierarchically configured and the division results of each region division processing unit are output, respectively, the division result with the necessary resolution is appropriately selected. Can be obtained. Note that the third region division processing unit 4 needs information decoded to the pixel level, but the processing by the third region division processing unit 4 may be performed only when necessary, so that the processing burden on the entire apparatus is increased. Can be lightened. In addition, the third region division processing unit 4 is not required when the division result by the third region division processing unit 4 is not required.

以上の説明から明らかなように、本発明によれば、映像の圧縮符号化データを部分的に復号することにより得られる情報を用いて分割された領域を形成するので、処理コストを大幅に低減させることができる。また、複数の領域分割処理部を階層的に構成して段階的に細かく分割された領域を形成し、各段階の分割結果を得ることができるようにしているので、処理速度や分解能を勘案して適当な分解能での分割結果を適宜選択して出力させることができる。 As is apparent from the above description, according to the present invention, a divided area is formed using information obtained by partially decoding compressed and encoded data of a video, thereby greatly reducing processing costs. Can be made. In addition, a plurality of area division processing units are hierarchically formed to form areas that are finely divided in stages, so that the division results at each stage can be obtained, so the processing speed and resolution are taken into account. Thus, the division result with an appropriate resolution can be appropriately selected and output.

本発明に係る映像の領域分割装置の一実施形態を示すブロック図である。It is a block diagram which shows one Embodiment of the area | region dividing device of the image | video which concerns on this invention. エッジ方向（右上がり階段状映像の例）の推定処理の説明図である。It is explanatory drawing of the estimation process of an edge direction (example of a step-up image to the right). エッジ方向（右下がり階段状映像の例）の推定処理の説明図である。It is explanatory drawing of the estimation process of an edge direction (an example of a step-down image to the right).

Explanation of symbols

１・・・部分復号部、２・・・第１の領域分割処理部、２−１，３−１・・・変換符号化情報判定部、２−２・・・符号化モード情報判定部、２−３・・・動き予測情報判定部、３・・・第２の領域分割処理部、４・・・第３の領域分割処理部、４−１・・・エッジ検出部 DESCRIPTION OF SYMBOLS 1 ... Partial decoding part, 2 ... 1st area | region division process part, 2-1, 3-1 ... Conversion encoding information determination part, 2-2 ... Coding mode information determination part, 2-3 ... motion prediction information determination unit, 3 ... second region division processing unit, 4 ... third region division processing unit, 4-1 ... edge detection unit

Claims

In an image area dividing device for dividing a compression-coded image into a plurality of areas,
Partial decoding means for partially decoding the compressed encoded data;
A plurality of area division processing means for forming areas divided at different resolutions, including at least one area division processing means for forming an area divided by using information partially decoded by the partial decoding means; An image area dividing device comprising: an image area dividing device;

2. The video according to claim 1, wherein the plurality of area division processing units are hierarchized from a low one to a high resolution of the area division, and sequentially form divided areas according to the hierarchy. Area dividing device.

3. The video area dividing device according to claim 2, wherein each of the plurality of area dividing processing means forms a divided area by using a division result of an upper layer area dividing processing means.

3. The video area dividing device according to claim 2, wherein the plurality of area dividing processing means includes area dividing processing means hierarchized according to an encoding information allocation unit.

5. The video area according to claim 4, wherein the plurality of area division processing means includes area division processing means for forming a divided area by using encoding information for each encoding information allocation unit. Splitting device.

The plurality of area division processing means includes low resolution area division processing means for forming a divided area using transform coding information, coding mode information, and motion prediction information. The video image segmentation apparatus described.

The low resolution region division processing unit determines whether or not the difference of the color signal DC component between the blocks is less than a threshold value from the transform coding information about the color difference signal, and integrates the blocks where the difference is determined to be less than the threshold value. 7. The image area dividing apparatus according to claim 6, further comprising means for forming a divided area.

The low resolution region dividing processing means extracts the mode encoding mode information within a predetermined time from the encoding mode information for each position, and uses the identity of the mode encoding mode to divide the divided region. 8. The image area dividing device according to claim 6, further comprising means for forming the image.

8. The low-resolution area dividing processing means includes means for forming a divided area using similarity in direction and length of motion vectors obtained from motion prediction information. The image segmentation device described in 1.

6. The video area dividing device according to claim 5, wherein the plurality of area dividing processing means includes area dividing processing means for forming an area divided by using transform coding information.

An area division processing unit that forms an area divided using the transform coding information determines whether or not the difference in luminance signal DC component between blocks is equal to or less than a threshold from the transform coding information about the luminance signal. 11. The video area dividing device according to claim 10, further comprising means for forming a divided area by integrating blocks determined to be equal to or less than a threshold value.

The area division processing means for forming a divided area using the transform coding information has edge estimation means for estimating the presence and direction of the edge of the luminance signal from the AC component of the transform coding information for the luminance signal. 12. The image region dividing apparatus according to claim 10, wherein divided regions are formed using the estimation result.

The edge estimation means estimates the presence / absence and direction of an edge of a luminance signal using the sum of absolute values of AC components in the horizontal direction and vertical direction of transform coding information for the luminance signal. 12. The image area dividing device according to 12.

The edge estimation means compares the AC component partial absolute value sum with a threshold value and uses the magnitude relationship between the AC component partial absolute value sums in the horizontal direction and the vertical direction or the ratio of the two to determine whether or not there is an edge of the luminance signal. The image region dividing device according to claim 13, wherein the image region dividing device is estimated.

13. The video area dividing device according to claim 12, wherein the area dividing processing means for forming an area divided using the transform coding information performs area division with a resolution equal to or higher than a block unit.

2. The video area dividing device according to claim 1, wherein the plurality of area dividing processing means include area dividing processing means for forming a divided area using pixel information of a luminance signal and a color difference signal.

An area division processing unit for forming an area divided using pixel information of the luminance signal and the color difference signal individually detects edges of the luminance signal and the color difference signal, and uses the detected edges to divide the area The image region dividing device according to claim 16, wherein: