JP2002223440A

JP2002223440A - Shape coder

Info

Publication number: JP2002223440A
Application number: JP2001017607A
Authority: JP
Inventors: Kazuhisa Iguchi; 和久井口; Osamu Mizuno; 修水野; Yutaka Kaneko; 金子　　豊; Yoshiaki Shishikui; 善明鹿喰
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2001-01-25
Filing date: 2001-01-25
Publication date: 2002-08-09

Abstract

PROBLEM TO BE SOLVED: To provide a shape coder that can code shape information with a small amount of codes without causing overlapping or gap among objects in the case of combining decoded objects to configure an image. SOLUTION: This shape coder 10, that divides an image in the unit of a prescribed image area and codes the shape information of image division areas, is provided with a shape noise elimination filter section 20 that eliminates as very small change included in the shape information and a shape coding section 30 that applies reverse coding to the shape information from which the very small change is eliminated.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、形状符号化装置に
係り、特に、画像の一部を構成する画像領域の形状情報
を符号化する形状符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a shape coding apparatus, and more particularly to a shape coding apparatus for coding shape information of an image area forming a part of an image.

【０００２】[0002]

【従来の技術】例えばＭＰＥＧ−４映像符号化規格（IS
O/IEC 14496-2:“Information technology-Generic cod
ing of audio-visual objects-Part2:Visual”）に代表
されるオブジェクトベース符号化方式は、フレーム又は
フィールド単位でなく、画像を前景や背景などの画像領
域（以下、「オブジェクト」という）に分割し、分割し
たオブジェクト単位に個別符号化するものである。2. Description of the Related Art For example, the MPEG-4 video coding standard (IS
O / IEC 14496-2: “Information technology-Generic cod
ing of audio-visual objects-Part2: Visual ”), the image is divided into image areas such as foreground and background (hereinafter referred to as“ objects ”) instead of frame or field units. , Is individually encoded in divided object units.

【０００３】図６は、画像をオブジェクト単位に分割す
る処理の一例について説明する図を示す。図６中、画像
１０１は人間，自動車，木，背景を表すオブジェクトか
ら構成されている。したがって、この画像１０１は、人
間のオブジェクト１０２，自動車のオブジェクト１０
３，背景のオブジェクト１０４，木のオブジェクト１０
５に分割される。FIG. 6 is a diagram for explaining an example of processing for dividing an image into object units. In FIG. 6, an image 101 is composed of objects representing a person, a car, a tree, and a background. Therefore, this image 101 is composed of a human object 102 and a car object 10.
3, background object 104, tree object 10
It is divided into five.

【０００４】例えば図６のように分割された夫々のオブ
ジェクト１０２〜１０５は、オブジェクトの絵柄を表す
テクスチャ画像とオブジェクトの形状を表すオブジェク
ト形状情報とから構成される。オブジェクトベース符号
化方式は、テクスチャ画像を符号化するテクスチャ符号
化とオブジェクト形状情報を符号化する形状符号化とを
行う。For example, each of the objects 102 to 105 divided as shown in FIG. 6 is composed of a texture image representing a picture of the object and object shape information representing the shape of the object. The object-based coding method performs texture coding for coding a texture image and shape coding for coding object shape information.

【０００５】オブジェクト形状情報は、画素がそのオブ
ジェクトに含まれるか否かを示す二値ビットパターン画
像，又はオブジェクト形状の輪郭情報で表される。つま
り、形状符号化を行う形状符号化装置は、二値ビットパ
ターン画像を符号化するものと輪郭情報を符号化するも
のとが含まれる。[0005] The object shape information is represented by a binary bit pattern image indicating whether or not a pixel is included in the object, or contour information of the object shape. That is, the shape encoding device that performs shape encoding includes one that encodes a binary bit pattern image and one that encodes contour information.

【０００６】さらに、形状符号化装置は、符号化前のオ
ブジェクト形状情報と復号化後のオブジェクト形状情報
とが同一となる可逆な符号化を行うものと、符号化前の
オブジェクト形状情報と復号化後のオブジェクト形状情
報とが異なる非可逆な符号化を行うものとが含まれる。[0006] Further, the shape encoding apparatus performs reversible encoding in which the object shape information before encoding and the object shape information after decoding are the same, and performs the object encoding with the object shape information before encoding. This includes irreversible encoding that differs from the subsequent object shape information.

【０００７】具体的に説明すると、二値ビットパターン
画像を符号化する方式としては、フレーム内符号化方式
のＪＢＩＧ規格（ISO/IEC 11544:“Progressive Bi-lev
el Compression”）と、ＭＭＲ（Modified Modified Re
ad）と呼ばれる符号化規格（ITU-T T.6:“Facsimile co
ding schemes and coding control functions for Grou
p 4 facsimile apparatus”）がある。また、二値ビッ
トパターン画像で表されたオブジェクト形状情報を時間
方向の相関を利用して符号化する方式としては、ＭＰＥ
Ｇ−４の形状符号化方式がある。More specifically, as a method for encoding a binary bit pattern image, the JBIG standard of the intra-frame encoding method (ISO / IEC 11544: “Progressive Bi-lev
el Compression ”) and MMR (Modified Modified Re
ad) (ITU-T T.6: “Facsimile co
ding schemes and coding control functions for Grou
p 4 facsimile apparatus ”). As a method of encoding the object shape information represented by the binary bit pattern image using the correlation in the time direction, MPE is used.
There is a G-4 shape coding system.

【０００８】一方、輪郭情報を符号化する方式として
は、フレーム内符号化方式のウェーブレット記述子を用
いた符号化方式（George Muller,etal:“Progressive T
ransmission of Line Drawings Using the Wavelet Tra
nsform”,IEEE Transaction onImage Processing,Vol.5
No.4,pp.666-672,April 1996）がある。また、輪郭情
報で表されたオブジェクト形状情報を時間方向の相関を
利用して符号化する方式としては、符号化方式（浅見和
弘他：“ウェーブレット記述子を用いた動輪郭画像の符
号化”,PCSJ97,P-5.13,pp.113-114,1997）等がある。On the other hand, as a method for encoding contour information, an encoding method using a wavelet descriptor of an intra-frame encoding method (George Muller, et al .: “Progressive T
ransmission of Line Drawings Using the Wavelet Tra
nsform ”, IEEE Transaction on Image Processing, Vol.5
No.4, pp.666-672, April 1996). In addition, as a method of encoding the object shape information represented by the contour information using the correlation in the time direction, there are encoding methods (Kazuhiro Asami et al .: “Coding of moving contour image using wavelet descriptor”, PCSJ97, P-5.13, pp.113-114, 1997).

【０００９】上記の符号化方式は可逆な符号化を行うも
のであり、以下、非可逆な符号化を行う符号化方式につ
いて説明する。非可逆な符号化を行う符号化方式として
は、Ｍｙｒｏｎ他によるスプライン関数を用いた輪郭情
報の符号化方式（Myron Flickner,etal:“Periodic Qua
si-Orthogonal Spline Bases and Applications to Lea
st-Squares Curve Fitting of Digital Images”,IEEE
Transaction on ImageProcessing,vol.5 No.1,pp.71-8
8,Jun.1996）がある。この符号化方式は、輪郭情報を近
似し、非可逆な符号化を行うものである。The above-mentioned encoding system performs lossless encoding. Hereinafter, an encoding system that performs irreversible encoding will be described. As an encoding method for performing irreversible encoding, an encoding method for contour information using a spline function by Myron et al. (Myron Flickner, et al .: “Periodic Qua
si-Orthogonal Spline Bases and Applications to Lea
st-Squares Curve Fitting of Digital Images ”, IEEE
Transaction on ImageProcessing, vol.5 No.1, pp.71-8
8, Jun. 1996). This encoding method approximates contour information and performs irreversible encoding.

【００１０】また、ＭＰＥＧ−４の形状符号化方式に
は、非可逆な符号化を行うモードもある。ＭＰＥＧ−４
の形状符号化方式は、オブジェクト形状情報を１／２，
１／４等に縮小し、その縮小したオブジェクト形状情報
で可逆な符号化を行うものである。In the MPEG-4 shape encoding method, there is a mode for performing irreversible encoding. MPEG-4
Is the object shape information of 1/2,
The image data is reduced to 1/4 or the like, and lossless encoding is performed using the reduced object shape information.

【００１１】[0011]

【発明が解決しようとする課題】しかしながら、非可逆
な形状符号化の場合、符号化前と復号化後とでオブジェ
クト形状情報が異なることになる。つまり、画像をオブ
ジェクトに分割して非可逆な形状符号化を行った場合、
符号化前のオブジェクトの形状と復号化後のオブジェク
トの形状とが異なることになる。したがって、復号化し
たオブジェクトを組み合わせて画像を構成するときに、
オブジェクト間に重なり，隙間が生じるという問題があ
った。However, in the case of irreversible shape coding, object shape information differs between before coding and after decoding. In other words, when an image is divided into objects and irreversible shape encoding is performed,
The shape of the object before encoding is different from the shape of the object after decoding. Therefore, when composing the image by combining the decrypted objects,
There was a problem that a gap was created between objects.

【００１２】また、可逆な形状符号化の場合、符号化前
と復号化後とでオブジェクト形状情報が同一であり、符
号化前のオブジェクトの形状と復号化後のオブジェクト
の形状とが同一である。したがって、復号化したオブジ
ェクトを組み合わせて画像を構成するときに、オブジェ
クト間に重なり，隙間が生じるという問題はない。しか
し、可逆な形状符号化は、非可逆な形状符号化と比較す
ると符号化後の情報量が多くなるという問題があった。In the case of lossless shape coding, object shape information is the same before coding and after decoding, and the shape of the object before coding and the shape of the object after decoding are the same. . Therefore, when an image is formed by combining the decoded objects, there is no problem that the objects overlap and a gap is generated. However, reversible shape coding has a problem that the amount of information after coding is larger than irreversible shape coding.

【００１３】その為、符号化後の情報量が少なく、復号
化したオブジェクトを組み合わせて画像を構成するとき
にオブジェクト間に重なり，隙間が生じないような形状
符号化が望まれていた。[0013] Therefore, it has been desired to perform shape coding in which the amount of information after coding is small, and when the decoded objects are combined to form an image, the objects do not overlap with each other and no gap occurs.

【００１４】本発明は、上記の点に鑑みなされたもの
で、復号化したオブジェクトを組み合わせて画面を構成
するときにオブジェクト間に重なりや隙間が生じず、少
ない符号量で符号化することが可能な形状符号化装置を
提供することを目的とする。[0014] The present invention has been made in view of the above points, and when composing a screen by combining decoded objects, there is no overlap or gap between objects, and encoding can be performed with a small code amount. It is an object to provide a simple shape encoding device.

【００１５】[0015]

【課題を解決するための手段】そこで、上記課題を解決
するため、本発明は、画像を所定の画像領域単位に分割
し、分割した画像領域の形状情報を符号化する形状符号
化装置において、前記形状情報に含まれる微細な変化を
除去する形状ノイズ除去フィルタ部と、前記微細な変化
が除去された形状情報を可逆符号化する形状符号化部と
を有することを特徴とする。In order to solve the above-mentioned problems, the present invention provides a shape encoding apparatus for dividing an image into predetermined image area units and encoding shape information of the divided image areas. It is characterized by having a shape noise removal filter unit for removing a minute change included in the shape information, and a shape encoding unit for reversibly encoding the shape information from which the minute change has been removed.

【００１６】このような形状符号化装置では、形状情報
を可逆符号化する前に、形状情報に含まれる微細な変化
が形状ノイズ除去フィルタで除去される。つまり、可逆
符号化する前に形状情報に含まれる微細な変化を除去す
ることができるので、可逆符号化における情報量を削減
することが可能である。したがって、本発明の形状符号
化装置では、形状符号化を少ない符号量で行うことがで
き、復号したオブジェクト間に重なり，隙間が生じるこ
ともない。In such a shape coding apparatus, before lossless coding of the shape information, fine changes included in the shape information are removed by a shape noise removal filter. That is, since a minute change included in the shape information can be removed before the lossless encoding, the amount of information in the lossless encoding can be reduced. Therefore, with the shape encoding device of the present invention, shape encoding can be performed with a small amount of code, and there is no overlap or gap between decoded objects.

【００１７】また、本発明は、前記形状ノイズ除去フィ
ルタ部が、動き補正処理を伴う時空間フィルタ部を有す
ることを特徴とする。Further, the present invention is characterized in that the shape noise elimination filter section has a spatiotemporal filter section accompanied by a motion correction process.

【００１８】このような形状符号化装置では、動き補正
処理を伴う時空間フィルタ部を用いて形状情報に含まれ
る微細な変化を除去することができる。In such a shape encoding apparatus, a minute change included in the shape information can be removed by using a spatio-temporal filter unit including a motion correction process.

【００１９】また、本発明は、前記形状ノイズ除去フィ
ルタ部が、同一の形状情報に含まれる画素に同一のラベ
ル情報を付加したラベル付き画像情報を生成するラベル
付き画像化部と、前記ラベル付き画像情報に時空間多数
決フィルタ処理を施す時空間多数決フィルタ部と、前記
時空間多数決フィルタ処理を施されたラベル付き画像情
報を形状情報に変換する形状情報化部とを有することを
特徴とする。Further, according to the present invention, the shape noise removal filter unit generates a labeled image information in which the same label information is added to the pixels included in the same shape information, It has a spatio-temporal majority filter unit that performs spatio-temporal majority filter processing on image information, and a shape information conversion unit that converts labeled image information that has been subjected to the spatio-temporal majority filter process into shape information.

【００２０】このような形状符号化装置では、時空間多
数決フィルタ処理を用いて形状情報に含まれる微細な変
化を除去することができる。In such a shape encoding device, fine changes included in shape information can be removed by using a spatio-temporal majority filtering process.

【００２１】また、本発明は、前記形状ノイズ除去フィ
ルタ部での動き補正処理に用いる動きベクトルと、前記
形状符号化部で可逆符号化に用いる動きベクトルとを同
一とすることを特徴とする。Further, the present invention is characterized in that the motion vector used for the motion correction processing in the shape noise removal filter unit and the motion vector used for the lossless coding in the shape encoding unit are the same.

【００２２】このような形状符号化装置では、形状ノイ
ズ除去フィルタ部での動き補正処理に用いる動きベクト
ルと、形状符号化部で可逆符号化に用いる動きベクトル
とを同一にすることで、形状ノイズ除去フィルタ部での
形状ノイズ除去効果を高めることができる。In such a shape encoding apparatus, the motion vector used for the motion correction processing in the shape noise elimination filter unit is the same as the motion vector used for the lossless encoding in the shape encoding unit, so that the shape noise is reduced. The shape noise removal effect in the removal filter unit can be enhanced.

【００２３】[0023]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面に基づいて説明する。まず、本発明の理解を容易
とする為に、形状符号化において符号量が増加する原因
について説明する。Next, embodiments of the present invention will be described with reference to the drawings. First, in order to facilitate understanding of the present invention, the cause of an increase in the code amount in shape coding will be described.

【００２４】形状符号化において情報量が増加する原因
は、静止画像の場合、オブジェクト形状の外縁部に存在
する空間的に微細な凹凸の存在である。図１は、オブジ
ェクト形状の外縁部に存在する凹凸の一例について説明
する図を示す。図１中、オブジェクト１〜３は画像から
分割されたものである。オブジェクト１〜３は、その形
状の外縁部に空間的に微細な凹凸を有している。このオ
ブジェクト形状の外縁部に存在する空間的に微細な凹凸
を形状符号化すると、形状符号化における情報量が増加
する。The cause of an increase in the amount of information in shape coding is the presence of spatially minute irregularities at the outer edge of the object shape in the case of a still image. FIG. 1 is a diagram illustrating an example of unevenness existing at an outer edge of an object shape. In FIG. 1, objects 1 to 3 are divided from an image. The objects 1 to 3 have spatially fine irregularities at the outer edge of the shape. If the spatially minute unevenness existing at the outer edge of the object shape is shape-coded, the amount of information in shape coding increases.

【００２５】また、動画像の場合、形状符号化において
情報量が増加する原因は、オブジェクト形状の外縁部に
存在する空間的に微細な凹凸の存在に加え、時間的に短
い間隔で属するオブジェクトが変化する画素の存在であ
る。図２は、時間の経過によるオブジェクト形状の変化
の一例について説明する図を示す。図２（ａ）は初期の
オブジェクト形状であり、時間の経過により図２
（ｂ），図２（ｃ）のオブジェクト形状に順次変化して
いる。Further, in the case of a moving image, the amount of information increases in the shape encoding due to the presence of spatially minute irregularities existing at the outer edge of the object shape and the presence of objects belonging to the object at short time intervals. The presence of changing pixels. FIG. 2 is a diagram illustrating an example of a change in an object shape over time. FIG. 2A shows an initial object shape, and FIG.
2 (b), and sequentially changes to the object shape of FIG. 2 (c).

【００２６】例えば図２（ａ）におけるオブジェクト
４，５と，図２（ｂ）におけるオブジェクト６，７と，
図２（ｃ）におけるオブジェクト８，９とは、そのオブ
ジェクト形状が異なってる。つまり、図２（ａ）〜図２
（ｃ）のオブジェクト形状は、時間の経過によりオブジ
ェクト形状が変化している。このような時間の経過によ
るオブジェクト形状の変化が短時間のうちに繰り返され
ると、形状符号化における情報量が増加する。For example, objects 4 and 5 in FIG. 2A, objects 6 and 7 in FIG.
The objects 8 and 9 in FIG. 2C have different object shapes. That is, FIGS.
In the object shape of (c), the object shape changes over time. If the change of the object shape due to the lapse of time is repeated within a short time, the amount of information in shape coding increases.

【００２７】つまり、形状符号化における情報量を削減
する為には、オブジェクト形状に存在する空間的或いは
時空間的に微細な変化を取り除けばよい。この為、本発
明では、形状符号化の前処理としてオブジェクト形状の
空間的或いは時空間的に微細な変化をフィルタリングす
る。具体的には、形状符号化を行う形状符号化部の前段
にオブジェクト形状の空間的或いは時空間的に微細な変
化を除去する形状ノイズ除去フィルタを設ける。That is, in order to reduce the amount of information in shape coding, it is only necessary to remove spatially or spatiotemporally minute changes in the object shape. For this reason, according to the present invention, as a pre-process of shape coding, a spatial or spatio-temporal minute change in the object shape is filtered. Specifically, a shape noise elimination filter for removing a spatially or spatiotemporally minute change in the object shape is provided at a stage preceding the shape encoding unit for performing the shape encoding.

【００２８】以下、図３を参照しつつ本発明の形状符号
化装置の構成について説明する。図３は、本発明の形状
符号化装置の一実施例の構成図を示す。図３中、形状符
号化装置１０は、形状ノイズ除去フィルタ部２０と可逆
形状符号化部３０とを含むように構成される。Hereinafter, the configuration of the shape encoding apparatus according to the present invention will be described with reference to FIG. FIG. 3 shows a configuration diagram of an embodiment of the shape encoding apparatus according to the present invention. 3, the shape encoding device 10 is configured to include a shape noise removal filter unit 20 and a lossless shape encoding unit 30.

【００２９】本実施例では、形状ノイズ除去フィルタの
一例として３フレームの時空間多数決フィルタを用いた
例について説明するが、時空間多数決フィルタに限定さ
れるものでなく任意のノイズ除去フィルタで実現可能で
ある。３フレームの時空間多数決フィルタを用いる場
合、形状ノイズ除去フィルタ部２０は、ラベル付き画像
化部２１，フレームメモリ２２，フレームメモリ２３，
動きベクトル検出部２４，時空間多数決フィルタ部２
５，形状情報化部２６を含むように構成される。In this embodiment, an example in which a three-frame spatiotemporal majority filter is used as an example of a shape noise elimination filter will be described. It is. When a three-frame spatio-temporal majority filter is used, the shape noise removal filter unit 20 includes a labeled imaging unit 21, a frame memory 22, a frame memory 23,
Motion vector detector 24, spatiotemporal majority filter 2
5, It is configured to include the shape information generating unit 26.

【００３０】まず、オブジェクト形状情報が形状ノイズ
除去フィルタ部２０のラベル付き画像化部２１に供給さ
れる。ラベル付き画像化部２１は供給されたオブジェク
ト形状情報をラベル付き画像に変換する。ラベル付き画
像とは、同一オブジェクトに属する画素が同一のラベル
を持ち、異なるオブジェクトに属する画素が異なるラベ
ルを持つ画像である。ラベル付き画像化部２１は変換し
たラベル付き画像をフレームメモリ２２，動きベクトル
検出部２４，時空間多数決フィルタ部２５に供給する。First, the object shape information is supplied to the labeled imaging unit 21 of the shape noise elimination filter unit 20. The labeled imaging unit 21 converts the supplied object shape information into a labeled image. A labeled image is an image in which pixels belonging to the same object have the same label and pixels belonging to different objects have different labels. The labeled imaging unit 21 supplies the converted labeled image to the frame memory 22, the motion vector detection unit 24, and the spatiotemporal majority filter unit 25.

【００３１】フレームメモリ２２は供給されたラベル付
き画像を格納しておき、所定のタイミングでラベル付き
画像をフレームメモリ２３，動きベクトル検出部２４，
時空間多数決フィルタ部２５に供給する。また、フレー
ムメモリ２３は供給されたラベル付き画像を格納してお
き、所定のタイミングでラベル付き画像を動きベクトル
検出部２４，時空間多数決フィルタ部２５に供給する。
つまり、ラベル付き画像化部２１から出力されるラベル
付き画像は、フレームメモリ２２，フレームメモリ２３
に順次シフトされる。The frame memory 22 stores the supplied labeled image and stores the labeled image at a predetermined timing in the frame memory 23, the motion vector detecting section 24,
It is supplied to the spatiotemporal majority filter unit 25. The frame memory 23 stores the supplied labeled image, and supplies the labeled image to the motion vector detecting unit 24 and the spatiotemporal majority filter unit 25 at a predetermined timing.
In other words, the labeled image output from the labeled imaging unit 21 is stored in the frame memory 22 and the frame memory 23.
Are sequentially shifted.

【００３２】動きベクトル検出部２４は、ラベル付き符
号化部２１，フレームメモリ２２，フレームメモリ２３
から３フレームのラベル付き画像が供給される。つま
り、動きベクトル検出部２４は、時間の異なる３フレー
ムのラベル付き画像が供給される。なお、フレームメモ
リ２３から供給されるラベル付き画像が最も古く、ラベ
ル付き画像化部２１から供給されるラベル付き画像が最
も新しい。The motion vector detecting section 24 includes a labeled encoding section 21, a frame memory 22, and a frame memory 23.
Supplies three frames of labeled images. That is, the motion vector detecting unit 24 is supplied with labeled images of three frames at different times. The labeled image supplied from the frame memory 23 is the oldest, and the labeled image supplied from the labeled imaging unit 21 is the newest.

【００３３】動きベクトル検出部２４は３フレームのラ
ベル付き画像について動きベクトル検出を行い、検出し
た動きベクトルを時空間多数決フィルタ部２５，可逆形
状符号化部３０に供給する。The motion vector detection unit 24 performs motion vector detection on the labeled image of three frames, and supplies the detected motion vector to the spatiotemporal majority filter unit 25 and the lossless shape encoding unit 30.

【００３４】時空間多数決フィルタ部２５は、ラベル付
き符号化部２１，フレームメモリ２２，フレームメモリ
２３から３フレームのラベル付き画像が供給される。な
お、説明の便宜上、フレームメモリ２３から供給される
１フレームのラベル付き画像を前フレーム，フレームメ
モリ２２から供給される１フレームのラベル付き画像を
中央フレーム，ラベル付き画像化部２１から供給される
１フレームのラベル付き画像を後フレームと呼ぶ。The spatiotemporal majority decision filter unit 25 is supplied with three frames of labeled images from the labeled encoding unit 21, the frame memory 22, and the frame memory 23. For convenience of description, one frame of labeled image supplied from the frame memory 23 is supplied from the previous frame, and one frame of labeled image supplied from the frame memory 22 is supplied from the central frame and labeled image forming unit 21. One frame of the labeled image is called a subsequent frame.

【００３５】時空間多数決フィルタ部２５は、動きベク
トル検出部２４から供給される動きベクトルを用いてウ
インドウを設定し、中央フレームの各画素について時空
間多数決フィルタ処理を施し、その時空間多数決フィル
タ処理を施した中央フレーム，言い換えればラベル付き
画像を形状情報化部２６に供給する。The spatio-temporal majority filter unit 25 sets a window using the motion vector supplied from the motion vector detection unit 24, performs spatio-temporal majority filter processing on each pixel of the central frame, and performs the spatio-temporal majority filter processing. The applied center frame, in other words, the labeled image is supplied to the shape information generating unit 26.

【００３６】ここで、時空間多数決フィルタ部の処理に
ついて図４，図５を参照しつつ説明する。図４は、ある
画素に対する時空間多数決フィルタ部の処理の一例につ
いて説明する図を示す。例えばウインドウ４１，４２，
４３は、前フレーム上，中央フレーム上，後フレーム上
に構成される。Here, the processing of the spatiotemporal majority filter unit will be described with reference to FIGS. FIG. 4 is a diagram illustrating an example of processing of a spatiotemporal majority filter unit for a certain pixel. For example, windows 41, 42,
Reference numeral 43 is formed on the front frame, the center frame, and the rear frame.

【００３７】着目画素４５は、時空間多数決フィルタ処
理を行う画素である。また、ウインドウ４２は着目画素
４５を中心とした５×５のウインドウである。動きベク
トル４７は動きベクトル検出部２４から供給されたもの
であり、着目画素４５の前フレームでの対応する位置を
表す。画素４４は、動きベクトル４７により着目画素４
５と対応付けられた前フレームにおける画素である。ま
た、ウインドウ４１は画素４４を中心とした５×５のウ
インドウである。The pixel of interest 45 is a pixel for which a spatio-temporal majority filtering process is performed. The window 42 is a 5 × 5 window centered on the pixel 45 of interest. The motion vector 47 is supplied from the motion vector detection unit 24, and indicates a corresponding position of the target pixel 45 in the previous frame. The pixel 44 is a pixel 4 of interest based on the motion vector 47.
5 is a pixel in the previous frame associated with 5. The window 41 is a 5 × 5 window centered on the pixel 44.

【００３８】動きベクトル４８は動きベクトル検出部２
４から供給されたものであり、着目画素４５の後フレー
ムでの対応する位置を表す。画素４６は、動きベクトル
４８により着目画素４５と対応付けられた後フレームに
おける画素である。また、ウインドウ４３は画素４６を
中心とした５×５のウインドウである。The motion vector 48 is the motion vector detector 2
4 and indicates the corresponding position in the subsequent frame of the target pixel 45. The pixel 46 is a pixel in the subsequent frame associated with the pixel of interest 45 by the motion vector 48. The window 43 is a 5 × 5 window centered on the pixel 46.

【００３９】時空間多数決フィルタ部２５は、３つのウ
インドウ４１，４２，４３に含まれる７５画素に割り当
てられたラベルのうち最も数の多いラベルを求め、着目
画素４５のラベルを最も数の多いラベルに置き換える。
図５は、着目画素のラベルを最も数の多いラベルに置き
換える処理の一例について説明する図を示す。The spatiotemporal majority filter unit 25 finds the label with the largest number among the labels assigned to the 75 pixels included in the three windows 41, 42, and 43, and replaces the label of the target pixel 45 with the label with the largest number. Replace with
FIG. 5 is a diagram illustrating an example of a process of replacing the label of the target pixel with the label having the largest number.

【００４０】ウインドウ４１，４２，４３は、それぞれ
２５画素を表している。ウインドウ４１，４２，４３に
マトリックス状に記載されている数字１〜３はラベルを
表しており、同一のラベルを持つ画素が同一オブジェク
トに属する。例えば図５の場合、ウインドウ４１，４
２，４３に含まれるラベル「１」の数が３５個，ラベル
「２」の数が１８個，ラベル「３」の数が２２個であ
る。Each of the windows 41, 42, and 43 represents 25 pixels. Numerals 1 to 3 described in a matrix in the windows 41, 42, and 43 represent labels, and pixels having the same label belong to the same object. For example, in the case of FIG.
The number of labels “1” included in 2, 43 is 35, the number of labels “2” is 18, and the number of labels “3” is 22.

【００４１】したがって、３つのウインドウ４１，４
２，４３に含まれる７５画素に割り当てられたラベルの
うち最も数の多いラベルはラベル「１」であり、着目画
素４５のラベル「２」がラベル「１」に書き換えられ
る。この結果、時空間多数決フィルタ処理をフレームに
順次施すと、オブジェクト形状に存在する空間的或いは
時空間的に微細な変化が取り除かれる。Therefore, the three windows 41, 4
The label with the largest number among the labels assigned to the 75 pixels included in the pixels 2 and 43 is the label “1”, and the label “2” of the target pixel 45 is rewritten to the label “1”. As a result, when the spatio-temporal majority filtering process is sequentially performed on a frame, minute changes in the object shape that are spatially or spatio-temporally small are removed.

【００４２】再び図３に戻り説明を続けると、時空間多
数決フィルタ部２５は時空間多数決フィルタ処理を施し
たラベル付き画像を形状情報化部２６に供給する。形状
情報化部２６は、供給されたラベル付き画像をオブジェ
クト形状情報に変換して可逆形状符号化部３０に出力す
る。Returning to FIG. 3 again, the spatiotemporal majority filter unit 25 supplies the labeled image subjected to the spatiotemporal majority filter processing to the shape information generating unit 26. The shape information conversion unit 26 converts the supplied labeled image into object shape information, and outputs the object shape information to the lossless shape encoding unit 30.

【００４３】可逆形状符号化部３０は、動きベクトル検
出部２４から動きベクトルが供給されると共に、形状情
報化部２６からオブジェクト形状情報が供給される。可
逆形状符号化部３０は、動きベクトルを利用してオブジ
ェクト形状情報の可逆な形状符号化を行い、符号化ビッ
ト列を出力する。The lossless shape encoder 30 receives the motion vector from the motion vector detector 24 and the object shape information from the shape information generator 26. The lossless shape encoding unit 30 performs lossless shape encoding of the object shape information using the motion vector, and outputs an encoded bit sequence.

【００４４】なお、時間方向の相関を利用して符号化を
行う形状符号化と時空間的な形状ノイズ除去フィルタ処
理とが共に動き補正処理を行う場合、両者で同じ動きベ
クトルを用いると時空間的な形状ノイズ除去フィルタ処
理のノイズ除去効果が高くなる。したがって、動きベク
トル検出部２４は動きベクトルを時空間多数決フィルタ
部２５及び可逆形状符号化部３０に供給している。When both the shape encoding for performing the encoding using the correlation in the time direction and the spatio-temporal shape noise removal filter process perform the motion correction process, if both use the same motion vector, The noise removal effect of the typical shape noise removal filter processing increases. Therefore, the motion vector detection unit 24 supplies the motion vector to the spatiotemporal majority filter unit 25 and the lossless shape encoding unit 30.

【００４５】本実施例では、３フレームの時空間多数決
フィルタを用いる例について説明したが、形状ノイズ除
去フィルタ部２０のフレームメモリの数を増減すること
で、より多くのフレームを用いた時空間多数決フィル
タ，１フレーム内の空間多数決フィルタに変更すること
ができる。なお、１フレーム内の空間多数決フィルタ処
理は、各画素を中心にＮ×Ｎのウインドウを設定し、中
心画素のラベルをウインドウ内で最も数の多いラベルに
置き換えるものである。また、時空間多数決フィルタ部
２５を他のラベルを持たない画素や、１つの画素に複数
のラベルが割り当てられることのないノイズ除去フィル
タに置き換えることも可能である。In this embodiment, an example in which a spatio-temporal majority filter of three frames is used has been described. However, by increasing or decreasing the number of frame memories in the shape noise elimination filter section 20, a spatio-temporal majority decision filter using more frames is used. The filter can be changed to a spatial majority filter in one frame. Note that the spatial majority filter processing in one frame sets an N × N window centering on each pixel, and replaces the label of the center pixel with the label with the largest number in the window. It is also possible to replace the spatio-temporal majority filter unit 25 with a pixel having no other label or a noise removal filter in which a plurality of labels are not assigned to one pixel.

【００４６】[0046]

【発明の効果】上述の如く、本発明によれば、形状情報
を可逆な形状符号化を行う前に、形状情報に含まれる微
細な変化が形状ノイズ除去フィルタで除去される。つま
り、形状符号化する前に形状情報に含まれる微細な変化
を除去することができるので、形状符号化における情報
量を削減することが可能である。また、ラベルを持たな
い画素や１つの画素に複数のラベルが割り当てられるこ
とのないノイズ除去フィルタを用いるので、オブジェク
ト間に重なりや隙間が生じない。したがって、本発明の
形状符号化装置では、形状符号化を少ない符号量で行う
ことができ、符号化したオブジェクト間に重なり，隙間
が生じることもない。As described above, according to the present invention, a small change included in the shape information is removed by the shape noise removing filter before the shape information is reversibly shaped. That is, since a minute change included in the shape information can be removed before shape coding, the amount of information in shape coding can be reduced. In addition, since a noise removal filter is used in which a pixel having no label or a plurality of labels is not assigned to one pixel, there is no overlap or gap between objects. Therefore, with the shape encoding device of the present invention, shape encoding can be performed with a small code amount, and there is no overlap or gap between encoded objects.

【００４７】また、本発明によれば、動き補正処理を伴
う時空間フィルタ部を用いて形状情報に含まれる微細な
変化を除去することができる。Further, according to the present invention, it is possible to remove a minute change included in the shape information by using a spatio-temporal filter unit involving a motion correction process.

【００４８】また、本発明によれば、時空間多数決フィ
ルタ処理を用いて形状情報に含まれる微細な変化を除去
することができる。Further, according to the present invention, fine changes included in shape information can be removed by using a spatio-temporal majority filter process.

【００４９】また、本発明によれば、形状ノイズ除去フ
ィルタ部での動き補正処理に用いる動きベクトルと、形
状符号化部で可逆符号化に用いる動きベクトルとを同一
にすることで、形状ノイズ除去フィルタ部での形状ノイ
ズ除去効果を高めることができる。Further, according to the present invention, the motion vector used for the motion correction processing in the shape noise elimination filter unit and the motion vector used for the lossless encoding in the shape encoding unit are made the same, thereby eliminating the shape noise. The shape noise removal effect in the filter unit can be enhanced.

【００５０】[0050]

[Brief description of the drawings]

【図１】オブジェクト形状の外縁部に存在する凹凸の一
例について説明する図である。FIG. 1 is a diagram illustrating an example of unevenness existing at an outer edge portion of an object shape.

【図２】時間の経過によるオブジェクト形状の変化の一
例について説明する図である。FIG. 2 is a diagram illustrating an example of a change in an object shape over time.

【図３】本発明の形状符号化装置の一実施例の構成図で
ある。FIG. 3 is a configuration diagram of an embodiment of a shape encoding device according to the present invention.

【図４】ある画素に対する時空間多数決フィルタ部の処
理の一例について説明する図である。FIG. 4 is a diagram illustrating an example of processing of a spatiotemporal majority filter unit for a certain pixel;

【図５】着目画素のラベルを最も数の多いラベルに置き
換える処理の一例について説明する図である。FIG. 5 is a diagram illustrating an example of a process of replacing a label of a target pixel with a label having the largest number.

【図６】画像をオブジェクト単位に分割する処理の一例
について説明する図である。FIG. 6 is a diagram illustrating an example of a process of dividing an image into object units.

[Explanation of symbols]

１〜９オブジェクト１０形状符号化装置２０形状ノイズ除去フィルタ部２１ラベル付き画像化部２２，２３フレームメモリ２４動きベクトル検出部２５時空間多数決フィルタ部２６形状情報化部３０可逆形状符号化部４１〜４３ウインドウ４４，４６画素４５着目画素４７，４８動きベクトル 1-9 Object 10 Shape Encoding Device 20 Shape Noise Elimination Filter Unit 21 Labeled Imaging Unit 22, 23 Frame Memory 24 Motion Vector Detection Unit 25 Spatio-Temporal Majority Decision Filter Unit 26 Shape Information Unit 30 Reversible Shape Encoding Unit 41 43 window 44,46 pixel 45 pixel of interest 47,48 motion vector

───────────────────────────────────────────────────── フロントページの続き (72)発明者金子豊東京都世田谷区砧一丁目10番11号日本放送協会放送技術研究所内 (72)発明者鹿喰善明東京都世田谷区砧一丁目10番11号日本放送協会放送技術研究所内Ｆターム(参考） 5B057 CA12 CA16 CB12 CB16 CE02 CE09 CE10 5C059 KK01 MA00 MB16 NN01 PP28 RC16 RC19 RC37 UA17 UA18 UA34 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Yutaka Kaneko 1-10-11 Kinuta, Setagaya-ku, Tokyo Japan Broadcasting Corporation Japan Broadcasting Research Institute (72) Inventor Yoshiaki Kakuro 1-1-10 Kinuta, Setagaya-ku, Tokyo No. 11 Japan Broadcasting Corporation Broadcasting Technology Laboratory F-term (reference) 5B057 CA12 CA16 CB12 CB16 CE02 CE09 CE10 5C059 KK01 MA00 MB16 NN01 PP28 RC16 RC19 RC37 UA17 UA18 UA34

Claims

[Claims]

1. A shape encoding device that divides an image into predetermined image region units and encodes shape information of the divided image regions, wherein a shape noise removal filter unit that removes a minute change included in the shape information. And a shape encoding unit that losslessly encodes the shape information from which the fine change has been removed.

2. The shape encoding apparatus according to claim 1, wherein the shape noise removal filter unit includes a spatiotemporal filter unit that performs a motion correction process.

3. A labeling image forming section for generating labeled image information in which the same label information is added to the pixels included in the same shape information, wherein the shape noise removing filter section includes: 3. The image processing apparatus according to claim 1, further comprising: a spatio-temporal majority filtering unit that performs a spatial majority filtering process; and a shape information generating unit that converts the labeled image information subjected to the spatio-temporal majority filtering process into shape information. The shape encoding device according to claim 1.

4. A motion vector used for a motion correction process in the shape noise removal filter unit and a motion vector used for lossless coding in the shape encoding unit are set to be the same. The shape encoding device according to claim 1.