KR100554632B1

KR100554632B1 - Shape encoding method and device for providing real-time object-based video service

Info

Publication number: KR100554632B1
Application number: KR1020030059129A
Authority: KR
Inventors: 장의선; 문석주; 김민훈; 이선영
Original assignee: 학교법인 한양학원
Priority date: 2003-08-26
Filing date: 2003-08-26
Publication date: 2006-02-22
Also published as: KR20050023132A

Abstract

본 발명은 실시간 객체 기반 서비스를 제공하기 위한 형상 정보 부호화 방법 및 장치에 관에 관한 것으로서, 보다 상세하게는 MPEG-4에서 사용되는 형상 정보 부호화(SHAPE INFORMATION ENCODING)의 복잡도를 감소시켜 실시간으로 형상 정보의 부호화가 가능하도록 구성된 형상 정보 부호화 방법 및 장치에 관한 것이다. 본 발명의 바람직한 실시예에 따른 실시간 객체 기반 서비스를 제공하기 위한 형상 부호화 방법은 형상 부호화 방법은 대상물의 영상 객체를 포함하는 최소 사각형 경계(TRB, Tightest Rectangular Boundary)를 설정하는 단계, 최소 사각형 경계를 이용하여, 대상물의 형상에 따라 형성된 VOP를 M×N의 화소를 가지는 매크로블록으로 구획하는 단계(여기서, M, N은 자연수) 및 구획 단계에서 구획된 매크로블록의 그리드 시작점을 이동시키면서 대상물의 영상 정보량 감소 위치를 서치하는 과정을 수행하지 아니하고, 매크로블록 단위로 부호화 하는 단계를 포함할 수 있다.
The present invention relates to a method and apparatus for shape information encoding for providing a real-time object-based service, and more particularly, shape information in real time by reducing the complexity of shape information encoding used in MPEG-4. The present invention relates to a shape information encoding method and apparatus configured to enable encoding of. In the shape encoding method for providing a real-time object-based service according to the preferred embodiment of the present invention, the shape encoding method may include: setting a minimum rectangular boundary (TRB) including an image object of a target; Partitioning the VOP formed according to the shape of the object into a macroblock having M × N pixels (where M and N are natural numbers) and moving the grid starting point of the partitioned macroblock in the partitioning step The method may include encoding in units of macroblocks without performing a process of searching for an information reduction position.

실시간, 부호화, VOPF, MEMC, 객체, MPEG, VIDEO, SHAPEReal time, Encoding, VOPF, MEMC, Object, MPEG, VIDEO, SHAPE

Description

Shape encoding method and device for providing real-time object-based video service

도 1은 본 발명의 바람직한 일 실시예에 따른 알고리즘 툴(MPEG-4 OVC encoding tools)을 정리한 도면.BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a diagram illustrating an algorithm tool (MPEG-4 OVC encoding tools) according to an embodiment of the present invention.

도 2는 본 발명의 바람직한 일 실시예에 따른 형상 신호 부호화 절차를 나타낸 순서도. 2 is a flowchart illustrating a shape signal encoding procedure according to an embodiment of the present invention.

도 3은 본 발명의 바람직한 일 실시예에 따른 실시간 형상 신호 부호화 시스템의 전체 구성을 나타낸 도면.3 is a diagram showing the overall configuration of a real-time shape signal encoding system according to an embodiment of the present invention.

도 4는 본 발명의 바람직한 일 실시예에 따른 실시간 형상 신호 부호화부의 구성을 나타낸 도면. 4 is a diagram illustrating a configuration of a real-time shape signal encoder according to an embodiment of the present invention.

도 5는 본 발명의 바람직한 일 실시예에 따른 실시간 VOP 형성부의 동작 원리를 나타낸 도면.5 is a view showing the operation principle of the real-time VOP forming unit according to an embodiment of the present invention.

도 6은 본 발명의 바람직한 일 실시예에 따른 움직임 추정 방법을 포함한 부호화 방법을 나타낸 도면.6 is a diagram illustrating an encoding method including a motion estimation method according to an exemplary embodiment of the present invention.

도 7을 본 발명에서 사용한 테스트 이미지 중 일 실시예를 도시한 도면.Figure 7 illustrates one embodiment of a test image used in the present invention.

도 8은 종래 방식, 실시간 VOP 형성 방식, 실시간 움직임 추정 방식, 두 방 식을 결합하여 사용한 경우를 종합적으로 비교한 결과를 나타낸 그래프.8 is a graph showing a result of comprehensively comparing a conventional method, a real-time VOP formation method, a real-time motion estimation method, and a combination of the two methods.

도 9a 내지 도 9c는 본 발명에 따른 실시간 VOP 형성 방법, 실시간 움직임 추정 방법, 상기 두 방식을 동시에 적용한 실시간 부호화 방식을 종래 방식과 비교한 성능 비교 그래프.
9A to 9C are graphs comparing performances of a real-time VOP forming method, a real-time motion estimation method, and a real-time coding method in which the two methods are simultaneously applied according to the present invention.

<도면의 주요부분에 대한 부호의 설명><Description of the symbols for the main parts of the drawings>

310 : 실시간 VOP 형성부310: real-time VOP forming unit

320 : 실시간 VOP 부호화부320: real-time VOP encoder

330 : 멀티플랙서330: Multiplexer

321 : 실시간 움직임 추정부321: real-time motion estimation unit

600 : MVP의 후보600: MVP candidate

610 : MV for shape610: MV for shape

620 : MV for texture
620: MV for texture

본 발명은 실시간 객체 기반 서비스를 제공하기 위한 형상 정보 부호화 방법 및 장치에 관에 관한 것으로서, 보다 상세하게는 MPEG-4에서 사용되는 형상 정보 부호화(SHAPE INFORMATION ENCODING)의 복잡도를 감소시켜 실시간으로 형상 정보의 부호화가 가능하도록 구성된 형상 정보 부호화 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for shape information encoding for providing a real-time object-based service, and more particularly, shape information in real time by reducing the complexity of shape information encoding used in MPEG-4. The present invention relates to a shape information encoding method and apparatus configured to enable encoding of.

MPEG-4는 영상의 내용에 대한 이해를 바탕으로 한 내용기반 부호화(Content-based Coding) 또는 객체 지향 부호화(Object-based Video Coding, 이하 'OVC' 라 칭함)라는 강점을 가지고 있다. 이러한 OVC에서는 영상 내용을 객체(Object)와 배경으로 나누어 처리하여 전송하므로, 사용자의 의도에 따라 다양한 형태의 조작이나 디스플레이가 가능하며, 사용자는 기존의 수동적인 입장에서 탈피하여, 사용하는 서비스의 내용을 자신의 기호에 따라 좀더 능동적으로 조작하여 자신에게 적합한 환경을 만들 수 있다. MPEG-4 has an advantage of content-based coding or object-based video coding (hereinafter referred to as OVC) based on an understanding of the content of an image. In this OVC, video contents are divided into objects and backgrounds, processed, and transmitted. Therefore, various types of operations and displays are possible according to the user's intention. Can be manipulated more actively according to their preferences to create an environment suitable for them.

이러한 MPEG-4의 visual 파트(Part)의 초판(first edition) 규격에 하면, MPEG-4 visual은 19개의 profile을 가지고 있으나, 실질적으로 서비스되는 MPEG-4는 OVC가 아니라, FVC(Frame-based video coding)를 기반으로 제공되고 있다. 즉, 19개 profile 중 모바일이나 인터넷용 애플리케이션으로 널리 사용되는 프로파일은 저전송율(low bit rate) 애플리케이션의 필요에 의하여 설계된 FVC 기반의 심플 프로파일(Simple profile)이다. 현재, 상업적으로 유용한 MPEG-4 OVC의 애플리케이션의 수는 많지 않으며, 현재 인프라 내에서 서비스 가능한 저전송율 애플리케이션에 대한 연구는 부족한 상태이다. According to the first edition of MPEG-4's visual part, MPEG-4 visual has 19 profiles, but MPEG-4, which is actually serviced, is not OVC, but frame-based video. coding). That is, among the 19 profiles, a profile widely used as a mobile or internet application is an FVC-based simple profile designed by a need of a low bit rate application. At present, the number of commercially available MPEG-4 OVC applications is not large, and there is a lack of research on low-rate applications that can be serviced within the current infrastructure.

2000년 이후로 frame-based MPEG-4 비디오가 활발하게 서비스 되었음에도 불구하고, 실시간 object-based 서비스가 상용화되지 못하는 핵심적인 문제 중에 하나는 frame-based에 비해 object-based 비디오 부호화의 복잡함 때문이다. Arbitrary shape으로 표현된 비디오는 비트스트림(bitstream)에 모션(motion 또는 움직임, 이하 '모션' 또는 'motion'이라 칭함)과 텍스쳐(texture 또는 대상물 내용, 이하 '텍스쳐' 또는 'texture'라 칭함) 정보 외에 형상(shape) 정보를 가진다. 이것은 복호기(decoder)뿐만 아니라 부호기(encoder) 측에 추가적인 복잡함을 제공한다. Although frame-based MPEG-4 video has been actively serviced since 2000, one of the key problems in which real-time object-based services are not commercialized is the complexity of object-based video encoding compared to frame-based. Video represented as an Arbitrary shape is a motion information (motion or motion, hereinafter 'motion' or 'motion') and texture (texture or object content, hereinafter referred to as 'texture' or 'texture') information in the bitstream In addition, it has shape information. This adds additional complexity to the encoder side as well as to the decoder.

이와 같이, 종래 기술에 의한 MPEG-4 OVC는 다음과 같은 문제점이 있다.As described above, the MPEG-4 OVC according to the prior art has the following problems.

첫째, 종래 기술에 의할 때, 제한된 실시간 arbitrarily shaped object 생성 부분에 있어서, 각 video object의 boundary는 매 frame마다 설정되어야 하는 번거로움이 있다. 여기서, Chroma keying을 사용한 Blue screen 기술이 shape 분리 방법을 주로 사용하나, 이것은 한정된 분야에서만 사용가능할 뿐이다.First, according to the prior art, in the limited real-time arbitrarily shaped object generation part, the boundary of each video object has to be set every frame. Here, the blue screen technique using chroma keying mainly uses the shape separation method, but this is only available in limited fields.

둘째, 종래 기술은 형상 부호화(shape coding)로 인한 복잡도 증가로 인하여 효율성이 저하되는 문제점이 있다. FVC 기반의 코딩 방식은 움직임(motion)과 텍스쳐(texture) 두 부분으로 구성된다. 일반적으로 복호화 절차보다 부호화 절차에 많은 비용과 시간이 소요되며, 이러한 부호화 절차 중 특히, 움직임 추정/움직임 보상(MEMC, motion estimation motion compensation, 이하 'MEMC'라 칭함)에 많은 시간 및 비용이 소요된다. 이러한 MEMC로 인하여, MEMC를 포함한 실시간 부호화는 어려운 문제점이 있다. 여기서, OVC는 FVC의 부호화에 더하여, 형상 부호화(shape coding)라는 복잡도가 추가되며, 이러한 부호화의 복잡성으로 인하여 효율성이 저하되는 문제점이 있다.Second, the prior art has a problem that the efficiency is lowered due to the increased complexity due to shape coding. The FVC-based coding scheme consists of two parts, motion and texture. In general, the encoding procedure is more expensive and time consuming than the decoding procedure, and among these encoding procedures, the motion estimation / motion compensation (MEMC) is more time consuming and expensive. . Due to such a MEMC, real-time encoding including the MEMC has a difficult problem. Here, in addition to the coding of the FVC, the OVC adds a complexity called shape coding, and there is a problem in that efficiency decreases due to the complexity of the coding.

특히, MPEG-4 object-based 비디오 부호화 과정 중 형상 부호화(shape coding)의 움직임 추정(Motion estimation)이 차지하는 비중이 전체 압축시간의 75%이상이다. 그러나 종래 기술에 의할 때, 복호화기(Decoder)에 비해 복잡한 처리가 요구되는 부호화기(Encoder)를 최적화하려는 연구가 여러 차례 있었으나, 텍스쳐(Texture)용 MEMC의 복잡성을 줄이기 위한 여러 가지 기술들이 시도되어질 뿐, 형상(shape) 용 MEMC에 대한 연구는 미비한 상태이다.
In particular, the proportion of motion estimation of shape coding in the MPEG-4 object-based video encoding process is 75% or more of the total compression time. However, according to the prior art, there have been several studies to optimize an encoder that requires more complicated processing than a decoder, but various techniques have been tried to reduce the complexity of MEMC for texture. However, research on shape MEMC is insufficient.

따라서 본 발명은 상기의 제반 문제점을 해결하기 위하여 안출한 것으로서, MPEG-4에서 사용되는 다수의 부호화 툴 중에 형상 정보 부호화(SHAPE INFORMATION ENCODING)의 복잡도를 감소시켜 실시간으로 형상 정보의 부호화가 가능한 부호화 방법 및 장치를 제공함에 있다. 즉, 본 발명에서는 현재 MPEG-4 비디오 부호화에서 사용되는 툴을 분석하여 부호화기(encoder)의 복잡함을 줄일 수 있는 효과적인 부호화기술 및 기존에는 연구되지 않았던 형상 부호화에 있어서의 MEMC 알고리즘을 제공할 수 있다.Accordingly, the present invention has been made to solve the above problems, and is a coding method capable of encoding shape information in real time by reducing the complexity of SHAPE INFORMATION ENCODING among a plurality of coding tools used in MPEG-4. And providing an apparatus. In other words, the present invention can provide an effective encoding technique that can reduce the complexity of an encoder by analyzing a tool currently used in MPEG-4 video encoding and a MEMC algorithm in shape encoding that has not been studied before.

본 발명의 다른 목적은 기존에 부호화의 효율성을 위하여 필수적으로 여겨지던 알고리즘의 생략을 통하여, 실질적인 성능 감소없이 실시간 부호화 가능한 형상 정보의 부호화 방법 및 장치를 제공함에 있다. 즉, 본 발명은 VOP 형성(Formation)과 MEMC부분에 사용하던 방식에 부호화의 효율성을 증가시키기 위한 별도의 알고리즘 처리를 부가하지 아니함으로써, 부호화의 복잡함을 근본적으로 해결할 수 있는 부호화 방법 및 장치를 제공함에 있다. 특히, 본 발명은 반복적으로 처리되는 나선형 검색 자체를 생략함으로 실행속도를 기존의 어떤 방법 보다 효과적으로 절약할 수 있다.Another object of the present invention is to provide a method and apparatus for encoding shape information that can be encoded in real time without substantially reducing performance by omitting an algorithm previously considered essential for the efficiency of encoding. That is, the present invention provides an encoding method and apparatus that can fundamentally solve the complexity of encoding by not adding a separate algorithm process for increasing the efficiency of encoding to the method used for VOP formation and MEMC part. Is in. In particular, the present invention saves execution speed more effectively than any conventional method by omitting the recursive spiral search itself.

또한, 본 발명의 또 다른 목적은 VOP 위치를 설정하는 VOP 형성 단계에 있어서, 지능형 VOP 형성(IVOPF, Intelligent VOP Formation) 알고리즘의 복잡한 처리에 대한 문제점 제기와 이에 대한 해결책을 제시함에 있다.
In addition, another object of the present invention is to propose a problem and a solution to the complicated processing of the Intelligent VOP Formation (IVOPF) algorithm in the VOP forming step of setting the VOP position.

상술한 목적을 달성하기 위하여 본 발명의 제1 측면에 따르면, 실시간 객체 기반 서비스를 제공하기 위한 형상 부호화 방법이 제공될 수 있다.According to a first aspect of the present invention, a shape encoding method for providing a real-time object-based service may be provided.

바람직한 실시예에 의할 때, 상기 형상 부호화 방법은 대상물의 영상 객체를 포함하는 최소 사각형 경계(TRB, Tightest Rectangular Boundary)를 설정하는 단계, 상기 최소 사각형 경계를 이용하여, 대상물의 형상에 따라 형성된 VOP를 M×N의 화소를 가지는 매크로블록으로 구획하는 단계(여기서, M, N은 자연수) 및 상기 구획 단계에서 구획된 매크로블록의 그리드 시작점을 이동시키면서 대상물의 영상 정보량 감소 위치를 찾는 과정을 수행하지 아니하고, 상기 매크로블록 단위로 부호화 하는 단계를 포함할 수 있다.According to a preferred embodiment, the shape encoding method may include setting a minimum rectangular boundary (TRB) including an image object of an object, and using the minimum rectangular boundary, a VOP formed according to the shape of the object. Is divided into a macroblock having pixels of M × N (where M and N are natural numbers) and the process of finding a position of decreasing image information amount of an object while moving the grid starting point of the partitioned macroblock in the partitioning step. In addition, the method may include encoding the macroblock unit.

여기서, 상기 방법은 상기 부호화 방식이 실시간인지 여부를 판단하는 단계를 더 포함할 수 있다. 그리고 상기 방법은 움직임 추정을 위한 후보군을 설정함에 있어, 어떠한 블록 비교를 포함하는 알고리즘을 수행하지 아니하고, 현재 이진 알파 블록(BAB, Binary Alpha Block)의 MV를 MVP로 설정하는 단계를 더 포함할 수도 있다. Here, the method may further include determining whether the encoding scheme is real time. The method may further include setting an MV of a current binary alpha block (BAB) as MVP without performing an algorithm including any block comparison in setting a candidate group for motion estimation. have.

바람직한 다른 실시예에 의할 때, 상기 실시간 객체 기반 서비스를 제공하기 위한 형상 부호화 방법은 형상 부호화를 위한 VOP를 형상하는 단계, 대상물의 형상에 따라 형성된 VOP를 M×N의 화소를 가지는 매크로블록으로 구획하는 단계(여기서, M, N은 자연수) 및 상기 매크로블록 단위로 움직임 추정을 위한 후보군을 설정함에 있어, 어떠한 블록 비교를 포함하는 알고리즘을 수행하지 아니하고, 현재 이진 알파 블록(BAB, Binary Alpha Block)의 MV를 MVP로 설정하는 단계를 포함할 수 있다.
According to another preferred embodiment, the shape coding method for providing a real-time object-based service is to form a VOP for shape coding, VOP formed according to the shape of the object as a macroblock having M × N pixels In the step of partitioning (where M and N are natural numbers) and setting a candidate group for motion estimation in units of the macroblock, the current binary alpha block (BAB) is not performed without performing an algorithm including any block comparison. The MV of) may be set to MVP.

상술한 목적을 달성하기 위하여 본 발명의 제2 측면에 따르면, 실시간 객체 기반 서비스를 제공하기 위한 형상 부호화 장치가 제공될 수 있다.In order to achieve the above object, according to the second aspect of the present invention, a shape encoding apparatus for providing a real-time object-based service may be provided.

바람직한 실시예에 의할 때, 실시간 객체 기반 서비스를 제공하기 위한 형상 부호화 장치는 대상물의 영상 객체를 포함하는 최소 사각형 경계(TRB, Tightest Rectangular Boundary)를 설정하고, 상기 최소 사각형 경계를 이용하여, 대상물의 형상에 따라 형성된 VOP를 M×N의 화소를 가지는 매크로블록으로 구획하는 구획하고, 상기 구획 단계에서 구획된 매크로블록의 그리드 시작점을 이동시키면서 대상물의 영상의 정보량 감소 위치를 서치하는 과정을 수행하지 아니하고, 상기 매크로블록 단위로 부호화하기 위한 VOP 형성부(여기서, M, N은 자연수), 상기 VOP 형성부에서 형성된 VOP를 부호화하기 위한 형상 부호화부 및 상기 부호화된 형상 신호를 다중화하여, 비트스트림으로 전송하기 위한 멀티플렉서를 포함할 수 있다.According to a preferred embodiment, the shape encoding apparatus for providing a real-time object-based service sets the minimum rectangular boundary (TRB) including the image object of the object, and using the minimum rectangular boundary, The VOP is partitioned into macroblocks having M × N pixels, and the search for the information reduction position of the image of the object is performed while moving the grid starting point of the partitioned macroblock in the partitioning step. Instead, the VOP forming unit for encoding the macroblock unit (where M and N are natural numbers), the shape encoder for encoding the VOP formed in the VOP forming unit, and the encoded shape signal are multiplexed into a bitstream. It may include a multiplexer for transmission.

여기서, 상기 형상 부호화 장치는 상기 부호화 방식이 실시간인지 여부를 판 단할 수 있다. 그리고, 상기 장치는 움직임 추정을 위한 후보군을 설정함에 있어, 어떠한 블록 비교를 포함하는 알고리즘을 수행하지 아니하고, 현재 이진 알파 블록(BAB, Binary Alpha Block)의 MV를 MVP로 설정하는 실시간 움직임 추정부를 더 포함할 수 있다.Here, the shape encoding apparatus may determine whether the encoding scheme is real time. Further, in setting a candidate group for motion estimation, the apparatus does not perform an algorithm including any block comparison, and further comprises a real-time motion estimator for setting an MV of a current binary alpha block (BAB) as an MVP. It may include.

바람직한 다른 실시예에 의할 때, 실시간 객체 기반 서비스를 제공하기 위한 형상 부호화 장치는 형상 부호화를 위한 VOP를 형성하고, 대상물의 형상에 따라 형성된 VOP를 M×N의 화소를 가지는 매크로블록으로 구획하기 위한 VOP 형성부, 상기 VOP 형성부에서 형성된 VOP를 부호화하기 위한 형상 부호화부 및 상기 부호화된 형상 신호를 다중화하여, 비트스트림으로 전송하기 위한 멀티플렉서, 상기 매크로블록 단위로 움직임 추정을 위한 후보군을 설정함에 있어, 어떠한 블록 비교를 포함하는 알고리즘을 수행하지 아니하고, 현재 이진 알파 블록(BAB, Binary Alpha Block)의 MV를 MVP로 설정하기 위한 움직임 추정부를 포함할 수 있다.
According to another exemplary embodiment, a shape encoding apparatus for providing a real-time object-based service forms a VOP for shape coding, and divides the VOP formed according to the shape of the object into a macroblock having pixels of M × N. A multiplexer for multiplexing the encoded shape signal, a multiplexer for transmitting the encoded shape signal, and a candidate group for motion estimation in units of macroblocks; Therefore, a motion estimation unit for setting an MV of a current binary alpha block (BAB) as an MVP may be included without performing an algorithm including any block comparison.

이하, 첨부한 도면들을 참조하여 본 발명에 따른 실시간 객체 기반 서비스를 제공하기 위한 형상 부호화 방법의 바람직한 실시예를 상세히 설명하기로 하며, 첨부 도면을 참조하여 설명함에 있어 도면 부호에 상관없이 동일하거나 대응하는 구성 요소는 동일한 참조번호를 부여하고 이에 대한 중복되는 설명은 생략하기로 한다.
Hereinafter, exemplary embodiments of a shape encoding method for providing a real-time object-based service according to the present invention will be described in detail with reference to the accompanying drawings. The components to be given the same reference numerals and duplicate description thereof will be omitted.

압축 알고리즘 툴 및 객체 기반 부호화 순서도Compression Algorithm Tool and Object-Based Coding Flowchart

기존 부호화 툴 중 압축률을 높이기 위해 실행하는 IVOPF 등을 포함하는 지능형 VOP 형성 알고리즘 및 움직임 추정 알고리즘은 많은 처리시간은 요구하여 실시간 부호화에 많은 장애가 된다. VOP 형성 및 움직임 추정 등을 포함하는 부호화 알고리즘에 대한 그 동안의 연구의 대부분은 압축률을 높이거나, 실행속도를 절약하기 위해서 새로운 알고리즘을 제시한다. 그러나 종래 기술에 의할 때, MV(motion vector)의 계산을 복잡하게 만드는 나선형 검색 문제를 근본적으로 해결하지는 못하는 실정이다. 이하, 도 1에서 다양한 부호화 알고리즘 툴을 소개하고, 도 2에는 본 발명에 따른 부호화 알고리즘의 절차를 설명하기로 한다.
Intelligent VOP formation algorithm and motion estimation algorithm including IVOPF which is executed to increase the compression rate among existing coding tools require much processing time and become a lot of obstacle in real time coding. Most of the previous studies on coding algorithms including VOP formation and motion estimation suggest new algorithms to increase the compression rate or to reduce the execution speed. However, according to the prior art, it is not possible to fundamentally solve the spiral search problem which complicates the calculation of the motion vector (MV). Hereinafter, various encoding algorithm tools will be introduced in FIG. 1, and a procedure of an encoding algorithm according to the present invention will be described in FIG. 2.

도 1은 본 발명의 바람직한 일 실시예에 따른 알고리즘 툴(MPEG-4 OVC encoding tools)을 정리한 도면이다.1 is a diagram summarizing an algorithm tool (MPEG-4 OVC encoding tools) according to an embodiment of the present invention.

도 1을 참조하면, 이진 형상신호의 부호화 과정에서 사용하는 일반적인 부호화 알고리즘이 정리되어 있으며, 이중 많은 시간이 소요되는 부분에 대하여 체크 표시가 되어 있다. 본 발명은 실시간 서비스인 경우, 기존의 복잡한 알고리즘을 사용하지 않고도 큰 성능 저하없이 부호화할 수 있는 방법 및 장치를 제공할 수 있다. 이하, 도 1을 참조하여 객체 기반 서비스의 다양한 알고리즘 툴, 예를 들면 MPEG-4 OVC의 기본 요소들인 MPEG-4 OVC, MPEG-4 OVC decoding process, MPEG-4 OVC encoding process, VOP Formation, Binary 형상 부호화(shape coding) 등에 대하여 간략히 설명하기로 한다. Referring to FIG. 1, a general encoding algorithm used in a process of encoding a binary shape signal is summarized, and a check mark is placed on a portion which takes a lot of time. The present invention can provide a method and apparatus capable of encoding without significant performance degradation in the case of a real-time service without using an existing complex algorithm. Hereinafter, various algorithm tools of the object-based service, for example, MPEG-4 OVC, MPEG-4 OVC decoding process, MPEG-4 OVC encoding process, VOP Formation, Binary shapes Shape coding and the like will be briefly described.

형상 부호화 절차는 이진 형상 부호화(Binary shape coding)(140) 및 VOP 형 성(Formation)(100)으로 구분될 수 있다.The shape coding procedure may be divided into a binary shape coding 140 and a VOP formation 100.

이진 형상 부호화(Binary shape coding)(140)에서 8 bit 값을 가진 알파맵(alpha map)은 주어진 이미지의 투명도를 표시한다. 만약 pixel 값이 0이면 transparent pixel이 되고, 값이 255이면 opaque pixel이 된다. 그리고 pixel값이 0과 255 사이라면, 그 pixel은 배경 pixel값과 겹쳐지는 경우를 의미한다. 여기서, binary shape은 alpha value로 0과 255 값을 가질 수 있으며, 이러한 이진 형상 정보(Binary shape data)는 색상 정보가 아닌 픽셀(pixel)의 불투명도를 표현하기 때문에, Binary shape과 해당 texture data의 관계는 motion과 texture 관계처럼 가깝지 않다. 만약 알파맵 데이터(alpha map data)가 1과 255 사이 값을 갖는 경우는, 그레이스케일 형상 부호화(grayscale shape coding)(150) 이미지라 지칭한다. 그리고 이진 형상 부호화(Binary shape coding)(140)는 모드 결정(mode decision), 움직임 추정/보상(MEMC), 서브샘플링(sub-sampling), 스캔 순서 결정(scan order decision), inter/intra CAE와 scalable shape 등을 포함할 수 있다.In binary shape coding 140, an alpha map with an 8-bit value indicates the transparency of a given image. If the pixel value is 0, it becomes a transparent pixel. If the value is 255, it becomes an opaque pixel. If the pixel value is between 0 and 255, the pixel overlaps with the background pixel value. Here, the binary shape may have a value of 0 and 255 as an alpha value, and since the binary shape data represents the opacity of the pixel rather than the color information, the relationship between the binary shape and the corresponding texture data Is not as close as motion and texture. If the alpha map data has a value between 1 and 255, it is referred to as a grayscale shape coding 150 image. The binary shape coding 140 may include mode decision, motion estimation / compensation (MEMC), subsampling, scan order decision, inter / intra CAE, and the like. It may include a scalable shape.

그리고 VOP Formation(100)은 객체를 추출하여 형성하는 알고리즘을 지칭한다. 이하, Video object의 각 frame을 video object plane(VOP)를 지칭하기로 한다. 비디오 객체(video object)가 rectangular가 아닌 arbitrarily shaped video이라면, shape 부분의 각 Binary Alpha Block (BAB)은 전체 픽셀 값이 255인 all opaque와 일부 pixel 값이 255인 partially opaque, 전체 pixel 값이 0인 all transparent로 나누어진다. 여기서, BAB을 생성하고 참조하기 위해 전체 프레임을 쓰는 대신, 추출할 형상(shape)을 포함하는 가장 작은 사각형(tightest rectangular boundary, 이하 TRB라 칭함)을 사용할 수 있다.The VOP Formation 100 refers to an algorithm for extracting and forming an object. Hereinafter, each frame of the video object will be referred to as a video object plane (VOP). If the video object is arbitrarily shaped video rather than rectangular, each Binary Alpha Block (BAB) in the shape part has an all opaque with a total pixel value of 255, a partial opaque with a partial pixel value of 255, and a total pixel value of zero. It is divided into all transparent. Here, instead of writing the entire frame to generate and refer to the BAB, one may use the smallest rectangular boundary (hereinafter referred to as TRB) containing the shape to be extracted.

여기서, TRB의 크기는 종종 16의 배수가 아니므로, 일부 매크로블록은 불필요한 transparent pixel을 가지게 된다. IVOPF(Intelligent VOP formation)를 포함하는 종래의 부호화 알고리즘은 압축률을 높이기 위해서 비디오 객체의 첫 번째 매크로블록의 시작 pixel값을 계산하는 과정을 대부분 포함하고 있다. 본 발명에 의한 VOP 형성 알고리즘은 불필요한 transparent pixel를 줄이기 위한 알고리즘을 포함하지 않더라도 성능 저하가 없는 부호화 방법을 제시한다.Here, since the size of the TRB is often not a multiple of 16, some macroblocks have unnecessary transparent pixels. Conventional coding algorithms including intelligent VOP formation (IVOPF) include most of the process of calculating the starting pixel value of the first macroblock of a video object in order to increase the compression rate. The VOP formation algorithm according to the present invention proposes an encoding method without deterioration in performance even without including an algorithm for reducing unnecessary transparent pixels.

하기의 표 2(BAB Coding Modes)에서 보여주듯이 형상 부호화(shape coding)에는 7가지 mode가 있으며, 모드 결정 알고리즘(Mode decision)(110)은 이러한 모드를 결정하는 절차를 지칭한다.As shown in Table 2 (BAB Coding Modes), there are seven modes of shape coding, and a mode decision 110 refers to a procedure for determining such a mode.

Type NumberType Number Coding ModeCoding mode UsageUsage 00 MVDs==0 && no updateMVDs == 0 && no update P- or B-VOPsP- or B-VOPs 1One MVDs!=0 && no updateMVDs! = 0 && no update P- or B-VOPsP- or B-VOPs 22 All transparentAll transparent All VOPsAll VOPs 33 All opaqueAll opaque All VOPsAll VOPs 44 intraCAEintraCAE All VOPsAll VOPs 55 MVDs==0 && interCAEMVDs == 0 && interCAE P- or B-VOPsP- or B-VOPs 66 MVDs!=0 && interCAEMVDs! = 0 && interCAE P- or B-VOPsP- or B-VOPs

상기 표 1에서, 0, 2, 3 mode의 경우 더 이상의 추가적인 부호화가 필요하지 않으며, IntraCAE는 intra-mode CAE 과정을 요구한다. no update는 전 VOP의 BAB을 복사하여 MC를 실행하며, InterCAE는 MC과정과 inter-mode CAE를 실행하고, MVD는 현재 motion vector와 참조되는 motion vector의 차이를 의미한다.In Table 1, in case of 0, 2, and 3 modes, no further encoding is required, and IntraCAE requires an intra-mode CAE process. no update copies the BAB of the previous VOP and executes the MC. InterCAE executes the MC process and inter-mode CAE, and MVD means the difference between the current motion vector and the referenced motion vector.

움직임 추정/움직임 보상(MEMC) 알고리즘(120)은 특히 video object가 움직임이 거의 없거나, 천천히 움직이는 경우 효율적인 알고리즘이다. 움직임 추정/움 직임 보상(MEMC) 알고리즘(110)은 MVP(motion vector predictor)를 결정 과정, 움직임 추정(motion estimation) 과정 및 움직임 보상(motion compensation) 과정을 포함한다.
The motion estimation / motion compensation (MEMC) algorithm 120 is an efficient algorithm, especially when the video object has little motion or moves slowly. The motion estimation / motion compensation (MEMC) algorithm 110 includes a process of determining a motion vector predictor (MVP), a motion estimation process, and a motion compensation process.

도 2는 본 발명의 바람직한 일 실시예에 따른 형상 신호 부호화 절차를 나타낸 순서도이다. 설명의 번잡을 회피하기 위하여 도 2의 순서도에서 기본적인 부호화 과정의 절차는 생략하고 본 발명에서 특징있는 부호화 과정만을 도시하였다.2 is a flowchart illustrating a shape signal encoding procedure according to an embodiment of the present invention. In order to avoid the confusing explanation, the procedure of the basic encoding process is omitted in the flowchart of FIG. 2 and only the encoding process characteristic of the present invention is illustrated.

MPEG-4 OVC 부호와 단계는 형상 부호화(shape coding), 텍스쳐 보상(texture coding)과 움직임 추정/보상(motion estimation and compensation)으로 나누어지며, 일반적으로 16*16 pixel의 MB을 기준으로 하는 block-based coding 이다. 이와 같은 부호화 절차(encoding process)는 상기 표 1에서 보여주듯이 많은 부호화 tool을 가지고 있으며, 많은 시간이 소요되는 알고리즘으로 인하여, 실시간 부호화를 수행함에 어려움이 발생한다. 본 발명은 이 중 형상 부호화와 관련하여 많은 시간이 소요되는 VOP 형성 및 형상용 움직임 추정/보상에 대한 근본적인 해결 제안을 제시할 수 있다.MPEG-4 OVC codes and steps are divided into shape coding, texture coding and motion estimation and compensation, and are generally based on 16 * 16 pixel MB. based coding. As shown in Table 1, the encoding process has many encoding tools, and due to algorithms that require a lot of time, it is difficult to perform real-time encoding. The present invention can propose a fundamental solution proposal for VOP formation and shape motion estimation / compensation which takes a lot of time in relation to shape coding.

이하, 도 2를 참조하여, 본 발명에 따른 실시간 객체 기반 서비스를 위한 부호화 방법을 설명하기로 한다.Hereinafter, an encoding method for a real-time object based service according to the present invention will be described with reference to FIG. 2.

먼저, 단계 S200에서 VOP 형성 단계인지 판단하며, VOP 형성 단계인 경우, S210에서, 상기 서비스가 실시간 서비스인지 여부를 판단한다. 판단 결과, 실시간 서비스가 아닌 경우, 단계 S220에서 종래 방식(예를 들면, IVOPF)에 따라 VOP를 형 성하고, 실시간 서비스인 경우, 단계 S230에서 본 발명에 따른 실시간 VOP 형성 방식에 따라 VOP를 형성한다. 본 발명의 바람직한 실시예에 따른 실시간 VOP 형성 방식은 TRB만을 이용하여 VOP를 형성하는 것이다(도 5참조)First, it is determined whether or not the VOP forming step in step S200, and in the case of the VOP forming step, in step S210, it is determined whether the service is a real-time service. As a result of the determination, if it is not a real time service, a VOP is formed according to a conventional method (for example, IVOPF) in step S220, and in case of a real time service, a VOP is formed according to the real time VOP formation method according to the present invention in step S230. do. Real-time VOP formation method according to a preferred embodiment of the present invention is to form a VOP using only TRB (see Fig. 5)

본 발명은 이러한 종래 기술에 따른 VOP 형성부분에 대한 실용성을 복잡도 계산과 압축효율을 분석함으로 검증함으로써, 최소의 non-transparent 블록을 포함하지 않더라도 성능에 큰 차이가 없는 실시간 VOP 형성 알고리즘을 선택적으로 사용함으로써, 실시간 부호화 서비스를 제공할 수 있다. 여기서, 실시간 서비스 여부에 관계없이, TRB를 사용하여 부호화하도록 설정할 수 있음은 물론이다.The present invention verifies the practicality of the VOP forming portion according to the prior art by analyzing the complexity calculation and the compression efficiency, thereby selectively using a real-time VOP forming algorithm with no significant difference in performance even if it does not include the minimum non-transparent block. Thus, a real time encoding service can be provided. Here, of course, it may be set to encode using TRB regardless of the real-time service.

이후, 단계 S240에서, 부호화 단계 중 상기 단계가 형상용 움직임 추정/움직임 보상(MEMC) 단계인지 여부를 판단한다. 판단 결과, 형상용 움직임 추정/움직임 보상(MEMC) 단계가 아닌 경우, 단계 S260에서 종래 방식에 따라 블록 비교를 수행하는 움직임 추정/움직임 보상(MEMC) 절차를 수행할 수 있다. 그리고 판단 결과, 움직임 추정/움직임 보상(MEMC) 단계인 경우, 단계 S250에서, 상기 서비스가 실시간 서비스인지 여부를 판단하도록 구성될 수 있다.Thereafter, in step S240, it is determined whether the step of the encoding step is a shape motion estimation / motion compensation (MEMC) step. As a result of the determination, if it is not a shape motion estimation / motion compensation (MEMC) step, a motion estimation / motion compensation (MEMC) procedure of performing a block comparison according to the conventional method may be performed in step S260. If it is determined that the motion estimation / motion compensation (MEMC) step is performed, it may be configured in step S250 to determine whether the service is a real-time service.

판단 결과, 실시간 서비스가 아닌 경우, 단계 S260에서 종래 방식에 따른 움직임 추정/움직임 보상(MEMC) 알고리즘을 수행하고, 실시간 서비스인 경우,단계 S270에서 본 발명에 따른 실시간 움직임 추정/움직임 보상(MEMC) 알고리즘을 수행한다.As a result of the determination, if it is not a real time service, a motion estimation / motion compensation (MEMC) algorithm according to the conventional method is performed in step S260, and if it is a real time service, real time motion estimation / motion compensation according to the present invention in step S270 Perform the algorithm.

움직임 추정/움직임 보상(MEMC)은 형상(shape)과 텍스쳐(texture) 두 곳에서 사용될 수 있다. 이러한 MEMC는 encoding process 중에서도 복잡하여 많은 시간이 소요된다. 기존의 연구는 텍스쳐(texture) 부호화시의 MEMC의 복잡함을 해결하려는 부분에 집중되어 있으며, 유사한 알고리즘을 형상(shape)에도 사용하였다. 본 발명에 의할 때, 형상 부호화시의 움직임 추정/움직임 보상(MEMC) 단계에서는 블록 비교 절차가 생략된 실시간 움직임 추정/움직임 보상(MEMC) 알고리즘을 제공함으로써, 성능 저하 없이 실시간 객체 기반 서비스를 제공할 수 있다(도 6참조). Motion estimation / motion compensation (MEMC) can be used in both shapes and textures. This MEMC is complicated and time consuming during the encoding process. Previous research has focused on solving the complexity of MEMC in texture coding, and similar algorithms have been used for shapes. According to the present invention, in the motion estimation / motion compensation (MEMC) step in shape coding, by providing a real-time motion estimation / motion compensation (MEMC) algorithm in which a block comparison procedure is omitted, a real-time object-based service is provided without degrading performance. (See FIG. 6).

본 발명에 따르면 실시간 서비스인 경우, 형상(Shape)에 대한 MEMC 과정을 단순화함으로써, MVD(Motion Vector Difference)는 항상 0으로 설정될 수 있다. 따라서 BAB Coding 의 여러 가지 Mode가 도시된 상기 표 2에서, 1번, 6번 Mode는 발생하지 아니한다.
According to the present invention, in the real-time service, the motion vector difference (MVD) may be always set to 0 by simplifying the MEMC process for the shape. Therefore, in Table 2 in which various modes of BAB Coding are shown, Modes 1 and 6 do not occur.

실시간 객체 기반 시스템의 구성도Diagram of Real-time Object-based System

이하, 본 발명에 따른 실시간 VOP 형성부 및 실시간 VOP 부호화부의 구성을 첨부 도면을 참조하여 보다 상세히 설명하기로 한다. 도 3에서는 실시간 VOP 형성부를 포함하는 전체 시스템의 구성을 설명하고, 도 4에서는 실시간 부호화부의 상세 구성을 설명하기로 한다.
Hereinafter, the configurations of the real-time VOP forming unit and the real-time VOP encoder according to the present invention will be described in detail with reference to the accompanying drawings. In FIG. 3, the configuration of the entire system including the real-time VOP forming unit will be described. In FIG. 4, the detailed configuration of the real-time encoder will be described.

도 3은 본 발명의 바람직한 일 실시예에 따른 실시간 형상 신호 부호화 시스템의 전체 구성을 나타낸 도면이다.3 is a diagram showing the overall configuration of a real-time shape signal encoding system according to an embodiment of the present invention.

MPEG-4 OVC는 하나의 장면을 배경(rectangular shape)과 객체(arbitrary shape)로 분류되며, 형상(shape or alpha map)이라 불리는 부가적인 정보가 비트스 트림(bitstream)으로 부호화된다. 형상(Shape) 정보는 주어진 점(pixel)이transparent(shape밖의 부분), opaque(shape내 부분) 인지를 구별하는데 사용되며 이러한 형상 정보를 이진 형상(binary shape)으로 칭하기로 한다.MPEG-4 OVC classifies a scene into a background shape and an object shape, and additional information, called a shape or alpha map, is encoded in a bitstream. Shape information is used to distinguish whether a given pixel is a transparent (part outside the shape) or an opaque (part inside the shape). This shape information will be referred to as a binary shape.

이하, 본 발명에 따른 형상 정보 부호화 시스템을 수행하는 부호화기(300) 및 복호화를 수행하는 복호화기(350)로 나누어 설명하기로 한다.Hereinafter, a description will be given of an encoder 300 that performs a shape information encoding system and a decoder 350 that performs decoding according to the present invention.

먼저, MPEG-4 OVC의 부호화기(300)는 실시간 VOP 형성부(310), 실시간 VOP 부호화부(320) 및 멀티플랙서(330)를 포함하여 구성된다.First, the encoder 300 of the MPEG-4 OVC includes a real time VOP forming unit 310, a real time VOP encoder 320, and a multiplexer 330.

여기서, 실시간 VOP 형성부(VOP FORMATION)(310)는 전송 또는 저장할 영상 시퀀스(SEQUENCE)가 입력될 경우에 이를 대상물 영상 단위로 나누어 각기 다른 VOP로 형성한다. 본 발명에 따른 실시간 VOP 형성부(VOP FORMATION)(310)는 실시간 전송인 경우, 기존의 복잡도 높은 과정을 거치지 아니하고, 최소 TRB를 추출하도록 구성된다(도 5 참조). 즉, 본 발명은 기존의 방식과 같이, 압축률을 높이기 위하여 수행하는 최적화 알고리즘을 수행하지 않고, 미리 설정된 방식(예를 들면, TRB)에 의하여 내용 객체만을 추출하도록 구성된다. 기존 방식에 의할 때, non-transparent MB을 최소 또는 적정 수준에 유지하면서도, 압축 효율을 증가시키는 부분에 한정되어 있었다.In this case, when a real time VOP forming unit 310 receives an image sequence (SEQUENCE) to be transmitted or stored, it is divided into object image units to form different VOPs. The real-time VOP formation unit 310 according to the present invention is configured to extract the minimum TRB without going through a high complexity process in the case of real-time transmission (see FIG. 5). That is, the present invention is configured to extract only the content object by a preset method (for example, TRB) without performing an optimization algorithm performed to increase the compression ratio as in the conventional method. According to the conventional method, it was limited to increase the compression efficiency while maintaining non-transparent MB at the minimum or proper level.

한편, 상기 실시간 VOP 형성부(310)에서 형성된 각각의 VOP는 VOP 부호화부(320-1, 320-2, …)에 각각 입력되어 VOP 별로 부호화되고, 멀티플렉서(330)에서 다중화되어 비트스트림(BIT STREAM)으로 전송된다.Meanwhile, each VOP formed by the real-time VOP forming unit 310 is input to the VOP encoders 320-1, 320-2, ..., respectively, encoded for each VOP, multiplexed by the multiplexer 330, and then bit stream (BIT). STREAM).

복호화기(decoder)(350)의 구성을 다음과 같다. 상기 부호화기(300)를 통해 부호화되고, 비트스트림으로 전송되는 정보인 VOP의 부호화 신호는 디멀티플렉서(360)에서 VOP별로 각기 분리된다. 그리고 상기 분리된 각각의 VOP 부호화 신호는 복수개의 복호화부(370-1, 370-2, 370-3, …)를 포함할 수 있는 복호화부(370)에 의해 각기 복호화되며, 상기 복호화기(370)에서 출력되는 복호화 신호는 합성부(23)에서 합성되어 원래의 영상으로 출력된다. 이와 같이 실시간 VOP 형성부에 의하여 추출된 VOP는 하기에 설명한 실시간 부호화기을 통하여 부호화될 수 있다.
The structure of the decoder 350 is as follows. The coded signal of the VOP, which is coded through the encoder 300 and is transmitted in the bitstream, is separated for each VOP in the demultiplexer 360. Each of the separated VOP coded signals is decoded by the decoder 370, which may include a plurality of decoders 370-1, 370-2, 370-3,... And the decoder 370. The decoded signal output from) is synthesized by the combining unit 23 and output as an original image. As such, the VOP extracted by the real-time VOP forming unit may be encoded through the real-time encoder described below.

도 4는 본 발명의 바람직한 일 실시예에 따른 실시간 형상 신호 부호화부의 구성을 나타낸 도면이다.4 is a diagram illustrating a configuration of a real-time shape signal encoder according to an exemplary embodiment of the present invention.

실시간 VOP 부호화부의 구성을 도 4를 참조하여 설명하면, 상기 실시간 VOP 형성부(310)에서 형성된 각각의 대상물 영상에 대한 VOP가 움직임 추정부(MOTION ESTIMATION)(321)에 입력되면, 상기 실시간 움직임 추정부(321)는 인가된 VOP로부터 매크로블록 단위의 움직임을 추정할 수 있다. 상기 실시간 움직임 추정부(321)에서 추정된 움직임 정보는 움직임 보상부(MOTION COMPENSATION)(322)에 입력되어 움직임이 보상된다.The configuration of the real-time VOP encoder will be described with reference to FIG. 4. When a VOP for each target image formed by the real-time VOP forming unit 310 is input to the motion estimation unit 321, the real-time motion estimation is performed. The government 321 may estimate the motion of the macroblock unit from the authorized VOP. The motion information estimated by the real-time motion estimation unit 321 is input to a motion compensation unit 322 to compensate for the motion.

그리고 상기 움직임 보상부(322)에서 움직임이 보상된 VOP의 각 MB는 상기 VOP 형성부(310)에서 형성된 참조되는 VOP의 각 MB와 함께 감산기(323)에 입력되어 차이값이 검출되고, 상기 감산기(323)에서 검출된 차이값은 텍스쳐 부호화부(324)에 입력되어 매크로블록의 서브 블록 단위로 대상물의 텍스쳐 정보가 부호화된다. 한편, 상기 움직임 보상부(322)에서 움직임이 보상된 VOP와, 상기 대상물 텍스쳐 정보를 부호화하기 위한 텍스쳐 부호화부(324)에서 부호화된 대상물의 텍스쳐 정보는 가산기(325)에서 입력되어 가산되고, 상기 가산기(325)의 출력신호는 이전VOP 검출부(PREVIOUS RECONSTRUCTED VOP)(326)에 입력되어 현재 영상의 바로 전 영상의 VOP에 해당하는 이전 VOP가 검출된다. 상기 이전 VOP 검출부(326)에서 검출된 상기 이전 VOP는 상기 실시간 움직임 추정부(321) 및 움직임 보상부(322)에 입력되어 움직임 추정 및 움직임 보상에 사용된다.Each MB of the VOP whose motion is compensated by the motion compensator 322 is input to the subtractor 323 together with each MB of the referenced VOP formed by the VOP forming unit 310 to detect a difference value. The difference value detected at 323 is input to the texture encoder 324 to encode texture information of the object in sub-block units of the macroblock. Meanwhile, the VOP whose motion is compensated by the motion compensator 322 and the texture information of the object encoded by the texture encoder 324 for encoding the object texture information are inputted and added by the adder 325. The output signal of the adder 325 is input to a PREVIOUS RECONSTRUCTED VOP 326 to detect a previous VOP corresponding to the VOP of the image immediately before the current image. The previous VOP detected by the previous VOP detector 326 is input to the real time motion estimator 321 and the motion compensator 322 and used for motion estimation and motion compensation.

그리고 상기 실시간 VOP 형성부(310)에서 형성된 VOP는 실시간 형상 부호화부(327)에 입력되어 형상 정보가 부호화된다. 상기 형상 부호화부(327)의 출력신호를 움직임 추정부(321), 움직임 보상부(322) 및 텍스쳐 부호화부(324)에 입력시켜 움직임 추정, 움직임 보상 및 대상물의 내부 정보를 부호화 하는데 사용할 수 있다. 또한, 상기 실시간 움직임 추정부(321)에서 추정된 움직임 정보와 상기 텍스쳐 부호화부(324)에서 부호화된 대상물 텍스쳐 정보 및 상기 형상 부호화부(327)에서 부호화된 모양 정보는 멀티플렉서(330)에 인가되어 다중화 된 후, 버퍼(340)를 통해 멀티플렉서(330)로 출력되어 비트스트림 전송된다.The VOP formed by the real-time VOP forming unit 310 is input to the real-time shape encoder 327 to encode shape information. The output signal of the shape encoder 327 may be input to the motion estimator 321, the motion compensator 322, and the texture encoder 324 to be used for encoding motion estimation, motion compensation, and internal information of an object. . In addition, motion information estimated by the real-time motion estimation unit 321, object texture information encoded by the texture encoder 324, and shape information encoded by the shape encoder 327 may be applied to the multiplexer 330. After the multiplexing, the data is output to the multiplexer 330 through the buffer 340 and transmitted in a bitstream.

이러한 MPEG-4에 있어서, 상기 실시간 VOP 형성부(310)에서 전송된 각각의 VOP를 부호화하는 상기 형상 부호화부(320)에 적용되는 기술로는, N×N 블록(N=16, 8, 4)을 기반으로 하는 모양 정보를 부호화하는 MMR 모양 정보 부호화 기술(MMR 형상 부호화(shape coding) TECHNIQUE), 정점을 기반으로 하여 모양 정보를 부호화하는 정점 기반 모양 정보 부호화 기술(VERTEX-BASED 형상 부호화(shape coding) TECHNIQUE), 기초선 기반 모양 정보 부호화 기술(BASELINE-BASED 형상 부호화(shape coding) TECHNIQUE) 및 내용 기반 산술 부호화 기술(CONTEXT-BASED ARITHMETIC CODING) 등이 있으나, 이러한 종래 기술은 기본적으로 움직임 추정을 수행함에 있어서 반복적인 블록 비교 과정을 필수적으로 요구하고 있다.In the MPEG-4, a technique applied to the shape encoder 320 for encoding each VOP transmitted from the real-time VOP forming unit 310 includes N × N blocks (N = 16, 8, 4). MMR shape information coding technology (MMR shape coding TECHNIQUE) for encoding shape information based on the shape information, vertex-based shape information coding technology (VERTEX-BASED shape coding) TECHNIQUE), BASELINE-BASED shape coding TECHNIQUE, and CONTEXT-BASED ARITHMETIC CODING, but these conventional techniques basically provide motion estimation. In doing so, an iterative block comparison process is required.

본 발명에 따른 실시간 움직임 추정부(321)는 실시간 서비스인 경우, 형상 정보의 부호화시, 이러한 블록 비교 연상을 수행하지 않고 미리 설정된 MVP를 현재 MV로 설정하도록 구성된다.
The real-time motion estimation unit 321 according to the present invention is configured to set a predetermined MVP as a current MV without performing such block comparison association when encoding shape information in the case of a real-time service.

실시간 VOP 형성 방법 및 실시간 움직임 추정을 이용한 부호화 방법Real-time VOP formation method and coding method using real-time motion estimation

이하, 본 발명에 따른 실시간 VOP 형성 방법 및 실시간 VOP 부호화 방법을 도 5 및 도 6을 참조하여 보다 상세히 설명하기로 한다. 도 5에서는 실시간 VOP 형성 방법을 설명하고, 도 6에서는 실시간 VOP 부호화 방법을 설명하기로 한다.
Hereinafter, a method for forming a real time VOP and a method for encoding a real time VOP according to the present invention will be described in more detail with reference to FIGS. 5 and 6. A method of forming a real-time VOP is described in FIG. 5, and a method of encoding a real-time VOP is described in FIG. 6.

도 5는 본 발명의 바람직한 일 실시예에 따른 실시간 VOP 형성부의 동작 원리를 나타낸 도면이다.5 is a view showing the operation principle of the real-time VOP forming unit according to an embodiment of the present invention.

도 5에서는 압축 효율을 높이기 위하여, 기존의 최적화 과정이 포함된 다양한 VOP 형성 방법 중 일 실시예에 따른 IVOPF와 본 발명에 따른 실시간 VOP 형성 방법을 비교하며 설명하기로 한다.In FIG. 5, in order to increase compression efficiency, an IVOPF according to an embodiment and a real-time VOP forming method according to the present invention will be described with reference to various VOP forming methods including the existing optimization process.

IVOP는 non-transparent MB수가 적은 경우는 motion과 texture coding을 최소화하고 더 좋은 압축을 이끌 수 있다는 가정 하에, non-transparent MB수를 가장 적게 포함하는 VOP의 위치를 찾는 최적화 알고리즘을 포함하고 있다. 하기의 표 2를 참조하면, 다양한 VOP 위치를 설정 방식이 도시되어 있다.IVOP includes an optimization algorithm that finds the location of the VOP that contains the least number of non-transparent MBs on the assumption that a small number of non-transparent MBs can minimize motion and texture coding and lead to better compression. Referring to Table 2 below, various VOP location setting methods are illustrated.

Selection ModeSelection Mode SemanticsSemantics IVOPFIVOPF Find the minimum number of nontransparent MBs.Find the minimum number of nontransparent MBs. 8x8 Bnd (8x8)8x8 Bnd (8x8) Find the minimum number of 8x8 boundary blocksFind the minimum number of 8x8 boundary blocks WorstWorst Find the maximum number of nontransparent MBsFind the maximum number of nontransparent MBs Trans/Bnd Min. (TBM)Trans / Bnd Min. (TBM) Find the minimum number of transparent and boundary MBs.Find the minimum number of transparent and boundary MBs. All Min (AM)All Min (AM) Find the minimum number of MBs (or try to find the minimal size of VOP)Find the minimum number of MBs (or try to find the minimal size of VOP) Tightest Rectangle (TR)Tightest Rectangle (TR) Do not use IVOPF at all.Do not use IVOPF at all.

하기의 표 3을 참조하면, 상술한 VOP 형성 방법을 미리 설정된 소정의 테스트 이미지(도 7 참조)를 이용하여 실험한 결과값이 도시되어 있다.Referring to Table 3 below, the results of experimenting with the above-described VOP forming method using a predetermined test image (see FIG. 7) are shown.

Test sequenceTest sequence IVOPFIVOPF 8x88x8 WorstWorst TBMTBM AMAM TRTR akiyoakiyo 105.3105.3 142.9142.9 111.1111.1 144.4144.4 105.3105.3 93.793.7 coastguardcoastguard 169.3169.3 170.7170.7 169.9169.9 170.6170.6 169.3169.3 168.8168.8 containercontainer 122.5122.5 123.1123.1 122.7122.7 123.4123.4 122.7122.7 121.8121.8 dancer1dancer1 73.473.4 88.088.0 80.980.9 86.886.8 74.974.9 74.474.4 dancer2dancer2 120.4120.4 142.3142.3 127.9127.9 141.5141.5 121.8121.8 118.1118.1 foremanforeman 118.4118.4 127.9127.9 120.1120.1 128.0128.0 118.5118.5 116.3116.3 newsnews 141.2141.2 144.8144.8 141.9141.9 145.5145.5 137.2137.2 138.2138.2 saxophonesaxophone 200.6200.6 243.0243.0 207.8207.8 246.0246.0 203.9203.9 191.3191.3 singersinger 62.062.0 79.179.1 69.469.4 78.278.2 63.863.8 62.162.1 stefanstefan 65.565.5 77.177.1 72.772.7 76.976.9 67.167.1 67.367.3 breambream 120.8120.8 144.9144.9 126.7126.7 143.2143.2 120.7120.7 116.3116.3 childchild 103.6103.6 135.3135.3 114.4114.4 136.2136.2 106.8106.8 99.399.3 cyclamencyclamen 201.8201.8 211.1211.1 203.1203.1 206.1206.1 207.5207.5 207.3207.3 fishandlogofishandlogo 127.2127.2 155.3155.3 138.9138.9 155.8155.8 131.7131.7 125.7125.7 weatherweather 76.476.4 97.697.6 81.881.8 98.998.9 78.178.1 71.971.9 AverageAverage 120.6120.6 138.9138.9 126.0126.0 138.8138.8 121.9121.9 118.2118.2

상기 표 3을 참조하면, 본 발명에 따른 실시간 VOP 형성 방법(TR)을 제외하 고 IVOPF보다 더 나은 결과를 제공하는 형성 방법은 존재하지 아니한다. 복잡도나 실행속도 외에 얼마만큼 원본 이미지와 같은지에 대한 평가에서도 IVOPF와 TR에서 거의 차이가 나지 않음을 알 수 있다(도 9a 참조).Referring to Table 3 above, there is no formation method that provides better results than IVOPF except the real-time VOP formation method (TR) according to the present invention. In addition to the complexity and execution speed, it can be seen that the evaluation of how much the same as the original image shows little difference between the IVOPF and the TR (see FIG. 9A).

도 5를 참조하면, BAB(Binary Alpha Block)정하기 위해 하는 VOP Formation 원리가 도시되어 있다. 여기서, 이러한 VOP의 가로 방향 크기는 VOP폭으로 정의되고, 세로 방향의 크기는 VOP높이로 정의되며, 형성된 VOP는 좌측 상단을 그리드 시작점으로 하고, X축 및 Y축으로 각기 M개 및 N개의 화소를 가지는 M×N 매크로블록으로 구획하기로 한다. 예를 들면 X축 및 Y축으로 각기 16개의 화소를 가지는 16×16매크로블록으로 구획될 수 있으며, 상기 M 및 N은 후술하는 텍스쳐 부호화부(TEXTURE CODING)에서 서브 블록의 단위로 부호화를 수행할 수 있도록 하기 위하여 각기 짝수로 설정될 수 있다.Referring to FIG. 5, a VOP Formation principle for determining a binary alpha block (BAB) is illustrated. Here, the horizontal size of the VOP is defined by the width of the VOP, the vertical size is defined by the height of the VOP, the formed VOP is the grid starting point in the upper left corner, M and N pixels in the X axis and Y axis, respectively It will be partitioned into M × N macroblocks with. For example, the X and Y axes may be divided into 16 × 16 macroblocks each having 16 pixels, and M and N may be encoded in units of subblocks by a texture coding unit described later. Each can be set to an even number to make it possible.

먼저, 종래 방식에 따른 VOP 형성 방식을 설명하면 다음과 같다. 일반적으로 주어진 object(500)의 TR(510)은 MB의 배수와 정확히 일치하지는 않는다. 따라서 VOP 위치를 정하기 위하여 TR의 좌측상단(510)을 첫 번째 MB를 지정하는 일이다. First, the VOP formation method according to the conventional method will be described. In general, the TR 510 of a given object 500 does not exactly match a multiple of MB. Therefore, in order to determine the VOP position, the upper left 510 of the TR is designated as the first MB.

종래 방식 중 바람직한 일 실시예인 IVOPF에서는 TR의 좌측상단(510)을 control MB의 우측하단으로 여기고 control MB(520)를 정한다. Control MB(520)에서는 한 pixel 건너 pixel들이 VOPF와 MB의 수를 분석하기 위해 사용되어진다. 예를 들면, 전체 사용되어지는 pixel의 수는 64(8*8)이다. 한 pixel 건너 한 pixel이 사용되는 이유는 video format이 YUV(4:2:0)으로 색채(U and V)값이 4개의 휘도(Y)값에 포함되기 때문이다. Control block(520)내 64 pixel들의 각 pixel들은 첫 번 째 MB의 위치 후보 pixel이 된다. 그런 후 전체 MB수, all transparent MB수, non-transparent(all opaque와 partially transparent) MB수를 계산할 수 있다. 모든 경우(64 pixels)를 계산한 후, non-transparent MB수가 가장 적은 pixel을 기준점으로 하여 VOP가 형성되도록 구성된 종래 기술은 반복 처리 과정으로 인하여 많은 시간이 소요된다는 것이다. 가장 최악의 경우는 64가지를 다 계산하고 non-transparent MB수를 비교하도록 구성되며, 이러한 반복 처리 알고리즘은 실시간 부호화를 어렵게 만드는 원인 중의 하나이다. In IVOPF, a preferred embodiment of the conventional method, the upper left 510 of the TR is regarded as the lower right of the control MB, and the control MB 520 is determined. In Control MB 520, pixels across one pixel are used to analyze the number of VOPFs and MBs. For example, the total number of pixels used is 64 (8 * 8). The reason why one pixel is used across one pixel is that the video format is YUV (4: 2: 0), and the color (U and V) values are included in four luminance values (Y). Each pixel of the 64 pixels in the control block 520 becomes a position candidate pixel of the first MB. You can then calculate the total number of MBs, all transparent MBs, and the number of non-transparent (all opaque and partially transparent) MBs. After all cases (64 pixels) have been calculated, the conventional technique in which the VOP is formed based on the pixel having the smallest number of non-transparent MBs takes a long time due to the iterative processing. The worst case is to calculate all 64 and compare the number of non-transparent MBs, and this iterative algorithm is one of the factors that makes real-time coding difficult.

본 발명에 따른 실시간 VOP 형성 방법에 의하면, 반복 처리 알고리즘이 포함되는 기존의 VOP 형성 알고리즘 대신, TR(510)만을 이용하여 VOP를 형성하도록 구성된다. 물론, 본 발명에 의하더라도, 실시간 서비스가 아닌 경우에는 다양한 지능형 VOP 형성 알고리즘을 사용하여 VOP를 형성할 수 있음을 물론이다. 또한 실시간 서비스 여부에 관계없이 본 발명에 따른 실시간 VOP 형성 방법을 사용할 수 있음은 물론이다. 하기의 도 9a를 참조하면, 종래 방식에 따른 VOP 형성 방법에 대한 화질과 압축량을 분석함으로 검증함으로써, 형상 부호화에 있어서 종래의 VOP 형성 과정이 포함되지 아니하더라도 성능에 큰 차이가 없음을 알 수 있다.
According to the real-time VOP formation method according to the present invention, instead of the existing VOP formation algorithm including the iterative processing algorithm, it is configured to form the VOP using only the TR (510). Of course, according to the present invention, in the case of not the real-time service, it is a matter of course that the VOP can be formed using various intelligent VOP formation algorithms. In addition, the real-time VOP forming method according to the present invention can be used regardless of the real-time service. Referring to FIG. 9A, by verifying by analyzing the image quality and the amount of compression for the conventional VOP forming method, it can be seen that there is no significant difference in performance even if the conventional VOP forming process is not included in the shape coding. have.

도 6은 본 발명의 바람직한 일 실시예에 따른 움직임 추정 방법을 포함한 부호화 방법을 나타낸 도면이다.6 is a diagram illustrating an encoding method including a motion estimation method according to an exemplary embodiment of the present invention.

Binary 형상 부호화(shape coding) MPEG-4에서 MEMC는 형상(shape) 및 텍스쳐(texture)를 위해 사용되며 현재 BAB과 참조 되는 BAB과의 유사성을 계산하기 위 해 pixel 값을 비교하여 더하는 방식을 사용한다. 형상(Shape) block은 binary 값을 비교하기 때문에 형상(Shape)을 위한 MEMC는 텍스쳐(texture)용 MEMC보다 덜 복잡하기는 하나, block 비교 연산이라는 공통된 부분이 있으며, 표준에 의한 Reference software에서는 block 비교 연산 부분에 나선형 검색을 사용하도록 규정되어 있다. 예상된 MVP의 중앙 점으로부터 다음에 나오는 MB 후보는 이러한 나선형 검색을 통해 얻어진다. 이 방법은 중앙 점으로부터 가장 가까운 pixel을 설정할 수 는 있으나, +/- 16 pixel을 거쳐 반복처리를 하게 만드는 알고리즘으로 인하여, 부호화기(encoder)의 복잡도를 증가시키는 큰 원인중의 하나이다. 이러한 MEMC의 복잡도를 줄이기 위한 여러 가지 기술들이 제안되고 있으나, 대부분 텍스쳐(texture)용 MEMC에 집중되어 있으며, 형상(shape)용 MEMC에 대하여는 텍스쳐(texture)용 MEMC을 응용하여 사용하였다.Binary Shape Coding In MPEG-4, MEMC is used for shape and texture and compares and adds pixel values to calculate the similarity between the current BAB and the referenced BAB. . Because shape blocks compare binary values, MEMCs for shapes are less complex than MEMCs for textures, but there is a common part of block comparison operations. It is specified to use spiral search for the calculation part. The MB candidates that follow from the center point of the expected MVP are obtained through this spiral search. This method can set the pixel that is closest to the center point, but due to the algorithm that iterates through +/- 16 pixels, it is one of the major causes of increasing the complexity of the encoder. Various techniques for reducing the complexity of the MEMC have been proposed, but most of them are concentrated in the texture MEMC, and for the shape MEMC, the texture MEMC is used as an application.

도 6을 참조하면, 해당 BAB에 대한 MVP의 후보의 예(600)가 도시되어 있다. 실시예에 의할 때, MVP의 후보(600)는 MV for shape(610) 및 MV for texture(620)로 나누어진다. 종래 방식에 의할 때, MVP를 정하기 위해 shape 뿐만 아니라 texture의 이전 MV를 활용하며, Shape-only coding시에는 MV for shape만 사용한다. 현재 BAB의 주위 MV들이 도시되어 있다. MVP는 0이 아니면서 처음 접한 MV로 정해지며, 유효한 MV가 없을 경우는 0으로 정해진다. 일단 MVP가 결정되면, 현재 BAB과 참조 되는 BAB과의 유사성을 계산하기 위해 pixel 값을 비교하여 나온 결과값이 특정(Threshold)값 이하 이면 정해진 MVP를 해당 BAB의 MV로 정한다. 반면 특정값 이상의 차이가 난다면, MVP 중심점 -/+ 16 pixel 범위내의 참조되는 MB 와 현 재 MB의 오차를 계산함으로 움직임 추정(motion estimation)을 시작한다. 현재의 검색 방법은 texture motion estimation 하는 것과 동일하게 shape에서도 나선형 검색을 한다. 중요한 점은 shape motion estimation이 texture 만큼 복잡하다는 것이다. 그래서 texture MEMC 만의 최적화는 MEMC의 복잡도 문제를 단지 반만 해결하는 것이다.Referring to FIG. 6, an example 600 of a candidate of an MVP for a corresponding BAB is shown. According to an embodiment, the candidate 600 of MVP is divided into MV for shape 610 and MV for texture 620. According to the conventional method, the previous MV of the texture as well as the shape is used to determine the MVP, and only MV for shape is used for shape-only coding. Currently the surrounding MVs of BAB are shown. MVP is set to the first MV that is not 0, and 0 if there is no valid MV. Once the MVP has been determined, if the result of comparing pixel values to calculate the similarity between the current BAB and the referenced BAB is less than or equal to the threshold value, the determined MVP is set as the MV of the corresponding BAB. On the other hand, if there is a difference over a certain value, motion estimation is started by calculating the error between the referenced MB within the MVP center point-/ + 16 pixel range and the current MB. The current search method performs a spiral search on the shape as well as texture motion estimation. The important point is that shape motion estimation is as complex as texture. So texture MEMC only optimization solves the complexity problem of MEMC only half way.

본 발명은 형상(shape)용 MEMC는 텍스쳐(texture)용 MEMC와 달리, 최적화알고리즘에 포함되지 않으므로, 어떠한 종류의 block 비교도 수행하지 않도록 구성된다. 예를 들면, MVP가 현재 BAB의 MV로 설정하고 부호화하도록 구성된다. 따라서 본 발명에 따른 부호화기(encoder)는 7 coding mode중 가장 적합한 것을 설정할 수 있다. 나선형 검색을 생략함으로 인해 형상 부호화(shape coding)에서의 처리량을 줄어 즉, 움직임 추정/움직임 보상 단계에서 실시간 부호화가 아닌 서비스인 경우 종래 방식에 따른 움직임 추정 방식을 사용하고, 실시간 서비스인 경우 블록 비교를 하지 않는 움직임 추정 방식을 사용하도록 구성된다. According to the present invention, unlike the MEMC for texture, the MEMC for shape is not included in the optimization algorithm, and thus, no block comparison is performed. For example, the MVP is configured to set and encode the MV of the current BAB. Therefore, the encoder according to the present invention can set the most suitable among 7 coding modes. By eliminating the helical search, the throughput in shape coding is reduced, that is, the motion estimation method according to the conventional method is used in the case of a service other than the real time coding in the motion estimation / motion compensation step, and the block comparison in the real time service. It is configured to use a motion estimation method that does not.

본 발명에 따르면 형상 부호화시에 움직임 추정 처리를 수행하지 않아도 화질의 차이가 없으므로, 형상 부호화시에는 움직임 추정의 과정을 수행하지 않도록 구성될 수 있다. 따라서 본 발명에 의하면, 형상 부호화에 있어서는 현재 BAB의 MV를 MVP로 설정하도록 구성된다.According to the present invention, since there is no difference in image quality even when the motion estimation process is not performed at the time of shape coding, the motion estimation process may not be performed at the time of shape coding. Therefore, according to the present invention, in shape coding, the MV of the current BAB is set to MVP.

하기의 도 9b를 참조하면, 종래 방식에 따른 VOP 부호화 방법에 대한 화질과 압축량을 분석함으로 검증함으로써, 형상 부호화에 있어서 움직임 추정 과정이 포함되지 아니하더라도 성능에 큰 차이가 없음을 알 수 있다.
Referring to FIG. 9B, by verifying by analyzing the image quality and the amount of compression of the conventional VOP coding method, it can be seen that there is no significant difference in performance even if the motion estimation process is not included in the shape coding.

성능 및 속도 비교Performance and speed comparison

이하, 본 발명에 따른 실시간 객체 기반 서비스를 위한 형상 부호화 방법과 기존의 형상 부호화 방법의 성능 및 효과를 비교하여 설명하기로 한다. 도 7을 참조하면, 본 발명에서 사용한 테스트 이미지 중 일 실시예가 도시되어 있다. 도 8에서는 종래 방식, 실시간 VOP 형성 방식, 실시간 움직임 추정 방식, 두 방식을 결합하여 사용한 경우를 종합적으로 비교한 결과를 설명하기로 한다. 그리고 도 9a 내지 도 9c 에서는 각 방식에서의 성능을 기존 방식과 비교하여 설명하기로 한다.
Hereinafter, the performance and effects of the shape coding method for the real-time object-based service according to the present invention and the existing shape coding method will be described. Referring to Figure 7, one embodiment of the test image used in the present invention is shown. In FIG. 8, a result of comprehensively comparing a conventional method, a real-time VOP forming method, a real-time motion estimation method, and a combination of the two methods will be described. 9A to 9C, the performance of each scheme will be described in comparison with the conventional scheme.

도 7은 본 발명에서 사용한 테스트 이미지 중 일 실시예를 도시한 도면이다. 도 7을 참조하면, 본 발명에서 사용한 테스트 이미지 중 일 실시예가 도시되어 있다. 도 7에서는 대표적인 테스트 이미지만이 도시되어 있으나, 하기의 표 4와 표기된 복수개의 테스트 이미지를 사용하여 실험하였다.
7 is a view showing an embodiment of a test image used in the present invention. Referring to Figure 7, one embodiment of the test image used in the present invention is shown. In FIG. 7, only a representative test image is illustrated, but experiments were performed using a plurality of test images indicated in Table 4 below.

　 Test sequenceTest sequence No. of framesNo. of frames FormatFormat akiyoakiyo 300300 CIFCIF coastguardcoastguard 300300 CIFCIF containercontainer 300300 CIFCIF dancer1dancer1 250250 CIFCIF dancer2dancer2 250250 CIFCIF foremanforeman 300300 CIFCIF newsnews 300300 CIFCIF saxophonesaxophone 250250 CIFCIF singersinger 250250 CIFCIF stefanstefan 300300 CIFCIF breambream 300300 SIFSIF childchild 300300 SIFSIF cyclamencyclamen 300300 SIFSIF fishandlogofishandlogo 300300 SIFSIF weatherweather 300300 SIFSIF

도 8은 종래 방식, 실시간 VOP 형성 방식, 실시간 움직임 추정 방식, 두 방식을 결합하여 사용한 경우를 종합적으로 비교한 결과를 나타낸 그래프이다.FIG. 8 is a graph showing a result of comprehensive comparison between a conventional method, a real-time VOP formation method, a real-time motion estimation method, and a combination of the two methods.

이하, 도 8을 참조하여, 본 발명은 부호화기(encoder)의 복잡성을 줄이고, 부호화 시간(encoding time)을 감소함으로써, 복호화기와의 시간 차이를 극복하면서 실시간 객체 기반 서비스를 성능 저하없이 제공할 수 있음을 설명하기로 한다. Hereinafter, with reference to FIG. 8, the present invention can provide a real-time object-based service without compromising performance by reducing the complexity of an encoder and reducing an encoding time, thereby overcoming a time difference with a decoder. Will be described.

도 8은 상기 표 7에서 도시한 테스트 이미지를 이용하여 실험한 부호화 시간을 나타낸 하기 표 5를 결과값이 그래프로 도시되어 있다. 여기서, 숫자는 부호화 실행 시간 결과의 결과를 나타내며, Ref SW(Reference Software)(800)는 표준에서 정의하는 부호화 방식을 지칭하고, TR(Tightest Rectangular)(810)는 본 발명에 따른 TRB를 이용하여 VOP를 형성한 경우의 부호화 방식을 지칭한다. 그리고 NMS(No Motion Search)(820)는 본 발명에 따른 실시간 움직임 추청 방식을 이용한 부호화 방식을 지칭한다. 그리고 PEC(Proposed Encoding Conditions)(830)은 실시간 VOP 형성 방식 및 실시간 움직임 추정 방식을 결합하여 사용한 실시간 형상 부호화 방 식을 지칭한다. 각 테스트 이미지에 대한 부호화 속도가 도시되어 있으며, 평균값(840)을 비교하면, 실시간 VOP 형성 방식, 실시간 움직임 추정 방식을 결합한 실시간 부호화 방식이 가장 시간이 적게 소요됨을 알 수 있다.
FIG. 8 is a graph showing the results of Table 5, which shows the encoding time experimented using the test image shown in Table 7. Here, the number represents the result of the encoding execution time result, Ref SW (Reference Software) 800 refers to the encoding scheme defined in the standard, and the T (Tightest Rectangular) 810 uses the TRB according to the present invention. Refers to an encoding method when a VOP is formed. And NMS (No Motion Search) 820 refers to the coding method using the real-time motion tracking method according to the present invention. Proposed Encoding Conditions (PEC) 830 refers to a real-time shape encoding method using a combination of a real-time VOP formation method and a real-time motion estimation method. The coding speeds for the respective test images are shown, and when the average value 840 is compared, it can be seen that the real time coding method combining the real time VOP formation method and the real time motion estimation method requires the least time.

Test sequenceTest sequence Ref. SWRef. SW TRTR NMSNMS PECPEC akiyoakiyo 105.3105.3 93.793.7 98.198.1 88.788.7 coastguardcoastguard 169.3169.3 168.8168.8 161.0161.0 161.6161.6 containercontainer 122.5122.5 121.8121.8 118.8118.8 119.0119.0 dancer1dancer1 73.473.4 74.474.4 62.462.4 62.162.1 dancer2dancer2 120.4120.4 118.1118.1 101.9101.9 99.099.0 foremanforeman 118.4118.4 116.3116.3 107.6107.6 105.9105.9 newsnews 141.2141.2 138.2138.2 137.1137.1 134.6134.6 saxophonesaxophone 200.6200.6 191.3191.3 177.5177.5 167.4167.4 singersinger 62.062.0 62.162.1 54.554.5 53.753.7 stefanstefan 65.565.5 67.367.3 55.455.4 55.955.9 breambream 120.8120.8 116.3116.3 104.8104.8 101.2101.2 childchild 103.6103.6 99.399.3 86.886.8 81.081.0 cyclamencyclamen 201.8201.8 207.3207.3 178.4178.4 179.6179.6 fishandlogofishandlogo 127.2127.2 125.7125.7 97.997.9 94.494.4 weatherweather 76.476.4 71.971.9 69.469.4 64.564.5 AverageAverage 120.6120.6 118.2118.2 107.4107.4 104.6104.6

도 8 및 표 5를 참조하면, MS(Motion Search) 하지 않는 실시간 움직임 추정 방식(820)을 사용하는 부호화기(encoder)의 경우 RF(reference software)(800)에 비해 10.9% 실행시간이 단축되었다. 특히, Fishandlogo 이미지의 경우 최대 23%의 속도가 빨라짐을 알 수 있다.Referring to FIG. 8 and Table 5, in the case of an encoder using a real-time motion estimation method 820 without a motion search (MS), execution time is shortened by 10.9% compared to the reference software (RF) 800. In particular, fishandlogo images can be up to 23% faster.

본 발명이 제시한 TR(Tightest Rectangle)을 이용한 실시간 VOP 형성 방법(810)과 실시간 움직임 추정 방법(no MEMC for shape)(820)을 동시에 적용한 실시간 부호화 방식(830)은 평균 부호화 속도(840)가 종래의 속도에 비해 13.2%가 향상되었다. 가장 두드려진 효과를 보인 것은 fishandlogo 이미지로써 25.7%의 속도가 줄었다. According to the present invention, a real-time encoding method 830 that simultaneously applies a real-time VOP formation method 810 and a real-time motion estimation method (no MEMC for shape) 820 using a Tightest Rectangle (TR) has an average coding rate 840. 13.2% improvement over the conventional speed. The most striking effect was the fishandlogo image, which was reduced by 25.7%.

그리고 하기의 표 6을 참조하면 Shape-only coding 의 경우의 실험 결과가 도시되어 있다.
And the results of the experiment in the case of Shape-only coding with reference to Table 6 below.

Test sequenceTest sequence Ref. SWRef. SW PECPEC Speedup (%)Speedup (%) akiyoakiyo 28.628.6 9.69.6 66.366.3 coastguardcoastguard 19.319.3 11.611.6 40.040.0 containercontainer 13.013.0 9.59.5 27.327.3 dancer1dancer1 24.524.5 9.99.9 59.759.7 dancer2dancer2 38.238.2 13.513.5 64.764.7 foremanforeman 28.228.2 13.413.4 52.552.5 newsnews 15.915.9 10.310.3 34.834.8 saxophonesaxophone 53.553.5 16.216.2 69.769.7 singersinger 22.822.8 9.89.8 56.956.9 stefanstefan 22.222.2 9.19.1 59.159.1 breambream 33.033.0 11.611.6 64.864.8 childchild 39.539.5 12.912.9 67.367.3 cyclamencyclamen 44.744.7 20.220.2 54.754.7 fishandlogofishandlogo 54.254.2 16.616.6 69.569.5 weatherweather 22.722.7 8.48.4 62.862.8 AverageAverage 30.730.7 12.212.2 56.756.7

Shape-only coding 이란 오로지 shape 정보만 부호화와 복호화됨을 의미한다. 상기 표 6을 참조하면, 형상 부호화(shape coding) 부분에서 얼마만큼 복잡도가 감소되었는지는 알 수 있다. 실험 결과, 본 발명의 평균 실행속도가 종래의 평균 속도의 절반이하를 차지함을 알 수 있다.Shape-only coding means that only shape information is encoded and decoded. Referring to Table 6, it can be seen how much the complexity is reduced in the shape coding portion. As a result, it can be seen that the average execution speed of the present invention accounts for less than half of the conventional average speed.

그리고 하기 표 7에는 본 발명에 따른 실시간 부호화 방식의 명령 처리수(Comparison of the Number of Instructions)가 정리되어 있다.
Table 7 below summarizes the Comparison of the Number of Instructions according to the present invention.

Test sequenceTest sequence Ref. SWRef. SW PECPEC Reduction (%)Reduction (%) akiyoakiyo 1875318753 1297612976 30.830.8 fishandlogofishandlogo 1921519215 1570115701 18.318.3

표 7을 참조하면, 전체 실행되는 명령(instruction)을 상당히 감소함을 알 수 있다.
Referring to Table 7, it can be seen that the overall instruction executed is significantly reduced.

도 9a 내지 도 9c는 본 발명에 따른 실시간 VOP 형성 방법, 실시간 움직임 추정 방법, 상기 두 방식을 동시에 적용한 실시간 부호화 방식을 종래 방식과 비교한 성능 비교 그래프가 도시되어 있다.9A to 9C illustrate performance comparison graphs comparing a real-time VOP forming method, a real-time motion estimation method, and a real-time encoding method in which the two methods are applied simultaneously with the conventional methods.

이하, 도 9a에서는 종래 방식과 실시간 VOP 형성방법을 비교하고, 도 9b에서는 종래 방식과 실시간 움직임 추정 방식을 비교하며, 도 9c에서는 종래 방식과 실시간 VOP형성, 실시간 움직임 추정 방식을 결합한 실시간 형상 부호화 방식을 비교하여 설명하기로 한다. 상기 그래프는 도 7의 테스트 이미지 중 (Akiyo sequence)을 기준으로 실험한 결과이다.Hereinafter, FIG. 9A compares the conventional method with the real-time VOP forming method, FIG. 9B compares the conventional method with the real-time motion estimation method, and FIG. 9C shows the real-time shape coding method combining the conventional method with the real-time VOP formation and real-time motion estimation method. It will be described by comparing. The graph is a result of the experiment based on the (Akiyo sequence) of the test image of FIG.

도 9a 내지 도 9c에서 X축은 압축률을 나타내고, Y축은 PSNR로 측정한 잡음비가 도시되어 있다. 여기서, PSNR(Peak Signal to Noise Ratio)이란 원본 이미지와 부호화되어진 이미지와의 차이를 의미한다. 9A to 9C, the X axis represents the compression ratio and the Y axis represents the noise ratio measured by PSNR. Here, PSNR (Peak Signal to Noise Ratio) means the difference between the original image and the encoded image.

도 9a 내지 도 9c를 참조하면, 본 발명에 따른 실시간 부호화 방식에 따른 부호화기에서 의한 PSNR(920, 940, 960)은 기존 방식의 부호화기(encoder)의 PSNR(910, 930, 950)과 거의 차이가 없음을 보여준다. 그러므로 본 발명은 전체 압축률을 떨어뜨리지 않고 화질 열화도 발생하지 않으면서 부호화기(encoder)의 복잡성을 줄일 수 있으며, 사람이 시각적으로 인식할 수 있는 화질상의 변화가 없음을 알 수 있다.9A to 9C, the PSNRs 920, 940, and 960 of the encoder according to the real-time coding scheme according to the present invention are almost different from the PSNRs 910, 930, and 950 of the encoder. Shows none. Therefore, the present invention can reduce the complexity of the encoder without degrading the overall compression rate and the image quality deterioration, and it can be seen that there is no change in image quality that can be visually recognized by a human.

상술한 바와 같이 본 발명은 움직임(Motion), 형상(shape), 텍스쳐(texture)의 압축효율을 보전하면서, 절반 이상의 형상 부호화(shape coding)의 복잡성을 줄여줄 수 있는 방법을 제시할 수 있다. 본 발명의 일 실시예에 의할 때, 본 발명은 형상 부호화(shape coding)에서의 복잡함을 줄이기 위해 IVOPF(Intelligent VOP Formation)이 아닌 TRB(Tightest Rectangular Boundary) 사용과 MEMC for shape의 처리 과정 중 나선형 검색을 생략한 실시간 형상 부호화 방식 및 장치를 제공함으로써, 형상 부호화(shape coding)의 복잡함을 56% 줄이는 효과가 나타남을 알 수 있다.As described above, the present invention can provide a method that can reduce the complexity of shape coding more than half while preserving the compression efficiency of motion, shape, and texture. According to an embodiment of the present invention, the present invention is a spiral during the processing of the use of the TBT (Tightest Rectangular Boundary) and MEMC for shape rather than IVOPF (Intelligent VOP Formation) to reduce the complexity in shape coding By providing a real-time shape coding method and apparatus in which the search is omitted, it is understood that the complexity of shape coding is reduced by 56%.

본 발명의 기술 사상이 상술한 실시예에 따라 구체적으로 기술되었으나, 상기한 실시예는 그 설명을 위한 것이며 그 제한을 위한 것이 아니며, 본 발명의 기술 분야의 통상의 전문가라면 본 발명의 기술 사상의 범위 내에서 다양한 실시예가 가능함을 이해할 수 있을 것이다.
Although the technical idea of the present invention has been described in detail according to the above-described embodiment, the above-described embodiment is for the purpose of description and not of limitation, and a person of ordinary skill in the art of the present invention It will be understood that various embodiments are possible within the scope.

본 발명은 상기의 제반 문제점을 해결하기 위하여 안출한 것으로서, MPEG-4 에서 사용되는 다수의 부호화 툴 중에 형상 정보 부호화(SHAPE INFORMATION ENCODING)의 복잡도를 감소시켜 실시간으로 형상 정보의 부호화가 가능한 부호화 방법 및 장치를 제공할 수 있는 효과가 있다.SUMMARY OF THE INVENTION The present invention has been made to solve the above problems, and is an encoding method capable of encoding shape information in real time by reducing the complexity of SHAPE INFORMATION ENCODING among a plurality of encoding tools used in MPEG-4, and There is an effect that can provide a device.

또한, 본 발명은 객체 기반 부호화에 있어서, 가장 큰 해결 과제였던 MPEG-4 OVC의 MEMC 문제를 근본적으로 해결할 수 있다. MPEG-4 OVC의 형상 부호화(shape coding)에서는 ME가 부호화 실행시간의 75% 이상을 차지하고 있다. 그러나 기존의 연구 방식은 이러한 움직임 추정을 필수적인 것으로 인식하고 이의 효율성 향상에 집중되어졌다. 본 발명은 형상 부호화는 텍스쳐 부호화와는 달리, 움직임 추정의 필요성에 의문을 제기하면서, 이를 생략한 부호화 방법을 제공함으로써, 객체기반서비스의 부호화 실행 속도를 대폭 감소시킬 수 있는 효과가 있다.In addition, the present invention can fundamentally solve the MEMC problem of MPEG-4 OVC, which is the biggest problem in object-based encoding. In shape coding of MPEG-4 OVC, ME occupies 75% or more of coding execution time. However, the existing research method recognizes this movement estimation as essential and concentrates on improving its efficiency. According to the present invention, unlike the texture encoding, the shape encoding raises the question of the necessity of motion estimation and provides an encoding method that omits it, thereby significantly reducing the encoding execution speed of the object-based service.

또한, 본 발명은 VOP 형성시에, 압축 효율을 높이기 위한 알고리즘의 사용없이도, 타이티스트 사각형(TR) 부호화 방법을 사용하여 성능의 저하 없이 실시간 객체 서비스를 제공할 수 있는 효과도 있다.In addition, the present invention has the effect of providing a real-time object service without deterioration in performance by using a titanic square (TR) coding method without using an algorithm for increasing compression efficiency when forming a VOP.

본 발명은 디지털 방송 스튜디오, 모바일용 hand-set과 인터넷 PC 등과 같은 네트워크 환경에서 실시간 객체 기반 서비스를 상용화할 수 있다.The present invention can commercialize real-time object-based services in network environments such as digital broadcasting studios, mobile handsets and Internet PCs.

상기에서는 본 발명의 바람직한 실시예를 참조하여 설명하였지만, 해당 기술 분야에서 통상의 지식을 가진 자라면 하기의 특허 청구의 범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 수 있을 것이다.Although the above has been described with reference to a preferred embodiment of the present invention, those skilled in the art to which the present invention pertains without departing from the spirit and scope of the present invention as set forth in the claims below It will be appreciated that modifications and variations can be made.

Claims

In the shape encoding method for providing a real-time object-based service,

Determining whether the encoding is a real-time service;

In the case of encoding for a real-time service, setting a minimum rectangular boundary (TRB) including an image object of an object;

Partitioning the VOP formed according to the shape of the object into a macroblock having M × N pixels using the minimum rectangular boundary, where M and N are natural numbers; And

Encoding the data in units of macroblocks without moving the grid start point of the partitioned macroblocks in the partitioning step and searching for a position of reducing the amount of video information of an object;

Shape encoding method of an object-based service comprising a.

delete

The method of claim 1,

In setting a candidate group for motion estimation, without performing an algorithm including any block comparison, setting the MV of the current binary alpha block (BAB) as MVP

Shape encoding method of the object-based service, characterized in that it further comprises.

In the shape encoding method for providing a real-time object-based service,

Determining whether the encoding is a real-time service;

In the case of encoding for a real-time service, forming a VOP for shape encoding;

Dividing the VOP formed according to the shape of the object into a macroblock having M × N pixels, where M and N are natural numbers; And

In setting a candidate group for motion estimation in units of macroblocks, without performing an algorithm including any block comparison, setting an MV of a current binary alpha block (BAB) as an MVP

Shape encoding method of an object-based service comprising a.

delete

In the shape encoding apparatus for providing a real-time object-based service,

In the case of encoding for a real-time service, a Tightest Rectangular Boundary (TRB) including an image object of an object is set, and the VOP formed according to the shape of the object by using the minimum rectangle boundary is a pixel of M × N. A VOP forming unit for dividing into macroblocks having a symbol and encoding the macroblock unit without performing a process of searching for a reduced information position of an image of an object while moving a grid start point of the partitioned macroblocks. , N is a natural number);

A shape encoder for encoding the VOP formed by the VOP forming unit: and

Multiplexer for multiplexing the coded shape signal and transmitting it in a bitstream

Shape encoding apparatus of the object-based service comprising a.

delete

The method of claim 6,

In setting a candidate group for motion estimation, a real-time motion estimation unit that sets an MV of a current binary alpha block (BAB) as an MVP without performing an algorithm including any block comparison.

Shape encoding apparatus of the object-based service further comprises.

In the shape encoding apparatus for providing a real-time object-based service,

In the case of encoding for a real-time service, a VOP forming unit for forming a VOP for shape coding, and partitioning the VOP formed according to the shape of the object into a macroblock having M × N pixels;

A shape encoder for encoding the VOP formed by the VOP forming unit:

A multiplexer for multiplexing the encoded shape signals and transmitting them in a bitstream; and

In the case of encoding for a real-time service, in setting a candidate group for motion estimation in units of macroblocks, MVP does not perform an algorithm including any block comparison, but MVP wins the MV of a current binary alpha block (BAB). Motion estimator for setting to

Shape encoding apparatus of the object-based service comprising a.

delete