KR19990065274A

KR19990065274A - Shape Information Coding Method for Progressive Scan

Info

Publication number: KR19990065274A
Application number: KR1019980000491A
Authority: KR
Inventors: 천승문; 신동규; 문주희
Original assignee: 김영환; 현대전자산업 주식회사
Priority date: 1998-01-10
Filing date: 1998-01-10
Publication date: 1999-08-05

Abstract

본 발명은 동영상의 격행주사를 위한 부호화시 모양정보의 움직임 정도를 검출하여 그 움직임량에 따라 프레임 단위 또는 필드단위로 모양정보를 부호화하는 격행주사를 위한 모양정보 부호화 방법에 관한 것으로서, 이러한 본 발명은 격행주사를 위한 모양정보 부호화에 있어서, 현재(Current) 프레임의 홀수 필드의 움직임 예측방향을 결정할 때 이전(Reference) 프레임의 홀수 필드에서 움직임 추정을 한 SAD(Sum Absolute Difference)의 최소값을 움직임 예측방향을 결정하고, 현재(Current) 프레임의 짝수 필드의 움직임 예측방향을 결정할 때 이전(Reference) 프레임의 짝수 필드에서 움직임 추정을 한 SAD(Sum Absolute Difference)의 최소값으로 움직임 추정을 수행하여 모양정보를 부호화하는 것이다.The present invention relates to a shape information encoding method for a parallel scan which detects a motion degree of shape information during encoding for a progressive scan of a video and encodes the shape information in a frame unit or a field unit according to the amount of motion. According to the present invention, in shape information encoding for a progressive scan, when determining a motion prediction direction of an odd field of a current frame, a minimum value of sum absorptive difference (SAD) estimated from an odd field of a previous frame is estimated. Determining the direction of motion prediction and determining the motion prediction direction of the even field of the current frame by performing the motion estimation to the minimum value of Sum Absolute Difference (SAD) that estimated the motion in the even field of the previous frame. Encoding shape information.

Description

Shape Information Coding Method for Progressive Scan

본 발명은 격행주사를 위한 모양정보 부호화에 관한 것으로, 특히 동영상의 격행주사를 위한 부호화시 모양정보의 움직임 정도를 검출하여 그 움직임량에 따라 프레임 단위 또는 필드단위로 모양정보를 부호화하는 격행주사를 위한 모양정보 부호화 방법에 관한 것이다.The present invention relates to shape information encoding for a progressive scan, and more particularly, to detect the degree of motion of shape information in encoding for a progressive scan of a video and to encode shape information in units of frames or fields according to the amount of motion. A shape information encoding method for row scanning.

일반적으로, 영상 및 음향 부호화 기술 및 시스템 구성에 관한 국제 표준안 (MPEG-1, MPEG-2)을 개발하고 의결한 MPEG 그룹이 국제 표준으로 채택할 예정의 차세대 영상 및 음향 부호화 기술 및 시스템 구성에 관한 국제 표준안 (MPEG-4)을 연구, 개발중에 있다. MPEG-4의 개발은 기존의 알려진 표준안으로는 지원할 수 없는 차세대 영상 및 음향 응용물들을 지원할 필요성에서 출발했다.In general, the MPEG-4 group has developed and decided on international standards on video and audio coding technology and system construction, and the next generation of video and audio coding technology and system construction to be adopted as an international standard. International Standard (MPEG-4) is being researched and developed. The development of MPEG-4 began with the need to support the next generation of video and audio applications that cannot be supported by existing known standards.

MPEG-4는 영상 및 음향 데이타의 통신과 접속, 그리고 조작을 위한 새로운 방법들(예를 들자면, 특성이 다른 네트워크를 통한 물체 중심 대화형 기능 및 접속 등)을 제공한다.MPEG-4 provides new methods for communicating and accessing and manipulating video and audio data, such as object-oriented interactive functionality and access over networks of differing characteristics.

또한 에러가 쉽게 발생되는 통신 환경과 저전송율의 통신환경에서도 유용하게 동작하는 특성을 제공한다. 더욱이 컴퓨터 그래픽 기술을 통합하여 자연영상 및 음향과 인공영상 및 음향들을 함께 부호화하고 조작할 수 있는 기능들을 제공한다.In addition, it provides a characteristic that operates usefully in a communication environment where errors are easily generated and a low transmission rate communication environment. In addition, it integrates computer graphics technology to provide the ability to encode and manipulate natural and artificial images and sounds together.

요약컨대, MPEG-4는 여러 응용분야에서 요구되고 예상되는 모든 기능들을 지원해야 한다. 따라서, 멀티미디어 정보의 급팽창과 기술 향상에 의해 새롭게 개발됐거나 개발될 저가, 고기능의 모든 가능한 응용 분야들에 요구되는 기능들을 지원할 수 있도록 확장 가능하고 개방적인 구조를 가지게 된다. 그 중에는 전송 및 저장 기능과 비용 절감에 필요한 부호화 효과의 향상 기능(Improved Compression Efficiency)이 있다.In summary, MPEG-4 must support all the features required and expected in many applications. As a result, the expansion and openness of multimedia information is possible to support the functions required for all possible low-cost, high-performance applications that are newly developed or will be developed. Among them are the transmission and storage functions and the improved compression efficiency required for cost reduction.

현재 MPEG-4의 기술이 응용될 것으로 기대되는 응용물로는 인터넷 멀티미디어(IMM: Internet Multimedia), 대화형 비디오 게임(IVG: Interactive Video Games), 영상회의 및 영상전화 등의 상호 통신(IPC: Interpersonal Communications), 쌍방향 저장매체(ISM: Interactive Storage Media), 멀티미디어 전자우편(MMM: Multimedia Mailing), 무선 멀티미디어 (WMM: Wireless Multimedia), ATM망 등을 이용한 네트웍 데이타베이스서비스(NDB: Networked Database Service), 원격 응급 시스템(RES: Remote Emergency Systems), 원격 영상 감시(RVS: Remote Video Surveillance) 등이 있다.Applications currently expected to apply MPEG-4 technology include interpersonal communication (IPC) such as Internet multimedia (IMM), interactive video games (IVG), video conferencing and video telephony. Networked Database Services (NDB) using Communications, Interactive Storage Media (ISM), Multimedia Mailing (MMM), Wireless Multimedia (WMM), ATM networks, Remote Emergency Systems (RES) and Remote Video Surveillance (RVS).

기존의 응용물이나 앞으로 기대되는 응용물들을 지원하기 위해서는 유저들이 영상 내의 원하는 객체만을 통신할 수 있고, 찾고 읽을 수 있도록 접근할 수 있으며, 자르고 붙일 수 있도록 편집할 수 있는 영상 부호화기술이 필요하다.In order to support existing applications or applications expected in the future, an image encoding technology that can be edited so that users can communicate only with the desired objects in the image, can be found and read, and can be cut and pasted can be edited.

현재 세계 표준화 작업이 진행중인 새로운 영상 및 음향 부호화 기술인 MPEG-4는 이러한 필요를 충족시키기 위한 것이다.MPEG-4, a new video and audio encoding technology that is currently under global standardization, is designed to meet this need.

도 1은 현재 국제표준 산하기구에서 1차적으로 확정한 MPEG-4의 VOP 영상 부호화기의 구성도이다.FIG. 1 is a block diagram of a MPEG-4 VOP video encoder first determined by the present International Standards Organization.

이는 기존의 영상부호화 세계표준화인 H.261, H.263, MPEG-1, MPEG-2의 영상 부호화기 구조와는 다른 구조를 지닌다. 특히 모양정보 부호화부(Shape Coder)와 VOP(Video Object Planes)라는 개념의 도입이 가장 두드러진 차이를 보이고 있다. VOP는 유저가 접근 및 편집할 수 있는 임의 모양의 내용물의 시간축상의 한 시점의 객체를 의미하며, 내용물 기반의 기능성(content-based functionality)을 지원하기 위해서는 VOP별로 부호화 되어야 한다.This is different from the video encoder structure of H.261, H.263, MPEG-1, MPEG-2, which is the world standardized video encoding. In particular, the introduction of the concepts of Shape Coder and VOP (Video Object Planes) shows the most significant difference. The VOP refers to an object at a point in time on an arbitrary shape of content that can be accessed and edited by the user. The VOP must be encoded for each VOP to support content-based functionality.

이러한 VOP 부호화기는, VOP형성부(11)에서 형성된 각각의 대상물 영상에 대한 VOP가 움직임 추정부(MOTION ESTIMATION)(13)에 입력되면, 움직임 추정부(13)는 인가된 VOP로부터 매크로 블록 단위의 움직임을 추정하게 된다.When the VOP encoder inputs a VOP for each object image formed in the VOP forming unit 11 to the MOTION ESTIMATION 13, the motion estimating unit 13 performs the macroblock unit from the applied VOP. The motion is estimated.

또한, 상기 움직임 추정부(13)에서 추정된 움직임 정보는 움직임 보상부(MOTION COMPENSATION)(14)에 입력되어 움직임이 보상된다. 그리고, 움직임 보상부(14)에서 움직임이 보상된 VOP는 상기 VOP형성부(11)에서 형성된 VOP와 함께 감산기(16)에 입력되어 차이 값이 검출되고, 감산기(16)에서 검출된 차이 값은 대상물 내부 부호화부(18)에 입력되어 매크로 블록의 서브 블록 단위로 대상물의 내부정보가 부호화 된다.In addition, the motion information estimated by the motion estimator 13 is input to a motion compensation unit 14 to compensate for the motion. The VOP whose motion is compensated by the motion compensator 14 is input to the subtractor 16 together with the VOP formed by the VOP forming unit 11 to detect a difference value, and the difference value detected by the subtractor 16 is The internal information of the object is encoded by the object internal encoding unit 18 in units of sub blocks of the macro block.

예를 들면, 매크로 블록의 X축 및 Y축이 M/2 × N/2로 각기 8개의 화소를 가지는 8 × 8의 서브 블록으로 세분화된 후 대상물 내부정보가 부호화 된다.For example, after the X and Y axes of the macroblock are subdivided into 8x8 subblocks having 8 pixels each with M / 2 x N / 2, the object internal information is encoded.

한편, 움직임 보상부(14)에서 움직임이 보상된 VOP와, 대상물 내부 부호화부(18)에서 부호화된 대상물의 내부정보는 가산기(17)에 입력되어 가산되고, 가산기(17)의 출력신호는 이전 VOP 검출부(PREVIOUS RECONSTRUCTED VOP)(15)에 입력되어 현재 영상 바로 이전 영상의 VOP인 이전 VOP가 검출된다.On the other hand, the VOP whose motion is compensated by the motion compensator 14 and the internal information of the object encoded by the object internal encoder 18 are added to the adder 17, and the output signal of the adder 17 is transferred. The previous VOP which is input to the VEV detector 15 (PREVIOUS RECONSTRUCTED VOP) 15 and is the VOP of the image immediately before the current image is detected.

또한, 이전 VOP검출부(15)에서 검출된 이전 VOP는 상기 움직임 추정부(13) 및 움직임 보상부(14)에 입력되어 움직임 추정 및 움직임 보상에 사용된다.In addition, the previous VOP detected by the previous VOP detector 15 is input to the motion estimator 13 and the motion compensator 14 and used for motion estimation and motion compensation.

그리고, VOP형성부(11)에서 형성된 VOP는 모양정보 부호화부(SHAPE CODING)(12)에 입력되어 모양 정보가 부호화 된다.The VOP formed by the VOP forming unit 11 is input to a shape coding unit 12, and shape information is encoded.

여기서, 모양정보 부호화부(12)의 출력신호는 VOP 부호화기가 적용되는 분야에 따라 사용 여부가 가변 되는 것으로, 점선으로 표시된 바와 같이, 모양정보 부호화부(12)의 출력신호를 움직임 추정부(13), 움직임 보상부(14) 및 대상물 내부 부호화부(18)에 입력시켜 움직임 추정, 움직임 보상 및 대상물의 내부 정보를 부호화 하는 데 사용할 수 있다.Here, the output signal of the shape information encoder 12 may vary depending on the field in which the VOP encoder is applied, and as shown by a dotted line, the output signal of the shape information encoder 12 may be used as the motion estimation unit 13. ), And may be input to the motion compensator 14 and the object internal encoder 18 to be used for encoding motion estimation, motion compensation, and internal information of the object.

또한, 움직임 추정부(13)에서 추정된 움직임 정보와, 대상물 내부 부호화부(18)에서 부호화된 대상물 내부 정보 및 상기 모양정보 부호화부(12)에서 부호화된 모양 정보는 다중화부(19)에 인가되어 다중화된 후, 비트스트림으로 도면에는 도시하지 않았지만 다수개의 부호화기의 출력을 다시 다중화 하여 전송하는 다중화기에 전달되어 전송되어진다.In addition, motion information estimated by the motion estimation unit 13, object internal information encoded by the object internal encoding unit 18, and shape information encoded by the shape information encoding unit 12 are applied to the multiplexer 19. After being multiplexed and multiplexed, the bitstream is transmitted to a multiplexer which multiplexes and outputs the outputs of the plurality of encoders, although not shown in the figure.

이러한 개념의 MPEG-4(Moving Picture Expert Group-4)의 가장 큰 특징 중 하나가 객체(object)를 기반으로 처리를 한다는 것이다. 즉, 한 영상을 여러 개의 객체로 나누고 그 각각의 객체를 개별적으로 부호화하고 처리할 수 있는 것이다. 따라서 객체를 만들기 위해서 모양정보를 알아야 한다.One of the biggest features of this concept of Moving Picture Expert Group-4 (MPEG-4) is that it is based on objects. In other words, one image can be divided into several objects and each object can be individually encoded and processed. So you need to know the shape information to make an object.

상기에서 말하는 모양정보를 흔히 마스크(mask)라고 하는데 영상에서 객체 부분은 '1'로 표현하고 객체 바깥 부분(배경 부분)은 '0'으로 표현한다. 이 모양정보를 이용하여 영상에서 한 객체를 얻을 수 있다. 그리고 이 모양정보를 이용하여 복호기 측에서 객체 부분을 복호하기 때문에 모양정보를 부호화 하여 복호기측에 전송해주어야 한다.The shape information referred to above is commonly referred to as a mask. In the image, the object part is represented by '1' and the outside part of the object (background part) is represented by '0'. This shape information can be used to obtain an object from the image. Since the decoder decodes the object part using the shape information, the shape information should be encoded and transmitted to the decoder side.

영상에서 객체를 분리하기 위한 모양 정보(shape information)와 객체의 움직임 정보(motion information)및 영상 정보(texture information)를 가지고 있어야하고, 정보들을 각각 부호화하여 전송하여야 한다. 블럭단위로 움직임 추정(Motion Estimation)을 하고, 여기에서 생성된 움직임 벡터(Motion vector)로 움직임 보상(Motion Compensation)을 한후에 움직임 보상이 되지않은 부분에 대하여 영상정보 부호화(Texture coding)를 수행한다. 그리고 객체의 모양정보에 대해 모양정보 부호화(Shape coding)를 수행한다.Shape information for separating an object from an image, shape motion information and image information of the object must be included, and the information must be encoded and transmitted. Motion estimation is performed in units of blocks, and motion coding is performed using the motion vector generated therein, and then image coding is performed on a portion where motion compensation is not performed. Shape coding is performed on the shape information of the object.

일반적으로, 움직임 영상을 전송하는 방법으로 순행주사(Progressive)와 격행주사(Interlaced)가 있다. 순행주사는 한 프레임(Frame)단위로 영상을 부호화 하고 전송하며 디스플레이 한다.In general, there are progressive and interlaced methods for transmitting a motion image. Progressive scanning encodes, transmits and displays images in units of frames.

그러나 격행주사는 한 프레임을 첫번째 필드(Top Field :홀수 필드)와 두번째 필드(Bottom Field : 짝수 필드)로 나누어 영상을 부호화하고 전송하며 디스플레이 한다.However, a parallel scan divides one frame into a first field (top field) and a second field (bottom field) and encodes, transmits, and displays the image.

이때 기존의 격행주사는 한 프레임을 두 필드로 나누어서 전송하기 때문에 두 필드간에는 시간축상으로 움직임이 발생하여 움직임의 크기만큼 영상의 차이가 발생하는 문제점을 유발 하였다.In this case, conventional interpolation scan transmits one frame into two fields, which causes motion on the time axis between two fields, causing a difference in image size by the size of the motion.

이에 본 발명은 상기와 같은 종래 격행주사시 발생하는 제반 문제점을 해결하기 위해서 제안된 것으로,Therefore, the present invention has been proposed to solve all the problems occurring during the conventional fleeting scan,

본 발명은 동영상의 격행주사를 위한 부호화시 모양정보의 움직임 정도를 검출하여 그 움직임량에 따라 프레임 단위 또는 필드단위로 모양정보를 부호화하는 격행주사를 위한 모양정보 부호화 방법을 제공하는 데 그 목적이 있다.The present invention provides a shape information encoding method for a progressive scan which detects a motion degree of shape information during encoding for a progressive scan of a video and encodes the shape information in a frame unit or a field unit according to the amount of movement thereof. There is a purpose.

상기와 같은 목적을 달성하기 위한 본 발명에 의한 방법은,Method according to the present invention for achieving the above object,

격행주사를 위한 모양정보 부호화에 있어서,In shape information encoding for a progressive scan,

현재(Current) 프레임의 홀수 필드의 움직임 예측방향을 결정할 때 이전(Reference) 프레임의 홀수 필드에서 움직임 추정을 한 SAD(Sum Absolute Difference)의 최소값을 움직임 예측방향을 결정하고, 현재(Current) 프레임의 짝수 필드의 움직임 예측방향을 결정할 때 이전(Reference) 프레임의 짝수 필드에서 움직임 추정을 한 SAD(Sum Absolute Difference)의 최소값으로 움직임 추정을 수행하여 모양정보를 부호화하는 것을 특징으로 한다.When determining the motion prediction direction of the odd field of the current frame, the minimum value of SAD (Sum Absolute Difference) that has been estimated in the odd field of the reference frame is determined to determine the motion prediction direction, and When determining the motion prediction direction of the even field, the shape information is encoded by performing motion estimation to a minimum value of SAD (Sum Absolute Difference) in which the motion estimation is performed in the even field of the previous frame.

상기와 같은 목적을 달성하기 위한 본 발명에 의한 다른 방법은,Another method according to the present invention for achieving the above object,

이진 모양정보(Binary Alpha Block: BAB)에 대하여 이전 프레임의 정보를 이용한 움직임 추정(Motion Estimation)을 수행하여 한개의 16x16 프레임 움직임 벡터 혹은 두개의 8x16 프레임 움직임 벡터를 구한 후 MVDs가 0이 아닌경우 한개의 16x16 움직임 벡터를 이용하였는지 두개의 8x16 움직임 벡터를 이용하였는가의 여부인 shape_field_prediction(ON: 필드 예측, OFF: 프레임 예측)을 결정하여 모양정보를 부호화하는 것을 특징으로 한다.One 16x16 frame motion vector or two 8x16 frame motion vectors are obtained by performing motion estimation on the binary alpha block (BAB) using the information of the previous frame, and then one if the MVDs are not 0. The shape information is encoded by determining shape_field_prediction (ON: field prediction, OFF: frame prediction), which is used as a 16x16 motion vector or two 8x16 motion vectors.

상기에서, shape_field_prediction이 필드예측이면 1을 전송하고 프레임 예측이면 0을 전송하는 것을 특징으로 한다.In the above, when shape_field_prediction is field prediction, 1 is transmitted, and when frame prediction is transmitted, 0 is transmitted.

또한, 상기에서 shape_field_prediction이 필드예측이면 0을 전송하고 프레임 예측이면 1을 전송하는 것을 특징으로 한다.In addition, if shape_field_prediction is field prediction, 0 is transmitted. If frame_prediction is frame prediction, 1 is transmitted.

상기와 같은 목적을 달성하기 위한 본 발명에 의한 또 다른 방법은,Another method according to the present invention for achieving the above object,

이진 모양정보(Binary Alpha Block: BAB)에 대하여 이전 프레임의 정보를 이용한 움직임 추정(Motion Estimation)을 수행하여 한개의 16x16 프레임 움직임 벡터 혹은 두개의 8x16 프레임 움직임 벡터를 구한 후 MVDs가 0이 아닌경우 한개의 16x16 움직임 벡터를 이용하였는지 두개의 8x16 움직임 벡터를 이용하였는가의 여부인 shape_field_prediction(ON: 필드 예측, OFF: 프레임 예측)을 결정하는 과정과; 모양정보 부가정보(mode)인 BAB_type을 결정하는 과정과; 이진 모양정보의 타입이 All_0이거나 All_255이거나 MVDs=0No_update인 경우에는 BAB에 대한 부호화를 수행하는 과정과; 상기 BAB 타입이 상기 세가지의 경우가 아닐 경우에는 선택적인 격행주사 모양정보 부호화를 수행하기 위해 우선 프레임 CAE 모드인지 필드 CAE 모드 인지의 여부를 확인하여 CAE_type을 결정하는 과정과; 필드 CAE 모드가 선택되면 두 개의 8x16 필드 BAB에 대하여 동일한BAB_type을 적용하여 부호화를 한뒤, 부호화 비트수가 적은 쪽으로 IntraCAE냐 MVDs!=0 No_Update냐 MVDs==0 InterCAE 혹은 MVDs!=0InterCAE냐를 결정하여 BAB_type을 결정하는 과정을 순차 실행시킴을 특징으로 한다.One 16x16 frame motion vector or two 8x16 frame motion vectors are obtained by performing motion estimation on the binary alpha block (BAB) using the information of the previous frame, and then one if the MVDs are not 0. Determining shape_field_prediction (ON: field prediction, OFF: frame prediction), which is whether a 16x16 motion vector or two 8x16 motion vectors are used; Determining a BAB_type that is a shape information additional information mode; If the type of the binary shape information is All_0, All_255, or MVDs = 0No_update, performing encoding on the BAB; If the BAB type is not the three cases, determining CAE_type by first checking whether the CAB mode or the field CAE mode is used to perform selective interpolation scan shape information encoding; When the field CAE mode is selected, two 8x16 field BABs are encoded by applying the same BAB_type, and then the BAB_type is determined by determining IntraCAE or MVDs! = 0 No_Update or MVDs == 0 InterCAE or MVDs! = 0InterCAE. Characterized in that the decision making process is carried out sequentially.

상기에서, CAE_type이 필드 CAE인지 프레임 CAE인지의 결정은, 필드 혹은 프레임 단위로 CAE를 수행한 후 발생되는 비트수가 적은 쪽으로 CAE_Type을 결정하는 것을 특징으로 한다.In the above, the determination of whether the CAE_type is a field CAE or a frame CAE is characterized in that the CAE_Type is determined so that the number of bits generated after performing CAE on a field or frame basis is small.

상기에서, CAE_type이 필드 모드일 때 ST(Scan_type)을 한 개만 이용하는 것을 특징으로 한다.In the above, only one ST (Scan_type) is used when the CAE_type is the field mode.

상기에서, CAE_type이 필드 CAE 모드일 때 부호화 효율을 높이기 위하여 각 필드 BAB별로 Flusih bit을 생성하지 않고, 홀수(Top) 필드 BAB을 부호화하고 난 후에는 Flush bit를 보내지 않고 짝수(Bottom) 필드 BAB을 부호화 한후에 플러쉬 비트를 한 번만 생성하여 전송하는 것을 특징으로 한다.In the above, when CAE_type is in the field CAE mode, in order to increase coding efficiency, after generating the top field BAB without encoding the Flusih bit for each field BAB, the even field BAB is sent without sending the flush bit. After encoding, the flush bit is generated and transmitted only once.

상기에서, BAB_type결정을 CR과 연관지어 수행함으로써 손실부호화(Lossy Coding)일때 원래 형태의 BAB에 CR를 적용하여 All_0,All_255 여부를 결정하고, 재배치 BAB (Permuted BAB)에 CR을 적용하여 All_0, All_255을 여부를 결정한후, 두 경우중 하나라도 All_0 또는All_255인 경우가 있으면 BAB_type을 All_0, All_255로 결정하는 거을 특징으로 한다.In the above, BAB_type determination is performed in association with CR to determine whether All_0, All_255 is applied by applying CR to BAB of the original form in lossy coding, and CR is applied to relocated BAB (Permuted BAB). After determining whether or not, if either case is All_0 or All_255, BAB_type is characterized as All_0, All_255.

상기에서, BAB_type결정을 CR과 연관지어 수행함으로써 손실부호화(Lossy Coding)일때 BAB에 CR를 적용하여 MVDs==0No_Update, MVDs!=0 No_Update 여부를 결정하고, 재배치 BAB(Permuted BAB)에 CR을 적용하여 MVDs==0 No_Update, MVDs!=0 No_Update여부를 결정한 후, 두 경우중 하나라도 MVDs==0No_Update 또는 MVDs!=0 No_Update인 경우가 있으면 BAB_type을MVDs==0No_Update 또는 MVDs!=0 No_Update로 결정하는 것을 특징으로 한다.In the above, by performing BAB_type determination in association with CR, CR is applied to BAB in lossy coding, and MVDs == 0No_Update, MVDs! = 0 No_Update is determined, and CR is applied to relocated BAB (Permuted BAB). After deciding whether MVDs == 0 No_Update, MVDs! = 0 No_Update, if either case MVDs == 0No_Update or MVDs! = 0 No_Update Characterized in that.

도1은 현재 국제표준 산하기구에서 1차적으로 확정한 MPEG-4의 VOP 영상 부호화기의 구성도,1 is a block diagram of a MPEG-4 VOP video encoder primarily determined by the present International Standards Organization;

도2는 순행주사(Progressive)와 격행주사(Interlaced)시 프레임과 필드의 시간축상의 전송순서도,FIG. 2 is a flow chart of time frames and fields in progressive and interlaced scans. FIG.

도3은 적응적인 격행주사 모양정보 부호화(Adaptive Interlaced shape coding)의 순서도,3 is a flow chart of adaptive interlaced shape coding.

도4는 필드 예측을 수행할 때 이전의 필드를 참조하는 방향의 예시도,4 is an exemplary view of a direction of referring to a previous field when performing field prediction;

도5는 매크로 블럭(MB)단위로 부호화할 프레임 MB과 필드 MB의 구성도,5 is a configuration diagram of a frame MB and a field MB to be encoded in units of macro blocks (MB);

도6은 도5에서 이진 모양정보(Mask)가 존재할 경우 움직임이 적은 영상 예시도,FIG. 6 is a view showing an image with little motion when binary shape information Mask is present in FIG. 5;

도7은 도5에서 이진 모양정보(Mask)가 존재할 경우 움직임이 많은 영상 예시도,FIG. 7 is a diagram illustrating an image having a lot of motion when binary shape information Mask is present in FIG. 5;

도8은 가장자리를 16x16의 매크로 블럭으로 나타낸 움직임 보상 이진 모양 블럭도(Bordered MC BAB),8 is a motion compensated binary shape block diagram (Bordered MC BAB) with edges represented by 16 × 16 macroblocks,

도9는 가장자리를 16x16의 매크로 블럭으로 나타낸 현재의 이진모양 블럭도(Bordered Current BAB),9 is a current binary ordered block diagram (Bordered Current BAB) showing the edges as 16x16 macroblocks,

도10은 원래 형태의 프레임에 대하여 CR(Conversion ratio)을 결정하는 과정에 대한 흐름도(이 경우 결정되는 CR은 1개),10 is a flowchart of a process of determining a conversion ratio (CR) for a frame of an original form (in this case, one CR is determined),

도11은 CR을 결정하기 전에 필드별 재배치(Reordering)를 한후 재배치 BAB을 구성하고 여기서 CR (Conversion ratio)을 결정하는 순서도(이 경우 결정되는 CR은 1개),FIG. 11 is a flow chart of reordering BAB after field reordering before determining CR, and determining a conversion ratio (CR) in this case (1 CR is determined in this case).

도12는 홀수(Top) 필드 BAB와 짝수(Bottom) 필드 BAB에 대하여 각각 CR(Conversion ratio)을 결정하는 과정에 대한 순서도(이 경우 결정되는 CR은 필드 BAB별로 총 2개),12 is a flowchart for a process of determining a conversion ratio (CR) for each of the top field BAB and the even field BAB (in this case, two CRs determined for each field BAB);

도13은 CAE_type이 필드 CAE 모드일 경우 재배치 BAB에 대하여CR (Conversion Ratio)를 결정하는 순서도(이 경우 결정되는 CR은 1개),13 is a flowchart for determining a conversion ratio (CR) for a relocation BAB when CAE_type is in field CAE mode (in this case, one CR is determined),

도14는 도3에서 추가되는 CAE_type의 1비트의 Syntax구성도,FIG. 14 is a 1-bit Syntax configuration diagram of CAE_type added in FIG. 3; FIG.

도15는 도3에서 추가되는 CAE_type의 1비트와 모양정보 움직임 예측의 shape_field_prediction 1비트와 Bottom 필드의 추가 움직임 벡터 Syntax구성도,15 is an additional motion vector syntax structure of 1 bit of CAE_type and 1 bit of shape_field_prediction of shape information motion prediction and Bottom field added in FIG.

도16은 기존의 움직임 벡터의 가변길이 부호화 테이블,16 is a variable length coding table of an existing motion vector;

도17은 움직임 벡터의 가로방향(x축)의 좌표가 0일때 적용되는 가변길이 부호화 테이블.Fig. 17 is a variable length coding table applied when the coordinate in the horizontal direction (x axis) of a motion vector is zero.

도면의 주요 부분에 대한 부호의 설명Explanation of symbols for the main parts of the drawings

11:VOP 형성부11: VOP forming part

12:모양정보 부호화부12: shape information encoder

13:움직임 추정부13: Motion estimation part

14:움직임 보상부14: Motion compensation department

18:대상물 내부 부호화부18: object internal encoding unit

19:다중화부19: multiplexing unit

20:버퍼20: buffer

이하, 본 발명의 바람직한 실시예를 첨부한 도면에 의거 상세히 설명하면 다음과 같다.Hereinafter, with reference to the accompanying drawings, preferred embodiments of the present invention will be described in detail.

도3은 선택적인 격행주사 모양정보 부호화에 관한 순서도를 나타내었다.3 shows a flow diagram for selective encoding of the scan information.

이진 모양정보(Binary Alpha Block: BAB)에 대하여 이전 프레임의 정보를 이용한 움직임 추정(Motion Estimation)을 수행하여 한개의 16x16 프레임 움직임 벡터 혹은 두개의 8x16 프레임 움직임 벡터를 구한 후 MVDs가 0이 아닌경우 한개의 16x16 움직임 벡터를 이용하였는지 두개의 8x16 움직임 벡터를 이용하였는가의 여부인 shape_field_prediction(ON: 필드 예측, OFF: 프레임 예측)을 결정한다.One 16x16 frame motion vector or two 8x16 frame motion vectors are obtained by performing motion estimation on the binary alpha block (BAB) using the information of the previous frame, and then one if the MVDs are not 0. The shape_field_prediction (ON: field prediction, OFF: frame prediction) is determined whether the 16x16 motion vector or two 8x16 motion vectors are used.

그 후 모양정보 부가정보(mode)인 BAB_type을 결정한다.Thereafter, BAB_type as the shape information additional information mode is determined.

상기 BAB_type은All_0, All_255, MVDs=0No_update, MVDs!=0No_update, IntraCAE , MVDs=0InterCAE, MVDs!=0 InterCAE 로 구분된다.The BAB_type is classified into All_0, All_255, MVDs = 0No_update, MVDs! = 0No_update, IntraCAE, MVDs = 0InterCAE, MVDs! = 0 InterCAE.

여기에서 블럭(BAB)의 모양정보가 모두 0일 경우에는 All_0로 정의하고 블럭(BAB)의 모양정보가 모두 255일 경우에는 All_255로 정의한다. MVDs=0No_update의 정의는 BAB의 모양정보 움직임 벡터가 0이고 이전 BAB와 동일하여 부호화를 하지 않는 경우이고 MVDs!=0No_update의 정의는 BAB의 모양정보 움직임 벡터가 0이 아니고 이전 BAB와 움직임 보상을 하였을 때 동일하여 부호화를 하지 않는 경우이다.Here, if the shape information of the block BAB is all 0, it is defined as All_0. If the shape information of the block BAB is all 255, it is defined as All_255. The definition of MVDs = 0No_update means that the shape information motion vector of BAB is 0 and is not encoded because it is the same as the previous BAB. This is the case when the same code is not used.

Intra CAE는 BAB를 Intra CAE 부호화 하는 것을 말하고 MVDs=0InterCAE의 정의는 BAB의 모양정보 움직임 벡터가 0이고 이전 BAB와 동일하지 않아서 Inter CAE 부호화를 해야 하는 경우이고 MVDs!=0InterCAE 의 정의는 BAB의 모양정보 움직임 벡터가 0이 아니고 이전 BAB와 움직임 보상을 하였을 때 동일하지 않아서 Inter CAE 부호화를 해야 하는 경우이다.Intra CAE refers to Intra CAE encoding of BAB. The definition of MVDs = 0InterCAE is when the shape information motion vector of BAB is 0 and it is not the same as the previous BAB. This is the case where Inter CAE coding is required because the information motion vector is not 0 and is not the same when the motion compensation is performed with the previous BAB.

All_0, All_255, MVDs==0No_update, 세가지의 경우BAB_type이 부호화됨으로써 BAB에 대한 부호화가 종료된다. 위의 세가지가 아닌 경우 선택적인 격행주사 모양정보 부호화를 수행하기 위해 우선 프레임 CAE 모드인지 필드 CAE 모드 인지의 여부를 확인하여CAE_type을 결정한다.All_0, All_255, MVDs = 0 No_update, in three cases, the BAB_type is encoded, so that the encoding for the BAB is terminated. If the above three are not performed, CAE_type is determined by first checking whether the frame CAE mode or the field CAE mode is used to perform selective encoding of the scan information.

상기 CAE_type 결정에는 여러가지 방법이 있을 수 있다.There may be various methods for determining the CAE_type.

프레임 CAE 모드가 선택되면 16x16 BAB 단위로 부호화를 한뒤, 부호화 비트수가 적은 쪽으로 IntraCAE냐 MVDs!=0No_Update냐 MVDs==0InterCAE 혹은 MVDs!=0InterCAE냐를 결정하여 BAB_type을 결정한다.When the frame CAE mode is selected, after encoding in units of 16 × 16 BAB, BAB_type is determined by determining whether IntraCAE or MVDs! = 0No_Update or MVDs == 0InterCAE or MVDs! = 0InterCAE in the less coded bits.

필드 CAE 모드가 선택되면 16x16 BAB가 두개의 8x16 서브블럭(이하 이를 필드 BAB이라 칭한다.)으로 나뉘게 되는데, 두 8x16 필드 BAB에 대하여 동일한BAB_type을 적용하여 부호화를 한뒤, 부호화 비트수가 적은 쪽으로 IntraCAE냐 MVDs!=0 No_Update냐 MVDs==0 InterCAE 혹은 MVDs!=0InterCAE냐를 결정하여 BAB_type을 결정한다.When the field CAE mode is selected, the 16x16 BAB is divided into two 8x16 subblocks (hereinafter referred to as field BABs). The same BAB_type is applied to two 8x16 field BABs, and then IntraCAE or MVDs ! = 0 No_Update or MVDs == 0 InterCAE or MVDs! = 0InterCAE to determine BAB_type.

제4도는 현재(Current) 프레임의 홀수 필드(Top field)와 짝수 필드(Bottom field)의 모양정보 움직임 추정 방향에 대한 그림을 나타내었다.4 is a diagram illustrating a shape information motion estimation direction of an odd field and an even field of a current frame.

현재(Current) 프레임의 홀수 필드(Top field)의 움직임 예측방향을 결정할 때, 이전(Reference) 프레임의 홀수 필드(Top field)에서 움직임 추정을 해보아 SAD(Sum Absolute Difference)의 최소값이 얻어지는움직임 벡터를 결정한다.When determining the direction of motion estimation of the odd field of the current frame, the motion vector obtains the minimum value of sum absolute difference (SAD) by estimating the motion of the odd field of the reference frame. Determine.

또한, 현재(Current) 프레임의 짝수 필드(Bottom field)의 움직임 예측방향을 결정할 때 이전(Reference) 프레임의 짝수 필드(Bottom field)에서 움직임 추정을 해보아 SAD(Sum Absolute Difference)의 최소값이 얻어지는 움직임 벡터를 결정한다.In addition, when determining the motion prediction direction of the even field of the current frame, a motion is estimated in the even field of the previous frame to obtain a minimum value of the sum absolute difference (SAD). Determine the vector.

도5는 두 필드를 한 프레임으로 구성한 후에 예를 들어 매크로 블럭사이즈로 부호화를 할 때 프레임 단위로 부호화 할 것인가 아니면 필드단위로 부호화 할 것인가의 영상 구성에 대한 도면이다.FIG. 5 is a diagram illustrating an image configuration of whether two fields are configured in one frame and then encoded in frame units or field units when encoded in a macroblock size, for example.

흰색 라인은 홀수필드(Top field)이고, 회색 라인은 짝수필드(Bottom field)를 나타낸다.The white line represents the odd field and the gray line represents the even field.

도6은 움직임이 적은 영상이나 정지 영상일 때 이진 모양정보(Binary Mask)를 구성하여 프레임 단위와 필드 단위로 부호화 하는 경우를 나타낸다.FIG. 6 illustrates a case in which binary shape information is configured and encoded in a frame unit and a field unit when the moving image is a still image or a still image.

시간상의 변화가 아주 적기 때문에 두 필드 간의 영상 모양의 변화가 거의 없다. 이 경우에는 프레임 단위로 모양정보 부호화 하는 것이 유리하다. 프레임 단위로 부호화할 때 이진 모양정보의 변화가 심하지 않기 때문이다Since the change in time is very small, there is almost no change in image shape between the two fields. In this case, it is advantageous to encode shape information in units of frames. This is because binary shape information is not severely changed when encoded in frame units.

도7은 움직임이 많은 영상일 때 이진 모양정보(Binary Mask)를 구성하여 프레임 단위와 필드 단위로 부호화 하는 경우를 나타낸다. 시간상의 변화가 많기 때문에 두필드간의 영상모양의 변화가 심하다. 이 경우에는 필드단위로 모양정보 부호화 하는 것이 유리하다. 필드단위로 부호화할 때 이진 모양정보의 변화가 심하지 않기 때문이다.FIG. 7 illustrates a case in which binary shape information is configured and encoded in a frame unit and a field unit when an image having a lot of motions. Since there are many changes in time, the change in image shape between two fields is severe. In this case, it is advantageous to encode shape information in field units. This is because the binary shape information is not severely changed when coding in field units.

상기 도6과 도7의 예에서 보듯이 움직임이 많고 적음에 따라 격행주사(Interlaced)의 부호화 효율이 다르게 나타난다. 선택적(Adaptive)으로 프레임과 필드를 구분하여 부호화 함으로서 최적의 부호화 효율을 얻을 수 있게 된다. 부호화 방법으로는 현재 국제표준MPEG-4 CD(Committee Draft)에 채택된 CAE(Context-based Arithmetic Encoding)가 이용된다.As shown in the examples of FIG. 6 and FIG. 7, the encoding efficiency of the interlaced appears differently as the motion is large and small. Optimum coding efficiency can be obtained by separately coding the frames and the fields. As the encoding method, CAE (Context-based Arithmetic Encoding) adopted in the international standard MPEG-4 CD (Committee Draft) is used.

매크로블럭 내의 움직임이 많고 적고 하는 기준은 다음의 수식으로 정할 수 있다. 도6에서 보듯이 이진 모양정보가 존재하는 화소를 1로 정하고 이진 모양정보가 존재하지 않는 화소를 0이라고 정한다.The criterion that there is a lot of motion in the macroblock is small and can be determined by the following equation. As shown in Fig. 6, a pixel having binary shape information is set to 1, and a pixel having no binary shape information is set to 0.

｜P_2i,j-P_2i+1,j｜+｜P_2i+1,j-P_2i+2,j｜) ｜P_2i,j-P_2i+2,j｜+｜P_2i+1,j-P_2i+3,j｜) | P _{2i, j} -P _{2i + 1, j} | + | P _{2i + 1, j} -P _{2i + 2, j} |) | P _{2i, j} -P _{2i + 2, j} | + | P _{2i + 1, j} -P _{2i + 3, j} |)

여기에서 Pi,j는 이진 모양정보 데이타이다 즉 모양정보가 존재하면 1이고 없으면 0이다.Where Pi, j is binary shape information data, i.e. 1 if shape information is present and 0 if it is present.

좌변값이 크면 필드 CAE 모드가 선택되고 우변값이 크면 프레임 CAE 모드가 선택된다. 여기에 추가로 필드 CAE 모드인가 프레임 CAE 모드인가의 여부가 CAE_type(1비트)으로 전송된다. CAE_type에 따라 필드 혹은 프레임 단위로 CAE가 수행된다. 위와 같은 수식을 사용하지 않고, 필드 혹은 프레임 단위로CAE를 수행한 후 발생되는 비트수가 적은 쪽으로 CAE_Type을 결정할 수도 있다.If the left side is large, the field CAE mode is selected. If the right side is large, the frame CAE mode is selected. In addition, whether the field CAE mode or the frame CAE mode is transmitted is transmitted as CAE_type (1 bit). CAE is performed in units of fields or frames according to CAE_type. Instead of using the above formula, the CAE_Type may be determined so that the number of bits generated after performing CAE on a field or frame basis is small.

도8의 왼쪽그림은 16x16의 매크로 블럭의 가장자리를 나타낸 움직임 보상 이진 모양 블럭(Bordered MC BAB:Bordered Motion Compensated Binary Alpha Block)을 나타낸다. 굵은선 내부는 부호화 하고자 하는 매크로 블럭(16x16)을 의미한다. 도8의 오른쪽 그림은 필드(Field) CAE 모드가 선택된 경우, 8x16의 홀수필드와 짝수필드로 나누어 지고 각각의 8x16의 서브블럭의 가장자리의 데이타는 각각의 필드와 관련있는 부분에서 가져와 구성됨을 나타낸다. 즉 Bordered MC BAB가 두필드로 나누어 사용된다.The left figure of FIG. 8 shows a Bordered MC BAB (Bordered Motion Compensated Binary Alpha Block) showing the edge of a 16x16 macroblock. The thick line inside means a macroblock 16x16 to be encoded. The right figure of FIG. 8 shows that when the Field CAE mode is selected, the 8x16 odd field and the even field are divided, and the data of the edge of each 8x16 subblock is obtained from the part associated with each field. In other words, Bordered MC BAB is divided into two fields.

도9의 왼쪽그림은 16x16의 매크로 블럭의 가장자리를 나타낸 현재 이진 모양 블럭(Bordered Current BAB: Bordered Current Binary Alpha Block)을 나타낸다. 도9의 오른쪽 그림은 필드(Field) CAE 모드가 선택된 경우, 8x16의 홀수필드와 짝수필드로 나누어 지고 각각의 8x16의 서브블럭의 가장자리의 데이타는 각각의 필드The left figure of FIG. 9 shows a Bordered Current BAB (Bordered Current Binary Alpha Block) showing the edge of a 16x16 macroblock. In the right figure of Fig. 9, when the Field CAE mode is selected, the 8x16 odd field and the even field are divided and the data of the edge of each 8x16 subblock is displayed in each field.

와 관련있는 부분에서 가져와 구성됨을 나타낸다. 즉 Bordered Current BAB가 두필드로 나누어 사용된다. 위쪽 가장자리(Top border)와 왼쪽 가장자리(Left border)만 각각의 필드에서 데이타를 가져와 구성하게 된다.It is taken from the part related to In other words, Bordered Current BAB is divided into two fields. Only the top and left borders get their data from each field.

손실 부호화(Lossy coding)를 위한 모양정보 부호화 과정에서 대상 BAB를 어떠한 형태로 구성하여 CR(Conversion Ratio)을 결정하는 가에 관하여 도10,도11,도12,도13을 이용하여 나타내었다.10, 11, 12, and 13 illustrate how the target BAB is configured to determine the conversion ratio (CR) in the shape information encoding process for lossy coding.

대상 BAB를 구성하는 방법은 두가지가 있다.There are two ways to construct the target BAB.

도5를 참조하여 설명하면, 왼쪽의 그림과 같이 원래 형태의 BAB을 유지할 수도 있고 오른쪽의 그림과 같이 필드별로 모아 재배치하여 새로운 형태의 BAB을 구성할 수도 있다. 이하 필드별로 모아 재배치하여 구성한 BAB을 재배치 BAB (Permuted BAB)이라 칭한다.Referring to FIG. 5, the original form of the BAB may be maintained as shown in the left figure, or the new type of BAB may be configured by rearranging and rearranging by field as shown in the figure on the right. Hereinafter, the BAB formed by collecting and rearranging by field is referred to as relocated BAB (Permuted BAB).

도10은 원래 형태의 Frame에 대하여 CR(Conversion ratio)을 결정하는 과정에 대한 블럭도이다. 이 경우 결정되는 CR은 1개이다.FIG. 10 is a block diagram illustrating a process of determining a conversion ratio (CR) for a frame having an original shape. In this case, one CR is determined.

도11은 상기 CR을 결정하기 전에 필드별 재배치(Reordering)를 한후 재배치 BAB을 구성하고 여기서 CR (Conversion ratio)을 결정하는 블럭도이다. 이 경우 결정되는 CR은 1개이다.FIG. 11 is a block diagram of relocation BAB after field reordering prior to determining the CR, and determining a conversion ratio (CR). In this case, one CR is determined.

도12는 홀수(Top) 필드 BAB와 짝수(Bottom) 필드 BAB에 대하여 각각 CR(Conversion ratio)을 결정하는 과정에 대한 블럭도이다. 이 경우 결정되는 CR은 필드 BAB별로 총 2개 이다.12 is a block diagram of a process of determining a conversion ratio (CR) for an odd field BAB and an even field BAB, respectively. In this case, two CRs are determined for each field BAB.

도13은 CAE_type이 필드 CAE 모드일 경우 재배치 BAB에 대하여CR (Conversion Ratio)를 결정한다. CAE_type이 프레임 CAE 모드이면 BAB에 대하여 CR을 결정한다. 이 경우 결정되는 CR은 1개 이다.FIG. 13 determines a conversion ratio (CR) for relocation BAB when CAE_type is in field CAE mode. If CAE_type is frame CAE mode, CR is determined for BAB. In this case, one CR is determined.

위에 기술된 CR 결정방법에 덧붙여 BAB_type결정을 CR과 연관지어 수행함으로써 손실부호화(Lossy Coding)일때 부호화 이득을 얻는 방법들을 다음과 같다.In addition to the CR determination method described above, a method of obtaining coding gain in lossy coding by performing BAB_type determination in association with CR is as follows.

원래 형태의 BAB에 CR를 적용하여 All_0,All_255 여부를 결정하고, 재배치 BAB (Permuted BAB)에 CR을 적용하여 All_0, All_255을 여부를 결정한후, 두 경우중 하나라도 All_0 또는All_255인 경우가 있으면 BAB_type을 All_0, All_255로 결정한다.After CR is applied to BAB of original form, All_0, All_255 is determined, CR is applied to relocated BAB (Permuted BAB), and All_0, All_255 is determined. If either case is All_0 or All_255, BAB_type Determine as All_0, All_255.

BAB에 CR를 적용하여 MVDs==0No_Update, MVDs!=0 No_Update 여부를 결정하고, 재배치 BAB(Permuted BAB)에 CR을 적용하여 MVDs==0 No_Update, MVDs!=0 No_Update여부를 결정한 후, 두 경우중 하나라도 MVDs==0No_Update 또는 MVDs!=0 No_Update인 경우가 있으면 BAB_type을MVDs==0No_Update 또는 MVDs!=0 No_Update로 결정한다.Apply CR to BAB to determine whether MVDs == 0No_Update, MVDs! = 0 No_Update and apply CR to Permuted BAB to determine whether MVDs == 0 No_Update, MVDs! = 0 No_Update If either of MVDs = 0 No_Update or MVDs! = 0 No_Update, BAB_type is determined as MVDs = 0 No_Update or MVDs! = 0 No_Update.

이 밖에도 부호화 효율 향상을 위하여 다음 사항들을 고려하여야 한다In addition, the following items should be considered to improve coding efficiency.

CAE_type이 필드 모드일 때 ST(Scan_type)을 한 개만 이용한다. 즉 홀수(Top) 필드 BAB과 짝수(Bottom) 필드 BAB에 동일한 ST를 이용한다. 각 8x16 필드BAB단위로 총 2개를 보내면 부가정보가 늘어나서 비트량에서 손해를 보기 때문이다.Only one ST (Scan_type) is used when CAE_type is in field mode. That is, the same ST is used for the odd field BAB and the even field BAB. This is because sending a total of two in each 8x16 field BAB unit increases the additional information, which causes a loss in bits.

일반적으로 CAE(Context Arithmetic Encoding)할 때 부호화 단위별(BAB단위)로 각화소의 발생확률을 이용하여 Arithmetic Coding을 한후에 Flush bit(2-3비트)를 보내는 방법을 이용한다. CAE_type이 필드 CAE 모드일 때 부호화 효율을 높이기 위하여 각 필드 BAB별로 Flusih bit을 생성하지 않고, 홀수(Top) 필드 BAB을 부호화하고 난 후에는 Flush bit를 보내지 않고 짝수(Bottom) 필드BAB을 부호화 한후에 한 번만 생성한다.In general, in case of CAE (Context Arithmetic Encoding), Arithmetic Coding is performed by using occurrence probability of each pixel in each coding unit (BAB unit), and then Flush bit (2-3 bits) is transmitted. In order to improve coding efficiency when CAE_type is in field CAE mode, after encoding the top field BAB without encoding the Flusih bit for each field BAB and encoding the even field BAB without sending the flush bit after encoding the top field BAB, Generate only once.

도14는 움직임 벡터가 16x16 한 개의 벡터로 전송될때 CAE_type 1비트의 추가정보의 syntax상의 위치를 나타내었다.FIG. 14 shows the syntax position of additional information of CAE_type 1 bit when the motion vector is transmitted in one 16x16 vector.

도15는 움직임 벡터가 8x16 두 개의 벡터로 전송될때 CAE_type 1비트의 추가정보의 syntax상의 위치와 shape_field_prediction, 추가의 움직임 벡터의 syntax상의 위치를 나타내었다.Figure 15 shows the position of the syntax of the CAE_type 1 bit additional information, shape_field_prediction, and the position of the additional motion vector when the motion vector is transmitted as two 8x16 vectors.

도16은 기존의 움직임 벡터의 가변길이 부호화 테이블 이고, 도17은 움직임 벡터의 가로방향(x축)의 좌표가 0일때 적용되는 가변길이 부호화 테이블이다.FIG. 16 is a conventional variable length coding table of a motion vector, and FIG. 17 is a variable length coding table applied when a coordinate in the horizontal direction (x-axis) of a motion vector is zero.

도16의 가변길이 부호화 테이블을 shape_field_prediction이 ON일 때 x축,y축 움직임 벡터의 가변길이 부호화 테이블로 사용해야 한다.(이 때 움직임 벡터의 가로방향(x축)의 좌표가 0일때도 도16을 이용한다)The variable length coding table of FIG. 16 should be used as the variable length coding table of the motion vector of the x-axis and the y-axis when shape_field_prediction is ON. (At this time, even if the coordinate in the horizontal direction (x-axis) of the motion vector is 0. I use it)

이상에서 상술한 바와 같이 본 발명은, 선택적인 격행주사 모양정보 부호화시 프레임과 필드 사이에서 모양정보의 움직임 량에 따라 부호화 모드를 선택 함으로써 격행주사시 모양정보의 부효화 효율을 높일 수 있는 효과가 있다.As described above, the present invention has the effect of increasing the invalidation efficiency of shape information during the progressive scan by selecting the encoding mode according to the amount of motion of the shape information between the frame and the field when encoding the optional progressive scan shape information. There is.

Claims

In shape information encoding for a progressive scan,

When determining the motion prediction direction of the odd field of the current frame, the minimum value of SAD (Sum Absolute Difference) that has been estimated in the odd field of the reference frame is determined to determine the motion prediction direction, and When determining the motion prediction direction of the even field, the shape information is encoded by performing the motion estimation to the minimum value of SAD (Sum Absolute Difference) which estimates the motion in the even field of the previous frame. Shape information encoding method.

In shape information encoding for a progressive scan,

One 16x16 frame motion vector or two 8x16 frame motion vectors are obtained by performing motion estimation on the binary alpha block (BAB) using the information of the previous frame, and then one if the MVDs are not 0. Shape information is encoded for a perimeter scan by determining shape_field_prediction (ON: field prediction, OFF: frame prediction), which is whether a 16x16 motion vector or two 8x16 motion vectors are used. Way.

The method of claim 2,

And if the shape_field_prediction is field prediction, 1 is transmitted, and if frame shape is predicted, 0 is transmitted.

The method of claim 2,

And if the shape_field_prediction is field prediction, 0 is transmitted, and if frame shape prediction is 1, 1 is transmitted.

In shape information encoding for a progressive scan,

One 16x16 frame motion vector or two 8x16 frame motion vectors are obtained by performing motion estimation on the binary alpha block (BAB) using the information of the previous frame, and then one if the MVDs are not 0. Determining shape_field_prediction (ON: field prediction, OFF: frame prediction), which is whether a 16x16 motion vector or two 8x16 motion vectors are used; Determining a BAB_type that is a shape information additional information mode; If the type of the binary shape information is All_0, All_255, or MVDs = 0No_update, performing encoding on the BAB; If the BAB type is not the three cases, determining CAE_type by first checking whether the CAB mode or the field CAE mode is used to perform selective interpolation scan shape information encoding; When the field CAE mode is selected, two 8x16 field BABs are encoded by applying the same BAB_type, and then the BAB_type is determined by determining IntraCAE or MVDs! = 0 No_Update or MVDs == 0 InterCAE or MVDs! = 0InterCAE. A shape information encoding method for a percussion scan, characterized in that to execute the determining process sequentially.

The method of claim 5,

And determining whether the CAE_type is a field CAE or a frame CAE to determine the CAE_Type in the number of bits generated after performing CAE on a field or frame basis.

The method of claim 5,

And only one ST (Scan_type) is used when the CAE_type is the field mode.

The method of claim 5,

When the CAE_type is the field CAE mode, in order to increase coding efficiency, after generating the top field BAB without encoding the Flusih bit for each field BAB, after encoding the even field BAB without sending the flush bit, Shape information encoding method for a perimeter scan, characterized in that to generate the flush bit only once.

The method of claim 5,

The BAB_type determination is performed in association with CR to determine whether All_0, All_255 is applied by applying CR to BAB of the original form in lossy coding, and CR is applied to relocated BAB (Permuted BAB) to determine whether All_0, All_255 And then determine if BAB_type is All_0 or All_255 if either of them is All_0 or All_255.

The method of claim 5,

The BAB_type decision is performed in association with CR to determine whether MVDs = 0 No_Update, MVDs! = 0 No_Update by applying CR to BAB when lossy coding and MVDs by applying CR to relocated BAB (Permuted BAB). == 0 No_Update, MVDs! = 0 After deciding whether or not to update, if either case MVDs == 0No_Update or MVDs! = 0 No_Update, BAB_type is determined to be MVDs == 0No_Update or MVDs! = 0 No_Update. A shape information encoding method for a percussion scan.