KR101295158B1

KR101295158B1 - Image codec system for supporting spatial random access and image encoding/decoding equipment and method thereof

Info

Publication number: KR101295158B1
Application number: KR1020070016921A
Authority: KR
Inventors: 문주희; 박민철
Original assignee: 세종대학교산학협력단
Priority date: 2007-02-20
Filing date: 2007-02-20
Publication date: 2013-08-09
Also published as: KR20080077430A

Abstract

An image codec system supporting spatial random access, and an image encoding / decoding apparatus and method configuring the same, are described. In an image encoding apparatus of an image codec system according to an embodiment of the present invention, a decoded pixel value of a pixel of an adjacent block, which is used as a prediction value in intra prediction encoding, together with an image bitstream, is separately stored as prediction data. When requested by the user, prediction data is separately extracted and transmitted along with the image bitstream of the target image. The image decoding apparatus of the image codec system according to the present embodiment reconstructs the target image using only the received image bitstream and the prediction data of the target image. According to such an embodiment of the present invention, since the video bitstream does not need to be transmitted to an area not included in the target video, the transmission efficiency can be improved.

Description

Image codec system for supporting spatial random access and image encoding / decoding equipment and method

도 1a 내지 도 1c는 각각 조각 영상의 크기에 따른 전송 영역의 관계를 보여주는 도면이다.1A to 1C are diagrams showing a relationship of a transmission area according to the size of a fragment image, respectively.

도 2는 현재 매크로블록 및 이의 복호화에 필요한 주변 매크로블록과의 관계를 보여주는 도면이다.2 is a diagram illustrating a relationship between a current macroblock and neighboring macroblocks necessary for decoding thereof.

도 3은 본 발명의 제1 실시예에 따른 영상 코덱 시스템의 구성을 보여주는 블록도이다.3 is a block diagram illustrating a configuration of an image codec system according to a first embodiment of the present invention.

도 4는 현재 매크로블록 내의 16개 4×4 블록과 상기 4×4 블록의 복호화에 모드 정보가 요구되는 주변 4×4 블록을 보여주는 도면이다.4 is a diagram showing 16 4x4 blocks in a current macroblock and neighboring 4x4 blocks for which mode information is required for decoding the 4x4 blocks.

도 5는 대상 영상에서 각 매크로블록의 위치에 따른 유형을 분류하여 보여주는 도면이다.FIG. 5 is a diagram illustrating types classified according to positions of respective macroblocks in a target image.

도 6은 본 발명의 일 실시예에 따라서 불일치 데이터(M-Data)를 구하는 과정을 도식적으로 보여주는 블록도이다.6 is a block diagram schematically illustrating a process of obtaining mismatch data (M-Data) according to an embodiment of the present invention.

도 7은 본 발명의 제2 실시예에 따른 영상 코덱 시스템의 구성을 보여주는 블록도이다.7 is a block diagram illustrating a configuration of an image codec system according to a second embodiment of the present invention.

본 발명은 영상 코덱에 관한 것으로, 보다 구체적으로 공간임의접근(spatial random access)을 지원하는 영상 코덱 시스템과 이를 구성하는 영상 부호화/복호화 장치 및 방법에 관한 것이다.The present invention relates to an image codec, and more particularly, to an image codec system supporting spatial random access, and an image encoding / decoding apparatus and method for configuring the same.

최근 방송의 디지털화와 더불어 방송과 통신의 융합이 활발히 진행되고 있다. 이에 따라서, 방송 환경도 기존의 단방향 수신에서 양방향 통신이 가능해짐과 아울러 이종망의 연동과 다양한 단말기를 수용하는 복잡한 형태로 변화하고 있다. 이러한 흐름을 반영하는 대표적인 예로 유비쿼터스 텔레비젼(TV)을 들 수 있다. 유비쿼터스 TV란 디지털 기기가 인터넷에 연결되어 있으면 언제 어디서나 시청이 가능한 TV를 말한다. 유비쿼터스 TV를 시청하기 위해서는 유비쿼터스 TV용 튜너를 케이블 또는 지상파 TV의 안테나에 연결하고 또한 이를 초고속 인터넷 선에 연결하면 된다. 이러한 유비쿼터스 TV를 이용하면, 노트북, 개인 휴대용 정보 단말기(PDA), 와이브로 단말기, 휴대용 멀티미디어 플레이어(Portable Multi-media Player, PMP) 등의 휴대용 단말기를 이용해 원격으로 유비쿼터스 TV용 튜너를 조정해 TV 프로그램만이 아니라, 영화, 사진, 애니메이션 등의 다양한 미디어를 볼 수 있다. 이러한 유비쿼터스 TV 서비스는 사용자가 원하는 형태의 단말기를 이용하여 언제, 어디서나 원하는 정보를 획득하고 또는 영상의 시청이 가능하다는 점에서 차세대 서비스로 급부상하고 있다.Recently, with the digitization of broadcasting, convergence of broadcasting and communication has been actively progressed. Accordingly, the broadcasting environment is also changing to a complex form that allows two-way communication in the existing one-way reception and interworking heterogeneous networks and accommodates various terminals. A representative example of this trend is ubiquitous television (TV). Ubiquitous TV is a TV that can be watched anytime and anywhere when digital devices are connected to the Internet. In order to watch ubiquitous TV, a ubiquitous TV tuner can be connected to an antenna of a cable or terrestrial TV and connected to a high speed internet line. With such ubiquitous TV, the ubiquitous TV tuner can be remotely controlled using a portable terminal such as a laptop, a personal digital assistant (PDA), a WiBro terminal, or a portable multimedia player (PMP). In addition, various media such as movies, photos, and animations can be viewed. The ubiquitous TV service is rapidly emerging as a next-generation service in that a user can obtain desired information anytime, anywhere, or view a video using a terminal of a user's desired form.

그런데 다양한 단말기에서 정보를 공유할 수 있는 이러한 유비쿼터스 환경은 단말기간의 해상도 불일치라는 새로운 문제를 만들어 내고 있다. 일례로, 최근의 고화질 텔레비젼(HDTV)의 경우에는 1920×1080의 해상도까지 지원하고 있고, 미국 중심의 영화 업계에서 새롭게 기획하고 있는 영화 전용 디지털 포맷은 4K×2K 크기의 디지털 시네마 영상을 기본으로 하고 있다. 반면에 PDA, 휴대용 멀티미디어 플레이어(PMP), 와이브로 단말기 등은 CIF(Common Intermediate Format) 또는 QCIF(Quater Common Intermediate Format) 크기까지의 해상도만을 지원한다. However, such a ubiquitous environment that can share information in various terminals is creating a new problem of resolution mismatch between terminals. For example, the recent high-definition television (HDTV) supports up to 1920 × 1080 resolution, and the new film-only digital format in the US-based film industry is based on 4K x 2K digital cinema video. have. On the other hand, PDAs, portable multimedia players (PMPs), and WiBro terminals only support resolutions up to Common Intermediate Format (CIF) or Quad Common Intermediate Format (QCIF).

이와 같이, 단말기간의 해상도가 서로 상이하면, 하나의 동영상 데이터를 여러 단말기 사이에서 호환하여 이용할 수 없는 문제가 발생할 수 있다. 그리고 이러한 문제는 유비쿼터스 TV 서비스 등과 같은 유비쿼터스 환경을 현실화하는데 장애로 작용한다.As such, when the resolutions of the terminals are different from each other, a problem may occur in that one video data cannot be used interchangeably among several terminals. And this problem is an obstacle to realizing a ubiquitous environment such as ubiquitous TV service.

전술한 해상도 불일치라는 문제를 해결하는 한 가지 방법은 TV 등으로 전송되는 고해상도의 영상을 다운샘플링(Down-Sampling)한 후에 저해상도의 영상을 만들어서 휴대용 단말기로 전송하는 것이다. 그러나 다운샘플링된 저해상도 영상을 제공하는 것은 휴대용 단말기 등을 통해서도 고화질의 화면을 시청하고자 하는 사용자의 욕구를 만족시킬 수가 없다. 특히, 뮤직 비디오, 축구와 같은 스포츠 경기, 및 홈쇼핑 등을 시청하는 사용자는 경우에 따라서 영상의 특정 부분을 자세하게 시청하기를 원할 경우도 있는데, 다운샘플링 방법으로는 이러한 사용자의 욕구를 만족시키기가 더욱 어렵다. 그렇다고 고해상도 영상 전체를 휴대용 단말기로 전송하는 것은 한정된 전송 채널 자원을 낭비하는 것일 뿐만 아니라 휴대용 단말기에 서는 불필요한 영상 부분에 대해서도 복호화를 수행해야 하는 문제가 발생한다.One way to solve the problem of resolution mismatch described above is to down-sample a high resolution image transmitted to a TV or the like, and then make a low resolution image and transmit it to a portable terminal. However, providing a downsampled low resolution image cannot satisfy a user's desire to view a high quality screen even through a portable terminal. In particular, a user watching a music video, a sports game such as soccer, and a home shopping may sometimes want to watch a specific part of an image in detail, and the downsampling method is more suitable to satisfy the needs of these users. it's difficult. However, transmitting the entire high resolution image to the portable terminal not only wastes limited transmission channel resources, but also causes a problem that the portable terminal needs to perform decoding on an unnecessary portion of the image.

이러한 문제점을 해결하기 위하여 제시된 방법 중의 하나가 공간임의접근(Spatial Random Access)이다. 공간임의접근은 영상 중의 임의의 위치에 있는 일부 영상만을 검색하고 또한 전송 및 복원이 가능하도록 하기 위한 것으로서, 이를 이용하면 전체 영상에 관한 데이터가 아니라 사용자의 의도 및 디스플레이 윈도우의 크기를 고려한 특정 부분(대상 영상)을 디스플레이하는데 필요한 최소한의 데이터만을 전송하여 복호화하는 지역 복호화(partial decoding)가 가능하다. 따라서 공간임의접근을 지원하는 영상 코덱 시스템을 이용하면 휴대용 단말기로도 HDTV와 같은 고화질의 영상을 보는 것이 가능해질 수 있다.One method proposed to solve this problem is spatial random access (Spatial Random Access). Spatial random access is to search and transmit and restore only a part of an image in an arbitrary position of the image. When using this method, a specific part considering the user's intention and the size of the display window (not data about the entire image) can be used. It is possible to perform partial decoding for transmitting and decoding only the minimum data necessary for displaying the target image). Therefore, by using an image codec system that supports spatial random access, it may be possible to view a high-definition image such as an HDTV using a portable terminal.

휴대용 단말기의 다양화 및 대중화로 인하여 이러한 공간임의접근을 지원하는 영상 코덱 시스템은 그 필요성이 점점 증가하고 있다. 현재 엠펙(MPEG)의 하부 그룹(Subgroup) 중의 한 곳에서는 다시점 비디오 코딩(Multi-view Video Coding, MVC)과 관련하여 F-TV(Free view-point TV)와 3차원 TV(3D TV)를 타깃 어플리케이션(Target Application)으로 하여 표준화를 진행하고 있는데, 2006년 4월에 개최된 76차 엠펙 스위스 회의(76th MPEG Swiss Meeting)의 결과로 영상의 일부분만을 접근 가능하도록 하는 프리 뷰 스케일러빌러티(Free View Scalability)를 요구안으로 내놓았다. 프리 뷰 스케일러빌러티란 다시점 비디오 코딩에서 각 카메라를 통해 획득한 영상의 일정 부분만을 분리하여 복호화 가 가능하도록 하는 기술이다.Due to the diversification and popularization of portable terminals, the necessity of an image codec system supporting such a random access is increasing. At present, one of the subgroups of MPEG is the Free View-point TV (F-TV) and the 3D TV (3D TV) in relation to Multi-view Video Coding (MVC). Standardization is underway as a Target Application. As a result of the 76th MPEG Swiss Meeting held in April 2006, only part of the image is accessible. View Scalability is a requirement. Free view scalability is a technique that enables decoding by separating only a part of an image acquired through each camera in multiview video coding.

3D 영상 뿐만 아니라 파노라마 영상(panoramic image)의 경우에도 이러한 공간임의접근은 필수적으로 적용이 되어야 할 것으로 여겨지고 있다. 일반적으로 파 노라마 영상은 다양한 시점에서 촬영된 영상들을 단일 영상으로 재구성한 영상이기 때문에, 영상의 크기가 일반 영상의 수십 배 이상이 된다. 이러한 파노라마 영상 전체를 전송하고 또한 복호화하도록 하는 것은 전송 채널의 한계 및 디코더의 처리 능력 등을 고려할 때 현실적으로 불가능하다. 따라서 파노라마 영상의 경우에는 공간임의접근을 지원하는 영상 코덱 시스템을 이용하여 그 일부분만을 전송하고 복호화할 수 있다면, 특정 부분에 대하여 고화질 영상을 시청하고자 하는 사용자의 욕구를 충족시킬 뿐만 아니라 전송의 측면에서 상당히 효율적임을 알 수 있다. In the case of panoramic images as well as 3D images, such spatial random access is considered to be essential. In general, since a panorama image is a reconstructed image of images taken at various viewpoints into a single image, the size of the image is more than tens of times that of the general image. It is practically impossible to transmit and decode the entire panoramic image considering the limitations of the transmission channel and the processing power of the decoder. Therefore, in the case of panoramic video, if only a part of the video can be transmitted and decoded using an image codec system that supports spatial random access, it not only satisfies a user's desire to watch high-definition video for a specific part, but also in terms of transmission. It can be seen that it is quite efficient.

그런데 현재까지 개발되거나 제안된 영상 코덱 시스템, 즉 영상 부호화/복호화 장치 및 방법으로는 이러한 공간임의접근을 효율적으로 실현하기가 어렵다. 그리고 현재까지는 해상도가 다른 여러 가지 디스플레이 장치가 공존하는 방송이나 통신 서비스의 필요성이 높지 않았기 때문에, 이러한 공간임의접근을 실현할 수 있는 구체적인 방법에 대한 폭 넓은 연구도 이루어지지 않았다. 다만, 최근에 이에 대한 연구가 서서히 진행되고 있는데, 대표적인 것으로 Carsten Grnheit 등에 의하여 제안된 논문 "Efficient Representation and Interactive Streaming"(Proc. ICIP 2002, IEEE Internation Conference on Image Processing, Rochester, NY, USA, September 22-25, 2002)이 있다.However, it is difficult to efficiently realize such spatial random access with the image codec system developed or proposed to date, that is, the image encoding / decoding apparatus and method. Since the necessity of broadcasting or communication service in which various display devices having different resolutions coexist so far has not been high, there has been no extensive research on a concrete method for realizing such a random access. In recent years, however, research on this topic has been slowly progressing. A representative article, "Efficient Representation and Interactive Streaming" proposed by Carsten Grnheit et al. (Proc. ICIP 2002, IEEE Internation Conference on Image Processing, Rochester, NY, USA, September 22) -25, 2002).

상기 논문에서는 인트라 모드(Intra Mode)인 전체 영상을 여러 개의 조각 영상으로 나누어서 각 조각 영상 단위로 부호화하여 전송하고 복호화하는 영상 코덱 시스템을 제안하고 있다. 상기 논문에서 제안된 영상 코덱 시스템은 디스플레이의 크기와 전체 영상에서 사용자가 시청하기를 원하는 영역의 영상(대상 영상)의 위치 및 크기를 고려하여, 전체 영상에 대한 데이터 전부가 아니라 대상 영상이 포함되는 조각 영상에 대한 데이터만을 전송하여 복호화하도록 한다.In this paper, we propose an image codec system that encodes, transmits, and decodes an intra mode whole image into a plurality of fragment images, and encodes each fragment image unit. The image codec system proposed in the above paper includes the target image, not all of the data for the entire image, in consideration of the size of the display and the position and size of the image (target image) of the region that the user wants to view in the entire image. Only the data of the fragment image is transmitted and decoded.

도 1a 내지 도 1c에는 상기 논문에서 제시한 방법에 의할 경우에, 전체 영상을 구획하는 조각 영상의 크기와 대상 영상을 디스플레이하기 위해서 복호화 장치로 전송해야 하는 영역과의 관계를 보여주는 도면이 도시되어 있다. 여기서, '대상 영상'이라 함은, 전체 영상에서 사용자가 시청하기를 원하는 디스플레이 윈도우 크기에 해당하는 부분 영상을 일컫는다.1A to 1C are diagrams showing the relationship between the size of a fragment image partitioning an entire image and an area to be transmitted to a decoding apparatus in order to display a target image in the case of the method presented in the paper. have. Here, the “target image” refers to a partial image corresponding to a display window size that the user wants to watch in the entire image.

도 1a 내지 도 1c를 참조하면, 대상 영상의 크기가 일정한 경우에, 조각 영상의 단위 크기가 작아질수록 복호화 장치로 전송해야 하는 영역이 상대적으로 작아진다는 것을 알 수 있다. 예컨대, 도 1c에 도시된 바와 같이, 조각 영상의 크기를 아주 작게 할 경우에는 대상 영상의 크기와 전송 영역의 크기가 거의 비슷하게 된다. 그리고 이와 같이 복호화 장치로 전송해야 하는 영역이 작아질 경우에는, 그에 비례하여 대상 영상에 포함되지 않는 영역에 대한 데이터의 전송을 감소시킬 수 있기 때문에, 그 만큼 전송 효율을 향상시킬 수가 있다.1A to 1C, it can be seen that when the size of the target image is constant, as the unit size of the fragment image is smaller, the area to be transmitted to the decoding apparatus is relatively smaller. For example, as shown in FIG. 1C, when the size of the fragment image is made very small, the size of the target image and the size of the transmission area are almost similar. When the area to be transmitted to the decoding apparatus is reduced in this way, the transmission of data to the area not included in the target video can be reduced in proportion to the area, so that the transmission efficiency can be improved by that amount.

그런데 일반적인 영상 부호화 기술들은 인트라 모드 영상에 대하여 화면내 예측 부호화를 수행한다. 이것은 화면내 예측 부호화가 인접 블록의 화소값을 이용하여 부호화하기 때문에 예측없이 부호화하는 것에 비하여 압축율을 향상시킬 수 있기 때문이다. 따라서 전체 화면을 다수의 조각 영상으로 분할하여, 각 조각 영상 단위로 독립적으로 부호화를 수행하는 것은 이러한 화면내 예측 부호화를 수행하는 취지에 반한다. 결국, 화면내 예측 부호화를 수행하는 인트라 영상인 경우에는 조 각 영상의 단위 크기를 일정 크기 이하(예컨대, 도 1c와 같은 크기)로 줄이는 것은 부호화 효율을 고려할 때 바람직하지 못하다.However, general image encoding techniques perform intra prediction encoding on intra mode images. This is because the intra-prediction encoding uses the pixel value of the adjacent block to encode the compression ratio compared to the encoding without the prediction. Therefore, splitting the entire screen into a plurality of fragment images and performing encoding independently for each fragment image unit is contrary to the purpose of performing such intra prediction encoding. As a result, in the case of an intra image performing intra prediction encoding, it is not preferable to reduce the unit size of the fragment image to a predetermined size or less (for example, the size as shown in FIG. 1C) in consideration of coding efficiency.

전술한 바와 같이, 전체 영상을 조각 영상으로 나누고 모든 조각 영상에 대하여 일률적으로 부호화하는 기존의 공간임의접근을 지원하는 영상 부호화 방법은 조각 영상의 크기를 작게 줄여야만 공간임의접근의 효율, 즉 전송 효율과 복호화 효율을 높일 수 있다. 하지만 조각 영상의 크기를 줄이는 것은 압축 효율을 떨어뜨린다는 점에서 문제가 있다. 반대로 조각 영상의 크기를 크게 하게 하는 것은 조각 영상 단위로 부호화를 수행하고자 하는 취지에 반하는 것으로서, 불필요한 영역에 대한 전송 및 복호화라는 문제를 발생시킨다. As described above, an image encoding method that supports an existing spatial random access method that divides an entire image into fragment images and uniformly encodes all fragment images, reduces the size of the fragment image to reduce the size of the fragment image. And the decoding efficiency can be improved. However, reducing the size of the fragment image is problematic in that it reduces the compression efficiency. On the contrary, increasing the size of the fragment image is contrary to the purpose of encoding the fragment image unit, and causes a problem of transmitting and decoding an unnecessary region.

본 발명은 이러한 문제들을 해결하기 위한 것으로서, 본 발명이 이루고자 하는 기술적 과제는 한정된 전송 채널을 효율적으로 이용할 수 있으며, 디스플레이 기기의 종류에 상관없이 3차원 영상이나 파노라마 영상 또는 일반 영상에 대하여 사용자가 원하는 특정 부분에 대하여 고해상도의 영상을 시청할 수 있도록 공간임의접근을 지원하는 영상 코덱 시스템, 이를 구성하는 영상 부호화 서버 및 영상 복호화 장치와 그 방법을 제공하는 것이다.SUMMARY OF THE INVENTION The present invention has been made to solve these problems, and the technical problem to be achieved by the present invention is to efficiently use a limited transmission channel, regardless of the type of display device, the user wants for 3D, panoramic or general video The present invention provides an image codec system that supports spatial random access to view a high resolution image of a specific portion, an image encoding server, an image decoding apparatus, and a method thereof.

상기한 기술적 과제를 달성하기 위한 본 발명의 일 실시예는 전체 영상의 일 부인 대상 영상에 대한 공간임의접근을 지원하는 영상 부호화 장치로서, 상기 전체 영상에 대하여 화면내 예측 부호화를 수행하여 영상 비트스트림을 생성하기 위한 영상 부호화 유닛, 화면내 예측 부호화 과정에서 상기 전체 영상의 현재 블록에 대한 예측 블록을 생성하는데 이용된 예측 데이터(Prediction Data, P-Data)를 저장하기 위한 예측 데이터 메모리, 및 상기 전체 영상에 대한 영상 비트스트림과 상기 예측 데이터 메모리에 저장된 상기 예측 데이터에서 상기 대상 영상에 대한 영상 비트스트림과 예측 데이터를 각각 추출하기 위한 추출기를 포함한다.An embodiment of the present invention for achieving the above technical problem is an image encoding apparatus that supports a spatial random access to the untitled target image of the entire image, the image bitstream by performing intra prediction encoding on the entire image An image coding unit for generating an image, a prediction data memory for storing prediction data (P-Data) used for generating a prediction block for a current block of the entire image in an intra prediction encoding process, and the whole And an extractor for extracting the image bitstream and the prediction data for the target image from the image bitstream for the image and the prediction data stored in the prediction data memory.

상기 실시예의 일 측면에 의하면, 상기 예측 데이터는 상기 현재 블록에 인접한 픽셀의 복호화된 화소값을 포함할 수 있다. 그리고 상기 예측 데이터 메모리에는 상기 현재 블록에 인접한 블록의 예측 모드 데이터 및/또는 상기 현재 블록에 인접한 블록의 영이 아닌 계수의 개수(Number of Non-Zero coefficients, NNZ) 데이터도 함께 저장될 수 있다.According to an aspect of the embodiment, the prediction data may include a decoded pixel value of a pixel adjacent to the current block. The prediction data memory may also store prediction mode data of a block adjacent to the current block and / or number of non-zero coefficients (NNZ) data of a block adjacent to the current block.

상기 실시예의 다른 측면에 의하면, 상기 추출기는 상기 전체 영상에 대한 영상 비트스트림 중에서 상기 대상 영상에 대한 영상 비트스트림을 추출하기 위한 영상 비트스트림 추출기 및 상기 전체 영상에 대한 예측 데이터 중에서 상기 대상 영상에 대한 예측 데이터를 추출하기 위한 예측 데이터 추출기를 포함할 수 있다.이 경우에 상기 예측 데이터 추출기는 상기 전체 영상에 대한 예측 데이터 중에서 상기 대상 영상의 최상측, 최좌측, 및/또는 최우측 매크로블록에 대한 예측 데이터를 추출할 수 있다.According to another aspect of the embodiment, the extractor extracts an image bitstream for the target image from the image bitstream of the entire image and the prediction data for the entire image from the image bitstream extractor The predictive data extractor may include a predictive data extractor for extracting predictive data. In this case, the predictive data extractor may extract a predictive data extractor for the uppermost, leftmost, and / or rightmost macroblock of the target image. Predictive data can be extracted.

상기한 기술적 과제를 달성하기 위한 본 발명의 다른 실시예는 전체 영상의 일부인 대상 영상에 대한 공간임의접근을 지원하는 영상 복호화 장치로서, 상기 대상 영상 전체에 영상 비트스트림과 상기 대상 영상의 최상측, 최좌측, 및/또는 최우측 블록에 대한 예측 데이터(P-Data)를 이용하여 화면내 예측 복호화를 수행함으로써 상기 대상 영상을 복원하기 위한 영상 복호화 유닛을 포함한다.Another embodiment of the present invention for achieving the above technical problem is an image decoding apparatus that supports the spatial random access to the target image that is part of the entire image, the image bitstream and the uppermost side of the target image, And an image decoding unit for reconstructing the target image by performing intra prediction prediction decoding using prediction data (P-Data) for the leftmost and / or rightmost block.

상기 실시예의 일 측면에 의하면, 상기 영상 복호화 유닛으로 보내지는 상기 예측 데이터를 저장하기 위한 예측 데이터 메모리를 더 포함할 수 있다.According to an aspect of the embodiment, it may further comprise a prediction data memory for storing the prediction data sent to the image decoding unit.

상기한 기술적 과제를 달성하기 위한 본 발명의 일 실시예에 따른 영상 코덱 시스템은 전술한 본 발명의 실시예에 따른 영상 부호화 장치와 영상 복호화 장치를 포함한다.An image codec system according to an embodiment of the present invention for achieving the above technical problem includes an image encoding apparatus and an image decoding apparatus according to the embodiment of the present invention described above.

상기한 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예는 전체 영상의 일부인 대상 영상에 대한 공간임의접근을 지원하는 영상 부호화 장치로서, 상기 전체 영상에 대하여 소정 크기의 블록 단위로 독립적으로 부호화를 수행하여 제1 영상 비트스트림을 생성하기 위한 제1 영상 부호화 유닛, 상기 전체 영상에 대하여 화면내 예측 부호화를 수행하여 제2 영상 비트스트림을 생성하기 위한 제2 영상 부호화 유닛, 상기 제1 영상 부호화 유닛에서 생성된 상기 제1 영상 비트스트림의 복호화된 제1 화소값과 상기 제2 영상 부호화 유닛에서 제공된 상기 제2 영상 비트스트림의 복호화된 제2 화소값의 차분값인 불일치 데이터(Mismatch Data, M-Data)를 블록 단위로 구하여 저장하기 위한 불일치 데이터 메모리, 상기 대상 영상의 최상측, 최좌측, 및/또는 최우측 블록에 대해서는 상기 제1 영상 비트스트림으로부터 영상 비트스트림을 추출하고, 상기 대상 영상의 블록들 중에서 상기 최상측, 최좌 측, 및 최우측 블록이 아닌 블록들에 대해서는 상기 제2 영상 비트스트림으로부터 비트스트림을 추출하기 위한 영상 비트스트림 추출기, 및 상기 대상 영상의 최상측, 최좌측, 및/또는 최우측 블록에 대해서는 상기 불일치 데이터를 추출하기 위한 불일치 데이터 추출기를 포함한다.Another embodiment of the present invention for achieving the above technical problem is an image encoding apparatus that supports the spatial random access to the target image that is part of the entire image, the encoding independently of the entire image in a block unit of a predetermined size A first image encoding unit for performing a first image bitstream by performing an intra-prediction encoding on the entire image to generate a second image bitstream, and the first image encoding unit Mismatch data, M−, which is a difference value between the decoded first pixel value of the first image bitstream generated by the decoded second pixel value of the second image bitstream provided by the second image coding unit. Inconsistent data memory for acquiring and storing Data in blocks, and the top, leftmost, and / or rightmost sides of the target image. A video bitstream is extracted from the first video bitstream for the block, and a bitstream from the second video bitstream for the blocks other than the top, left, and rightmost blocks among the blocks of the target video. And a mismatch data extractor for extracting the mismatch data for the uppermost, leftmost, and / or rightmost blocks of the target image.

상기 실시예의 일 측면에 의하면, 상기 불일치 데이터 메모리에는 각 블록의 최하측열 및 최우측행에 위치한 화소들에 대한 상기 차분값을 저장할 수 있다. 그리고 상기 불일치 데이터 메모리에는 인접 블록의 예측 모드 데이터 및/또는 인접 블록의 영이 아닌 계수의 개수(Number of Non-Zero coefficients, NNZ) 데이터를 더 저장될 수 있다.According to an aspect of the embodiment, the difference value for the pixels located in the lowermost column and the rightmost row of each block may be stored in the mismatched data memory. The mismatch data memory may further store prediction mode data of adjacent blocks and / or number of non-zero coefficients (NNZ) data of adjacent blocks.

상기한 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예는 전체 영상의 일부인 대상 영상에 대한 공간임의접근을 지원하는 영상 복호화 장치로서, 상기 대상 영상의 제1 영역은 소정 크기의 블록 단위로 독립 복호화를 수행하고, 상기 대상 영상의 제2 영역은 소정 크기의 블록 단위로 화면내 예측 복호화를 수행하며, 상기 제2 영역에 대한 화면내 예측 복호화에 이용되는 인접 블록의 복호화된 화소값이 상기 제1 영역에 속하는 픽셀인 경우에는 상기 독립 복호화된 화소값에 불일치 데이터를 가산한 후에 화면내 예측 복호화를 수행하는 영상 복호화 유닛을 포함한다.Another embodiment of the present invention for achieving the above technical problem is an image decoding apparatus that supports the spatial random access to the target image that is part of the entire image, the first region of the target image is independent of the block unit of a predetermined size Decoding is performed, and an intra prediction prediction decoding is performed on a second area of the target image in units of blocks of a predetermined size, and a decoded pixel value of an adjacent block used for intra prediction prediction decoding is applied to the second area. In the case of a pixel belonging to one region, an image decoding unit which performs intra prediction prediction decoding after adding mismatched data to the independent decoded pixel value is included.

상기 실시예의 일 측면에 의하면, 상기 제1 영역은 상기 대상 영상의 최상측, 최좌측, 및/또는 최우측에 위치하는 블록들을 포함한다. 그리고 상기 불일치 데이터를 가산하는 화소값은 상기 제1 영역에 속하는 각 블록에서 최우측행 및/또 는 최하측열에 위치하는 픽셀들의 화소값일 수 있다.According to an aspect of the embodiment, the first area includes blocks located at the top, leftmost, and / or rightmost sides of the target image. The pixel value for adding the mismatched data may be pixel values of pixels located in the rightmost row and / or the lowest column in each block belonging to the first region.

상기한 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예에 따른 영상 코덱 시스템은 전술한 본 발명의 실시예에 따른 영상 부호화 장치와 영상 복호화 장치를 포함한다.An image codec system according to another embodiment of the present invention for achieving the above technical problem includes an image encoding apparatus and an image decoding apparatus according to the embodiment of the present invention described above.

상기한 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예는 영상 부호화 방법은 전체 영상의 일부인 대상 영상에 대한 공간임의접근을 지원하는 영상 부호화 방법으로서, 상기 전체 영상에 대하여 화면내 예측 부호화를 수행하여 영상 비트스트림을 생성함과 동시에 상기 화면내 예측 부호화를 수행하는 과정에서 예측 블록을 생성하는데 이용된 예측 데이터(Prediction Data, P-Data)를 저장해두는 영상 부호화 단계, 및 사용자로부터 상기 대상 영상에 대한 전송 요청이 있는 경우에 상기 영상 비트스트림과 상기 예측 데이터에서 상기 대상 영상에 대한 영상 비트스트림과 예측 데이터를 각각 추출하여 전송하는 단계를 포함한다.Another embodiment of the present invention for achieving the above technical problem is an image encoding method that supports spatial random access to a target image that is part of an entire image, and performs intra prediction encoding on the entire image. Generating an image bitstream and storing prediction data (P-Data) used to generate a prediction block in the process of performing the intra prediction encoding, and from the user to the target image. And extracting and transmitting the image bitstream and the prediction data of the target image from the image bitstream and the prediction data, respectively, when there is a request for transmission.

상기한 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예는 전체 영상의 일부인 대상 영상에 대한 공간임의접근을 지원하는 영상 복호화 방법으로서, 상기 대상 영상에 영상 비트스트림과 함께 예측 데이터를 수신하는 단계, 상기 예측 데이터를 이용하여 예측 블록을 생성하는 단계, 및 상기 영상 비트스트림과 상기 예측 블록을 이용하여 상기 대상 영상을 복원하는 단계를 포함한다.Another embodiment of the present invention for achieving the above technical problem is an image decoding method for supporting a spatial random access to the target image that is part of the entire image, the step of receiving prediction data with the image bitstream in the target image Generating a prediction block using the prediction data, and reconstructing the target image using the image bitstream and the prediction block.

상기한 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예는 전체 영상의 일부인 대상 영상에 대한 공간임의접근을 지원하는 영상 부호화 방법으로서, 상기 전체 영상에 대하여 소정 크기의 블록 단위로 독립적으로 부호화를 수행하여 제 1 영상 비트스트림을 생성하는 단계, 상기 전체 영상에 대하여 화면내 예측 부호화를 수행하여 제2 영상 비트스트림을 생성하는 단계, 상기 제1 영상 비트스트림의 복호화된 제1 화소값과 상기 제2 영상 비트스트림의 복호화된 제2 화소값의 차분값인 불일치 데이터(Mismatch Data, M-Data)를 블록 단위로 구하여 저장하는 단계, 사용자로부터 상기 대상 영상에 대한 전송 요청이 있는 경우에 상기 제1 영상 비트스트림, 제2 영상 비트스트림 및 상기 불일치 데이터 중에서, 상기 대상 영상의 최상측, 최좌측, 및/또는 최우측 블록에 대해서는 상기 제1 영상 비트스트림으로부터 영상 비트스트림을 추출하고, 상기 대상 영상의 블록들 중에서 상기 최상측, 최좌측, 및 최우측 블록이 아닌 블록들에 대해서는 상기 제2 영상 비트스트림으로부터 영상 비트스트림을 추출하며, 상기 대상 영상의 최상측, 최좌측, 및/또는 최우측 블록에 대해서는 상기 불일치 데이터도 함께 추출하여 전송하는 단계를 포함한다.Another embodiment of the present invention for achieving the above technical problem is an image encoding method that supports the spatial random access to the target image that is part of the entire image, the encoding independently of the entire image in a block unit of a predetermined size Generating a first image bitstream by performing intra-prediction encoding on the entire image to generate a second image bitstream, and decoding the first pixel value of the first image bitstream and the first image bitstream. Obtaining and storing mismatch data (M-Data), which is a difference value of the decoded second pixel value of the second image bit stream, in units of blocks, and when the user requests a transmission of the target image, Among the image bitstream, the second image bitstream, and the mismatched data, the topmost, leftmost, and / or rightmost of the target image. An image bitstream is extracted from the first image bitstream for a block, and image bits are extracted from the second image bitstream for blocks other than the uppermost, leftmost, and rightmost blocks among the blocks of the target image. Extracting a stream, and extracting and transmitting the mismatched data for the uppermost, leftmost, and / or rightmost blocks of the target video.

상기한 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예는 전체 영상의 일부인 대상 영상에 대한 공간임의접근을 지원하는 영상 복호화 방법으로서, 상기 대상 영상에 제1 영상 비트스트림, 제2 영상 비트스트림, 및 불일치 데이터를 수신하는 단계, 상기 제1 영상 비트스트림을 복호화하여 상기 대상 영상의 제1 영역에 대한 화소값을 복원하는 단계, 상기 불일치 데이터를 이용하여 상기 복원한 제1 영역에 대한 화소값을 보정하는 단계, 상기 보정된 상기 제1 영역에 대한 화소값과 상기 제2 영상 비트스트림을 이용하여 화면내 예측 복호화를 수행함으로써 상기 대상 영상의 제2 영역에 대한 화소값을 복원하는 단계를 포함한다.Another embodiment of the present invention for achieving the above technical problem is an image decoding method for supporting a spatial random access to the target image that is part of the entire image, the first image bitstream, the second image bitstream to the target image And restoring pixel values for the first region of the target image by decoding the first image bitstream, and pixel values for the restored first region using the mismatch data. Reconstructing the pixel value of the second region of the target image by performing intra prediction prediction decoding using the corrected pixel value of the first region and the second image bitstream. do.

이하에서는, 첨부 도면을 참조하여 본 발명의 바람직한 실시예에 대하여 상세하게 설명한다. 후술하는 실시예는 본 발명의 기술적 사상을 설명하기 위한 목적이므로, 본 발명의 기술적 사상은 이 실시예에 의하여 한정되는 것으로 해석되어서는 안된다. 그리고 이하의 실시예의 설명에서 각각의 구성요소의 명칭은 당업계에서 다른 명칭으로 호칭될 수 있는데, 이들이 기능적 유사성과 동일성이 있다면 비록 다른 명칭을 사용하더라도 본 발명의 실시예와 균등한 구성이라고 볼 수 있다. 마찬가지로 도면상의 구성을 일부 변형한 실시예가 채용되더라도 기능적 유사성과 동일성이 있다면 양자는 균등한 구성으로 볼 수 있다. 본 실시예에 대한 설명 및 도면에서 각각의 구성요소에 부가된 참조 부호는 단지 설명의 편의를 위하여 기재된 것일 뿐이다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. Since the embodiments described below are for the purpose of illustrating the technical idea of the present invention, the technical idea of the present invention should not be construed as being limited by the embodiments. In the description of the following embodiments, the names of each component may be referred to as other names in the art, and if they have functional similarity and identity, they may be regarded as equivalent to the embodiments of the present invention even though other names are used. have. Similarly, even if an embodiment in which the configuration on the drawings is partially modified is adopted, both can be regarded as an equivalent configuration if there is functional similarity and identity. Reference numerals added to the respective components in the description of the embodiment and the drawings are merely described for convenience of description.

본 발명에 따른 공간임의접근을 지원하는 영상 코덱 시스템, 이를 구성하는 영상 부호화 서버 및 영상 복호화 장치와 그 방법은 인트라 모드 영상을 부호화 또는 복호화하기 위한 장치 또는 방법이다. 그리고 본 발명에서는 상기한 기술적 과제를 달성하기 위하여 예측 데이터(Prediction Data, P-Data) 또는 불일치 데이터(Mismatch Data, M-Data)를 영상 비트스트림과는 별도로 영상 복호화 장치로 전송한다. 이하에서는 이들 각각에 대하여 상세히 설명한다.An image codec system supporting spatial random access according to the present invention, an image encoding server and an image decoding apparatus constituting the same, and a method thereof are an apparatus or method for encoding or decoding an intra mode image. In order to achieve the above technical problem, the present invention transmits prediction data (P-Data) or mismatch data (M-Data) to the image decoding apparatus separately from the image bitstream. Hereinafter, each of these will be described in detail.

제1 실시예First Embodiment

본 발명의 제1 실시예에 따른 영상 코덱 시스템은 P-Data를 이용한다. 'P- Data'는 화면내 예측 부호화시에 현재 블록의 화소값을 예측하는데 이용되는 인접 픽셀의 복호화된 화소값을 가리킨다. 래스터 스캔(Raster Scan) 순서에 따라서 각 매크로블록을 순차적으로 부호화/복호화하는 경우에, 상기 P-Data는 현재 매크로블록의 왼쪽 및/또는 위쪽에 위치하는 매크로블록들로부터 제공된다.The video codec system according to the first embodiment of the present invention uses P-Data. 'P-Data' refers to the decoded pixel values of adjacent pixels used to predict the pixel values of the current block during intra prediction encoding. In the case of sequentially encoding / decoding each macroblock according to a raster scan order, the P-Data is provided from macroblocks located to the left and / or above the current macroblock.

도 2는 래스터 스캔 순서에 따라서 복호화를 하는 경우에 현재 매크로블록의 복호화에 필요한 P-Data를 제공하는 주변 매크로블록을 보여주는 블록도이다. 도 2를 참조하면, 현재 매크로블록을 복호화하기 위해서는 상기 현재 매크로블록에 대하여 왼쪽과 위쪽으로 인접한 주변 매크로블록들의 복호화된 화소값이 요구된다. 그리고 상기 주변 매크로블록을 복호화하기 위해서는 또 다시 이 매크로블록의 왼쪽으로 인접한 주변 매크로블록들과 위쪽으로 인접한 주변 매크로블록들의 복호화된 화소값이 요구된다. 결국, 현재 매크로블록을 복호화하기 위해서는 상기 현재 매크로블록에 대하여 왼쪽에 위치하는 주변 매크로블록들과 위쪽에 위치하는 주변 매크로블록들 모두에 대한 복호화가 선행되어야 한다.FIG. 2 is a block diagram illustrating neighboring macroblocks that provide P-Data required for decoding a current macroblock when decoding according to a raster scan order. Referring to FIG. 2, in order to decode a current macroblock, decoded pixel values of neighboring macroblocks left and up adjacent to the current macroblock are required. In order to decode the neighboring macroblock, the decoded pixel values of neighboring macroblocks adjacent to the left side of the macroblock and neighboring macroblocks upwardly are required. As a result, in order to decode the current macroblock, decoding of both neighboring macroblocks located on the left side and neighboring macroblocks located on the upper side should be preceded.

그런데, 상기 현재 매크로블록의 복호화를 위하여 도 2에 도시된 모든 블록에 대한 영상 비트스트림을 영상 복호화 장치로 전송하고 또한 복호화하도록 하는 것은 과다한 데이터의 전송으로 전송 효율을 떨어뜨릴 뿐만 아니라 디스플레이할 필요가 없는 영상 비트스트림에 대해서도 복호화를 수행해야 하는 문제가 있다. 이러한 문제는 전체 영상을 하나의 인트라 모드 영상으로 간주하고 부호화/복호화를 수행하는 경우뿐만 아니라 전체 영상을 다수의 조각 영상으로 나누어서 조각 영상 단위로 부호화/복호화를 수행하는 경우에도 발생한다. However, transmitting and decoding the video bitstreams of all the blocks shown in FIG. 2 to the video decoding apparatus for decoding the current macroblock need not only reduce the transmission efficiency but also display the excessive data transmission. There is a problem that decoding should be performed even for a missing image bitstream. This problem occurs not only when the entire image is regarded as an intra mode image and the encoding / decoding is performed, but also when the entire image is divided into a plurality of fragment images and the encoding / decoding is performed in units of fragment images.

본 실시예에서는 이러한 문제점들을 해결하기 위하여 부호화시에 P-Data 등을 별도로 저장하여 두며, 사용자가 디스플레이하기를 원하는 대상 영상에 대한 영상 비트스트림을 복원하는데 있어서 필요한 P-Data 등만을 추출하여 영상 복호화 장치로 전송하도록 한다. 그리고 영상 복호화 장치에서는 수신된 대상 영상에 대한 영상 비트스트림과 수신된 P-Data 등을 이용하여 복호화를 수행한다.In this embodiment, in order to solve these problems, P-Data and the like are separately stored during encoding, and only the P-Data and the like necessary for reconstructing the image bitstream for the target image that the user wants to display are extracted and decoded. To the device. In addition, the image decoding apparatus performs decoding by using the image bitstream and the received P-Data of the received target image.

도 3은 본 발명의 일 실시예에 따른 영상 코덱 시스템(100)의 구성을 보여주는 블록도이다. 도 3을 참조하면, 영상 코덱 시스템(100)은 영상 부호화 장치(110)와 영상 복호화 장치(120)를 포함한다. 상기 영상 부호화 장치(110)는 영상 부호화 유닛(112), 제1 P-Data 메모리(114), 영상 비트스트림 추출기(116), 및 P-Data 추출기(118)를 포함한다. 그리고 영상 복호화 장치(120)는 영상 복호화 유닛(122)과 제2 P-Data 메모리(124)를 포함한다.3 is a block diagram illustrating a configuration of an image codec system 100 according to an embodiment of the present invention. Referring to FIG. 3, the image codec system 100 includes an image encoding apparatus 110 and an image decoding apparatus 120. The image encoding apparatus 110 includes an image encoding unit 112, a first P-Data memory 114, an image bitstream extractor 116, and a P-Data extractor 118. The image decoding apparatus 120 includes an image decoding unit 122 and a second P-Data memory 124.

먼저, 영상 부호화 장치(110)에 대하여 설명한다. First, the video encoding apparatus 110 will be described.

영상 부호화 장치(110)의 영상 부호화 유닛(112)은 입력되는 전체 영상을 소정의 부호화 방법을 이용하여 압축함으로써 영상 비트스트림을 생성하기 위한 수단이다. 이러한 영상 부호화 유닛(112)은 H.264/AVC 또는 VC-1 등과 같은 영상 압축에 관한 표준 규격에 부합하는 엔코더(Encoder)인 것이 바람직하지만, 이에 한정되는 것은 아니다. 생성된 영상 비트스트림은 소정의 저장 수단에 저장되거나 또는 실시간 전송의 경우에는 버퍼 메모리 등에 저장되어 있을 수 있다.The image encoding unit 112 of the image encoding apparatus 110 is a means for generating an image bitstream by compressing the entire input image using a predetermined encoding method. The image encoding unit 112 is preferably an encoder conforming to a standard for image compression such as H.264 / AVC or VC-1, but is not limited thereto. The generated image bitstream may be stored in a predetermined storage means or in a buffer memory or the like in the case of real time transmission.

그리고 본 실시예에 따른 영상 부호화 유닛(112)은 인트라 모드 영상에 대하여 화면내 예측 부호화를 지원하는 엔코더로서, 입력 영상을 소정의 크기를 갖는 블록 단위로 부호화를 수행한다. 예를 들어, 영상 부호화 유닛(112)은 인트라 모드 영상에 대하여 매크로블록 단위로 화면내 예측 부호화를 수행할 수 있지만, 여기에만 한정되는 것은 아니다. 예를 들어, 상기 영상 부호화 유닛(112)은 매크로블록, 4×4 블록 또는 8×8 블록 단위로 부호화를 수행하거나 또는 단위 블록의 크기를 적응적으로 변경하면서 부호화를 수행할 수도 있다.The image encoding unit 112 according to the present embodiment is an encoder that supports intra prediction encoding on an intra mode image, and encodes an input image in units of blocks having a predetermined size. For example, the image encoding unit 112 may perform intra prediction encoding on a macroblock basis for an intra mode image, but is not limited thereto. For example, the image encoding unit 112 may perform encoding in units of macroblocks, 4 × 4 blocks, or 8 × 8 blocks, or adaptively change the size of the unit block.

제1 P-Data 메모리(114)는 P-Data 즉, 영상 부호화 유닛(112)에서 부호화시에 현재 매크로블록의 화소값을 예측하는데 이용되는 인접 픽셀의 복호화된 화소값을 저장하기 위한 수단이다. 그리고 실시예에 따라서는 제1 P-Data 메모리(114)는 현재 매크로블록을 화면내 예측 부호화하는데 있어서 필요한 인접 블록의 메타 데이터 정보도 저장한다. 이를 위하여, 영상 부호화 유닛(112)은 부호화 과정에서 생성되는 P-Data 등을 제1 P-Data 메모리(114)로 보낸다. 이러한 제1 P-Data 메모리(114)는 영상 부호화 장치(110)의 다른 구성 요소, 예컨대 영상 부호화 유닛(112)에 구비되어 있는 프레임 메모리나 영상 비트스트림 저장용 메모리 등에 통합되어 있거나 또는 이들과 분리되어 독립적인 저장 유닛을 형성할 수도 있다.The first P-Data memory 114 is a means for storing P-Data, i.e., decoded pixel values of neighboring pixels used to predict pixel values of the current macroblock at the time of encoding in the image encoding unit 112. According to an exemplary embodiment, the first P-Data memory 114 also stores metadata information of neighboring blocks necessary for intra prediction encoding the current macroblock. To this end, the image encoding unit 112 sends the P-Data generated in the encoding process to the first P-Data memory 114. The first P-Data memory 114 may be integrated with or separated from other components of the image encoding apparatus 110, for example, a frame memory or an image bitstream storage memory included in the image encoding unit 112. May form an independent storage unit.

그리고 제1 P-Data 메모리(114)에 저장되는 P-Data의 종류 및/또는 구체적인 값은 영상 부호화 유닛(112)에서 지원하는 화면내 예측 부호화 방법에 따라서 달라질 수 있다. 예를 들어, 상기 영상 부호화 유닛(112)이 H.264/AVC 표준 규격에 부합하는 엔코더인 경우에, 상기 제1 P-Data 메모리(114)에 저장되는 P-Data는 현재 매크로블록에 인접한 픽셀의 재구성 화소값(재구성 화소 데이터)이 된다. 그리고 제1 P-Data 메모리(114)에는 현재 블록이 4×4 블록 모드인 경우에 이에 인접한 4 ×4 블록의 예측 모드를 지시하는 정보(예측 모드 데이터) 및/또는 현재 블록이 4×4 블록 모드인 경우에 이에 인접한 4×4 블록의 영이 아닌 계수의 개수(Number of Non-Zero coefficients, NNZ)를 지시하는 정보(NNZ 데이터) 등을 저장될 수 있는데, 이들에 대해서는 뒤에서 상세하게 설명한다.The type and / or specific value of the P-Data stored in the first P-Data memory 114 may vary according to the intra prediction encoding method supported by the image encoding unit 112. For example, when the image encoding unit 112 is an encoder conforming to the H.264 / AVC standard, the P-Data stored in the first P-Data memory 114 is a pixel adjacent to the current macroblock. Is a reconstructed pixel value (reconstructed pixel data). The first P-Data memory 114 may include information (prediction mode data) indicating a prediction mode of a 4 × 4 block adjacent to the current block in the 4 × 4 block mode and / or a 4 × 4 block in the current block. In the case of the mode, information (NNZ data) indicating the number of non-zero coefficients (NNZ) of 4x4 blocks adjacent thereto may be stored, which will be described in detail later.

영상 비트스트림 추출기(116)는 전체 영상에 대한 비트스트림 중에서 사용자가 원하는 대상 영상에 대한 비트스트림만을 추출하는 기능을 수행한다. 즉, 사용자가 전체 영상 중에서 시청하기를 원하는 부분의 위치(대상 영상)를 특정하여 데이터를 요청한 경우에, 영상 비트스트림 추출기(116)는 이 대상 영상을 구성하는 매크로블록들에 대한 영상 비트스트림을 추출한다.The image bitstream extractor 116 extracts only the bitstream of the target image desired by the user from among the bitstreams of the entire image. That is, when a user requests data by specifying a position (target image) of a portion that the user wants to watch among the entire images, the image bitstream extractor 116 generates an image bitstream for macroblocks constituting the target image. Extract.

그리고 P-Data 추출기(118)는 제1 P-Data 메모리(114)에 저장되어 있는 데이터들 중에서 사용자가 원하는 대상 영상에 대한 P-Data 등을 추출하는 기능을 수행한다. 전술한 바와 같이, 상기 P-Data는 대상 영상을 구성하는 매크로블록의 왼쪽 및/또는 위쪽으로 인접한 픽셀들의 복호화된 화소값일 수 있으며, 이들 복화화된 화소값은 각 매크로블록에 대한 데이터로써 제1 P-Data 메모리(114)에 저장되어 질 수 있다. 본 실시예에 의하면, P-Data 추출기(118)는 상기 대상 영상을 구성하는 매크로블록 전체에 대한 P-Data를 추출할 필요는 없으며, 상기 대상 영상의 최좌측, 최우측 및 최상측에 위치하는 매크로블록에 대한 P-Data만 추출하면 충분하다. 이에 대한 자세한 내용도 P-Data에 대한 상세한 설명과 함께 후술하기로 한다.The P-Data extractor 118 performs a function of extracting P-Data or the like for a target image desired by a user from among data stored in the first P-Data memory 114. As described above, the P-Data may be decoded pixel values of pixels adjacent to the left and / or upward of the macroblock constituting the target image, and the decoded pixel values are data for each macroblock as the first data. It may be stored in the P-Data memory 114. According to the present embodiment, the P-Data extractor 118 does not need to extract the P-Data for the entire macroblock constituting the target image, and is located at the leftmost, rightmost and topmost positions of the target image. It is enough to extract only the P-Data for the macroblock. Details thereof will be described later along with a detailed description of the P-Data.

계속해서, 영상 복호화 장치(120)에 대해서 설명한다.Subsequently, the video decoding apparatus 120 will be described.

영상 복호화 장치(120)의 영상 복호화 유닛(122)은 사용자가 시청하기를 원 하는 대상 영상을 복원하기 위한 수단이다. 이러한 영상 복호화 유닛(122)은 대상 영상에 대한 영상 비트스트림(즉, 영상 비트스트림 추출기(116)에서 추출된 비트스트림)과 대상 영상에 대한 P-Data(즉, P-Data 추출기(118)에서 추출된 P-Data) 등을 이용하여 대상 영상을 복원한다.The image decoding unit 122 of the image decoding apparatus 120 is a means for reconstructing a target image that the user wants to watch. The image decoding unit 122 may include an image bitstream (ie, a bitstream extracted by the image bitstream extractor 116) for the target image and a P-Data (ie, P-Data extractor 118) for the target image. The target image is reconstructed using the extracted P-Data).

영상 복호화 장치(120)의 제2 P-Data 메모리(124)는 대상 영상의 복원에 이용되는 P-Data 등을 수신하여 저장하기 위한 수단이다. 제2 P-Data 메모리(124)는 영상 복호화 장치(120)의 다른 구성 요소, 예컨대 영상 복호화 유닛(122)에 구비되어 있는 메모리 등에 통합되어 있거나 또는 영상 복호화 유닛(122) 등과는 분리되어 독립적인 유닛을 형성할 수도 있다.The second P-Data memory 124 of the image decoding apparatus 120 is a means for receiving and storing P-Data or the like used to restore the target image. The second P-Data memory 124 may be integrated with other components of the image decoding apparatus 120, for example, a memory included in the image decoding unit 122, or may be separated from the image decoding unit 122 and the like. It may form a unit.

계속해서, 전술한 제1 실시예에 따른 영상 부호화 유닛(112)과 영상 복호화 유닛(122)이 각각 H.264/AVC 표준 규격에 부합하는 엔코더 또는 디코더인 경우에, 상기 제1 P-Data 메모리(114)에 저장되는 데이터의 종류와 사용자가 소정의 대상 화면에 대한 데이터를 요청한 경우에 영상 복호화 장치(120)로 전송하기 위하여 P-Data 추출기(118)가 추출하는 데이터에 대하여 구체적으로 설명한다. H.264/AVC 표준 규격에 부합하는 엔코더와 디코더를 사용할 경우의 상기 제1 P-Data 메모리(114)에는 현재 매크로블록에 인접한 픽셀의 복호화된 화소값(P-Data)이 저장되며, 현재 블록이 4×4 블록 모드인 경우에 이에 인접한 주변 4×4 블록의 예측 모드 데이터와 현재 블록이 4×4 블록인 경우에 이에 인접한 주변 4×4 블록의 NNZ 데이터 등도 저장될 수 있다.Subsequently, when the image encoding unit 112 and the image decoding unit 122 according to the first embodiment described above are encoders or decoders conforming to the H.264 / AVC standard, respectively, the first P-Data memory. Types of data stored in 114 and data extracted by the P-Data extractor 118 for transmission to the image decoding apparatus 120 when the user requests data for a predetermined target screen will be described in detail. . In the case of using an encoder and a decoder conforming to the H.264 / AVC standard, the first P-Data memory 114 stores decoded pixel values (P-Data) of pixels adjacent to the current macroblock, and the current block In this 4x4 block mode, prediction mode data of a neighboring 4x4 block adjacent to this, and NNZ data of a neighboring 4x4 block adjacent thereto when the current block is a 4x4 block may also be stored.

(1) 재구성 화소 데이터(1) reconstructed pixel data

H.264/AVC 표준에 규정되어 있는 인트라 예측(Intra Prediction)은 휘도 성분에 대해서는 16×16 블록 모드와 4×4 블록 모드가 존재하고, 색차 성분(Chrominance Component)에 대해서는 8×8 블록 모드가 존재한다. 이하에서는 휘도 성분에 대한 재구성 화소 데이터에 대해서만 설명한다.Intra Prediction, defined in the H.264 / AVC standard, has 16x16 block mode and 4x4 block mode for luminance components, and 8x8 block mode for chrominance components. exist. Only reconstructed pixel data for the luminance component will be described below.

4×4 블록 모드인 경우에 9가지 예측 모드(예측 모드0 내지 예측 모드8)가 존재한다. 4×4 블록 모드를 적용하기 위하여, 우선 매크로블록을 16개의 4×4 블록으로 나눈다. 그리고 각각의 4×4 블록에 대하여 상기 9가지 예측 모드 중에서 최적의 예측 모드를 선택하여 부호화를 수행한다. 상기 9가지 예측 모드는 Vertical(예측 모드0), Horizontal(예측 모드1), DC(예측 모드2), Diagonal down-left(예측 모드3), Diagonal down-right(예측 모드4), Vertical-right(예측 모드5), Horizontal down(예측 모드6), Vertical left(예측 모드7), 및 Horizontal-up(예측 모드8)이다. 16개의 4×4 블록으로 구성되는 하나의 매크로블록에 대하여 상기 9가지 예측 모드 각각을 적용해보면, 하나의 예측 휘도 매크로블록을 생성하기 위해서는 최대 37개, 즉 위쪽 21개의 인접 화소 및 왼쪽 16개의 인접 화소의 복호화된 휘도값이 요구된다. In the case of the 4x4 block mode, there are nine prediction modes (prediction mode 0 to prediction mode 8). In order to apply the 4x4 block mode, the macroblock is first divided into 16 4x4 blocks. For each 4x4 block, encoding is performed by selecting an optimal prediction mode from the nine prediction modes. The nine prediction modes are Vertical (prediction mode 0), Horizontal (prediction mode 1), DC (prediction mode 2), Diagonal down-left (prediction mode 3), Diagonal down-right (prediction mode 4), Vertical-right (Prediction mode 5), Horizontal down (prediction mode 6), Vertical left (prediction mode 7), and Horizontal-up (prediction mode 8). Applying each of the nine prediction modes to one macroblock consisting of 16 4 × 4 blocks, in order to generate one predicted luminance macroblock, a maximum of 37, i.e., the top 21 neighboring pixels and the left 16 neighbors are generated. The decoded luminance value of the pixel is required.

그리고 16×16 블록 모드인 경우에는 4가지 예측 모드가 존재한다. 상기 4가지 예측 모드는 Vertical(예측 모드0), Horizontal(예측 모드1), DC(예측 모드2), 및 Plane(예측 모드3)이다. 상기 4가지 예측 모드 각각을 매크로블록에 대하여 적 용해보면, 하나의 예측 휘도 매크로블록을 생성하는데 있어서 최대 33개, 즉 위쪽 17개의 인접 화소와 왼쪽 16개의 인접 화소의 복호화된 휘도값이 이용된다.In case of 16 × 16 block mode, four prediction modes exist. The four prediction modes are Vertical (prediction mode 0), Horizontal (prediction mode 1), DC (prediction mode 2), and Plane (prediction mode 3). When each of the four prediction modes is applied to a macroblock, the decoded luminance values of up to 33, i.e., 17, upper and 16 adjacent pixels are used to generate one predicted luminance macroblock.

따라서 각 매크로블록에 대하여 이에 인접하는 최대 37개 화소들의 복호화된 휘도값이 P-Data로서 존재할 수 있으며, 해당 매크로블록을 예측하는데 있어서 실제로 몇 개의 픽셀에 대한 재구성 화소 데이터가 예측값으로 이용되는지 여부는 각 매크로블록의 블록 모드 및/또는 각 매크로블록을 구성하는 단위 블록의 예측 모드에 따라서 달라질 수 있다. Therefore, the decoded luminance value of up to 37 pixels adjacent to each macroblock may exist as P-Data, and whether or not the reconstructed pixel data of several pixels is actually used as a prediction value in predicting the macroblock. It may vary depending on the block mode of each macroblock and / or the prediction mode of the unit block constituting each macroblock.

(2) 예측 모드 데이터(2) prediction mode data

H.264/AVC에서는 인트라 4×4 블록 모드인 경우에 예측 모드를 지시하는 정보(예측 모드 정보)에 대한 비트수를 감소시키기 위하여, 현재 4×4 블록에 대한 상기 예측 모드 정보를 적응적으로 할당한다. 보다 구체적으로, 현재 4×4 블록의 왼쪽과 위쪽에 위치하는 4×4 블록의 예측 모드 중에서 크기가 작은 모드를 기준 모드로 선택하고, 선택된 기준 모드와 현재 4×4 블록의 예측 모드가 같은 경우에는 1비트만을 전송함으로써 같다는 것을 표시한다. 그리고 현재 4×4 블록의 예측 모드와 상기 기준 모드가 같지 않은 경우에는, 예측 모드가 다르다는 것을 나타내는 1비트 정보와 상기 기준 모드를 제외한 나머지 8가지 예측 모드를 3비트로 표시함으로써 현재 4×4 블록의 예측 모드를 지시하는 정보를 4비트로써 표시한다. 이러한 적응적 예측 모드 정보의 할당은 9가지 예측 모드를 일괄적으로 4비트로 표현하는 것에 비하여 적응적으로 1비트 또는 4비트로 표현함으로써, 부호화 효율을 향 상시킬 수가 있다.In H.264 / AVC, the prediction mode information for the current 4x4 block is adaptively applied in order to reduce the number of bits for the information (prediction mode information) indicating the prediction mode in the intra 4x4 block mode. Assign. More specifically, when a small mode is selected as a reference mode from among prediction modes of 4 × 4 blocks located on the left and top of the current 4 × 4 block, and the selected reference mode and the prediction mode of the current 4 × 4 block are the same. Indicates that they are the same by transmitting only one bit. If the prediction mode of the current 4x4 block and the reference mode are not the same, 1-bit information indicating that the prediction mode is different and the remaining 8 prediction modes except for the reference mode are displayed in 3 bits so as to display the current 4x4 block. Information indicating the prediction mode is displayed as 4 bits. The allocation of the adaptive prediction mode information can improve coding efficiency by adaptively expressing nine prediction modes in one bit or four bits as compared with collectively representing four bits.

도 4는 현재 매크로블록 내의 16개의 4×4 블록들(0번 내지 15번) 중에서 주변 4×4 블록의 예측 모드 정보가 요구되는 4×4 블록을 보여주기 위한 도면이다. 도 4를 참조하면, '0'번 4×4 블록은 왼쪽 4×4 블록(0A)과 위쪽 4×4 블록(0B)의 예측 모드 정보가 요구되며, '1', '4', 및 '5'번 4×4 블록은 위쪽 4×4 블록(1B, 2B, 3B)의 예측 모드 정보가 요구되며, '2', '8', 및 '10'번 4×4 블록은 왼쪽 4×4 블록(1A, 2A, 3A)의 예측 모드 정보가 요구된다. FIG. 4 is a diagram for showing a 4x4 block requiring prediction mode information of neighboring 4x4 blocks among 16 4x4 blocks (Nos. 0 to 15) within a current macroblock. Referring to FIG. 4, the 4x4 block '0' requires prediction mode information of the left 4x4 block (0A) and the upper 4x4 block (0B), and the '1', '4', and ' 4x4 block 5 'requires prediction mode information of the upper 4x4 blocks 1B, 2B, and 3B, and 4x4 blocks' 2', '8', and '10' are left 4x4. The prediction mode information of the blocks 1A, 2A, and 3A is required.

따라서 하나의 매크로블록을 복원하기 위하여 최대 8개의 주변 4×4 블록(0, 1, 2, 4, 5, 8, 10번)의 예측 모드 정보가 필요하며, 이들 예측 모드 정보도 P-Data와 함께 영상 복호화 장치로 전송되어야 한다.Therefore, in order to recover one macroblock, prediction mode information of up to eight neighboring 4x4 blocks (Nos. 0, 1, 2, 4, 5, 8, and 10) is required. Together with the video decoding apparatus.

(3) NNZ 데이터(3) NNZ data

H.264/AVC에서는 엔트로피 부호화를 위해 콘텍스트 적응 가변장 부호화(Context Adaptive Variable Length Coding, CAVLC)를 사용한다. CAVLC는 주변 블록의 상황에 따라 여러 개의 VLC 테이블 중에서 하나를 적응적으로 선택하여 엔트로피 부호화를 수행한다. 현재 4×4 블록의 VLC 테이블을 결정하기 위하여 주변의 0이 아닌 계수의 개수(NNZ)를 이용한다. 보다 구체적으로, 현재 4×4 블록의 왼쪽 및 위쪽에 위치하는 4×4 블록의 NNZ값의 평균값(nC로 표시)을 구하여, 이 평균값을 이용하여 현재 4×4 블록의 부호화에 적용할 VLC 테이블을 결정한다.H.264 / AVC uses Context Adaptive Variable Length Coding (CAVLC) for entropy coding. CAVLC performs entropy encoding by adaptively selecting one of several VLC tables according to the situation of neighboring blocks. The number of neighboring nonzero coefficients NNZ is used to determine the VLC table of the current 4x4 block. More specifically, the VLC table to be applied to the encoding of the current 4x4 block by obtaining the average value (denoted in nC) of the NNZ values of the 4x4 blocks located on the left and top of the current 4x4 block and using the average value Determine.

따라서 인접 4×4 블록의 NNZ 정보의 경우에도, 전술한 인접 4×4 블록의 예 측 모드 정보와 마찬가지로, 하나의 매크로블록을 복원하기 위하여 최대 8개의 주변 4×4 블록(0, 1, 2, 4, 5, 8, 10번)의 NNZ 정보가 필요하며, 이들 NNZ 정보도 P-Data와 함께 영상 복호화 장치로 전송되어야 한다.Therefore, even in the case of NNZ information of adjacent 4x4 blocks, similar to the prediction mode information of the adjacent 4x4 blocks described above, up to eight peripheral 4x4 blocks (0, 1, 2) in order to recover one macroblock. , Nos. 4, 5, 8 and 10) are required, and these NNZ information should also be transmitted to the video decoding apparatus together with the P-Data.

전술한 세 가지 종류의 데이터, 즉 P-Data, 예측 모드 정보, 및 NNZ 정보는 부호화 시에 각 매크로블록에 대하여 영상 부호화 유닛(112)로부터 제1 P-Data 메모리(114)로 보내져서 저장된다. 예를 들어, 상기 P-Data 등은 해당 매크로블록의 부가 정보로써 제1 P-Data 메모리(114)에 저장될 수 있다. 그리고 P-Data 추출기(118)는 대상 영상에서의 각 매크로블록의 위치 및/또는 블록 모드 등을 고려하여, 상기 제1 P-Data 메모리(114)에 저장되어 있는 각 매크로블록에 대한 데이터들 중에서 전부 또는 일부의 데이터를 추출하여 영상 복호화 장치(120)로 전송한다.The three types of data described above, namely P-Data, prediction mode information, and NNZ information, are sent from the image coding unit 112 to the first P-Data memory 114 for each macroblock at the time of encoding and stored. . For example, the P-Data may be stored in the first P-Data memory 114 as additional information of the corresponding macroblock. The P-Data extractor 118 considers the location and / or block mode of each macroblock in the target image, and among the data for each macroblock stored in the first P-Data memory 114. All or part of the data is extracted and transmitted to the image decoding apparatus 120.

그리고 도 5는 디스플레이 윈도우, 즉 대상 영상에서의 매크로블록 위치 및/또는 블록 모드 등에 따라서 P-Data 추출기(118)가 영상 복호화 장치(120)로 전송하는 데이터 종류에 대하여 구체적으로 설명하기 위한 도면이다. 본 실시예에 의하면, 대상 영상의 좌우측 가장자리와 위쪽 가장자리에 위치하는 매크로블록, 즉 매크로블록①, ②, ③, ⑦에 대해서는 상기 세 가지 종류의 데이터 중에서 하나 이상의 데이터를 전송해주어야 할 가능성이 높다. 그리고 나머지 매크로블록들에 대해서는 P-Data 등을 별도로 전송해줄 필요가 없다. 이하, 매크로블록①, ②, ③, ⑦의 경우에 전송해야 하는 데이터의 종류에 대해서 보다 구체적으로 예를 들어 설명 한다.FIG. 5 is a diagram for describing in detail a type of data transmitted by the P-Data extractor 118 to the image decoding apparatus 120 according to a display window, that is, a macroblock position and / or a block mode in a target image. . According to the present exemplary embodiment, one or more pieces of data of the three types of data are likely to be transmitted to the macroblocks positioned at the left and right edges and the upper edge of the target image, that is, the macroblocks ①, ②, ③ and ⑦. For the remaining macroblocks, there is no need to separately transmit P-Data. Hereinafter, the types of data to be transmitted in the case of the macroblocks ①, ②, ③ and ⑦ will be described in more detail with reference to examples.

본 실시예의 일례는 매크로블록①, ②, ③, ⑦에 해당하는 모든 매크로블록에 대하여 상기 세 가지 종류의 데이터 모두를 추출하여 전송하는 경우이다. 이 경우는 P-Data 등의 전송이 필요한 모든 매크로블록에 대하여 일률적으로 처리한다는 점에서, P-Data 추출기(118)의 구성 및 동작을 단순화시킬 수 있는 장점이 있다. 다만, 이 경우에는 해당 매크로블록을 복호화하는데 있어서 불필요한 데이터의 전송이 발생할 수도 있다는 점에서, 전송 효율을 극대화하는 데에는 일정한 한계가 있을 수 있다.An example of this embodiment is a case where all three types of data are extracted and transmitted for all macroblocks corresponding to macroblocks ①, ②, ③, and ⑦. This case has the advantage of simplifying the configuration and operation of the P-Data extractor 118 in that it uniformly processes all macroblocks requiring transmission of P-Data and the like. However, in this case, there may be a certain limit in maximizing the transmission efficiency in that unnecessary data transmission may occur in decoding the macroblock.

본 실시예의 다른 예는 매크로블록①, ②, ③, ⑦의 블록 모드 및 예측 모드 등을 고려하여, 해당 매크로블록을 복호화하는데 있어서 필요한 데이터만을 전송해주는 경우이다. 보다 구체적으로, 매크로블록①에 대해서는 전술한 세 가지 종류의 데이터를 모두 전송하고, 매크로블록②에 대해서는 전술한 세 가지 종류의 데이터 중에서 각 매크로블록에 대한 왼쪽 16개 픽셀에 대한 재구성 화소 데이터, 왼쪽 4개의 4×4 블록에 대한 예측 모드 데이터, 및/또는 왼쪽 4개의 4×4 블록에 대한 NNZ 데이터는 전송하지 않을 수도 있다. 그리고 매크로블록③에 대해서는 전술한 세 가지 종류의 데이터 중에서 각 매크로블록에 대한 위쪽 16개의 픽셀의 재구성 화소 데이터, 위쪽 4개의 4×4 블록에 대한 예측 모드 데이터, 및/또는 위쪽 4개의 4×4 블록에 대한 NNZ 데이터는 전송하지 않을 수도 있다. 또한, 매크로블록⑦의 경우에는 전술한 세 가지 종류의 데이터 중에서 각 매크로블록에 대하여 우상(right and upper) 방향으로 인접하게 위치하는 4개의 화소에 대한 재구성 화소 데이터만을 전송할 수 있다.Another example of the present embodiment is a case where only data necessary for decoding the macroblock is transmitted in consideration of the block mode and the prediction mode of the macroblocks ①, ②, ③ and ⑦. More specifically, all three types of data described above are transmitted for the macroblock ①, and reconstructed pixel data for the left 16 pixels for each macroblock among the three types of data described above for the macroblock ②, and the left side. Prediction mode data for four 4x4 blocks and / or NNZ data for the left four 4x4 blocks may not be transmitted. For the macroblock ③, the reconstructed pixel data of the top 16 pixels for each macroblock, the prediction mode data for the top 4 4 × 4 blocks, and / or the top 4 4 × 4 of the above three types of data. NNZ data for the block may not be transmitted. In the case of the macroblock ⑦, only reconstructed pixel data of four pixels positioned adjacent to each of the macroblocks in the right and upper directions can be transmitted.

본 실시예의 또 다른 측면에 의하면, 매크로블록①, ②, ③에 대하여 전술한 세 가지 데이터 모두를 전송하는 것을 기본으로 하면서, 아래에 해당되는 경우에는 그 내용을 선택적으로 적용하는 것이 가능하다.According to another aspect of the present embodiment, it is possible to selectively apply the contents in the following cases, based on the transmission of all three data described above for the macroblocks ①, ②, ③.

- 매크로블록②가 16×16 블록 모드이고 Horizontal 예측 모드인 경우, 재구성 화소 데이터는 전송하지 않는다.When the macroblock ② is in the 16 × 16 block mode and the horizontal prediction mode, reconstructed pixel data is not transmitted.

- 매크로블록③이 16×16 블록 모드이고 Vertical 예측 모드인 경우, 재구성 화소 데이터는 전송하지 않는다.When the macroblock ③ is in the 16 × 16 block mode and the vertical prediction mode, reconstructed pixel data is not transmitted.

- 매크로블록②의 경우, 좌측으로 인접한 16개 화소의 재구성 화소 데이터와 좌측 4개의 4×4 블록에 대한 NNZ 정보는 전송하지 않는다.In the case of the macroblock ②, the reconstructed pixel data of the 16 pixels adjacent to the left side and the NNZ information of the 4 4x4 blocks on the left side are not transmitted.

- 매크로블록③의 경우, 위쪽으로 인접한 16개 화소의 재구성 화소 데이터와 위쪽 4개의 4×4 블록에 대한 NNZ 정보는 전송하지 않는다.In the case of the macroblock ③, the reconstructed pixel data of the 16 adjacent pixels upward and the NNZ information of the 4 upper 4 × 4 blocks are not transmitted.

- 매크로블록⑦의 경우, 우상측으로 인접한 4개의 픽셀에 대한 재구성 화소 데이터만 전송한다.In the case of macroblock ⑦, only reconstructed pixel data for four pixels adjacent to the upper right side is transmitted.

- 인접 4×4 블록의 예측 모드 정보는 현재의 블록 모드와 주위의 블록 모드가 모두 4×4 블록 모드일 경우에만 전송한다.The prediction mode information of adjacent 4x4 blocks is transmitted only when both the current block mode and the neighboring block modes are 4x4 block modes.

이상에서 상세하게 설명한 본 발명의 제1 실시예에 따른 영상 코덱 시스템에 의하면, 영상 부호화 서버에서 인트라 모드의 전체 영상에 대하여 화면내 예측 부호화를 수행하여 종래와 마찬 가지 방법으로 영상 비트스트림을 생성하며, 또한 상 기 화면내 예측 부호화에 이용된 예측 화소 데이터 등은 별도의 메모리에 저장해둔다. 그리고 사용자가 상기 전체 영상의 특정 부분을 지정한 대상 영상에 대한 데이터를 요청한 경우에, 상기 영상 부호화 서버는 상기 대상 영상에 대한 영상 비트스트림과 함께 이 영상 비트스트림을 복호화하는데 있어서 필요한 P-Data 등을 영상 복호화 장치로 전송한다. 또한, 영상 복호화 장치는 수신된 영상 비트스트림과 P-Data 등을 이용하여 요청된 대상 영상을 복원한다.According to the image codec system according to the first embodiment of the present invention described in detail above, the image encoding server performs the intra prediction encoding on the entire image in the intra mode to generate the image bitstream in the same manner as the conventional method. In addition, the prediction pixel data used for the intra prediction encoding is stored in a separate memory. When the user requests data for a target video designating a specific part of the entire video, the video encoding server, together with the video bitstream for the target video, provides P-Data or the like necessary for decoding the video bitstream. Transmission to the video decoding apparatus. Also, the image decoding apparatus restores the requested target image by using the received image bitstream and P-Data.

이러한 본 발명의 제1 실시예에 따른 영상 코덱 시스템에 의하면, 전체 영상이 아닌 대상 영상에 대한 영상 비트스트림과 P-Data만을 전송하기 때문에 전송 효율을 향상시킬 수 있다. 아울러, 영상 복호화 장치에서는 불필요한 영상 부분(즉, 전체 영상 중에서 대상 영상에 해당되지 않는 부분)에 대해서는 복호화를 수행할 필요가 없다. 결국, 이러한 본 발명의 제1 실시예에 따른 영상 코덱 시스템은 공간임의접근을 지원함과 동시에 한정된 전송 채널을 효율적으로 이용할 수 있기 때문에, 사용자는 전체 영상의 일부인 대상 영상에 대해서도 고화질의 영상을 시청하는 것이 가능하다.According to the video codec system according to the first embodiment of the present invention, transmission efficiency can be improved because only the video bitstream and the P-Data for the target video are transmitted instead of the entire video. In addition, the image decoding apparatus does not need to perform decoding on an unnecessary image portion (that is, a portion of the entire image that does not correspond to the target image). As a result, since the image codec system according to the first embodiment of the present invention supports spatial random access and efficiently uses a limited transmission channel, a user can watch a high quality image even for a target image which is a part of the entire image. It is possible.

제2 실시예Second Embodiment

본 발명의 제2 실시예는 불일치 데이터(M-Data)를 이용하는 경우이다. 일반적으로'M-Data'는 화면내 예측 부호화 방법에 의하여 각 매크로블록을 부호화한 다음 복호화한 재구성 화소 데이터와 화면내 예측 부호화를 하지 않고 각 매크로블록을 독립적으로 부호화한 다음 복호화한 재구성 화소 데이터와의 차분값을 나타낸 다. 그리고 본 명세서에서는 상기 M-Data라는 용어가 상기 매크로블록 전체에 대한 차분값이 아니라 매크로블록 일부에 대한 차분값을 의미하는 것으로 사용되기도 한다. 예를 들어, 도 6에 도시되어 있는 것과 같이, 매크로블록의 좌하측 열과 좌우측 행에 대한 차분값을 나타내기 위하여 상기 M-Data라는 용어를 사용하기도 한다.The second embodiment of the present invention is a case of using mismatched data (M-Data). In general, 'M-Data' refers to reconstructed pixel data obtained by encoding and then decoding each macroblock by an intra prediction encoding method, and reconstructed pixel data obtained by separately encoding and then decoding each macroblock without performing intra prediction encoding. The difference value of. In the present specification, the term "M-Data" may be used to mean a difference value for a part of the macroblock, not a difference value for the entire macroblock. For example, as illustrated in FIG. 6, the term M-Data may be used to indicate a difference value between a lower left column and a left and right row of a macroblock.

도 7은 본 발명의 제2 실시예에 따른 영상 코덱 시스템(200)의 구성을 보여주는 블록도이다. 도 7을 참조하면, 영상 코덱 시스템(200)은 영상 부호화 서버(210)와 영상 복호화 장치(220)를 포함한다. 그리고 영상 부호화 서버(210)는 제1 영상 부호화 유닛(211), 제2 영상 부호화 유닛(212), 제1 M-Data 메모리(214), 영상 비트스트림 추출기(216), 및 M-Data 추출기(218)를 포함한다. 그리고 영상 복호화 장치(220)는 영상 복호화 유닛(222)과 제2 M-Data 메모리(224)를 포함한다.7 is a block diagram illustrating a configuration of an image codec system 200 according to a second embodiment of the present invention. Referring to FIG. 7, the image codec system 200 includes an image encoding server 210 and an image decoding apparatus 220. The image encoding server 210 may include a first image encoding unit 211, a second image encoding unit 212, a first M-Data memory 214, an image bitstream extractor 216, and an M-Data extractor ( 218). The image decoding apparatus 220 includes an image decoding unit 222 and a second M-Data memory 224.

우선 영상 부호화 서버(210)에 대하여 설명한다. 본 실시예에 따른 영상 부호화 서버(210)는 2가지 종류의 영상 부호화 유닛(211, 212), 제1 M-Data 메모리(214), 및 M-Data 추출기(218)를 포함한다는 점에서, 전술한 제1 실시예에 따른 영상 부호화 서버(110)와 차이가 있다.First, the video encoding server 210 will be described. The image encoding server 210 according to the present embodiment includes two types of image encoding units 211 and 212, a first M-Data memory 214, and an M-Data extractor 218. There is a difference from the image encoding server 110 according to the first embodiment.

제1 영상 부호화 유닛(211)과 제2 영상 부호화 유닛(212)은 모두 입력되는 전체 영상을 소정의 부호화 방법을 이용하여 압축함으로써 각각 제1 영상 비트스트림과 제2 영상 비트스트림을 생성하기 위한 수단이다. 이러한 제1 및 제2 영상 부호화 유닛(211, 212)은 H.264/AVC 또는 VC-1 등과 같은 영상 압축에 관한 표준 규격에 부합하는 엔코더(Encoder)인 것이 바람직하지만, 이에 한정되는 것은 아니다. 제1 영상 부호화 유닛(211)과 제2 영상 부호화 유닛(212)으로부터 생성된 영상 비트스트림은 소정의 저장 수단에 저장되거나 또는 실시간 전송의 경우에는 버퍼 메모리 등에 저장되어 있을 수 있다.Means for generating a first video bitstream and a second video bitstream, respectively, by compressing the entire video inputted by the first video encoding unit 211 and the second video encoding unit 212 using a predetermined encoding method. to be. The first and second image encoding units 211 and 212 are preferably encoders that conform to a standard for image compression such as H.264 / AVC or VC-1, but are not limited thereto. The video bitstreams generated from the first video encoding unit 211 and the second video encoding unit 212 may be stored in predetermined storage means or in a buffer memory in the case of real time transmission.

그리고 본 실시예에 의하면, 제1 영상 부호화 유닛(211)은 인트라 모드 영상에 대하여 소정 크기의 블록, 예컨대 매크로블록 단위로 독립적으로 부호화를 수행하여 제1 영상 비트스트림을 생성하는 엔코더이다. 즉, 제1 영상 부호화 유닛(211)은 인접 블록의 복호화된 화소 데이터 등을 이용하여 부호화하는 화면내 예측 부호화를 수행하지 않는다. 반면, 제2 영상 부호화 유닛(212)은 인트라 모드 영상에 대하여 화면내 예측 부호화를 수행하여 제2 영상 비트스트림을 생성하는 엔코더이다. 따라서 제2 영상 부호화 유닛(212)은 전술한 제1 실시예의 영상 부호화 유닛(112)과 동일하다.According to the present exemplary embodiment, the first image encoding unit 211 is an encoder that independently encodes an intra mode image in units of predetermined sizes, for example, macroblocks, to generate a first image bitstream. That is, the first image encoding unit 211 does not perform intra prediction encoding that is encoded by using decoded pixel data of an adjacent block. On the other hand, the second image encoding unit 212 is an encoder that performs intra prediction encoding on an intra mode image to generate a second image bitstream. Therefore, the second image encoding unit 212 is the same as the image encoding unit 112 of the first embodiment described above.

제1 M-Data 메모리(214)는 제1 영상 부호화 유닛(211)에서 생성되는 각 매크로블록에 대한 복호화된 화소값(제1 복호화된 화소값)과 제2 영상 부호화 유닛(212)에서 생성되는 각 매크로블록에 대한 복호화된 화소값(제2 복호화된 화소값)의 차분값(차분 화소 데이터)을 저장하기 위한 수단이다. 이러한 차분값을 구하기 위하여 제1 M-Data 메모리(214)의 입력측에는 감산기(215)가 더 구비되어 있을 수 있다. 이러한 제1 M-Data 메모리(214)에는 매크로블록 단위로 전체 화소(256개 화소)에 대한 차분 데이터가 저장되거나 또는 도 6에 도시된 바와 같이 각 블록의 최우측 제16행과 최하측 제16열에 위치하는 화소들, 즉 총 31개 화소에 대한 차분 데이터만 저장될 수 있는데, 후자의 경우가 더 바람직하다. 그리고 제1 M-Data 메 모리(214)에는 제2 영상 부호화 유닛(212)에서 생성되는 데이터의 일부, 보다 구체적으로는 예측 모드 데이터와 NNZ 데이터도 함께 저장될 수 있다.The first M-Data memory 214 is generated by the decoded pixel value (first decoded pixel value) for each macroblock generated by the first image encoding unit 211 and by the second image encoding unit 212. Means for storing the difference value (differential pixel data) of the decoded pixel value (second decoded pixel value) for each macroblock. In order to obtain such a difference value, a subtractor 215 may be further provided at an input side of the first M-Data memory 214. The first M-Data memory 214 stores difference data for all pixels (256 pixels) in units of macroblocks, or as shown in FIG. 6, the rightmost 16th row and the lowest 16th block of each block. Only difference data for pixels located in the column, that is, 31 pixels in total, can be stored, with the latter being more preferred. The first M-Data memory 214 may also store some of the data generated by the second image coding unit 212, more specifically, prediction mode data and NNZ data.

영상 비트스트림 추출기(216)는 전체 영상에 대한 비트스트림, 즉 제1 영상 비트스트림과 제2 영상 비트스트림 중에서 사용자가 원하는 대상 영상에 대한 비트스트림만을 추출하는 기능을 수행한다. 보다 구체적으로, 본 실시예에 의하면, 영상 비트스트림 추출기(216)는 상기 대상 영상을 구성하는 매크로블록들 중에서 양쪽 가장자리에 위치하는 매크로블록과 최상측에 위치하는 매크로블록(즉, 전술한 도 7의 매크로블록①, ②, ③에 해당하는 블록)에 대해서는 제1 영상 비트스트림으로부터 데이터를 추출한다. 반면, 상기 대상 영상을 구성하는 매크로블록들 중에서 제1 영상 비트스트림을 이용하지 않는 나머지 매크로블록들에 대해서는 제2 영상 비트스트림으로부터 데이터를 추출한다.The image bitstream extractor 216 extracts only a bitstream of a target image desired by a user from among a bitstream of the entire image, that is, a first image bitstream and a second image bitstream. More specifically, according to the present exemplary embodiment, the image bitstream extractor 216 may include macroblocks positioned at both edges and macroblocks positioned at the top of macroblocks constituting the target image (ie, FIG. 7 described above). Blocks corresponding to the macroblocks 1, 2, and 3) are extracted from the first video bitstream. On the other hand, data is extracted from the second image bitstream for the remaining macroblocks that do not use the first image bitstream among the macroblocks constituting the target image.

그리고 M-Data 추출기(218)는 제1 M-Data 메모리(214)에 저장되어 있는 M-Data 중에서 해당 매크로블록에 대한 M-Data인 차분 화소 데이터와 예측 모드 데이터, 및/또는 NNZ 데이터를 추출하는 기능을 수행한다. 보다 구체적으로, M-Data 추출기(218)는 상기 대상 영상을 구성하는 매크로블록들 중에서 양쪽 가장자리에 위치하는 매크로블록과 최상측에 위치하는 매크로블록(즉, 전술한 도 5의 매크로블록①, ②, ③에 해당하는 블록)에 대해서는 차분 화소 데이터를 추출한다. 그리고 나머지 매크로블록들에 대해서는 예측 모드 데이터와 NNZ 데이터를 추출한다.The M-Data extractor 218 extracts differential pixel data, prediction mode data, and / or NNZ data, which are M-Data for the corresponding macroblock, from the M-Data stored in the first M-Data memory 214. It performs the function. More specifically, the M-Data extractor 218 may include macroblocks positioned at both edges of the macroblocks constituting the target image and macroblocks positioned at the uppermost side (ie, macroblocks ① and ② of FIG. 5 described above). Difference pixel data) is extracted. The prediction mode data and the NNZ data are extracted for the remaining macroblocks.

계속해서, 영상 복호화 장치(220)에 대해서 설명한다.Subsequently, the video decoding apparatus 220 will be described.

영상 복호화 장치(220)의 영상 복호화 유닛(222)은 사용자가 시청하기를 원 하는 대상 영상을 복원하기 위한 수단이다. 이러한 영상 복호화 유닛(222)은 수신된 제1 영상 비트스트림과 차분 화소 데이터(M-Data)를 이용하여, 대상 영상을 구성하는 매크로블록들 중에서 양쪽 가장자리에 위치하는 매크로블록과 최상측에 위치하는 매크로블록(즉, 전술한 도 5의 매크로블록①, ②, ③에 해당하는 블록)을 복원한다. 그리고 대상 영상 중에서 매크로블록①, ②, ③에 해당하지 않는 나머지 매크로블록에 대해서는 수신된 제2 영상 비트스트림과 예측 모드 데이터 및/또는 NNZ 데이터를 이용하여 복호화를 수행한다.The image decoding unit 222 of the image decoding apparatus 220 is a means for reconstructing a target image that the user wants to watch. The image decoding unit 222 may be located at the top of the macroblocks located at both edges of the macroblocks constituting the target image by using the received first image bitstream and differential pixel data (M-Data). The macro blocks (that is, the blocks corresponding to the macro blocks ①, ②, and ③ of FIG. 5 described above) are restored. The remaining macroblocks not corresponding to macroblocks ①, ②, and ③ of the target image are decoded using the received second image bitstream, prediction mode data, and / or NNZ data.

그런데, 제1 영상 비트스트림은 매크로블록 단위로 독립적으로 부호화된 데이터들로 구성되고, 제2 영상 비트스트림은 매크로블록 단위로 화면내 예측 부호화에 의하여 부호화된 데이터들로 구성된다. 그런데, 도 5의 매크로블록④, ⑤, ⑥의 경우에는 매크로블록①, ②, ③의 픽셀값 등이 예측값으로 이용되는데, 영상 복호화 유닛(222)에서는 제1 영상 비스트스림을 이용하여 상기 매크로블록①, ②, ③의 픽셀값을 복원하기 때문에, 영상 부호화 유닛(212)에서 예측값으로 이용하는 매크로블록①, ②, ③의 픽셀값과 차이가 있다. 따라서 본 실시예에 따른 복호화 방법에서는, 제2 영상 비트스트림을 이용하여 복호화를 수행하기 이전에, 제2 M-Data 메모리(224)에 저장되어 있는 차분 화소 데이터를 이용하여 매크로블록①, ②, ③의 픽셀값, 보다 구체적으로는 각 매크로블록에서 제16번째행 및/또는 제16번째 열에 해당되는 픽셀값을 보상해준다.However, the first image bitstream is composed of data encoded independently in macroblock units, and the second image bitstream is composed of data encoded by intra prediction encoding in macroblock units. However, in the case of the macroblocks ④, ⑤, and ⑥ of FIG. 5, the pixel values of the macroblocks ①, ②, and ③ are used as prediction values. Since the pixel values of (1), (2), and (3) are reconstructed, there is a difference from the pixel values of macroblocks (1), (2), and (3) used as prediction values in the video encoding unit 212. Therefore, in the decoding method according to the present embodiment, before performing decoding using the second image bitstream, macroblocks ①, ②, using differential pixel data stored in the second M-Data memory 224. The pixel value corresponding to?, More specifically, the pixel value corresponding to the sixteenth row and / or the sixteenth column in each macroblock is compensated for.

영상 복호화 장치(220)의 제2 M-Data 메모리(224)는 대상 영상의 복원에 이용되는 차분 화소 데이터(M-Data)와 예측 모드 데이터 및/또는 NNZ 데이터 등을 저 장하기 위한 수단이다. 이러한 제2 M-Data 메모리(224)는 영상 복호화 장치(220)의 다른 구성 요소, 예컨대 영상 복호화 유닛(222)에 구비되어 있는 메모리 등에 통합되어 있거나 또는 영상 복호화 유닛(222) 등과는 분리되어 독립적인 유닛을 형성할 수도 있다.The second M-Data memory 224 of the image decoding apparatus 220 is a means for storing differential pixel data (M-Data), prediction mode data and / or NNZ data, etc. used for reconstruction of the target image. The second M-Data memory 224 may be integrated with other components of the image decoding apparatus 220, for example, a memory included in the image decoding unit 222, or may be separated from and independent of the image decoding unit 222. The phosphorus unit may be formed.

다음으로, 본 실시예에 따른 영상 코덱 시스템이 H.264/AVC 표준 규격에 부합하는 영상 코덱 시스템인 경우에, 상기 제1 M-Data 메모리(214)에 저장되어서 영상 복호화 장치(120)로 전송되는 전술한 데이터에 대하여 구체적으로 설명한다. 전술한 바와 같이, 상기 제1 M-Data 메모리(214)에는 각 매크로블록의 최하측 열과 최우측 행에 해당하는 35개의 픽셀에 대하여 제1 영상 부호화 유닛(211)에서 생성되는 복호화된 화소값과 제2 영상 부호화 유닛(212)에서 생성되는 복호화된 화소값 사이의 차분값(차분 화소 데이터)인 M-Data와 현재 매크로블록에 인접한 주변 4×4 블록의 예측 모드 정보(예측 모드 데이터) 및/또는 현재 매크로블록에 인접한 주변 4×4 블록의 NNZ 정보(NNZ 데이터) 등을 포함한다.Next, when the image codec system according to the present embodiment is an image codec system conforming to the H.264 / AVC standard, it is stored in the first M-Data memory 214 and transmitted to the image decoding apparatus 120. The above-described data will be described in detail. As described above, the first M-Data memory 214 includes decoded pixel values generated by the first image coding unit 211 for 35 pixels corresponding to the lowermost column and the uppermost row of each macroblock. Prediction mode information (prediction mode data) of M-Data, which is a difference value (differential pixel data) between the decoded pixel values generated by the second image encoding unit 212, and neighboring 4x4 blocks adjacent to the current macroblock, and / Or NNZ information (NNZ data) of a neighboring 4x4 block adjacent to the current macroblock.

(1) 차분 화소 데이터(1) differential pixel data

M-Data 메모리(214)에는 차분 화소 데이터로서 각 매크로블록에 대하여 최우측 행(제16행)과 최하측 열(제16열)에 위치하는 총 31개의 화소들에 대한 차분값이 저장된다. 그리고 영상 복호화 장치(220)로 전송되는 차분 화소 데이터는 대상 영상에서 각 매크로블록이 차지하는 위치에 따라서 달라질 수 있다. 예를 들어, 도 5 의 매크로블록①인 경우에는 상기 31개의 화소에 대한 차분 화소 데이터를 모두 전송한다. 그리고 도 5의 매크로블록②인 경우에는 최하측 열에 위치하는 16개의 화소에 대한 차분 화소 데이터만을 전송하고, 도 5의 매크로블록③인 경우에는 최우측 행에 위치하는 16개의 화소에 대한 차분 화소 데이터만을 전송할 수 있다. 다만, 다음과 같은 2가지 경우에는 일부 매크로블록에 대한 차분 화소 데이터는 전송하지 않아도 된다.The M-Data memory 214 stores difference values for 31 pixels located in the rightmost row (16th row) and the lowest column (16th column) for each macroblock as differential pixel data. The differential pixel data transmitted to the image decoding apparatus 220 may vary according to positions occupied by each macroblock in the target image. For example, in the case of the macroblock ① of FIG. 5, all differential pixel data for the 31 pixels is transmitted. In the case of macroblock ② of FIG. 5, only differential pixel data for 16 pixels located in the lowermost column is transmitted. In the case of macroblock ③ of FIG. 5, differential pixel data for 16 pixels located in the rightmost row is transmitted. Can only transmit. However, in the following two cases, differential pixel data for some macroblocks may not be transmitted.

- 도 5의 매크로블록⑤에 해당하는 블록이 16×16 블록 모드로로, Horizontal 예측 모드인 경우에는 매크로블록②의 최하측 열에 위치하는 16개의 화소에 대한 차분 화소 데이터.Differential pixel data for 16 pixels located in the lowermost column of the macroblock ② when the block corresponding to the macroblock? Of FIG. 5 is in the 16 × 16 block mode and is in the horizontal prediction mode.

- 도 5의 매크로블록⑥에 해당하는 블록이 16×16 블록 모드로써, Vertical 예측 모드인 경우에는 매크로블록③의 최우측 행에 위치하는 16개의 화소에 대한 차분 화소 데이터.Differential pixel data for the 16 pixels located in the rightmost row of the macroblock ③ when the block corresponding to the macroblock ⑥ in FIG. 5 is the 16x16 block mode and is in the vertical prediction mode.

(2) 예측 모드 데이터(2) prediction mode data

현재 매크로블록이 4×4 블록 모드일 경우에, 각 매크로블록에 대하여 도 4의 최우측 및/또는 최하측에 위치하는 4×4 블록의 예측 모드 정보를 M-Data 메모리(214)에 저장해둔다. 이를 도 4를 참조하여 설명하면, 매크로블록 5, 7, 10, 11, 13, 14, 및 15에 해당하는 7개의 4×4 블록의 예측 모드를 저장한다.When the current macroblock is in the 4x4 block mode, prediction mode information of 4x4 blocks located at the rightmost and / or bottommost side of FIG. 4 is stored in the M-Data memory 214 for each macroblock. . Referring to FIG. 4, prediction modes of seven 4 × 4 blocks corresponding to macroblocks 5, 7, 10, 11, 13, 14, and 15 are stored.

그리고 M-Data 추출기(218)에서 추출하여 영상 복호화 장치(220)로 전송하는 예측 모드 데이터는 대상 영상에서 현재 매크로블록이 해당되는 위치에 따라서 달 라질 수 있다. 예를 들어, 현재 매크로블록이 도 5의 매크로블록①에 해당할 경우에는 상기 7개의 4×4 블록의 예측 모드 모두를 전송한다. 그리고 현재 매크로블록이 도 5의 매크로블록②에 해당될 경우에는 상기 7개의 예측 모드 중에서 10, 11, 14, 및 15번 4×4 블록의 예측 모드를 전송하고, 현재 매크로블록이 도 5의 매크로블록③에 해당될 경우에는 상기 7개의 예측 모드 중에서 5, 7, 13, 및 15번 4×4 블록의 예측 모드를 전송한다. 단, 도 5의 매크로블록⑤에 해당되는 매크로블록이 16×16 블록 모드인 경우에는 도 5의 ②에 해당되는 매크로블록의 예측 모드와 도 5의 매크로블록⑥에 해당되는 매크로블록이 16×16 블록 모드인 경우에는 도 5의 ③에 해당되는 매크로블록의 예측 모드는 전송하지 않을 수 있다.The prediction mode data extracted by the M-Data extractor 218 and transmitted to the image decoding apparatus 220 may vary according to the position of the current macroblock in the target image. For example, when the current macroblock corresponds to macroblock ① of FIG. 5, all of the prediction modes of the seven 4 × 4 blocks are transmitted. When the current macroblock corresponds to the macroblock ② of FIG. 5, the prediction modes of the 4 × 4 blocks 10, 11, 14, and 15 of the seven prediction modes are transmitted, and the current macroblock is the macro of FIG. 5. In case of the block ③, the prediction modes of blocks 4, 4, 5, 7, 13 and 15 of the seven prediction modes are transmitted. However, when the macroblock corresponding to macroblock⑤ of FIG. 5 is the 16 × 16 block mode, the prediction mode of the macroblock corresponding to ② of FIG. 5 and the macroblock corresponding to the macroblock⑥ of FIG. 5 are 16 × 16. In the block mode, the prediction mode of the macroblock corresponding to 3 in FIG. 5 may not be transmitted.

(3) NNZ 데이터(3) NNZ data

현재 매크로블록이 4×4 블록 모드일 경우에, 각 매크로블록에 대하여 도 4의 최우측 및/또는 최하측에 위치하는 4×4 블록의 NNZ 데이터를 M-Data 메모리(214)에 저장해둔다. 이를 도 4를 참조하여 설명하면, 매크로블록 5, 7, 10, 11, 13, 14, 및 15에 해당하는 7개의 4×4 블록의 NNZ 데이터를 저장한다.When the current macroblock is in the 4x4 block mode, the NNZ data of the 4x4 block located at the rightmost and / or bottommost side of FIG. 4 is stored in the M-Data memory 214 for each macroblock. Referring to FIG. 4, NNZ data of seven 4 × 4 blocks corresponding to macroblocks 5, 7, 10, 11, 13, 14, and 15 is stored.

그리고 M-Data 추출기(218)에서 추출하여 영상 복호화 장치(220)로 전송하는 NNZ 데이터는 대상 영상에서 현재 매크로블록이 해당되는 위치에 따라서 달라질 수 있다. 예를 들어, 현재 매크로블록이 도 5의 매크로블록①에 해당할 경우에는 상기 7개의 4×4 블록의 NNZ 데이터 모두를 전송한다. 그리고 현재 매크로블록이 도 5의 매크로블록②에 해당될 경우에는 상기 7개의 예측 모드 중에서 10, 11, 14, 및 15 번 4×4 블록의 NNZ 데이터를 전송하고, 현재 매크로블록이 도 5의 매크로블록③에 해당될 경우에는 상기 7개의 예측 모드 중에서 5, 7, 13, 및 15번 4×4 블록의 NNZ 데이터를 전송한다.The NNZ data extracted by the M-Data extractor 218 and transmitted to the image decoding apparatus 220 may vary according to the position of the current macroblock in the target image. For example, when the current macroblock corresponds to macroblock ① of FIG. 5, all of the NNZ data of the seven 4 × 4 blocks are transmitted. When the current macroblock corresponds to macroblock ② of FIG. 5, NNZ data of 4 × 4 blocks 10, 11, 14, and 15 of the seven prediction modes are transmitted, and the current macroblock is the macro of FIG. 5. In the case of block ③, NNZ data of blocks 4, 4, 5, 7, 13, and 15 of the seven prediction modes are transmitted.

이상에서 상세하게 설명한 본 발명의 제2 실시예에 따른 영상 코덱 시스템에 의하면, 영상 부호화 서버에서 인트라 모드인 전체 영상에 대하여 화면내 예측 부호화 및 독립 부호화를 병행하여 수행함으로써, 서로 다른 값을 갖는 2개의 영상 비트스트림을 생성한다. 그리고 상기 화면내 예측 부호화에 이용된 예측 모드 데이터와 NNZ 데이터, 그리고 상기 서로 다른 2개의 부호화 방법에 따라 생성된 복호화된 화소값(차분 화소 데이터)의 차이가 M-Data로서 별도로 저장해둔다. 그리고 사용자가 상기 전체 영상의 특정 부분을 지정한 대상 영상에 대한 데이터를 요청한 경우에, 상기 영상 부호화 서버는 상기 대상 영상의 최좌측, 최상측, 및 최우측에 위치하는 매크로블록에 대해서는 제1 영상 비트스트림과 차분 화소 데이터를 전송하고, 상기 대상 영상의 나머지 매크로블록에 대해서는 제2 영상 비트스트림과 함께 예측 모드 데이터 및 NNZ 데이터를 영상 복호화 장치로 전송한다. 그리고 영상 복호화 장치는 수신된 제1 영상 비트스트림과 차분 화소 데이터를 이용하여 대상 영상의 최좌측, 최상측, 및 최우측에 위치하는 매크로블록을 복호화하고, 나머지 매크로블록에 대해서는 제2 영상 비트스트림, 예측 모드 데이터, 및 NNZ 데이터를 이용하여 복호화를 수행함으로써, 대상 영상 전체를 복원한다.According to the image codec system according to the second embodiment of the present invention described in detail above, an image encoding server performs an intra picture prediction encoding and an independent encoding on the entire image in parallel, thereby having two different values. Video bitstreams. The difference between the prediction mode data and the NNZ data used for the intra prediction encoding, and the decoded pixel values (differential pixel data) generated according to the two different encoding methods are separately stored as M-Data. When the user requests data for a target image in which a specific part of the entire image is specified, the image encoding server may include first image bits for macroblocks positioned on the leftmost, uppermost, and rightmost sides of the target image. The stream and the differential pixel data are transmitted, and the prediction mode data and the NNZ data are transmitted to the image decoding apparatus along with the second image bitstream for the remaining macroblocks of the target image. The apparatus for decoding an image decodes macroblocks located at the leftmost, uppermost, and rightmost sides of the target image by using the received first image bitstream and the differential pixel data, and performs a second image bitstream on the remaining macroblocks. , The prediction mode data, and the NNZ data are decoded to restore the entire target image.

이러한 본 발명의 제2 실시예에 따른 영상 코덱 시스템에 의하면, 대상 영상 에 대하여 제1 및 제2 영상 비트스트림과 M-Data만을 전송하기 때문에 전송 효율을 향상시킬 수 있다. 아울러, 영상 복호화 장치에서는 불필요한 영상 부분에 대해서는 복호화를 수행할 필요가 없다. 결국, 이러한 본 발명의 제2 실시예에 따른 영상 코덱 시스템은 공간임의접근을 지원하면서 한정된 전송 채널을 효율적으로 이용할 수 있기 때문에, 사용자는 전체 영상의 일부인 대상 영상에 대해서도 고화질의 영상을 시청하는 것이 가능하다.According to the video codec system according to the second embodiment of the present invention, since only the first and second video bitstreams and M-Data are transmitted with respect to the target video, transmission efficiency can be improved. In addition, the image decoding apparatus does not need to perform decoding on an unnecessary portion of the image. As a result, since the video codec system according to the second embodiment of the present invention can efficiently use a limited transmission channel while supporting a spatial random access, it is recommended that a user view a high-definition video even for a target video that is a part of the entire video. It is possible.

이상에서 설명한 바와 같이, 본 발명에 따른 공간임의접근을 지원하는 영상 코덱 시스템과 이를 구성하는 영상 부호화/복호화 장치 및 방법에 의하면, 사용자가 요청한 대상 영상에 대한 비트스트림과 이를 복호화하는데 있어서 필요한 부가 데이터(P-Data 또는 M-Data)를 최소화하여 복호화 장치로 전송하더라도, 사용자는 전체 영상에서 임의의 위치 및 크기를 갖는 대상 영상에 대하여 고화질의 영상을 시청하는 것이 가능하다. 따라서 본 발명에 의하면, 공간임의접근을 지원하면서도 전송 데이터의 양을 최소화하기 때문에 한정된 전송 채널을 효율적으로 이용할 수 있으며, 디스플레이 기기의 종류에 상관없이 3차원 영상이나 파노라마 영상 또는 일반 영상에 대하여 사용자가 원하는 특정 부분에 대하여 고해상도의 영상을 시청하는 것이 가능하다.As described above, according to the video codec system supporting the spatial random access and the video encoding / decoding apparatus and method constituting the same according to the present invention, a bitstream for the target video requested by the user and additional data necessary for decoding the same Even if the P-Data or the M-Data is minimized and transmitted to the decoding apparatus, the user can watch a high quality image with respect to a target image having an arbitrary position and size in the entire image. Therefore, according to the present invention, it is possible to efficiently use a limited transmission channel because it minimizes the amount of transmission data while supporting spatial random access. It is possible to watch high-definition images for the specific part desired.

Claims

An image encoding apparatus supporting spatial random access to a target image that is a part of an entire image,

An image encoding unit for generating an image bitstream by performing intra prediction encoding on the entire image;

A prediction data memory for storing prediction data (P-Data) used to generate a prediction block for the current block of the entire image in an intra prediction encoding process; And

And an extractor for extracting the image bitstream and the prediction data for the target image from the image bitstream for the entire image and the prediction data stored in the prediction data memory, respectively.

The image encoding apparatus of claim 1, wherein the prediction data includes a decoded pixel value of a pixel adjacent to the current block.

The data storage device of claim 2, wherein the prediction data memory stores prediction mode data of a block adjacent to the current block and / or number of non-zero coefficients (NNZ) data of a block adjacent to the current block. And a video encoding apparatus supporting a spatial random access.

The image extractor of claim 1, wherein the extractor comprises: an image bitstream extractor for extracting an image bitstream of the target image from among image bitstreams of the entire image, and prediction data of the target image among prediction data of the entire image; And a predictive data extractor for extracting the data.

The spatial random access method of claim 4, wherein the predictive data extractor extracts predicted data of the uppermost, leftmost, and / or rightmost macroblocks of the target image from the predictive data of the entire image. Image encoding apparatus that supports the.

delete

A first image encoding unit for generating a first image bitstream by independently encoding the entire image in units of blocks having a predetermined size;

A second image encoding unit for generating a second image bitstream by performing intra prediction encoding on the entire image;

Inconsistent data that is a difference value between the decoded first pixel value of the first image bitstream generated by the first image coding unit and the decoded second pixel value of the second image bitstream provided by the second image coding unit. A mismatch data memory for obtaining and storing (Mismatch Data, M-Data) in units of blocks;

For the uppermost, leftmost, and / or rightmost block of the target image, an image bitstream is extracted from the first image bitstream, and the uppermost, leftmost, and rightmost blocks among the blocks of the target image are extracted. A video bitstream extractor for extracting a bitstream from the second video bitstream for the non-blocks; And

And an inconsistency data extractor for extracting the inconsistency data for the uppermost, leftmost, and / or rightmost blocks of the target image.

10. The image encoding apparatus of claim 9, wherein the difference value for the pixels located in the lowermost column and the rightmost row of each block is stored in the mismatched data memory.

10. The method of claim 9, wherein the mismatched data memory further includes prediction mode data of an adjacent block and / or data of number of non-zero coefficients (NNZ) of an adjacent block. Image encoding apparatus that supports the.

An image decoding apparatus supporting spatial random access to a target image that is a part of an entire image,

The first region of the target image performs independent decoding in units of blocks having a predetermined size, and the second region of the target image performs intra prediction prediction decoding in units of blocks of a predetermined size, and the intra region of the second region is in-screen. If the decoded pixel value of the neighboring block used for predictive decoding is a pixel belonging to the first region, a space including an image decoding unit for performing intra prediction prediction decoding after adding mismatched data to the independent decoded pixel value Image decoding apparatus that supports random access.

The image decoding apparatus of claim 12, wherein the first area includes blocks positioned at the top, leftmost, and / or rightmost sides of the target image.

The image of claim 13, wherein the pixel value to which the mismatch data is added is a pixel value of pixels located in the rightmost row and / or the lowest column in each block belonging to the first region. Decryption device.

An image codec system comprising the image encoding device of claim 9 and the image decoding device of claim 12.

An image encoding method for supporting a spatial random access to a target image that is a part of an entire image,

Storing prediction data (Prediction Data, P-Data) used to generate a prediction block in the process of performing the intra prediction encoding by generating an image bitstream by performing intra prediction encoding on the entire image. Image encoding step; And

And extracting and transmitting the image bitstream and the prediction data of the target image from the image bitstream and the prediction data, respectively, when the user requests a transmission of the target image. Coding method.

An image decoding method for supporting a spatial random access to a target image that is a part of an entire image,

Receiving prediction data together with an image bitstream in the target image;

Generating a prediction block using the prediction data; And

And reconstructing the target image using the image bitstream and the prediction block.

Generating a first image bitstream by independently encoding the entire image in units of blocks having a predetermined size;

Generating a second image bitstream by performing intra prediction encoding on the entire image;

Obtaining and storing mismatch data (M-Data), which is a difference value between the decoded first pixel value of the first video bitstream and the decoded second pixel value of the second video bitstream, in block units;

When there is a request for transmission of the target video from a user, among the first video bitstream, the second video bitstream, and the mismatched data, the uppermost, leftmost, and / or rightmost block of the target video is the same. Extracting an image bitstream from a first image bitstream, extracting an image bitstream from the second image bitstream for blocks other than the uppermost, leftmost, and rightmost blocks among the blocks of the target image; And extracting and transmitting the mismatched data with respect to the uppermost, leftmost, and / or rightmost blocks of the target image.

Receiving a first video bitstream, a second video bitstream, and mismatch data in the target video;

Restoring pixel values of the first region of the target image by decoding the first image bitstream;

Correcting a pixel value for the restored first region by using the mismatched data;

Restoring the pixel value of the second region of the target image by performing intra prediction prediction decoding using the corrected pixel value of the first region and the second image bitstream. Supported video decoding method.