KR20140038319A - Device and method for inter-layer prediction of multi-layer video - Google Patents

Device and method for inter-layer prediction of multi-layer video Download PDF

Info

Publication number
KR20140038319A
KR20140038319A KR1020130110448A KR20130110448A KR20140038319A KR 20140038319 A KR20140038319 A KR 20140038319A KR 1020130110448 A KR1020130110448 A KR 1020130110448A KR 20130110448 A KR20130110448 A KR 20130110448A KR 20140038319 A KR20140038319 A KR 20140038319A
Authority
KR
South Korea
Prior art keywords
unit
layer
video
prediction
partition structure
Prior art date
Application number
KR1020130110448A
Other languages
Korean (ko)
Inventor
장형문
안용조
심동규
Original Assignee
광운대학교 산학협력단
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 광운대학교 산학협력단 filed Critical 광운대학교 산학협력단
Publication of KR20140038319A publication Critical patent/KR20140038319A/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/119Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/34Scalability techniques involving progressive bit-plane based encoding of the enhancement layer, e.g. fine granular scalability [FGS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/53Multi-resolution motion estimation; Hierarchical motion estimation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention relates to a decoder for a multi-layer video, an inter-layer prediction device for a multi-layer video and an inter-layer prediction method for a multi-layer video. The decoder for a multi-layer video according to the present invention comprises an inter-layer division structure prediction part for determining a division structure of an improvement layer by referring to the division information of a reference layer and an inter/intra prediction part for determining a prediction mode based on the division structure of the improvement layer. The inter-layer prediction method for a multi-layer video comprises: a division structure use decision step for determining the use of reference division information when the video of the improvement layer is divided based on the division information of the reference layer; a corresponding position structure selection step for determining a division unit of the reference layer of a position corresponding to the division unit of the improvement layer; and a division structure prediction level selection step for selecting the reference of the division information of the coding unit and converting unit of the reference layer and transmitting the same to a dequantization part. [Reference numerals] (101,201) Entropy decoding part; (102,202) Dequantization part; (103,203) Inversion part; (104,204) Motion compensation part; (105,205) Intra prediction part; (106,206) Deblocking filter part; (107,207) Sample adaptable offset part; (108,208) Restoration picture buffer; (300) Demultiplexer; (400) Inter-layer division structure prediction part; (AA) Bit stream

Description

Inter-layer prediction apparatus and method for multi-layer video {DEVICE AND METHOD FOR INTER-LAYER PREDICTION OF MULTI-LAYER VIDEO}

The present invention relates to video decoding, and more particularly, to a technique for decoding and compressing multi-layer video at high speed based on high efficiency video coding (HEVC), a next-generation video compression standard technology.

High Efficiency Video Coding (HEVC) is a next-generation compression standard technology developed by Joint Collaborative Team on Video Coding (JCT-VC) jointly organized by ISO / IEC MPEG and ITU-T VCEG. It is known that the HEVC main profile has about twice the compression performance compared to the conventional H.264 / AVC high profile. The standardization of HEVC was completed in February 2013, and a standard for effective encoding of multi-layer video based on HEVC will be additionally established.

In the standardization work for providing so-called scalability of video coding, various multimedia services are activated as the wireless network and the Internet are developed at a very high speed. In particular, when a compression coding technique is only developed in the emergence of a broadcasting communication convergence network (QoS) in various conditions of multimedia generation, transmission and consumption environment.

Scalable Video Coding (SVC) technology is a technique for converting an image having different kinds of resolution (Spatial), quality and frame rate in one compressed bitstream into various terminals and network environment So that it can be restored adaptively. SVC is a video codec that provides a hierarchical structure adaptable to various multimedia devices at a high compression ratio of H.264 / AVC. It is a Joint Video Team (JVT) as an amendment of H.264 / MPEG-4 PART 10, Standardization is underway. That is, standardization is proceeding with an extension version for HEVC.

HEVC uses Coding Unit (CU), Prediction Unit (PU) and Transform Unit (TU). Unlike existing video codecs, CU and TU have a hierarchical block structure based on QuadTree. And a long direction, thereby improving the coding efficiency. These HEVC quad-tree-based block structures and various shape PUs cause high complexity of HEVC coding. Therefore, a more efficient video compression technique is needed by using a method of eliminating high complexity in multi-layer video coding.

An object of the present invention is to improve decoding performance by using partition structure information used in decoding of a reference layer in an enhancement layer. It is an object of the present invention to provide a method and apparatus for predicting a partition structure between layers by using similarity between multi-layer video and speeding up video decoding of an enhancement layer by using partition structure information of a reference layer.

In general, not only temporal scalable video but also spatial scalable video and quality scalable video have similar characteristics, so that the degree or shape of segmentation has a similar shape. Instead of expressing the split information of the enhancement layer as a split flag using this similarity, the purpose of the present invention is to efficiently decode the bit by reducing the split information of the reference layer.

A decoder for multi-layer video according to an aspect of the present invention for solving the above problems is a layer that determines the partition structure of the current image of the enhancement layer by referring to the partition structure information of the reference image constituting the video of the reference layer. And a prediction unit configured to determine an enhancement prediction mode of the enhancement prediction unit of the enhancement layer based on the determined partition structure of the current image of the enhancement layer.

The decoder for the multi-layer video may further include an entropy decoder which transmits segmentation structure information of the reference image generated by entropy decoding the bitstream of the video of the reference layer to an interlayer partition structure prediction unit. .

Here, the decoder for the multi-layer video may include an enhancement transform unit constituting the video of the enhancement layer determined based on the enhancement prediction mode and the partition structure information of the reference transform unit constituting the video of the reference layer. The inverse quantization unit for performing inverse quantization according to the partition structure of may be configured to further include.

Here, the partition structure of the reference picture and the current picture may be a partition structure of a coding unit.

In addition, the coding unit, the prediction unit, and the transformation unit may each include a coding block, a prediction block, and a transform block, and blocks constituting the same unit may be divided into different division structures.

According to another aspect of the present invention for solving the above problems, the inter-layer partition structure prediction method for multi-layer video, in a method performed by the inter-layer partition structure prediction apparatus for multi-layer video, the video of the reference layer A partition structure determination step of determining whether to use the reference partition structure information when splitting the video of the enhancement layer based on the reference partition structure information for each unit to be split, and the video of the enhancement layer according to the determination that the reference partition structure information is used. A corresponding position structure selection step of determining a unit for dividing a video of a reference layer at a position corresponding to a unit for dividing the data; and selecting whether to refer to split structure information of a coding unit and a transformation unit among units for dividing a video of the reference layer Segmentation structure prediction level selection step sent to the inverse quantization unit or the prediction unit And the like.

Here, in the partition structure determination step, split flag information corresponding to the determination that reference partition structure information is not used may be transmitted to the inverse quantization unit.

Here, the unit for dividing the video of the enhancement layer and the unit for dividing the video of the reference layer include a prediction block and a transform block as the minimum unit and a sequence as the maximum unit. can do.

Here, in the step of selecting the partition structure prediction level, when selecting to refer to the partition structure information of the coding unit, the motion compensation unit and the intra predictor are prediction units of the enhancement layer based on the partition structure information of the coding unit. When adjusting to determine the prediction mode of the control unit and selecting to refer to the partition structure information of the transform unit, the inverse quantization unit may be adjusted to perform inverse quantization based on the partition structure information of the transform unit.

Here, when the reference depth use flag of the coding unit is 1, the split structure prediction level selection step divides the coding unit of the enhancement layer to the same depth as the reference depth of the coding unit, and the reference depth use flag of the coding unit is 0. If, the split flag information is transmitted to the inverse quantization unit, and the inverse quantization unit may be adjusted to perform inverse quantization based on the partition structure information of the transform unit.

Here, the coding unit and the transform unit may each include a coding block and a transform block, and blocks constituting the same unit may be divided into different division structures.

According to another aspect of the present invention, an apparatus for predicting an inter-layer partition structure for multilayer video according to another aspect of the present invention may be provided when partitioning a video of an enhancement layer based on reference partition structure information for each unit for splitting a video of a reference layer. A partition structure usage determining unit that determines whether to use the reference partition structure information, and a reference layer of a position corresponding to a unit for dividing a video of an enhancement layer according to a determination that reference partition structure information is used. A corresponding position structure selection unit for determining a unit for dividing a video, and a division structure for selecting whether to refer to split structure information of a coding unit and a transformation unit among units for dividing a video of a reference layer and transmitting the information to a dequantization unit or a prediction unit It may be configured to include a prediction level selector.

Here, the partition structure usage determining unit may transmit split flag information corresponding to the determination that reference partition structure information is not used, to the dequantization unit.

Here, the unit for dividing the video of the enhancement layer and the unit for dividing the video of the reference layer include a prediction block and a transform block as the minimum unit and a sequence as the maximum unit. can do.

Here, when the partition structure prediction level selector selects to refer to the partition structure information of the coding unit, the motion compensation unit and the intra predictor determine the prediction unit of the enhancement layer based on the partition structure information of the coding unit. When adjusting to determine the prediction mode and selecting to refer to the partition structure information of the transform unit, the inverse quantization unit may be adjusted to perform inverse quantization based on the partition structure information of the transform unit.

Here, if the reference depth use flag of the coding unit is 1, the split structure prediction level selector splits the coding unit of the enhancement layer to the same depth as the reference depth of the coding unit, and if the reference depth use flag of the coding unit is 0. In addition, the split flag information may be transmitted to the inverse quantizer to adjust the inverse quantizer to perform inverse quantization based on the partition structure information of the transform unit.

Here, the coding unit and the transformation unit may each include a coding block and a transformation block, and blocks constituting the same unit may be divided into different division structures.

By using the inter-layer decoding apparatus and method for multi-layer video according to the embodiment of the present invention as described above, it is possible to reduce the high decoder complexity that occurs during the decoding process using blocks of various sizes and shapes of HEVC. There is an advantage. In other words, decoding performance may be improved by using the partition structure of the coding unit and the transform unit of the reference layer as the partition structure of the enhancement layer by using the similarity of the images between the layers.

In general, not only temporal scalable video but also spatial scalable video and quality scalable video have similar characteristics, so that the degree or shape of segmentation has a similar shape. By using the similarity, instead of expressing the split information of the enhancement layer as a split flag, it is advantageous to efficiently decode by reducing the bit by decoding using the split information of the reference layer.

1 is a block diagram illustrating a decoder for multi-layer video for decoding in an enhancement layer using partition structure information of a reference layer in HEVC based SVC according to an embodiment of the present invention.
2 is a flowchart illustrating an operation of an inter-layer partition structure prediction apparatus for multi-layer video according to an embodiment of the present invention.
3 is a conceptual diagram illustrating a unit for dividing a video according to an embodiment of the present invention.
4 is an exemplary diagram for describing an example of an operation of an inter-layer partition structure prediction apparatus for multilayer video, according to an embodiment of the present invention.
5 is a conceptual diagram illustrating a corresponding position of an inter-layer image (an image having a different size) performed by the corresponding position structure selection unit 420 according to an embodiment of the present invention.
6 is a conceptual diagram illustrating a corresponding position of an inter-layer image (image of the same size) performed by the corresponding position structure selection unit 420 according to an embodiment of the present invention.
7 is a flowchart illustrating a method of determining whether to use a coding unit split structure of the split structure prediction level selector 430 according to an embodiment of the present invention.
8 is a flowchart illustrating a method of determining whether to use a transform unit split structure of the split structure prediction level selector 430 according to an embodiment of the present invention.
9 is a block diagram illustrating an apparatus for predicting inter-layer partitioning structure for multilayer video and its components according to an embodiment of the present invention.

While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail.

It should be understood, however, that the invention is not intended to be limited to the particular embodiments, but includes all modifications, equivalents, and alternatives falling within the spirit and scope of the invention.

The terms first, second, etc. may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as a second component, and similarly, the second component may also be referred to as a first component. And / or < / RTI > includes any combination of a plurality of related listed items or any of a plurality of related listed items.

It is to be understood that when an element is referred to as being "connected" or "connected" to another element, it may be directly connected or connected to the other element, . On the other hand, when an element is referred to as being "directly connected" or "directly connected" to another element, it should be understood that there are no other elements in between.

The terminology used in this application is used only to describe a specific embodiment and is not intended to limit the invention. The singular expressions include plural expressions unless the context clearly dictates otherwise. In the present application, the terms "comprises" or "having" and the like are used to specify that there is a feature, a number, a step, an operation, an element, a component or a combination thereof described in the specification, But do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, or combinations thereof.

Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Terms such as those defined in commonly used dictionaries should be interpreted as having a meaning consistent with the meaning in the context of the relevant art and are to be interpreted in an ideal or overly formal sense unless explicitly defined in the present application Do not.

First, the terms used in the present application will be briefly described as follows.

The decoder (Video Decoding Apparatus) to be described below is a personal computer (PC), a notebook computer, a personal digital assistant (PDA), a portable multimedia player (PMP), a PlayStation Portable ( It may be a device included in a server terminal such as a PSP, PlayStation Portable), a wireless communication terminal, a smart phone, a TV application server, and a service server, and communicates with a user terminal such as various devices or a wired / wireless communication network. A communication device such as a communication modem for performing the operation, a memory for storing various programs and data for inter- or intra-prediction for decoding or decoding an image, a microprocessor for executing and operating a program, and the like. It can mean a variety of devices.

In addition, the image encoded in the bitstream by the encoder is real-time or non-real-time through the wired or wireless communication network, such as the Internet, local area wireless communication network, wireless LAN network, WiBro network, mobile communication network or the like, cable, universal serial bus (USB, It may be transmitted to a video decoder through various communication interfaces such as a universal serial bus), decoded, reconstructed, and played back.

The multi-layer video refers to a video in which a compressed bit stream is hierarchically structured so that it can be decoded at an arbitrary bit rate. A single layer decoder decodes only one bitstream that supports only one bit rate, frame rate, and image size, whereas a decoder for multi-layer video can support scalability for various bit rates, frame rates, and image sizes.

In the SVC standard, one bitstream is decoded into several video layers, and each layer has a respective bit rate, frame rate, image size, and image quality. That is, one bitstream may be composed of lower layers and upper layers that are scalable. In general, an upper layer can be encoded to have a higher image quality than a video made of previous lower layers.

As used in the present application, an enhancement layer may mean an upper layer described above, and a reference layer may mean a lower layer. In addition, the enhancement coding unit refers to a coding unit of a video of an enhancement layer that is currently decoded. The reference coding unit refers to a coding unit of a video of a reference layer that can be referred to when decoding an enhancement coding unit.

In general, a video may be composed of a series of pictures, and each picture may be divided into a predetermined area such as a block. In addition, a person having ordinary skill in the art to which the present invention belongs may use the term picture as described below to be replaced with another term having an equivalent meaning such as an image or a frame. I can understand.

Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. In order to facilitate the understanding of the present invention, the same reference numerals are used for the same constituent elements in the drawings and redundant explanations for the same constituent elements are omitted.

1 is a block diagram illustrating a decoder for multi-layer video for decoding in an enhancement layer using partition structure information of a reference layer in HEVC based SVC according to an embodiment of the present invention.

Referring to FIG. 1, a decoder for a multilayer video may include an inter-layer split structure predictor configured to determine a split structure of a current video of an enhancement layer by referring to split structure information of a reference picture constituting a video of a reference layer. 400 and prediction units 204 and 205 for determining an enhancement prediction mode of an enhancement prediction unit of the enhancement layer based on the determined partition structure of the current image of the enhancement layer.

The decoder for multi-layer video further includes an entropy decoder 101 which transmits segmentation structure information of the reference image, which is generated by entropy decoding the bitstream of the video of the reference layer, to the interlayer partition structure prediction unit 400. Can be configured. Inverse quantization is performed according to the partition structure of the enhancement transform unit constituting the video of the enhancement layer determined based on the enhancement prediction mode and the partition structure information of the reference transform unit constituting the video of the reference layer. It may be configured to further include an inverse quantization unit 202 to perform.

The interlayer prediction apparatus for the multilayer video may be configured in a form included in the decoder for the multilayer video, but may be configured in a form independent of the decoder. The signal input to the demultiplexer 300 as a multiplexed bitstream may be divided into a reference layer and an enhancement layer, decoded by a video decoder of each layer, and reproduced as an image.

Although FIG. 1 illustrates an example of two layers as an embodiment of the present invention, it is to be understood that any interlayer prediction apparatus targeting two or more layers may be realized by those skilled in the art. It will be possible.

The entropy decoders 101 and 201 may entropy decode the quantized values of the bitstreams distributed by the demultiplexer 300. In addition, the quantization value may be entropy decoded using Context-Adaptive Variable Length Coding (CAVLC) or Context-Adaptive Binary Arithmetic Coding (CABAC), and may also decode other information encoded during entropy encoding. .

The inverse quantization units 102 and 202 may inverse quantize the entropy decoded quantization value. That is, the inverse quantization units 102 and 202 can restore the value (frequency coefficient) of the frequency domain from the quantization value.

The inverse transformers 103 and 203 reconstruct the residual image by converting the frequency domain values (frequency coefficients) provided from the inverse quantizers 102 and 202 from the frequency domain to the spatial domain, and then perform intra prediction or inter prediction. The reconstructed image of the input image is generated by adding the residual image reconstructed by the inverse transformers 103 and 203 to the generated prediction image.

The deblocking filter units 106 and 206 perform filtering to reduce block distortion generated when the image is encoded, on the reconstructed image. Reduces visible distortion

The sample adaptive offset (SAO) unit 107 or 207 divides an image into a plurality of SAO regions on a quadtree basis, and transmits an offset value for error correction in each region unit. The error can be corrected.

The reconstructed picture buffers 108 and 208 store the reconstructed image in which errors are corrected by the deblocking filter units 106 and 206 and the sample adaptive offset units 107 and 207. In addition, partition structure information for each partition unit of each layer may be stored.

The predictor may include an intra predictor 105 and 205 and a motion compensator 104 and 204. The intra predictors 105 and 205 perform intra prediction, and the motion compensators 104 and 204 compensate for motion vectors for inter prediction, that is, inter prediction. 300 receives the multiplexed bitstream and distributes the multiplexed bitstream into bitstreams of the reference layer and the enhancement layer.

The main components of the HEVC-based SVC video decoder proposed by the present invention will be described in more detail as follows. The inverse quantization unit 202 of the enhancement layer may perform inverse quantization on the residual signal according to the partition structure of the transform unit of the enhancement layer determined by referring to the partition structure of the transform unit of the reference layer for inverse quantization. The inverse transform unit 203 may inversely transform the residual signal subjected to inverse quantization. The interlayer partition structure prediction unit 400 may determine the coding unit partition structure of the enhancement layer by using the coding unit partition structure of the reference layer. The intra prediction unit 205 and the motion compensation unit 204 of the enhancement layer may generate prediction values through the mode of the prediction unit in the corresponding partition structure for the determined coding unit of the enhancement layer.

The partition structure of the reference picture and the current picture may be a partition structure of a coding unit. In addition, the coding unit, the prediction unit, and the transformation unit may each include a coding block, a prediction block, and a transform block, and blocks constituting the same unit may be divided into different division structures.

In an HEVC-based SVC, a so-called CU (Coding Unit) is referred to as including both a luminance component (luma sample) on one picture and a coding block of two chrominance components (chroma sample). Therefore, in the case of a monochrome picture having only a luminance component, the coding unit indicates an object that is substantially the same as the coding block of the luminance component.

The encoding block, the prediction block, and the transform block are blocks of a luminance component for a coding unit (CU), a prediction unit (PU), a transform unit (TU), and a transform unit (HEVC) Lt; / RTI > The enhancement block and the reference block may also be an encoding block, a prediction block, or a transform block.

2 is a flowchart illustrating an operation of an inter-layer partition structure prediction apparatus 400 for multi-layer video according to an embodiment of the present invention.

Referring to FIG. 2, in the method performed by the inter-layer partition structure prediction apparatus 400 for the multi-layer video, the inter-layer partition structure prediction method for the multi-layer video includes unit-by-unit reference division for splitting the video of the reference layer. A segmentation structure use determination step of determining whether to use the reference partition structure information when segmenting the video of the enhancement layer based on the structure information (S210) and segmenting the video of the enhancement layer according to the determination that the reference partition structure information is used. A corresponding position structure selection step (S230) for determining a unit for dividing the video of the reference layer at the position corresponding to the unit; and whether to refer to the partition structure information of the coding unit and the transformation unit among the units for dividing the video of the reference layer; And a segmentation structure prediction level selection step (S240) of selecting and transmitting to the dequantization unit 202 or the prediction units 204 and 205. Can. In the partition structure determination step (S210), split flag information corresponding to the determination that reference partition structure information is not used may be transmitted to the inverse quantization unit 202.

An operation of the inter-layer partition structure prediction apparatus 400 for the multilayer video will be described with reference to FIGS. 4 to 6, which will be described below with reference to one embodiment.

3 is a conceptual diagram illustrating a unit for dividing a video according to an embodiment of the present invention.

Referring to FIG. 3, a unit for dividing a video of an enhancement layer and a unit for dividing a video of a reference layer include a prediction block and a transform block as a minimum unit and a sequence as a maximum unit. You can do

A sequence refers to a series of bits forming a coded picture on a NAL unit stream. Therefore, the sequence unit may be a larger concept or a smaller concept than a picture which is one of temporally consecutive images. In the present invention, the sequence refers to a series of bits representing all or part of one layer and may be a unit larger or equal to a picture.

A unit for dividing a video according to an embodiment of the present invention has been described as a sequence, a slice, and a coding tree block (CTB) in the following description of FIG. 4. That is, how to define and operate a unit for dividing a video only affects decoding efficiency in decoding an encoded image, and is not limited to any particular division unit. In other words, a unit for dividing a video may be defined based on the inclusion relationship of each division unit listed in FIG. 3.

4 is an exemplary diagram for describing an example in which an inter-layer partition structure prediction apparatus 400 operates for multilayer video according to an embodiment of the present invention, and FIG. 5 is an inter-layer image performed by a corresponding position structure selector ( It is a conceptual diagram for explaining the corresponding position of the image of different size. 6 is a conceptual diagram illustrating a corresponding position of an inter-layer image (image of the same size) performed by the corresponding position structure selecting unit.

Referring to FIG. 4, the partition structure usage determining unit 410 may refer to an inter_layer_structure_prediction_sequence_enable_flag syntax element decoded in sequence units through the entropy decoding unit 101 (S410). When the decoded value of the syntax element is 1, inter-layer structure division prediction may be used in the sequence (S430). If the decoded value of the syntax element is 0, the sequence may be decoded by referring to the split flag syntax element in the enhancement layer without using inter-layer structured prediction (S420). When the Inter_layer_structure_prediction_sequence_enable_flag syntax element is 1, the syntax element may be descended to a slice unit, which is a unit constituting the sequence, to refer to the syntax element (S430). With reference to the inter_layer_structure_prediction_slice_enable_flag syntax element decoded in units of slices by the entropy decoding unit 101, when the syntax element is 1, the slice of the sequence may use inter-layer structure partition prediction (S450). If the syntax element is 0, the slice may be decoded by referring to the split flag syntax element without using inter-layer structure division prediction (S440). When all of the above-described syntax elements are 1, the inter_layer_structure_prediction_CTB_enable_flag syntax element decoded through the entropy decoding unit 101 may be referred to (S450). If the syntax element is 1, the CTB may perform structure division of the enhancement layer using structure division prediction between layers (S470). If the syntax element is 0, the CTB may decode using the split flag (S460).

Image sizes between layers are the same or different according to coding schemes such as temporal scalable video coding, quality scalable video coding, and spatial scalable video coding of SVC. 5 to 6 show corresponding positions according to differences in image sizes between layers. For example, it is necessary to calculate the corresponding position because images between layers supporting spatial scalable video are different. On the other hand, coding methods such as temporal scalable video and quality scalable video do not need to calculate the structure division according to each position because the images have the same size.

7 is a flowchart illustrating a method of determining whether to use a coding unit split structure of the split structure prediction level selector 430 according to an embodiment of the present invention, and FIG. A flowchart illustrating a method of determining whether to use a transform unit split structure of the split structure prediction level selector 430 is described.

7 to 8, in the segmentation prediction level selection step, when the segmentation structure information of the coding unit is selected to refer to the segmentation structure information, the motion compensation unit 204 and the intra prediction unit 205 divide the structure information of the coding unit. If it is adjusted to determine the prediction mode of the prediction unit (prediction unit) of the enhancement layer on the basis of, and select to refer to the partition structure information of the transform unit, the dequantization unit 202 is partitioned of the transform unit In the case of adjusting to perform inverse quantization based on the structure information and selecting to refer to split structure information of the coding unit and the transformation unit, the operation may be performed to perform both operations.

In the split structure prediction level selection step (S240), if the reference depth use flag of the coding unit is 1, the coding unit of the enhancement layer is split to the same depth as the reference depth of the coding unit (S730), and the reference depth use flag of the coding unit is used. If 0, split flag information is transmitted to the inverse quantization unit 202, and the inverse quantization unit 202 may be adjusted to perform inverse quantization based on the partition structure information of the transform unit (S720).

Referring to FIG. 7, the inter_layer_CU_depth_use_flag syntax element decoded by the entropy decoding unit 101 may be referred to (S710). If the syntax element is 1, the corresponding CTB may be divided into the same partitioning structure as the corresponding CTB of the reference layer (S730). On the other hand, if the syntax element is 0, the CTB splits the structure using the split flag, and the split structure of the Transform Unit may refer to the Transform Unit split structure of the reference layer (S720). When the syntax element is 0, the reason for using the transform unit split structure of the reference layer without decoding the inter_layer_TU_depth_use_flag for splitting the transform unit of the split structure prediction level selector 430 of FIG. Since inter-layer_styructure_prediction_CTB_enable_flag is 1, the current CTB can determine whether inter_layer_TU_detph_use_flag is 0 in the same manner as inter_layer_TU_depth_use_flag is 1.

8 illustrates a method of determining whether to use a transform unit partition structure of the partition structure prediction level selector 430 according to an embodiment of the present invention. Referring to FIG. 8, the inter_layer_TU_depth_use_flag syntax element decoded by the entropy decoding unit 101 may be referred to. If the syntax element is 1, the transform structure block may be split into the same structure as the corresponding CU of the reference layer. On the other hand, if the syntax element is 0, the CU can be split using the split flag.

9 is a block diagram illustrating an inter-layer partition structure prediction apparatus 400 and its components for multi-layer video according to an embodiment of the present invention.

Referring to FIG. 9, the inter-layer partition structure prediction apparatus 400 for the multilayer video may use the reference partition structure information when splitting the video of the enhancement layer based on the reference partition structure information for each unit that splits the video of the reference layer. The partition structure usage determining unit 410 for determining whether to use the reference partition structure information of the partition structure usage determining unit 410, or the reference layer at a position corresponding to a unit for dividing the video of the enhancement layer. The corresponding position structure selector 420 determines a unit for dividing the video, and the dequantization unit 202 or the prediction is performed by selecting whether to refer to the division structure information of the coding unit and the transformation unit among the units for dividing the video of the reference layer. And a divisional structure prediction level selection unit 430 for transmitting to the units 204 and 205.

The partition structure usage determining unit 410 may transmit split flag information corresponding to the determination that reference partition structure information is not used, to the dequantization unit 202.

Referring to FIG. 9, the inter-layer partition structure prediction apparatus 400 may include a partition structure use determiner 410, a corresponding position structure selector 420, and a partition structure prediction level selector 430. have. The partition structure determination unit 410 may determine whether to determine the partition structure of the enhancement layer by using the partition structure of the reference layer as a unit for dividing the video. If the split structure usage determining unit 410 is selected not to use the inter-layer split structure, the split unit may be used to split the coding unit and the transform unit.

The corresponding position structure selection unit may calculate and select a corresponding position of the reference layer corresponding to a unit for dividing the video of the enhancement layer when the partition structure use determination unit 410 selects a mode using inter-layer partition structure prediction. . The partition structure prediction level selector 430 determines whether to refer to a coding unit partition structure, a transform unit partition structure, or to refer to both a coding unit partition structure and a transform unit partition structure. If the partition structure determination unit decides to use only the coding unit partition structure, the coding unit partition structure is selected to determine the mode of the prediction unit for making the prediction values of the motion compensator 204 and the intra predictor 205 of FIG. 1. To pass. If it is decided to use only the transform unit partition structure, the selected transform unit partition structure is transmitted to decode the residual signal through the inverse quantization unit 202 and the inverse transform unit 203 of FIG. 1. In addition, if the coding unit and the transform unit partition structure are selected to use both, the coding unit partition structure and the transform unit partition structure are transferred to perform both of the above-described operations.

The unit for dividing the video of the enhancement layer and the unit for dividing the video of the reference layer may have a prediction block and a transform block as a minimum unit and a sequence as a maximum unit.

The sequence unit may be a larger concept or a smaller concept than a picture which is one of temporally consecutive images. In the present invention, the sequence refers to a series of bits representing all or part of one layer and may be a unit larger or equal to a picture. The unit for dividing the video has been described above in the description of FIG. 4 and thus will not be redundantly described.

When selecting to refer to the partition structure information of the coding unit, the motion compensator 204 and the intra prediction unit 205 predict the prediction mode of the prediction unit of the enhancement layer based on the partition structure information of the coding unit. mode), and when selecting to refer to the partition structure information of the transform unit, the inverse quantization unit 202 is adjusted to perform inverse quantization based on the partition structure information of the transform unit, and the coding unit and the transform unit In the case of selecting to refer to all the partition structure information of, it may be adjusted to perform both operations.

If the reference depth use flag of the coding unit is 1, the partition structure prediction level selector 430 splits the coding unit of the enhancement layer to the same depth as the reference depth of the coding unit, and if the reference depth use flag of the coding unit is 0. The split flag information may be transmitted to the inverse quantization unit 202 to adjust the inverse quantization unit 202 to perform inverse quantization based on the partition structure information of the transform unit.

The coding unit and the transformation unit each include a coding block and a transformation block, and blocks constituting the same unit may be divided into different division structures.

Although some aspects have been described in terms of methods, it is clear that these aspects represent descriptions of corresponding devices, where the devices correspond to the steps of the method. According to certain implementation requirements, embodiments of the invention may be implemented in hardware or software. Embodiments of the present invention may be performed as a computer program product having program code operative for performing one of the program codes, methods.

It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined in the appended claims. It will be possible.

101, 201: entropy decoding unit 102, 202: inverse quantization unit
103, 203: inverse transform unit 104, 204: motion compensation unit
105, 205: intra prediction unit 106, 206: deblocking filter unit
107 and 207 sample adaptive offsets 108 and 208 reconstructed picture buffer
300: demultiplexer 400: inter-layer partition structure prediction unit (device)
410: partition structure use determination unit 420: corresponding position structure selection unit
430: partition structure prediction level selection unit

Claims (17)

An inter-layer partition structure prediction unit for determining a partition structure of the current image of the enhancement layer by referring to the partition structure information of the reference picture configuring the video of the reference layer; And
And a predictor configured to determine an enhancement prediction mode of an enhancement prediction unit of the enhancement layer based on the determined division structure of the current image of the enhancement layer.
The method according to claim 1,
The decoder for the multilayer video
And an entropy decoder configured to transmit segmentation structure information of the reference image generated by entropy decoding the bitstream of the video of the reference layer to the interlayer partition structure prediction unit.
The method according to claim 1,
The decoder for the multilayer video
Inverse quantization is performed according to the partition structure of the enhancement transform unit constituting the video of the enhancement layer determined based on the enhancement prediction mode and the partition structure information of the reference transform unit constituting the video of the reference layer. A decoder for multi-layer video, characterized by further comprising an inverse quantization unit to perform.
The method according to claim 1,
And a partition structure of the reference picture and the current picture is a partition structure of a coding unit.
The method according to claim 3 or 4,
The coding unit, the prediction unit and the transformation unit
And a block comprising a coding block, a prediction block, and a transform block, wherein the blocks constituting the same unit may be divided into different partition structures, respectively.
In the method performed by the inter-layer partition structure prediction apparatus for multilayer video,
A partition structure usage determination step of determining whether to use the reference partition structure information when partitioning the video of the enhancement layer based on the reference partition structure information for each unit for splitting the video of the reference layer;
A corresponding position structure selection step of determining a unit of dividing video of the reference layer at a position corresponding to a unit of dividing video of the enhancement layer according to the determination that the reference division structure information is used; And
An inter-layer division for multi-layer video, comprising a step of selecting a partition structure prediction level of transmitting a dequantization unit or a prediction unit by selecting whether to refer to split structure information of a coding unit and a transform unit among the units of splitting the video of the reference layer. Structure prediction method.
The method of claim 6,
The use of the partition structure determination step
And split flag information corresponding to the determination that the reference split structure information is not used is transmitted to the inverse quantization unit.
The method of claim 6,
The unit for dividing the video of the enhancement layer and the unit for dividing the video of the reference layer are
A method for predicting inter-layer partitioning structure for multilayer video, characterized by using a prediction block and a transform block as a minimum unit and a sequence as a maximum unit.
The method of claim 6,
The partition structure prediction level selection step
When selecting to refer to the partition structure information of the coding unit, the motion compensator and the intra prediction unit determine a prediction mode of the prediction unit of the enhancement layer based on the partition structure information of the coding unit. Adjust it to
And selecting an inverse quantization unit to perform inverse quantization based on the divided structure information of the transform unit when selecting to refer to the divided structure information of the transform unit.
The method of claim 6,
The partition structure prediction level selection step
If the reference depth use flag of the coding unit is 1, the coding unit of the enhancement layer is divided into the same depth as the reference depth of the coding unit,
If the reference depth use flag of the coding unit is 0, split flag information is transmitted to the inverse quantizer and adjusted so that the inverse quantizer performs inverse quantization based on the partition structure information of the transform unit. An interlayer partition structure prediction method for multilayer video.
The method of claim 6,
The coding unit and the transformation unit
A method of predicting an inter-layer partitioning structure for a multilayer video, wherein blocks each including a coding block and a transform block and constituting the same unit may be divided into different partitioning structures.
In a decoder for multi-layer video,
A partition structure usage determination unit that determines whether to use the reference partition structure information when segmenting the video of the enhancement layer based on the reference partition structure information for each unit for splitting the video of the reference layer;
A corresponding position structure selection unit that determines a unit of dividing the video of the reference layer at a position corresponding to a unit of dividing the video of the enhancement layer according to the determination of using the reference division structure information of the division structure use determining unit; And
Inter-layer partitioning structure for multi-layer video including a partition structure prediction level selector which selects whether to refer to a coding unit and split structure information of a transform unit among the units for splitting the video of the reference layer and transmits it to the inverse quantizer or the prediction unit Prediction device.
The method of claim 12,
The division structure use determination unit
And split flag information corresponding to the determination that the reference split structure information is not used is transmitted to the inverse quantization unit.
The method of claim 12,
The unit for dividing the video of the enhancement layer and the unit for dividing the video of the reference layer are
An apparatus for predicting an inter-layer partitioning structure for multilayer video, characterized by using a prediction block and a transform block as a minimum unit and a sequence as a maximum unit.
The method of claim 12,
The partition structure prediction level selector
When selecting to refer to the partition structure information of the coding unit, the motion compensator and the intra prediction unit determine a prediction mode of the prediction unit of the enhancement layer based on the partition structure information of the coding unit. Adjust it to
The apparatus for predicting inter-layer partition structure for multilayer video according to claim 1, wherein the inverse quantization unit adjusts to perform inverse quantization based on the partition structure information of the transform unit.
The method of claim 12,
The partition structure prediction level selector
If the reference depth use flag of the coding unit is 1, the coding unit of the enhancement layer is divided into the same depth as the reference depth of the coding unit,
If the reference depth use flag of the coding unit is 0, split flag information is transmitted to the inverse quantizer and adjusted so that the inverse quantizer performs inverse quantization based on the partition structure information of the transform unit. An interlayer partition structure prediction apparatus for multilayer video.
The method of claim 12,
The coding unit and the transformation unit
An apparatus for predicting inter-layer partitioning structures for multilayer video, wherein the blocks each comprising a coding block and a transform block, and constituting the same unit may be divided into different partitioning structures.
KR1020130110448A 2012-09-17 2013-09-13 Device and method for inter-layer prediction of multi-layer video KR20140038319A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR20120102986 2012-09-17
KR1020120102986 2012-09-17

Publications (1)

Publication Number Publication Date
KR20140038319A true KR20140038319A (en) 2014-03-28

Family

ID=50646747

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020130110448A KR20140038319A (en) 2012-09-17 2013-09-13 Device and method for inter-layer prediction of multi-layer video

Country Status (1)

Country Link
KR (1) KR20140038319A (en)

Similar Documents

Publication Publication Date Title
US11388393B2 (en) Method for encoding video information and method for decoding video information, and apparatus using same
KR101962183B1 (en) Method for encoding/decoding an intra prediction mode and apparatus for the same
JP6874032B2 (en) Picture encoding / decoding method and equipment using this
AU2012267007B2 (en) Method and apparatus of scalable video coding
CA2909259C (en) Video encoding and decoding device and method in which the granularity of the quantization is controlled
US20150036743A1 (en) Interlayer prediction method and device for image signal
WO2013048033A1 (en) Method and apparatus for encoding/decoding intra prediction mode
US9167258B2 (en) Fast mode determining method and apparatus in scalable video coding
KR20140081681A (en) Video encoding method and apparatus using the same
KR102219841B1 (en) Method and Apparatus for Video Encoding and Video Decoding
KR20140038319A (en) Device and method for inter-layer prediction of multi-layer video
KR20140048806A (en) Apparatus and method for inter-layer prediction based on spatial resolution
KR101307406B1 (en) Encoding/decoding apparatus with reference frame compression
WO2014092434A2 (en) Video encoding method, video decoding method, and device using same
KR20140038316A (en) Apparatus and method for inter-layer prediction using deblocking filter
KR20140082915A (en) Devices and method for inter-layer encoding/decoding of scalable video
KR20140038323A (en) Apparatus and method for inter-layer reference of multi-layer video

Legal Events

Date Code Title Description
WITN Withdrawal due to no request for examination