WO2015072626A1

WO2015072626A1 - Interlayer reference picture generation method and apparatus for multiple layer video coding

Info

Publication number: WO2015072626A1
Application number: PCT/KR2014/001197
Authority: WO
Inventors: 김경혜; 조현호; 심동규; 유지우
Original assignee: 광운대학교 산학협력단
Priority date: 2013-11-15
Filing date: 2014-02-13
Publication date: 2015-05-21
Also published as: KR20150056679A

Abstract

The present invention, with reference to a restoration image of a reference layer by an enhancement layer in an SVC decoder, adjusts and limits a motion vector of the reference layer to an integer pixel position when deriving a differential coefficient of the reference layer by using the motion vector of the reference layer in a GRP process, thereby being capable of generating a differential coefficient without performing additional interpolation on an image of the reference layer and a restoration image of the enhancement layer.

Description

Method and apparatus for generating inter-layer reference picture for multi-layer video coding

The present invention relates to an image processing technique, and more particularly, to a method and apparatus for more effectively compressing an enhancement layer by using a reconstructed picture of a reference layer in inter-layer video coding.

Conventional video coding generally services by encoding and decoding one screen, resolution and bit rate suitable for an application. Due to the development of multimedia, Scalable Video Coding (SVC), a video coding technology that supports various resolutions and image quality according to various resolutions and application environments, and multi-view video that can express various viewpoints and depth information Standardization and related research on multi-view video coding (MVC) has been conducted. Such MVC and SVC are referred to as extended video encoding / decoding.

H.264 / AVC, a video compression standard technology that is widely used in the market, also includes SVC and MVC extended video standards, and extended High Efficiency Video Coding (HEVC), which was established in January 2013. Standardization on video standard technology is underway.

The SVC may refer to and code images having one or more temporal / spatial resolutions and image quality with each other, and the MVC may refer to and code multiple images at different viewpoints. In this case, coding of one image is called a layer. Conventional video coding can be encoded / decoded by referring to previously decoded / decoded information in one image, but extended video encoding / decoding is performed by referring to not only the current layer but also different layers at different resolutions and / or different viewpoints. You can perform encryption / decryption.

Hierarchical or multi-view video data transmitted and decoded for various display environments should support compatibility with existing single layer and viewpoint systems as well as stereoscopic image display systems. The concept introduced for this is the base layer or reference layer and enhancement layer or extended layer in hierarchical video coding, and the base view in multiview video coding. ) Or reference view, enhancement view, or extended view. If a bitstream is encoded using a HEVC-based hierarchical or multi-view video coding technique, at least one base layer / view or reference layer / view can be correctly decoded by the HEVC decoding apparatus in the decoding process of the corresponding bitstream. On the contrary, the extended layer / view or enhancement layer / view is an image decoded by referring to information of another layer / view, so that information of the layer / view referred to is present and correctly decoded after the image of the layer / view is decoded. Can be. Therefore, the decoding order must be followed according to the coding order of each layer / view image.

The reason why the enhancement layer / view has a dependency on the reference layer / view is that encoding information or an image of the reference layer / view is used in the encoding process of the enhancement layer / view, and in hierarchical video coding, inter-layer prediction (inter-layer) prediction, referred to as inter-view prediction in multiview video coding. By performing layer / time prediction, additional bit savings of about 20 to 30% can be achieved, compared to general intra-picture prediction and inter-screen prediction.In the layer / time prediction, the reference layer / time of the enhancement layer / time is used. Research is in progress on how to use or correct information. In hierarchical video coding, when a reference is made between layers in an enhancement layer, the enhancement layer may refer to a reconstructed picture of the reference layer, and if there is a difference in resolution between the reference layer and the enhancement layer, upsampling of the reference layer is performed to perform the reference. Can be done.

An object of the present invention is to improve an up-sampled reference layer image by predicting a difference coefficient between layers in order to improve the enhancement layer encoding performance when referring to the reconstructed image of the reference layer in the enhancement / decoding unit of the enhancement layer.

Another object of the present invention is to provide a method and apparatus for predicting a difference coefficient without applying an interpolation filter to a reconstructed image of a reference layer and an enhancement layer by adjusting motion information of a reference layer when predicting and encoding inter-layer difference coefficients. It is done.

The inter-layer reference image generator according to an embodiment of the present invention includes an upsampling unit; It includes an inter-layer reference picture enhancement unit.

The reference layer motion information limiter according to this embodiment of the present invention restricts the precision of the motion vector of the reference layer when predicting the inter-layer difference signal, thereby avoiding applying an additional interpolation filter to the upsampled reference layer and enhancement layer pictures. .

According to an embodiment of the present invention, the reference layer motion information adjusting unit adjusts the precision of the motion vector of the reference layer when predicting the inter-layer difference signal for improving the inter-layer reference picture, thereby adding it to the reconstructed picture of the reference layer and the enhancement layer. Differential signal prediction between layers can be performed without applying an interpolation filter.

1 is a block diagram illustrating a configuration of a scalable video encoder.

2 is a block diagram of an extended decoder according to an embodiment of the present invention.

3 is a block diagram of an extension encoder according to an embodiment of the present invention.

4 is a block diagram of an apparatus for upsampling a reconstructed frame of a reference layer in a scalable video encoder / decoder, enhancing an upsampled reference layer image, and using the same as a reference value of an enhancement layer.

FIG. 5 is a conceptual diagram illustrating a generalized residual prediction (GRP) for inter-layer difference coefficients according to an embodiment of the present invention.

6 is a block diagram of an extended decoder according to this embodiment of the present invention.

7 is a block diagram of an extension encoder according to this embodiment of the present invention.

8A is a diagram illustrating a reference layer upsampling and enhancement operation of an extension encoder / decoder according to an embodiment of the present invention.

8B is a view for explaining the operation of the motion information adjusting unit of the expansion unit / decoder according to this embodiment of the present invention.

FIG. 9 illustrates an example in which a motion information adjusting unit of an extension / decoder according to an embodiment of the present invention maps a motion vector of a reference layer to integer pixels.

10 is a diagram for explaining an example of a method of constructing an enhancement layer reference list of an extended encoder / decoder according to this embodiment of the present invention.

EMBODIMENT OF THE INVENTION Hereinafter, embodiment of this invention is described concretely with reference to drawings. In describing the embodiments of the present specification, when it is determined that a detailed description of a related well-known configuration or function may obscure the gist of the present specification, the detailed description thereof will be omitted.

When a component is said to be "connected" or "connected" to another component, it may be directly connected to or connected to that other component, but it may be understood that another component may be present in between. Should be. In addition, the content described as "include" a specific configuration in the present invention does not exclude a configuration other than the configuration, it means that additional configuration may be included in the scope of the technical idea of the present invention or the present invention.

Terms such as first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component.

In addition, the components shown in the embodiments of the present invention are independently shown to represent different characteristic functions, and do not mean that each component is made of separate hardware or one software unit. In other words, each component is included in each component for convenience of description, and at least two of the components may be combined into one component, or one component may be divided into a plurality of components to perform a function. Integrated and separate embodiments of the components are also included within the scope of the present invention without departing from the spirit of the invention.

In addition, some of the components may not be essential components for performing essential functions in the present invention, but may be optional components for improving performance. The present invention can be implemented including only the components essential for implementing the essentials of the present invention except for the components used for improving performance, and the structure including only the essential components except for the optional components used for improving performance. Also included in the scope of the present invention.

1 is a block diagram illustrating a configuration of a scalable video encoder.

Referring to FIG. 1, a scalable video encoder provides spatial scalability, temporal scalability, and SNR scalability. For spatial scalability, multi-layers using upsampling are used, and temporal scalability uses Hierarchical B picture structure. In addition, for the quality scalability, only the quantization coefficient is changed or a gradual encoding method for quantization error is used in the same manner as the technique for spatial scalability.

Input video 110 is down sampled through spatial decimation 115. The down-sampled image 120 is used as an input of the reference layer, and the coding blocks in the picture of the reference layer may be obtained through intra prediction using the intra prediction unit 135 or inter prediction using the motion compensation unit 130. Effectively encoded. The difference coefficient, which is a difference value between the original block to be encoded and the prediction block generated by the motion compensation unit 130 or the intra prediction unit 135, is discrete cosine transformed or integer transformed through the transform unit 140. The transform difference coefficient is quantized while passing through the quantization unit 145, and the transform difference coefficient is entropy coded by the entropy encoder 150. The quantized transform difference coefficients are reconstructed back into differential coefficients through the inverse quantizer 152 and the inverse transform unit 154 to generate predicted values for use in adjacent blocks or adjacent pictures. In this case, the difference coefficient value restored due to an error occurring in the quantization unit 145 may not match the difference coefficient value used as an input of the converter 140. The reconstructed difference coefficient value is added to a prediction block previously generated by the motion compensator 130 or the intra predictor 135 to reconstruct the pixel value of the block currently encoded. The reconstructed block passes through the in-loop filter 156. When all blocks in the picture are reconstructed, the reconstructed picture is input to the reconstructed picture buffer 158 and used for inter prediction in the reference layer.

In the enhancement layer, the input video 110 is used as an input value and encoded. The interlayer prediction is performed by the motion compensator 172 or the intra predictor 170 in order to effectively encode the coding block in the picture as in the reference layer. Alternatively, an intra prediction is performed and an optimal prediction block is generated. The block to be encoded in the enhancement layer is predicted in the prediction block generated by the motion compensator 172 or the intra predictor 170, and as a result, a difference coefficient is generated in the enhancement layer. The difference coefficients of the enhancement layer are encoded through the transform unit, the quantization unit, and the entropy encoding unit like the reference layer. In the multi-layered structure as shown in FIG. 1, encoded bits are generated in each layer. The multiplexer 180 serves to configure one single bitstream 185.

Although each of the multiple layers may be independently encoded in FIG. 1, since the input video of the lower layer is down-sampled from the video of the upper layer, it has very similar characteristics. Therefore, when the reconstructed pixel values, motion vectors, and the like of the lower layer video are used in the enhancement layer, encoding efficiency may be increased.

In FIG. 1, the inter-prediction prediction 172 of the enhancement layer may reconstruct an image of the reference layer and interpolate the reconstructed image 164 according to the image size of the enhancement layer and use it as a reference image. When reconstructing the image of the reference layer, a method of decoding the reference image in units of frames and a method of decoding in units of blocks may be used in consideration of a reduction in complexity. The image 164 reconstructed in the reference layer is input to the motion compensation unit 172 of the enhancement layer, thereby improving the coding efficiency in the enhancement layer.

In FIG. 1, the motion information 162 of the reference layer may be upsampled through the upsampling unit 160 according to the enhancement layer resolution, and then referred to when motion information is encoded by the motion compensation unit 172 of the enhancement layer.

2 is an extended decoder block diagram according to an embodiment of the present invention. The extended decoder includes both a reference layer 200 and a decoder for the enhancement layer 210. The reference layer 200 and the enhancement layer 210 may be one or multiple depending on the number of layers of the SVC. The decoder 200 of the reference layer has an entropy decoder 201, an inverse quantizer 202, an inverse transformer 203, a motion compensator 204, and an intra prediction unit 205 in a structure similar to a general video decoder. ), A loop filter unit 206, a reconstructed image buffer 207, and the like. The entropy decoding unit 201 receives an extracted bitstream of the reference layer through the demultiplexer unit 224 and then performs an entropy decoding process. The quantized coefficient values reconstructed through the entropy decoding process are inversely quantized by the inverse quantizer 202. The inverse-zeroed coefficient value is restored to the residual coefficient through the inverse transform unit 203. In generating a prediction value for a coding block of a reference layer, when the corresponding coding block is coded by inter picture coding, the decoder of the reference layer performs motion compensation through the motion compensation unit 204. In general, the reference layer motion compensation unit 204 performs motion compensation after performing interpolation according to the precision of a motion vector. When the coding block of the reference layer is encoded through intra prediction, the decoder generates a prediction value through the intra prediction unit 205. The intra prediction unit 205 generates a prediction value from the reconstructed neighboring pixel values in the current frame according to the intra prediction mode. The difference coefficient reconstructed in the reference layer and the predicted value are added to each other to generate a reconstructed value. The reconstructed frame is stored in the reconstructed image buffer 207 after passing through the loop filter unit 206 and used as a predicted value in the inter prediction of the next frame.

The extended decoder including the reference layer and the enhancement layer decodes the image of the reference layer and uses the prediction layer in the motion compensation unit 214 and the intra prediction unit 215 of the enhancement layer. To this end, the upsampling unit 221 upsamples the picture and motion information 223 reconstructed in the reference layer according to the resolution of the enhancement layer. The motion vector included in the motion information 223 may be used in the original form or in the compressed form. The upsampled image 225 may be used as a reference image by the motion compensator 214 of the enhancement layer. In addition, after the upsampled image is enhanced by the inter-layer reference image enhancer 222, the enhanced inter-layer reference image 226 may be used as a reference image in the motion compensation unit 214 of the enhancement layer.

The bitstream input to the extended decoder is input to the entropy decoding unit 211 of the enhancement layer through the demultiplexer 224 to perform bitstream parsing according to the syntax structure of the enhancement layer. Thereafter, a reconstructed differential image is generated through the inverse quantization unit 212 and the inverse transform unit 213, which is further added to the prediction image acquired by the motion compensation unit 214 or the intra prediction unit 215 of the enhancement layer. Become. The reconstructed image is stored in the reconstructed image buffer 217 via the loop filter 216 and used in the predictive image generation process by the motion compensator 214 of frames continuously positioned in the enhancement layer.

Referring to FIG. 3, the scalable video encoder downsamples the input video 300 through the spatial partitioning 310 and then uses the downsampled video 320 as an input of the video encoder of the reference layer. Video input to the reference layer video encoder is predicted in an intra or inter mode in units of coding blocks in the reference layer. The difference image, which is a difference between the original block and the coding block, is transformed and quantized through the transform unit 330 and the quantizer 335. The quantized difference coefficients are expressed in bits in units of syntax elements through the entropy encoder 340.

The encoder for the enhancement layer uses input video 300 as input. The input video is predicted through the intra predictor 360 or the motion compensator 370 in units of coding blocks in the enhancement layer. The difference image, which is the difference between the original block and the coding block, undergoes a transform encoding and quantization process through the transformer 371 and the quantizer 372. The quantized difference coefficients are expressed in bits in units of syntax elements through the entropy encoder 373. The bitstreams encoded in the reference layer and the enhancement layer are composed of a single bitstream 385 through the multiplexer 380.

The motion compensation unit 370 of the enhancement layer encoder may generate a prediction value by using the reconstructed picture of the reference layer. In this case, the reconstructed reference layer picture is upsampled by the upsampling unit 350 according to the resolution of the enhancement layer, and the upsampled reference layer image 355 is used by the motion compensator 370. In addition, the motion compensation unit 370 of the enhancement layer may upsample the motion information 345 of the reference layer by the upsampling unit 350 to use the reference information when encoding the motion vector. When using the motion information 345 of the reference layer, motion vector information compressed in the reference layer may be used. After the upsampled image is enhanced by the inter-layer reference image enhancer 390, the enhanced inter-layer reference image 395 may be used as a reference image in the motion compensator 370 of the enhancement layer.

4 is a block diagram of an apparatus for upsampling and improving a reconstructed picture of a reference layer in a scalable video encoder / decoder.

Referring to FIG. 4, the apparatus includes a reconstructed picture buffer 401 of the reference layer, an N-fold upsampling unit 402, an inter-layer reference picture enhancement unit 403, and an inter-layer reference picture buffer 404. .

The reference layer reconstructed picture buffer 401 is a buffer that stores a reconstructed picture of the reference layer. In order to use the image of the reference layer in the enhancement layer, the reconstructed image of the reference layer should be upsampled to a size corresponding to the image size of the enhancement layer, and the upsampling is performed through the N-fold upsampling unit 402. The upsampled image of the reference layer is enhanced by the inter-layer reference image enhancer 403 and then stored in the inter-layer reference image buffer 404 of the enhancement layer.

FIG. 5 is a conceptual diagram illustrating a generalized residual prediction (GRP) technique for improving an inter-layer reference picture according to an embodiment of the present invention.

Referring to FIG. 5, when coding a block 500 of an enhancement layer in a scalable video encoder, block 530 of a corresponding position of an upsampled reference layer may be selected as a prediction block.

In the GRP, a difference coefficient is predicted using the motion information 510 of the reference layer block 530 at a position corresponding to the enhancement layer block currently being coded and the reconstructed images of the enhancement layer and the reference layer. In addition to 530, the reference layer picture is improved. When using the motion information 510 of the reference layer, compressed motion vector information may be used in the reference layer, or uncompressed original motion information may be used. The difference coefficient 560 is calculated by the difference between the prediction block 520 in the enhancement layer reconstruction image generated using the upsampled motion information 510 of the reference layer and the prediction block 550 in the upsampled reference layer reconstruction image. . The final prediction block 570 of the enhancement layer may be generated by adding the generated difference coefficient 560 and the reference layer block 530, and the difference coefficient 560 may be multiplied by a weight. At this time, the coefficient of weight may be selected to 0, 0.5, 1 and the like.

If the motion information of the reference layer is bidirectional prediction in the GRP, the average value of the difference coefficient in the L0 direction and the difference coefficient in the L0 direction and the block 53 of the reference layer is calculated in order to calculate the prediction block 580 of the enhancement layer. Using the weighted sum for.

Referring to FIG. 6, a single bitstream input to the scalable video decoder configures a bitstream for each layer through the demultiplexer 624. The bitstream for the reference layer is entropy decoded through the entropy decoding unit 601 of the reference layer. The entropy decoded difference coefficient is decoded into a difference coefficient after passing through the inverse quantization unit 602 and the inverse transform unit 603. The coding block decoded in the reference layer generates a predictive block through the motion compensator 604 or the intra predictor 605, which is added to the difference coefficient to decode the block. The decoded image is filtered through the in-loop filter 606 and then stored in the reconstructed image buffer 607 of the reference layer.

The bitstream of the enhancement layer extracted through the demultiplexer 624 is entropy decoded by the entropy decoding unit 611 of the enhancement layer. The entropy-decoded difference coefficient is decoded into the difference coefficient after passing through the inverse quantization unit 612 and the inverse transform unit 613. The coding block decoded in the enhancement layer generates a prediction block through the motion compensation unit 614 or the intra prediction unit 615 of the enhancement layer, and the prediction block is added to the difference coefficient to decode the block. The decoded image is filtered through the in-loop filter 616 and then stored in the reconstructed image buffer 617 of the enhancement layer.

When using the GRP technique in the enhancement layer, upsample the image and motion information of the reference layer and then derive the difference coefficient from the reference layer and the enhancement layer reconstructed image using the motion vector of the reference layer, and reference the derived difference coefficient value. In addition to the layer, use it as a predictive value. When using the motion information 623 of the reference layer, motion vector information compressed in the reference layer may be used. The upsampling unit 621 performs upsampling according to the resolution of the image of the enhancement layer by using the reconstructed image of the reference layer. The motion information adjusting unit 625 adjusts the precision of the reference layer motion vector in integer pixels in order to use the motion vector information of the reference layer in the GRP. The inter-layer reference image enhancer 622 receives a coding block 530 at the same position as the coding block 500 of the enhancement layer from the reconstructed picture buffer of the reference layer and is manipulated by an integer unit through the motion information adjuster 625. Receive a motion vector. The upsampling unit 621 compensates for the block for generating the differential coefficients in the upsampled image and the reconstructed image of the enhancement layer by using the motion vector adjusted in integer units. The difference between the two compensated prediction blocks and the coding block 500 of the enhancement layer and the coding block 530 of the same position are added to generate the prediction image 627 to be used in the enhancement layer.

Referring to FIG. 7, the scalable video encoder downsamples the input video 700 through the spatial partitioning 715 and then uses the downsampled video 710 as an input of the video encoder of the reference layer. Video input to the reference layer video encoder is predicted in an intra or inter mode in units of coding blocks in the reference layer. The difference image, which is the difference between the original block and the coding block, undergoes a transform encoding and quantization process through the transform unit 730 and the quantization unit 732. The quantized difference coefficients are expressed in bits in units of syntax elements through the entropy encoder 734.

The encoder for the enhancement layer uses input video 700 as input. The input video is predicted through the intra predictor 760 or the motion compensator 765 in units of coding blocks in the enhancement layer. The difference image, which is the difference between the original block and the coding block, is transformed and quantized through the transform unit 770 and the quantizer 772. The quantized difference coefficients are expressed in bits in units of syntax elements through the entropy encoder 774. The bitstreams encoded in the reference layer and the enhancement layer consist of a single bitstream 785 through the multiplexer 780.

In the GRP technique, after upsampling the reconstructed picture and motion information 752 of the reference layer, the difference coefficient is derived from the reconstructed picture of the reference layer and the enhancement layer by using the motion vector of the reference layer, and the derived difference coefficient value is referred to as the reference layer. In addition to the block, it is used as a prediction value of the enhancement layer. When using the motion information 752 of the reference layer, motion vector information compressed in the reference layer may be used. The upsampling unit 750 performs upsampling according to the resolution of the image of the enhancement layer by using the reconstructed image of the reference layer. The motion information adjusting unit 794 adjusts the precision of the upsampled motion vector in integer units in order to use the motion vector information of the reference layer in the GRP. The inter-layer reference image enhancer 790 receives a coding block 530 at the same position as the coding block 500 of the enhancement layer from the reconstructed picture buffer of the reference layer and is manipulated by an integer unit through the motion information adjuster 794. Receive a motion vector. The upsampling unit 750 compensates a block for generating a differential coefficient in the upsampled reference layer image and the reconstructed image of the enhancement layer by using the motion vector of the reference layer adjusted by an integer unit. The inter-layer prediction image 792 to be used in the enhancement layer is generated by adding the difference between the two compensated prediction blocks and the coding block 530 of the reference layer located at the same position as the coding block 500 of the enhancement layer.

8 is a diagram illustrating an operation of a motion information adjusting unit of an expansion / decoding unit according to an embodiment of the present invention.

Referring to FIG. 8A, the motion

information adjusting units

625 and 794 of the expansion unit / decoder according to the present embodiment adjust the precision of the upsampled motion vector of the reference layer to an integer position for GRP. GRP derives the difference coefficients from the reference layer and the enhancement layer using the motion vector of the reference layer. In this case, the reference picture should be interpolated according to the precision of the motion vector. In the extended encoder / decoder according to an embodiment of the present invention, when using the motion vector of the reference layer in GRP, the interpolation is not performed in the reconstructed images of the reference layer and the enhancement layer by adjusting the motion vector to an integer position.

Referring to FIG. 8B, the motion

information adjusting units

625 and 794 determine whether the motion vector of the reference layer is already at an integer position (810). If the motion vector of the reference layer is already at an integer position, no additional motion vector adjustment is performed. If the motion vector of the reference layer is not an integer position, mapping 811 to integer pixels is performed so that the motion vector of the reference layer can be used in the GRP.

FIG. 9 illustrates an example in which a motion information adjusting unit of an extension / decoder according to an embodiment of the present invention maps a motion vector of an enhancement layer to integer pixels.

Referring to FIG. 9, the motion vector of the enhancement layer may be located at

integer locations

900, 905, 910, and 915 or at non-integer locations 920. Process of interpolating the reference layer and the enhancement layer reconstruction image by mapping the motion vector of the reference layer to integer pixels when generating difference coefficients from the reconstruction images of the reference layer and the enhancement layer using the motion vector of the reference layer in GRP Can be omitted. If the motion vector of the reference layer corresponds to the non-integer position 920, the motion vector is adjusted to the integer pixel position 900 located on the left-top side of the pixel of the non-integer position, and then the adjusted motion vector is used for the GRP. do.

FIG. 10 is a diagram illustrating a configuration of an enhancement layer reference list of an extended encoder / decoder according to an embodiment of the present invention.

Referring to FIG. 10, the reference layer picture 1010 is enhanced by a reference layer picture A 1020 upsampled to fit with an enhancement layer and an inter-layer

reference picture enhancer

622 or 790. It may be used to construct a reference picture list of the enhancement layer. The reference list L0 and L1 may be configured 1040 using only the reference layer image A 1020, and the reference layer image A 1020 is added to L0 and the reference layer image B 1030 is added to L1. May be configured 1050. In addition, the reference layer structure 1060 of the enhancement layer may be performed by adding the reference layer image B 1030 to the reference list L0 and the reference layer image A 1010 to the reference list L1. Reference layer picture A 1020 and reference layer picture B 1030 added to the reference list may be used to encode an enhancement layer.

The method according to the present invention described above may be stored in a computer-readable recording medium that is produced as a program for execution on a computer, and examples of the computer-readable recording medium include ROM, RAM, CD-ROM, magnetic tape , Floppy disks, optical data storage devices, and the like, and also include those implemented in the form of carrier waves (eg, transmission over the Internet).

The computer readable recording medium can be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion. In addition, functional programs, codes, and code segments for implementing the method can be easily inferred by programmers in the art to which the present invention belongs.

In addition, although the preferred embodiment of the present invention has been shown and described above, the present invention is not limited to the above-described specific embodiment, the technical field to which the invention belongs without departing from the spirit of the invention claimed in the claims. Of course, various modifications can be made by those skilled in the art, and these modifications should not be individually understood from the technical spirit or prospect of the present invention.

Claims

In the encoding / decoding method for providing an inter-layer reference structure, in generating an inter-layer reference image, the inter-layer difference coefficient prediction is performed by limiting the precision of the motion vector when predicting the inter-layer difference coefficient using motion information of the reference layer. Way.
In an encoding / decoding method that provides an inter-layer reference structure, in constructing a reference list of an enhancement layer by referencing an inter-layer reference image, an enhancement layer using an upsampled reference layer reconstruction image and an upsampling and enhanced reference layer reconstruction image To construct a reference list for a file.