WO2015053525A1 - Procédé de codage et de décodage d'image et dispositif l'utilisant - Google Patents

Procédé de codage et de décodage d'image et dispositif l'utilisant Download PDF

Info

Publication number
WO2015053525A1
WO2015053525A1 PCT/KR2014/009409 KR2014009409W WO2015053525A1 WO 2015053525 A1 WO2015053525 A1 WO 2015053525A1 KR 2014009409 W KR2014009409 W KR 2014009409W WO 2015053525 A1 WO2015053525 A1 WO 2015053525A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
layer
output
target
decoding
Prior art date
Application number
PCT/KR2014/009409
Other languages
English (en)
Korean (ko)
Inventor
강정원
이하현
이진호
최진수
김진웅
Original Assignee
한국전자통신연구원
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR20140031880A external-priority patent/KR20150042683A/ko
Priority claimed from KR1020140132543A external-priority patent/KR102246634B1/ko
Application filed by 한국전자통신연구원 filed Critical 한국전자통신연구원
Priority to US15/027,101 priority Critical patent/US10142638B2/en
Priority to CN201480064335.8A priority patent/CN105765978B/zh
Publication of WO2015053525A1 publication Critical patent/WO2015053525A1/fr
Priority to US16/157,217 priority patent/US10306244B2/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • the present invention relates to video encoding and decoding processing, and more particularly, to a video encoding / decoding method and apparatus for supporting bitstream random access.
  • an inter prediction technique for predicting pixel values included in a current picture from a previous and / or subsequent picture in time, and predicting pixel values included in a current picture using pixel information in the current picture.
  • An intra prediction technique an entropy encoding technique of allocating a short code to a symbol with a high frequency of appearance and a long code to a symbol with a low frequency of appearance may be used.
  • Video compression technology is a technology that provides a constant network bandwidth under a limited operating environment of hardware without considering a fluid network environment.
  • a new compression technique is required to compress image data applied to a network environment in which bandwidth changes frequently, and a scalable video encoding / decoding method may be used for this purpose.
  • the present invention provides a method for enabling decoding from an arbitrary point in a bitstream and an apparatus using the same.
  • an embodiment of the present invention includes an intra-random access point (IRAP) picture and a non-intra-random random access point picture (IRAP) picture or only an IRAP picture in an AU (access unit) at a random access point.
  • IRAP intra-random access point
  • IRAP non-intra-random random access point picture
  • the bitstream may be gradually decoded from a random access point.
  • a method of decoding a bitstream including a plurality of layers may include: setting a parameter for decoding the bitstream from a time point when the random access occurs when random access occurs; And decoding the picture to be decoded, and setting the parameter comprises setting an initialization flag indicating whether or not the picture to be decoded can be normally decoded and output when the random access occurs.
  • Steps Setting a flag variable (NoRaslOutputFlag) indicating whether the decoding target picture is used as a random access point; If the decoding target picture is an IRAP picture, the flag variable is 1, the initialization flag for the decoding target picture is 0, and the initialization flag for all the reference layers of the target layer is 1, for the target layer And resetting the initialization flag to 1.
  • NoRaslOutputFlag a flag variable indicating whether the decoding target picture is used as a random access point
  • the setting of the initialization flag may be set to 1 for a base layer of the access unit in which the random access has occurred and to 0 for a layer higher than the base layer.
  • the setting of the flag variable NoRaslOutputFlag may include setting the flag variable to 1 when the decoding target picture is an IDR picture or a BLA picture or an IRAP picture and is the first picture of the target layer of the corresponding bitstream.
  • the setting of the flag variable may include: an initialization flag set to a layer to which the target picture belongs, an initial flag set to 0, and an initialization flag for all reference layers of the layer to which the target picture belongs to 1
  • the flag variable may be set to one.
  • the method may further include performing motion prediction and motion compensation on the decoding target picture by using a reference picture in a reference picture list generated based on the reference picture set.
  • the determining of whether to output the decoding target picture may include determining that the decoding target picture is not output when the decoding target picture is an IRAP picture and the initialization flag value of the target layer is 0.
  • the determining of whether to output the decoding target picture includes determining that an output target picture belonging to the output layer is not output in an access unit to which the target layer belongs, rather than an output layer to which the target layer belongs. And the target layer is used as a reference of the output layer so that the decoded picture is used as a reference picture of the output target picture and the flag information indicating whether or not to output a picture signaled in the slice header is decoded with 1 is decoded. In the case of the highest picture among the pictures, it may be determined to output the decoding target picture.
  • the determining of whether to output the decoding target picture includes determining that an output target picture belonging to the output layer is not output in an access unit to which the target layer belongs, rather than an output layer to which the target layer belongs.
  • the target layer is used as a reference of the output layer such that the decoding target picture is used as a reference picture of the output target picture, the initialization flag value for the target layer is 1, and the decoding target picture is signaled in a slice header.
  • flag information indicating whether to output a picture is output is the highest picture among decoded pictures of 1, it may be determined that the decoding target picture is output.
  • the first access unit picture outputting step comprises: the bitstream being the highest layer to the highest layer. And output an output picture of the output layer when the base layer is an intra random access point (IRAP) picture from the base layer to an nth (0 ⁇ n) layer and an output layer belongs to the base layer to the nth layer.
  • the output layer is included in the top layer from the n + 1 th layer, the decoded picture of the n th layer may be output.
  • the output picture of the output layer is output.
  • a decoded picture of the k th layer or a picture determined to be output may be output.
  • the calculating of the POC may include setting the POC of the decoding target picture to 0 when the picture of the base layer of the same access unit is an IDR picture, and when the picture of the base layer is the BLA picture or the first CRA picture of the bitstream, POC_MSB of the decoding target picture may be set to zero.
  • a decoding apparatus of a bitstream when a random access occurs, includes setting parameters for decoding the bitstream from a time point when the random access occurs, and decoding a picture to be decoded.
  • a prediction unit which decodes a signal, and when the random access occurs, the prediction unit sets an initialization flag indicating whether a target picture to be decoded can be decoded and output normally, and the decoding target picture is randomly accessed.
  • NoRaslOutputFlag a flag variable indicating whether or not it is used as a point, wherein the decoding target picture is an IRAP picture, the flag variable is 1, the initialization flag for the decoding target picture is 0, and all of the target layers If the initialization flag for the reference layer is 1, the The initialization flag for the layer may be reset to one.
  • a method and an apparatus using the same are provided so that decoding is possible from any point in the bitstream.
  • an intra random access point (IRAP) picture and a non-intra random random access point picture (IRAP) are present in an access unit (AU) at a random access point, or an IRAP picture
  • AU access unit
  • IRAP picture a non-intra random random access point picture
  • FIG. 1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment.
  • FIG. 2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment.
  • FIG. 3 is a conceptual diagram schematically illustrating an embodiment of a scalable video coding structure using multiple layers to which the present invention can be applied.
  • FIG. 4 is a diagram for describing a method of gradually decoding a bitstream according to the present invention.
  • FIG. 5 is a diagram illustrating a bitstream structure including a randomly accessible point of time according to an embodiment of the present invention.
  • FIG. 6 is a diagram illustrating a bitstream structure including randomly accessible viewpoints according to another embodiment of the present invention.
  • FIG. 7 is a diagram illustrating a bitstream structure including randomly accessible viewpoints according to another embodiment of the present invention.
  • FIG. 8 is a diagram illustrating a bitstream structure including a randomly accessible time point according to another embodiment of the present invention.
  • FIG. 9 is a control flowchart illustrating a method of decoding an image according to an embodiment of the present invention.
  • FIG. 10 is a diagram for describing a method of setting a decoding parameter according to an embodiment of the present invention.
  • 11 is a control flowchart illustrating a decoding method of an inter-layer video according to an embodiment of the present invention.
  • first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another.
  • the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component.
  • each component shown in the embodiments of the present invention are shown independently to represent different characteristic functions, and do not mean that each component is made of separate hardware or one software component unit.
  • each component is included in each component for convenience of description, and at least two of the components may be combined into one component, or one component may be divided into a plurality of components to perform a function.
  • Integrated and separate embodiments of the components are also included within the scope of the present invention without departing from the spirit of the invention.
  • the components may not be essential components for performing essential functions in the present invention, but may be optional components for improving performance.
  • the present invention can be implemented including only the components essential for implementing the essentials of the present invention except for the components used for improving performance, and the structure including only the essential components except for the optional components used for improving performance. Also included in the scope of the present invention.
  • FIG. 1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment.
  • a scalable video encoding / decoding method or apparatus may be implemented by an extension of a general video encoding / decoding method or apparatus that does not provide scalability, and the block diagram of FIG. 1 is scalable.
  • An embodiment of an image encoding apparatus that may be the basis of a video encoding apparatus is illustrated.
  • the image encoding apparatus 100 may include a motion predictor 111, a motion compensator 112, an intra predictor 120, a switch 115, a subtractor 125, and a converter 130. And a quantization unit 140, an entropy encoding unit 150, an inverse quantization unit 160, an inverse transform unit 170, an adder 175, a filter unit 180, and a reference image buffer 190.
  • the image encoding apparatus 100 may perform encoding in an intra mode or an inter mode on an input image and output a bit stream.
  • Intra prediction means intra prediction and inter prediction means inter prediction.
  • the switch 115 is switched to intra, and in the inter mode, the switch 115 is switched to inter.
  • the image encoding apparatus 100 may generate a prediction block for an input block of an input image and then encode a difference between the input block and the prediction block.
  • the intra predictor 120 may generate a prediction block by performing spatial prediction using pixel values of blocks that are already encoded around the current block.
  • the motion predictor 111 may obtain a motion vector by searching for a region that best matches an input block in the reference image stored in the reference image buffer 190 during the motion prediction process.
  • the motion compensator 112 may generate a prediction block by performing motion compensation using the motion vector and the reference image stored in the reference image buffer 190.
  • the subtractor 125 may generate a residual block by the difference between the input block and the generated prediction block.
  • the transform unit 130 may output a transform coefficient by performing transform on the residual block.
  • the quantization unit 140 may output the quantized coefficient by quantizing the input transform coefficient according to the quantization parameter.
  • the entropy encoding unit 150 entropy encodes a symbol according to a probability distribution based on values calculated by the quantization unit 140 or encoding parameter values calculated in the encoding process, thereby generating a bit stream. You can print
  • the entropy encoding method is a method of receiving a symbol having various values and expressing it in a decodable column while removing statistical redundancy.
  • the symbol means a syntax element, a coding parameter, a residual signal value, or the like that is to be encoded / decoded.
  • the encoding parameter is a parameter necessary for encoding and decoding, and may include information that may be inferred in the encoding or decoding process as well as information encoded by the encoding apparatus and transmitted to the decoding apparatus, such as syntax elements. This means the information you need to do.
  • Coding parameters may be, for example, intra / inter prediction modes, moving / motion vectors, reference picture indexes, coding block patterns, presence or absence of residual signals, transform coefficients, quantized transform coefficients, quantization parameters, block sizes, block partitioning information, or the like. May include statistics.
  • the residual signal may mean a difference between the original signal and the prediction signal, and a signal in which the difference between the original signal and the prediction signal is transformed or a signal in which the difference between the original signal and the prediction signal is converted and quantized It may mean.
  • the residual signal may be referred to as a residual block in block units.
  • the entropy encoder 150 may store a table for performing entropy encoding, such as a variable length coding (VLC) table, and the entropy encoder 150 may store the stored variable length encoding. Entropy encoding may be performed using the (VLC) table. In addition, the entropy encoder 150 derives a binarization method of a target symbol and a probability model of a target symbol / bin, and then performs entropy encoding using the derived binarization method or a probability model. You may.
  • VLC variable length coding
  • CABAC context-adaptive binary arithmetic coding
  • the quantized coefficients may be inversely quantized by the inverse quantizer 160 and inversely transformed by the inverse transformer 170.
  • the inverse quantized and inverse transformed coefficients are added to the prediction block through the adder 175 and a reconstruction block can be generated.
  • the reconstruction block passes through the filter unit 180, and the filter unit 180 applies at least one or more of a deblocking filter, a sample adaptive offset (SAO), and an adaptive loop filter (ALF) to the reconstruction block or the reconstruction picture. can do.
  • the reconstructed block that has passed through the filter unit 180 may be stored in the reference image buffer 190.
  • FIG. 2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment.
  • a scalable video encoding / decoding method or apparatus may be implemented by extension of a general video encoding / decoding method or apparatus that does not provide scalability
  • the block diagram of FIG. 2 is scalable video decoding.
  • An embodiment of an image decoding apparatus that may be the basis of an apparatus is shown.
  • the image decoding apparatus 200 may include an entropy decoder 210, an inverse quantizer 220, an inverse transformer 230, an intra predictor 240, a motion compensator 250, and a filter. 260 and a reference picture buffer 270.
  • the image decoding apparatus 200 may receive a bitstream output from the encoding apparatus, perform decoding in an intra mode or an inter mode, and output a reconstructed image, that is, a reconstructed image.
  • the switch In the intra mode, the switch may be switched to intra, and in the inter mode, the switch may be switched to inter.
  • the image decoding apparatus 200 may generate a reconstructed block, that is, a reconstructed block by obtaining a residual block reconstructed from the received bitstream, generating a prediction block, and adding the reconstructed residual block and the prediction block.
  • the entropy decoder 210 may entropy decode the input bitstream according to a probability distribution to generate symbols including symbols in the form of quantized coefficients.
  • the entropy decoding method is a method of generating each symbol by receiving a binary string.
  • the entropy decoding method is similar to the entropy coding method described above.
  • the quantized coefficients are inversely quantized by the inverse quantizer 220 and inversely transformed by the inverse transformer 230, and as a result of the inverse quantization / inverse transformation of the quantized coefficients, a reconstructed residual block may be generated.
  • the intra predictor 240 may generate a predictive block by performing spatial prediction using pixel values of an already encoded block around the current block.
  • the motion compensator 250 may generate a prediction block by performing motion compensation using the motion vector and the reference image stored in the reference image buffer 270.
  • the reconstructed residual block and the prediction block are added through the adder 255, and the added block passes through the filter unit 260.
  • the filter unit 260 may apply at least one or more of the deblocking filter, SAO, and ALF to the reconstructed block or the reconstructed picture.
  • the filter unit 260 outputs a reconstructed image, that is, a reconstructed image.
  • the reconstructed picture may be stored in the reference picture buffer 270 to be used for inter prediction.
  • components directly related to the decoding of an image in the reference image buffer 270 for example, an entropy decoder 210, an inverse quantizer 220, an inverse transformer 230, an intra predictor 240, and motion compensation.
  • the unit 250, the filter unit 260, and the like may be distinguished from other components and expressed as a decoder or a decoder.
  • the image decoding apparatus 200 may further include a parsing unit (not shown) which parses information related to an encoded image included in a bitstream.
  • the parser may include the entropy decoder 210 or may be included in the entropy decoder 210. Such a parser may also be implemented as one component of the decoder.
  • FIG. 3 is a conceptual diagram schematically illustrating an embodiment of a scalable video coding structure using multiple layers to which the present invention can be applied.
  • a group of pictures represents a picture group, that is, a group of pictures.
  • a transmission medium In order to transmit image data, a transmission medium is required, and its performance varies depending on the transmission medium according to various network environments.
  • a scalable video coding method may be provided for application to such various transmission media or network environments.
  • the scalable video coding method is a coding method that improves encoding / decoding performance by removing inter-layer redundancy by using texture information, motion information, and residual signals between layers.
  • the scalable video coding method may provide various scalability in terms of spatial, temporal, and image quality according to ambient conditions such as a transmission bit rate, a transmission error rate, and a system resource.
  • Scalable video coding may be performed using multiple layers structure to provide a bitstream applicable to various network situations.
  • the scalable video coding structure may include a base layer that compresses and processes image data by using a general image encoding method, and compresses the image data by using the encoding information of the base layer and a general image encoding method together. May include an enhancement layer for processing.
  • a layer is an image and a bit divided based on spatial (eg, image size), temporal (eg, coding order, image output order, frame rate), image quality, complexity, and the like.
  • the base layer may mean a lower layer, a reference layer or a base layer
  • the enhancement layer may mean an upper layer and an enhancement layer.
  • the plurality of layers may have a dependency between each other.
  • the base layer may be defined as a standard definition (SD), a frame rate of 15 Hz, and a 1 Mbps bit rate
  • the first enhancement layer may be a high definition (HD), a frame rate of 30 Hz, and a 3.9 Mbps bit rate
  • the second enhancement layer may be defined as 4K-UHD (ultra high definition), a frame rate of 60 Hz, and a bit rate of 27.2 Mbps.
  • the format, frame rate, bit rate, etc. are exemplary and may be determined differently as necessary.
  • the number of hierarchies used is not limited to this embodiment and may be determined differently according to a situation.
  • the frame rate of the first enhancement layer HD may be reduced and transmitted at 15 Hz or less.
  • the scalable video coding method can provide temporal, spatial and image quality scalability by the method described above in the embodiment of FIG. 3.
  • encoding and decoding video supporting a plurality of layers in a bitstream that is, scalable coding
  • the prediction is performed by using the correlation.
  • Duplicate elements can be removed and the encoding performance of the image can be improved.
  • Performing prediction of the current layer to be predicted using information of another layer is referred to as inter-layer prediction in the following.
  • Scalable video coding has the same meaning as scalable video coding from a coding point of view and scalable video decoding from a decoding point of view.
  • At least one of a resolution, a frame rate, and a color format may be different from each other, and upsampling or downsampling of a layer may be performed to adjust resolution when inter-layer prediction is performed.
  • SVC scalable video coding / decoding
  • the bitstream may support at least one scalability, for example, spatial, image quality, and view scalability, and thus, a video having at least one or more hierarchical structures may be encoded / decoded to allow random access.
  • at least one scalability for example, spatial, image quality, and view scalability
  • the number of layers may be one or more, and the lowest layer may be referred to as a base layer.
  • a case where the number of layers is three will be described as an example.
  • the present invention can be applied without being limited to the number of layers.
  • the upper layer and the lower layer may be encoded / decoded in a single loop or multiple loops.
  • the decoding apparatus may decode the bitstream from the random access point generated in the encoding apparatus.
  • the base layer included in an access unit (AU) at random access time is encoded as an intra random access point (IRAP) picture, and upper layers are non-IRAP picture (hereinafter referred to as non-IRAP picture) or IRAP pictures. Can be encoded by.
  • An access unit refers to a set of NAL units or pictures that can be displayed at the same time.
  • the bitstream can be decoded from this time point.
  • a problem may occur that the bitstream cannot be decoded from that time.
  • the present invention proposes a method for enabling decoding of a bitstream from a corresponding point of time even when a picture of a layer corresponding to a random access point includes a picture that is not accessible randomly.
  • FIG. 4 is a diagram for describing a method of gradually decoding a bitstream according to the present invention.
  • each layer may include an IRAP picture and a non-IRAP picture.
  • the picture of the first layer (layer 0) among the pictures included in the AU A is an IRAP picture, but the pictures of the second layer (layer 1) and the third layer (layer 2) are non-IRAP pictures.
  • the picture of the second layer (layer 1) among the pictures included in the AU B is an IRAP picture, but the pictures of the first layer (layer 0) and the third layer (layer 2) are non-IRAP pictures.
  • the picture of the third layer (layer 2) of the pictures included in the AU C is an IRAP picture, but the pictures of the first layer (layer 0) and the second layer (layer 1) are non-IRAP pictures.
  • the bitstream may be encoded / decoded so as to be displayed gradually from the lower layer normally decoded to the final highest layer.
  • 5 to 8 illustrate a bitstream structure including a randomly accessible view point according to an embodiment of the present invention.
  • one IRAP picture is included in the AU as shown in FIG. 4, and the IRAP pictures are sequentially included in order from the lowest lower layer to the higher layer.
  • a first layer (layer 0) is decoded from AU A
  • a first layer (layer 0) and a second layer (layer 1) are decoded from AU B
  • all layers from AU C. (layer 0, layer 1, layer 2) may be decoded.
  • the pictures of the first layer (layer 0) and the second layer (layer 1) of the pictures included in the AU A are IRAP pictures, and in the case of the next AU B, the pictures of the second layer (layer 1) Only an IRAP picture.
  • the first layer (layer 0) and the second layer (layer 1) are decoded from AU A, and from AU C to the third layer (layer 2), that is, all layers (layer 0, layer 1, layer 2) may be decoded.
  • the picture of the first layer (layer 0) among the pictures included in the AU A is an IRAP picture
  • the pictures of the second layer (layer 1) and the third layer (layer 2) are non-IRAP pictures.
  • pictures of the first layer (layer 0) and the second layer (layer 1) of the pictures included in the AU B are IRAP pictures
  • the second layer (layer 1) and the third of the pictures included in the AU C are included.
  • the picture of the layer (layer 2) is an IRAP picture.
  • the first layer (layer 0) is decoded from AU A
  • the first layer (layer 0) and the second layer (layer 1) are decoded from AU B, as shown in FIG.
  • Layers (layer 0, layer 1, layer 2) may be decoded.
  • the first layer (layer 0) is decoded from AU A
  • the second layer (layer 1) is a picture because the picture of the second layer (layer 1) is a non-IRAP picture and is not an IRAP picture. Cannot be decrypted.
  • the pictures of the third layer (layer 2) referring to the second layer (layer 1) may not be normally output. Accordingly, the first layer (layer 0) and the second layer (layer1) can be decoded from AU C, and then all layers (layer 0) from AU D including the IRAP picture in the second layer (layer 1). , layer 1, layer 2) may be decoded.
  • the present invention provides an encoding and decoding method capable of random access from AU A in a bitstream having the structure of FIGS. 5 to 8.
  • the encoding and decoding method of the present invention is not limited to the bitstream structure of FIGS. 5 to 8.
  • the encoding / decoding process from the corresponding AU is described below when the base layer is an IRAP picture and the AU in which the upper layer is present is the first AU of the bitstream.
  • a method of decoding a bitstream from a randomly accessible point of time receives a bitstream up to nth layer, decodes nth layer, and then decodes the layer up to (n + k) th layer (where k> 0). The same may be applied to the case of starting input reception, that is, in case of layer-switching.
  • 9 is a control flowchart illustrating a method of decoding an image according to an embodiment of the present invention. 9 may be equally applied to an image encoding method as well as an image decoding method.
  • the apparatus for decoding an image may set parameters necessary for decoding (S910).
  • parameters necessary for decoding may be initialized or reset in the following order.
  • the order of (4) and (5) does not change. Specifically, the initialization process for (5) should be performed after (3) and (4), and the initialization process of (5) may be performed only when NoRaslOutputFlag is '1' in (3) or (4). have.
  • the flag value, FirstPicInLayerDecodedFlag, of all possible higher layers can be initialized to '0'. have.
  • an initialization flag (for example, named LayerInitialisedFlag) that indicates whether the layer is an IRAP picture and all reference layers in that layer are normally initialized so that the decoded picture of that layer can be used as the output of the decoder. It can be set as That is, the initialization flag indicates whether the target picture to be decoded can be normally decoded and output.
  • the value can be reset according to whether the flag variable and the reference layer referenced by the layer are normally decoded as described below.
  • NoRaslOutputFlag can be set to '1'.
  • NoRaslOutputFlag is a flag variable indicating whether the corresponding IRAP picture is used as a random access point.
  • NoRaslOutputFlag may be set to '1'.
  • the flag variable is set to 1.
  • LayerInitialisedFlag [n] may be set to '1'.
  • the flag variable is 1
  • the initialization flag set for the decoding target picture is 0, and the initialization flags for all the reference layers of the layer to which the decoding target picture belongs are 1, initialization for the target layer.
  • the flag is reset from 0 to 1.
  • the decoding apparatus calculates a POC, which is an identifier of a picture to be currently encoded / decoded, so that a value increases according to the output order (S920).
  • the Picture Order Counter is an identifier for identifying pictures in a layer having the same nuh_layer_id in a coded video stream.
  • the value may increase as the output order from the DPB becomes late. have. That is, the POC is a display order in which pictures may be output from the DPB and displayed, and the POC of the IDR picture may have a value of '0'.
  • All pictures in the same AU may have the same POC value.
  • the POC value may be calculated by the following conventional method.
  • POC_LSB value may be signaled in the slice header of the picture, and the MaxPOCLSB value indicating the maximum LSB may be signaled in the sequence parameter set.
  • POC_MSB is the POC_MSB (hereinafter referred to as prevPOCMSB) and POC_LSB (hereinafter referred to as prevPOCLSB) of the picture closest to the current picture among temporal_id '0' indicating an identifier of a temporal sublayer previously encoded / decoded. And POC_LSB of the picture to be currently encoded / decoded.
  • the POC value of an IDR picture can always be assumed to be '0', if the CRA picture is the first picture of the bitstream or for a BLA picture, the POC_MSB value of the picture is assumed to be '0' and the POC_LSB value is defined in the slice header. May be signaled. If the CRA picture is not the first picture of the bitstream, the POC value may be calculated in the same way as the non-IRAP picture.
  • the POC value may be the same.
  • flag information such as poc_reset_flag may be signaled in the slice header.
  • the decoding apparatus may set POC values of all pictures in the AU including the slice to '0'.
  • the decoding apparatus calculates an original POC value (POC 1) calculated by a conventional method of a current encoding / decoding target picture, and POC values of reference layers existing in the decoded picture buffer of the corresponding layer including the picture. It decreases by 1 and sets the POC value of the current target picture to '0'.
  • decoding of pictures included in the corresponding AU may be performed in the following order.
  • the initial POC value of the target picture to be currently encoded / decoded may be set as follows.
  • the POC value may be set to '0'.
  • the POC_MSB value is 0 and the POC value may be set to the POC_LSB value signaled in the slice header.
  • the general POC configuration method using the POC_LSB value signaled in the slice header (as described above, the current picture among previously encoded / decoded temporal_id '0')
  • the POC value ('POC1') may be set according to POC_MSB (named prevPOCMSB) and POC_LSB (named prevPOCLSB) and POC_LSB value of a picture to be currently encoded / decoded.
  • the POC value of the picture including the slice having poc_reset_flag of '1' may be set to '0'.
  • the picture of the base layer (layer 0) is an IRAP picture and is the first picture of the coded bitstream.
  • decoding of pictures included in a corresponding AU may be performed in the following order.
  • the initial POC value of the target picture to be currently encoded / decoded may be set as follows.
  • the POC value may be set to '0'.
  • the POC_MSB value is 0 and the POC value may be set to the POC_LSB value signaled in the slice header.
  • the POC value ('POC1') can be set using one of the following methods: Can be.
  • the POC value 'POC1' may be set according to the conventional POC setting method described above using the POC_LSB value signaled in the slice header.
  • both prevPOCMSB and prevPOCLSB values may be set to '0'.
  • the POC_MSB value of the current decoding target picture may be set to '0'.
  • the POC value ('POC1') of the picture to be currently decoded is set to the POC_LSB value signaled in the slice header of the current picture.
  • the POC value of the current decoding target picture may be set to '0' without being calculated.
  • the POC value of the picture including the slice having poc_reset_flag of '1' may be set to '0'.
  • the apparatus for decoding an image configures a reference picture set and performs reference picture marking for inter prediction of the current sub / decoding target picture (S930).
  • the POC value or the POC_LSB value of the slice reference pictures of the picture may be calculated as follows.
  • POC values of short-term reference pictures are calculated using a delta_POC value indicating each short-term reference picture signaled in the corresponding slice header and a 'POC1' value, which is a POC value of the current picture. do.
  • the delta_POC value may be a difference between the POC of the current picture and the i-th short-term reference picture or a difference between the (i + 1) th short-term reference picture and the i-th short-term reference picture.
  • a long-term reference picture is referred to using a POC_LSB value for each long-term reference picture, a value for calculating a POC_MSB value (delta_poc_msb_cycle_lt), and a 'POC1' value, which is a POC value of the current picture.
  • the POC_LSB or POC value of the pictures is calculated.
  • delta_poc_msb_cycle_lt may be signaled in the corresponding slice header.
  • the POC_LSB is signaled for the long-term reference picture, and only the POC_LSB value signaled identifies the long-term POC.
  • a value (delta_poc_msb_cycle_lt) for calculating the POC_MSB value is additionally signaled, thereby identifying the POC of each reference picture. Do it.
  • POC values or POC_LSB values of slice reference pictures of the picture may be calculated as follows.
  • POC values of short-term reference pictures are calculated using a delta_POC value indicating each short-term reference picture signaled in the corresponding slice header and a '0' value, which is a POC value of the current picture. do.
  • the delta_POC value may be a difference between the POC of the current picture and the i-th short-term reference picture or a difference between the (i + 1) th short-term reference picture and the i-th short-term reference picture.
  • a value for calculating a POC_LSB value and a POC_MSB value for each long-term reference picture (delta_poc_msb_cycle_lt), a '0' value that is a POC value of the current picture, and a slice header of the current picture
  • the POC_LSB value or the POC value of the long term reference pictures is calculated using the POC_LSB value signaled at.
  • delta_poc_msb_cycle_lt may be signaled in the slice header of the corresponding long-term reference picture.
  • the POC_LSB is signaled for the long-term reference picture, and only the POC_LSB value signaled identifies the long-term POC.
  • a value (delta_poc_msb_cycle_lt) for calculating the POC_MSB value is additionally signaled, thereby identifying the POC of each reference picture. Do it.
  • the POC value or POC_LSB value of the reference pictures signaled to the slice header of the picture including the slice having poc_reset_flag '1' matches the POC value of the pictures existing in the DPB. Can be adjusted using the 'POC1' value of the current picture.
  • a reference picture set may be configured and the shape of the reference picture may be displayed according to the presence or absence of the reference picture in the DPB.
  • the image decoding apparatus may generate a virtual reference picture for a picture included in the reference picture set but not present in the DPB (S940).
  • the POC values of the generated virtual pictures have the POC values of the respective reference pictures calculated in step S920.
  • the PicOutputFlag value indicating whether the virtual reference picture is output in the DPB is set to '0', and the virtual reference picture generated thereby is not output.
  • the generated virtual reference pictures may be represented as short-term reference pictures or long-term reference pictures as content signaled in the reference picture set. Specifically, the short-term reference picture referenced by the current picture and before the current picture in display order, the short-term reference picture referenced by the current picture and displayed in the display order after the current picture, the long-term reference picture referenced by the current picture, and pictures after the current picture. It may be indicated as a short-term reference picture referenced by, a long-term reference picture referenced by pictures after the current picture.
  • the image decoding apparatus may determine whether to output the current encoding / decoding target picture (S950).
  • PicOutputFlag which indicates whether to output the decoded picture, can be set in the following order. If the PicOutputFlag value is '1', the decoded picture buffer is exported to the decoder output, and if it is '0', the decoded picture buffer is output to the decoder output. You can not send.
  • the RASL picture is a leading picture in which the encoding / decoding order is later than the IRAP picture closest to the current picture among the IRAP pictures decoded / decoded before the current picture, but the display order is first, and before the IRAP picture.
  • the PicOutputFlag may be set to '0'.
  • the PicOutputFlag value may be set by one of the following methods.
  • the PicOutputFlag value may be set to a pic_output_flag value signaled in the slice header.
  • the PicOutputFlag value may be set to a pic_output_flag value signaled in the slice header when the corresponding layer is an output layer, and set to '0' when the corresponding layer is not an output layer.
  • PicOutputFlag may be set to zero.
  • the layer is not an output layer, and there is a layer whose immediate upper layer is the IRAP picture and among the lower reference layers of the upper layer, the layerInitialisedFlag value is '0', and the upper layer of the layer is the output layer.
  • PicOutputFlag of the layer may be set to a pic_output_flag value signaled in the slice header.
  • the PicOutputFlag value of the corresponding layer may be set to '0'.
  • the layer is not an output layer, and there is no picture corresponding to the output layer within the same AU, and the layer is either a direct reference picture (a picture of a layer directly referenced by the layer) or an indirect reference picture of the output layer picture.
  • a layer required for encoding / decoding a layer directly referenced by the corresponding layer), and a picture that is a direct or indirect reference picture of an output layer picture and a pic_output_flag value signaled in the slice header is '1'
  • the PicOutputFlag value of the picture may be set to '1'. Otherwise, PicOutputFlag value may be set to '0'.
  • the layer is not an output layer, and there is no picture corresponding to the output layer within the same AU, and the layer is either a direct reference picture (a picture of a layer directly referenced by the layer) or an indirect reference picture of the output layer picture.
  • Layer required when encoding / decoding a layer directly referenced by the corresponding layer) the picture is a direct or indirect reference picture of an output layer picture
  • the LayerInitializedFlag value is '1'
  • the pic_output_flag value signaled in the slice header is In the case of the highest picture among the decoded pictures that are '1' (ie, the picture having the highest nuh_layer_id value), the PicOutputFlag value of the picture may be set to '1'. Otherwise, PicOutputFlag value may be set to '0'.
  • the decoding apparatus may control the output of the pictures in the decoded picture buffer so that not only the set PicOutputFlag value but also the same result as described.
  • the image decoding apparatus When it is determined whether to output the picture, the image decoding apparatus performs motion prediction and motion compensation on the current sub / decoding target image by using the reference picture in the reference picture list (S960).
  • motion prediction and motion compensation may be performed on the current sub / decoded target image using a reference picture in the reference picture list using a conventional inter-picture prediction method.
  • the decoding apparatus may perform the decoding process for steps S910 to S970 for each layer. After the decoding process in the first AU is finished, pictures having a PicOutputFlag of 1 among the layers in the first AU may be output.
  • the decoded picture of the layer And decoded pictures of the n th layer if the n + 1 th layer to the top layer are included in the output layer.
  • the decoding apparatus may decode the bitstream by repeatedly performing the above process up to the AU where the picture of the highest layer is decoded.
  • FIG. 10 is a diagram for describing a method of setting a decoding parameter according to an embodiment of the present invention.
  • the decoding apparatus may set the initialization flag to 1 for the base layer and the initialization flag to 0 for the layer higher than the base layer (S1110).
  • the initialization flag (LayerInitialisedFlag) is used in the bitstream when the base layer included in the AU at the randomly accessible point is decoded from the randomly accessible point as the bitstream whose IRAP picture (ie, IDR picture, BLA picture, or CRA picture) is decoded from the randomly accessible point.
  • this layer is an IRAP picture, and all reference layers of the layer are normally initialized to indicate whether a decoded picture of the layer can be used as an output of the decoder.
  • the normalization of the reference layers may mean that the picture may be normally decoded and output from the decoder.
  • the decoding apparatus may set a flag variable (NoRaslOutputFlag) indicating whether the picture is used as a random access point.
  • a flag variable NoRaslOutputFlag
  • the decoding apparatus may set the flag variable to 1 when the target picture is an IDR picture or a BLA picture or an IRAP picture and the first picture of the corresponding layer (nuh_layer_id is the same layer as the current encoding and decoding target picture) in the corresponding bitstream. If the target picture is an IRAP picture, the initialization flag set in the layer to which the target picture belongs is 0, and the initialization flag for all reference layers of the layer to which the target picture belongs is 1, the flag variable may be set to 1 (S1020).
  • the decoding apparatus determines whether the initialization flag set for the target picture is 0 and the initialization flag for all reference layers of the layer to which the target picture belongs is 1. If it is determined that the initialization flag for the target picture is 0 and the initialization flag for all reference layers of the layer to which the target picture belongs is 1, the initialization flag for the target layer may be set from 0 to 1 (S1030).
  • step S1010 may be changed to 1.
  • decoding of the target picture may be performed as in the remaining steps of FIG. 9, and a picture of a higher layer may be gradually output depending on whether the decoded picture is output.
  • FIG. 11 is a control flowchart illustrating a decoding method of an inter-layer video according to an embodiment of the present invention. A method of finally decoding a bitstream by decoding an image for each access unit and applying the same to a plurality of layers will be described with reference to FIG. 11.
  • the decoding apparatus sets parameters for decoding as shown in steps S910 and 10 of FIG. 9 and decodes pictures included in the first access unit (S1110).
  • the decoding process of the picture may correspond to steps S910 to S970 of FIG. 9, and overlapping description is omitted.
  • the decoding apparatus may determine whether to individually output the pictures included in the first access unit (S1120).
  • the decoding apparatus determines whether to individually output the pictures, when the initialization flag value of the target layer to which the decoding target picture belongs is 0, the decoding apparatus may determine that the target picture is not output.
  • the decoding apparatus determines that the target layer to which the decoding target picture belongs is not the output layer, and there is no output target picture belonging to the output layer in the access unit to which the target layer belongs, that is, the output target picture is not output.
  • the target picture is the highest picture among decoded pictures having flag information indicating whether or not the picture signaled in the slice header is output is 1, It may be determined to output the target picture.
  • the decoding apparatus determines that the target layer to which the decoding target picture belongs is not the output layer, and there is no output target picture belonging to the output layer in the access unit to which the target layer belongs, that is, the output target picture is not output.
  • the layer is used as a reference of the output layer
  • the target picture is used as the reference picture of the output picture
  • the initialization flag value for the target layer is 1
  • the flag information indicating whether the target picture is outputted in the picture signaled in the slice header is 1
  • the target picture may be determined to be output.
  • the decoding apparatus may decode the pictures included in the second access unit, which is the next access unit (S1130), and determine whether to output each of them separately (S1140).
  • the method of determining whether to output pictures included in the second access unit is the same as the method of determining whether to output pictures included in the first access unit.
  • the decoding apparatus may output a picture determined to be output in correspondence with an output time point of each picture.
  • the decoding apparatus individually determines whether to output the pictures included in the access unit, and outputs pictures that can be output, and sequentially determines whether to output the pictures of the next access unit, and then decodes normally. Output pictures of a layer that can be output. Through this process, pictures of a higher layer may be gradually decoded and output.
  • an intra-random access point (IRAP) picture and a non-intra-random random access point picture (IRAP) picture or only an IRAP picture exist in an access unit (AU) at a random access point.
  • IRAP intra-random access point
  • AU access unit

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

La présente invention concerne un procédé de décodage d'un flux binaire comprenant plusieurs couches, faisant appel aux étapes consistant à : définir un paramètre de décodage du flux binaire à partir d'un point auquel un accès aléatoire s'est produit si l'accès aléatoire a eu lieu ; et décoder une image à décoder, l'étape de définition d'un paramètre pouvant comprendre les étapes consistant à : définir un drapeau d'initialisation indiquant si l'image à décoder a pu être émise en sortie puisque l'image à décoder est une image de point d'accès aléatoire intra (IRAP), et toutes les couches de référence d'une couche cible à laquelle l'image à décoder appartient sont normalement décodées, si l'accès aléatoire a eu lieu ; définir une variable de drapeau (NoRaslOutputFlag) indiquant si l'image à décoder a été utilisée en tant que point d'accès aléatoire ; et restaurer le drapeau d'initialisation si l'image à décoder est une image IRAP et si la variable de drapeau est 1.
PCT/KR2014/009409 2013-10-11 2014-10-07 Procédé de codage et de décodage d'image et dispositif l'utilisant WO2015053525A1 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US15/027,101 US10142638B2 (en) 2013-10-11 2014-10-07 Method for encoding/decoding image and device using same
CN201480064335.8A CN105765978B (zh) 2013-10-11 2014-10-07 用于编码/解码图像的方法和使用其的装置
US16/157,217 US10306244B2 (en) 2013-10-11 2018-10-11 Method for encoding/decoding image and device using same

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
KR20130121430 2013-10-11
KR10-2013-0121430 2013-10-11
KR20130121357 2013-10-11
KR10-2013-0121357 2013-10-11
KR20140031880A KR20150042683A (ko) 2013-10-11 2014-03-18 점진적 계층전환을 이용한 영상 부호화/복호화 방법 및 그 장치
KR10-2014-0031880 2014-03-18
KR10-2014-0132543 2014-10-01
KR1020140132543A KR102246634B1 (ko) 2013-10-11 2014-10-01 영상의 부호화/복호화 방법 및 이를 이용하는 장치
KR1020140134240A KR102257754B1 (ko) 2013-10-11 2014-10-06 영상의 부호화/복호화 방법 및 이를 이용하는 장치
KR10-2014-0134240 2014-10-06

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US15/027,101 A-371-Of-International US10142638B2 (en) 2013-10-11 2014-10-07 Method for encoding/decoding image and device using same
US16/157,217 Continuation US10306244B2 (en) 2013-10-11 2018-10-11 Method for encoding/decoding image and device using same

Publications (1)

Publication Number Publication Date
WO2015053525A1 true WO2015053525A1 (fr) 2015-04-16

Family

ID=52813303

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2014/009409 WO2015053525A1 (fr) 2013-10-11 2014-10-07 Procédé de codage et de décodage d'image et dispositif l'utilisant

Country Status (1)

Country Link
WO (1) WO2015053525A1 (fr)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003063500A1 (fr) * 2002-01-22 2003-07-31 Microsoft Corporation Procedes et systemes de codage et de decodage de donnees video permettant un acces aleatoire et un collage
JP2003319340A (ja) * 2002-04-26 2003-11-07 Sony Corp 符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム
US20050021814A1 (en) * 2003-06-19 2005-01-27 Nokia Corporation Stream switching based on gradual decoder refresh
KR20080114500A (ko) * 2007-06-25 2008-12-31 한국전자통신연구원 기본시점 복호 및 시점 임의 접근을 지원하기 위한 다시점비디오 부호화 시스템, 복호화 시스템 및 비트스트림 추출시스템
US20130077687A1 (en) * 2011-09-23 2013-03-28 Ye-Kui Wang Coding reference pictures for a reference picture set

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003063500A1 (fr) * 2002-01-22 2003-07-31 Microsoft Corporation Procedes et systemes de codage et de decodage de donnees video permettant un acces aleatoire et un collage
JP2003319340A (ja) * 2002-04-26 2003-11-07 Sony Corp 符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム
US20050021814A1 (en) * 2003-06-19 2005-01-27 Nokia Corporation Stream switching based on gradual decoder refresh
KR20080114500A (ko) * 2007-06-25 2008-12-31 한국전자통신연구원 기본시점 복호 및 시점 임의 접근을 지원하기 위한 다시점비디오 부호화 시스템, 복호화 시스템 및 비트스트림 추출시스템
US20130077687A1 (en) * 2011-09-23 2013-03-28 Ye-Kui Wang Coding reference pictures for a reference picture set

Similar Documents

Publication Publication Date Title
WO2020071830A1 (fr) Procédé de codage d'images utilisant des informations de mouvement basées sur l'historique, et dispositif associé
WO2013157826A1 (fr) Procédé de décodage de données d'image, procédé de décodage d'image et dispositif utilisant ces procédés
WO2015009036A1 (fr) Procédé et appareil de prédiction inter-couche en fonction d'informations de sous-couche temporelle
WO2014163241A1 (fr) Procédé et appareil permettant de traiter une vidéo
WO2013157797A1 (fr) Procédé et dispositif permettant de coder une vidéo multicouche, et procédé et dispositif permettant de décoder une vidéo multicouche
WO2014042460A1 (fr) Procédé et appareil pour le codage/décodage d'images
WO2014084656A1 (fr) Procédé et dispositif de codage/décodage d'image prenant en charge une pluralité de couches
WO2015002460A1 (fr) Procédé de codage et de décodage de vidéo comprenant une pluralité de couches
WO2013162249A1 (fr) Procédé de codage vidéo, procédé de décodage vidéo et appareil les mettant en œuvre
WO2015012600A1 (fr) Procédé et appareil de codage/décodage d'une image
WO2014007550A1 (fr) Procédé et appareil pour coder une vidéo ayant une échelonnabilité temporelle, et procédé et appareil pour décoder une vidéo ayant une échelonnabilité temporelle
WO2021201515A1 (fr) Procédé et dispositif de codage/décodage d'image pour signalisation hls, et support d'enregistrement lisible par ordinateur dans lequel est stocké un flux binaire
WO2014038905A2 (fr) Procédé de décodage d'image et appareil l'utilisant
WO2014051372A1 (fr) Procédé de décodage d'image, et appareil l'utilisant
WO2015194913A1 (fr) Procédé pour encoder et décoder une image et dispositif l'utilisant
WO2021246791A1 (fr) Procédé et appareil pour traiter une syntaxe de haut niveau dans un système de codage d'image/de vidéo
WO2021107622A1 (fr) Procédé et appareil de codage d'image/de vidéo
WO2015053525A1 (fr) Procédé de codage et de décodage d'image et dispositif l'utilisant
WO2013062175A1 (fr) Procédé et dispositif pour codage adaptable de couche d'enrichissement
WO2021201617A1 (fr) Appareil et procédé de codage/décodage d'image d'après des informations de sous-image alignées entre des couches, et support d'enregistrement stockant un flux binaire
WO2021201616A1 (fr) Procédé et appareil de codage/décodage d'image basés sur des informations de division d'image et des informations de sous-image, et support d'enregistrement stockant un flux binaire
WO2021201542A1 (fr) Procédé et appareil de codage/décodage vidéo permettant de signaler un paramètre dpb, et support d'enregistrement lisible par ordinateur stockant un flux binaire
WO2021201541A1 (fr) Procédé et appareil de codage/décodage d'image pour signaler des informations concernant le nombre de paramètres de dpb, et support d'enregistrement lisible par ordinateur stockant un train de bits
WO2023195819A1 (fr) Procédé et appareil de codage/décodage d'image basés sur un procédé de balayage de tranche adaptatif, et support d'enregistrement pour stocker un flux binaire
WO2021201514A1 (fr) Procédé et dispositif de codage/décodage d'image pour signaler des paramètres de hrd, et support d'enregistrement lisible par ordinateur sur lequel un flux binaire est stocké

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14852402

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 15027101

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14852402

Country of ref document: EP

Kind code of ref document: A1