WO2013155828A1 - Video image code stream processing method and device - Google Patents

Video image code stream processing method and device Download PDF

Info

Publication number
WO2013155828A1
WO2013155828A1 PCT/CN2012/083682 CN2012083682W WO2013155828A1 WO 2013155828 A1 WO2013155828 A1 WO 2013155828A1 CN 2012083682 W CN2012083682 W CN 2012083682W WO 2013155828 A1 WO2013155828 A1 WO 2013155828A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
syntax element
code stream
type
images
Prior art date
Application number
PCT/CN2012/083682
Other languages
French (fr)
Chinese (zh)
Inventor
李斌
李厚强
杨海涛
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2013155828A1 publication Critical patent/WO2013155828A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/65Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
    • H04N19/68Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience involving the insertion of resynchronisation markers into the bitstream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/89Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/107Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/179Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scene or a shot
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • the present invention relates to image processing technologies, and in particular, to a video image code stream processing method and apparatus. Background technique
  • High Efficiency Video Coding (hereafter referred to as HEVC) is a new generation of video coding standards to meet the needs of the growing demand for video applications, especially for high definition video.
  • the goal of HEVC design is to save 50% code rate compared to previous generation video coding standards, such as Advanced Video Coding (AVC), when subjective effects are similar.
  • AVC Advanced Video Coding
  • PSNR Peak Signal Noise Ratio
  • the compression performance of HEVC relative to AVC is as follows: Under the test conditions of all intra-frame images, the code rate is saved by about 23%; The code rate savings are about 33% under test conditions that support random access; the code rate savings are about 41% under low latency test conditions.
  • the subjective performance improvement of HEVC is more prominent. The main reason is that the improved interframe prediction and loop filtering techniques in HEVC have an obvious effect on improving subjective performance.
  • Embodiments of the present invention provide a video image code stream processing method and device.
  • An embodiment of the present invention provides a video image code stream processing method, including:
  • a code stream generated by the encoding device is extracted, the code stream containing pure random access CRA image information, the CRA image information satisfying a constraint condition of the CRA image and the constraint condition includes the CRA image must be a reference image.
  • an embodiment of the present invention provides a decoding device, including:
  • An extracting module configured to extract a code stream generated by the encoding device, where the code stream includes pure random access CRA image information, where the CRA image information satisfies a constraint condition of the CRA image and The constraints include that the CRA image must be a reference image.
  • the constraint condition of the CRA image is improved, that is, the CRA image is forced to be a reference image based on the existing constraints on the CRA image.
  • the decoding device may determine whether the code stream includes a CRA image, and if yes, determine whether the attribute of the CRA image in the code stream is a reference image according to a constraint condition of the CRA image, if The reference image may determine that the code stream is a legal code stream, and the CRA image may be used to assist POC derivation of subsequent images, otherwise the code stream is an illegal code stream, and the CRA image cannot be used to assist POC derivation of subsequent images. , thereby avoiding POC calculation anomalies and decoding anomalies.
  • An embodiment of the present invention provides another video image code stream processing method, including:
  • the POCMsb of the current image is equal to the POCMsb of the previous image
  • the POC of the current image is determined according to the POCMsb of the current image and the POCLsb of the current image.
  • an embodiment of the present invention provides a decoding device, including:
  • a first determining module configured to determine whether a picture sequence number POC of the sequentially decoded images is arranged in a non-descending order
  • the processing module is configured to determine POCsb of the current image by using the following method if it is arranged in non-descending order:
  • the POCMsb of the current image is equal to the POCMsb of the previous image;
  • a second determining module configured to determine a POC of the current image according to the POCMsb of the current image and the POCLsb of the current image.
  • the decoding device may first determine whether POCs of sequentially decoded images are arranged in non-descending order; if they are arranged in non-descending order, the improved derivation manner may be used to determine POCMsb of the current image, that is, from the code stream.
  • the decoding device can determine the output order based on the POC of the current image and the POC of each image in the DPB.
  • the POCs of the sequentially decoded images are arranged in a non-descending order, which simplifies the derivation of the POCMsb, improves the processing efficiency, and, based on the improved derivation, can remove any two of the activated images.
  • the size of the POC difference of the image is limited, so that the bit width of POCLsb in the code stream can be reduced, thereby reducing the transmission overhead.
  • the active image refers to the current image and all the images in the DPB that are marked for short-term reference and the images to be output.
  • An embodiment of the present invention provides a video image code stream processing method, including:
  • the first type of syntax element refers to a syntax element including time layer quantity information
  • the second type of syntax element refers to a syntax element that includes code stream feature information of each time layer
  • the third type of syntax element refers to a syntax element that contains information about the size of the storage space occupied by each image
  • At least two types of syntax elements are extracted from the neighboring locations of the codestream.
  • an embodiment of the present invention provides a processing device, including:
  • An extraction module configured to extract a code stream generated by the encoding device, where the first class syntax element and the second class syntax element are placed in a neighboring position in the code stream, or the second class syntax element and the third class syntax element Placed in a neighboring position in the code stream, the first type of syntax element refers to a syntax element including time layer quantity information, and the second type of syntax element refers to a code stream characteristic information including each time layer.
  • a syntax element, the third type of syntax element is meant to contain each picture a syntax element like the size of the storage space occupied;
  • an extracting module configured to extract at least two types of grammatical elements from the neighboring locations of the code stream.
  • the decoding device or the media gateway may extract a code stream generated by the encoding device, where the first type of syntax element and the second type of syntax element are placed in a neighboring position in the code stream, or The second type of syntax element and the third type of syntax element are placed in the adjacent position in the code stream. Therefore, the decoding device or the media gateway does not need to additionally parse other irrelevant syntax elements when extracting each time layer information, but All the syntax elements of each time layer information can be continuously extracted, thereby improving the processing efficiency of the time layer information.
  • An embodiment of the present invention provides a video image code stream processing method, including:
  • Extracting syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream generated by the encoding device where the syntax element max_dec_pic_buffering_diff[i] indicates that the decoding time layer identifier does not exceed i
  • the size of the decoded image buffer required by the substream is different from the size of the decoded image buffer required by the decoding time layer identifier not exceeding i-1
  • the syntax element num_reorder_pics_diff[i] represents the time layer. The difference between the number of inverted images in the substream that does not exceed i and the number of inverted images in the substream that does not exceed i-1;
  • MaxDecPicBuffering[i] and NumReorderPics[i] are obtained in the following way:
  • MaxDecPicBuffering[i] max— dec— pic— buffering— diff[i];
  • MaxDecPicBuffering[i] and NumReorderPics[i] as follows:
  • NumReorderPics[i] NumReorderPics [i-1] + num_reorder_p ic s_dif f [i] ; where MaxDecPicBuffering[i] represents the size of the storage space required for the iB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics [i] represents the maximum number of time-series inverted images in the i-th temporal layer code stream.
  • an embodiment of the present invention provides a decoding device, including: An extracting module, configured to extract syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream generated by the encoding device, wherein the syntax element max_dec_pic- Buffering_diff[i] indicates the difference between the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i and the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i-1 Value, syntax element num_reorder_pics_diff[i] indicates the difference between the number of inverted images in the subcode stream whose time layer identifier does not exceed i and the number of inverted images in the subcode stream whose time layer identifier does not exceed i-1 value;
  • Differential decoding module if i is 0, obtain MaxDecPicBuffering[i] and NumReorderPics[i] in the following way:
  • MaxDecPicBuffering[i] max— dec— pic— buffering— diff[i];
  • MaxDecPicBuffering[i] and NumReorderPics[i] as follows:
  • NumReorderPics[i] NumReorderPics [i-1] + num_reorder_p ic s_dif f [i] ; where MaxDecPicBuffering[i] represents the size of the storage space required for the iB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics [i] represents the maximum number of time-series inverted images in the i-th temporal layer code stream.
  • the value of the syntax element indicating the size of the decoded image buffer required for the sub-code stream not exceeding i of the decoding time layer identifier and the number of inverted images in the sub-code stream indicating that the time layer identifier does not exceed i are utilized.
  • the values of the syntax elements are non-decreasing, and the two syntax elements are differentially encoded to reduce the bits required to represent the syntax elements in the SPS, thereby improving compression efficiency.
  • Embodiment 1 is a flowchart of Embodiment 1 of a video image code stream processing method according to the present invention
  • Embodiment 2 is a flowchart of Embodiment 2 of a video image code stream processing method according to the present invention
  • Embodiment 3 is a flowchart of Embodiment 3 of a video image code stream processing method according to the present invention.
  • Embodiment 4 is a flowchart of Embodiment 4 of a video image code stream processing method according to the present invention.
  • FIG. 5 is a schematic structural diagram of Embodiment 2 of a decoding device according to the present invention.
  • FIG. 6 is a schematic structural diagram of an embodiment of a processing device according to the present invention.
  • FIG. 7 is a schematic structural diagram of Embodiment 3 of a decoding device according to the present invention.
  • the technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention.
  • the embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
  • An embodiment of the present invention provides a video image code stream processing method, where the method includes: extracting a code stream generated by an encoding device, where the code stream includes Clean Random Access (CRA) image information,
  • CRA Clean Random Access
  • the CRA image information satisfies the constraints of the CRA image and the constraints include that the CRA image must be a reference image.
  • the technical solution of the embodiment improves the constraint condition of the CRA image on the basis of the existing HEVC standard document, that is, forcibly constrains the CRA image into a reference image based on the existing constraints on the CRA image, thereby making the
  • the CRA image can be used to assist the picture order number (Picture Order Count, hereinafter referred to as POC) derivation of the subsequent image, otherwise the code stream is an illegal code stream, and the CRA image cannot be used to assist the POC derivation of the subsequent image, thereby avoiding the POC. Calculate exceptions and decode exceptions.
  • Embodiment 1 is a flowchart of Embodiment 1 of a video image code stream processing method according to the present invention. As shown in FIG. 1, the method in this embodiment may include:
  • Step 101 Extract a code stream generated by the encoding device, where the code stream includes CRA image information.
  • Step 102 Determine, according to a constraint condition of the CRA image, whether the code stream is a legal code stream. This constraint includes that the CRA image must be a reference image.
  • HEVC supports random access using both Instantaneous Decoding Refresh (IDR) images and CRA images.
  • IDR Instantaneous Decoding Refresh
  • a CRA image For a CRA image, it can only be an I-frame image; for an image whose decoding order and output order are after the CRA image, it is not allowed to use the decoding order or the output order to predict the image before the CRA image, and the decoding order is The image before the CRA image, whose output order is also before the CRA image. Therefore, the CRA image can be normally decoded without relying on other images, and the decoding operation of the image after the CRA image does not depend on any image before the CRA image.
  • the CRA image can be used as a starting point to decode a video sequence; and before the CRA image The image does not affect the output of the image after the CRA image, so the CRA image can be used as a starting point to output a video sequence.
  • the CRA image can be decoded and output as the first image in a video sequence.
  • the first image of a video sequence specified in HEVC can only be an IDR image or a CRA image, so if an image is neither an IDR image nor a CRA image, it must not be the first image in the video sequence.
  • each image it has a corresponding POC, which is used to indicate the temporal relationship between an image and other images in a Decoded Picture Buffer (DPB).
  • DPB Decoded Picture Buffer
  • the timing relationship specifies the output order of all images in the DPB.
  • images with a small POC number should be output before the image with a large POC number. Therefore, in deriving the POC of the current image, it is necessary to first determine the previous image of the current image, then acquire the POC information of the previous image and apply the POC information of the previous image to derive the POC of the current image.
  • the previous image refers to the previous image in the decoding order that satisfies the following two conditions: Condition 1, the syntax element of the image temporal_id is 0, and the syntax element temporal_id can be from the network abstraction layer unit (Network The Abstract Layer Unit (hereinafter referred to as NALU) is obtained from the header field (hereinafter referred to as header); Condition 2, the image is a reference image, and the syntax element nal_ref_flag in the NALU header can be used to determine whether an image is a reference.
  • Image in the existing HEVC standard document, nal_ref_flag of 1 indicates that the image is a reference image, and nal_ref_flag of 0 indicates that the image is not a reference image.
  • the first decoded image in a video sequence whether it is an IDR image or a CRA image, must satisfy both of the above conditions to ensure subsequent frames in the video sequence. Any image can find its corresponding previous image.
  • the existing HEVC standard document has a temporal-id for the CRA image, which must be 0, otherwise the code stream containing the CRA image is an illegal code stream.
  • the existing HEVC standard document does not limit its nal-ref_flag must be 1. In this case, if a CRA image is used as the first image of a video sequence, and the nal_ref_flag of the CRA image is 0, that is, the CRA image is not a reference image, then each of the CRA images The image cannot find its corresponding previous image, causing the POC calculation to be abnormal and eventually causing the decoding to be abnormal.
  • the present embodiment improves the constraint condition of the CRA image based on the existing HEVC standard document, that is, the constraint CRA image is forced to be a reference image based on the existing constraints on the CRA image.
  • the decoding device may extract the code stream generated by the encoding device, and then determine whether the code stream includes CRA image information, and if so, determine whether the attribute of the CRA image in the code stream is a reference image according to a constraint condition of the CRA image, if Referring to the image, the code stream can be considered as a legal code stream, which can be used as a POC derivation of the subsequent image, otherwise the code stream is an illegal code stream.
  • the decoding device can forcibly constrain the CRA image into a reference image, thereby performing fault tolerance processing on the illegal code stream.
  • the entire video stream is encapsulated in multiple NALUs, and each NALU can contain all or part of an image. Therefore, in the specific implementation of the embodiment, for the NALU corresponding to the CRA image, the syntax element nal_unit_type in the NALU header may be used to indicate that the information contained in the NALU belongs to the CRA image, and the NALU header is in the NALU header.
  • Another syntax element nal_ref_flag is 1, which can indicate that the CRA image is a reference image.
  • the decoding device After the decoding device extracts the NALU generated by the encoding device, it may determine, according to the syntax element nal_unit_type in the NALU header, whether the information included in the NALU belongs to the CRA image, and if it belongs to the CRA image, according to the NALU header.
  • Another syntax element nal_ref_flag determines whether the CRA image is a reference image, and further determines whether the code stream containing the NALU is a legal code stream.
  • the improvement of the CRA image is improved.
  • the beam condition that is, based on the existing constraints for the CRA image, forces the CRA image to be a reference image.
  • the decoding device may determine whether the code stream includes a CRA image, and if yes, determine whether the attribute of the CRA image in the code stream is a reference image according to a constraint condition of the CRA image, if The reference image may determine that the code stream is a legal code stream, and the CRA image may be used to assist POC derivation of subsequent images, otherwise the code stream is an illegal code stream, and the CRA image cannot be used to assist POC derivation of subsequent images. , thereby avoiding POC calculation anomalies and decoding anomalies.
  • Embodiment 2 is a flowchart of Embodiment 2 of a video image code stream processing method according to the present invention. As shown in FIG. 2, the method in this embodiment may include:
  • Step 201 Determine whether POCs of sequentially decoded images are arranged in non-descending order; Step 202, if yes, determine POCMsb of the current image by using: obtaining POCLsb of the current image from the code stream;
  • the POCMsb of the current image is equal to the POCMsb of the previous image
  • Step 203 Determine a POC of the current image according to the POCMsb of the current image and the POCLsb of the current image.
  • each image it has a corresponding POC, which is used to indicate the timing relationship between an image and other images in the DPB, and the timing relationship defines all the images in the DPB.
  • the order of the output In general, images with a small POC number should be output before the image with a large POC number. Therefore, in deriving the POC of the current image, it is necessary to first determine the previous image of the current image, then acquire the POC information of the previous image and apply the POC information of the previous image to derive the POC of the current image.
  • the POC of an image is determined by a combination of a Major Significant Bit (MSB) and a Least Significant Bit (LSB).
  • MSB Major Significant Bit
  • LSB Least Significant Bit
  • the MSB can be referred to as POCMsb
  • the LSB can be referred to as POCLsb.
  • the POC of the current image can be calculated as follows:
  • PicOrderCntVal PicOrderCntMsb + pic—order— cnt—lsb
  • pic_order_cnt_lsb is the POCLsb of the current image obtained from the code stream
  • PicOrderCntMsb is the POCMsb of the current image derived by the decoding device.
  • PicOrderCntVal is the POC of the current image.
  • prevPicOrderCntLsb the POCLsb of the previous image
  • prevPicOrderCntMsb the POCMsb of the previous image
  • prevPicOrderCntLsb and prevPicOrderCntMsb are both set to 0;
  • the previous image of the current image is determined, and POCMsb and POCLsb of the previous image are obtained as prevPicOrderCntLsb and prevPicOrderCntMsb, respectively, wherein the definition of the previous image has been described in the previous embodiment, and will not be described herein.
  • PicOrderCntMsb can be implemented using the operations represented by the pseudo code below:
  • PicOrderCntMsb prevPicOrderCntMsb + MaxPicOrderCntLsb else if((pic_order_cnt_lsb>prevPicOrderCntLsb)&&&
  • PicOrderCntMsb prevPicOrderCntMsb-MaxPicOrderCntLsb else
  • PicOrderCntMsb prevPicOrderCntMsb
  • the meanings of the above three branch languages are:
  • PicOrderCntMsb prevPicOrderCntMsb + MaxPicOrderCntLsb;
  • PicOrderCntMsb prevPicOrderCntMsb - MaxPicOrderCntLsb;
  • PicOrderCntMsb prevPicOrderCntMsb.
  • the effect of deriving PicOrderCntMsb according to the prior art is that the POC of the current image is larger than POC-MaxPicOrderCntLsb/2 of the previous image and less than or equal to POC+MaxPicOrderCntLsb/2 of the previous image.
  • POCLsb is represented by 4 bits, that is, MaxPicOrderCntLsb is 16, and adjacent three images A, B, and C arranged in order of output timing in the video bitstream are reference images and temporal_id is 0, these three
  • the decoding device determines the output order of the three images as follows:
  • the POCMsb of the image A is referred to as POCMsb(A), and the POCMsb(A) can be determined during the process of decoding the image A.
  • the decoding device determines the order in which the two images are output as follows:
  • the POCMsb of the image A is referred to as POCMsb(A), and the POCMsb(A) can be determined during the process of decoding the image A.
  • the POC difference between any two images in the existing HEVC limit activation image must be smaller than MaxPicOrderCntLsb/2.
  • the existing HEVC-defined POC derivation process applies to any coding structure, that is, allows both the POC of the current image to be larger than the POC of the previous image, and allows the POC of the current image to be POC of the previous image. small.
  • the second conditional branch corresponds to the case where the current image POCMsb is smaller than the previous image POCMsb.
  • the present embodiment improves the derivation process of PicOrderCntMsb for the POC of sequentially decoded images in non-descending order, and the specific pseudo code is expressed as follows: If( pic— order— cnt—lsb ⁇ prevPicOrderCntLsb )
  • PicOrderCntMsb pre vP ic OrderCntMsb + MaxPicOrderCntLsb else
  • PicOrderCntMsb pre vP ic OrderCntMsb
  • PicOrderCntMsb prevPicOrderCntMsb + MaxPicOrderCntLsb;
  • PicOrderCntMsb prevPicOrderCntMsb.
  • the value of the syntax element num_reorder_pics indicating the maximum number of timing inverted images in the Sequence Parameter Set (SPS) may be used to indicate whether the POC of the sequentially decoded image is not Descending order
  • the timing inverted image refers to an image in which the decoding order is before a certain image and the output order is after that image. For example, using num_reorder_pics to 0 means that the POCs of sequentially decoded images are arranged in non-descending order, and using num_reorder_pics greater than 0 means that the POCs of sequentially decoded images are not arranged in non-descending order.
  • the encoding of the syntax element num_reorder_pics has been modified in the fourth embodiment of the subsequent method.
  • the modified variable MaxDecPicBuffering has the same physical meaning as the syntax element num_reorder_pics before tampering. Therefore, under the condition of the fourth embodiment, it is determined whether the POCs of the sequentially decoded images are arranged in non-descending order according to the value of the variable MaxDecPicBuffering.
  • the decoding device may first determine whether the POCs of the sequentially decoded images are arranged in non-descending order according to the value of num_reorder_pics in the SPS, if it is determined that the images are sequentially decoded.
  • the POCs are arranged in non-descending order, and the PicOrderCntMsb of the current image can be obtained by using the improved derivation method of the embodiment, and the current current pic_order_cnt-lsb of the current image obtained from the code stream is obtained according to PicOrderCntMsb.
  • the PicOrderCntVal of the image thus based on the current image's PicOrderCntVal And the POC value of each image in the DPB to determine the output order of the current image and the DPB images; if it is determined that the POCs of the sequentially decoded images are not arranged in non-descending order, then , the actual implementation of the examples will not be described again. .
  • the present embodiment can also be modified to improve the existing HHEE VVCC standard document.
  • the PPOOCC for the image image of the image image of the successive de-decoded code is arranged according to the non-non-descending descending sequence, and the large and small values of the PPOOCC difference value are removed.
  • the constraint bundle its good advantage lies in the fact that it can reduce the width of the picicc - oorrddeerr - ccnntt - llssbb, from which it can be reduced, transmitted, transmitted Lose the overhead. .
  • the de-code decoding device may be configured to first determine whether the PPOOCC of the image image of the de-decoded code is determined according to the non-non-descending descending order. Arrange the columns; if the results are arranged in a non-descending descending order, then the derivation method can be used to determine the 1155 when the current PPOOCCMMssbb of the image of the previous image, that is, the PPOOCC that is obtained from the code stream is taken as the image of the current image of the previous image is not more important than the bit bit PPOOCCLLssbb, as described
  • the PPOOCCLLssbb of the current image of the previous image is smaller than the PPOOCCLLssbb of the previous image of the previous image, then the PPOOCCMMssbb of the current image of the previous image is equal to the PPOOCCMMssbb of the image of the previous image before and after The sum of the largest PPOOCCLLssbb
  • the deciphering code setting 2200 can be determined to be determined according to the PPOOCC according to the image of the current front image and the PPOOCC of each image image of the DDPPBB. In order. .
  • the embodiment of the present embodiment is simplified for the case where the PPOOCCs of the image images of the successively decoded codes are arranged in a non-descending descending order.
  • the push-guided mode of PPOOCCMMssbb is improved, and the rate of efficiency of processing is improved.
  • it may be In order to remove the limit system of the PPOOCC difference value of the image of any two or two images of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image.
  • the bit width and width of the PPOOCCLLssbb in the less code stream are reduced, and then the small transmission transmission and the overhead pin are reduced. .
  • FIG. 33 is a flow chart of the third embodiment of the embodiment of the method for processing the video stream of the video of the clear video video image of the present invention, as shown in FIG. 33. It should be noted that the method of the embodiment of the present invention may be included in the package:
  • Step 330011 extracting the code stream of the code-coding device, and among them, the first class of grammatical elements and the second class
  • the grammatical element element placement is placed adjacent to the nearest neighbor position in the code stream, or the second second class grammatical element element and And the third third class grammatical element element placement is placed in the adjacent neighboring near position in the code stream, and the first class is
  • the grammatical element element refers to the grammatical element element of the packet 3300 containing the amount of information of the number of layers in the time period
  • the second categorical grammatical element of the second class Elemental means that the package contains layers of time between each a syntax element of the code stream feature information, where the third class syntax element refers to a syntax element containing information about the size of the storage space occupied by each image;
  • Step 302 Extract at least two types of syntax elements from the neighboring locations of the code stream.
  • the capabilities of the decoding device are limited.
  • the decoding device needs to determine whether the decoding capability required by the code stream is within the range of decoding capabilities supported by the code stream. If yes, the decoding device can normally receive the code stream and decode it. h The decoding device will perform exception handling.
  • the decoding capability can include restrictions on the DPB. Such restrictions may include: Limitations on the number of images stored in the DPB, and limitations on the storage space occupied by all images in the DPB.
  • the storage space occupied by all the images in the DPB can be calculated by the relevant parameters indicating the number of images stored in the DPB and the relevant parameters of the storage space occupied by each image, wherein the syntax elements indicating the storage space occupied by each image can be Including: related parameters indicating the image size, such as the width and height of the image; image cropping parameters; the bit width of each chroma component of each sample value, and the like.
  • the code is the entire code stream, and the other is to extract and decode only part of the time layer code stream that can be supported while discarding the time layer code stream that the decoding device cannot support, in the case where the code stream contains several different time layers.
  • the above-mentioned operation of extracting the code stream by the time layer can also be performed in the media gateway.
  • the decoding device such as the decoder in the client, can communicate with the media gateway to inform the media gateway of its own decoding capabilities.
  • the media gateway can select not to transmit the code stream to the decoder by parsing part of the information in the SPS to find that the decoding capability of the current code stream exceeds the decoding capability supported by the decoding device.
  • the media gateway parses part of the information in the SPS, it is found that although the decoding capability required for the complete stream exceeds the decoding capability supported by the decoding device, the decoding capability required for the partial time layer stream is supported by the decoding device.
  • the media gateway may choose to discard part of the time layer code stream, and only send the remaining part of the layer layer code stream to the decoding device to save transmission bandwidth.
  • the premise of the above-mentioned decoding device or media gateway performing the temporal layer stream extraction operation is to obtain relevant information by decoding some syntax elements in the SPS, thereby determining the time layer code stream that the decoding device can support.
  • the syntax elements related to the temporal layer stream extraction operation can be divided into three categories.
  • the first type of syntax element, the second type of syntax element, and the third type of syntax element are respectively recorded.
  • the first type of syntax element refers to a syntax element that includes time-level quantity information, such as a syntax element max_temporal-layer-minus 1;
  • the second type of syntax element refers to a code stream characteristic information including each time layer.
  • syntax elements such as the syntax elements max-dec-pic-buffering[i], num-reorder_pics[i], and max-latency-increment [i], where the syntax element max_dec_pic_buffering[i] represents decoding all Temporal—the size of the decoded image buffer required for the substream that does not exceed i, the syntax element num_reorder_pics[i] is expressed as the number of inverted images in the substream of temporal_id not exceeding i, i is the time layer
  • the third type of syntax element refers to a syntax element that contains information about the size of the storage space occupied by each image, such as the syntax element pic_width_in-luma-samples representing the width of the image and the pic-height representing the height of the image.
  • the syntax element for image cropping information pic- cropping_flag, pic-crop-left-offset, pic-crop-right- Offset , pic — crop — top — offset and pic — crop — bottom — offset , and the syntax element for each chrominance component bit width of each sample value — depth — luma — minus 8 and bit — depth — chroma — minus .
  • each time layer has its own parameters, which are passed in the SPS.
  • the SPS can include the following syntax elements:
  • the first type of syntax elements, the second type of syntax elements, and the third type of syntax elements are located far apart in the SPS.
  • the syntax element max_tempo_layer- minus 1 indicating the number of time layers and the syntax elements max_dec_pic_buffering[i], num_reorder_pics [i] indicating the code stream feature information of each time layer are Max- latency-increase [i] is far apart, and there are more than 4 syntax elements in the middle that are independent of the decoding device or media gateway to extract the time-layer code stream operation, such as pcm-enabled-flag.
  • the decoding device or the media gateway extracts each time layer information to complete the time layer stream extraction operation, and needs to additionally parse these irrelevant syntax elements, which reduces the processing efficiency of the decoding device or the media gateway.
  • the encoding device may place the first type of syntax element and the second type of syntax element in a neighboring position in the code stream, or place the second type of syntax element and the third type of syntax element in a vicinity of the code stream on.
  • the decoding device or the media gateway generates the extracted encoding device After the code stream, at least two types of syntax elements can be extracted from a neighboring position in the code stream.
  • the manner in which the above syntax elements are placed in the code stream can be implemented in at least the following four ways:
  • the first way is to keep the position of the third type of syntax element unchanged, and adjust the position of the first type of syntax element in the existing SPS, that is, the first type of syntax element is placed before the second type of syntax element;
  • the positional relationship of each syntax element in the SPS obtained by the method is as follows:
  • Seq parameter set rbsp( ) ⁇ pic— width— in— luma— samples
  • the second way is to keep the position of the third type of syntax element and adjust the position of the second type of syntax element, that is, the second type of syntax element is placed after the first type of syntax element.
  • the third way is to keep the position of the third type of syntax element unchanged, and adjust the first type of syntax element and the second type of syntax element to the middle position, such as adjusting the first type of syntax element and the second type of syntax element to After the third type of syntax element.
  • Seq parameter set rbsp( ) ⁇ pic— width— in— luma— samples
  • the fourth way is to place the third type of syntax element after the first type of syntax element and the second type of syntax element after the third type of syntax element.
  • the decoding device or the media gateway may extract a code stream generated by the encoding device, where the first class syntax element and the second class syntax element are placed in a neighboring position in the code stream, or the second The class syntax element and the third class syntax element are placed in the adjacent position in the code stream. Therefore, the decoding device or the media gateway does not need to additionally parse other irrelevant syntax elements when extracting each time layer information, but may continuously All the syntax elements of each time layer information are extracted, thereby improving the processing efficiency of the time layer information.
  • Embodiment 4 is a flowchart of Embodiment 4 of a video image code stream processing method according to the present invention. As shown in FIG. 4, the method in this embodiment may include:
  • Step 401 Extract syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream generated by the encoding device, where the syntax element max_dec_pic_buffering- Diff[i] represents the difference between the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i and the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i-1, The syntax element num_reorder_pics_diff[i] represents the difference between the number of inverted images in the subcode stream whose time layer identifier does not exceed i and the number of inverted images in the subcode stream whose time layer identifier does not exceed i-1;
  • Step 403 If i is greater than 0, obtain MaxDecPicBuffering[i] and NumReorderPics[i] in the following manner:
  • NumReorderPics[i] NumReorderPics [i-1] + num_reorder_p ic s_dif f [i] ; where MaxDecPicBuffering[i] indicates that the DPB of the Hypothetical Reference Decoder (HRD) is for the i-th time-layer code.
  • HRD Hypothetical Reference Decoder
  • the size of the storage space required by the stream, NumReorderPics [i] represents the maximum number of time-series inverted images in the i-th temporal layer stream.
  • the i-th time-layer code stream has the same meaning as the sub-code stream whose time-layer identifier does not exceed i, and refers to the sub-code stream obtained by removing all NALUs whose temporal-id is greater than i in the original code stream.
  • the syntax element max_dec_pic_buffering[i] represents the size of the decoded image buffer required to decode all sub-streams whose temporal-id does not exceed i, and therefore, the syntax element max_dec The value of – pic—buffering ⁇ ] is non-decreasing, ie max—dec_pic—buffering[i] ⁇ max—dec—pic—buffering[il].
  • the syntax element num_reorder_pics [i] indicates that the temporal_id does not exceed the number of inverted images in the substream of i, so the value of the syntax element num_reorder_pics[i] is also non-decreasing.
  • the value of the syntax element max_dec_pic_buffering [i] and the value of the syntax element num_reorder_pics[i] are both non-subtractive, and the two syntax elements are differentially encoded to reduce the representation.
  • the syntax elements max- dec_pic- buffering in the SPS and the bits required by the syntax element num_reorder_pics improve the compression efficiency.
  • the specific implementation may be:
  • MaxDecPicBuffering[i] Use the variable MaxDecPicBuffering[i] to indicate the size of the storage space required by the DPB of the HRD for the i-th time-layer code stream.
  • the size of the storage space is based on the size of the image storage space.
  • the unit calculation, for example, the MaxDecPicBuffering[i] 2 indicates the amount of storage space required for the DPB storage space to be 2 images.
  • the variable NumReorderPics[i] is used to represent the maximum number of time-series inverted images in the i-th temporal layer code stream.
  • MaxDecPicBuffering[i] max— dec_pic— buffering— diff[i]
  • NumReorderPics[i] NumReorderPics[i-1]+num_reorder_pics_diff [i]
  • the value and the presentation time of the syntax element indicating the size of the decoded image buffer required for the sub-code stream not exceeding i are represented by the decoding time layer identifier.
  • the value of the syntax element whose layer identifier does not exceed the number of inverted images in the sub-stream of i is non-decreasing, and the two syntax elements are differentially encoded to reduce the bits required to represent the syntax elements in the SPS. Thereby improving the compression efficiency.
  • the decoding device may include: an extracting module, configured to extract a code stream generated by the encoding device, where the code stream includes pure random access CRA image information, and the CRA image information satisfies CRA Constraints of the image and the constraints include that the CRA image must be a reference image.
  • an extracting module configured to extract a code stream generated by the encoding device, where the code stream includes pure random access CRA image information, and the CRA image information satisfies CRA Constraints of the image and the constraints include that the CRA image must be a reference image.
  • the extracting module is specifically configured to extract a network abstraction layer unit NALU generated by the encoding device, where a syntax element nal_unit_type in the header field of the NALU indicates that the NALU includes CRA image information, where The constraint includes: the syntax element nal_ref_flag in the header field of the NALU must be 1.
  • the decoding device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 1.
  • the principle and technical effects are similar, and details are not described herein again.
  • FIG. 5 is a schematic structural diagram of Embodiment 2 of a decoding device according to the present invention.
  • the decoding device in this embodiment may include: a first determining module 21, a processing module 22, and a second determining module 23, where a determining module 21, configured to determine whether a picture sequence number POC of the sequentially decoded images in the active image is arranged in a non-descending order, the active image including the current image and all images in the decoded image buffer DPB that are marked for short-term reference and waiting for output of An image processing module 22, configured to determine a POC important bit POCMsb of the current image by using a method of: following a non-descending order: obtaining a POC non-significant bit POCLsb of the current image from the code stream; If the POCLsb of the image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of the POCMsb of the previous image and the maximum POCLsb, wherein the maximum POCLsb
  • the first determining module 21 is specifically configured to determine, according to the value of the syntax element num_reorder_pics indicating the maximum number of time-stamped images in the sequence parameter set SPS, whether the POC of the sequentially decoded image in the activated image is Arranged in non-descending order.
  • the decoding device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 2, and the principle and the technical effect are similar, and details are not described herein again.
  • FIG. 6 is a schematic structural diagram of an embodiment of a processing device according to the present invention.
  • the device in this embodiment may include: an extracting module 31, and an extracting module 32, where the extracting module 31 is configured to extract a code generated by the encoding device.
  • the first type of syntax element refers to a syntax element that includes time layer quantity information
  • the second type of syntax element refers to a syntax element that includes code stream feature information of each time layer
  • the third type of syntax element is Refers to a syntax element that contains information on the size of the storage space occupied by each image; an extraction module 32 is configured to extract at least two types of syntax elements from the neighboring locations of the code stream.
  • the positions of the second type of syntax element and the third type of syntax element in the code stream are unchanged, and the first type of syntax element is placed before the second type of syntax element; or, the The position of a class of syntax elements and a class of class 3 syntax elements in the code stream is unchanged, the second class of syntax elements are placed after the first class of syntax elements; or the third class of syntax elements are The position in the code stream is unchanged, the first class syntax element and the second class syntax element are placed in an intermediate position of the code stream; or the third class syntax element is placed in the first class syntax element Thereafter, the second type of syntax element is placed after the third type of syntax element.
  • the processing device of this embodiment may be a decoding device, such as decoding in a terminal device. It can also be a media gateway.
  • the processing device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 3, and the principle and the technical effect are similar, and details are not described herein again.
  • FIG. 7 is a schematic structural diagram of Embodiment 3 of a decoding device according to the present invention.
  • the device in this embodiment may include: an extracting module 41 and a differential decoding module 42, where the extracting module 41 is configured to extract an encoding device.
  • syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream where the syntax element max_dec_pic_buffering_diff[i] represents the decoding time
  • syntax element num_reorder_pics – diff[i] represents the difference between the number of inverted pictures in the sub-code stream whose time layer identifier does not exceed i and the number of inverted pictures in the sub-code stream whose time layer identifier does not exceed i-1
  • differential decoding module 42 for If i is 0, Max get MaxDecPicBuffering[i] and NumReorderPics[i] in the following way:
  • MaxDecPicBuffering[i] max— dec— pic— buffering— diff[i];
  • MaxDecPicBuffering[i] and NumReorderPics[i] as follows:
  • NumReorderPics[i] NumReorderPics [i-1] + num_reorder_p ic s_diff [i] ; where MaxDecPicBuffering[i] represents the size of the storage space required by the DPB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics[ i] represents the maximum number of time-series inverted images in the i-th temporal layer code stream.
  • the i-th time-layer code stream has the same meaning as the sub-code stream whose time-layer identifier does not exceed i, and refers to the sub-code stream obtained by removing all NALUs whose temporal-id is greater than i in the original code stream.
  • the decoding device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 4, and the principle and the technical effect are similar, and details are not described herein again.
  • the aforementioned program can be stored in a computer readable storage medium.
  • the foregoing storage medium includes: a medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Embodiments of the present invention provide a video image code stream processing method and device. The video image code stream processing method comprises: extracting a code stream generated by an encoding device. The code stream comprises clean random access CRA image information. The CRA image information meets a constraint condition of a CRA image, and the constraint condition comprises that the CRA image has to be a reference image. According to the embodiments of the present invention, on the basis of an HEVC standard draft in the prior art, the constraint condition of the CRA image is improved, that is, on the basis of the constraint condition for the CRA image in the prior art, the CRA image is compulsively constrained as the reference image, so as to avoid calculation abnormity and decoding abnormity of the POC.

Description

视频图像码流处理方法和设备 技术领域 本发明实施例涉及图像处理技术, 尤其涉及一种视频图像码流处理方 法和设备。 背景技术  The present invention relates to image processing technologies, and in particular, to a video image code stream processing method and apparatus. Background technique
高性能视频编码 ( High Efficiency Video Coding, 以下简称: HEVC ) , 是为满足日益增长的视频应用需求, 特别是高清视频的应用需求而制定的 新一代视频编码标准。  High Efficiency Video Coding (hereafter referred to as HEVC) is a new generation of video coding standards to meet the needs of the growing demand for video applications, especially for high definition video.
HEVC设计的目标是在主观效果相似的情况下, 相比较于上一代视频 编码标准, 如先进视频编码(Advanced Video Coding, 以下简称: AVC ) , 节省 50%的码率。 当釆用基于峰值信噪比 ( Peak Signal Noise Ratio, 以下 简称: PSNR ) 的客观评价准则时, HEVC相对于 AVC压缩性能如下: 在 全部帧内图像的测试条件下, 码率节省约 23%; 在支持随机访问的测试条 件下, 码率节省约 33%; 在低延迟的测试条件下, 码率节省约 41%。 相比 于 HEVC的客观性能提升, HEVC的主观性能提升更加突出, 主要原因是 HEVC中改进的帧间预测和环路滤波技术对提升主观性能的作用明显。  The goal of HEVC design is to save 50% code rate compared to previous generation video coding standards, such as Advanced Video Coding (AVC), when subjective effects are similar. When using the objective evaluation criteria based on Peak Signal Noise Ratio (PSNR), the compression performance of HEVC relative to AVC is as follows: Under the test conditions of all intra-frame images, the code rate is saved by about 23%; The code rate savings are about 33% under test conditions that support random access; the code rate savings are about 41% under low latency test conditions. Compared with the objective performance improvement of HEVC, the subjective performance improvement of HEVC is more prominent. The main reason is that the improved interframe prediction and loop filtering techniques in HEVC have an obvious effect on improving subjective performance.
由于 HEVC优越的性能, 可以预见, 在不远的将来, HEVC会成为新 一代非常成功的视频编码标准。 发明内容  Due to the superior performance of HEVC, it is foreseeable that HEVC will become a very successful video coding standard for the new generation in the near future. Summary of the invention
本发明实施例提供一种视频图像码流处理方法和设备。  Embodiments of the present invention provide a video image code stream processing method and device.
本发明实施例提供一种视频图像码流处理方法, 包括:  An embodiment of the present invention provides a video image code stream processing method, including:
提取编码设备产生的码流, 所述码流中包含纯净随机访问 CRA 图像 信息, 所述 CRA图像信息满足 CRA图像的约束条件且所述约束条件包括 CRA图像必须为参考图像。  A code stream generated by the encoding device is extracted, the code stream containing pure random access CRA image information, the CRA image information satisfying a constraint condition of the CRA image and the constraint condition includes the CRA image must be a reference image.
相应地, 本发明实施例提供一种解码设备, 包括:  Correspondingly, an embodiment of the present invention provides a decoding device, including:
提取模块, 用于提取编码设备产生的码流, 所述码流中包含纯净随机 访问 CRA图像信息, 所述 CRA图像信息满足 CRA图像的约束条件且所 述约束条件包括 CRA图像必须为参考图像。 An extracting module, configured to extract a code stream generated by the encoding device, where the code stream includes pure random access CRA image information, where the CRA image information satisfies a constraint condition of the CRA image and The constraints include that the CRA image must be a reference image.
本发明上述实施例, 在现有 HEVC标准文稿的基础上, 改进了 CRA 图像的约束条件, 即在现有针对 CRA 图像的约束条件的基础上, 强制约 束 CRA 图像为参考图像。 解码设备在提取编码设备生成的码流后, 可以 确定该码流是否包含 CRA图像, 若包含, 则可以根据 CRA图像的约束条 件判断该码流中该 CRA图像的属性是否为参考图像, 如果是参考图像, 则可以确定该码流为合法码流, 而该 CRA图像可以用于辅助后续图像的 POC推导, 否则该码流为非法码流, 而该 CRA图像不能用于辅助后续图 像的 POC推导, 从而避免 POC计算异常以及解码异常。  According to the above embodiment of the present invention, on the basis of the existing HEVC standard document, the constraint condition of the CRA image is improved, that is, the CRA image is forced to be a reference image based on the existing constraints on the CRA image. After extracting the code stream generated by the encoding device, the decoding device may determine whether the code stream includes a CRA image, and if yes, determine whether the attribute of the CRA image in the code stream is a reference image according to a constraint condition of the CRA image, if The reference image may determine that the code stream is a legal code stream, and the CRA image may be used to assist POC derivation of subsequent images, otherwise the code stream is an illegal code stream, and the CRA image cannot be used to assist POC derivation of subsequent images. , thereby avoiding POC calculation anomalies and decoding anomalies.
本发明实施例提供另一种视频图像码流处理方法, 包括:  An embodiment of the present invention provides another video image code stream processing method, including:
确定依次解码的图像的图片顺序编号 POC是否按照非降序排列; 若是, 则釆用如下方法确定所述当前图像的 POC 重要比特位 POCMsb:  Determining whether the picture sequence number POC of the sequentially decoded pictures is arranged in non-descending order; if so, determining the POC important bits POCMsb of the current picture by the following method:
从码流中获取当前图像的 POC非重要比特位 POCLsb;  Obtaining a POC non-significant bit of the current image from the code stream POCLsb;
如果所述当前图像的 POCLsb小于之前图像的 POCLsb, 则当前图像 的 POCMsb 等于之前图像的 POCMsb 与最大 POCLsb 之和, 其中最大 POCLsb=2N, N为 POCLsb的位宽;  If the POCLsb of the current image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of POCMsb and the maximum POCLsb of the previous image, wherein the maximum POCLsb=2N, where N is the bit width of the POCLsb;
否则, 所述当前图像的 POCMsb等于之前图像的 POCMsb;  Otherwise, the POCMsb of the current image is equal to the POCMsb of the previous image;
根据所述当前图像的 POCMsb以及所述当前图像的 POCLsb, 确定所 述当前图像的 POC。  The POC of the current image is determined according to the POCMsb of the current image and the POCLsb of the current image.
相应地, 本发明实施例提供一种解码设备, 包括:  Correspondingly, an embodiment of the present invention provides a decoding device, including:
第一确定模块, 用于确定依次解码的图像的图片顺序编号 POC是否 按照非降序排列;  a first determining module, configured to determine whether a picture sequence number POC of the sequentially decoded images is arranged in a non-descending order;
处理模块, 用于若是按照非降序排列, 则釆用如下方法确定所述当前 图像的 POC重要比特位 POCMsb:  The processing module is configured to determine POCsb of the current image by using the following method if it is arranged in non-descending order:
从码流中获取当前图像的 POC非重要比特位 POCLsb;  Obtaining a POC non-significant bit of the current image from the code stream POCLsb;
如果所述当前图像的 POCLsb小于之前图像的 POCLsb, 则当前图像 的 POCMsb 等于之前图像的 POCMsb 与最大 POCLsb 之和, 其中最大 POCLsb=2N, N为 POCLsb的位宽;  If the POCLsb of the current image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of POCMsb and the maximum POCLsb of the previous image, wherein the maximum POCLsb=2N, where N is the bit width of the POCLsb;
否则, 所述当前图像的 POCMsb等于之前图像的 POCMsb; 第二确定模块,用于根据所述当前图像的 POCMsb以及所述当前图像 的 POCLsb, 确定所述当前图像的 POC。 Otherwise, the POCMsb of the current image is equal to the POCMsb of the previous image; And a second determining module, configured to determine a POC of the current image according to the POCMsb of the current image and the POCLsb of the current image.
本发明上述实施例中, 解码设备可以先确定依次解码的图像的 POC 是否按照非降序排列; 如果是按照非降序排列的, 则可以釆用改进的推导 方式确定当前图像的 POCMsb, 即从码流中获取当前图像的 POC非重要 比特位 POCLsb, 如果所述当前图像的 POCLsb小于之前图像的 POCLsb, 则当前图像的 POCMsb等于之前图像的 POCMsb与最大 POCLsb之和,否 则, 当前图像的 POCMsb等于 prevPicOrderCntMsb; 最后再根据当前图像 的 POCMsb以及当前图像的 POCLsb, 确定当前图像的 POC。 因此, 解码 设备可以根据当前图像的 POC以及 DPB 中各幅图像的 POC确定输出顺 序。 因此, 本发明实施例针对依次解码的图像的 POC按照非降序排列的 情况, 简化了 POCMsb的推导方式, 提高了处理效率, 而且, 基于该改进 的推导方式, 可以去除对激活图像中任意两幅图像的 POC 差值的大小的 限制, 从而可以减少码流中 POCLsb的位宽, 进而减小传输开销。 其中, 激活图像是指当前图像以及 DPB 中所有被标记用作短期参考的图像和等 待输出的图像。  In the above embodiment of the present invention, the decoding device may first determine whether POCs of sequentially decoded images are arranged in non-descending order; if they are arranged in non-descending order, the improved derivation manner may be used to determine POCMsb of the current image, that is, from the code stream. Obtaining a POC non-significant bit POCLsb of the current image, if the POCLsb of the current image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of the POCMsb of the previous image and the maximum POCLsb, otherwise, the POCMsb of the current image is equal to the prevPicOrderCntMsb; Finally, the POC of the current image is determined according to the POCMsb of the current image and the POCLsb of the current image. Therefore, the decoding device can determine the output order based on the POC of the current image and the POC of each image in the DPB. Therefore, in the embodiment of the present invention, the POCs of the sequentially decoded images are arranged in a non-descending order, which simplifies the derivation of the POCMsb, improves the processing efficiency, and, based on the improved derivation, can remove any two of the activated images. The size of the POC difference of the image is limited, so that the bit width of POCLsb in the code stream can be reduced, thereby reducing the transmission overhead. Among them, the active image refers to the current image and all the images in the DPB that are marked for short-term reference and the images to be output.
本发明实施例提供再一种视频图像码流处理方法, 包括:  An embodiment of the present invention provides a video image code stream processing method, including:
提取编码设备产生的码流, 其中, 第一类语法元素与第二类语法元素 放置在所述码流中的邻近位置上, 或者第二类语法元素与第三类语法元素 放置在所述码流中的邻近位置上, 所述第一类语法元素是指包含时间层数 量信息的语法元素, 所述第二类语法元素是指包含每个时间层的码流特征 信息的语法元素, 所述第三类语法元素是指包含每一幅图像所占用存储空 间大小信息的语法元素;  Extracting a code stream generated by the encoding device, wherein the first class syntax element and the second class syntax element are placed in a neighboring position in the code stream, or the second class syntax element and the third class syntax element are placed in the code In a neighboring position in the stream, the first type of syntax element refers to a syntax element including time layer quantity information, and the second type of syntax element refers to a syntax element that includes code stream feature information of each time layer, The third type of syntax element refers to a syntax element that contains information about the size of the storage space occupied by each image;
从所述码流的所述邻近位置上抽取至少两类语法元素。  At least two types of syntax elements are extracted from the neighboring locations of the codestream.
相应地, 本发明实施例提供一种处理设备, 包括:  Correspondingly, an embodiment of the present invention provides a processing device, including:
提取模块, 用于提取编码设备产生的码流, 其中, 第一类语法元素与 第二类语法元素放置在所述码流中的邻近位置上, 或者第二类语法元素与 第三类语法元素放置在所述码流中的邻近位置上, 所述第一类语法元素是 指包含时间层数量信息的语法元素, 所述第二类语法元素是指包含每个时 间层的码流特征信息的语法元素, 所述第三类语法元素是指包含每一幅图 像所占用存储空间大小信息的语法元素; An extraction module, configured to extract a code stream generated by the encoding device, where the first class syntax element and the second class syntax element are placed in a neighboring position in the code stream, or the second class syntax element and the third class syntax element Placed in a neighboring position in the code stream, the first type of syntax element refers to a syntax element including time layer quantity information, and the second type of syntax element refers to a code stream characteristic information including each time layer. a syntax element, the third type of syntax element is meant to contain each picture a syntax element like the size of the storage space occupied;
抽取模块, 用于从所述码流的所述邻近位置上抽取至少两类语法元 素。  And an extracting module, configured to extract at least two types of grammatical elements from the neighboring locations of the code stream.
本发明上述实施例中, 解码设备或者媒体网关可以提取编码设备生成 的码流, 该码流中, 第一类语法元素与第二类语法元素放置在所述码流中 的邻近位置上, 或者第二类语法元素与第三类语法元素放置在所述码流中 的邻近位置上, 因此, 解码设备或者媒体网关在抽取每个时间层信息时, 无需额外解析其他无关的语法元素, 而是可以连续地将每个时间层信息的 全部语法元素都抽取出来, 从而提高了时间层信息的处理效率。  In the foregoing embodiment of the present invention, the decoding device or the media gateway may extract a code stream generated by the encoding device, where the first type of syntax element and the second type of syntax element are placed in a neighboring position in the code stream, or The second type of syntax element and the third type of syntax element are placed in the adjacent position in the code stream. Therefore, the decoding device or the media gateway does not need to additionally parse other irrelevant syntax elements when extracting each time layer information, but All the syntax elements of each time layer information can be continuously extracted, thereby improving the processing efficiency of the time layer information.
本发明实施例提供又一种视频图像码流处理方法, 包括:  An embodiment of the present invention provides a video image code stream processing method, including:
提取编码设备产生的码流中的语法元素 max_dec_pic_buffering_diff[i] 和 num— reorder_pics— diff[i] , 其中,语法元素 max— dec— pic— buffering— diff[i] 表示解码时间层标识不超过 i的子码流所需要的解码图像緩存的大小与解 码时间层标识不超过 i-1 的子码流所需要的解码图像緩存的大小的差值, 语法元素 num— reorder_pics— diff[i]表示时间层标识不超过 i 的子码流中倒 置图像的数量与时间层标识不超过 i-1 的子码流中倒置图像的数量的差 值;  Extracting syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream generated by the encoding device, where the syntax element max_dec_pic_buffering_diff[i] indicates that the decoding time layer identifier does not exceed i The size of the decoded image buffer required by the substream is different from the size of the decoded image buffer required by the decoding time layer identifier not exceeding i-1, and the syntax element num_reorder_pics_diff[i] represents the time layer. The difference between the number of inverted images in the substream that does not exceed i and the number of inverted images in the substream that does not exceed i-1;
若 i 为 0 , 釆用 下 述方 式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  If i is 0, then MaxDecPicBuffering[i] and NumReorderPics[i] are obtained in the following way:
MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];  MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];
NumReorderPics[i] = num_reorder_pics_diff [i];  NumReorderPics[i] = num_reorder_pics_diff [i];
若 i 大于 0 , 釆用 下述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  If i is greater than 0, use MaxDecPicBuffering[i] and NumReorderPics[i] as follows:
MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]  MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]
+max— dec— pic— buffering— diff[i];  +max— dec— pic— buffering— diff[i];
NumReorderPics[i] = NumReorderPics [i- 1 ] + num_reorder_p ic s_dif f [i] ; 其中, MaxDecPicBuffering[i]表示假设参考解码器 HRD的 DPB针对 第 i个时间层码流所要求的存储空间大小, NumReorderPics[i]表示第 i个 时间层码流中的时序倒置图像的最大数量。  NumReorderPics[i] = NumReorderPics [i-1] + num_reorder_p ic s_dif f [i] ; where MaxDecPicBuffering[i] represents the size of the storage space required for the iB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics [i] represents the maximum number of time-series inverted images in the i-th temporal layer code stream.
相应地, 本发明实施例提供一种解码设备, 包括: 提取模块, 用 于提取编码设备产生的码流中 的语法元素 max— dec— pic— buffering— diff[i]和 num— reorder— pics— diff[i] , 其中, 语法元素 max— dec— pic— buffering— diff[i]表示解码时间层标识不超过 i 的子码流所需 要的解码图像緩存的大小与解码时间层标识不超过 i-1 的子码流所需要的 解码图像緩存的大小的差值, 语法元素 num— reorder— pics— diff[i]表示时间 层标识不超过 i的子码流中倒置图像的数量与时间层标识不超过 i-1 的子 码流中倒置图像的数量的差值; Correspondingly, an embodiment of the present invention provides a decoding device, including: An extracting module, configured to extract syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream generated by the encoding device, wherein the syntax element max_dec_pic- Buffering_diff[i] indicates the difference between the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i and the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i-1 Value, syntax element num_reorder_pics_diff[i] indicates the difference between the number of inverted images in the subcode stream whose time layer identifier does not exceed i and the number of inverted images in the subcode stream whose time layer identifier does not exceed i-1 value;
差分解码模块,用于若 i为 0,釆用下述方式得到 MaxDecPicBuffering[i] 和 NumReorderPics[i]:  Differential decoding module, if i is 0, obtain MaxDecPicBuffering[i] and NumReorderPics[i] in the following way:
MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];  MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];
NumReorderPics[i] = num_reorder_pics_diff [i];  NumReorderPics[i] = num_reorder_pics_diff [i];
若 i 大于 0 , 釆用 下述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  If i is greater than 0, use MaxDecPicBuffering[i] and NumReorderPics[i] as follows:
MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]  MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]
+max— dec— pic— buffering— diff[i];  +max— dec— pic— buffering— diff[i];
NumReorderPics[i] = NumReorderPics [i- 1 ] + num_reorder_p ic s_dif f [i] ; 其中, MaxDecPicBuffering[i]表示假设参考解码器 HRD的 DPB针对 第 i个时间层码流所要求的存储空间大小, NumReorderPics[i]表示第 i个 时间层码流中的时序倒置图像的最大数量。  NumReorderPics[i] = NumReorderPics [i-1] + num_reorder_p ic s_dif f [i] ; where MaxDecPicBuffering[i] represents the size of the storage space required for the iB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics [i] represents the maximum number of time-series inverted images in the i-th temporal layer code stream.
本发明上述实施例中, 利用表示解码时间层标识不超过 i的子码流所 需要的解码图像緩存的大小的语法元素的值和表示时间层标识不超过 i的 子码流中倒置图像的数量的语法元素的值均是非减的这一特性, 对这两个 语法元素进行差分编码, 以减少表示 SPS中的语法元素所需要的比特, 从 而提高压缩效率。 附图说明  In the above embodiment of the present invention, the value of the syntax element indicating the size of the decoded image buffer required for the sub-code stream not exceeding i of the decoding time layer identifier and the number of inverted images in the sub-code stream indicating that the time layer identifier does not exceed i are utilized. The values of the syntax elements are non-decreasing, and the two syntax elements are differentially encoded to reduce the bits required to represent the syntax elements in the SPS, thereby improving compression efficiency. DRAWINGS
实施例或现有技术描述中所需要使用的附图作一简单地介绍, 显而易见 地, 下面描述中的附图是本发明的一些实施例, 对于本领域普通技术人员 来讲, 在不付出创造性劳动性的前提下, 还可以根据这些附图获得其他的 附图。 The drawings used in the embodiments or the description of the prior art are briefly described. It is obvious that the drawings in the following description are some embodiments of the present invention, and are not creative to those skilled in the art. Under the premise of labor, you can also obtain other The figure.
图 1为本发明视频图像码流处理方法实施例一的流程图;  1 is a flowchart of Embodiment 1 of a video image code stream processing method according to the present invention;
图 2为本发明视频图像码流处理方法实施例二的流程图;  2 is a flowchart of Embodiment 2 of a video image code stream processing method according to the present invention;
图 3为本发明视频图像码流处理方法实施例三的流程图;  3 is a flowchart of Embodiment 3 of a video image code stream processing method according to the present invention;
图 4为本发明视频图像码流处理方法实施例四的流程图;  4 is a flowchart of Embodiment 4 of a video image code stream processing method according to the present invention;
图 5为本发明解码设备实施例二的结构示意图;  FIG. 5 is a schematic structural diagram of Embodiment 2 of a decoding device according to the present invention;
图 6为本发明处理设备实施例的结构示意图;  6 is a schematic structural diagram of an embodiment of a processing device according to the present invention;
图 7为本发明解码设备实施例三的结构示意图。 具体实施方式 为使本发明实施例的目的、 技术方案和优点更加清楚, 下面将结合本 发明实施例中的附图, 对本发明实施例中的技术方案进行清楚、 完整地描 述,显然, 所描述的实施例是本发明一部分实施例, 而不是全部的实施例。 基于本发明中的实施例, 本领域普通技术人员在没有作出创造性劳动前提 下所获得的所有其他实施例, 都属于本发明保护的范围。  FIG. 7 is a schematic structural diagram of Embodiment 3 of a decoding device according to the present invention. The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. The embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
本发明实施例提供一种视频图像码流处理方法, 该方法包括: 提取编码设备产生的码流, 所述码流中包含纯净随机访问 (Clean Random Access, 以下简称: CRA ) 图像信息, 所述 CRA 图像信息满足 CRA图像的约束条件且所述约束条件包括 CRA图像必须为参考图像。  An embodiment of the present invention provides a video image code stream processing method, where the method includes: extracting a code stream generated by an encoding device, where the code stream includes Clean Random Access (CRA) image information, The CRA image information satisfies the constraints of the CRA image and the constraints include that the CRA image must be a reference image.
本实施例的技术方案,在现有 HEVC标准文稿的基础上,改进了 CRA 图像的约束条件, 即在现有针对 CRA 图像的约束条件的基础上, 强制约 束 CRA图像为参考图像,从而使得该 CRA图像可以用于辅助后续图像的 图片顺序编号 (Picture Order Count, 以下简称: POC )推导, 否则该码流 为非法码流, 而该 CRA图像不能用于辅助后续图像的 POC推导, 从而避 免 POC计算异常以及解码异常。  The technical solution of the embodiment improves the constraint condition of the CRA image on the basis of the existing HEVC standard document, that is, forcibly constrains the CRA image into a reference image based on the existing constraints on the CRA image, thereby making the The CRA image can be used to assist the picture order number (Picture Order Count, hereinafter referred to as POC) derivation of the subsequent image, otherwise the code stream is an illegal code stream, and the CRA image cannot be used to assist the POC derivation of the subsequent image, thereby avoiding the POC. Calculate exceptions and decode exceptions.
下面釆用一个具体的实施例, 对上述技术方案进行详细说明。  The above technical solution will be described in detail below using a specific embodiment.
图 1为本发明视频图像码流处理方法实施例一的流程图,如图 1所示, 本实施例的方法可以包括:  1 is a flowchart of Embodiment 1 of a video image code stream processing method according to the present invention. As shown in FIG. 1, the method in this embodiment may include:
步骤 101、 提取编码设备产生的码流, 码流中包含 CRA图像信息; 步骤 102、 根据 CRA图像的约束条件, 确定该码流是否是合法码流; 该约束条件包括 CRA图像必须为参考图像。 Step 101: Extract a code stream generated by the encoding device, where the code stream includes CRA image information. Step 102: Determine, according to a constraint condition of the CRA image, whether the code stream is a legal code stream. This constraint includes that the CRA image must be a reference image.
具体来说, HEVC使用即时解码刷新( Instantaneous Decoding Refresh, 以下简称: IDR ) 图像和 CRA图像两种图像类型支持随机访问。  Specifically, HEVC supports random access using both Instantaneous Decoding Refresh (IDR) images and CRA images.
对于 CRA图像来说, 其只能是 I帧图像; 对于解码顺序和输出顺序 在 CRA图像之后的图像, 不允许使用解码顺序或输出顺序在 CRA图像之 前的图像进行帧间预测, 对解码顺序在 CRA 图像之前的图像, 其输出顺 序也在 CRA图像之前。 因此, CRA图像可以不依赖于其他图像进行正常 解码, CRA图像之后的图像的解码操作不依赖于任何 CRA图像之前的图 像, 因此, CRA图像可以作为起始点解码一个视频序列; 而 CRA图像之 前的图像不会影响 CRA图像之后图像的输出, 因此, CRA图像可以作为 起始点输出一个视频序列。  For a CRA image, it can only be an I-frame image; for an image whose decoding order and output order are after the CRA image, it is not allowed to use the decoding order or the output order to predict the image before the CRA image, and the decoding order is The image before the CRA image, whose output order is also before the CRA image. Therefore, the CRA image can be normally decoded without relying on other images, and the decoding operation of the image after the CRA image does not depend on any image before the CRA image. Therefore, the CRA image can be used as a starting point to decode a video sequence; and before the CRA image The image does not affect the output of the image after the CRA image, so the CRA image can be used as a starting point to output a video sequence.
综上, CRA图像可以作为一个视频序列中的第一幅图像进行解码与输 出操作。 HEVC中规定一个视频序列的第一幅图像只能是 IDR图像或 CRA 图像, 因此若一幅图像既不是 IDR图像也不是 CRA图像, 则其一定不是 所在视频序列中的第一幅图像。  In summary, the CRA image can be decoded and output as the first image in a video sequence. The first image of a video sequence specified in HEVC can only be an IDR image or a CRA image, so if an image is neither an IDR image nor a CRA image, it must not be the first image in the video sequence.
进一步地, 对于每一幅图像来说, 其都有一个对应的 POC, 该 POC 是用于指示一幅图像与解码图像緩存( Decoded Picture Buffer, 以下简称: DPB ) 中其它图像的时序先后关系, 而该时序先后关系规定 DPB 中所有 图像的输出顺序。 一般来说, POC号码较小的图像应当在 POC号码较大 的图像之前输出。 因此, 在推导当前图像的 POC 的过程中需要首先确定 当前图像的之前图像, 然后获取之前图像的 POC信息并应用之前图像的 POC 信息推导当前图像的 POC。 其中, 之前图像, 是指满足下面两个条 件的按照解码顺序的前一幅图像: 条件一, 该图像的语法元素 temporal— id 为 0,该语法元素 temporal— id可从网络抽象层单元( Network Abstract Layer Unit, 以下简称: NALU )的头字段(以下简称: header )中获取; 条件二, 该图像是参考图像, 可从 NALU header中的语法元素 nal— ref— flag判断一 幅图像是否为参考图像, 现有 HEVC标准文稿中, nal— ref— flag为 1表示 该图像是参考图像, nal— ref— flag为 0表示该图像不是参考图像。  Further, for each image, it has a corresponding POC, which is used to indicate the temporal relationship between an image and other images in a Decoded Picture Buffer (DPB). The timing relationship specifies the output order of all images in the DPB. In general, images with a small POC number should be output before the image with a large POC number. Therefore, in deriving the POC of the current image, it is necessary to first determine the previous image of the current image, then acquire the POC information of the previous image and apply the POC information of the previous image to derive the POC of the current image. Wherein, the previous image refers to the previous image in the decoding order that satisfies the following two conditions: Condition 1, the syntax element of the image temporal_id is 0, and the syntax element temporal_id can be from the network abstraction layer unit (Network The Abstract Layer Unit (hereinafter referred to as NALU) is obtained from the header field (hereinafter referred to as header); Condition 2, the image is a reference image, and the syntax element nal_ref_flag in the NALU header can be used to determine whether an image is a reference. Image, in the existing HEVC standard document, nal_ref_flag of 1 indicates that the image is a reference image, and nal_ref_flag of 0 indicates that the image is not a reference image.
因此,一个视频序列中的第一幅解码图像,无论是 IDR图像还是 CRA 图像, 都必须同时满足上述两个条件, 才能够保证所在视频序列中后续的 任何一幅图像都能够找到其对应的之前图像。 Therefore, the first decoded image in a video sequence, whether it is an IDR image or a CRA image, must satisfy both of the above conditions to ensure subsequent frames in the video sequence. Any image can find its corresponding previous image.
但是,在实践过程中发现 ,如果一个视频序列中的第一幅图像是 CRA, 则时常出现后续图像无法找到其对应的之前图像的问题。  However, it has been found in practice that if the first image in a video sequence is a CRA, it often happens that subsequent images cannot find their corresponding previous images.
经过深入研究发现, 现有 HEVC 标准文稿对于 CRA 图像的 temporal— id, 限制其必须为 0, 否则包含该 CRA图像的码流为非法码流。 但对于 CRA图像的 nal— ref— flag来说, 现有 HEVC标准文稿并未限制其 nal— ref— flag必须为 1。 在该情况下, 若一幅 CRA图像作为一个视频序列 的第一幅图像, 且该 CRA图像的 nal— ref— flag为 0, 也即表示该 CRA图像 不是参考图像, 那么该 CRA图像后的各幅图像就无法找到其对应的之前 图像, 从而引起 POC计算异常并最终导致解码异常。  After in-depth research, it is found that the existing HEVC standard document has a temporal-id for the CRA image, which must be 0, otherwise the code stream containing the CRA image is an illegal code stream. However, for the nal_ref_flag of the CRA image, the existing HEVC standard document does not limit its nal-ref_flag must be 1. In this case, if a CRA image is used as the first image of a video sequence, and the nal_ref_flag of the CRA image is 0, that is, the CRA image is not a reference image, then each of the CRA images The image cannot find its corresponding previous image, causing the POC calculation to be abnormal and eventually causing the decoding to be abnormal.
鉴于上述问题, 本实施例在现有 HEVC 标准文稿的基础上, 改进了 CRA图像的约束条件, 即在现有针对 CRA图像的约束条件的基础上, 强 制约束 CRA图像为参考图像。 解码设备可以提取编码设备生成的码流, 然后可以确定该码流是否包含 CRA图像信息, 若包含, 则根据 CRA图像 的约束条件判断该码流中该 CRA 图像的属性是否为参考图像, 如果是参 考图像, 则可以认为该码流为合法码流, 其可以用作后续图像的 POC推 导, 否则该码流为非法码流。 在实际应用中, 解码设备可以强制约束 CRA 图像为参考图像, 从而对该非法码流进行容错处理。  In view of the above problems, the present embodiment improves the constraint condition of the CRA image based on the existing HEVC standard document, that is, the constraint CRA image is forced to be a reference image based on the existing constraints on the CRA image. The decoding device may extract the code stream generated by the encoding device, and then determine whether the code stream includes CRA image information, and if so, determine whether the attribute of the CRA image in the code stream is a reference image according to a constraint condition of the CRA image, if Referring to the image, the code stream can be considered as a legal code stream, which can be used as a POC derivation of the subsequent image, otherwise the code stream is an illegal code stream. In practical applications, the decoding device can forcibly constrain the CRA image into a reference image, thereby performing fault tolerance processing on the illegal code stream.
整个视频码流是封装在多个 NALU内进行传输的, 每个 NALU可以 包含一幅图像的全部信息或者部分信息。 因此, 本实施例在具体实现时, 对于 CRA图像对应的 NALU来说, 可以釆用 NALU header中的语法元素 nal— unit— type指示该 NALU 中包含的信息属于 CRA 图像, 且该 NALU header中的另一语法元素 nal— ref— flag为 1 , 则可以表示该 CRA图像为参 考图像。  The entire video stream is encapsulated in multiple NALUs, and each NALU can contain all or part of an image. Therefore, in the specific implementation of the embodiment, for the NALU corresponding to the CRA image, the syntax element nal_unit_type in the NALU header may be used to indicate that the information contained in the NALU belongs to the CRA image, and the NALU header is in the NALU header. Another syntax element nal_ref_flag is 1, which can indicate that the CRA image is a reference image.
在解码设备提取编码设备生成的 NALU后,可以根据该 NALU header 中的语法元素 nal— unit— type确定该 NALU中包含的信息是否属于 CRA图 像, 如果属于 CRA图像, 则可以根据该 NALU header中的另一语法元素 nal— ref— flag来确定该 CRA图像是否为参考图像, 进而确定包含该 NALU 的码流是否为合法码流。  After the decoding device extracts the NALU generated by the encoding device, it may determine, according to the syntax element nal_unit_type in the NALU header, whether the information included in the NALU belongs to the CRA image, and if it belongs to the CRA image, according to the NALU header. Another syntax element nal_ref_flag determines whether the CRA image is a reference image, and further determines whether the code stream containing the NALU is a legal code stream.
本实施例, 在现有 HEVC标准文稿的基础上, 改进了 CRA图像的约 束条件, 即在现有针对 CRA图像的约束条件的基础上, 强制约束 CRA图 像为参考图像。 解码设备在提取编码设备生成的码流后, 可以确定该码流 是否包含 CRA图像, 若包含, 则可以根据 CRA图像的约束条件判断该码 流中该 CRA图像的属性是否为参考图像, 如果是参考图像, 则可以确定 该码流为合法码流, 而该 CRA图像可以用于辅助后续图像的 POC推导, 否则该码流为非法码流, 而该 CRA图像不能用于辅助后续图像的 POC推 导, 从而避免 POC计算异常以及解码异常。 In this embodiment, on the basis of the existing HEVC standard document, the improvement of the CRA image is improved. The beam condition, that is, based on the existing constraints for the CRA image, forces the CRA image to be a reference image. After extracting the code stream generated by the encoding device, the decoding device may determine whether the code stream includes a CRA image, and if yes, determine whether the attribute of the CRA image in the code stream is a reference image according to a constraint condition of the CRA image, if The reference image may determine that the code stream is a legal code stream, and the CRA image may be used to assist POC derivation of subsequent images, otherwise the code stream is an illegal code stream, and the CRA image cannot be used to assist POC derivation of subsequent images. , thereby avoiding POC calculation anomalies and decoding anomalies.
图 2为本发明视频图像码流处理方法实施例二的流程图,如图 2所示, 本实施例的方法可以包括:  2 is a flowchart of Embodiment 2 of a video image code stream processing method according to the present invention. As shown in FIG. 2, the method in this embodiment may include:
步骤 201、 确定依次解码的图像的 POC是否按照非降序排列; 步骤 202、 若是, 则釆用如下方法确定所述当前图像的 POCMsb: 从码流中获取当前图像的 POCLsb;  Step 201: Determine whether POCs of sequentially decoded images are arranged in non-descending order; Step 202, if yes, determine POCMsb of the current image by using: obtaining POCLsb of the current image from the code stream;
如果所述当前图像的 POCLsb小于之前图像的 POCLsb, 则当前图像 的 POCMsb 等于之前图像的 POCMsb 与最大 POCLsb 之和, 其中最大 POCLsb=2N, N为 POCLsb的位宽; If the POCLsb of the current image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of the POCMsb of the previous image and the maximum POCLsb, wherein the maximum POCLsb=2 N , where N is the bit width of the POCLsb;
否则, 所述当前图像的 POCMsb等于之前图像的 POCMsb;  Otherwise, the POCMsb of the current image is equal to the POCMsb of the previous image;
步骤 203、 根据所述当前图像的 POCMsb 以及所述当前图像的 POCLsb, 确定所述当前图像的 POC。  Step 203: Determine a POC of the current image according to the POCMsb of the current image and the POCLsb of the current image.
如前所述, 对于每一幅图像来说, 其都有一个对应的 POC, 该 POC 是用于指示一幅图像与 DPB 中其它图像的时序先后关系, 而该时序先后 关系规定 DPB中所有图像的输出顺序。 一般来说, POC号码较小的图像 应当在 POC号码较大的图像之前输出。 因此, 在推导当前图像的 POC的 过程中需要首先确定当前图像的之前图像, 然后获取之前图像的 POC信 息并应用之前图像的 POC信息推导当前图像的 POC。  As mentioned before, for each image, it has a corresponding POC, which is used to indicate the timing relationship between an image and other images in the DPB, and the timing relationship defines all the images in the DPB. The order of the output. In general, images with a small POC number should be output before the image with a large POC number. Therefore, in deriving the POC of the current image, it is necessary to first determine the previous image of the current image, then acquire the POC information of the previous image and apply the POC information of the previous image to derive the POC of the current image.
具体来说, 一幅图像的 POC是由其重要比特位( Most Significant Bit, 以下简称: MSB )和非重要比特位(Least Significant Bit以下简称: LSB ) 共同决定的。在本实施例中, MSB可记作 POCMsb, LSB可记作 POCLsb。  Specifically, the POC of an image is determined by a combination of a Major Significant Bit (MSB) and a Least Significant Bit (LSB). In this embodiment, the MSB can be referred to as POCMsb, and the LSB can be referred to as POCLsb.
当前图像的 POC可以按照下式计算得到:  The POC of the current image can be calculated as follows:
PicOrderCntVal = PicOrderCntMsb + pic— order— cnt—lsb  PicOrderCntVal = PicOrderCntMsb + pic—order— cnt—lsb
其中, pic— order— cnt—lsb 是从码流中获取的当前图像的 POCLsb, 而 PicOrderCntMsb 是解码设备推导获得的 当前图像的 POCMsb ,Where pic_order_cnt_lsb is the POCLsb of the current image obtained from the code stream, and PicOrderCntMsb is the POCMsb of the current image derived by the decoding device.
PicOrderCntVal是当前图像的 POC。 PicOrderCntVal is the POC of the current image.
现有 HEVC标准文稿釆用如下方式推导获得 PicOrderCntMsb:  Existing HEVC standard documents are derived using the following method to obtain PicOrderCntMsb:
1、 获取之前图像的 POCLsb (以下简称: prevPicOrderCntLsb ) 和之 前图像的 POCMsb (以下简称: prevPicOrderCntMsb )  1. Obtain the POCLsb of the previous image (hereinafter referred to as: prevPicOrderCntLsb) and the POCMsb of the previous image (hereinafter referred to as: prevPicOrderCntMsb)
如果当前图像是 IDR图像,或者当前图像是整个序列中的第一幅图像 且为 CRA图像, 则将 prevPicOrderCntLsb和 prevPicOrderCntMsb均设置 为 0;  If the current image is an IDR image, or if the current image is the first image in the entire sequence and is a CRA image, then prevPicOrderCntLsb and prevPicOrderCntMsb are both set to 0;
否则, 确定当前图像的之前图像, 并获取之前图像的 POCMsb 与 POCLsb , 分别作为 prevPicOrderCntLsb和 prevPicOrderCntMsb, 其中, 之 前图像的定义在前一实施例中已经说明, 此处不再赘述。  Otherwise, the previous image of the current image is determined, and POCMsb and POCLsb of the previous image are obtained as prevPicOrderCntLsb and prevPicOrderCntMsb, respectively, wherein the definition of the previous image has been described in the previous embodiment, and will not be described herein.
2 、 才艮据 prevPicOrderCntLsb 和 prevPicOrderCntMsb , 确 定 PicOrderCntMsb  2, according to prevPicOrderCntLsb and prevPicOrderCntMsb, determine PicOrderCntMsb
具体地,确定 PicOrderCntMsb的过程可以釆用如下伪代码所表示的操 作实现:  Specifically, the process of determining PicOrderCntMsb can be implemented using the operations represented by the pseudo code below:
if((pic_order_cnt_lsb<prevPicOrderCntLsb)&&  If((pic_order_cnt_lsb<prevPicOrderCntLsb)&&
((prevPicOrderCntLsb-pic_order_cnt_lsb)>=(MaxPicOrderCntLsb/2))) PicOrderCntMsb = prevPicOrderCntMsb + MaxPicOrderCntLsb else if((pic_order_cnt_lsb>prevPicOrderCntLsb)&&  ((prevPicOrderCntLsb-pic_order_cnt_lsb)>=(MaxPicOrderCntLsb/2))) PicOrderCntMsb = prevPicOrderCntMsb + MaxPicOrderCntLsb else if((pic_order_cnt_lsb>prevPicOrderCntLsb)&&
((pic_order_cnt_lsb-prevPicOrderCntLsb)>(MaxPicOrderCntLsb/2 ((pic_order_cnt_lsb-prevPicOrderCntLsb)>(MaxPicOrderCntLsb/2
))) )))
PicOrderCntMsb=prevPicOrderCntMsb-MaxPicOrderCntLsb else  PicOrderCntMsb=prevPicOrderCntMsb-MaxPicOrderCntLsb else
PicOrderCntMsb = prevPicOrderCntMsb  PicOrderCntMsb = prevPicOrderCntMsb
其中, 最大 POCLsb (以下简称: MaxPicOrderCntLsb )是由 POCLsb 的位宽 N决定的, MaxPicOrderCntLsb=2N。 举例来说, POCLsb的位宽为 4 , 即 POCLsb釆用 4bit表示, 则 MaxPicOrderCntLsb=24=16, POCLsb釆 用 8比特表示时, MaxPicOrderCntLsb=28=64。 上述三条分支语言的含义分别为: Among them, the maximum POCLsb (hereinafter referred to as: MaxPicOrderCntLsb) is determined by the bit width N of POCLsb, and MaxPicOrderCntLsb=2 N . For example, POCLsb has a bit width of 4, that is, POCLsb釆 is represented by 4 bits, then MaxPicOrderCntLsb=2 4 =16, and POCLsb釆 is represented by 8 bits, and MaxPicOrderCntLsb=2 8 =64. The meanings of the above three branch languages are:
如果从码流中获得的当前图像的 pic— order— cnt—lsb 小于之前图像的 prevPicOrderCntLsb , 并且 prevPicOrderCntLsb与 pic— order— cnt—lsb的差值 大于等于 MaxPicOrderCntLsb/2,则 PicOrderCntMsb = prevPicOrderCntMsb + MaxPicOrderCntLsb;  If the pic_order_cnt_lsb of the current image obtained from the code stream is smaller than the prevPicOrderCntLsb of the previous image, and the difference between prevPicOrderCntLsb and pic_order_cnt_lsb is greater than or equal to MaxPicOrderCntLsb/2, then PicOrderCntMsb = prevPicOrderCntMsb + MaxPicOrderCntLsb;
如果从码流中获得的当前图像的 pic— order— cnt—lsb 大于之前图像的 prevPicOrderCntLsb , 并且 pic— order— cnt—lsb与 revPicOrderCntLsb的差值 大于 MaxPicOrderCntLsb/2, 则 PicOrderCntMsb = prevPicOrderCntMsb - MaxPicOrderCntLsb;  If the pic_order_cnt_lsb of the current image obtained from the code stream is greater than the prevPicOrderCntLsb of the previous image, and the difference between pic_order_cnt_lsb and revPicOrderCntLsb is greater than MaxPicOrderCntLsb/2, then PicOrderCntMsb = prevPicOrderCntMsb - MaxPicOrderCntLsb;
否则, PicOrderCntMsb = prevPicOrderCntMsb。  Otherwise, PicOrderCntMsb = prevPicOrderCntMsb.
因此,按照现有技术推导 PicOrderCntMsb的效果是, 当前图像的 POC 大于之前图像的 POC-MaxPicOrderCntLsb/2 , 且小于等于之前图像的 POC+MaxPicOrderCntLsb/2。  Therefore, the effect of deriving PicOrderCntMsb according to the prior art is that the POC of the current image is larger than POC-MaxPicOrderCntLsb/2 of the previous image and less than or equal to POC+MaxPicOrderCntLsb/2 of the previous image.
举例来说,假设用 4比特表示 POCLsb, 即 MaxPicOrderCntLsb为 16, 而视频码流中按照输出时序顺序排列的相邻三幅图像 A、 B与 C均为参考 图像且 temporal— id为 0, 这三幅图像的 POCLsb分别为: POCLsb(A)=2, POCLsb(B)=7, POCLsb(C)=l l。  For example, suppose that POCLsb is represented by 4 bits, that is, MaxPicOrderCntLsb is 16, and adjacent three images A, B, and C arranged in order of output timing in the video bitstream are reference images and temporal_id is 0, these three The POCLsb of the image are: POCLsb(A)=2, POCLsb(B)=7, POCLsb(C)=ll.
假设三幅图像均被解码设备正确接收、 解码并放置在 DPB 中等待输 出, 则解码设备会按照下面方法确定这三幅图像输出顺序:  Assuming that all three images are correctly received, decoded, and placed in the DPB for output, the decoding device determines the output order of the three images as follows:
1、 确定图像 A的 POC  1. Determine the POC of image A
将图像 A的 POCMsb记作 POCMsb(A) , POCMsb(A)可以在解码图像 A的过程中确定。  The POCMsb of the image A is referred to as POCMsb(A), and the POCMsb(A) can be determined during the process of decoding the image A.
因此, 图像 A的 POC为:  Therefore, the POC of image A is:
POC(A)=POCMsb(A) + POCLsb(A) = POCMsb(A)+2  POC(A)=POCMsb(A) + POCLsb(A) = POCMsb(A)+2
2、 确定图像 B的 POC  2. Determine the PO of the image B
选择 B作为当前图像, 因此 pic— order— cnt—lsb = POCLsb(B) = 7; 确定图像 B 的之前图像为图像 A, 因此, prevPicOrderCntLsb =2, prevPicOrderCntMsb = POCMsb(A)。  Select B as the current image, so pic_order- cnt-lsb = POCLsb(B) = 7; Determine that the previous image of image B is image A, therefore, prevPicOrderCntLsb = 2, prevPicOrderCntMsb = POCMsb(A).
由 于 pic— order— cnt—lsb > prevPicOrderCntLsb 且 pic— order— cnt—lsb— prevPicOrderCntLsb=5 < MaxPicOrderCntLsb/2 , 因 JH , 按照上述第三条件分支计算 PicOrderCntMsb=prevPicOrderCntMsb ,则可推 断 POCMsb(B)=POCMsb(A); Since pic_order- cnt-lsb > prevPicOrderCntLsb and pic_order- cnt-lsb- prevPicOrderCntLsb=5 < MaxPicOrderCntLsb/2 , because JH, Calculating PicOrderCntMsb=prevPicOrderCntMsb according to the third conditional branch above, then inferring POCMsb(B)=POCMsb(A);
因此, 图像 B的 POC为:  Therefore, the POC of image B is:
POC(B) = POCMsb(B) + pic— order— cnt—lsb = POCMsb(A)+7。  POC(B) = POCMsb(B) + pic-order- cnt-lsb = POCMsb(A)+7.
3、 确定图像 C的 POC  3. Determine the PO of the image C
选择 C作为当前图像, 因此 pic— order— cnt—lsb = POCLsb(C) = 1 1 ; 确定图像 C的之前图像为图像 B , 因此, prevPicOrderCntLsb = 7 , prevPicOrderCntMsb = POCMsb(B) = POCMsb(A)  Select C as the current image, so pic_order—cnt—lsb = POCLsb(C) = 1 1 ; Determine that the previous image of image C is image B, therefore, prevPicOrderCntLsb = 7, prevPicOrderCntMsb = POCMsb(B) = POCMsb(A)
由 于 pic— order— cnt—lsb > pre vP ic OrderCntLsb 且 pic— order— cnt—lsb— prevPicOrderCntLsb=4 < MaxPicOrderCntLsb/2 , 因 JH , 按照上述第三条件分支计算 PicOrderCntMsb=prevPicOrderCntMsb ,则可推 断 POCMsb(C)=POCMsb(B)=POCMsb(A) ;  Since pic_order_cnt_lsb > pre vP ic OrderCntLsb and pic_order_cnt_lsb- prevPicOrderCntLsb=4 < MaxPicOrderCntLsb/2 , because JH, PicOrderCntMsb=prevPicOrderCntMsb is calculated according to the third conditional branch above, then POCMsb (C) can be inferred. ) = POCMsb (B) = POCMsb (A);
因此, 图像 C的 POC为:  Therefore, the POC of image C is:
POC(C) = POCMsb(C) + pic— order— cnt—lsb = POCMsb(A)+l 1。  POC(C) = POCMsb(C) + pic-order- cnt-lsb = POCMsb(A)+l 1.
因为, 解码设备计算得到的这三幅图像的 POC值分别是 POC(A) = Because the POC values of the three images calculated by the decoding device are POC(A) =
POCMsb(A) + 2 , POC(B) = POCMsb(A) + 7 , POC(C) = POCMsb(A) + 1 1 , 所以这三幅图像的输出顺序是 A->B->C。 POCMsb(A) + 2 , POC(B) = POCMsb(A) + 7 , POC(C) = POCMsb(A) + 1 1 , so the output order of these three images is A->B->C.
但是, 如果 B没有被解码设备正确接收, 则只有图像 A与 C被解码 设备正确接收、 解码并被放置在 DPB 中等待输出。 解码设备则会按照下 面的方法确定这两幅图像输出顺序:  However, if B is not correctly received by the decoding device, only images A and C are correctly received, decoded by the decoding device and placed in the DPB for output. The decoding device determines the order in which the two images are output as follows:
1、 确定图像 A的 POC  1. Determine the POC of image A
将图像 A的 POCMsb记作 POCMsb(A) , POCMsb(A)可以在解码图像 A的过程中确定。  The POCMsb of the image A is referred to as POCMsb(A), and the POCMsb(A) can be determined during the process of decoding the image A.
因此, 图像 A的 POC为:  Therefore, the POC of image A is:
POC(A)=POCMsb(A) + POCLsb(A) = POCMsb(A)+2  POC(A)=POCMsb(A) + POCLsb(A) = POCMsb(A)+2
2、 确定图像 C的 POC  2. Determine the PO of the image C
选择 C作为当前图像, 因此 pic— order— cnt—lsb = POCLsb(C) = 1 1 ; 确定图像 C的之前图像为图像 A, 因此, prevPicOrderCntLsb = 2 , prevPicOrderCntMsb = POCMsb(A) ;  Select C as the current image, so pic_order- cnt-lsb = POCLsb(C) = 1 1 ; Determine that the previous image of image C is image A, therefore, prevPicOrderCntLsb = 2, prevPicOrderCntMsb = POCMsb(A);
由 于 pic— order— cnt—lsb > pre vP ic OrderCntLsb 且 pic_order_cnt_lsb-prevPicOrderCntLsb=9 > MaxPicOrderCntLsb/2 , 因 JH , 按照上述第二条件分支计算 PicOrderCntMsb = prevPicOrderCntMsb - MaxPicOrderCntLsb , 则可推断 POCMsb(C) = POCMsb(A) - 16 ; Since pic_order- cnt-lsb > pre vP ic OrderCntLsb and pic_order_cnt_lsb-prevPicOrderCntLsb=9 > MaxPicOrderCntLsb/2 , because JH, according to the second conditional branch calculation PicOrderCntMsb = prevPicOrderCntMsb - MaxPicOrderCntLsb, then POCMsb(C) = POCMsb(A) - 16 can be inferred;
因此, 图像 C的 POC:  Therefore, the POC of image C:
POC(C) = POCMsb(C) + pic— order— cnt—lsb = POCMsb(A) - 16 + 1 1 = POC(C) = POCMsb(C) + pic-order- cnt-lsb = POCMsb(A) - 16 + 1 1 =
POCMsb(A) _ 5。 POCMsb(A) _ 5.
因为计算得到的图像 A与 C的 POC分别是 POC(A) = POCMsb(A) + 2 , POC(C) = POCMsb(A) - 5 , 所以这两幅图像的输出顺序是 C->A。 显然, 在图像 B丟失的情况下, 解码设备按照给定方法无法推导图像 A与 C的 正确输出顺序。  Since the calculated POCs of images A and C are POC(A) = POCMsb(A) + 2 and POC(C) = POCMsb(A) - 5 , respectively, the output order of the two images is C->A. Obviously, in the case where the image B is lost, the decoding device cannot derive the correct output order of the images A and C according to a given method.
为了解决上述由于图像丟失而导致的图像乱序输出问题, 现有 HEVC 限制激活图像中 的任意两幅图像之间 的 POC 差值必须小于 MaxPicOrderCntLsb/2。  In order to solve the above problem of image out-of-order output due to image loss, the POC difference between any two images in the existing HEVC limit activation image must be smaller than MaxPicOrderCntLsb/2.
以上述丟失图像 B 的例子举例来说, 如果 POCLsb(C)与 POCLsb(A) 的差值小于 MaxPicOrderCntLsb/2 , 则 POCMsb(C) = POCMsb(A) , 从而不 会出现图像 A与 C输出顺序颠倒的情况。  For example, in the above example of missing image B, if the difference between POCLsb(C) and POCLsb(A) is less than MaxPicOrderCntLsb/2, POCMsb(C) = POCMsb(A), so that the image A and C output order does not occur. Reversed situation.
但是, 在实践过程中发现, 现有 HEVC规定的 POC推导过程, 适用 于任何编码结构, 也即, 既允许当前图像的 POC比之前图像的 POC大, 又允许当前图像的 POC 比之前图像的 POC 小。 例如, 在推导 PicOrderCntMsb的计算式中,第二个条件分支对应当前图像 POCMsb小于 之前图像 POCMsb 的情况。 此时, 考虑 POCLsb 的取值范围在 [0 , MaxPicOrderCntLsb)范围内,比较最后计算得到的当前图像的 POC与之前 图像的 POC , 显然当前图像的 POC小于之前图像的 POC。  However, it has been found in practice that the existing HEVC-defined POC derivation process applies to any coding structure, that is, allows both the POC of the current image to be larger than the POC of the previous image, and allows the POC of the current image to be POC of the previous image. small. For example, in the calculation formula for deriving PicOrderCntMsb, the second conditional branch corresponds to the case where the current image POCMsb is smaller than the previous image POCMsb. At this time, considering the value range of POCLsb in the range of [0, MaxPicOrderCntLsb), compare the POC of the last calculated current image with the POC of the previous image, obviously the POC of the current image is smaller than the POC of the previous image.
但是, 当不存在当前图像的 POC小于之前图像的 POC的情况, 即所 有依次解码的图像的 POC 是按照非降序排列的, 此时, 上述推导 PicOrderCntMsb的第二个条件分支是多余的,其无需限制激活图像中的任 意两幅图像之间的 POC差值必须小于 MaxPicOrderCntLsb/2。  However, when there is no POC of the current image smaller than the POC of the previous image, that is, the POCs of all sequentially decoded images are arranged in non-descending order, at this time, the second conditional branch of the above-mentioned derivation PicOrderCntMsb is redundant, which does not need The POC difference between any two images in the restricted activation image must be less than MaxPicOrderCntLsb/2.
鉴于上述分析, 本实施例针对依次解码的图像的 POC按照非降序排 列的情况,对 PicOrderCntMsb推导过程进行了改进,具体的伪代码所表达 如下: if( pic— order— cnt—lsb < prevPicOrderCntLsb ) In view of the above analysis, the present embodiment improves the derivation process of PicOrderCntMsb for the POC of sequentially decoded images in non-descending order, and the specific pseudo code is expressed as follows: If( pic— order— cnt—lsb < prevPicOrderCntLsb )
PicOrderCntMsb = pre vP ic OrderCntMsb + MaxPicOrderCntLsb else  PicOrderCntMsb = pre vP ic OrderCntMsb + MaxPicOrderCntLsb else
PicOrderCntMsb = pre vP ic OrderCntMsb  PicOrderCntMsb = pre vP ic OrderCntMsb
上述语法的含义为:  The meaning of the above grammar is:
如果从码流中获取的当前图像的 pic— order— cnt—lsb 小于之前图像的 prevPicOrderCntLsb , 贝1] PicOrderCntMsb= prevPicOrderCntMsb + MaxPicOrderCntLsb; If the pic_order_cnt_lsb of the current image obtained from the code stream is smaller than the prevPicOrderCntLsb of the previous image, Bay 1 ] PicOrderCntMsb= prevPicOrderCntMsb + MaxPicOrderCntLsb;
否贝1 J , PicOrderCntMsb = prevPicOrderCntMsb。 No Bay 1 J , PicOrderCntMsb = prevPicOrderCntMsb.
在具体实现时, 可以利用当序列参数集 ( Sequence Parameter Set, 以 下简称: SPS ) 中表示时序倒置图像的最大数量的语法元素 num— reorder— pics 的值来指示依次解码的图像的 POC 是否按照非降序排 歹 |J。 所述时序倒置图像指解码顺序在某一幅图像之前而输出顺序在那一幅 图像之后的图像。 例如, 使用 num— reorder— pics为 0表示依次解码的图像 的 POC按照非降序排列,使用 num— reorder_pics大于 0表示依次解码的图 像的 POC 没有按照非降序排列。 需注意, 在后续方法实施例四中对语法 元素 num— reorder_pics 的编码方式进行了修改。 修改后的变量 MaxDecPicBuffering与爹改前的语法元素 num_reorder_pics具有相同的物 理含义。 因此, 在实施例四的条件下 , 可根据变量 MaxDecPicBuffering的 值确定依次解码的图像的 POC是否按照非降序排列。  In a specific implementation, the value of the syntax element num_reorder_pics indicating the maximum number of timing inverted images in the Sequence Parameter Set (SPS) may be used to indicate whether the POC of the sequentially decoded image is not Descending order | J. The timing inverted image refers to an image in which the decoding order is before a certain image and the output order is after that image. For example, using num_reorder_pics to 0 means that the POCs of sequentially decoded images are arranged in non-descending order, and using num_reorder_pics greater than 0 means that the POCs of sequentially decoded images are not arranged in non-descending order. It should be noted that the encoding of the syntax element num_reorder_pics has been modified in the fourth embodiment of the subsequent method. The modified variable MaxDecPicBuffering has the same physical meaning as the syntax element num_reorder_pics before tampering. Therefore, under the condition of the fourth embodiment, it is determined whether the POCs of the sequentially decoded images are arranged in non-descending order according to the value of the variable MaxDecPicBuffering.
需要说明的是, 由于 num— reorder— pics是按照不同时间层传递的, 本 实施例中不带数组组标的变量均表示当前解码时间层的参数信息。 如当前 解码第 1层, 贝1 J num— reorder_pics表示 num— reorder— pics[l] , 其余变量类 似。 It should be noted that, since num_reorder_pics is transmitted according to different time layers, the variables without the array group label in this embodiment all represent the parameter information of the current decoding time layer. If the current layer 1 is decoded, Bay 1 J num — reorder_pics represents num — reorder — pics [l] , and the remaining variables are similar.
解码设备在确定当前图像与 DPB 各幅图像的输出顺序时, 可以先根 据 SPS中的 num— reorder— pics的取值来确定依次解码的图像的 POC是否 按照非降序排列, 如果确定依次解码的图像的 POC是按照非降序排列的, 则可以釆用本实施例改进的推导方式获得当前图像的 PicOrderCntMsb , 并 依据 PicOrderCntMsb和从码流中获取的当前图像的 pic— order— cnt—lsb , 来 得到当前图像的 PicOrderCntVal , 从而根据当前图像的 PicOrderCntVal以 及 DPB 中各幅图像的 POC值来确定当前图像与 DPB各幅图像的输出顺 序; 如果确定依次解码的图像的 POC 不是按照非降序排列的, 则可以釆
Figure imgf000016_0001
,, 本本实实施施例例不不再再赘赘述述。。
When determining the output order of the current image and the DPB images, the decoding device may first determine whether the POCs of the sequentially decoded images are arranged in non-descending order according to the value of num_reorder_pics in the SPS, if it is determined that the images are sequentially decoded. The POCs are arranged in non-descending order, and the PicOrderCntMsb of the current image can be obtained by using the improved derivation method of the embodiment, and the current current pic_order_cnt-lsb of the current image obtained from the code stream is obtained according to PicOrderCntMsb. The PicOrderCntVal of the image, thus based on the current image's PicOrderCntVal And the POC value of each image in the DPB to determine the output order of the current image and the DPB images; if it is determined that the POCs of the sequentially decoded images are not arranged in non-descending order, then
Figure imgf000016_0001
, the actual implementation of the examples will not be described again. .
可可以以理理解解的的是是,, 在在依依次次解解码码的的图图像像的的 PPOOCC按按照照非非降降序序排排列列的的情情况况下下,, 55 即即使使对对激激活活图图像像中中任任意意两两幅幅图图像像的的 PPOOCC 差差值值的的大大小小不不做做限限制制,, 也也不不会会影影 响响解解码码的的鲁鲁棒棒性性,, 即即不不会会出出现现解解码码图图像像相相对对位位置置错错乱乱的的情情况况。。 因因此此,, 基基于于 本本实实施施例例上上述述处处理理过过程程,,本本实实施施例例还还可可以以改改进进现现有有 HHEE VVCC标标准准文文稿稿中中对对激激 活活图图像像中中任任意意两两幅幅图图像像的的 PPOOCC 差差值值的的大大小小的的约约束束条条件件,, 即即,, 针针对对依依次次解解 码码的的图图像像的的 PPOOCC按按照照非非降降序序排排列列的的情情况况,, 无无需需釆釆用用约约束束条条件件对对激激活活图图像像 1100 中中任任意意两两幅幅图图像像的的 PPOOCC 差差值值的的大大小小进进行行限限制制。。 而而针针对对依依次次解解码码的的图图像像的的 PPOOCC按按照照非非降降序序排排列列的的情情况况,, 去去除除 PPOOCC差差值值的的大大小小的的约约束束,, 其其好好处处在在于于 可可以以减减少少 ppiicc—— oorrddeerr—— ccnntt——llssbb的的位位宽宽,, 从从而而可可以以减减 、、传传输输开开销销。。  It can be understood that, under the condition that the PPOOCC of the image image of the image of the decoded code is arranged in the non-non-descending descending order, 55 Even if the size of the PPOOCC difference value of any two or two image images in the image of the active activation image is not limited, it will not be affected. It affects the robustness of the ringing decoding code, that is, it does not cause the situation that the image of the current decoded coded image is misplaced relative to the position of the alignment. . Therefore, based on the above-mentioned processing and processing procedures in the present embodiment, the present embodiment can also be modified to improve the existing HHEE VVCC standard document. The large and small constrained bundle conditional condition of the PPOOCC difference value of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image of the image For the case where the PPOOCC of the image image of the image of the successive de-decoded code is arranged according to the non-non-descending descending sequence, there is no need to use the constraint constraint to bind the conditional component. The live limit image is activated by a large and small limit of the PPOOCC difference value of any two or two image images in the 1100. . However, the PPOOCC for the image image of the image image of the successive de-decoded code is arranged according to the non-non-descending descending sequence, and the large and small values of the PPOOCC difference value are removed. The constraint bundle, its good advantage lies in the fact that it can reduce the width of the picicc - oorrddeerr - ccnntt - llssbb, from which it can be reduced, transmitted, transmitted Lose the overhead. .
本本实实施施例例中中,, 解解码码设设备备可可以以先先确确定定依依次次解解码码的的图图像像的的 PPOOCC是是否否按按照照 非非降降序序排排列列;; 如如果果是是按按照照非非降降序序排排列列的的,, 则则可可以以釆釆用用改改进进的的推推导导方方式式确确定定 1155 当当前前图图像像的的 PPOOCCMMssbb ,, 即即从从码码流流中中获获取取当当前前图图像像的的 PPOOCC 非非重重要要比比特特位位 PPOOCCLLssbb,, 如如果果所所述述当当前前图图像像的的 PPOOCCLLssbb小小于于之之前前图图像像的的 PPOOCCLLssbb,, 则则当当前前 图图像像的的 PPOOCCMMssbb等等于于之之前前图图像像的的 PPOOCCMMssbb与与最最大大 PPOOCCLLssbb之之和和,, 否否则则,, 当当 前前图图像像的的 PPOOCCMMssbb 等等于于 pprreevvPPiiccOOrrddeerrCCnnttMMssbb;; 最最后后再再根根据据当当前前图图像像的的 PPOOCCMMssbb以以及及当当前前图图像像的的 PPOOCCLLssbb,, 确确定定当当前前图图像像的的 PPOOCC。。 因因此此,, 解解码码设设 2200 备备可可以以根根据据当当前前图图像像的的 PPOOCC以以及及 DDPPBB中中各各幅幅图图像像的的 PPOOCC确确定定输输出出顺顺序序。。  In the embodiment of the present embodiment, the de-code decoding device may be configured to first determine whether the PPOOCC of the image image of the de-decoded code is determined according to the non-non-descending descending order. Arrange the columns; if the results are arranged in a non-descending descending order, then the derivation method can be used to determine the 1155 when the current PPOOCCMMssbb of the image of the previous image, that is, the PPOOCC that is obtained from the code stream is taken as the image of the current image of the previous image is not more important than the bit bit PPOOCCLLssbb, as described When the PPOOCCLLssbb of the current image of the previous image is smaller than the PPOOCCLLssbb of the previous image of the previous image, then the PPOOCCMMssbb of the current image of the previous image is equal to the PPOOCCMMssbb of the image of the previous image before and after The sum of the largest PPOOCCLLssbb and, if not, then, when the current image of the previous image is PPOOCCMMssbb, etc. is equal to pprreevvPPiiccOOrrddeerrCCnnttMMssbb;; the last and then the root image according to the current image The PPOOCCMMssbb and the PPOOCCLLssbb of the current image of the previous image are indeed determined to be the PPOOCC of the current image of the previous image. . Therefore, the deciphering code setting 2200 can be determined to be determined according to the PPOOCC according to the image of the current front image and the PPOOCC of each image image of the DDPPBB. In order. .
因因此此,, 本本实实施施例例针针对对依依次次解解码码的的图图像像的的 PPOOCC按按照照非非降降序序排排列列的的情情况况,, 简简 化化了了 PPOOCCMMssbb的的推推导导方方式式,, 提提高高了了处处理理效效率率,, 而而且且,, 基基于于该该改改进进的的推推导导方方 式式,, 可可以以去去除除对对激激活活图图像像中中任任意意两两幅幅图图像像的的 PPOOCC 差差值值的的大大小小的的限限制制,, 从从 而而可可以以减减少少码码流流中中 PPOOCCLLssbb的的位位宽宽,, 进进而而减减小小传传输输开开销销。。  Therefore, in the present embodiment, the embodiment of the present embodiment is simplified for the case where the PPOOCCs of the image images of the successively decoded codes are arranged in a non-descending descending order. The push-guided mode of PPOOCCMMssbb is improved, and the rate of efficiency of processing is improved. Moreover, based on the push-guide method of the improvement, it may be In order to remove the limit system of the PPOOCC difference value of the image of any two or two images of the image of the image of the image of the image of the image of the image The bit width and width of the PPOOCCLLssbb in the less code stream are reduced, and then the small transmission transmission and the overhead pin are reduced. .
2255 图图 33为为本本发发明明视视频频图图像像码码流流处处理理方方法法实实施施例例三三的的流流程程图图,,如如图图 33所所示示,, 本本实实施施例例的的方方法法可可以以包包括括:: 2255 FIG. 33 is a flow chart of the third embodiment of the embodiment of the method for processing the video stream of the video of the clear video video image of the present invention, as shown in FIG. 33. It should be noted that the method of the embodiment of the present invention may be included in the package:
步步骤骤 330011、、 提提取取编编码码设设备备产产生生的的码码流流,, 其其中中,, 第第一一类类语语法法元元素素与与第第二二 类类语语法法元元素素放放置置在在所所述述码码流流中中的的邻邻近近位位置置上上,, 或或者者第第二二类类语语法法元元素素与与第第三三 类类语语法法元元素素放放置置在在所所述述码码流流中中的的邻邻近近位位置置上上,, 所所述述第第一一类类语语法法元元素素是是指指包包 3300 含含时时间间层层数数量量信信息息的的语语法法元元素素,, 所所述述第第二二类类语语法法元元素素是是指指包包含含每每个个时时间间层层 的码流特征信息的语法元素, 所述第三类语法元素是指包含每一幅图像所 占用存储空间大小信息的语法元素; Step 330011, extracting the code stream of the code-coding device, and among them, the first class of grammatical elements and the second class The grammatical element element placement is placed adjacent to the nearest neighbor position in the code stream, or the second second class grammatical element element and And the third third class grammatical element element placement is placed in the adjacent neighboring near position in the code stream, and the first class is The grammatical element element refers to the grammatical element element of the packet 3300 containing the amount of information of the number of layers in the time period, and the second categorical grammatical element of the second class Elemental means that the package contains layers of time between each a syntax element of the code stream feature information, where the third class syntax element refers to a syntax element containing information about the size of the storage space occupied by each image;
步骤 302、 从所述码流的所述邻近位置上抽取至少两类语法元素。 具体来说, 由于解码设备的能力是有限的。 为了保证解码设备能够正 常解码码流, 解码设备需要判断该码流所要求的解码能力是否在其所支持 的解码能力范围内, 若是, 则解码设备可以正常接收码流并进行解码, 否 贝' h 解码设备将进行异常处理。  Step 302: Extract at least two types of syntax elements from the neighboring locations of the code stream. In particular, the capabilities of the decoding device are limited. In order to ensure that the decoding device can decode the code stream normally, the decoding device needs to determine whether the decoding capability required by the code stream is within the range of decoding capabilities supported by the code stream. If yes, the decoding device can normally receive the code stream and decode it. h The decoding device will perform exception handling.
其中, 解码能力可以包括对 DPB 的限制。 这种限制可以包括: DPB 中存储的图像数量的限制, DPB中所有图像占用的存储空间的限制。 DPB 中所有图像占用的存储空间可由表示 DPB 中存储的图像数量的相关参数 以及每一幅图像所占用的存储空间的相关参数计算得到, 其中, 表示每一 幅图像所占用存储空间的语法元素可以包括: 表示图像大小的相关参数, 例如图像的宽和高; 图像裁剪参数; 每个釆样值各个色度分量的位宽等。 码整个码流, 另一种是在码流中包含若干不同的时间层的情况下, 只抽取 并解码所能支持的部分时间层码流而丟弃解码设备不能支持的时间层码 流。  Among them, the decoding capability can include restrictions on the DPB. Such restrictions may include: Limitations on the number of images stored in the DPB, and limitations on the storage space occupied by all images in the DPB. The storage space occupied by all the images in the DPB can be calculated by the relevant parameters indicating the number of images stored in the DPB and the relevant parameters of the storage space occupied by each image, wherein the syntax elements indicating the storage space occupied by each image can be Including: related parameters indicating the image size, such as the width and height of the image; image cropping parameters; the bit width of each chroma component of each sample value, and the like. The code is the entire code stream, and the other is to extract and decode only part of the time layer code stream that can be supported while discarding the time layer code stream that the decoding device cannot support, in the case where the code stream contains several different time layers.
上述按时间层对码流进行抽取的操作还可以在媒体网关中进行。 在视 频的传输过程中, 解码设备, 例如客户端中的解码器, 可以与媒体网关进 行通信, 告知媒体网关自己的解码能力。 一种情况下, 媒体网关通过解析 SPS中的部分信息发现当前码流对解码能力的要求超过解码设备所能支持 的解码能力, 则可以选择不将码流发送给解码器。 另一种情况下, 如果媒 体网关通过解析 SPS 中的部分信息发现尽管完整码流所要求的解码能力 超过解码设备所支持的解码能力但部分时间层码流所要求的解码能力在 解码设备所支持的解码能力范围之内, 则媒体网关可以选择丟弃部分时间 层码流, 而只将余下部分时间层码流发送给解码设备, 以节省传输带宽。  The above-mentioned operation of extracting the code stream by the time layer can also be performed in the media gateway. During the transmission of the video, the decoding device, such as the decoder in the client, can communicate with the media gateway to inform the media gateway of its own decoding capabilities. In one case, the media gateway can select not to transmit the code stream to the decoder by parsing part of the information in the SPS to find that the decoding capability of the current code stream exceeds the decoding capability supported by the decoding device. In another case, if the media gateway parses part of the information in the SPS, it is found that although the decoding capability required for the complete stream exceeds the decoding capability supported by the decoding device, the decoding capability required for the partial time layer stream is supported by the decoding device. Within the decoding capability range, the media gateway may choose to discard part of the time layer code stream, and only send the remaining part of the layer layer code stream to the decoding device to save transmission bandwidth.
上述解码设备或媒体网关进行时间层码流抽取操作的前提是通过解 码 SPS中部分语法元素获得相关信息,从而确定解码设备所能够支持的时 间层码流。  The premise of the above-mentioned decoding device or media gateway performing the temporal layer stream extraction operation is to obtain relevant information by decoding some syntax elements in the SPS, thereby determining the time layer code stream that the decoding device can support.
在 SPS中, 与时间层码流抽取操作相关的语法元素可以分为三类, 在 本实施例中分别记为第一类语法元素、 第二类语法元素以及第三类语法元 素。 其中, 第一类语法元素, 是指包含时间层数量信息的语法元素, 例如 语法元素 max— temporal— layers— minus 1 ; 第二类语法元素, 是指包含每个 时 间 层 的 码 流 特 征信 息 的 语 法 元 素 , 例 如 语 法 元 素 max— dec— pic— buffering[i]、 num— reorder_pics[i]和 max— latency— increase [i] , 其中, 语法元素 max— dec— pic— buffering[i] 表示解码所有 temporal— id不超 过 i的子码流所需要的解码图像緩存的大小,语法元素 num— reorder_pics[i] 表示为 temporal— id不超过 i的子码流中倒置图像的数量, i为时间层的标 识; 第三类语法元素, 是指包含每一幅图像所占用存储空间大小信息的语 法元素, 例如表示图像宽度的语法元素 pic— width— in— luma— samples和表示 图像高度的 pic— height— in— luma— samples , 表示图像裁剪信息的语法元素 pic— cropping— flag 、 pic— crop— left— offset 、 pic— crop— right— offset 、 pic— crop— top— offset和 pic— crop— bottom— offset,以及表示每个釆样值各个色 度 分 量 位 宽 的 语 法 元 素 bit— depth— luma— minus 8 和 bit— depth— chroma— minus。 In SPS, the syntax elements related to the temporal layer stream extraction operation can be divided into three categories. In this embodiment, the first type of syntax element, the second type of syntax element, and the third type of syntax element are respectively recorded. The first type of syntax element refers to a syntax element that includes time-level quantity information, such as a syntax element max_temporal-layer-minus 1; the second type of syntax element refers to a code stream characteristic information including each time layer. Syntax elements, such as the syntax elements max-dec-pic-buffering[i], num-reorder_pics[i], and max-latency-increment [i], where the syntax element max_dec_pic_buffering[i] represents decoding all Temporal—the size of the decoded image buffer required for the substream that does not exceed i, the syntax element num_reorder_pics[i] is expressed as the number of inverted images in the substream of temporal_id not exceeding i, i is the time layer The third type of syntax element refers to a syntax element that contains information about the size of the storage space occupied by each image, such as the syntax element pic_width_in-luma-samples representing the width of the image and the pic-height representing the height of the image. In- luma-sample , the syntax element for image cropping information pic- cropping_flag, pic-crop-left-offset, pic-crop-right- Offset , pic — crop — top — offset and pic — crop — bottom — offset , and the syntax element for each chrominance component bit width of each sample value — depth — luma — minus 8 and bit — depth — chroma — minus .
在现有 HEVC 中, 各个时间层均有各自的参数, 这些参数在 SPS 中 传递。 举例来说, 该 SPS可以包括以下语法元素:  In existing HEVC, each time layer has its own parameters, which are passed in the SPS. For example, the SPS can include the following syntax elements:
seq parameter set rbsp( ) { max— temporal— layers— minus 1  Seq parameter set rbsp( ) { max — temporal — layers — minus 1
p ic— width— in— luma— samp le s  p ic— width— in— luma— samp le s
pic— height— in— luma— samples  Pic— height— in— luma— samples
pic— cropping— flag  Pic— cropping — flag
if( pic— cropping— flag ) {  If( pic- cropping-flag ) {
pic— crop— left— offset  Pic— crop—left—offset
pic— crop— right— offset  Pic— crop— right—offset
pic— crop— top— offset  Pic— crop — top—offset
pic— crop— bottom— offset  Pic— crop—bottom—offset
}  }
b it— depth— luma— minus 8 bit— depth— chroma— minus8 b it—depth— luma— minus 8 Bit—depth—chrome— minus8
pcm— enabled— flag  Pcm— enabled— flag
if( pcm— enabled— flag ) {  If( pcm— enabled— flag ) {
pcm— bit— depth— luma— minus 1  Pcm — bit — depth — luma — minus 1
pcm— bit— depth— chroma— minus 1  Pcm — bit — depth — chroma — minus 1
}  }
qpprime—y— zero— transquant— bypass— flag  Qpprime—y— zero— transquant— bypass—flag
log2_max_pic_order_cnt_lsb_minus4  Log2_max_pic_order_cnt_lsb_minus4
for( i = 0; i <= max— temporal— layers— minus 1 ; i++ )  For( i = 0; i <= max—temporal—layer— minus 1 ; i++ )
{  {
max— dec_pic— buffering [ i ]  Max— dec_pic— buffering [ i ]
num— reorder— pics [ i ]  Num— reorder— pics [ i ]
max— latency— increase [ i ]  Max- latency- increase [ i ]
} }  } }
然而, 在现有技术中, 第一类语法元素、 第二类语法元素以及第三类 语法元素在 SPS中的位置相隔较远。 以上述内容举例来说, 表示时间层数 量的语法元素 max— temporal— layers— minus 1和表示每个时间层的码流特征 信息的语法元素 max_dec_pic_buffering[i] , num— reorder— pics [i]以及 max— latency— increase [i]相隔较远, 中间有 4艮多与解码设备或者媒体网关抽 取时间层码流操作无关的语法元素, 例如 pcm— enabled— flag等。 此时, 解 码设备或者媒体网关为抽取每个时间层信息以完成时间层码流抽取操作, 需要额外解析这些无关语法元素, 会降低解码设备或者媒体网关的处理效 率。  However, in the prior art, the first type of syntax elements, the second type of syntax elements, and the third type of syntax elements are located far apart in the SPS. For example, the syntax element max_tempo_layer- minus 1 indicating the number of time layers and the syntax elements max_dec_pic_buffering[i], num_reorder_pics [i] indicating the code stream feature information of each time layer are Max- latency-increase [i] is far apart, and there are more than 4 syntax elements in the middle that are independent of the decoding device or media gateway to extract the time-layer code stream operation, such as pcm-enabled-flag. At this time, the decoding device or the media gateway extracts each time layer information to complete the time layer stream extraction operation, and needs to additionally parse these irrelevant syntax elements, which reduces the processing efficiency of the decoding device or the media gateway.
为解决上述问题, 提高解码设备或者媒体网关抽取每个时间层信息的 处理效率, 本实施例提供如下技术方案:  In order to solve the above problem, the processing efficiency of extracting each time layer information by the decoding device or the media gateway is improved. This embodiment provides the following technical solutions:
编码设备可以将第一类语法元素与第二类语法元素放置在所述码流 中的邻近位置上, 或者将第二类语法元素与第三类语法元素放置在所述码 流中的邻近位置上。 相应地, 解码设备或者媒体网关在提取编码设备生成 的码流后, 可以从该码流中一邻近位置上抽取至少两类语法元素。 The encoding device may place the first type of syntax element and the second type of syntax element in a neighboring position in the code stream, or place the second type of syntax element and the third type of syntax element in a vicinity of the code stream on. Correspondingly, the decoding device or the media gateway generates the extracted encoding device After the code stream, at least two types of syntax elements can be extracted from a neighboring position in the code stream.
在具体实现时, 上述语法元素在码流中的放置方式至少可以釆用下述 四种方式实现:  In a specific implementation, the manner in which the above syntax elements are placed in the code stream can be implemented in at least the following four ways:
第一种方式是, 保持第三类语法元素的位置不动, 调整现有 SPS中的 第一类语法元素位置, 即将第一类语法元素放置在第二类语法元素之前; 釆用第一种方式得到的 SPS中各语法元素位置关系如下所示:  The first way is to keep the position of the third type of syntax element unchanged, and adjust the position of the first type of syntax element in the existing SPS, that is, the first type of syntax element is placed before the second type of syntax element; The positional relationship of each syntax element in the SPS obtained by the method is as follows:
seq parameter set rbsp( ) { pic— width— in— luma— samples  Seq parameter set rbsp( ) { pic— width— in— luma— samples
pic— height— in— luma— samples  Pic— height— in— luma— samples
pic— cropping— flag  Pic— cropping — flag
if( pic— cropping— flag ) {  If( pic- cropping-flag ) {
pic— crop— left— offset  Pic— crop—left—offset
pic— crop— right— offset  Pic— crop— right—offset
pic— crop— top— offset  Pic— crop — top—offset
pic— crop— bottom— offset  Pic— crop—bottom—offset
}  }
bit— depth— luma— minus 8  Bit—depth— luma— minus 8
bit— depth— chroma— minus 8  Bit-depth—chrome— minus 8
pcm— enabled— flag  Pcm— enabled— flag
if( pcm— enabled— flag ) {  If( pcm— enabled— flag ) {
pcm— bit— depth— luma— minus 1  Pcm — bit — depth — luma — minus 1
pcm— bit— depth— chroma— minus 1  Pcm — bit — depth — chroma — minus 1
}  }
qpprime—y— zero— transquant— bypass— flag log2— max— pic— order— cnt—lsb—minus4  Qpprime—y—zero— transquant—bypass—flag log2— max— pic—order— cnt—lsb—minus4
max— temporal— layers— minus 1  Max—tempo—layer— minus 1
for( i = 0; i <= max— temporal— layers— minus 1 ;  For( i = 0; i <= max-temporal- layers- minus 1 ;
i++ ) {  i++ ) {
max_dec_pic_buffering[ i ] num_reorder_p ic s [ i ] Max_dec_pic_buffering[ i ] Num_reorder_p ic s [ i ]
max— latency— increase [ i ]  Max- latency- increase [ i ]
}  }
第二种方式是, 保持第三类语法元素的位置不动, 调整第二类语法元 素位置, 即将第二类语法元素放置在第一类语法元素之后。 The second way is to keep the position of the third type of syntax element and adjust the position of the second type of syntax element, that is, the second type of syntax element is placed after the first type of syntax element.
釆用第二种方式得到的 SPS中各语法元素位置关系如下所示:  The positional relationship of each syntax element in the SPS obtained in the second way is as follows:
seq parameter set rbsp( ) { max— temporal— layers— minus 1  Seq parameter set rbsp( ) { max — temporal — layers — minus 1
for( i = 0; i <= max— temporal— layers— minus 1 ;  For( i = 0; i <= max-temporal- layers- minus 1 ;
i++ ) {  i++ ) {
max— dec— pic— buffering [ i ]  Max—dec—pic— buffering [ i ]
num_reorder_pics[ i ]  Num_reorder_pics[ i ]
max— latency— increase [ i ]  Max- latency- increase [ i ]
}  }
p ic— width— in— luma— samp le s  p ic— width— in— luma— samp le s
pic— height— in— luma— samples  Pic— height— in— luma— samples
pic— cropping— flag  Pic— cropping — flag
if( pic— cropping— flag ) {  If( pic- cropping-flag ) {
pic— crop— left— offset  Pic— crop—left—offset
pic— crop— right— offset  Pic— crop— right—offset
pic— crop— top— offset  Pic— crop — top—offset
pic— crop— bottom— offset  Pic— crop—bottom—offset
}  }
b it— depth— luma— minus 8  b it—depth— luma— minus 8
bit— depth— chroma— minus8  Bit—depth—chrome— minus8
pcm— enabled— flag if( pcm— enabled— flag ) { Pcm— enabled— flag If( pcm— enabled— flag ) {
pcm— bit— depth— luma— minus 1  Pcm — bit — depth — luma — minus 1
pcm— bit— depth— chroma— minus 1  Pcm — bit — depth — chroma — minus 1
}  }
qpprime—y— zero— transquant— bypass— flag  Qpprime—y— zero— transquant— bypass—flag
log2_max_pic_order_cnt_lsb_minus4  Log2_max_pic_order_cnt_lsb_minus4
} }
第三种方式, 保持第三类语法元素的位置不动, 将第一类语法元素和 第二类语法元素均调整到中间位置, 如将第一类语法元素和第二类语法元 素均调整到第三类语法元素之后。  The third way is to keep the position of the third type of syntax element unchanged, and adjust the first type of syntax element and the second type of syntax element to the middle position, such as adjusting the first type of syntax element and the second type of syntax element to After the third type of syntax element.
釆用第三种方式得到的 SPS中各语法元素位置关系如下所示:  The positional relationship of each syntax element in the SPS obtained in the third way is as follows:
seq parameter set rbsp( ) { pic— width— in— luma— samples  Seq parameter set rbsp( ) { pic— width— in— luma— samples
pic— height— in— luma— samples  Pic— height— in— luma— samples
pic— cropping— flag  Pic— cropping — flag
if( pic— cropping— flag ) {  If( pic- cropping-flag ) {
pic— crop— left— offset  Pic— crop—left—offset
pic— crop— right— offset  Pic— crop— right—offset
pic— crop— top— offset  Pic— crop — top—offset
pic— crop— bottom— offset  Pic— crop—bottom—offset
}  }
bit— depth— luma— minus 8  Bit—depth— luma— minus 8
bit— depth— chroma— minus 8  Bit-depth—chrome— minus 8
max— temporal— layers— minus 1  Max—tempo—layer— minus 1
for( i = 0; i <= max— temporal— layers— minus 1 ;  For( i = 0; i <= max-temporal- layers- minus 1 ;
i++ ) {  i++ ) {
max_dec_pic_buffering[ i ]  Max_dec_pic_buffering[ i ]
num_reorder_p ic s [ i ] max— latency— increase [ i ] Num_reorder_p ic s [ i ] Max- latency- increase [ i ]
}  }
pcm— enabled— flag  Pcm— enabled— flag
if( pcm— enabled— flag ) {  If( pcm— enabled— flag ) {
pcm— bit— depth— luma— minus 1  Pcm — bit — depth — luma — minus 1
pcm— bit— depth— chroma— minus 1  Pcm — bit — depth — chroma — minus 1
}  }
qpprime—y— zero— transquant— bypass— flag log2— max— pic— order— cnt—lsb—minus4  Qpprime—y—zero— transquant—bypass—flag log2— max— pic—order— cnt—lsb—minus4
} }
第四种方式, 是将第三类语法元素放置在第一类语法元素之后, 而将 第二类语法元素放置在第三类语法元素之后。  The fourth way is to place the third type of syntax element after the first type of syntax element and the second type of syntax element after the third type of syntax element.
釆用第四种方式得到的 SPS中各语法元素位置关系如下所示:  The positional relationship of each syntax element in the SPS obtained in the fourth way is as follows:
seq parameter set rbsp( ) { max— temporal— layers— minus 1  Seq parameter set rbsp( ) { max — temporal — layers — minus 1
pic— width— in— luma— samples  Pic— width—in— luma— samples
pic— height— in— luma— samples  Pic— height— in— luma— samples
pic— cropping— flag  Pic— cropping — flag
if( pic— cropping— flag ) {  If( pic- cropping-flag ) {
pic— crop— left— offset  Pic— crop—left—offset
pic— crop— right— offset  Pic— crop— right—offset
pic— crop— top— offset  Pic— crop — top—offset
pic— crop— bottom— offset  Pic— crop—bottom—offset
}  }
bit— depth— luma— minus 8  Bit—depth— luma— minus 8
bit— depth— chroma— minus 8  Bit-depth—chrome— minus 8
for( i = 0; i <= max— temporal— layers— minus 1 ;  For( i = 0; i <= max-temporal- layers- minus 1 ;
i++ ) { max— dec_pic— buffering [ i ] i++ ) { Max— dec_pic— buffering [ i ]
num— reorder— pics [ i ]  Num— reorder— pics [ i ]
max— latency— increase [ i ]  Max- latency- increase [ i ]
}  }
pcm— enabled— flag  Pcm— enabled— flag
if( pcm— enabled— flag ) {  If( pcm— enabled— flag ) {
pcm— bit— depth— luma— minus 1  Pcm — bit — depth — luma — minus 1
pcm— bit— depth— chroma— minus 1  Pcm — bit — depth — chroma — minus 1
}  }
qpprime—y— zero— transquant— bypass— flag log2— max— pic— order— cnt—lsb—minus4  Qpprime—y—zero— transquant—bypass—flag log2— max— pic—order— cnt—lsb—minus4
} }
本实施例中, 解码设备或者媒体网关可以提取编码设备生成的码流, 该码流中, 第一类语法元素与第二类语法元素放置在所述码流中的邻近位 置上, 或者第二类语法元素与第三类语法元素放置在所述码流中的邻近位 置上, 因此, 解码设备或者媒体网关在抽取每个时间层信息时, 无需额外 解析其他无关的语法元素, 而是可以连续地将每个时间层信息的全部语法 元素都抽取出来, 从而提高了时间层信息的处理效率。  In this embodiment, the decoding device or the media gateway may extract a code stream generated by the encoding device, where the first class syntax element and the second class syntax element are placed in a neighboring position in the code stream, or the second The class syntax element and the third class syntax element are placed in the adjacent position in the code stream. Therefore, the decoding device or the media gateway does not need to additionally parse other irrelevant syntax elements when extracting each time layer information, but may continuously All the syntax elements of each time layer information are extracted, thereby improving the processing efficiency of the time layer information.
图 4为本发明视频图像码流处理方法实施例四的流程图,如图 4所示, 本实施例的方法可以包括:  4 is a flowchart of Embodiment 4 of a video image code stream processing method according to the present invention. As shown in FIG. 4, the method in this embodiment may include:
步骤 401 、 提取编码设备产 生 的 码 流 中 的 语 法元素 max— dec— pic— buffering— diff[i]和 num— reorder— pics— diff[i] , 其中, 语法元素 max— dec— pic— buffering— diff[i]表示解码时间层标识不超过 i 的子码流所需 要的解码图像緩存的大小与解码时间层标识不超过 i-1 的子码流所需要的 解码图像緩存的大小的差值, 语法元素 num— reorder— pics— diff[i]表示时间 层标识不超过 i的子码流中倒置图像的数量与时间层标识不超过 i-1 的子 码流中倒置图像的数量的差值;  Step 401: Extract syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream generated by the encoding device, where the syntax element max_dec_pic_buffering- Diff[i] represents the difference between the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i and the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i-1, The syntax element num_reorder_pics_diff[i] represents the difference between the number of inverted images in the subcode stream whose time layer identifier does not exceed i and the number of inverted images in the subcode stream whose time layer identifier does not exceed i-1;
步骤 402、 若 i 为 0 , 釆用下述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]: MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i] ; Step 402: If i is 0, obtain MaxDecPicBuffering[i] and NumReorderPics[i] in the following manner: MaxDecPicBuffering[i] = max— dec—pic—buffering—diff[i] ;
NumReorderPics[i] = num_reorder_pics_diff [i];  NumReorderPics[i] = num_reorder_pics_diff [i];
步骤 403、 若 i大于 0, 釆用下述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  Step 403: If i is greater than 0, obtain MaxDecPicBuffering[i] and NumReorderPics[i] in the following manner:
MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]  MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]
+max— dec— pic— buffering— diff[i];  +max— dec— pic— buffering— diff[i];
NumReorderPics[i] = NumReorderPics [i- 1 ] + num_reorder_p ic s_dif f [i] ; 其中, MaxDecPicBuffering[i]表示假设参考解码器 ( Hypothetical Reference Decoder, 以下简称: HRD ) 的 DPB针对第 i个时间层码流所要 求的存储空间大小, NumReorderPics [i]表示第 i个时间层码流中的时序倒 置图像的最大数量。  NumReorderPics[i] = NumReorderPics [i-1] + num_reorder_p ic s_dif f [i] ; where MaxDecPicBuffering[i] indicates that the DPB of the Hypothetical Reference Decoder (HRD) is for the i-th time-layer code. The size of the storage space required by the stream, NumReorderPics [i] represents the maximum number of time-series inverted images in the i-th temporal layer stream.
所述第 i个时间层码流与所述时间层标识不超过 i的子码流含义相同, 指去除原码流中所有 temporal— id大于 i的 NALU所得到的子码流。  The i-th time-layer code stream has the same meaning as the sub-code stream whose time-layer identifier does not exceed i, and refers to the sub-code stream obtained by removing all NALUs whose temporal-id is greater than i in the original code stream.
由上述图 3 所示方法实施例三中的描述可知, 语法元素 max_dec_pic_buffering[i] 表示解码所有 temporal— id不超过 i的子码流所需 要的解码图像緩存的大小, 因此, 语法元素 max— dec— pic— buffering^]的值 是非减的, 即 max— dec_pic— buffering[i]≥max— dec— pic— buffering[i-l]。 也即, 不会出现解码 temporal— id不超过 i的图像所需要的解码图像緩存大小,小 于解码 temporal— id不超过 i- 1的图像所需要的解码图像緩存大小的情况。 同理,语法元素 num_reorder_pics [i] 表示 temporal— id不超过 i的子码流中 倒置图像的数量, 因此, 语法元素 num— reorder— pics[i]的值也是非减的。  As can be seen from the description in the third embodiment of the method shown in FIG. 3 above, the syntax element max_dec_pic_buffering[i] represents the size of the decoded image buffer required to decode all sub-streams whose temporal-id does not exceed i, and therefore, the syntax element max_dec The value of – pic—buffering^] is non-decreasing, ie max—dec_pic—buffering[i]≥max—dec—pic—buffering[il]. That is, there is no case where the decoded image buffer size required to decode an image whose temporal_id does not exceed i is smaller than the decoded image buffer size required to decode an image whose temporal_id does not exceed i-1. Similarly, the syntax element num_reorder_pics [i] indicates that the temporal_id does not exceed the number of inverted images in the substream of i, so the value of the syntax element num_reorder_pics[i] is also non-decreasing.
因此, 本实施例利用语法元素 max— dec_pic— buffering [i]的值和语法元 素 num— reorder_pics[i]的值均是非减的这一特性, 对这两个语法元素进行 差分编码,以减少表示 SPS中的语法元素 max— dec_pic— buffering和语法元 素 num_reorder_pics所需要的比特, 从而提高压缩效率。  Therefore, in this embodiment, the value of the syntax element max_dec_pic_buffering [i] and the value of the syntax element num_reorder_pics[i] are both non-subtractive, and the two syntax elements are differentially encoded to reduce the representation. The syntax elements max- dec_pic- buffering in the SPS and the bits required by the syntax element num_reorder_pics improve the compression efficiency.
具体地实施方式可以为:  The specific implementation may be:
使用语法元素 max— dec_pic— buffering— diff[i]和 num_reorder_pics_diff[i] 分另 ll替代现有的语法元素 max— dec_pic— buffering [i]和 num— reorder— pics[i]。  Use the syntax elements max_dec_pic-buffering-diff[i] and num_reorder_pics_diff[i] to replace the existing syntax elements max- dec_pic-buffering [i] and num-reorder-pics[i].
使用变量 MaxDecPicBuffering[i]表示 HRD的 DPB针对第 i个时间层 码流所要求的存储空间大小, 该存储空间大小以图像存储空间大小为基本 单元计算, 例如该 MaxDecPicBuffering[i]=2表示 DPB存储空间大小为 2 幅图像所需的存储空间大小。 使用变量 NumReorderPics[i]表示第 i个时间 层码流中的时序倒置图像的最大数量。 Use the variable MaxDecPicBuffering[i] to indicate the size of the storage space required by the DPB of the HRD for the i-th time-layer code stream. The size of the storage space is based on the size of the image storage space. The unit calculation, for example, the MaxDecPicBuffering[i]=2 indicates the amount of storage space required for the DPB storage space to be 2 images. The variable NumReorderPics[i] is used to represent the maximum number of time-series inverted images in the i-th temporal layer code stream.
当 i为 0时:  When i is 0:
MaxDecPicBuffering[i]= max— dec_pic— buffering— diff[i] MaxDecPicBuffering[i]= max— dec_pic— buffering— diff[i]
NumReorderPics[i] = num_reorder_pics_diff [i]; NumReorderPics[i] = num_reorder_pics_diff [i];
当 i>0时:  When i>0:
MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]  MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]
+max— dec— pic— buffering— diff [i]  +max— dec— pic— buffering— diff [i]
NumReorderPics[i]=NumReorderPics[i- 1 ]+num_reorder_pics_diff [i] 本实施例中, 利用表示解码时间层标识不超过 i的子码流所需要的解 码图像緩存的大小的语法元素的值和表示时间层标识不超过 i的子码流中 倒置图像的数量的语法元素的值均是非减的这一特性, 对这两个语法元素 进行差分编码, 以减少表示 SPS中的语法元素所需要的比特, 从而提高压 缩效率。  NumReorderPics[i]=NumReorderPics[i-1]+num_reorder_pics_diff [i] In this embodiment, the value and the presentation time of the syntax element indicating the size of the decoded image buffer required for the sub-code stream not exceeding i are represented by the decoding time layer identifier. The value of the syntax element whose layer identifier does not exceed the number of inverted images in the sub-stream of i is non-decreasing, and the two syntax elements are differentially encoded to reduce the bits required to represent the syntax elements in the SPS. Thereby improving the compression efficiency.
本发明解码设备实施例一中, 解码设备可以包括: 提取模块, 该提取 模块, 用于提取编码设备产生的码流, 所述码流中包含纯净随机访问 CRA 图像信息所述 CRA图像信息满足 CRA图像的约束条件且所述约束条件包 括 CRA图像必须为参考图像。  In the first embodiment of the decoding device of the present invention, the decoding device may include: an extracting module, configured to extract a code stream generated by the encoding device, where the code stream includes pure random access CRA image information, and the CRA image information satisfies CRA Constraints of the image and the constraints include that the CRA image must be a reference image.
在具体实现时, 提取模块, 具体用于提取所述编码设备产生的网络抽 象层单元 NALU,所述 NALU的头字段中的语法元素 nal— unit— type指示该 NALU中包含 CRA图像信息, 所述约束条件, 包括: 所述 NALU的头字 段中的语法元素 nal— ref— flag必须为 1。  In a specific implementation, the extracting module is specifically configured to extract a network abstraction layer unit NALU generated by the encoding device, where a syntax element nal_unit_type in the header field of the NALU indicates that the NALU includes CRA image information, where The constraint includes: the syntax element nal_ref_flag in the header field of the NALU must be 1.
本实施例的解码设备用于执行图 1所示方法实施例的技术方案, 其实 现原理和技术效果类似, 此处不再赘述。  The decoding device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 1. The principle and technical effects are similar, and details are not described herein again.
图 5为本发明解码设备实施例二的结构示意图, 如图 5所示, 本实施 例的解码设备, 可以包括: 第一确定模块 21、 处理模块 22以及第二确定 模块 23 , 其中, 第一确定模块 21 , 用于确定激活图像中依次解码的图像 的图片顺序编号 POC是否按照非降序排列, 所述激活图像包括当前图像 以及解码图像緩存 DPB 中所有被标记用作短期参考的图像和等待输出的 图像; 处理模块 22, 用于若是按照非降序排列, 则釆用如下方法确定所述 当前图像的 POC重要比特位 POCMsb: 从码流中获取当前图像的 POC非 重要比特位 POCLsb; 如果所述当前图像的 POCLsb 小于之前图像的 POCLsb,则当前图像的 POCMsb等于之前图像的 POCMsb与最大 POCLsb 之和, 其中最大 POCLsb=2N, N为 POCLsb的位宽; 否则, 所述当前图像 的 POCMsb等于 prevPicOrderCntMsb; 第二确定模块 23 ,用于根据所述当 前图像的 POCMsb 以及所述当前图像的 POCLsb, 确定所述当前图像的 POC。 FIG. 5 is a schematic structural diagram of Embodiment 2 of a decoding device according to the present invention. As shown in FIG. 5, the decoding device in this embodiment may include: a first determining module 21, a processing module 22, and a second determining module 23, where a determining module 21, configured to determine whether a picture sequence number POC of the sequentially decoded images in the active image is arranged in a non-descending order, the active image including the current image and all images in the decoded image buffer DPB that are marked for short-term reference and waiting for output of An image processing module 22, configured to determine a POC important bit POCMsb of the current image by using a method of: following a non-descending order: obtaining a POC non-significant bit POCLsb of the current image from the code stream; If the POCLsb of the image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of the POCMsb of the previous image and the maximum POCLsb, wherein the maximum POCLsb=2 N , N is the bit width of the POCLsb; otherwise, the POCMsb of the current image is equal to prevPicOrderCntMsb; The second determining module 23 is configured to determine a POC of the current image according to the POCMsb of the current image and the POCLsb of the current image.
在具体实现时, 第一确定模块 21 , 具体用于根据序列参数集 SPS 中 表示时序倒置图像的最大数量的语法元素 num— reorder_pics的取值来确定 所述激活图像中依次解码的图像的 POC是否按照非降序排列。  In a specific implementation, the first determining module 21 is specifically configured to determine, according to the value of the syntax element num_reorder_pics indicating the maximum number of time-stamped images in the sequence parameter set SPS, whether the POC of the sequentially decoded image in the activated image is Arranged in non-descending order.
本实施例的解码设备用于执行图 2所示方法实施例的技术方案, 其实 现原理和技术效果类似, 此处不再赘述。  The decoding device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 2, and the principle and the technical effect are similar, and details are not described herein again.
图 6为本发明处理设备实施例的结构示意图, 如图 6所示, 本实施例 的设备, 可以包括: 提取模块 31、 抽取模块 32, 其中, 提取模块 31 , 用 于提取编码设备产生的码流, 其中, 第一类语法元素与第二类语法元素放 置在所述码流中的邻近位置上, 或者第二类语法元素与第三类语法元素放 置在所述码流中的邻近位置上, 所述第一类语法元素是指包含时间层数量 信息的语法元素, 所述第二类语法元素是指包含每个时间层的码流特征信 息的语法元素, 所述第三类语法元素是指包含每一幅图像所占用存储空间 大小信息的语法元素; 抽取模块 32, 用于从所述码流的所述邻近位置上抽 取至少两类语法元素。  FIG. 6 is a schematic structural diagram of an embodiment of a processing device according to the present invention. As shown in FIG. 6, the device in this embodiment may include: an extracting module 31, and an extracting module 32, where the extracting module 31 is configured to extract a code generated by the encoding device. a stream, wherein the first type of syntax element and the second type of syntax element are placed in a neighboring position in the code stream, or the second type of syntax element and the third type of syntax element are placed in a neighboring position in the code stream The first type of syntax element refers to a syntax element that includes time layer quantity information, and the second type of syntax element refers to a syntax element that includes code stream feature information of each time layer, and the third type of syntax element is Refers to a syntax element that contains information on the size of the storage space occupied by each image; an extraction module 32 is configured to extract at least two types of syntax elements from the neighboring locations of the code stream.
在具体实现时, 第二类语法元素和第三类语法元素在所述码流中的位 置不变, 所述第一类语法元素放置在所述第二类语法元素之前; 或者, 所 述第一类语法元素和第三类语法元素在所述码流中的位置不变, 所述第二 类语法元素放置在所述第一类语法元素之后; 或者, 所述第三类语法元素 在所述码流中的位置不变, 所述第一类语法元素和第二类语法元素放置在 所述码流的中间位置; 或者, 所述第三类语法元素放置在所述第一类语法 元素之后, 所述第二类语法元素放置在所述第三类语法元素之后。  In a specific implementation, the positions of the second type of syntax element and the third type of syntax element in the code stream are unchanged, and the first type of syntax element is placed before the second type of syntax element; or, the The position of a class of syntax elements and a class of class 3 syntax elements in the code stream is unchanged, the second class of syntax elements are placed after the first class of syntax elements; or the third class of syntax elements are The position in the code stream is unchanged, the first class syntax element and the second class syntax element are placed in an intermediate position of the code stream; or the third class syntax element is placed in the first class syntax element Thereafter, the second type of syntax element is placed after the third type of syntax element.
而且, 本实施例的处理设备可以是解码设备, 例如终端设备中的解码 器, 也可以是媒体网关。 Moreover, the processing device of this embodiment may be a decoding device, such as decoding in a terminal device. It can also be a media gateway.
本实施例的处理设备用于执行图 3所示方法实施例的技术方案, 其实 现原理和技术效果类似, 此处不再赘述。  The processing device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 3, and the principle and the technical effect are similar, and details are not described herein again.
图 7为本发明解码设备实施例三的结构示意图, 如图 7所示, 本实施 例的设备, 可以包括: 提取模块 41、 差分解码模块 42, 其中, 提取模块 41 , 用 于 提 取 编 码 设 备 产 生 的 码 流 中 的 语 法 元 素 max— dec— pic— buffering— diff[i]和 num— reorder— pics— diff[i] , 其中, 语法元素 max— dec— pic— buffering— diff[i]表示解码时间层标识不超过 i 的子码流所需 要的解码图像緩存的大小与解码时间层标识不超过 i-1 的子码流所需要的 解码图像緩存的大小的差值, 语法元素 num— reorder— pics— diff[i]表示时间 层标识不超过 i的子码流中倒置图像的数量与时间层标识不超过 i-1 的子 码流中倒置图像的数量的差值; 差分解码模块 42, 用于若 i为 0, 釆用下 述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  FIG. 7 is a schematic structural diagram of Embodiment 3 of a decoding device according to the present invention. As shown in FIG. 7, the device in this embodiment may include: an extracting module 41 and a differential decoding module 42, where the extracting module 41 is configured to extract an encoding device. The syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream, where the syntax element max_dec_pic_buffering_diff[i] represents the decoding time The difference between the size of the decoded image buffer required for the sub-code stream whose layer identifier does not exceed i and the size of the decoded picture buffer required for the sub-code stream whose decoding time layer identifier does not exceed i-1, syntax element num_reorder_pics – diff[i] represents the difference between the number of inverted pictures in the sub-code stream whose time layer identifier does not exceed i and the number of inverted pictures in the sub-code stream whose time layer identifier does not exceed i-1; differential decoding module 42, for If i is 0, Max get MaxDecPicBuffering[i] and NumReorderPics[i] in the following way:
MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];  MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];
NumReorderPics[i] = num_reorder_pics_diff[i];  NumReorderPics[i] = num_reorder_pics_diff[i];
若 i 大于 0 , 釆用 下述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  If i is greater than 0, use MaxDecPicBuffering[i] and NumReorderPics[i] as follows:
MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]  MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]
+max— dec— pic— buffering— diff[i];  +max— dec— pic— buffering— diff[i];
NumReorderPics[i] = NumReorderPics [i- 1 ] + num_reorder_p ic s_diff [i] ; 其中, MaxDecPicBuffering[i]表示假设参考解码器 HRD的 DPB针对 第 i个时间层码流所要求的存储空间大小, NumReorderPics[i]表示第 i个 时间层码流中的时序倒置图像的最大数量。 所述第 i个时间层码流与所述 时间层标识不超过 i的子码流含义相同, 指去除原码流中所有 temporal— id 大于 i的 NALU所得到的子码流。  NumReorderPics[i] = NumReorderPics [i-1] + num_reorder_p ic s_diff [i] ; where MaxDecPicBuffering[i] represents the size of the storage space required by the DPB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics[ i] represents the maximum number of time-series inverted images in the i-th temporal layer code stream. The i-th time-layer code stream has the same meaning as the sub-code stream whose time-layer identifier does not exceed i, and refers to the sub-code stream obtained by removing all NALUs whose temporal-id is greater than i in the original code stream.
本实施例的解码设备用于执行图 4所示方法实施例的技术方案, 其实 现原理和技术效果类似, 此处不再赘述。  The decoding device of this embodiment is used to perform the technical solution of the method embodiment shown in FIG. 4, and the principle and the technical effect are similar, and details are not described herein again.
本领域普通技术人员可以理解: 实现上述各方法实施例的全部或部分 步骤可以通过程序指令相关的硬件来完成。 前述的程序可以存储于一计算 机可读取存储介质中。 该程序在执行时, 执行包括上述各方法实施例的步 骤; 而前述的存储介质包括: ROM、 RAM, 磁碟或者光盘等各种可以存 储程序代码的介质。 It will be understood by those skilled in the art that all or part of the steps of implementing the above method embodiments may be performed by hardware related to the program instructions. The aforementioned program can be stored in a computer readable storage medium. When the program is executed, the steps including the above method embodiments are performed. The foregoing storage medium includes: a medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.
最后应说明的是: 以上各实施例仅用以说明本发明的技术方案, 而非 对其限制; 尽管参照前述各实施例对本发明进行了详细的说明, 本领域的 普通技术人员应当理解: 其依然可以对前述各实施例所记载的技术方案进 行修改, 或者对其中部分或者全部技术特征进行等同替换; 而这些修改或 者替换, 并不使相应技术方案的本质脱离本发明各实施例技术方案的范 围。  It should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art The technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the technical solutions of the embodiments of the present invention. range.

Claims

权 利 要 求 书 Claim
1、 一种视频图像码流处理方法, 其特征在于, 包括: A video image code stream processing method, comprising:
提取编码设备产生的码流, 所述码流中包含纯净随机访问 CRA 图像 信息, 所述 CRA图像信息满足 CRA图像的约束条件且所述约束条件包括 CRA图像必须为参考图像。  A code stream generated by the encoding device is extracted, the code stream containing pure random access CRA image information, the CRA image information satisfying a constraint condition of the CRA image and the constraint condition includes the CRA image must be a reference image.
2、 根据权利要求 1 所述的方法, 其特征在于, 所述提取编码设备产 生的码流, 所述码流中包含 CRA图像信息, 包括:  The method according to claim 1, wherein the extracting the code stream generated by the encoding device, and the code stream includes CRA image information, including:
提取所述编码设备产生的网络抽象层单元 NALU,所述 NALU的头字 段中的语法元素 nal— unit— type指示该 NALU中包含 CRA图像信息;  Extracting a network abstraction layer unit NALU generated by the encoding device, and a syntax element nal_unit_type in the header field of the NALU indicates that the NALU includes CRA image information;
所述约束条件, 包括:  The constraints include:
所述 NALU的头字段中的语法元素 nal— ref— flag必须为 1。  The syntax element nal_ref_flag in the header field of the NALU must be 1.
3、 一种视频图像码流处理方法, 其特征在于, 包括:  3. A video image code stream processing method, comprising:
确定依次解码的图像的图片顺序编号 POC是否按照非降序排列; 若是, 则釆用如下方法确定所述当前图像的 POC 重要比特位 POCMsb:  Determining whether the picture sequence number POC of the sequentially decoded pictures is arranged in non-descending order; if so, determining the POC important bits POCMsb of the current picture by the following method:
从码流中获取当前图像的 POC非重要比特位 POCLsb;  Obtaining a POC non-significant bit of the current image from the code stream POCLsb;
如果所述当前图像的 POCLsb小于之前图像的 POCLsb, 则当前图像 的 POCMsb 等于之前图像的 POCMsb 与最大 POCLsb 之和, 其中最大 POCLsb=2N, N为 POCLsb的位宽;  If the POCLsb of the current image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of POCMsb and the maximum POCLsb of the previous image, wherein the maximum POCLsb=2N, where N is the bit width of the POCLsb;
否则, 所述当前图像的 POCMsb等于之前图像的 POCMsb;  Otherwise, the POCMsb of the current image is equal to the POCMsb of the previous image;
根据所述当前图像的 POCMsb以及所述当前图像的 POCLsb, 确定所 述当前图像的 POC。  The POC of the current image is determined according to the POCMsb of the current image and the POCLsb of the current image.
4、 根据权利要求 3 所述的方法, 其特征在于, 所述确定依次解码的 图像的 POC是否按照非降序排列, 包括:  The method according to claim 3, wherein the determining whether the POCs of the sequentially decoded images are arranged in non-descending order comprises:
根据序列参数集 SPS 中表示时序倒置图像的最大数量的语法元素 num— reorder— pics的取值来确定依次解码的图像的 POC是否按照非降序排 歹 |J。  According to the value of the syntax element num_reorder_pics indicating the maximum number of time-stamped images in the sequence parameter set SPS, it is determined whether the POCs of the sequentially decoded images are arranged in non-descending order 歹 |J.
5、 根据权利要求 3或 4所述的方法, 其特征在于, 激活图像中任意 两幅图像的 POC 差值的大小没有约束条件, 所述激活图像包括所述当前 图像以及解码图像緩存 DPB 中所有被标记用作短期参考的图像和等待输 出的图像。 The method according to claim 3 or 4, wherein the size of the POC difference value of any two images in the activated image has no constraint, and the activated image includes all of the current image and the decoded image buffer DPB. Images marked for short-term reference and waiting for input Out of the image.
6、 一种视频图像码流处理方法, 其特征在于, 包括:  6. A video image code stream processing method, comprising:
提取编码设备产生的码流, 其中, 第一类语法元素与第二类语法元素 放置在所述码流中的邻近位置上, 或者第二类语法元素与第三类语法元素 放置在所述码流中的邻近位置上, 所述第一类语法元素是指包含时间层数 量信息的语法元素, 所述第二类语法元素是指包含每个时间层的码流特征 信息的语法元素, 所述第三类语法元素是指包含每一幅图像所占用存储空 间大小信息的语法元素;  Extracting a code stream generated by the encoding device, wherein the first class syntax element and the second class syntax element are placed in a neighboring position in the code stream, or the second class syntax element and the third class syntax element are placed in the code In a neighboring position in the stream, the first type of syntax element refers to a syntax element including time layer quantity information, and the second type of syntax element refers to a syntax element that includes code stream feature information of each time layer, The third type of syntax element refers to a syntax element that contains information about the size of the storage space occupied by each image;
从所述码流的所述邻近位置上抽取至少两类语法元素。  At least two types of syntax elements are extracted from the neighboring locations of the codestream.
7、 根据权利要求 6所述的方法, 其特征在于, 所述第二类语法元素 和第三类语法元素在所述码流中的位置不变, 所述第一类语法元素放置在 所述第二类语法元素之前; 或者,  The method according to claim 6, wherein the positions of the second class syntax element and the third class syntax element in the code stream are unchanged, and the first class syntax element is placed in the Before the second type of syntax element; or,
所述第一类语法元素和第三类语法元素在所述码流中的位置不变, 所 述第二类语法元素放置在所述第一类语法元素之后; 或者,  The position of the first type of syntax element and the third type of syntax element in the code stream is unchanged, and the second type of syntax element is placed after the first type of syntax element; or
所述第三类语法元素在所述码流中的位置不变, 所述第一类语法元素 和第二类语法元素放置在所述码流的中间位置; 或者,  The position of the third type of syntax element in the code stream is unchanged, and the first type of syntax element and the second type of syntax element are placed in an intermediate position of the code stream; or
所述第三类语法元素放置在所述第一类语法元素之后, 所述第二类语 法元素放置在所述第三类语法元素之后。  The third type of syntax element is placed after the first type of syntax element, and the second type of syntax element is placed after the third type of syntax element.
8、 一种视频图像码流处理方法, 其特征在于, 包括:  8. A video image code stream processing method, comprising:
提取编码设备产生的码流中的语法元素 max_dec_pic_buffering_diff[i] 和 num— reorder_pics— diff[i] , 其中,语法元素 max— dec— pic— buffering— diff[i] 表示解码时间层标识不超过 i的子码流所需要的解码图像緩存的大小与解 码时间层标识不超过 i-1 的子码流所需要的解码图像緩存的大小的差值, 语法元素 num— reorder_pics— diff[i]表示时间层标识不超过 i 的子码流中倒 置图像的数量与时间层标识不超过 i-1 的子码流中倒置图像的数量的差 值;  Extracting syntax elements max_dec_pic_buffering_diff[i] and num_reorder_pics_diff[i] in the code stream generated by the encoding device, where the syntax element max_dec_pic_buffering_diff[i] indicates that the decoding time layer identifier does not exceed i The size of the decoded image buffer required by the substream is different from the size of the decoded image buffer required by the decoding time layer identifier not exceeding i-1, and the syntax element num_reorder_pics_diff[i] represents the time layer. The difference between the number of inverted images in the substream that does not exceed i and the number of inverted images in the substream that does not exceed i-1;
若 i 为 0 , 釆用 下 述方 式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  If i is 0, then MaxDecPicBuffering[i] and NumReorderPics[i] are obtained in the following way:
MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];  MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];
NumReorderPics[i] = num_reorder_pics_diff[i]; 若 i 大于 0 , 釆用 下述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]: NumReorderPics[i] = num_reorder_pics_diff[i]; If i is greater than 0, get MaxDecPicBuffering[i] and NumReorderPics[i] in the following way:
MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]  MaxDecPicBuffering[i]=MaxDecPicBuffering[i-l]
+max— dec— pic— buffering— diff[i];  +max— dec— pic— buffering— diff[i];
NumReorderPics[i] = NumReorderPics [i- 1 ] + num_reorder_p ic s_diff [i]; 其中, MaxDecPicBuffering[i]表示假设参考解码器 HRD的 DPB针对 第 i个时间层码流所要求的存储空间大小, NumReorderPics[i]表示第 i个 时间层码流中的时序倒置图像的最大数量。  NumReorderPics[i] = NumReorderPics [i-1] + num_reorder_p ic s_diff [i]; where MaxDecPicBuffering[i] represents the size of the storage space required by the DPB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics[ i] represents the maximum number of time-series inverted images in the i-th temporal layer code stream.
9、 一种解码设备, 其特征在于, 包括:  9. A decoding device, comprising:
提取模块, 用于提取编码设备产生的码流, 所述码流中包含纯净随机 访问 CRA图像信息所述 CRA图像信息满足 CRA图像的约束条件且所述 约束条件包括 CRA图像必须为参考图像。  And an extracting module, configured to extract a code stream generated by the encoding device, where the code stream includes pure random access CRA image information, the CRA image information satisfies a constraint condition of the CRA image, and the constraint includes the CRA image must be a reference image.
10、 根据权利要求 9所述的设备, 其特征在于, 所述提取模块, 具体 用于提取所述编码设备产生的网络抽象层单元 NALU,所述 NALU的头字 段中的语法元素 nal— unit— type指示该 NALU中包含 CRA图像信息;  The device according to claim 9, wherein the extracting module is specifically configured to extract a network abstraction layer unit NALU generated by the encoding device, and a syntax element nal_unit in a header field of the NALU. Type indicates that the NALU contains CRA image information;
所述约束条件, 包括: 所述 NALU的头字段中的语法元素 nal— ref— flag 必须为 1。  The constraint includes: the syntax element nal_ref_flag in the header field of the NALU must be 1.
11、 一种解码设备, 其特征在于, 包括:  A decoding device, comprising:
第一确定模块, 用于确定依次解码的图像的图片顺序编号 POC是否 按照非降序排列;  a first determining module, configured to determine whether a picture sequence number POC of the sequentially decoded images is arranged in a non-descending order;
处理模块, 用于若是按照非降序排列, 则釆用如下方法确定所述当前 图像的 POC重要比特位 POCMsb:  The processing module is configured to determine POCsb of the current image by using the following method if it is arranged in non-descending order:
从码流中获取当前图像的 POC非重要比特位 POCLsb;  Obtaining a POC non-significant bit of the current image from the code stream POCLsb;
如果所述当前图像的 POCLsb小于之前图像的 POCLsb, 则当前图像 的 POCMsb 等于之前图像的 POCMsb 与最大 POCLsb 之和, 其中最大 POCLsb=2N, N为 POCLsb的位宽;  If the POCLsb of the current image is smaller than the POCLsb of the previous image, the POCMsb of the current image is equal to the sum of POCMsb and the maximum POCLsb of the previous image, wherein the maximum POCLsb=2N, where N is the bit width of the POCLsb;
否则, 所述当前图像的 POCMsb等于之前图像的 POCMsb;  Otherwise, the POCMsb of the current image is equal to the POCMsb of the previous image;
第二确定模块,用于根据所述当前图像的 POCMsb以及所述当前图像 的 POCLsb, 确定所述当前图像的 POC。  And a second determining module, configured to determine a POC of the current image according to the POCMsb of the current image and the POCLsb of the current image.
12、 根据权利要求 11所述的设备, 其特征在于, 所述第一确定模块, 具体用于根据序列参数集 SPS 中表示时序倒置图像的最大数量的语法元 素 num— reorder_pics的取值来确定所述激活图像中图像的 POC是否按照非 降序排列。 The device according to claim 11, wherein the first determining module, Specifically, it is determined whether the POC of the image in the activated image is arranged in a non-descending order according to the value of the syntax element num_reorder_pics indicating the maximum number of time-stamped images in the sequence parameter set SPS.
13、 根据权利要求 1 1或 12所述的设备, 其特征在于, 所述激活图像 中任意两幅图像的 POC 差值的大小没有约束条件, 所述激活图像包括所 述当前图像以及解码图像緩存 DPB 中所有被标记用作短期参考的图像和 等待输出的图像。  The device according to claim 1 or 12, wherein the size of the POC difference value of any two images in the activated image has no constraint, and the activated image includes the current image and the decoded image buffer. All images in the DPB that are marked for short-term reference and images that are waiting for output.
14、 一种处理设备, 其特征在于, 包括:  14. A processing device, comprising:
提取模块, 用于提取编码设备产生的码流, 其中, 第一类语法元素与 第二类语法元素放置在所述码流中的邻近位置上, 或者第二类语法元素与 第三类语法元素放置在所述码流中的邻近位置上, 所述第一类语法元素是 指包含时间层数量信息的语法元素, 所述第二类语法元素是指包含每个时 间层的码流特征信息的语法元素, 所述第三类语法元素是指包含每一幅图 像所占用存储空间大小信息的语法元素;  An extraction module, configured to extract a code stream generated by the encoding device, where the first class syntax element and the second class syntax element are placed in a neighboring position in the code stream, or the second class syntax element and the third class syntax element Placed in a neighboring position in the code stream, the first type of syntax element refers to a syntax element including time layer quantity information, and the second type of syntax element refers to a code stream characteristic information including each time layer. a syntax element, the third type of syntax element is a syntax element that includes information about the size of the storage space occupied by each image;
抽取模块, 用于从所述码流的所述邻近位置上抽取至少两类语法元 素。  And an extracting module, configured to extract at least two types of grammatical elements from the neighboring locations of the code stream.
15、 根据权利要求 14所述的设备, 其特征在于, 所述第二类语法元 素和第三类语法元素在所述码流中的位置不变, 所述第一类语法元素放置 在所述第二类语法元素之前; 或者,  The device according to claim 14, wherein the positions of the second class syntax element and the third class syntax element in the code stream are unchanged, and the first class syntax element is placed in the Before the second type of syntax element; or,
所述第一类语法元素和第三类语法元素在所述码流中的位置不变, 所 述第二类语法元素放置在所述第一类语法元素之后; 或者,  The position of the first type of syntax element and the third type of syntax element in the code stream is unchanged, and the second type of syntax element is placed after the first type of syntax element; or
所述第三类语法元素在所述码流中的位置不变, 所述第一类语法元素 和第二类语法元素放置在所述码流的中间位置; 或者,  The position of the third type of syntax element in the code stream is unchanged, and the first type of syntax element and the second type of syntax element are placed in an intermediate position of the code stream; or
所述第三类语法元素放置在所述第一类语法元素之后, 所述第二类语 法元素放置在所述第三类语法元素之后。  The third type of syntax element is placed after the first type of syntax element, and the second type of syntax element is placed after the third type of syntax element.
16、 根据权利要求 14或 15所述的设备, 其特征在于, 所述设备为解 码设备或者媒体网关。  The device according to claim 14 or 15, wherein the device is a decoding device or a media gateway.
17、 一种解码设备, 其特征在于, 包括:  17. A decoding device, comprising:
提取模块, 用 于提取编码设备产生的码流中 的语法元素 max— dec— pic— buffering— diff[i]和 num— reorder— pics— diff[i] , 其中, 语法元素 max— dec— pic— buffering_diff[i]表示解码时间层标识不超过 i 的子码流所需 要的解码图像緩存的大小与解码时间层标识不超过 i-1 的子码流所需要的 解码图像緩存的大小的差值, 语法元素 num_reorder_pics_diff[i]表示时间 层标识不超过 i的子码流中倒置图像的数量与时间层标识不超过 i-1 的子 码流中倒置图像的数量的差值; An extraction module, configured to extract syntax elements max_dec_pic_buffering-diff[i] and num-reorder-pics-diff[i] in the code stream generated by the encoding device, where the syntax element Max_dec_pic_buffering_diff[i] indicates the size of the decoded image buffer required for the decoding of the sub-stream of the decoding time layer not exceeding i and the decoded image buffer required for the sub-stream of the decoding time layer not exceeding i-1 The difference in size, the syntax element num_reorder_pics_diff[i] represents the difference between the number of inverted images in the subcode stream whose time layer identifier does not exceed i and the number of inverted images in the subcode stream whose time layer identifier does not exceed i-1;
差分解码模块,用于若 i为 0,釆用下述方式得到 MaxDecPicBuffering[i] 和 Num eorderPics[i]:  Differential decoding module, if i is 0, obtain MaxDecPicBuffering[i] and Num eorderPics[i] in the following way:
MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];  MaxDecPicBuffering[i] = max— dec— pic— buffering— diff[i];
NumReorderPics[i] = num_reorder_pics_diff [i];  NumReorderPics[i] = num_reorder_pics_diff [i];
若 i 大于 0 , 采用 下述方式得到 MaxDecPicBuffering[i]和 NumReorderPics[i]:  If i is greater than 0, MaxDecPicBuffering[i] and NumReorderPics[i] are obtained as follows:
MaxDecPicBuffering[i]=MaxDecPicBufferin i- 1 ] MaxDecPicBuffering[i]=MaxDecPicBufferin i- 1 ]
Figure imgf000034_0001
Figure imgf000034_0001
NumReorderPics[i] = NumReorderPics [i- 1 ] + num_reorder_pics_diff [i] ; 其中, MaxDecPicBuffering[i]表示假设参考解码器 HRD的 DPB针对 第 i个时间层码流所要求的存储空间大小, NumReorderPics[i]表示第 i个 时间层码流中的时序倒置图像的最大数量。  NumReorderPics[i] = NumReorderPics [i- 1 ] + num_reorder_pics_diff [i] ; where MaxDecPicBuffering[i] represents the size of the storage space required for the iB of the reference decoder HRD for the i-th temporal layer stream, NumReorderPics[i] Represents the maximum number of time-series inverted images in the i-th temporal layer stream.
PCT/CN2012/083682 2012-04-16 2012-10-29 Video image code stream processing method and device WO2013155828A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210110893.6A CN103379320B (en) 2012-04-16 2012-04-16 Video image code flow processing method and equipment
CN201210110893.6 2012-04-16

Publications (1)

Publication Number Publication Date
WO2013155828A1 true WO2013155828A1 (en) 2013-10-24

Family

ID=49382863

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/083682 WO2013155828A1 (en) 2012-04-16 2012-10-29 Video image code stream processing method and device

Country Status (2)

Country Link
CN (1) CN103379320B (en)
WO (1) WO2013155828A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104754347B (en) 2013-12-26 2019-05-17 中兴通讯股份有限公司 Coding, coding/decoding method and device, the electronic equipment of video image serial number
CN104754358B (en) * 2013-12-27 2019-02-19 中兴通讯股份有限公司 The generation of code stream and processing method, apparatus and system
US20150264404A1 (en) * 2014-03-17 2015-09-17 Nokia Technologies Oy Method and apparatus for video coding and decoding
CN108243339B (en) 2016-12-27 2021-02-26 浙江大学 Image coding and decoding method and device

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1751517A (en) * 2003-02-18 2006-03-22 诺基亚有限公司 Picture decoding method
CN1871854A (en) * 2003-09-05 2006-11-29 康奈克森特系统公司 Hypothetical reference decoder with low start-up delays for compressed image and video
CN101141641A (en) * 2006-09-05 2008-03-12 索尼株式会社 Information processing apparatus and information processing method
CN101185333A (en) * 2005-05-26 2008-05-21 Lg电子株式会社 Method of transmitting picture information when encoding video signal and method of using the same when decoding video signal
CN101222630A (en) * 2007-11-30 2008-07-16 武汉大学 Time-domain gradable video encoding method for implementing real-time double-frame reference
CN101272495A (en) * 2007-03-23 2008-09-24 三星电子株式会社 Method and apparatus for transmitting packet-based image frame
US20110064146A1 (en) * 2009-09-16 2011-03-17 Qualcomm Incorporated Media extractor tracks for file format track selection
CN102065289A (en) * 2010-09-09 2011-05-18 西安电子科技大学 Reliable video transmission method and device based on network coding
CN102131081A (en) * 2010-01-13 2011-07-20 华为技术有限公司 Dimension-mixed coding/decoding method and device
CN102215422A (en) * 2010-04-01 2011-10-12 炬力集成电路设计有限公司 Method, device and system for generating verification code stream of video processing integrated circuit

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1751517A (en) * 2003-02-18 2006-03-22 诺基亚有限公司 Picture decoding method
CN1871854A (en) * 2003-09-05 2006-11-29 康奈克森特系统公司 Hypothetical reference decoder with low start-up delays for compressed image and video
CN101185333A (en) * 2005-05-26 2008-05-21 Lg电子株式会社 Method of transmitting picture information when encoding video signal and method of using the same when decoding video signal
CN101141641A (en) * 2006-09-05 2008-03-12 索尼株式会社 Information processing apparatus and information processing method
CN101272495A (en) * 2007-03-23 2008-09-24 三星电子株式会社 Method and apparatus for transmitting packet-based image frame
CN101222630A (en) * 2007-11-30 2008-07-16 武汉大学 Time-domain gradable video encoding method for implementing real-time double-frame reference
US20110064146A1 (en) * 2009-09-16 2011-03-17 Qualcomm Incorporated Media extractor tracks for file format track selection
CN102131081A (en) * 2010-01-13 2011-07-20 华为技术有限公司 Dimension-mixed coding/decoding method and device
CN102215422A (en) * 2010-04-01 2011-10-12 炬力集成电路设计有限公司 Method, device and system for generating verification code stream of video processing integrated circuit
CN102065289A (en) * 2010-09-09 2011-05-18 西安电子科技大学 Reliable video transmission method and device based on network coding

Also Published As

Publication number Publication date
CN103379320B (en) 2016-11-23
CN103379320A (en) 2013-10-30

Similar Documents

Publication Publication Date Title
US11477488B2 (en) Method and apparatus for encoding/decoding images
US8638847B2 (en) Video coding, decoding and hypothetical reference decoder
US20180152712A1 (en) Devices and methods for identifying a leading picture
US20160234522A1 (en) Video Decoding
US20150085938A1 (en) Image decoding method and apparatus using same
TW200427335A (en) Picture coding method
US20130279601A1 (en) Image signal decoding apparatus and image signal decoding method
TW200531555A (en) Picture decoding method
KR102574868B1 (en) Video-encoding method, video-decoding method, and apparatus implementing same
US8401077B2 (en) Video processing apparatus and method
US20130272619A1 (en) Devices for identifying a leading picture
KR102115323B1 (en) Method for storing image information, method for parsing image information and apparatus using same
WO2013155828A1 (en) Video image code stream processing method and device
JP2023526370A (en) Picture type restrictions in video bitstream processing
US20170272787A1 (en) Signal transmission/reception device and signal transmission/reception method for providing trick play service
JP4408801B2 (en) Codec conversion apparatus and method for image data
TW202130187A (en) Video data stream, video encoder, apparatus and methods for a hypothetical reference decoder and for output layer sets
CN112073822B (en) Media change method and system in broadband trunking communication
US20100027681A1 (en) Method and apparatus for video error concealment in multi-view coded video using high level syntax

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12874656

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12874656

Country of ref document: EP

Kind code of ref document: A1