WO2008067761A1 - Method and apparatus of video coding and decoding - Google Patents

Method and apparatus of video coding and decoding Download PDF

Info

Publication number
WO2008067761A1
WO2008067761A1 PCT/CN2007/071167 CN2007071167W WO2008067761A1 WO 2008067761 A1 WO2008067761 A1 WO 2008067761A1 CN 2007071167 W CN2007071167 W CN 2007071167W WO 2008067761 A1 WO2008067761 A1 WO 2008067761A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
random access
video sequence
decoding
encoding
Prior art date
Application number
PCT/CN2007/071167
Other languages
French (fr)
Chinese (zh)
Inventor
Yannan Wu
Yun He
Jianwen Chen
Xiaozhen Zheng
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Publication of WO2008067761A1 publication Critical patent/WO2008067761A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/107Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field

Definitions

  • the present invention relates to the field of video signal processing, and in particular, to a video encoding method, a decoding method, and a device thereof.
  • the prediction modes of video image sequences used for encoding can be classified into the following three types:
  • the first type of prediction is to encode the current pixel by using the encoded pixel data in the same image;
  • the current image is encoded by using chronological order, that is, playing or displaying reference image data in front of the current image;
  • the third type of prediction is reference image data and time after chronological order after the current image.
  • the current image is encoded by reference image data preceding the current image.
  • the bitstream data obtained by the three coding methods has a flag code for identifying the nature of an image, and the code indicates that the graph uses the first type of prediction mode for encoding.
  • the current image referred to above is the image being encoded or decoded.
  • the one image can be a frame image or a field image.
  • the first type of prediction method has the lowest coding efficiency because it uses only the data in the image.
  • the second type of prediction method uses the reference image data before the current image time to predict the current image, and its compression efficiency.
  • the compression efficiency is higher than that of the first type of prediction method;
  • the third type of prediction method predicts the current image by using both the reference image data after the current image time and the reference image data before the current image time. Its compression efficiency is The highest of the above three encoding methods.
  • only the first type can support random access operations.
  • the existing MPEG-2 video codec standard and the H.264 video codec standard use the above three types of prediction techniques in combination.
  • the image encoded by the first type of prediction mode is referred to as an A picture
  • the image B image encoded by the second type of prediction mode, and the image coded by the third type of prediction mode are C pictures.
  • MPEG-2 a video sequence is divided into a series of image groups, each image group having a picture group header, that is, a description of the image group features, by finding the starting identification code of the image header. To access random access to the group of images.
  • the A picture is followed by the picture group header to support random access, and then the B picture or C picture is used as much as possible to improve compression efficiency.
  • MPEG-2 when a C picture is introduced in encoding, it is likely that a C picture belonging to the previous picture group in the code stream follows the A picture of the next picture group.
  • MPEG-2 reserves a picture group enclosing flag in the picture group header to indicate whether the picture group can be randomly accessed.
  • the method of using the closed flag for video random access operation is as follows:
  • the closed flag is set to 1, otherwise set to 0.
  • the closed flag is 1, if random access occurs, even if the image before the current image group is not available, the image in the current image group can still be decoded normally; if the closed flag is 0, the current image is discarded. The group performs a random access and continues to search backward for a new group of images.
  • the B picture and the C picture use a multi-reference frame technique, that is, the number of reference pictures at the time of predictive coding is more than one.
  • a C picture using the third type of prediction mode or a B picture of the second type of prediction mode having multiple reference pictures is introduced into the coding, it is possible to have a C picture which belongs to the A picture before the A picture in the code stream.
  • the image before the current A picture may not be available, and the B picture or C picture in which the above occurs may not be decoded normally.
  • H.264 introduces an instant decoding update image identifier to support random access. If the current image is an instant update image, all the reference images before the current image will be marked as unavailable, that is, in the encoding order, the encoding order is the time sequence of the image encoding process, and the image is in the instant update map. For the image after the image, the image before the image is updated as the reference image when the second type of prediction mode is encoded. By using an instant update image, H.264 can also achieve good random access to video.
  • an object of the present invention is to provide a video encoding method that reduces random access to a video stream while reducing the efficiency of video sequence encoding due to random access techniques.
  • the encoding method includes the following steps:
  • the random access cut-in image in the uncoded video sequence is coded by the first type of prediction
  • n images having a coding order located after the random access cut-in image are encoded as a reference image by using the random access cut-in image as a reference image, and n is a predetermined positive integer;
  • the randomized access cut-in image or/and the randomized access-cut image after the cut-in image are encoded as reference pictures.
  • Still another object of the present invention is to provide a decoding method corresponding to the above encoding method, comprising the following steps:
  • n is a predetermined positive integer
  • the encoding device includes the following modules:
  • a random access judging module configured to receive an uncoded video sequence, determine whether the received current image has a random access punctured image, and if yes, send the current image and the subsequent image to the random access coding module, otherwise
  • the encoded video sequence is sent to the normal encoding module;
  • a random access coding module configured to encode the received random access access image and subsequent images according to a random access coding rule, and output the encoded video sequence;
  • a normal coding module configured to receive the received video
  • the current image and subsequent images are encoded according to normal encoding rules, and the encoded video sequence is output.
  • the decoding device includes the following modules:
  • a random access judging module configured to receive a video sequence, determine whether the received current decoded image is a random access punctured image, and if yes, send the video sequence to a random access decoding module, otherwise send the video sequence to normal decoding Module
  • a random access decoding module configured to decode a random access access image and subsequent images in the received video sequence according to a random access decoding rule, and output the decoded video sequence
  • the normal encoding module is configured to decode the received video sequence according to a normal decoding rule, and output the decoded video sequence.
  • the present invention also proposes a codec device composed of the above-described encoding device and decoding device.
  • a codec device composed of the above-described encoding device and decoding device.
  • at the time of encoding at least one image after random access to the cut image is encoded by the fourth type of prediction mode; thus, for random access to other images after the image is cut, the image can be used.
  • the randomly accessed punctured image and/or the image encoded by the fourth type of prediction mode described above is encoded as a reference image. Since the random access of the cut-in image and the reference image of the image after it does not include random access to the image before the cut-in image, random access with the random access of the cut-in image as a point of entry can be realized at the time of decoding.
  • the scheme of the present invention does not need to impose a mandatory limitation on the selection of the reference image at the time of decoding, and the code rate is also significantly improved.
  • FIG. 3 is a block diagram of an encoding apparatus according to an embodiment of the present invention.
  • Fig. 4 is a block diagram of a decoding apparatus according to an embodiment of the present invention.
  • the core idea of the present invention is that the image after the random access point is not encoded with the image before the random access point as a reference image, so that random access to the video sequence according to the random access point can be realized.
  • This random access point is the point of entry for random access of video.
  • the code stream it can be an image with a random access identifier, which is called a random access cut-in image.
  • the fourth type of prediction mode is used to assist the random access of the video image, and the fourth type of prediction mode uses the reference image data after the current image in the display order to encode the current image.
  • an image encoded by the first type of prediction mode is referred to as an A image
  • an image encoded by the second type of prediction mode is referred to as a B image
  • the image obtained by the mode encoding is called a C image
  • the image encoded by the fourth type of prediction is called a D image.
  • the compression efficiency of the A image is low, but the random access is supported; the compression efficiency of the B image and the C image is sequentially improved, but since they need to decode the image located before them as the reference image, the random image is not supported. Access; D images have good compression efficiency and do not require decoding of images that are in sequence before the A image as reference images.
  • the present invention will be further described in detail below with reference to the accompanying drawings.
  • the encoding method of a video sequence is such that there are two C images between the A image and the B image or between the B image and the B image, there is a relationship between the A image and the A image. 4 B images, ie the coding order is as follows: ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇
  • the video sequence can be expressed in the order of time:
  • the subscript number is the serial number of the image in the display order.
  • the image c 14 , the image c 15 and the image can be forward referenced to the image.
  • the image 6 is set as a random access cut-in image of the segment of video, then according to the inventive arrangement, after the image A 16 is encoded, any C image after the image A 16 and before the image B 19 is changed.
  • the fourth type of prediction for example, in this example, the first image whose encoding order is immediately followed by the image A 16 is encoded by the fourth type of prediction, so that the original image C 14 becomes an image.
  • D 14 whose reference image is image A 6 .
  • the image D 14 and the image B 14 are identical in display order and encoding order. Then the coding sequence of the video is ⁇ ⁇ B 4 C 2 CB 7 C 5 C 6 5 10 C 8 C 9 B l C l ⁇ C l2 A l6 D l4 C 15 B l9 C 17 C 18
  • the image D 14 can be used as the encoded reference image of the image c 15 and the image ⁇ 19 , and the image c 15 and the image cannot use the image before the A 6 as the encoded reference image. .
  • decoding when a video segment is set to the random access cut image 6 appears, decoding the first image 46 a first type prediction mode, then the image of a decoding order after 6 The image encoded by the fourth type of prediction mode is decoded, and the image is used as a reference image of the first C image and the first B image immediately following the image. According to the above operation to complete the decoding, the video can achieve random access with 4 6 as the cut image.
  • the first type of prediction mode is used for coding
  • B One or more images in the coding order after random access to the cut-in image are encoded in a fourth type of prediction mode, and the reference image is a random access cut-in image
  • the randomized access cut image or/and the random access cut image after the encoded image is used as a reference image.
  • coding Specifically, the image encoded by the first type of prediction mode and/or the image of the fourth type of coding mode are used as reference images.
  • the encoding processing flow of the second embodiment of the present invention includes the following steps: Step 101: Determine whether the current image is set to randomly access the cut image, and if yes, execute step 102, otherwise perform processing according to the normal encoding process.
  • Step 102 encoding the random access cut-in image by using the first type of prediction manner to obtain
  • Step 103 The one or more images whose encoding order is located after the random access cut-in image are encoded by the fourth type of prediction mode, and the reference image is the A image to obtain a corresponding D image.
  • an image whose encoding order follows the random access cut-in image is encoded in a fourth type of prediction.
  • Step 104 Perform a second type of prediction mode or a third type of prediction mode coding on the current image by using the encoded image in the coding order after the A image and the A image and before the current image as a reference image. , get B image or C image.
  • Step 105 Determine whether there is still an uncoded image. If yes, execute step 106. Otherwise, end the encoding process.
  • Step 106 Determine whether a new random access cut-in image appears, and if yes, go to step 102, otherwise go to step 104.
  • Step 201 If it is determined that the currently decoded image in the video sequence is a random access cut-in image, step 202 is performed; otherwise, processing is performed according to a normal decoding process.
  • Step 202 Decode the A image corresponding to the random access image into the image.
  • Step 203 Decode the D picture immediately after the random access cut image A picture in the decoding order, and the reference picture is an A picture.
  • Step 204 Decode the current decoded image by using the image after the random access to the image and before the current decoded image as a reference image.
  • Step 205 Determine whether there is still an undecoded video image, and if yes, execute step 206, otherwise, end the encoding process.
  • Step 206 Determine whether a new random access cut-in image appears, and if yes, go to step 202, otherwise go to step 204.
  • the image encoded by the fourth type of prediction in the second embodiment is the next image randomly accessed in the coding order.
  • the number of C pictures between the A picture and the first B picture after the A picture, or between two adjacent B pictures is usually a fixed value, which is set to m. Any one or more of the m C pictures can be changed to be encoded in the fourth type of prediction mode in the scheme of the present invention to obtain a D picture.
  • Embodiment 3 of the present invention is a random access coding apparatus and a decoding apparatus for implementing the scheme of the present invention.
  • the random access coding device is as shown in FIG. 3, and includes:
  • the random access judging module 301 is configured to receive an uncoded video sequence, determine whether the received current image is a random access punctured image, and if yes, send the current image and the subsequent image to the random access encoding module 302, Otherwise, the current image and its subsequent image are sent to the normal encoding module 303.
  • the random access coding module 302 is configured to encode the received random access hand-in image and subsequent images according to a random access coding rule, and send the encoded video sequence to the storage module 304.
  • the random access coding rule satisfies the following process in principle:
  • the current random access cut-in image is encoded by the first type of prediction mode
  • the coding sequence is in the random access cut-in image
  • the subsequent n images are encoded by the random access cut-in image as a reference image, and encoded by a fourth type of prediction, where n is a predetermined positive integer;
  • the specific content of the random access encoding rule depends on the specific format of the video sequence.
  • the code stream according to the coding mode of the first embodiment can be coded by referring to the coding mode of the first embodiment of the present invention.
  • the normal encoding module 303 is configured to encode the received current image and subsequent images according to normal encoding rules, and send the encoded video sequence to the storage module 304.
  • the normal coding rule may be any video sequence coding method that does not need to implement random access, for example, a coding method in a second type of prediction mode or a third type of prediction mode.
  • the storage module 304 is configured to store the received video sequence and output the stored video sequence.
  • the random access decoding apparatus of the embodiment of the present invention includes:
  • the random access judging module 401 is configured to receive a video sequence, determine whether the received image is a random access punctured image, and if yes, send the video sequence to the random access decoding module 402, otherwise send the video sequence to normal.
  • the random access decoding module 402 is configured to decode the image corresponding to the random access identifier in the received video sequence and the subsequent image according to a random access decoding rule, and output the decoded video sequence.
  • the random access decoding rule corresponds to the random access coding rule in the above coding apparatus, and follows the following process:
  • n images encoded by the fourth type of encoding are decoded, n is a preset positive integer; C. cutting the random access into the image, and/or decoding the decoded image encoded by the fourth type of encoding after the random access of the cut image as a decoded reference image, and cutting the random access in the decoding order The other undecoded pictures after the picture are decoded.
  • the normal encoding module 403 is configured to decode the received video sequence according to a normal decoding rule, and output the decoded video sequence.
  • the display module 404 is configured to receive the decoded video sequence from the random access decoding module 402 or the normal encoding module 403, and display the video code sequence as a video image.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method of video coding includes following steps: use the first prediction mode to code a random accessing entry image in a non-coded video sequence; take the said random accessing entry image as reference image and the fourth prediction mode is used to code n images which follow the said random accessing entry image in the coding order, and n is a pre-establishing positive integer; the said random accessing entry image or/and coded images after the said random accessing entry image are used as reference image to code other non-coding images which follow the said random accessing entry image in the coding order. And a method of video decoding, an apparatus for coding, a apparatus for decoding and an apparatus for coding and decoding. The subject matter is economized code rate than a technology of video random access in prior art.

Description

一种视频编码方法、 解码方法及其装置 技术领域  Video coding method, decoding method and device thereof
本发明涉及视频信号处理领域, 特别涉及一种视频编码方法、 解码 方法及其装置。  The present invention relates to the field of video signal processing, and in particular, to a video encoding method, a decoding method, and a device thereof.
发明背景 Background of the invention
良好的视频编解码方法, 需要在保证尽可能高的压缩效率的前提 下, 能够有效地支持随机访问操作, 即当把带有随机访问点的图象地址 传给解码器后, 解码器能正确解码所述带有随机访问点的图象以及该图 象之后的图象。 目前, 视频图象序列用于编码的预测方式可分为如下三 类: 第一类预测方式为, 利用同一幅图象内已编码的象素数据对当前象 素进行编码; 第二类预测方式为, 利用时间顺序即播放或显示顺序在当 前图象之前的参考图象数据, 来对当前图象进行编码; 第三类预测方式 为利用时间顺序在当前图象之后的参考图象数据和时间顺序在当前图 象之前的参考图象数据来对当前图象进行编码。 这三种编码方式所得到 的比特流数据中, 均有用于标识一幅图象性质的标志码, 该码说明该图 使用了第几类预测方式进行编码。 上述所说的当前图象即是正在进行编 码或解码的图象。 所说的一幅图象可以是一帧图像或者是一场图象。  A good video encoding and decoding method needs to support the random access operation effectively under the premise of ensuring the highest possible compression efficiency, that is, when the image address with the random access point is transmitted to the decoder, the decoder can be correct. The image with random access points and the image after the image are decoded. At present, the prediction modes of video image sequences used for encoding can be classified into the following three types: The first type of prediction is to encode the current pixel by using the encoded pixel data in the same image; For example, the current image is encoded by using chronological order, that is, playing or displaying reference image data in front of the current image; the third type of prediction is reference image data and time after chronological order after the current image. The current image is encoded by reference image data preceding the current image. The bitstream data obtained by the three coding methods has a flag code for identifying the nature of an image, and the code indicates that the graph uses the first type of prediction mode for encoding. The current image referred to above is the image being encoded or decoded. The one image can be a frame image or a field image.
第一类预测方式由于只使用本图象内的数据进行处理, 其编码效率 最低; 第二类预测方式由于利用了当前图象所在时刻之前的参考图象数 据来预测当前图象, 其压缩效率比第一类预测方式的压缩效率高; 第三 类预测方式由于既利用当前图象所在时刻之后的参考图象数据, 又利用 当前图象所在时刻之前的参考图象数据来预测当前图象, 其压缩效率在 上述三种编码方式中最高。 但是, 上述三类编码技术中, 只有第一类可 以支持随机访问操作。 The first type of prediction method has the lowest coding efficiency because it uses only the data in the image. The second type of prediction method uses the reference image data before the current image time to predict the current image, and its compression efficiency. The compression efficiency is higher than that of the first type of prediction method; the third type of prediction method predicts the current image by using both the reference image data after the current image time and the reference image data before the current image time. Its compression efficiency is The highest of the above three encoding methods. However, of the above three types of coding techniques, only the first type can support random access operations.
现有的 MPEG - 2视频编解码标准及 H.264视频编解码标准都混合使 用了上述三类预测技术, 为便于描述, 以下称用第一类预测方式编码的 图象为 A图象, 第二类预测方式编码的图象 B图象, 第三类预测方式编码 的图象为 C图象。 在 MPEG - 2中, 一个视频序列分成一系列图象组, 每 个图象组都有一个图象组头, 即该图象组特征的描述, 通过寻找图象头 的起始标识码来实现以进入该图象组的随机访问。 在每个图象组中, A 图象紧跟着图象组头以便支持随机访问,之后则尽可能地采用 B图象或 C 图象来提高压缩效率。  The existing MPEG-2 video codec standard and the H.264 video codec standard use the above three types of prediction techniques in combination. For convenience of description, the image encoded by the first type of prediction mode is referred to as an A picture, The image B image encoded by the second type of prediction mode, and the image coded by the third type of prediction mode are C pictures. In MPEG-2, a video sequence is divided into a series of image groups, each image group having a picture group header, that is, a description of the image group features, by finding the starting identification code of the image header. To access random access to the group of images. In each group of pictures, the A picture is followed by the picture group header to support random access, and then the B picture or C picture is used as much as possible to improve compression efficiency.
在 MPEG - 2中, 当编码中引入采用 C图象时, 将有可能出现码流中 本属于前一个图象组的 C图象跟在下一个图象组的 A图象之后的情况。当 发生随机访问时, 由于无法获得随机访问点所对应的当前图象组之前的 图象, 紧跟在当前图象组的 A图象之后、 所述 A图象后续的第一个非 B图 象之间的 C图象就有可能不能正常解码。 为了解决这个问题, MPEG - 2 在图象组头中预留了图象组封闭标志来标志该图象组能否发生随机访 问。  In MPEG-2, when a C picture is introduced in encoding, it is likely that a C picture belonging to the previous picture group in the code stream follows the A picture of the next picture group. When random access occurs, since the image before the current image group corresponding to the random access point cannot be obtained, immediately after the A image of the current image group, the first non-B image subsequent to the A image It is possible that the C picture between the images cannot be decoded normally. To solve this problem, MPEG-2 reserves a picture group enclosing flag in the picture group header to indicate whether the picture group can be randomly accessed.
利用封闭标志进行视频随机访问操作的方法如下:  The method of using the closed flag for video random access operation is as follows:
判断码流中图象组头之后第一个 A图象之后的 C图象(如果有)是否 仅仅采用了第四类预测方式, 该方式利用了按时间顺序当前图象之后的 参考图象数据来预测当前图象, 或者采用了第一类预测方式编码, 如果 是, 那么封闭标志设为 1 , 否则设为 0。 当封闭标志为 1时, 若发生随机 访问, 即便当前图象组之前的图象已不可获得, 当前图象组内的图象仍 然能够正常解码; 若封闭标志为 0, 则放弃对当前图象组进行随机访问, 继续向后搜索新的图象组。 在 H.264中, B图象和 C图象使用了多参考帧技术, 即其预测编码时 的参考图象数目多于一幅。 当编码中引入采用第三类预测方式的 C图象 或具有多参考图象的第二类预测方式的 B图象时, 将有可能出现码流中 本属于 A图象之前的 C图象跟在所述 A图象之后的情况, 或者紧跟在 A图 象之后的 B图象向前参考所述 A图象之前的图象的情况。当发生随机访问 时, 当前 A图象之前的图象可能已无法获得, 出现上述情况的 B图象或 C 图象就有可能不能正常解码。 为了解决这个问题, H.264中引入即时解 码更新图象标识符来支持随机访问。 若当前图象为即时更新图象时, 则 当前图象之前所有的参考图象都会被标记为不可用, 即对于在编码顺序 上, 编码顺序为图象编码处理的时间顺序, 处于即时更新图象之后的图 象, 对其进行第二类预测方式编码时均不能使用即时更新图象之前的图 象作为参考图象。 通过使用即时更新图象, H.264中亦能够很好的实现 视频随机访问。 Determining whether the C picture (if any) after the first A picture after the picture group header in the code stream uses only the fourth type of prediction mode, which utilizes the reference picture data after the current picture in chronological order To predict the current image, or use the first type of prediction mode encoding, if it is, then the closed flag is set to 1, otherwise set to 0. When the closed flag is 1, if random access occurs, even if the image before the current image group is not available, the image in the current image group can still be decoded normally; if the closed flag is 0, the current image is discarded. The group performs a random access and continues to search backward for a new group of images. In H.264, the B picture and the C picture use a multi-reference frame technique, that is, the number of reference pictures at the time of predictive coding is more than one. When a C picture using the third type of prediction mode or a B picture of the second type of prediction mode having multiple reference pictures is introduced into the coding, it is possible to have a C picture which belongs to the A picture before the A picture in the code stream. In the case after the A picture, or the case where the B picture immediately after the A picture is referred to the image before the A picture. When random access occurs, the image before the current A picture may not be available, and the B picture or C picture in which the above occurs may not be decoded normally. To solve this problem, H.264 introduces an instant decoding update image identifier to support random access. If the current image is an instant update image, all the reference images before the current image will be marked as unavailable, that is, in the encoding order, the encoding order is the time sequence of the image encoding process, and the image is in the instant update map. For the image after the image, the image before the image is updated as the reference image when the second type of prediction mode is encoded. By using an instant update image, H.264 can also achieve good random access to video.
上述现有技术虽然解决了视频随机访问的问题, 但又带来新的问 题, 那就是使得编码效率降低。 因为按照 MPEG - 2或 H.264标准中解决 随机访问的方法, 紧跟在 A图象之后的 C图象或 B图象的预测方向及可选 参考图象数目上会受到一定的限制,这样就会导致这组 C图象或 B图象的 编码效率降低。 具体地说, 在 A图象之后、 并且所述 A图象之后的 B图象 之前的一组 C图象只能进行第四类预测方式编码, 而无法进行第三类预 测方式编码; 而紧跟在 A图象之后的第一个 B图象只能向前参考一幅图 象, 而其后的 B图象的可参考图象亦会受到数目限制。 对于随机访问点 比较多的视频序列,例如广播视频序列要求平均 0.5秒就有一个随机访问 点, 现有的视频随机访问方法带来的编码效率降低的问题就更加突出。 发明内容 Although the above prior art solves the problem of random access of video, it brings a new problem, which is to make the coding efficiency lower. Since the random access method is solved in accordance with the MPEG-2 or H.264 standard, the prediction direction of the C picture or the B picture immediately following the A picture and the number of optional reference pictures are limited. This results in a reduction in the coding efficiency of the set of C pictures or B pictures. Specifically, a group of C pictures after the A picture and before the B picture after the A picture can only perform the fourth type of prediction mode coding, and cannot perform the third type of prediction mode coding; The first B picture following the A picture can only be referenced one picture forward, and the subsequent reference picture of the B picture is also limited by the number. For a video sequence with a large number of random access points, for example, a broadcast video sequence requires an average of 0.5 seconds to have a random access point, and the problem of reduced coding efficiency caused by the existing video random access method is more prominent. Summary of the invention
有鉴于此, 本发明的目的在于, 提出一种视频编码方法, 在实现对 视频流随机访问的同时, 又减少因随机访问技术而带来的视频序列编码 效率的降低。 所述编码方法包括如下步骤:  In view of the above, an object of the present invention is to provide a video encoding method that reduces random access to a video stream while reducing the efficiency of video sequence encoding due to random access techniques. The encoding method includes the following steps:
对未编码视频序列中的随机访问切入图象采用第一类预测方式编 码;  The random access cut-in image in the uncoded video sequence is coded by the first type of prediction;
将编码顺序位于所述随机访问切入图象之后的 n幅图象以所述随机 访问切入图象作为参考图象以第四类预测方式进行编码, n为预先设定 的正整数;  n images having a coding order located after the random access cut-in image are encoded as a reference image by using the random access cut-in image as a reference image, and n is a predetermined positive integer;
对于编码顺序在所述随机访问切入图象之后的其它未编码图象, 以 所述随机访问切入图象或 /和所述随机访问切入图象之后的已编码图象 作为参考图象进行编码。  For the other uncoded pictures whose encoding order is after the random access cut-in image, the randomized access cut-in image or/and the randomized access-cut image after the cut-in image are encoded as reference pictures.
本发明的又一目的在于, 提出与上述编码方法对应的解码方法, 包 括如下步骤:  Still another object of the present invention is to provide a decoding method corresponding to the above encoding method, comprising the following steps:
将未解码视频序列中的随机访问切入图象解码;  Decoding the random access in the undecoded video sequence into the image;
对于解码顺序在所述随机访问切入图象之后的 n幅以第四类编码方 式编码的图象解码, n为预先设定的正整数;  For the decoding of the image encoded by the fourth type of encoding after the random access cut-in image, n is a predetermined positive integer;
将所述随机访问切入图象, 以及所述随机访问切入图象之后的以第 四类编码方式编码的已解码图象作为解码参考图象, 对解码顺序在所述 随机访问切入图象之后其它未解码图象进行解码。  Inserting the random access into the image, and the decoded image encoded by the fourth type of encoding after the random access of the cut image as a decoded reference image, and the decoding order after the random access of the cut image The undecoded image is decoded.
本发明的目的还在于, 提出一种视频编码装置和解码装置。 所述编 码装置包括如下模块:  It is also an object of the present invention to provide a video encoding apparatus and decoding apparatus. The encoding device includes the following modules:
随机访问判断模块, 用于接收未编码视频序列, 判断所接收的当前 图象是否出现随机访问切入图象, 若是则将当前图象及后续图象送至随 机访问编码模块, 否则将所述未编码视频序列发送至正常编码模块; 随机访问编码模块, 用于对所收到的当前随机访问切入图象及其之 后的图象按照随机访问编码规则进行编码, 输出编码后的视频序列; 正常编码模块, 用于对所收到的当前图象及其之后的图象按照正常 编码规则进行编码, 输出编码后的视频序列。 a random access judging module, configured to receive an uncoded video sequence, determine whether the received current image has a random access punctured image, and if yes, send the current image and the subsequent image to the random access coding module, otherwise The encoded video sequence is sent to the normal encoding module; a random access coding module, configured to encode the received random access access image and subsequent images according to a random access coding rule, and output the encoded video sequence; a normal coding module, configured to receive the received video The current image and subsequent images are encoded according to normal encoding rules, and the encoded video sequence is output.
所述解码装置包括如下模块:  The decoding device includes the following modules:
随机访问判断模块, 用于接收视频序列, 判断所接收当前解码图象 是否为随机访问切入图象, 若是则将所述视频序列发送至随机访问解码 模块, 否则将所述视频序列发送至正常解码模块;  a random access judging module, configured to receive a video sequence, determine whether the received current decoded image is a random access punctured image, and if yes, send the video sequence to a random access decoding module, otherwise send the video sequence to normal decoding Module
随机访问解码模块, 用于对所收到的视频序列中随机访问切入图象 及其之后的图象按照随机访问解码规则进行解码, 输出解码后的视频序 列;  a random access decoding module, configured to decode a random access access image and subsequent images in the received video sequence according to a random access decoding rule, and output the decoded video sequence;
正常编码模块, 用于对所收到的视频序列按照正常解码规则进行解 码, 并输出解码后的视频序列。  The normal encoding module is configured to decode the received video sequence according to a normal decoding rule, and output the decoded video sequence.
本发明还提出了由上述编码装置和解码装置组成的编解码装置。 从以上技术方案可以看出, 在编码时, 对于随机访问切入图象之后 的至少一幅图象采用第四类预测方式进行编码; 这样对于随机访问切入 图象之后的其它图象,就可以用随机访问切入图象和 /或上述第四类预测 方式编码的图象作为参考图象进行编码。 由于随机访问切入图象及其之 后的图象的参考图象不会包括随机访问切入图象之前的图象, 在解码 时, 就可以实现以随机访问切入图象为切入点的随机访问。 与现有技术 比较, 本发明方案在解码时无需对参考图象的选择进行强制限制, 其码 率也有明显地改善。 附图简要说明 The present invention also proposes a codec device composed of the above-described encoding device and decoding device. As can be seen from the above technical solution, at the time of encoding, at least one image after random access to the cut image is encoded by the fourth type of prediction mode; thus, for random access to other images after the image is cut, the image can be used. The randomly accessed punctured image and/or the image encoded by the fourth type of prediction mode described above is encoded as a reference image. Since the random access of the cut-in image and the reference image of the image after it does not include random access to the image before the cut-in image, random access with the random access of the cut-in image as a point of entry can be realized at the time of decoding. Compared with the prior art, the scheme of the present invention does not need to impose a mandatory limitation on the selection of the reference image at the time of decoding, and the code rate is also significantly improved. BRIEF DESCRIPTION OF THE DRAWINGS
图 1为本发明实施例 的编码流程图;  1 is a code flow chart of an embodiment of the present invention;
图 2为本发明实施例 的解码流程图;  2 is a flowchart of decoding according to an embodiment of the present invention;
图 3为本发明实施例 的编码装置框图;  3 is a block diagram of an encoding apparatus according to an embodiment of the present invention;
图 4为本发明实施例 的解码装置框图。  Fig. 4 is a block diagram of a decoding apparatus according to an embodiment of the present invention.
实施本发明的方式 Mode for carrying out the invention
本发明的核心思想为: 使随机访问点之后的图象都不用随机访问点 之前的图象作为参考图象进行编码, 这样就可以实现依照随机访问点对 视频序列进行随机访问。 这个随机访问点即是视频随机访问的切入点, 在码流中可以是一幅带有随机访问标识符的图象, 筒称为随机访问切入 图象。  The core idea of the present invention is that the image after the random access point is not encoded with the image before the random access point as a reference image, so that random access to the video sequence according to the random access point can be realized. This random access point is the point of entry for random access of video. In the code stream, it can be an image with a random access identifier, which is called a random access cut-in image.
本发明实施例合理使用第四类预测方式协助视频图象随机访问, 第 四类预测方式即利用显示顺序上当前图象之后的参考图象数据来对当 前图象进行编码。 以下为说明方便并与现有技术对应, 将由第一类预测 方式编码得到的图象称为 A图象, 由第二类预测方式编码得到的图象称 为 B图象, 由第三类预测方式编码得到的图象称为 C图象, 由第四类预 测方式编码得到的图象称为 D图象。 其中, A图象的压缩效率较低, 但 支持随机访问; B图象和 C图象的压缩效率依次提高, 但由于它们需要 解码顺序上位于它们之前的图象作为参考图象, 不支持随机访问; D图 象具有不错的压缩效率,并且不需要解码顺序上位于 A图象之前的图象 作为参考图象。  In the embodiment of the present invention, the fourth type of prediction mode is used to assist the random access of the video image, and the fourth type of prediction mode uses the reference image data after the current image in the display order to encode the current image. Hereinafter, for convenience of description and corresponding to the prior art, an image encoded by the first type of prediction mode is referred to as an A image, and an image encoded by the second type of prediction mode is referred to as a B image, and is predicted by the third type. The image obtained by the mode encoding is called a C image, and the image encoded by the fourth type of prediction is called a D image. Among them, the compression efficiency of the A image is low, but the random access is supported; the compression efficiency of the B image and the C image is sequentially improved, but since they need to decode the image located before them as the reference image, the random image is not supported. Access; D images have good compression efficiency and do not require decoding of images that are in sequence before the A image as reference images.
为使本发明的目的、 技术方案和优点更加清楚, 下面结合附图对本 发明作进一步的详细阐述。 实施例一,若设在一段视频序列的编码方法为, A图象与 B图象之间 或 B图象与 B图象之间有两个 C图象, A图象与 A图象之间有 4个 B图象, 即编码顺序如下:
Figure imgf000009_0001
· · · · · . 同时, 该段视频序列在显示的时间先后顺序上可表示为:
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described in detail below with reference to the accompanying drawings. In the first embodiment, if the encoding method of a video sequence is such that there are two C images between the A image and the B image or between the B image and the B image, there is a relationship between the A image and the A image. 4 B images, ie the coding order is as follows:
Figure imgf000009_0001
· · · · · · At the same time, the video sequence can be expressed in the order of time:
A^C.B.C.C^C.C^C^B^C^A^C^ ...... , 以下将显示的时间先 后顺序筒称为显示顺序。 A^C.B.C.C^C.C^C^B^C^A^C^ ...... , The time sequence shown below will be referred to as the display order.
其中下标数字为该幅图象在显示顺序上的序号。 在正常编码时, 图 象 c14、 图象 c15及图象 可向前参考图象 。若图象 6被设置为该段视 频的随机访问切入图象, 则按照本发明方案, 在图象 A16被编码后, 将图 象 A16后且图象 B19之前的任意 C图象改为用第四类预测方式进行编码, 例如本例中将编码顺序紧跟在图象 A16的第一个图象用第四类预测方式 进行编码, 这样原先的图象 C14变为图象 D14 , 其参考图象为图象 A6。 图 象 D14与图象 B14在显示顺序及编码顺序上相同。 则该段视频的编码顺序 为 . Αλ B4 C2 C B7 C5 C6510 C8 C9 Bl Cl λ Cl2 Al6 Dl4 C15 Bl9 C17 C18 The subscript number is the serial number of the image in the display order. At the time of normal encoding, the image c 14 , the image c 15 and the image can be forward referenced to the image. If the image 6 is set as a random access cut-in image of the segment of video, then according to the inventive arrangement, after the image A 16 is encoded, any C image after the image A 16 and before the image B 19 is changed. For encoding in the fourth type of prediction, for example, in this example, the first image whose encoding order is immediately followed by the image A 16 is encoded by the fourth type of prediction, so that the original image C 14 becomes an image. D 14 , whose reference image is image A 6 . The image D 14 and the image B 14 are identical in display order and encoding order. Then the coding sequence of the video is Α λ B 4 C 2 CB 7 C 5 C 6 5 10 C 8 C 9 B l C l λ C l2 A l6 D l4 C 15 B l9 C 17 C 18
依据本发明方案,图象 D14可作为图象 c15及图象 β19的编码参考图象, 并且图象 c15及图象 不能使用编码顺序在 A6之前的图象作为编码参考 图象。 在解码时, 当被设置为该段视频的随机访问切入图象 6出现时, 首先用第一类预测方式对图象 46进行解码, 然后将解码顺序在图象 6 后的第一幅以第四类预测方式编码的图象进行解码, 并让该幅图象作为 紧跟该幅图象之后的第一个 C图象及第一个 B图象的参考图象。按照上 述操作完成解码, 该段视频即可实现以 46为切入图象的随机访问。 According to the solution of the present invention, the image D 14 can be used as the encoded reference image of the image c 15 and the image β 19 , and the image c 15 and the image cannot use the image before the A 6 as the encoded reference image. . Upon decoding, when a video segment is set to the random access cut image 6 appears, decoding the first image 46 a first type prediction mode, then the image of a decoding order after 6 The image encoded by the fourth type of prediction mode is decoded, and the image is used as a reference image of the first C image and the first B image immediately following the image. According to the above operation to complete the decoding, the video can achieve random access with 4 6 as the cut image.
从上述本发明方案的具体应用中, 可以看出, 本发明的编码方案有 如下关键点:  From the specific application of the above solution of the present invention, it can be seen that the coding scheme of the present invention has the following key points:
A、 对于随机访问切入图象采用第一类预测方式编码; B、 编码顺序上在随机访问切入图象之后的一幅或多幅图象用第四 类预测方式进行编码, 其参考图象为随机访问切入图象; A. For the random access cut-in image, the first type of prediction mode is used for coding; B. One or more images in the coding order after random access to the cut-in image are encoded in a fourth type of prediction mode, and the reference image is a random access cut-in image;
C、 对于编码顺序在所述随机访问切入图象之后的其它未编码图象, 以所述随机访问切入图象或 /和所述随机访问切入图象之后的已编码图 象作为参考图象进行编码。 特别地, 将上述第一类预测方式编码的图象 和 /或第四类编码方式编码的图象作为参考图象。  C. For the other uncoded image whose encoding order is after the random access cut image, the randomized access cut image or/and the random access cut image after the encoded image is used as a reference image. coding. Specifically, the image encoded by the first type of prediction mode and/or the image of the fourth type of coding mode are used as reference images.
本发明实施例二的编码处理流程如图 1所示, 包括如下步骤: 步骤 101 : 判断当前图象是否设置为随机访问切入图象, 若是, 则 执行步骤 102, 否则按照正常编码流程进行处理。  As shown in FIG. 1 , the encoding processing flow of the second embodiment of the present invention includes the following steps: Step 101: Determine whether the current image is set to randomly access the cut image, and if yes, execute step 102, otherwise perform processing according to the normal encoding process.
步骤 102: 将随机访问切入图象用第一类预测方式进行编码, 得到 Step 102: encoding the random access cut-in image by using the first type of prediction manner to obtain
A图象。 A image.
步骤 103: 将编码顺序位于所述随机访问切入图象之后的一幅或多 幅图象用第四类预测方式进行编码, 参考图象为所述 A图象, 得到相应 的 D图象。本实施例中将编码顺序紧跟所述随机访问切入图象的一幅图 象用第四类预测方式进行编码。  Step 103: The one or more images whose encoding order is located after the random access cut-in image are encoded by the fourth type of prediction mode, and the reference image is the A image to obtain a corresponding D image. In the present embodiment, an image whose encoding order follows the random access cut-in image is encoded in a fourth type of prediction.
步骤 104: 以所述在编码顺序上 A图象以及 A图象之后、 当前图象 之前的已编码图象作为参考图象, 对当前图象进行第二类预测方式或第 三类预测方式编码, 得到 B图象或 C图象。  Step 104: Perform a second type of prediction mode or a third type of prediction mode coding on the current image by using the encoded image in the coding order after the A image and the A image and before the current image as a reference image. , get B image or C image.
步骤 105: 判断是否还有未编码的图象, 若是则执行步骤 106, 否则 结束本次编码流程。  Step 105: Determine whether there is still an uncoded image. If yes, execute step 106. Otherwise, end the encoding process.
步骤 106: 判断是否出现新的随机访问切入图象, 若是则转至步骤 102, 否则转至步骤 104。  Step 106: Determine whether a new random access cut-in image appears, and if yes, go to step 102, otherwise go to step 104.
相应的解码处理流程如图 2所示, 包括如下步骤:  The corresponding decoding process is shown in Figure 2, including the following steps:
步骤 201: 判断视频序列中当前被解码图象如果为随机访问切入图 象, 则执行步骤 202, 否则按照正常解码流程进行处理。 步骤 202: 将随机访问切入图象对应的 A图象进行解码。 Step 201: If it is determined that the currently decoded image in the video sequence is a random access cut-in image, step 202 is performed; otherwise, processing is performed according to a normal decoding process. Step 202: Decode the A image corresponding to the random access image into the image.
步骤 203: 将在解码顺序上紧跟所述随机访问切入图象 A图象之后 的 D图象进行解码, 参考图象为 A图象。  Step 203: Decode the D picture immediately after the random access cut image A picture in the decoding order, and the reference picture is an A picture.
步骤 204: 以所述随机访问切入图象之后、 当前解码图象之前的图 象作为参考图象, 对当前解码图象进行解码。  Step 204: Decode the current decoded image by using the image after the random access to the image and before the current decoded image as a reference image.
步骤 205: 判断是否还有未解码视频图象, 若是则执行步骤 206, 否 则结束本次编码流程。  Step 205: Determine whether there is still an undecoded video image, and if yes, execute step 206, otherwise, end the encoding process.
步骤 206: 判断是否出现新的随机访问切入图象, 若是则转至步骤 202, 否则转至步骤 204。  Step 206: Determine whether a new random access cut-in image appears, and if yes, go to step 202, otherwise go to step 204.
实施例二中采用第四类预测方式进行编码的图象为编码顺序上随机 访问切入图象的下一幅图象。 现有技术中, A图象与 A图象之后的第一 个 B图象, 或者相邻两个 B图象之间的 C图象的数目通常为一固定值, 设为 m。 可以将这 m个 C图象中的任意一个或多个改为用本发明方案 中按第四类预测方式进行编码从而得到 D图象。  The image encoded by the fourth type of prediction in the second embodiment is the next image randomly accessed in the coding order. In the prior art, the number of C pictures between the A picture and the first B picture after the A picture, or between two adjacent B pictures is usually a fixed value, which is set to m. Any one or more of the m C pictures can be changed to be encoded in the fourth type of prediction mode in the scheme of the present invention to obtain a D picture.
本发明实施例三为用于实现本发明方案的随机访问编码装置和解码 装置。 其中, 随机访问编码装置如图 3所示, 包括:  Embodiment 3 of the present invention is a random access coding apparatus and a decoding apparatus for implementing the scheme of the present invention. The random access coding device is as shown in FIG. 3, and includes:
随机访问判断模块 301 , 用于接收未编码视频序列, 判断所接收的 当前图象是否为随机访问切入图象, 若是则将所述当前图象及其后图象 发送至随机访问编码模块 302, 否则将所述当前图象及其后图象发送至 正常编码模块 303。  The random access judging module 301 is configured to receive an uncoded video sequence, determine whether the received current image is a random access punctured image, and if yes, send the current image and the subsequent image to the random access encoding module 302, Otherwise, the current image and its subsequent image are sent to the normal encoding module 303.
随机访问编码模块 302, 用于对所收到的随机访问切入图象及其之 后的图象按照随机访问编码规则进行编码, 并将编码后的视频序列发送 至存储模块 304。 其中, 随机访问编码规则原则上满足如下流程:  The random access coding module 302 is configured to encode the received random access hand-in image and subsequent images according to a random access coding rule, and send the encoded video sequence to the storage module 304. The random access coding rule satisfies the following process in principle:
A、 将当前随机访问切入图象用第一类预测方式进行编码;  A. The current random access cut-in image is encoded by the first type of prediction mode;
B、 将所述未编码视频序列中, 编码顺序在所述随机访问切入图象 之后的 n幅图象以所述随机访问切入图象作为参考图象, 用第四类预测 方式进行编码, n为预先设定的正整数; B. In the uncoded video sequence, the coding sequence is in the random access cut-in image The subsequent n images are encoded by the random access cut-in image as a reference image, and encoded by a fourth type of prediction, where n is a predetermined positive integer;
c、 对于编码顺序在所述随机访问切入图象之后其它未编码图象, 以所述随机访问切入图象或 /和所述随机访问切入图象之后的已编码图 象作为参考图象进行编码。  c. encoding, for the encoded sequence, the other uncoded image after the random access of the cut image, the encoded image after the random access cut image or/and the random access cut image as a reference image .
随机访问编码规则的具体内容要依照视频序列的具体格式而定。 例 如采用实施例一所述编码方式的码流可参照本发明实施例一的编码方 式进行编码。  The specific content of the random access encoding rule depends on the specific format of the video sequence. For example, the code stream according to the coding mode of the first embodiment can be coded by referring to the coding mode of the first embodiment of the present invention.
正常编码模块 303 , 用于对所收到的当前图象及其后图象按照正常 编码规则进行编码, 并将编码后的视频序列发送至存储模块 304。 所述 正常编码规则可以是任何无需实现随机访问的视频序列编码方法, 例如 以第二类预测方式或第三类预测方式进行编码方法等。  The normal encoding module 303 is configured to encode the received current image and subsequent images according to normal encoding rules, and send the encoded video sequence to the storage module 304. The normal coding rule may be any video sequence coding method that does not need to implement random access, for example, a coding method in a second type of prediction mode or a third type of prediction mode.
存储模块 304, 用于存储所收到的视频序列, 并将所存储的视频序 列输出。  The storage module 304 is configured to store the received video sequence and output the stored video sequence.
本发明实施例的随机访问解码装置如图 4所示, 包括:  As shown in FIG. 4, the random access decoding apparatus of the embodiment of the present invention includes:
随机访问判断模块 401 , 用于接收视频序列, 判断所接收的图象是 否为随机访问切入图象, 若是则将所述视频序列发送至随机访问解码模 块 402, 否则将所述视频序列发送至正常解码模块 403;  The random access judging module 401 is configured to receive a video sequence, determine whether the received image is a random access punctured image, and if yes, send the video sequence to the random access decoding module 402, otherwise send the video sequence to normal. Decoding module 403;
随机访问解码模块 402, 用于对所收到的视频序列中随机访问标识 对应的图象及其之后的图象按照随机访问解码规则进行解码, 输出解码 后的视频序列。 所述随机访问解码规则与上述编码装置中的随机访问编 码规则相互对应, 遵循如下流程:  The random access decoding module 402 is configured to decode the image corresponding to the random access identifier in the received video sequence and the subsequent image according to a random access decoding rule, and output the decoded video sequence. The random access decoding rule corresponds to the random access coding rule in the above coding apparatus, and follows the following process:
a、 将未解码视频序列中的随机访问切入图象解码;  a, decoding random access in the undecoded video sequence into the image;
b、对于解码顺序在所述随机访问切入图象之后的 n幅以第四类编码 方式编码的图象解码, n为预先设定的正整数; C、 将所述随机访问切入图象, 和 /或所述随机访问切入图象之后的 以第四类编码方式编码的已解码图象作为解码参考图象, 对解码顺序在 所述随机访问切入图象之后其它未解码图象进行解码。 b, for the decoding sequence, after the random access cut-in image, n images encoded by the fourth type of encoding are decoded, n is a preset positive integer; C. cutting the random access into the image, and/or decoding the decoded image encoded by the fourth type of encoding after the random access of the cut image as a decoded reference image, and cutting the random access in the decoding order The other undecoded pictures after the picture are decoded.
正常编码模块 403, 用于对所收到的视频序列按照正常解码规则进 行解码, 并输出解码后的视频序列。  The normal encoding module 403 is configured to decode the received video sequence according to a normal decoding rule, and output the decoded video sequence.
显示模块 404, 用于接收来自随机访问解码模块 402或正常编码模 块 403的解码后的视频序列, 并将所述视频码序列显示为视频图象。  The display module 404 is configured to receive the decoded video sequence from the random access decoding module 402 or the normal encoding module 403, and display the video code sequence as a video image.
通过大量实验表明, 在同样支持随机访问的情况下, 本发明方案和 MPEG - 2及 H.264随机访问的方法相比, 能显著节省码率。  It has been shown by a large number of experiments that the scheme of the present invention can significantly save the code rate as compared with the MPEG-2 and H.264 random access methods in the case of also supporting random access.
以上所述仅为本发明的较佳实施例而已, 并不用以限制本发明, 凡 在本发明的精神和原则之内所作的任何修改、 等同替换和改进等, 均应 包含在本发明的保护范围之内。  The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims

权利要求书 Claim
1、 一种视频编码方法, 其特征在于, 包括如下步骤: A video encoding method, comprising the steps of:
对未编码视频序列中的随机访问切入图象采用第一类预测方式编 码;  The random access cut-in image in the uncoded video sequence is coded by the first type of prediction;
将编码顺序位于所述随机访问切入图象之后的 n幅未编码图象以所 述随机访问切入图象作为参考图象按照第四类预测方式进行编码, n为 预先设定的正整数;  And n uncoded images whose encoding order is located after the random access cut-in image are encoded by the random access cut-in image as a reference image according to a fourth type of prediction manner, where n is a preset positive integer;
对于编码顺序在所述随机访问切入图象之后的其它未编码图象, 以 所述随机访问切入图象或 /和所述随机访问切入图象之后的已编码图象 作为参考图象进行编码。  For the other uncoded pictures whose encoding order is after the random access cut-in image, the randomized access cut-in image or/and the randomized access-cut image after the cut-in image are encoded as reference pictures.
2、根据权利要求 1所述的编码方法, 其特征在于, 所述对未编码视 频序列中的随机访问切入图象采用第一类预测方式编码之前, 进一步包 括: 判断所述未编码视频序列中是否出现随机访问切入图象, 若是则执 行所述对未编码视频序列中的随机访问切入图象采用第一类预测方式 编码的步骤, 否则执行现有技术的视频序列编码流程。  The encoding method according to claim 1, wherein before the random access hand-in image in the uncoded video sequence is encoded by the first type of prediction, the method further comprises: determining the uncoded video sequence Whether a random access punctured image occurs, and if so, the step of encoding the random access punctured image in the uncoded video sequence using the first type of prediction mode is performed, otherwise the prior art video sequence encoding process is performed.
3、根据权利要求 1所述的编码方法, 其特征在于, 所述 n幅未编码 图象为编码顺序在所述随机访问切入图象之后的第 k幅未编码图象, k 为预先设定的正整数。  The encoding method according to claim 1, wherein the n uncoded images are k-th uncoded images whose encoding order is after the random access cut-in image, and k is preset Positive integer.
4、 根据权利要求 1、 2或 3任一项所述的编码方法, 其特征在于, 所述以所述随机访问切入图象或 /和所述随机访问切入图象之后的已编 码图象作为参考图象为:以所述第一类预测方式得到的图象和 /或第四类 预测方式得到的图象作为参考图象。  The encoding method according to any one of claims 1, 2 or 3, wherein said encoded image after said random access cut image or/and said random access cut image is used as The reference image is an image obtained by the first type of prediction and/or an image obtained by the fourth type of prediction as a reference image.
5、根据权利要求 4所述的编码方法, 其特征在于, 所述以所述随机 访问切入图象或 /和所述随机访问切入图象之后的已编码图象作为参考 图象进行编码为: 采用第二类预测方式或第三类预测方式, 以所述随机 访问切入图象或 /和所述随机访问切入图象之后的已编码图象作为参考 图象进行编码。 The encoding method according to claim 4, wherein said encoded image after said random access cut image or/and said random access cut image is used as a reference The image is encoded as: using the second type of prediction mode or the third type of prediction mode, encoding the encoded image after the random access cut image or/and the random access cut image as a reference image.
6、 一种视频解码方法, 其特征在于, 包括如下步骤:  6. A video decoding method, comprising: the following steps:
将未解码视频序列中的随机访问切入图象解码;  Decoding the random access in the undecoded video sequence into the image;
对于解码顺序在所述随机访问切入图象之后的 n幅以第四类预测方 式编码的未解码图象解码, n为预先设定的正整数;  Decoding the undecoded image encoded in the fourth type of prediction mode after the random access plunge image, and n is a predetermined positive integer;
将所述随机访问切入图象,和 /或所述随机访问切入图象之后的以第 四类预测方式编码的已解码图象作为解码参考图象, 对解码顺序在所述 随机访问切入图象之后其它未解码图象进行解码。  Placing the random access into the image, and/or the decoded image encoded by the fourth type of prediction after the random access of the cut image as a decoded reference image, and the random access cut image for the decoding order Other undecoded pictures are then decoded.
7、根据权利要求 6所述的解码方法, 其特征在于, 所述将未解码视 频序列中的随机访问切入图象解码之前, 进一步包括: 判断当前图象是 否为随机访问切入图象, 若是则执行所述将未解码视频序列中的随机访 问切入图象解码的步骤, 否则执行现有技术的视频序列解码流程。  The decoding method according to claim 6, wherein the decoding of the random access punctured image in the undecoded video sequence further comprises: determining whether the current image is a random access punctured image, and if so The step of decoding the random access in the undecoded video sequence into the image is performed, otherwise the prior art video sequence decoding process is performed.
8、根据权利要求 6或 7任一项所述的解码方法, 其特征在于, 所述 n幅以第四类编码方式编码的未解码图象为: 解码顺序在所述随机访问 切入图象之后的第 k幅未解码图象, k为预先设定的正整数。  The decoding method according to any one of claims 6 or 7, wherein the n uncoded images encoded in the fourth type of encoding are: decoding order after the random access punctured image The kth undecoded image, k is a pre-set positive integer.
9、根据权利要求 8所述的解码方法, 其特征在于, 所述对解码顺序 在所述随机访问切入图象之后其它未解码图象进行解码为: 采用第二类 预测方式或第三类预测方式, 对解码顺序在所述随机访问切入图象之后 其它未解码图象进行解码。  The decoding method according to claim 8, wherein the decoding of the other undecoded image after the random access of the image is decoded into: using a second type of prediction method or a third type of prediction In the manner, the decoding sequence is decoded after the random access of the cut-in image.
10、 一种视频编码装置, 其特征在于, 包括:  10. A video encoding apparatus, comprising:
随机访问判断模块, 用于接收未编码视频序列, 判断所接收的当前 图象是否出现随机访问切入图象, 若是则将当前图象及后续图象送至随 机访问编码模块, 否则将所述未编码视频序列发送至正常编码模块; 随机访问编码模块, 用于对所收到的当前随机访问切入图象及其之 后的图象按照随机访问编码规则进行编码, 输出编码后的视频序列; 正常编码模块, 用于对所收到的当前图象及其之后的图象按照正常 编码规则进行编码, 输出编码后的视频序列。 a random access judging module, configured to receive an uncoded video sequence, determine whether the received current image has a random access punctured image, and if yes, send the current image and the subsequent image to the random access coding module, otherwise The encoded video sequence is sent to the normal encoding module; a random access coding module, configured to encode the received random access access image and subsequent images according to a random access coding rule, and output the encoded video sequence; a normal coding module, configured to receive the received video The current image and subsequent images are encoded according to normal encoding rules, and the encoded video sequence is output.
11、根据权利要求 10所述的编码装置, 其特征在于, 所述编码装置 进一步包括: 存储模块, 用于存储来自随机访问编码模块或正常编码模 块的视频序列, 并将所存储的视频序列输出。  The encoding device according to claim 10, wherein the encoding device further comprises: a storage module, configured to store a video sequence from a random access encoding module or a normal encoding module, and output the stored video sequence .
12、 一种视频解码装置, 其特征在于, 包括:  12. A video decoding device, comprising:
随机访问判断模块, 用于接收视频序列, 判断所接收当前解码图象 是否为随机访问切入图象, 若是则将所述视频序列发送至随机访问解码 模块, 否则将所述视频序列发送至正常解码模块;  a random access judging module, configured to receive a video sequence, determine whether the received current decoded image is a random access punctured image, and if yes, send the video sequence to a random access decoding module, otherwise send the video sequence to normal decoding Module
随机访问解码模块, 用于对所收到的视频序列中随机访问切入图象 及其之后的图象按照随机访问解码规则进行解码, 输出解码后的视频序 列;  a random access decoding module, configured to decode a random access access image and subsequent images in the received video sequence according to a random access decoding rule, and output the decoded video sequence;
正常编码模块, 用于对所收到的视频序列按照正常解码规则进行解 码, 并输出解码后的视频序列。  The normal encoding module is configured to decode the received video sequence according to a normal decoding rule, and output the decoded video sequence.
13、根据权利要求 12所述的解码装置, 其特征在于, 所述解码装置 还包括显示模块, 用于接收解码后的视频序列并显示为视频图象。  The decoding apparatus according to claim 12, wherein said decoding means further comprises a display module for receiving the decoded video sequence and displaying it as a video image.
14、 一种视频编解码装置, 包括编码器和解码器, 其特征在于, 所 述解码器包括:  A video encoding and decoding apparatus, comprising an encoder and a decoder, wherein the decoder comprises:
随机访问判断模块, 用于接收未编码视频序列, 判断所接收的当前 图象是否出现随机访问切入图象, 若是则将当前图象及后续图象送至随 机访问编码模块, 否则将所述未编码视频序列发送至正常编码模块; 随机访问编码模块, 用于对所收到的当前随机访问切入图象及其之 后的图象按照随机访问编码规则进行编码, 输出编码后的视频序列; 正常编码模块, 用于对所收到的当前图象及其之后的图象按照正常 编码规则进行编码, 输出编码后的视频序列; a random access judging module, configured to receive an uncoded video sequence, determine whether the received current image has a random access punctured image, and if yes, send the current image and the subsequent image to the random access coding module, otherwise The encoded video sequence is sent to the normal encoding module; the random access encoding module is configured to encode the received random access access image and subsequent images according to a random access encoding rule, and output the encoded video sequence; a normal encoding module, configured to encode the received current image and subsequent images according to a normal encoding rule, and output the encoded video sequence;
所述解码器包括:  The decoder includes:
随机访问判断模块, 用于接收视频序列, 判断所接收当前解码图象 是否为随机访问切入图象, 若是则将所述视频序列发送至随机访问解码 模块, 否则将所述视频序列发送至正常解码模块;  a random access judging module, configured to receive a video sequence, determine whether the received current decoded image is a random access punctured image, and if yes, send the video sequence to a random access decoding module, otherwise send the video sequence to normal decoding Module
随机访问解码模块, 用于对所收到的视频序列中随机访问切入图象 及其之后的图象按照随机访问解码规则进行解码, 输出解码后的视频序 歹ΐ ;  a random access decoding module, configured to decode the cut-in image and the subsequent image in the received video sequence according to a random access decoding rule, and output the decoded video sequence;
正常编码模块, 用于对所收到的视频序列按照正常解码规则进行解 码, 并输出解码后的视频序列。  The normal encoding module is configured to decode the received video sequence according to a normal decoding rule, and output the decoded video sequence.
PCT/CN2007/071167 2006-12-04 2007-12-04 Method and apparatus of video coding and decoding WO2008067761A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200610161041.4 2006-12-04
CN 200610161041 CN101198052B (en) 2006-12-04 2006-12-04 Video encoding method, decoding method and device

Publications (1)

Publication Number Publication Date
WO2008067761A1 true WO2008067761A1 (en) 2008-06-12

Family

ID=39491681

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2007/071167 WO2008067761A1 (en) 2006-12-04 2007-12-04 Method and apparatus of video coding and decoding

Country Status (2)

Country Link
CN (1) CN101198052B (en)
WO (1) WO2008067761A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103765908A (en) * 2011-07-02 2014-04-30 三星电子株式会社 Method and apparatus for multiplexing and demultiplexing video data to identify reproducing state of video data.

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9706227B2 (en) * 2011-03-10 2017-07-11 Qualcomm Incorporated Video coding techniques for coding dependent pictures after random access
JP6045222B2 (en) * 2012-06-28 2016-12-14 株式会社Nttドコモ Moving picture predictive decoding apparatus, method and program
US10062416B2 (en) * 2012-07-10 2018-08-28 Sony Corporation Image decoding device, and image decoding method, image encoding device, and image encoding method
CN105516729B (en) * 2014-09-23 2019-12-31 浙江大学 Video coding and decoding method and device and generated video code stream

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5543847A (en) * 1992-12-14 1996-08-06 Sony Corporation Picture coding and decoding method for random accessing
CN1390045A (en) * 2001-06-01 2003-01-08 三洋电机株式会社 Image processing apparatus realized in integrated circuit chip
CN1445989A (en) * 2002-03-14 2003-10-01 松下电器产业株式会社 Regeneration device
US20050169371A1 (en) * 2004-01-30 2005-08-04 Samsung Electronics Co., Ltd. Video coding apparatus and method for inserting key frame adaptively

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3263807B2 (en) * 1996-09-09 2002-03-11 ソニー株式会社 Image encoding apparatus and image encoding method
JP3380763B2 (en) * 1998-01-23 2003-02-24 松下電器産業株式会社 Image processing method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5543847A (en) * 1992-12-14 1996-08-06 Sony Corporation Picture coding and decoding method for random accessing
CN1390045A (en) * 2001-06-01 2003-01-08 三洋电机株式会社 Image processing apparatus realized in integrated circuit chip
CN1445989A (en) * 2002-03-14 2003-10-01 松下电器产业株式会社 Regeneration device
US20050169371A1 (en) * 2004-01-30 2005-08-04 Samsung Electronics Co., Ltd. Video coding apparatus and method for inserting key frame adaptively

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103765908A (en) * 2011-07-02 2014-04-30 三星电子株式会社 Method and apparatus for multiplexing and demultiplexing video data to identify reproducing state of video data.
US9788003B2 (en) 2011-07-02 2017-10-10 Samsung Electronics Co., Ltd. Method and apparatus for multiplexing and demultiplexing video data to identify reproducing state of video data

Also Published As

Publication number Publication date
CN101198052B (en) 2010-05-19
CN101198052A (en) 2008-06-11

Similar Documents

Publication Publication Date Title
US10390049B2 (en) Electronic devices for sending a message and buffering a bitstream
US9262986B2 (en) Reference frame management for screen content video coding using hash or checksum functions
US9503729B2 (en) Encoding or decoding method and apparatus
EP2191650B1 (en) Architecture for multi-stage decoding of a cabac bitstream
CN103907351B (en) Reference picture is marked
WO2016131223A1 (en) Frame loss method for video frame and video sending apparatus
EP3484155B1 (en) Method and device for processing video data
WO2010037317A1 (en) Video communication system, device and method based on feedback reference frames
US8953671B2 (en) Codec capability negotiation method and terminal thereof
WO2013113217A1 (en) Decoding method and device
US8660191B2 (en) Software video decoder display buffer underflow prediction and recovery
TW200952498A (en) CABAC decoding unit and decoding method
WO2013091391A1 (en) Video encoding method and device
WO2008067761A1 (en) Method and apparatus of video coding and decoding
TW201206202A (en) Moving image prediction encoding device, moving image prediction encoding method, moving image prediction encoding program, moving image prediction decoding device, moving image prediction decoding method, and moving image prediction decoding program
WO2012119463A1 (en) Coding method, decoding method, and equipments for transform coefficients
CN106791875B (en) Video data decoding method, video data encoding method and related devices
US9426460B2 (en) Electronic devices for signaling multiple initial buffering parameters
CN111654724B (en) Low-bit-rate coding transmission method of video conference system
WO2013155828A1 (en) Video image code stream processing method and device
WO2010083743A1 (en) Encoding/decoding method and device based on double prediction
CN110300305B (en) Video encoding apparatus and method
KR20060068254A (en) Video encoding method, video decoding method, and video decoder
WO2005039190A1 (en) Method for video clip assisted by clip identifier
KR20060099805A (en) Apparatus for bit stream encoding / decoding

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07817357

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07817357

Country of ref document: EP

Kind code of ref document: A1