CN101841704B

CN101841704B - Method and device for compressing and decompressing video bit stream

Info

Publication number: CN101841704B
Application number: CN2010100042542A
Authority: CN
Inventors: 卢京辉; 王浩; 邱嵩; 杨晓东
Original assignee: Vimicro Corp
Current assignee: FUZHOU ZHONGXING ELECTRONICS Co Ltd
Priority date: 2009-01-14
Filing date: 2010-01-14
Publication date: 2013-12-11
Anticipated expiration: 2030-01-14
Also published as: CN101841704A

Abstract

The invention provides a method for compressing a video bit stream, which comprises the following steps: dividing one or more images in collected a video bit stream into a first regional image and a second regional image, and marking a discardable identification on the second region of the one or more images; dividing the first regional image and the second regional image into different stripes for coding; decoding the first regional image and the second regional image; using a first regional image in the decoded restructured image as a reference image for interframe movement prediction and compensation of other images; and adopting the restructured image, which has the same position as the second regional image, of the existing reference image to replace the decoded restructured image of the second regional image with the discardable identification as a reference image for the interframe movement prediction and compensation of other images. Thus, a receiving terminal does not loss the action details in the first region in the process of decoding, thereby overcoming the defect of the non-consistent actions of a monitoring image.

Description

Video code flow compression and the method and apparatus decompressed

Technical field

The present invention relates to the image technique field, particularly the method and apparatus of a kind of video code flow compression and decompression.

Background technology

Common digital video coding sequence is comprised of a series of continuous images usually, and image may be comprised of one or more band, and a band can comprise the information of the part of whole image or image.In the process of being encoded at the piece in an image, can utilize the information of other piece in same band to carry out infra-frame prediction, but or the piece in the decoded reconstructed image of the reference picture that utilizes other to encode carry out inter prediction and compensation.

The code stream that video compression obtains generally need to carry out sending and receiving by transmission mediums such as networks, comprises the various situations such as private network, the Internet, ADSL, mobile radio networks.According to the difference of the network bandwidth, likely need code stream is adjusted, so that sent under lower bandwidth.Existing technology has the mode of multiple adjustment code stream, comprises and reduces spatial resolution, reduction frame per second (temporal resolution), reduction image quality etc., to realize the reduction to the code stream size.

For example, in more common field of video monitoring, the video camera of front monitoring front-end, with the picture of the speed output 720x576 pixel of per second 25 frames, after video compressing module, is kept at code stream for example, in local storage medium (disk array).During code stream under certain staff far wants to record by this CCTV camera of ADSL access to netwoks, due to 720 * 576 pixels, after 25 frames/compression second, code check surpasses the bandwidth of ADSL, therefore in the situation that make current ADSL network can't transmit complete code stream.In this case, in prior art, fairly simple method is frame losing.As shown in Figure 1, be the schematic diagram of P frame in prior art, as can be seen from the figure can will can not abandon to reduce frame per second with reference to the P frame, to realize the purpose of downscaled video code check.But, what abandon can not be with reference to comprising the comparatively active subregion of motion of paying close attention in video monitoring scene in the P frame, the perhaps part key area in monitoring scene, the detail of information of the image that these are regional should be used for saying to possess higher importance for monitoring, but, along with can not abandoning with reference to the P frame, the important information of the key area comprised in this P frame also abandons thereupon, make these important informations lose value in interframe movement prediction and compensation, will make the action of monitored object discontinuous.

Summary of the invention

One of purpose of the present invention is that the important information of key area is fully used in interframe movement prediction and compensation, thereby overcomes the incoherent technological deficiency of action of monitored object.

For achieving the above object, the invention provides a kind of video code flow compression method, comprise the following steps:

A frame in the video code flow of collection or multiple image are divided into to first area image and second area image, and a frame that will be wherein or the discardable sign of second area mark of multiple image; Described first area image and described second area image are divided into to different bands is encoded;

Described first area image and described second area image are decoded, first area image in decoded reconstructed image is interframe movement prediction and the compensation for other image as the reference image, adopt the reconstructed image after but the reconstructed image identical with the second area picture position that has reference picture replaced the second area image decoding of discardable sign, predict and compensation with the interframe movement for other image as the reference image.

As one embodiment of the present of invention, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, but and with the reference picture of the immediate discardable sign of unmarked second area of having encoded of current frame image.

As one embodiment of the present of invention, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, and with the in-frame encoding picture of the immediate discardable sign of unmarked second area of having encoded of current frame image.

As one embodiment of the present of invention, the method further comprises: after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.

As one embodiment of the present of invention, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned, specifically comprise:

Judge whether to need the downscaled video code stream; If judgement needs the downscaled video code stream, the encoding strip thereof of the second area of the discardable sign of described mark is deleted from code stream, and exported all the other code streams; If judgement does not need the downscaled video code stream, by all coded image output.

As one embodiment of the present of invention, adopting image level parameter or slice level parameters is the discardable sign of second area image tagged.

As one embodiment of the present of invention, increase to mean the whether discardable image level parameter of content of described second area image, and mean at least one parameter in image level parameter that whether encoding strip thereof of described second area be dropped.

The present invention also provides a kind of video code flow compression set, and this device comprises:

Dividing mark unit: the frame in the video code flow of collection or multiple image are divided into to first area image and second area image, by the discardable sign of second area mark;

Coding unit: described first area image and described second area image are divided into to different bands and are encoded;

Decoding unit: described first area image and described second area image are decoded;

Reference picture determining unit: predict and compensation for the interframe movement of other image the first area image in decoded reconstructed image as the reference image; And but the reconstructed image identical with the second area picture position that adopts existing reference picture replace the reconstructed image after the second area image decoding of discardable sign, as the reference image with the prediction of the interframe movement for other image and compensation.

As one embodiment of the present of invention, this device further comprises:

Code stream reduction unit: for after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.

The present invention also provides a kind of video code flow decompression method, and the video code flow after the compression received is decompressed, and comprises the following steps:

First area image and second area image after the compression received are decoded, first area image in decoded reconstructed image is predicted and compensation for image decompression compression process interframe movement as the reference picture of other image, but the reconstructed image identical with the second area picture position that adopts existing reference picture replaced the reconstructed image after the second area image decoding of discardable sign, as the reference picture of other image with for the prediction of image decompression compression process interframe movement and compensation.

The present invention also provides a kind of video code flow decompressing device, and this device comprises:

Decoding unit: the first area image and the second area image that receive are decoded;

The reference picture determining unit: the first area image in decoded reconstructed image is predicted and compensation for image decompression compression process interframe movement as the reference picture of other image, but the reconstructed image identical with the second area picture position that adopts existing reference picture replaced the reconstructed image after the second area image decoding of discardable sign, as the reference picture of other image with for the prediction of image decompression compression process interframe movement and compensation.

The present invention is divided into first area image and second area image by the frame in the video code flow by collection or multiple image, and a frame that will be wherein or the discardable sign of second area mark of multiple image; Described first area image and described second area image are divided into to different bands is encoded; Described first area image and described second area image are decoded, first area image in decoded reconstructed image is interframe movement prediction and the compensation for other image as the reference image, adopt the reconstructed image after but the reconstructed image identical with the second area picture position that has reference picture replaced the second area image decoding of discardable sign, predict and compensation with the interframe movement for other image as the reference image.Can make like this receiving terminal the time not lose the action details in first area in decoding, and then overcome monitoring image and move incoherent defect.

The aspect that the present invention is additional and advantage part in the following description provide, and part will become obviously from the following description, or recognize by practice of the present invention.

The accompanying drawing explanation

Above-mentioned and/or the additional aspect of the present invention and advantage will become from the following description of the accompanying drawings of embodiments and obviously and easily understand, wherein:

The schematic diagram that Fig. 1 is P frame in prior art;

The flow chart that Fig. 2 is video code flow compression method of the present invention;

The schematic diagram that Fig. 3 is the embodiment of the present invention;

The schematic diagram of the inter prediction process that Fig. 4-1 is the embodiment of the present invention;

The schematic diagram of the inter prediction process that Fig. 4-2 are the embodiment of the present invention;

Fig. 5 is bit stream cellular construction figure of the present invention;

The structure chart that Fig. 6 is video code flow compression set of the present invention;

The preferred structure figure that Fig. 7 is video code flow compression set of the present invention.

Embodiment

Below describe embodiments of the invention in detail, the example of described embodiment is shown in the drawings, and wherein same or similar label means same or similar element or the element with identical or similar functions from start to finish.Be exemplary below by the embodiment be described with reference to the drawings, only for explaining the present invention, and can not be interpreted as limitation of the present invention.

The flow chart that Fig. 2 is video code flow compression method of the present invention; As shown in Figure 2, the method comprises the following steps:

S201, be divided into first area image and second area image by the frame in the video code flow of collection or multiple image, and a frame that will be wherein or the discardable sign of second area mark of multiple image; Described first area image and described second area image are divided into to different bands is encoded;

S202, decoded to described first area image and described second area image, and the first area image in decoded reconstructed image is interframe movement prediction and the compensation for other image as the reference image

S203, but the reconstructed image after the reconstructed image identical with the second area picture position that has reference picture replaced the second area image decoding of discardable sign adopted, as the reference image, with the interframe movement for other image, predict and compensation.

In S203, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, but and with the reference picture of the immediate discardable sign of unmarked second area of having encoded of current frame image, but reference picture is I two field picture or P two field picture.

Or, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, and with the in-frame encoding picture of the immediate discardable sign of unmarked second area of having encoded of current frame image, in-frame encoding picture is the I two field picture.

The method further comprises: after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.

As the preferred embodiments of the present invention, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned, specifically comprise:

The specific implementation process of the inventive method is described below by Fig. 3 and Fig. 4, as shown in Figure 3, schematic diagram for one embodiment of the invention, whole frame image is divided into to first area (as shown in solid box) and second area (as shown in the dotted line frame), when headend equipment is encoded, k, k+2, k+4 frame second area is marked as not discardable; K+1, the discardable sign of k+3 frame second area mark.P _k+1,2the second area image that means the k+1 frame, P _k+3,2the second area image that means the k+3 frame, P _k+1,1the first area image that means the k+1 frame, P _k+3,1the first area image that means the k+3 frame.

When coding, the image of every two field picture first area and second area is divided into different bands and is encoded.

When coding, P _k+1,2with P _k+3,2the coded image information in zone is not used in P _k+1,1with P _k+3,1infra-frame prediction process in the area image coding.

When needs reduction encoding code stream, P _k+1,2with P _k+3,2the encoding strip thereof in zone can be dropped.

And, as shown in Figure 3, when coding, P _k+1,2with P _k+3,2the reconstructed image in zone is not used in the inter prediction in other Image Coding and compensation.

Fig. 4-1 and Fig. 4-2 be embodiment illustrated in fig. 3 in the schematic diagram of two kinds of possible inter prediction processes, as shown in Fig. 4-1, can be with the reconstructed image P of the second area of k+2 frame _k+2,2replace the reconstructed image P of the second area of k+3 frame _k+3,2, P ' is arranged _k+3,2equal P _k+2,2.

As shown in Fig. 4-2, can be with the reconstructed image I of the second area of k frame _{k, 2}replace the reconstructed image P of the second area of k+3 frame _k+3,2, P ' is arranged _k+3,2equal I _{k, 2}.

But reference picture when the k+3 frame is as other reference picture of coding, while in coding k+4 two field picture process, carrying out motion prediction and compensation, when reference picture index is pointed to the k+3 frame, if the prediction piece of motion vectors point is positioned at first area, export P _k+3,1middle relevant block is as the prediction piece, if the prediction piece of motion vectors point is positioned at second area, with P ' _k+3,2the piece of middle relevant position is as the output of prediction piece.

As one embodiment of the present of invention, described discardable image level parameter or the slice level parameters of being designated.

As preferred embodiment, the first area image of above statement can be area-of-interest (ROI) image, and the second area image can be the Background regional image picture.That is, by the discardable sign of background area mark, and then carry out determining of reference picture.In some other application, described first area image can be open zone, and the second area image, for the zone of maintaining secrecy, only has specific user just can see, by the discardable sign of closed security zone field mark, the reconstructed image after this regional decoding is not used in inter prediction and the compensation of other image.

Below will take area-of-interest and background area is described as example.

When coding, image is divided into to area-of-interest (ROI) and background area, background area is labeled as discardable, the encoding strip thereof that will be labeled as discardable background area when needing the downscaled video code check abandons, thereby, when guaranteeing the downscaled video code check, can also make receiving terminal not lose the action details in the ROI zone when decoding.

Described in this embodiment a kind of application scenarios, comprised video server and provide at least one headend equipment of video image and videoconference client for video server.Each headend equipment sends to video server after gathering image, after being gathered by video server, offers videoconference client again.Headend equipment is divided into area-of-interest (ROI) by image and ，Jiang background area, background area is labeled as discardable in when coding in this embodiment.Thereby when headend equipment compresses for carrying out video code flow, can the encoding strip thereof of corresponding background area be abandoned according to transmission network condition and discardable sign, can not affect the decoding of other frame in code stream simultaneously.Certainly, the process abandoned can complete equally in video server.

The present invention also can increase by two parameters in picture parameter set, means that respectively whether the content of background area is discardable, and whether the content that reaches background area is dropped, and for example whether whether these two parameters be " discardable " and " abandoning ".When for the background area image, making discardable sign, if whether " discardable " is labeled as 1, the discardable sign that meaned this background area mark, its content can abandon when needed, does not affect the subsequent frame decoding.If whether " discardable " is labeled as 0, the not discardable sign that meaned this background area mark, its content cannot abandon, otherwise the subsequent frame decoding can make mistakes.Be labeled as 1 if " whether abandon ", mean that this frame background area content is dropped, does not need to separate the background area content during decoding.Be labeled as 0 if " whether abandon ", mean that this frame background area content also in code stream, need to decode during decoding and obtain the background area image.The so as above described schematic diagram of Fig. 3, after having abandoned part background area image, k, k+2, whether " discardable " of the background area in the k+4 frame is labeled as 0, and whether " abandoning " is labeled as 0; K+1, whether " discardable " of the background area in the k+3 frame is labeled as 1, and whether " abandoning " is labeled as 1.

More specifically, the present invention proposes a specific embodiment that above-mentioned parameter is set, in the syntactic definition at the image parameter collection, newly-increased as given a definition:

Picture parameter set ()	Descriptor

background_discardable
		1 bit
if(background_discardable){
	non_roi_skip_flag	1 bit
}
	Other parameter
}

Wherein, the semantic description of background_discardable and non_roi_skip_flag syntactic element is as follows: background_discardable: whether the background area that means present frame can abandon, if this value is 1, should be with reference to the background area in present frame when subsequent frame is encoded.Non_roi_skip_flag: mean whether the background area in present frame is dropped, if this value is 1, mean not comprise the background area part in present frame in code stream.

As one embodiment of the present of invention, discardable sign can adopt slice level parameters.As a kind of possible bit stream unit (NAL) as shown in Figure 5.Unit loads is the encoding strip thereof data, is useful on the whether discardable NAL cell type of the encoding strip thereof sign nal_unit_type of sign second area image in unit header.As unit loads be mark the second area image of discardable sign (as the P in Fig. 3 _k+3,2) encoding strip thereof, in the corresponding unit head, nal_unit_type equals 2, unit loads is to be labeled as not discardable second area image (as the P in Fig. 3 _k+3,1) encoding strip thereof, in the corresponding unit head, nal_unit_type equals 1.

The present invention, when guaranteeing Video coding efficiency, can also make receiving terminal not lose the action details in the ROI zone when decoding.Corresponding with the video code flow compression method, the present invention also provides a kind of video code flow compression set, the structure chart that Fig. 6 is video code flow compression set of the present invention; As shown in Figure 6, this device comprises: dividing mark unit 601, and coding unit 602, decoding unit 603, reference picture determining unit 604, wherein:

Dividing mark unit 601: the frame in the video code flow of collection or multiple image are divided into to first area image and second area image, by the discardable sign of second area mark;

Coding unit 602: described first area image and described second area image are divided into to different bands and are encoded;

Decoding unit 603: described first area image and described second area image are decoded;

Reference picture determining unit 604: predict and compensation for the interframe movement of other image the first area image in decoded reconstructed image as the reference image; And but the reconstructed image identical with the second area picture position that adopts existing reference picture replace the reconstructed image after the second area image decoding of discardable sign, as the reference image with the prediction of the interframe movement for other image and compensation.

The preferred structure figure that Fig. 7 is video code flow compression set of the present invention, as shown in Figure 7, this device further comprises:

Code stream reduction unit 605: for after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.

Corresponding with decompression method, the present invention also provides a kind of video code flow decompressing device, and this device comprises:

Although illustrated and described embodiments of the invention, for the ordinary skill in the art, be appreciated that without departing from the principles and spirit of the present invention and can carry out multiple variation, modification, replacement and modification to these embodiment, scope of the present invention is by claims and be equal to and limit.

Claims

1. the video code flow compression method, is characterized in that, comprises the following steps:

2. method according to claim 1, is characterized in that, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, but and with the reference picture of the immediate discardable sign of unmarked second area of having encoded of current frame image.

3. method according to claim 1, it is characterized in that, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, and with the in-frame encoding picture of the immediate discardable sign of unmarked second area of having encoded of current frame image.

4. method according to claim 1, it is characterized in that, the method further comprises: after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.

5. method according to claim 4, is characterized in that, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark abandoned, and specifically comprises:

6. method according to claim 1, is characterized in that, adopting image level parameter or slice level parameters is the discardable sign of second area image tagged.

7. method according to claim 6, is characterized in that, increase to mean the whether discardable image level parameter of content of described second area image, and mean at least one parameter in image level parameter that whether encoding strip thereof of described second area be dropped.

8. the video code flow compression set, is characterized in that, this device comprises:

9. device according to claim 8, is characterized in that, this device further comprises:

10. the video code flow decompression method, is characterized in that, the video code flow after the compression received is decompressed, and comprises the following steps:

11. the video code flow decompressing device, is characterized in that, this device comprises:

Decoding unit: first area image and second area image after the compression received are decoded;