Summary of the invention
One of purpose of the present invention is that the important information of key area is fully used in interframe movement prediction and compensation, thereby overcomes the incoherent technological deficiency of action of monitored object.
For achieving the above object, the invention provides a kind of video code flow compression method, comprise the following steps:
A frame in the video code flow of collection or multiple image are divided into to first area image and second area image, and a frame that will be wherein or the discardable sign of second area mark of multiple image; Described first area image and described second area image are divided into to different bands is encoded;
Described first area image and described second area image are decoded, first area image in decoded reconstructed image is interframe movement prediction and the compensation for other image as the reference image, adopt the reconstructed image after but the reconstructed image identical with the second area picture position that has reference picture replaced the second area image decoding of discardable sign, predict and compensation with the interframe movement for other image as the reference image.
As one embodiment of the present of invention, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, but and with the reference picture of the immediate discardable sign of unmarked second area of having encoded of current frame image.
As one embodiment of the present of invention, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, and with the in-frame encoding picture of the immediate discardable sign of unmarked second area of having encoded of current frame image.
As one embodiment of the present of invention, the method further comprises: after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.
As one embodiment of the present of invention, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned, specifically comprise:
Judge whether to need the downscaled video code stream; If judgement needs the downscaled video code stream, the encoding strip thereof of the second area of the discardable sign of described mark is deleted from code stream, and exported all the other code streams; If judgement does not need the downscaled video code stream, by all coded image output.
As one embodiment of the present of invention, adopting image level parameter or slice level parameters is the discardable sign of second area image tagged.
As one embodiment of the present of invention, increase to mean the whether discardable image level parameter of content of described second area image, and mean at least one parameter in image level parameter that whether encoding strip thereof of described second area be dropped.
The present invention also provides a kind of video code flow compression set, and this device comprises:
Dividing mark unit: the frame in the video code flow of collection or multiple image are divided into to first area image and second area image, by the discardable sign of second area mark;
Coding unit: described first area image and described second area image are divided into to different bands and are encoded;
Decoding unit: described first area image and described second area image are decoded;
Reference picture determining unit: predict and compensation for the interframe movement of other image the first area image in decoded reconstructed image as the reference image; And but the reconstructed image identical with the second area picture position that adopts existing reference picture replace the reconstructed image after the second area image decoding of discardable sign, as the reference image with the prediction of the interframe movement for other image and compensation.
As one embodiment of the present of invention, this device further comprises:
Code stream reduction unit: for after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.
The present invention also provides a kind of video code flow decompression method, and the video code flow after the compression received is decompressed, and comprises the following steps:
First area image and second area image after the compression received are decoded, first area image in decoded reconstructed image is predicted and compensation for image decompression compression process interframe movement as the reference picture of other image, but the reconstructed image identical with the second area picture position that adopts existing reference picture replaced the reconstructed image after the second area image decoding of discardable sign, as the reference picture of other image with for the prediction of image decompression compression process interframe movement and compensation.
The present invention also provides a kind of video code flow decompressing device, and this device comprises:
Decoding unit: the first area image and the second area image that receive are decoded;
The reference picture determining unit: the first area image in decoded reconstructed image is predicted and compensation for image decompression compression process interframe movement as the reference picture of other image, but the reconstructed image identical with the second area picture position that adopts existing reference picture replaced the reconstructed image after the second area image decoding of discardable sign, as the reference picture of other image with for the prediction of image decompression compression process interframe movement and compensation.
The present invention is divided into first area image and second area image by the frame in the video code flow by collection or multiple image, and a frame that will be wherein or the discardable sign of second area mark of multiple image; Described first area image and described second area image are divided into to different bands is encoded; Described first area image and described second area image are decoded, first area image in decoded reconstructed image is interframe movement prediction and the compensation for other image as the reference image, adopt the reconstructed image after but the reconstructed image identical with the second area picture position that has reference picture replaced the second area image decoding of discardable sign, predict and compensation with the interframe movement for other image as the reference image.Can make like this receiving terminal the time not lose the action details in first area in decoding, and then overcome monitoring image and move incoherent defect.
The aspect that the present invention is additional and advantage part in the following description provide, and part will become obviously from the following description, or recognize by practice of the present invention.
Embodiment
Below describe embodiments of the invention in detail, the example of described embodiment is shown in the drawings, and wherein same or similar label means same or similar element or the element with identical or similar functions from start to finish.Be exemplary below by the embodiment be described with reference to the drawings, only for explaining the present invention, and can not be interpreted as limitation of the present invention.
The flow chart that Fig. 2 is video code flow compression method of the present invention; As shown in Figure 2, the method comprises the following steps:
S201, be divided into first area image and second area image by the frame in the video code flow of collection or multiple image, and a frame that will be wherein or the discardable sign of second area mark of multiple image; Described first area image and described second area image are divided into to different bands is encoded;
S202, decoded to described first area image and described second area image, and the first area image in decoded reconstructed image is interframe movement prediction and the compensation for other image as the reference image
S203, but the reconstructed image after the reconstructed image identical with the second area picture position that has reference picture replaced the second area image decoding of discardable sign adopted, as the reference image, with the interframe movement for other image, predict and compensation.
In S203, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, but and with the reference picture of the immediate discardable sign of unmarked second area of having encoded of current frame image, but reference picture is I two field picture or P two field picture.
Or, but described existing reference picture be positioned at current frame image on DISPLAY ORDER before, and with the in-frame encoding picture of the immediate discardable sign of unmarked second area of having encoded of current frame image, in-frame encoding picture is the I two field picture.
The method further comprises: after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.
As the preferred embodiments of the present invention, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned, specifically comprise:
Judge whether to need the downscaled video code stream; If judgement needs the downscaled video code stream, the encoding strip thereof of the second area of the discardable sign of described mark is deleted from code stream, and exported all the other code streams; If judgement does not need the downscaled video code stream, by all coded image output.
The specific implementation process of the inventive method is described below by Fig. 3 and Fig. 4, as shown in Figure 3, schematic diagram for one embodiment of the invention, whole frame image is divided into to first area (as shown in solid box) and second area (as shown in the dotted line frame), when headend equipment is encoded, k, k+2, k+4 frame second area is marked as not discardable; K+1, the discardable sign of k+3 frame second area mark.P
k+1,2the second area image that means the k+1 frame, P
k+3,2the second area image that means the k+3 frame, P
k+1,1the first area image that means the k+1 frame, P
k+3,1the first area image that means the k+3 frame.
When coding, the image of every two field picture first area and second area is divided into different bands and is encoded.
When coding, P
k+1,2with P
k+3,2the coded image information in zone is not used in P
k+1,1with P
k+3,1infra-frame prediction process in the area image coding.
When needs reduction encoding code stream, P
k+1,2with P
k+3,2the encoding strip thereof in zone can be dropped.
And, as shown in Figure 3, when coding, P
k+1,2with P
k+3,2the reconstructed image in zone is not used in the inter prediction in other Image Coding and compensation.
Fig. 4-1 and Fig. 4-2 be embodiment illustrated in fig. 3 in the schematic diagram of two kinds of possible inter prediction processes, as shown in Fig. 4-1, can be with the reconstructed image P of the second area of k+2 frame
k+2,2replace the reconstructed image P of the second area of k+3 frame
k+3,2, P ' is arranged
k+3,2equal P
k+2,2.
As shown in Fig. 4-2, can be with the reconstructed image I of the second area of k frame
k, 2replace the reconstructed image P of the second area of k+3 frame
k+3,2, P ' is arranged
k+3,2equal I
k, 2.
But reference picture when the k+3 frame is as other reference picture of coding, while in coding k+4 two field picture process, carrying out motion prediction and compensation, when reference picture index is pointed to the k+3 frame, if the prediction piece of motion vectors point is positioned at first area, export P
k+3,1middle relevant block is as the prediction piece, if the prediction piece of motion vectors point is positioned at second area, with P '
k+3,2the piece of middle relevant position is as the output of prediction piece.
As one embodiment of the present of invention, described discardable image level parameter or the slice level parameters of being designated.
As one embodiment of the present of invention, increase to mean the whether discardable image level parameter of content of described second area image, and mean at least one parameter in image level parameter that whether encoding strip thereof of described second area be dropped.
As preferred embodiment, the first area image of above statement can be area-of-interest (ROI) image, and the second area image can be the Background regional image picture.That is, by the discardable sign of background area mark, and then carry out determining of reference picture.In some other application, described first area image can be open zone, and the second area image, for the zone of maintaining secrecy, only has specific user just can see, by the discardable sign of closed security zone field mark, the reconstructed image after this regional decoding is not used in inter prediction and the compensation of other image.
Below will take area-of-interest and background area is described as example.
When coding, image is divided into to area-of-interest (ROI) and background area, background area is labeled as discardable, the encoding strip thereof that will be labeled as discardable background area when needing the downscaled video code check abandons, thereby, when guaranteeing the downscaled video code check, can also make receiving terminal not lose the action details in the ROI zone when decoding.
Described in this embodiment a kind of application scenarios, comprised video server and provide at least one headend equipment of video image and videoconference client for video server.Each headend equipment sends to video server after gathering image, after being gathered by video server, offers videoconference client again.Headend equipment is divided into area-of-interest (ROI) by image and ,Jiang background area, background area is labeled as discardable in when coding in this embodiment.Thereby when headend equipment compresses for carrying out video code flow, can the encoding strip thereof of corresponding background area be abandoned according to transmission network condition and discardable sign, can not affect the decoding of other frame in code stream simultaneously.Certainly, the process abandoned can complete equally in video server.
The present invention also can increase by two parameters in picture parameter set, means that respectively whether the content of background area is discardable, and whether the content that reaches background area is dropped, and for example whether whether these two parameters be " discardable " and " abandoning ".When for the background area image, making discardable sign, if whether " discardable " is labeled as 1, the discardable sign that meaned this background area mark, its content can abandon when needed, does not affect the subsequent frame decoding.If whether " discardable " is labeled as 0, the not discardable sign that meaned this background area mark, its content cannot abandon, otherwise the subsequent frame decoding can make mistakes.Be labeled as 1 if " whether abandon ", mean that this frame background area content is dropped, does not need to separate the background area content during decoding.Be labeled as 0 if " whether abandon ", mean that this frame background area content also in code stream, need to decode during decoding and obtain the background area image.The so as above described schematic diagram of Fig. 3, after having abandoned part background area image, k, k+2, whether " discardable " of the background area in the k+4 frame is labeled as 0, and whether " abandoning " is labeled as 0; K+1, whether " discardable " of the background area in the k+3 frame is labeled as 1, and whether " abandoning " is labeled as 1.
More specifically, the present invention proposes a specific embodiment that above-mentioned parameter is set, in the syntactic definition at the image parameter collection, newly-increased as given a definition:
Picture parameter set () |
Descriptor |
|
|
background_discardable |
|
1 bit |
if(background_discardable){ |
|
non_roi_skip_flag |
1 bit |
} |
|
Other parameter |
|
} |
|
Wherein, the semantic description of background_discardable and non_roi_skip_flag syntactic element is as follows: background_discardable: whether the background area that means present frame can abandon, if this value is 1, should be with reference to the background area in present frame when subsequent frame is encoded.Non_roi_skip_flag: mean whether the background area in present frame is dropped, if this value is 1, mean not comprise the background area part in present frame in code stream.
As one embodiment of the present of invention, discardable sign can adopt slice level parameters.As a kind of possible bit stream unit (NAL) as shown in Figure 5.Unit loads is the encoding strip thereof data, is useful on the whether discardable NAL cell type of the encoding strip thereof sign nal_unit_type of sign second area image in unit header.As unit loads be mark the second area image of discardable sign (as the P in Fig. 3
k+3,2) encoding strip thereof, in the corresponding unit head, nal_unit_type equals 2, unit loads is to be labeled as not discardable second area image (as the P in Fig. 3
k+3,1) encoding strip thereof, in the corresponding unit head, nal_unit_type equals 1.
The present invention, when guaranteeing Video coding efficiency, can also make receiving terminal not lose the action details in the ROI zone when decoding.Corresponding with the video code flow compression method, the present invention also provides a kind of video code flow compression set, the structure chart that Fig. 6 is video code flow compression set of the present invention; As shown in Figure 6, this device comprises: dividing mark unit 601, and coding unit 602, decoding unit 603, reference picture determining unit 604, wherein:
Dividing mark unit 601: the frame in the video code flow of collection or multiple image are divided into to first area image and second area image, by the discardable sign of second area mark;
Coding unit 602: described first area image and described second area image are divided into to different bands and are encoded;
Decoding unit 603: described first area image and described second area image are decoded;
Reference picture determining unit 604: predict and compensation for the interframe movement of other image the first area image in decoded reconstructed image as the reference image; And but the reconstructed image identical with the second area picture position that adopts existing reference picture replace the reconstructed image after the second area image decoding of discardable sign, as the reference image with the prediction of the interframe movement for other image and compensation.
The preferred structure figure that Fig. 7 is video code flow compression set of the present invention, as shown in Figure 7, this device further comprises:
Code stream reduction unit 605: for after described first area image and described second area Image Coding, while needing the reduction code stream, the encoding strip thereof of the second area of the discardable sign of described mark is abandoned.
The present invention also provides a kind of video code flow decompression method, and the video code flow after the compression received is decompressed, and comprises the following steps:
First area image and second area image after the compression received are decoded, first area image in decoded reconstructed image is predicted and compensation for image decompression compression process interframe movement as the reference picture of other image, but the reconstructed image identical with the second area picture position that adopts existing reference picture replaced the reconstructed image after the second area image decoding of discardable sign, as the reference picture of other image with for the prediction of image decompression compression process interframe movement and compensation.
Corresponding with decompression method, the present invention also provides a kind of video code flow decompressing device, and this device comprises:
Decoding unit: the first area image and the second area image that receive are decoded;
The reference picture determining unit: the first area image in decoded reconstructed image is predicted and compensation for image decompression compression process interframe movement as the reference picture of other image, but the reconstructed image identical with the second area picture position that adopts existing reference picture replaced the reconstructed image after the second area image decoding of discardable sign, as the reference picture of other image with for the prediction of image decompression compression process interframe movement and compensation.
Although illustrated and described embodiments of the invention, for the ordinary skill in the art, be appreciated that without departing from the principles and spirit of the present invention and can carry out multiple variation, modification, replacement and modification to these embodiment, scope of the present invention is by claims and be equal to and limit.