CN106162190A - A kind of video image compression and the method for reading, Apparatus and system - Google Patents

A kind of video image compression and the method for reading, Apparatus and system Download PDF

Info

Publication number
CN106162190A
CN106162190A CN201510164120.XA CN201510164120A CN106162190A CN 106162190 A CN106162190 A CN 106162190A CN 201510164120 A CN201510164120 A CN 201510164120A CN 106162190 A CN106162190 A CN 106162190A
Authority
CN
China
Prior art keywords
layer
target
code stream
background
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510164120.XA
Other languages
Chinese (zh)
Inventor
武晓阳
浦世亮
沈林杰
俞海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Hikvision Digital Technology Co Ltd
Original Assignee
Hangzhou Hikvision Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Hikvision Digital Technology Co Ltd filed Critical Hangzhou Hikvision Digital Technology Co Ltd
Priority to CN201510164120.XA priority Critical patent/CN106162190A/en
Priority to PCT/CN2015/077729 priority patent/WO2016161674A1/en
Publication of CN106162190A publication Critical patent/CN106162190A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
    • H04N19/23Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding with coding of regions that are present throughout a whole video segment, e.g. sprites, background or mosaic

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a kind of video image compression and the method for reading, Apparatus and system.The method of this video image compression, including: going out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded;Respectively target figure layer and Background From Layer are encoded and each generate code stream;The code stream of target figure layer and the code stream of Background From Layer are combined.By going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, then by bit stream complex, during decoding, composite bit stream is decoded, the directly retrieval image containing destination object, improves the utilization rate calculating resource.

Description

A kind of video image compression and the method for reading, Apparatus and system
Technical field
The present invention relates to image processing field, particularly relate to a kind of video image compression and the method for reading, Apparatus and system.
Background technology
Digital video compression standard starts from the eighties in 20th century, through the development of more than 30 years, existing standard have ITU-T series H.261, H.263, MPEG-1, MPEG-4 of ISO, and the common MPEG-2/H.262, H.264/AVC formulated of two tissues, HEVC (up-to-date issue in 2013).Also have the standard of its hetero-organization, the most domestic AVS, the VC-1 of Microsoft, the VP8 etc. of Google.Identical, these standards all use block-based hybrid encoding frame, fusion forecasting coding, transition coding and the big coding techniques of entropy code 3.
Block-based hybrid encoding frame coding flow process is as shown in Figure 1, the image that will encode first carries out piecemeal process, and (block size of HEVC can change to be divided into the block of 16x16, referred to as macro block (Macroblock), from 8x8 to 64x64, it is called maximum coding unit LCU).As it is shown on figure 3, macro block encodes according to scanning sequency from left to right, from top to bottom.First each macro block is predicted coding, utilizes an above frame to rebuild encoded portions around image or macro block and makes reference, it is thus achieved that the residual error data after prediction;Residual error data carries out the transition coding in space, uses DCT or ICT to convert supplemental characteristic by different size block, obtains the conversion coefficient in frequency domain;Conversion coefficient, after quantifying, is delivered in entropy code, it is thus achieved that final code stream.In order to effectively next frame image be encoded, data after current quantisation need through reverse process, namely inverse quantization, inverse transformation, it is added with prediction data again and obtains the image decoding out, namely rebuild image, rebuild image to be placed in reference buffer storage, as the reference picture of next frame picture coding.The decoding process of block-based hybrid encoding frame is as in figure 2 it is shown, encoding code stream is after entropy decoding, inverse quantization, inverse transformation, then carries out being added the image (video signal) just obtaining understanding code with the image of prediction.Decoding image out needs to store, to do the reference picture use of lower frame decoding.
Predictive coding is the important coding techniques of video compress, according to prediction data source difference, coded image can be divided into I frame (infra-frame prediction frame, Intra), P frame (MB of prediction frame, Prediction), B frame (bi-directional predicted frames, Bi-Prediction).As shown in Figure 4, when I frame is predicted coding, the data only with this frame are predicted, and can independently decode the when of decoding, are independent of other frames.When P frame is predicted coding, the reconstruction image of an employing above frame encoded image is as reference, the when of P frame decoding, it is necessary to wait the picture decoding of reference frame to complete to decode.When B frame is predicted coding, previous frame and subsequent frames can be used simultaneously to make reference, become bi-directionally referenced frame, B frame decoding is to need above could decode after reference frame and later reference frame all successfully decodeds.P frame, B frame when coding except making reference with other frames, it is also possible to as I frame sample makes reference with these frame data, select both optimal situation just.I frame can independently decode, and is typically used as radom insertion and uses, and such as DTV requires 1~1.5 second to insert I frame, it is ensured that during user's switching channels, it is possible to see image as early as possible.But I frame compression efficiency is low, and code check is bigger, it is common that 4~10 times of P frame, even tens times.For compression efficiency, under normal circumstances I frame < P frame < and B frame, for computational complexity, I frame < P frame < B frame under normal circumstances.
Between conducting frame during prediction, the most multiple reconstruction image can be used to make reference frame, as it is shown in figure 5, it is P frame multi-frame-reference situation, when encoding the 2nd P frame, use above two two field pictures to make reference;As shown in Figure 6, it is B frame multi-frame-reference situation, and the forward reference frame of B frame has two frames, backward reference frame to be a frame.Multi-frame-reference can improve compression efficiency, also can increase the complexity of computing simultaneously.
In actual applications, particularly in video surveillance applications, user is often interested in target specific in picture, people in such as picture, car, gate area etc., wishing that these regional display quality are clear, coding interested, there are 3 area-of-interests in image shown in Fig. 7.Further, since monitor video point position is many, the time is long, causing data volume big, user wishes by the way of retrieval quickly positioning target rather than checks whole video.
Processing coding interested in existing video uses the different quantization parameter of distribution of the encoding block to area-of-interest class to realize, and generally quantifies other regions of coefficient ratio little, and image quality is high.But, dependence and the dependence of block and reference image block between the order of code stream, block and adjacent block do not become.At this moment, if user needs to retrieve video, need to decode all of picture, the picture of area-of-interest could be obtained.Normal conditions, the moving object of monitored picture is the most few, and the period containing moving object is also minority, solves all of image completely and retrieves, and the waste calculating resource is serious.
Summary of the invention
It is an object of the invention to provide the methods, devices and systems of a kind of video image compression and reading, it is by going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, again by bit stream complex, during decoding, composite bit stream is decoded, the directly retrieval image containing destination object, improves the utilization rate calculating resource.
For achieving the above object, the most by the following technical solutions:
First aspect uses a kind of video image compressing method, including:
Going out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded;
Respectively target figure layer and Background From Layer are encoded and each generate code stream;
The code stream of target figure layer and the code stream of Background From Layer are combined.
Wherein, described the code stream of target figure layer and the code stream of Background From Layer are combined, particularly as follows:
Header is added, the code stream of record object figure layer and the code stream of Background From Layer after header for code stream corresponding to image to be encoded.
Wherein, described coding target figure layer and Background From Layer respectively each generates code stream, including:
Region fixed value outside target in described target figure layer is filled;
The region fixed value at target place in described target figure layer corresponding in described Background From Layer is filled;
Respectively the target figure layer after filling and Background From Layer coding are each generated code stream.
Wherein, in described header, record has the positional information of the target in target figure layer.
Wherein, when extracting target from images figure layer failure to be encoded, in described header, the positional information of the target in target figure layer is recorded as sky.
Wherein, it is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header.
Second aspect uses a kind of video image compression device, including:
Figure layer extraction unit, for going out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded;
Figure layer coding unit, each generates code stream for encoding target figure layer and Background From Layer respectively;
Bit stream complex unit, for being combined the code stream of target figure layer and the code stream of Background From Layer.
Wherein, described bit stream complex unit, specifically for:
Header is added, the code stream of record object figure layer and the code stream of Background From Layer after header for code stream corresponding to image to be encoded.
Wherein, described figure layer coding unit, including:
First packing module, for filling the region fixed value outside target in described target figure layer;
Second packing module, for filling the region fixed value at target place in described target figure layer corresponding in described Background From Layer;
Figure layer coding module, for each generating code stream to the target figure layer after filling and Background From Layer coding respectively.
Wherein, in described header, record has the positional information of the target in target figure layer.
Wherein, when extracting target from images figure layer failure to be encoded, in described header, the positional information of the target in target figure layer is recorded as sky.
Wherein, it is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header.
The third aspect uses a kind of video image read method, including:
Obtaining video code flow, described video code flow is formed by the code stream of target figure layer and the bit stream complex of Background From Layer;Wherein, the target in described target figure layer is the interest region part in image;
Confirm the target video frame at decoding target place;
Start associated video code stream decoding from described target video frame.
Wherein, described video code flow is added with header, and described header record has the positional information of the target in target figure layer;
Start associated video code stream decoding from described target video frame, particularly as follows:
From described target video frame, decode interest region part, described interest region part is compound to described Background From Layer according to described positional information.
Wherein, it is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header.
Fourth aspect uses a kind of video image reading device, including:
Code stream acquiring unit, is used for obtaining video code flow, and described video code flow is formed by the code stream of target figure layer and the bit stream complex of Background From Layer;Wherein, the target in described target figure layer is the interest region part in image;
Goal verification unit, for confirming to decode the target video frame at target place;
Code stream decoding unit, for starting associated video code stream decoding from described target video frame.
Wherein, described video code flow is added with header, and described header record has the positional information of the target in target figure layer;
Described code stream decoding unit, specifically for:
From described target video frame, decode interest region part, described interest region part is compound to described Background From Layer according to described positional information.
Wherein, it is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header.
5th aspect uses a kind of video image processing system, including the video image compression device described in above-mentioned any one and the video image reading device described in above-mentioned any one.
The beneficial effects of the present invention is: by going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, again by bit stream complex, during decoding, composite bit stream is decoded, the directly retrieval image containing destination object, improves the utilization rate calculating resource.
Accompanying drawing explanation
Fig. 1 is the schematic flow sheet of block-based hybrid encoding frame coding in prior art;
Fig. 2 is the schematic flow sheet of block-based hybrid encoding frame decoding in prior art;
Fig. 3 is the scanning sequency schematic diagram of macro block in block-based hybrid coding in prior art;
Fig. 4 is the schematic diagram of inter-reference relation in block-based hybrid coding in prior art;
Fig. 5 is the schematic diagram of the referring-to relation of P frame multi-frame-reference in block-based hybrid coding in prior art;
Fig. 6 is the schematic diagram of the referring-to relation of B frame multi-frame-reference in block-based hybrid coding in prior art;
Fig. 7 is the schematic diagram of the area-of-interest in prior art in image;
Fig. 8 is the method flow diagram of the first embodiment of the method for a kind of video image compression provided in the specific embodiment of the invention;
Fig. 9 is the method flow diagram of the second embodiment of the method for a kind of video image compression provided in the specific embodiment of the invention;
Figure 10 be in the specific embodiment of the invention provide a kind of video image compression method the second embodiment in image layer and the schematic diagram of background layer;
Figure 11 be the method for a kind of video image compression provided in the specific embodiment of the invention the second embodiment in the schematic diagram of organizational form of code stream;
Figure 12 is the structural representation of the first embodiment of the device of a kind of video image compression provided in the specific embodiment of the invention
Figure 13 is the block diagram of the second embodiment of the device of a kind of video image compression provided in the specific embodiment of the invention.
Figure 14 is the method flow diagram of the first embodiment of the method that a kind of video image provided in the specific embodiment of the invention reads.
Figure 15 is the block diagram of the first embodiment of the device that a kind of video image provided in the specific embodiment of the invention reads.
Figure 16 is the block diagram of the first embodiment of the system of a kind of Computer Vision provided in the specific embodiment of the invention.
Detailed description of the invention
For making the object, technical solutions and advantages of the present invention of greater clarity, below in conjunction with detailed description of the invention and referring to the drawings, the present invention is described in more detail.It should be understood that these describe the most exemplary, and it is not intended to limit the scope of the present invention.Additionally, in the following description, the description to known features and technology is eliminated, to avoid unnecessarily obscuring idea of the invention.
Refer to Fig. 8, it is the method flow diagram of the first embodiment of the method for a kind of video image compression of offer in the specific embodiment of the invention.Method in the present embodiment, is mainly used in the storage of various video, particularly monitor video.As it can be seen, the method, including:
Step S101: go out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded.
In this programme, for being particularly directed to monitor video, specific target in picture, position and the regions such as such as people, car, gateway, owing to monitor video point position is many, the time is long, it is generally desirable to these regions can quickly position, it is not necessary to whole video is carried out complete observation.So image to be encoded to be divided into Background From Layer and target figure layer, the target area in video being assigned in target figure layer, when checking video, directly target figure layer being retrieved, it is achieved the quick-searching to the target needing retrieval, improve operation efficiency.
Step S102: target figure layer and Background From Layer are encoded each generate code stream respectively.
Can decode the most respectively during for making and decoding, when coding, target figure layer and Background From Layer are separately encoded the respective code stream of formation, specifically can complete coding for the coding standard used.
Step S103: the code stream of target figure layer and the code stream of Background From Layer are combined.
The code stream of target figure layer and the code stream of Background From Layer are combined, and compared with the scheme of prior art, the code stream after being combined can position more accurately, directly accesses the image at the target place determined.Decoding efficiency improves.
In sum, by going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, again by bit stream complex, during decoding, composite bit stream is decoded, directly the retrieval image containing destination object, improve the utilization rate calculating resource.
Refer to Fig. 9, it is the method flow diagram of the second embodiment of the method for a kind of video image compression of offer in invention detailed description of the invention, as it can be seen, the method includes:
Step S201: go out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded.
The extraction of Background From Layer and target figure layer is realized by image recognition or graphical analysis, selectes also by the scope that picture pick-up device is provided with target figure layer.Concrete technical scheme has realized, and is not described further at this.
Step S202: the region fixed value outside target in described target figure layer is filled.
Step S203: the region fixed value at target place in described target figure layer corresponding in described Background From Layer is filled.
So that the original position that during decoding, target figure layer is in image, region fixed value outside target in destination layer is filled, the region at target place in described target figure layer corresponding in Background From Layer is also filled by fixed value, when coding, target figure layer and Background From Layer possess identical image size and resolution, and follow-up composition operation is more accurate.As shown in Figure 10, two extract two figure layers from image to concrete filling mode, are filled with, in respective figure layer, the position that another figure layer is corresponding, are equivalent to obtain two sub image frame with identical resolution, then are separately encoded two figure layers.
Step S204: respectively the target figure layer after filling and Background From Layer coding are each generated code stream.
When encoding target figure layer and Background From Layer, still according to the scan mode from left to right, under upper, encoding each piecemeal, only running into filling part timesharing, directly skipping and need not process, the bit stream complex that each figure layer produces is together.
Step S205: add header for code stream corresponding to image to be encoded, the code stream of record object figure layer and the code stream of Background From Layer after header.
In described header, record has the positional information of the target in target figure layer.It is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header;Specifically separating identifier can be to identify by dividing initial code, and the code stream of the most each figure layer is both provided with initial code mark, in order to during decoding, the original position to code stream makes a distinction.The organizational form of concrete code stream is as shown in figure 11, header was added before video flowing, target in record object figure layer position, when video is retrieved, directly it is accurately positioned by header, improve the treatment effeciency of data, the position relationship of the code stream of concrete target figure layer and the code stream of Background From Layer is not restricted, figure layer 1 code stream in Figure 11 and figure layer 2 code stream each corresponding.
In sum, by going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, again by bit stream complex, during decoding, composite bit stream is decoded, directly the retrieval image containing destination object, improve the utilization rate calculating resource.It is also provided with header simultaneously, the code stream of record object figure layer and the code stream of Background From Layer after header, the positional information of the target in record object figure layer in header, between code stream and the code stream of Background From Layer of target figure layer, insert separation identifier the two is made a distinction, it is achieved that storage and the quick-searching in order of two code streams.
The embodiment of the device of a kind of video image compression provided in the specific embodiment of the invention is provided, the embodiment of the embodiment of the device of video image compression method based on above-mentioned video image compression realizes, elaboration the most most in the embodiment of the device of several video image compression, refer to the embodiment of the method for above-mentioned video image compression.
Refer to Figure 12, it is the block diagram of the first embodiment of the device of a kind of video image compression of offer in the specific embodiment of the invention, as it can be seen, this device, including:
Figure layer extraction unit 310, for going out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded;
By image to be encoded being divided into Background From Layer and target figure layer, the target area in video being assigned in target figure layer, when checking video, directly target figure layer being retrieved, it is achieved the quick-searching to the target needing retrieval, improve operation efficiency.
Figure layer coding unit 320, each generates code stream for encoding target figure layer and Background From Layer respectively;
Bit stream complex unit 330, for being combined the code stream of target figure layer and the code stream of Background From Layer.
In sum, the collaborative work of above-mentioned each unit, by going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, again by bit stream complex, during decoding, composite bit stream is decoded, directly the retrieval image containing destination object, improve the utilization rate calculating resource.
Refer to Figure 13, it is the block diagram of the second embodiment of the device of a kind of video image compression of offer in the specific embodiment of the invention, as it can be seen, this device, including:
Figure layer extraction unit 310, for going out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded;
Figure layer coding unit 320, each generates code stream for encoding target figure layer and Background From Layer respectively;
Bit stream complex unit 330, for being combined the code stream of target figure layer and the code stream of Background From Layer.
Wherein, described bit stream complex unit 330, specifically for:
Header is added, the code stream of record object figure layer and the code stream of Background From Layer after header for code stream corresponding to image to be encoded.
Wherein, described figure layer coding unit 320, including:
First packing module 321, for filling the region fixed value outside target in described target figure layer;
Second packing module 322, for filling the region fixed value at target place in described target figure layer corresponding in described Background From Layer;
Figure layer coding module 323, for each generating code stream to the target figure layer after filling and Background From Layer coding respectively.
When encoding target figure layer and Background From Layer, still according to the scan mode from left to right, under upper, encoding each piecemeal, only running into filling part timesharing, directly skipping and need not process, the bit stream complex that each figure layer produces is together.
Wherein, in described header, record has the positional information of the target in target figure layer.
Wherein, when extracting target from images figure layer failure to be encoded, in described header, the positional information of the target in target figure layer is recorded as sky.
Wherein, being inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header, specifically, the code stream of each figure layer is both provided with initial code mark, in order to during decoding, the original position to code stream makes a distinction..
In sum, the cooperative cooperating of above-mentioned functions module, by going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, again by bit stream complex, during decoding, composite bit stream is decoded, directly the retrieval image containing destination object, improve the utilization rate calculating resource.It is also provided with header simultaneously, the code stream of record object figure layer and the code stream of Background From Layer after header, the positional information of the target in record object figure layer in header, between code stream and the code stream of Background From Layer of target figure layer, insert separation identifier the two is made a distinction, it is achieved that storage and the quick-searching in order of two code streams.
The embodiment of the method that a kind of video image that the following is provides in the specific embodiment of the invention reads, the scheme in the present embodiment is for being read out the video code flow obtained in previous embodiment.Refer to Figure 14, it is the method flow diagram of the first embodiment of the method that a kind of video image of offer reads in the specific embodiment of the invention, as it can be seen, the method includes:
Step S401: obtaining video code flow, described video code flow is formed by the code stream of target figure layer and the bit stream complex of Background From Layer;Wherein, the target in described target figure layer is the interest region part in image.
Video code flow is to have the code stream of target figure layer and the bit stream complex of Background From Layer to form, when reading, targetedly to wanting the target information read to position.
Step S402: confirm that in header, record has the frame of the positional information of decoding target to be target video frame.
Described video code flow is added with header, and described header record has the positional information of the target in target figure layer.Can directly be conducted interviews by the position of header record.
Confirm that the target video frame at decoding target place also can realize according to other scheme, such as, be not provided with header, directly video accessed frame by frame.
Step S403: decode interest region part from described target video frame, is compound to described Background From Layer by described interest region part according to described positional information.
It is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header.When video image is conducted interviews, directly according to the boundary of separation identifier, target figure layer place code stream is conducted interviews.
In sum, by the reading of the video code flow to composition generation, it is achieved the quick access to target figure layer, improve the efficiency of decoding, reduce the complexity of computing.
The embodiment of the device that a kind of video image that the following is provides in the specific embodiment of the invention reads, the embodiment of the method that the embodiment of the device that video image reads reads based on above-mentioned video image realizes, elaboration the most most in the embodiment of the device read at several video images, refer to the embodiment of the method that above-mentioned video image reads.
Refer to Figure 15, it is the block diagram of the device that a kind of video image provided in the specific embodiment of the invention reads, as it can be seen, this device includes:
Code stream acquiring unit 510, is used for obtaining video code flow, and described video code flow is formed by the code stream of target figure layer and the bit stream complex of Background From Layer;Wherein, the target in described target figure layer is the interest region part in image;
Goal verification unit 520, for confirming to decode the target video frame at target place;
Code stream decoding unit 530, for starting associated video code stream decoding from described target video frame.
Wherein, described video code flow is added with header, and described header record has the positional information of the target in target figure layer;
Described code stream decoding unit 530, specifically for:
From described target video frame, decode interest region part, described interest region part is compound to described Background From Layer according to described positional information.
Wherein, it is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after described header.
In sum, the collaborative work of above-mentioned each unit, by the reading of the video code flow to composition generation, it is achieved the quick access to target figure layer, improves the efficiency of decoding, reduces the complexity of computing.
Additionally providing the embodiment of a kind of video image processing system in the last specific embodiment of the invention, video image processing system comprises device 30 and device 50 two parts of video image reading of above-mentioned video image compression.The most as shown in figure 16, the device 30 of video image compression includes:
Figure layer extraction unit 310, for going out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is the interest region part in image to be encoded;
Figure layer coding unit 320, each generates code stream for encoding target figure layer and Background From Layer respectively;
Bit stream complex unit 330, for being combined the code stream of target figure layer and the code stream of Background From Layer.
The device 50 that video image reads includes:
Code stream acquiring unit 510, is used for obtaining video code flow, and described video code flow is formed by the code stream of target figure layer and the bit stream complex of Background From Layer;Wherein, the target in described target figure layer is the interest region part in image;
Goal verification unit 520, for confirming to decode the target video frame at target place;
Code stream decoding unit 530, for starting associated video code stream decoding from described target video frame.
In sum, the collaborative work of above-mentioned each unit, by going out Background From Layer and target figure layer from image zooming-out to be encoded, carry out respectively encoding each generating code stream by Background From Layer and target figure layer, again by bit stream complex, during decoding, composite bit stream is decoded, directly the retrieval image containing destination object, improve the utilization rate calculating resource.Reading by the video code flow to composition generation, it is achieved the quick access to target figure layer, improves the efficiency of decoding, reduces the complexity of computing.
It should be appreciated that the above-mentioned detailed description of the invention of the present invention is used only for exemplary illustration or explains the principle of the present invention, and it is not construed as limiting the invention.Therefore, any modification, equivalent substitution and improvement etc. done in the case of without departing from the spirit and scope of the present invention, should be included within the scope of the present invention.Additionally, claims of the present invention be intended to fall in the equivalents on scope and border or this scope and border whole change and modifications example.
Although embodiments of the present invention are described in detail it should be understood that, in the case of without departing from the spirit and scope of the present invention, embodiments of the present invention can be made various change, replace and change.

Claims (19)

1. a video image compressing method, it is characterised in that including:
Going out Background From Layer and target figure layer from image zooming-out to be encoded, the target in described target figure layer is Interest region part in image to be encoded;
Respectively target figure layer and Background From Layer are encoded and each generate code stream;
The code stream of target figure layer and the code stream of Background From Layer are combined.
A kind of video image compressing method the most according to claim 1, it is characterised in that described The code stream of target figure layer and the code stream of Background From Layer are combined, particularly as follows:
The code stream corresponding for image to be encoded adds header, the code of record object figure layer after header Stream and the code stream of Background From Layer.
A kind of video image compressing method the most according to claim 1, it is characterised in that described Respectively target figure layer and Background From Layer are encoded and each generate code stream, including:
Region fixed value outside target in described target figure layer is filled;
The region fixed value at target place in described target figure layer corresponding in described Background From Layer is filled out Fill;
Respectively the target figure layer after filling and Background From Layer coding are each generated code stream.
A kind of video image compressing method the most according to claim 2, it is characterised in that described In header, record has the positional information of the target in target figure layer.
A kind of video image compressing method the most according to claim 4, it is characterised in that when treating When coded image extracts target figure layer failure, the position of the target in target figure layer in described header Information is recorded as sky.
A kind of video image compressing method the most according to claim 2, it is characterised in that described It is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after header.
7. a video image compression device, it is characterised in that including:
Figure layer extraction unit, for going out Background From Layer and target figure layer from image zooming-out to be encoded, described Target in target figure layer is the interest region part in image to be encoded;
Figure layer coding unit, each generates code stream for encoding target figure layer and Background From Layer respectively;
Bit stream complex unit, for being combined the code stream of target figure layer and the code stream of Background From Layer.
A kind of video image compression device the most according to claim 7, it is characterised in that described Bit stream complex unit, specifically for:
The code stream corresponding for image to be encoded adds header, the code of record object figure layer after header Stream and the code stream of Background From Layer.
A kind of video image compression device the most according to claim 7, it is characterised in that described Figure layer coding unit, including:
First packing module, for filling out the region fixed value outside target in described target figure layer Fill;
Second packing module, for by target place in described target figure layer corresponding in described Background From Layer Region fixed value fill;
Figure layer coding module, for each spontaneous to the target figure layer after filling and Background From Layer coding respectively Become code stream.
A kind of video image compression device the most according to claim 8, it is characterised in that described In header, record has the positional information of the target in target figure layer.
11. a kind of video image compressing methods according to claim 10, it is characterised in that when During extracting target from images figure layer failure to be encoded, the position of the target in target figure layer in described header Confidence breath is recorded as sky.
12. a kind of video image compressing methods according to claim 8, it is characterised in that described It is inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after header.
13. 1 kinds of video image read methods, it is characterised in that including:
Obtaining video code flow, described video code flow is multiple by the code stream of target figure layer and the code stream of Background From Layer Conjunction forms;Wherein, the target in described target figure layer is the interest region part in image;
Confirm the target video frame at decoding target place;
Start associated video code stream decoding from described target video frame.
14. a kind of video image read methods according to claim 13, it is characterised in that institute Stating video code flow and be added with header, described header record has the position of the target in target figure layer to believe Breath;
Start associated video code stream decoding from described target video frame, particularly as follows:
From described target video frame, decode interest region part, by described interest region part according to Described positional information is compound to described Background From Layer.
15. a kind of video image read methods according to claim 13, it is characterised in that institute State and be inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after header.
16. 1 kinds of video image reading devices, it is characterised in that including:
Code stream acquiring unit, is used for obtaining video code flow, and described video code flow is by the code stream of target figure layer Form with the bit stream complex of Background From Layer;Wherein, the target in described target figure layer is emerging in image Interest region part;
Goal verification unit, for confirming to decode the target video frame at target place;
Code stream decoding unit, for starting associated video code stream decoding from described target video frame.
17. a kind of video image reading devices according to claim 16, it is characterised in that institute Stating video code flow and be added with header, described header record has the position of the target in target figure layer Letter;
Described code stream decoding unit, specifically for:
From described target video frame, decode interest region part, by described interest region part according to Described positional information is compound to described Background From Layer.
18. a kind of video image reading devices according to claim 16, it is characterised in that institute State and be inserted with separation identifier between code stream and the code stream of Background From Layer of the target figure layer after header.
19. 1 kinds of video image processing systems, it is characterised in that include that claim 7~12 is any one Video image compression device described in Xiang and the video image described in claim 16~18 any one are read Fetching is put.
CN201510164120.XA 2015-04-08 2015-04-08 A kind of video image compression and the method for reading, Apparatus and system Pending CN106162190A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201510164120.XA CN106162190A (en) 2015-04-08 2015-04-08 A kind of video image compression and the method for reading, Apparatus and system
PCT/CN2015/077729 WO2016161674A1 (en) 2015-04-08 2015-04-28 Method, device, and system for video image compression and reading

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510164120.XA CN106162190A (en) 2015-04-08 2015-04-08 A kind of video image compression and the method for reading, Apparatus and system

Publications (1)

Publication Number Publication Date
CN106162190A true CN106162190A (en) 2016-11-23

Family

ID=57071735

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510164120.XA Pending CN106162190A (en) 2015-04-08 2015-04-08 A kind of video image compression and the method for reading, Apparatus and system

Country Status (2)

Country Link
CN (1) CN106162190A (en)
WO (1) WO2016161674A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108668130A (en) * 2017-03-31 2018-10-16 晨星半导体股份有限公司 The method for recombinating image file
CN108924557A (en) * 2018-06-11 2018-11-30 青岛海信电器股份有限公司 Image processing method, device, electronic equipment and storage medium
CN110460855A (en) * 2019-07-22 2019-11-15 西安万像电子科技有限公司 Image processing method and system
CN113012657A (en) * 2019-12-19 2021-06-22 北京嗨动视觉科技有限公司 Layer processing method and device, video processing equipment and computer readable storage medium
CN113660495A (en) * 2021-08-11 2021-11-16 易谷网络科技股份有限公司 Real-time video stream compression method and device, electronic equipment and storage medium
US11223662B2 (en) 2017-08-29 2022-01-11 Line Corporation Method, system, and non-transitory computer readable record medium for enhancing video quality of video call

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120269267A1 (en) * 2011-04-19 2012-10-25 Samsung Electronics Co., Ltd. Method and apparatus for unified scalable video encoding for multi-view video and method and apparatus for unified scalable video decoding for multi-view video
CN103402087A (en) * 2013-07-23 2013-11-20 北京大学 Video encoding and decoding method based on gradable bit streams

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009049979A (en) * 2007-07-20 2009-03-05 Fujifilm Corp Image processing device, image processing method, image processing system, and program
EP2870768A4 (en) * 2012-07-04 2016-03-09 Intel Corp A region of interest based framework for 3d video coding
US9013536B2 (en) * 2013-03-13 2015-04-21 Futurewei Technologies, Inc. Augmented video calls on mobile devices

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120269267A1 (en) * 2011-04-19 2012-10-25 Samsung Electronics Co., Ltd. Method and apparatus for unified scalable video encoding for multi-view video and method and apparatus for unified scalable video decoding for multi-view video
CN103402087A (en) * 2013-07-23 2013-11-20 北京大学 Video encoding and decoding method based on gradable bit streams

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108668130A (en) * 2017-03-31 2018-10-16 晨星半导体股份有限公司 The method for recombinating image file
US11223662B2 (en) 2017-08-29 2022-01-11 Line Corporation Method, system, and non-transitory computer readable record medium for enhancing video quality of video call
CN108924557A (en) * 2018-06-11 2018-11-30 青岛海信电器股份有限公司 Image processing method, device, electronic equipment and storage medium
CN110460855A (en) * 2019-07-22 2019-11-15 西安万像电子科技有限公司 Image processing method and system
CN113012657A (en) * 2019-12-19 2021-06-22 北京嗨动视觉科技有限公司 Layer processing method and device, video processing equipment and computer readable storage medium
CN113660495A (en) * 2021-08-11 2021-11-16 易谷网络科技股份有限公司 Real-time video stream compression method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
WO2016161674A1 (en) 2016-10-13

Similar Documents

Publication Publication Date Title
CN112823521B (en) Image encoding method using history-based motion information and apparatus therefor
CN106162190A (en) A kind of video image compression and the method for reading, Apparatus and system
CN102388616B (en) Image signal decoding device, image signal decoding method, image signal encoding device, and image signal encoding method
CN104704835B (en) The apparatus and method of movable information management in Video coding
CN102396228B (en) Image processing equipment and method
CN106063271B (en) For executing cross-layer video coding/decoding method and its equipment and the cross-layer video coding method and its equipment for executing the prediction based on sub-block of the prediction based on sub-block
CN101540926B (en) Stereo video coding-decoding method based on H.264
CN110944185B (en) Video decoding method and device, computer equipment and storage medium
CN105075269B (en) Picture decoding apparatus, picture coding device and computer-readable recording medium
CN104641652A (en) Indication of frame-packed stereoscopic 3d video data for video coding
CN105580365A (en) Sub-prediction unit (pu) based temporal motion vector prediction in hevc and sub-pu design in 3d-hevc
CN105580364A (en) Sub-prediction unit (PU) based temporal motion vector prediction in HEVC and sub-PU design in 3D-HEVC
US11743475B2 (en) Advanced video coding method, system, apparatus, and storage medium
CN106464917A (en) Signaling hrd parameters for bitstream partitions
CN101053258A (en) Method and device for processing coded video data
US9503751B2 (en) Method and apparatus for simplified depth coding with extended prediction modes
CN103329535B (en) Video encoder, video decoding apparatus, method for video coding, video encoding/decoding method and program
CN105144713A (en) Method for encoding video for decoder setting and device therefor, and method for decoding video on basis of decoder setting and device therefor
US20130039412A1 (en) Predictive coding with block shapes derived from a prediction error
CN106162191A (en) A kind of method for video coding based on target and system
CN105144716A (en) Inter-layer picture signaling and related processes
US11902505B2 (en) Video decoding device and video decoding method
CN105379272A (en) Signaling of Bit Rate Information and Picture Rate Information in VPS
CN105765977A (en) POC value design for multi-layer video coding
CN102742274A (en) Image processing device and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20161123