CN109451318A - Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium - Google Patents
Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN109451318A CN109451318A CN201910022693.7A CN201910022693A CN109451318A CN 109451318 A CN109451318 A CN 109451318A CN 201910022693 A CN201910022693 A CN 201910022693A CN 109451318 A CN109451318 A CN 109451318A
- Authority
- CN
- China
- Prior art keywords
- block
- pixels
- image
- adapter
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
The present invention provides a kind of convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium, includes the block of pixels that the first image and the second image are divided into default ranks number by VR video image after getting VR video image to be encoded;Then for each of the first image block of pixels, determine that a block of pixels as adapter block, calculates the block of pixels and arrives the distance between adapter block in the second image, wherein the similarity difference value between the block of pixels and adapter block is minimum;Distance is then based on to be calculated and the depth information apart from corresponding block of pixels;Before being encoded, according to depth information, each block of pixels is grouped, so that VR video image is at least divided into two groups, so in subsequent progress VR Video coding, compared with the video image of low quality group, relatively more code rates can be distributed to the video image of high quality group, so as to improve compression efficiency, bandwidth is saved.
Description
Technical field
The present invention relates to field of video encoding, in particular to a kind of method, apparatus convenient for VR Video coding, electricity
Sub- equipment and storage medium.
Background technique
In current Video Coding Scheme, mainly have following three according to the algorithm that video content does adaptive layered coding
Class: image is done and divides, analysis of complexity is carried out to encoding block, picture material is done and divides or identifies.The master of above-mentioned algorithm
Wanting problem is that calculation amount is generally excessive, and the segmentation and identification to image do not have real-time usually, so that very strong in real-time
Above-mentioned algorithm can not be applied in net cast application.
The sports tournament of panorama VR is broadcast live, and the resolution ratio and frame rate index request to video are all very high.If directly adopted
With common coding method, video stream bit rate is excessively high, can make the difficulty of network direct broadcasting greatly.
Summary of the invention
In view of this, the embodiment of the present invention is designed to provide a kind of method, apparatus convenient for VR Video coding, electronics
Equipment and storage medium, to alleviate the above problem.
In a first aspect, the embodiment of the invention provides a kind of methods convenient for VR Video coding, which comprises obtain
VR video image to be encoded, the VR video image include the first image and the second image;By the first image and
Second image is divided into the block of pixels of default ranks number;For each of the first image block of pixels,
A block of pixels is determined in second image as adapter block, calculates the block of pixels the distance between to the adapter block,
Wherein, the similarity difference value between the block of pixels and the adapter block is minimum;Based on the distance be calculated with it is described away from
Depth information from corresponding block of pixels;Before being encoded, according to the depth information, each block of pixels is divided
Group, so that the VR video image is at least divided into two groups.
Second aspect, the embodiment of the invention provides a kind of devices convenient for VR Video coding, module are obtained, for obtaining
VR video image to be encoded, the VR video image include the first image and the second image;Division module, being used for will be described
First image and second image are divided into the block of pixels of default ranks number;Computing module, for being directed to described first
Each of image block of pixels determines that a block of pixels as adapter block, calculates the pixel in second image
Block is the distance between to the adapter block, wherein the similarity difference value between the block of pixels and the adapter block is minimum;It is described
Computing module is also used to be calculated based on the distance and the depth information apart from corresponding block of pixels;Grouping module,
For according to the depth information, each block of pixels being grouped before being encoded, so that the VR video image
At least it is divided into two groups.
The third aspect, the embodiment of the present invention provide a kind of electronic equipment, including memory interconnected, processor, institute
It states and stores computer program in memory, when the computer program is executed by the processor, so that the electronic equipment
Execute method described in first aspect.
Fourth aspect, the embodiment of the present invention provide a kind of computer readable storage medium, the computer-readable storage medium
Computer program is stored in matter, when the computer program is run on computers, so that the computer executes first
Method described in aspect.
Compared with prior art, the method, apparatus convenient for VR Video coding of various embodiments of the present invention proposition, electronic equipment
And the beneficial effect of storage medium is: including first the first figure by VR video image after getting VR video image to be encoded
Picture and the second image are divided into the block of pixels of default ranks number;Then for each of the first image pixel
Block determines a block of pixels as adapter block in second image, calculates the block of pixels between the adapter block
Distance, wherein the similarity difference value between the block of pixels and the adapter block is minimum;The distance is then based on to be calculated
With the depth information apart from corresponding block of pixels;Before being encoded, according to the depth information, by each pixel
Block is grouped, so that the VR video image is at least divided into two groups, then in subsequent progress VR Video coding, with low-quality
The video content of amount group is compared, and distributes relatively more code rates to the video content of high quality group, so as to improve compression effect
Rate saves bandwidth.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate
Appended attached drawing, is described in detail below.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached
Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair
The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this
A little attached drawings obtain other relevant attached drawings.
Fig. 1 is the structural block diagram of electronic equipment provided in an embodiment of the present invention;
Fig. 2 is one of the flow chart of method convenient for VR Video coding that first embodiment of the invention provides;
Fig. 3 is schematic diagram of the block of pixels that provides of first embodiment of the invention to the distance between adapter block;
Fig. 4 is the schematic diagram for the field block of pixels that first embodiment of the invention provides;
Fig. 5 is the determination schematic diagram for the adapter block that first embodiment of the invention provides;
Fig. 6 is the calculating schematic diagram of constant a, b that first embodiment of the invention provides;
Fig. 7 is the structural block diagram for the device convenient for VR Video coding that second embodiment of the invention provides.
Specific embodiment
Below in conjunction with attached drawing in the embodiment of the present invention, technical solution in the embodiment of the present invention carries out clear, complete
Ground description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Usually exist
The component of the embodiment of the present invention described and illustrated in attached drawing can be arranged and be designed with a variety of different configurations herein.Cause
This, is not intended to limit claimed invention to the detailed description of the embodiment of the present invention provided in the accompanying drawings below
Range, but it is merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art are not doing
Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.
It should also be noted that similar label and letter indicate similar terms in following attached drawing, therefore, once a certain Xiang Yi
It is defined in a attached drawing, does not then need that it is further defined and explained in subsequent attached drawing.Meanwhile of the invention
In description, term " first ", " second " etc. are only used for distinguishing description, are not understood to indicate or imply relative importance.
As shown in Figure 1, being the block diagram of electronic equipment 100.The electronic equipment 100 may include: convenient for VR view
Device, the memory 110, storage control 120, processor 130, Peripheral Interface 140, input-output unit 150, sound of frequency coding
Frequency unit 160, display unit 170.Wherein, the electronic equipment 100 can be user terminal, such as PC
(personal computer, PC), tablet computer, smart phone, personal digital assistant (personal digital
Assistant, PDA) etc., it is also possible to server.
The memory 110, storage control 120, processor 130, Peripheral Interface 140, input-output unit 150, sound
Frequency unit 160 and each element of display unit 170 are directly or indirectly electrically connected between each other, with realize data transmission or
Interaction.It is electrically connected for example, these elements can be realized between each other by one or more communication bus or signal wire.It is described just
It include that at least one can be stored in the memory in the form of software or firmware (firmware) in the device of VR Video coding
In 110 or the software function module that is solidificated in the operating system (operating system, OS) of electronic equipment.The processing
Device 130 for executing the executable module stored in memory 110, such as the device convenient for VR Video coding include it is soft
Part functional module or computer program.
Wherein, memory 110 may be, but not limited to, random access memory (Random Access Memory,
RAM), read-only memory (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only
Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM),
Electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc..
Wherein, memory 110 is for storing program, and the processor 130 executes described program after receiving and executing instruction, aforementioned
The method for the flow definition that any embodiment of the embodiment of the present invention discloses can be applied in processor 130, or by processor
130 realize.
Processor 130 may be a kind of IC chip, the processing capacity with signal.Above-mentioned processor 130 can
To be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network processing unit
(Network Processor, abbreviation NP) etc.;Can also be digital signal processor (DSP), specific integrated circuit (ASIC),
Field programmable gate array (FPGA) either other programmable logic device, discrete gate or transistor logic, discrete hard
Part component.It may be implemented or execute disclosed each method, step and the logic diagram in the embodiment of the present invention.General processor
It can be microprocessor or the processor be also possible to any conventional processor etc..
Various input/output devices are couple processor 130 and memory 110 by the Peripheral Interface 140.Some
In embodiment, Peripheral Interface 140, processor 130 and storage control 120 can be realized in one single chip.Other one
In a little examples, they can be realized by independent chip respectively.
Input-output unit 150 is used to be supplied to the interaction that user input data realizes user and electronic equipment 100.It is described
Input-output unit 150 may be, but not limited to, mouse and keyboard etc..
Audio unit 160 provides a user audio interface, may include one or more microphones, one or more raises
Sound device and voicefrequency circuit.
Display unit 170 provides an interactive interface (such as user interface) between electronic equipment 100 and user
Or it is referred to for display image data to user.In the present embodiment, the display unit 170 can be liquid crystal display or touching
Control display.It can be the touching of the capacitance type touch control screen or resistance-type of support single-point and multi-point touch operation if touch control display
Control screen etc..Single-point and multi-point touch operation is supported to refer to that touch control display can sense on the touch control display one or more
The touch control operation generated simultaneously at a position, and the touch control operation that this is sensed transfers to processor 130 to be calculated and handled.
First embodiment
Referring to figure 2., Fig. 2 is a kind of process for method convenient for VR Video coding that first embodiment of the invention provides
Figure, the method are applied to electronic equipment.Process shown in Fig. 2 will be described in detail below, which comprises
Step S110: obtaining VR video image to be encoded, and the VR video image includes the first image and the second figure
Picture.
Since VR video image has left and right both of which, the VR video image that electronic equipment 100 is got can wrap
Include left image and right image.Wherein, there are visual angle differences between left image and right image.
Optionally, the first image can be left image, correspondingly, second image is right image;Optionally, institute
Stating the first image can be right image, correspondingly, second image is left image.
Step S120: the first image and second image are divided into the block of pixels of default ranks number.
Wherein, the default ranks number can arrange for 8 rows 8, i.e., the first image and second image are drawn
It is divided into 8 × 8 block of pixels, the size of each block of pixels is identical.It certainly, as an alternative embodiment, can also be with
Use the piecemeal principle of bigger (such as 9 × 9) or smaller (such as 6 × 6).
Certainly, as an alternative embodiment, being divided by the first image and second image
It, can also be by the first image and second image point in order to improve computational efficiency before the block of pixels of preset quantity
Not carry out binary conversion treatment, obtain and the corresponding first gradient image of the first image and corresponding with second image
Second gradient image.Wherein, image binaryzation processing is exactly to set 0 or 255 for the gray value of the pixel on image, also
It is that whole image is showed to apparent black and white effect, wherein 0 indicates white, and 255 indicate black.By 256 brightness degrees
Gray level image is obtained by selection threshold value appropriate still can reflect the whole binary image with local feature of image.Example
Such as, the gray value that gray value is greater than or equal to the pixel of threshold value resets to 255, and gray value is less than to the pixel of preset threshold
The gray value of point resets to 0.
The binaryzation of image is conducive to being further processed for image, becomes image simply, and data volume reduces, can not only be convex
The profile of interested target is showed, and computation complexity can be reduced.
Wherein it is possible to be carried out at binaryzation by SOBEL or CANNEY algorithm to the first image and second image
Reason, to obtain and the corresponding first gradient image of the first image and the second gradient map corresponding with second image
Picture.
Step S130: for each of the first image block of pixels, one is determined in second image
A block of pixels calculates the block of pixels the distance between to the adapter block, wherein the block of pixels is adapted to described as adapter block
Similarity difference value between block is minimum.
Since the relationship between the first image and the second image is to belong to the same VR video image, the first image
Diversity factor is mainly as caused by the difference of visual angle between the second image.In this case, for every in the first image
A block of pixels S can find an adapter block D corresponding with block of pixels S in the second image.Wherein, by block of pixels S
When carrying out the calculating of similarity difference value with each block of pixels in the second image, adapter block D and block of pixels S as block of pixels S
Between similarity difference value it is minimum.Make the SAD minimum when searching a block of pixels D in second image
When, wherein SAD be the similarity difference value, determine block of pixels D for institute corresponding to the block of pixels S in the first image
State adapter block, wherein S can be referred to as source pixel block, and D can be referred to as to be purpose block of pixels.
When default ranks number be 8 × 8, and carry out similarity difference value calculate when, optionally, for the first image
Each of the block of pixels, using the block of pixels as source pixel block S, the dimension of the corresponding pixel block matrix of each block of pixels is
8 × 8, wherein the element value in the pixel block matrix is the pixel value of the pixel in the block of pixels, can be based on formulaCalculate the similarity difference between each block of pixels in the block of pixels and second image
Value, wherein SAD is the similarity difference value, SijFor the pixel value for the pixel that the i-th row jth in the block of pixels arranges, dijFor
The pixel value for the pixel that the i-th row jth arranges in some block of pixels in second image.Wherein, i, j are pictures in pixel block matrix
The subscript of prime element.
Corresponding adapter block is being found for each of the first image block of pixels incorporated by reference to Fig. 3
Afterwards, the block of pixels (i.e. source pixel block S) and corresponding adapter block (i.e. purpose picture can be calculated by window search
Plain block D) between moving displacement, i.e. distance mv.
In order to calculate mv, therefore, either source pixel block or purpose block of pixels, coordinate (x, y) are based respectively on pixel
Block respectively where the upper left corner of 1/2 image be that origin determines, wherein the coordinate and mesh of the top left corner pixel point of source pixel block
The difference of coordinate of block of pixels top left corner pixel point be exactly motion vector mv.Since the first image and the second image are left and right figures
Picture, therefore, the water between the coordinate of the top left corner pixel point of the coordinate and purpose block of pixels of the top left corner pixel point of source pixel block
Flat distance is motion vector mv.
Assuming that the coordinate of source pixel block is (x=8, y=6), S can be expressed as(8,16), the coordinate of purpose block of pixels is (x
=25, y=16), D can be expressed as(25,16), then the numerical value of mv is exactly 25-8=17;It certainly is also likely to be negative, such as mesh
The coordinate of block of pixels be (x=-2, y=16), then motion vector mv is exactly -10.
As another optional embodiment, it for the source pixel block S in the first image, is searched in second image
It, therefore, will be with the source image in order to determine the adapter block of source pixel block S when rope makes the SAD minimum to multiple block of pixels
Plain block S adjacent multiple block of pixels are determined as field block of pixels, wherein please refer to Fig. 4, field block of pixels can be the first image
In with 8 block of pixels at the center S.
It is included every in the available field block of pixels of electronic equipment 100 after field block of pixels has been determined
The band of position of the corresponding adapter block of a block of pixels, and count the position point of the adapter block of each block of pixels in the block of pixels of field
Cloth, then using the highest region of the frequency of occurrences in position distribution as target area.Due to field block of pixels and source pixel block S it
Between there are certain similarities, therefore, the adapter block D of source pixel block S also very likely appears in target area.It therefore, can be with
It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the source pixel
The corresponding adapter block of block S.Please refer to Fig. 5, the frequency highest that A point occurs as adapter block in figure, therefore using A point as with source image
The corresponding adapter block of plain block S.
Step S140: it is calculated and the depth information apart from corresponding block of pixels based on the distance.
Optionally, formula can be based onThe depth information is calculated, wherein depth information is shooting
The distance between object and video camera, Z are the depth information, and a and b are constant, and mv is the source pixel block S and the adaptation
The distance between block.
Incorporated by reference to Fig. 6,Derivation process can refer to following process:
In order to establish the relationship between depth information and mv, therefore, according to shooting the distance between object and video camera
(d), the similitude of the parallax between right and left eyes image (mv) and triangle, it is available: d/dt=dv/mv, wherein dv
For the distance between video camera and right and left eyes image, dt is interpupillary distance, since the distance between video camera and right and left eyes image are to become
Amount is not constant, and therefore, it is necessary to eliminate the dv in d/dt=dv/mv.
Since camera focus is fk, according to camera imaging principle, the linear relationship of dv and d are established, it is then available: 1/
D+1/dv=1/fk please refers to following formula so that dv is indicated by d.
Dv=1/ (1/fk-1/d)
It is then available by bringing the expression formula of dv into d/dt=dv/mv:
D/dt=1/ ((1/fk-1/d) * mv), so that the expression formula is not influenced by dv.
It is then available by the way that d/dt=1/ ((1/fk-1/d) * mv) is made further deformation:
D*mv=dt/ (1/fk-1/d)
D*mv=dt/ ((d-fk)/(fk*d))
D*mv=dt*fk*d/ (d-fk)
(d-fk) * mv=dt*fk
Further, equation is obtained: d=dt*fk/mv+fk.
Since interpupillary distance and focal length are all constants, above formula can be further converted into:
Since there are two unknown parameters a and b in above formula, therefore, it is necessary to construct two equations to come simultaneous solution a and b,
In, the two parameters can shoot image by two groups of material objects and obtain:
Wherein, d0 is in kind at a distance from video camera, the mv0 in first group of material object shooting image
The motion vector that the left eye in kind in image is overlapped between image and eye image is shot for first group of material object, d1 is second group
For material object in material object shooting image at a distance from video camera, mv1 is the left eye overlapping in kind in second group of material object shooting image
Motion vector between image and eye image, d0, d1, mv0 and mv1 are that can be substituted by measuring obtained data
Above-mentioned equation group can solve a, the numerical value of b.
Step S150: before being encoded, according to the depth information, each block of pixels being grouped, so that
The VR video image is at least divided into two groups.
In video camera when shooting before object, when being encoded to image, need with the image distribution to some regions
The image of high code rate, some regions distributes low bit- rate, so as to improve compression efficiency, saves bandwidth, does not have in subjective quality
Under the premise of significant difference, code rate is saved, therefore, it is necessary to take region to video camera to divide, as a kind of embodiment party
Four markers can artificially be arranged under shooting environmental, so that constant a and b be calculated, and obtain the depth of marker for formula
Degree is according to the edge of window dividing value cut as depth, and the edge of window dividing value cut according to depth is to take region to video camera
It is divided, judges whether the block of pixels belongs to according to the depth data of block of pixels and determined according to the edge of window dividing value that depth is cut
Window area in, by be located at window area in block of pixels be divided into high quality group, by be located at window area outside block of pixels
Be divided into low quality group, when being encoded to VR video image, based on the first code rate to belong to the block of pixels of high quality group into
Row coding, encodes the block of pixels for belonging to low quality group based on the second code rate, wherein the value of the first code rate is greater than second code
The value of rate.
Therefore, by the way that VR video image to be layered, in subsequent progress VR Video coding, it is based on above-mentioned Data Rate Distribution
Mode is distributed relatively large number of code rate to the video image of high quality group, be can be improved compared with the video image of low quality group
Compression efficiency, code rate can be saved under the premise of subjective quality does not have significant difference by saving bandwidth.
A kind of method convenient for VR Video coding that first embodiment of the invention provides, is getting VR video to be encoded
It include first block of pixels that the first image and the second image are divided into default ranks number by VR video image after image;Then needle
To each of the first image block of pixels, determined in second image block of pixels as adapter block,
The block of pixels is calculated the distance between to the adapter block, wherein the similarity difference between the block of pixels and the adapter block
Value is minimum;The distance is then based on to be calculated and the depth information apart from corresponding block of pixels;Before being encoded,
According to the depth information, each block of pixels is grouped, so that the VR video image is at least divided into two groups, that
In subsequent progress VR Video coding, compared with the video image of low quality group, phase is distributed to the video image of high quality group
Bandwidth is saved so as to improve compression efficiency to more code rates.
Second embodiment
Fig. 7 is please referred to, Fig. 7 is a kind of knot for device 400 convenient for VR Video coding that second embodiment of the invention provides
Structure block diagram.Structural block diagram shown in Fig. 7 will be illustrated below, shown device includes:
Obtain module 410, for obtaining VR video image to be encoded, the VR video image include the first image and
Second image;
Division module 420, for the first image and second image to be divided into the picture of default ranks number
Plain block;
Computing module 430, for being directed to each of the first image block of pixels, in second image really
A block of pixels is made as adapter block, calculates the block of pixels the distance between to the adapter block, wherein the block of pixels and institute
The similarity difference value stated between adapter block is minimum;
The computing module 430 is also used to be calculated based on the distance and the depth apart from corresponding block of pixels
Spend information;
Grouping module 440, for according to the depth information, each block of pixels being divided before being encoded
Group, so that the VR video image is at least divided into two groups.
Optionally, described device further include: preprocessing module, for the first image and second image to be divided
Not carry out binary conversion treatment, obtain and the corresponding first gradient image of the first image and corresponding with second image
Second gradient image.
Optionally, the computing module 430, for being based on public affairs for each of the first image block of pixels
FormulaThe similarity calculated between each block of pixels in the block of pixels and second image is poor
Different value, wherein SAD is the similarity difference value, SijFor the block of pixels that the i-th row jth in the first image arranges, dijFor institute
State the block of pixels that the i-th row jth arranges in the second image;Make the SAD when searching a block of pixels in second image
When minimum, determine that the block of pixels is and the SijThe corresponding adapter block.
Optionally, the computing module 430, multiple block of pixels ought be searched in second image by, which being also used to, makes institute
It, will be with the S when stating SAD minimumijAdjacent multiple block of pixels are determined as field block of pixels;The field block of pixels is obtained to be wrapped
The band of position of the corresponding adapter block of each block of pixels included determines the highest region of the adapter block frequency of occurrences as target area
Domain;It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the Sij
Corresponding adapter block.
Optionally, the computing module 430 is also used to based on formulaThe depth information is calculated,
Wherein, Z is the depth information, and a and b are constant, and mv is the distance.
The present embodiment please join the process of the respective function of each Implement of Function Module of the device 400 convenient for VR Video coding
See content described in above-mentioned Fig. 1 to embodiment illustrated in fig. 5, details are not described herein again.
In addition, schematic diagram can be as shown in Figure 1, include mutual the embodiment of the invention also provides a kind of electronic equipment
Memory 110, the processor 120 of connection, the memory 110 is interior to store computer program, when the computer program is by institute
When stating the execution of processor 120, so that the electronic equipment 100 executes provided by embodiment any one of of the invention convenient for VR view
The method of frequency coding.
In addition, the embodiment of the invention also provides a kind of computer readable storage medium, in the computer-readable storage medium
Computer program is stored in matter, when the computer program is run on computers, so that the computer executes this hair
It is convenient for the method for VR Video coding provided by any one of bright embodiment.
In addition, the embodiment of the invention also provides a kind of computer program, the computer program can store beyond the clouds or
On the storage medium of person local, when the computer program is run on computers, so that the computer executes the present invention
It is convenient for the method for VR Video coding provided by any one embodiment.
In conclusion proposition of the embodiment of the present invention is situated between convenient for the method, apparatus of VR Video coding, electronic equipment and storage
VR video image is included first that the first image and the second image are divided into after getting VR video image to be encoded by matter
The block of pixels of default ranks number;Then for each of the first image block of pixels, in second image really
A block of pixels is made as adapter block, calculates the block of pixels the distance between to the adapter block, wherein the block of pixels and institute
The similarity difference value stated between adapter block is minimum;The distance is then based on to be calculated with described apart from corresponding block of pixels
Depth information;Before being encoded, according to the depth information, each block of pixels is grouped, so that the VR
Video image is at least divided into two groups, then in subsequent progress VR Video coding, it is right compared with the video image of low quality group
The video image of high quality group distributes relatively more code rates, so as to improve compression efficiency, saves bandwidth.
In several embodiments provided herein, it should be understood that disclosed device and method can also pass through
Other modes are realized.The apparatus embodiments described above are merely exemplary, for example, flow chart and block diagram in attached drawing
Show the device of multiple embodiments according to the present invention, the architectural framework in the cards of method and computer program product,
Function and operation.In this regard, each box in flowchart or block diagram can represent the one of a module, section or code
Part, a part of the module, section or code, which includes that one or more is for implementing the specified logical function, to be held
Row instruction.It should also be noted that function marked in the box can also be to be different from some implementations as replacement
The sequence marked in attached drawing occurs.For example, two continuous boxes can actually be basically executed in parallel, they are sometimes
It can execute in the opposite order, this depends on the function involved.It is also noted that every in block diagram and or flow chart
The combination of box in a box and block diagram and or flow chart can use the dedicated base for executing defined function or movement
It realizes, or can realize using a combination of dedicated hardware and computer instructions in the system of hardware.
In addition, each functional module in each embodiment of the present invention can integrate one independent portion of formation together
Point, it is also possible to modules individualism, an independent part can also be integrated to form with two or more modules.
It, can be with if the function is realized and when sold or used as an independent product in the form of software function module
It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words
The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter
Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a
People's computer, server or network equipment etc.) it performs all or part of the steps of the method described in the various embodiments of the present invention.
And storage medium above-mentioned includes: that USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), arbitrary access are deposited
The various media that can store program code such as reservoir (RAM, Random Access Memory), magnetic or disk.It needs
Illustrate, herein, relational terms such as first and second and the like be used merely to by an entity or operation with
Another entity or operation distinguish, and without necessarily requiring or implying between these entities or operation, there are any this realities
The relationship or sequence on border.Moreover, the terms "include", "comprise" or its any other variant are intended to the packet of nonexcludability
Contain, so that the process, method, article or equipment for including a series of elements not only includes those elements, but also including
Other elements that are not explicitly listed, or further include for elements inherent to such a process, method, article, or device.
In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including the element
Process, method, article or equipment in there is also other identical elements.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field
For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair
Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.It should also be noted that similar label and letter exist
Similar terms are indicated in following attached drawing, therefore, once being defined in a certain Xiang Yi attached drawing, are then not required in subsequent attached drawing
It is further defined and explained.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any
Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain
Lid is within protection scope of the present invention.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (10)
1. a kind of method convenient for VR Video coding, which is characterized in that the described method includes:
VR video image to be encoded is obtained, the VR video image includes the first image and the second image;
The first image and second image are divided into the block of pixels of default ranks number;
For each of the first image block of pixels, determine a block of pixels as suitable in second image
With block, the block of pixels is calculated the distance between to the adapter block, wherein the similarity between the block of pixels and the adapter block
Difference value is minimum;
Based on the distance, it is calculated and the depth information apart from corresponding block of pixels;
Before being encoded, according to the depth information, each block of pixels is grouped, so that the VR video image
At least it is divided into two groups.
2. the method according to claim 1, wherein being drawn by the first image and second image
It is divided into before the block of pixels of preset quantity, the method also includes:
The first image and second image are subjected to binary conversion treatment respectively, obtained corresponding with the first image
First gradient image and the second gradient image corresponding with second image.
3. the method according to claim 1, wherein the default ranks number is 8 × 8, for first figure
Each of the picture block of pixels, determines a block of pixels as adapter block in second image, comprising:
Formula is based on using the block of pixels as source pixel block for each of the first image block of pixelsCalculate the similarity difference between each block of pixels in the block of pixels and second image
Value, wherein SAD is the similarity difference value, SijFor the pixel that the i-th row jth in the block of pixels arranges, dijFor second figure
The pixel that the i-th row jth arranges in some block of pixels as in;
When searching a block of pixels in second image and making the SAD minimum, determine in second image
The block of pixels is purpose block of pixels, and the purpose block of pixels is the corresponding adapter block of the source pixel block.
4. according to the method described in claim 3, it is characterized in that, the method also includes:
It, will be adjacent with the source pixel block when searching multiple block of pixels in second image and making the SAD minimum
Multiple block of pixels be determined as field block of pixels;
The band of position for obtaining the corresponding adapter block of each block of pixels included by the field block of pixels, determines that adapter block goes out
The existing highest region of frequency is as target area;
It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the source
The corresponding adapter block of block of pixels.
5. the method according to claim 1, wherein being calculated with described based on the distance apart from corresponding
The depth information of block of pixels, comprising:
Based on formulaThe depth information is calculated, wherein Z is the depth information, and a and b are constant, mv
For the distance.
6. a kind of device convenient for VR Video coding, which is characterized in that described device includes:
Module is obtained, for obtaining VR video image to be encoded, the VR video image includes the first image and the second figure
Picture;
Division module, for the first image and second image to be divided into the block of pixels of default ranks number;
Computing module, for determining one in second image for each of the first image block of pixels
A block of pixels calculates the block of pixels the distance between to the adapter block, wherein the block of pixels is adapted to described as adapter block
Similarity difference value between block is minimum;
The computing module is also used to be calculated based on the distance and the depth information apart from corresponding block of pixels;
Grouping module, for according to the depth information, each block of pixels being grouped before being encoded, so that
The VR video image is at least divided into two groups.
7. device according to claim 6, which is characterized in that described device further include:
Preprocessing module obtains and institute for the first image and second image to be carried out binary conversion treatment respectively
State the corresponding first gradient image of the first image and the second gradient image corresponding with second image.
8. device according to claim 6, which is characterized in that the computing module, for in the first image
Each of the block of pixels be based on formula using the block of pixels as source pixel blockCalculate the picture
The similarity difference value between each block of pixels in plain block and second image, wherein SAD is the similarity difference
Value, SijFor the pixel that the i-th row jth in the block of pixels arranges, dijIt is arranged for the i-th row jth in some block of pixels in second image
Pixel;
When searching a block of pixels in second image and making the SAD minimum, determine in second image
The block of pixels is purpose block of pixels, and the purpose block of pixels is the corresponding adapter block of the source pixel block.
9. a kind of electronic equipment, which is characterized in that including memory interconnected, processor, storage meter in the memory
Calculation machine program, when the computer program is executed by the processor, so that the electronic equipment perform claim requires in 1-5
Method described in any one.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium
Program, when the computer program is run on computers, so that the computer is executed as any one in claim 1-5
Method described in.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910022693.7A CN109451318B (en) | 2019-01-09 | 2019-01-09 | Method, apparatus, electronic device and storage medium for facilitating VR video encoding |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910022693.7A CN109451318B (en) | 2019-01-09 | 2019-01-09 | Method, apparatus, electronic device and storage medium for facilitating VR video encoding |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109451318A true CN109451318A (en) | 2019-03-08 |
CN109451318B CN109451318B (en) | 2022-11-01 |
Family
ID=65543945
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910022693.7A Active CN109451318B (en) | 2019-01-09 | 2019-01-09 | Method, apparatus, electronic device and storage medium for facilitating VR video encoding |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109451318B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111954085A (en) * | 2020-08-06 | 2020-11-17 | 咪咕文化科技有限公司 | VR video display method, device, network equipment and storage medium |
CN114786037A (en) * | 2022-03-17 | 2022-07-22 | 青岛虚拟现实研究院有限公司 | Self-adaptive coding compression method facing VR projection |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101965733A (en) * | 2008-03-09 | 2011-02-02 | Lg电子株式会社 | Be used to encode or the method and apparatus of decoded video signal |
CN104427345A (en) * | 2013-09-11 | 2015-03-18 | 华为技术有限公司 | Motion vector acquisition method, acquisition device, video codec and method thereof |
CN104702954A (en) * | 2013-12-05 | 2015-06-10 | 华为技术有限公司 | Video coding method and device |
WO2015200820A1 (en) * | 2014-06-26 | 2015-12-30 | Huawei Technologies Co., Ltd. | Method and device for providing depth based block partitioning in high efficiency video coding |
CN102970529B (en) * | 2012-10-22 | 2016-02-17 | 北京航空航天大学 | A kind of object-based multi-view point video fractal image compression & decompression method |
-
2019
- 2019-01-09 CN CN201910022693.7A patent/CN109451318B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101965733A (en) * | 2008-03-09 | 2011-02-02 | Lg电子株式会社 | Be used to encode or the method and apparatus of decoded video signal |
CN102970529B (en) * | 2012-10-22 | 2016-02-17 | 北京航空航天大学 | A kind of object-based multi-view point video fractal image compression & decompression method |
CN104427345A (en) * | 2013-09-11 | 2015-03-18 | 华为技术有限公司 | Motion vector acquisition method, acquisition device, video codec and method thereof |
CN104702954A (en) * | 2013-12-05 | 2015-06-10 | 华为技术有限公司 | Video coding method and device |
WO2015200820A1 (en) * | 2014-06-26 | 2015-12-30 | Huawei Technologies Co., Ltd. | Method and device for providing depth based block partitioning in high efficiency video coding |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111954085A (en) * | 2020-08-06 | 2020-11-17 | 咪咕文化科技有限公司 | VR video display method, device, network equipment and storage medium |
CN114786037A (en) * | 2022-03-17 | 2022-07-22 | 青岛虚拟现实研究院有限公司 | Self-adaptive coding compression method facing VR projection |
CN114786037B (en) * | 2022-03-17 | 2024-04-12 | 青岛虚拟现实研究院有限公司 | VR projection-oriented adaptive coding compression method |
Also Published As
Publication number | Publication date |
---|---|
CN109451318B (en) | 2022-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Liu et al. | PQA-Net: Deep no reference point cloud quality assessment via multi-view projection | |
US10198623B2 (en) | Three-dimensional facial recognition method and system | |
CN110458805B (en) | Plane detection method, computing device and circuit system | |
Ju et al. | Depth-aware salient object detection using anisotropic center-surround difference | |
Zhou et al. | Omnidirectional image quality assessment by distortion discrimination assisted multi-stream network | |
US9704066B2 (en) | Multi-stage image classification | |
US20180018503A1 (en) | Method, terminal, and storage medium for tracking facial critical area | |
CN104574342B (en) | The noise recognizing method and Noise Identification device of parallax depth image | |
US10848746B2 (en) | Apparatus including multiple cameras and image processing method | |
CN111340866A (en) | Depth image generation method, device and storage medium | |
Tu et al. | V-PCC projection based blind point cloud quality assessment for compression distortion | |
WO2017095543A1 (en) | Object detection with adaptive channel features | |
CN114627244A (en) | Three-dimensional reconstruction method and device, electronic equipment and computer readable medium | |
CN109451318A (en) | Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium | |
CN113139540A (en) | Backboard detection method and equipment | |
CN113920023B (en) | Image processing method and device, computer readable medium and electronic equipment | |
CN111291611A (en) | Pedestrian re-identification method and device based on Bayesian query expansion | |
Wang et al. | Salient video object detection using a virtual border and guided filter | |
Wang et al. | Deep intensity guidance based compression artifacts reduction for depth map | |
Li et al. | Graph-based saliency fusion with superpixel-level belief propagation for 3D fixation prediction | |
WO2023273515A1 (en) | Target detection method, apparatus, electronic device and storage medium | |
Yang et al. | User models of subjective image quality assessment on virtual viewpoint in free-viewpoint video system | |
Tsai et al. | A novel method for 2D-to-3D video conversion based on boundary information | |
CN110264431A (en) | Video beautification method, device and electronic equipment | |
CN116506627A (en) | Encoding method and device for searching by constructing hash table by multi-feature hash value |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |