CN109451318A

CN109451318A - Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium

Info

Publication number: CN109451318A
Application number: CN201910022693.7A
Authority: CN
Inventors: 鲍金龙
Original assignee: Individual
Current assignee: Individual
Priority date: 2019-01-09
Filing date: 2019-01-09
Publication date: 2019-03-08
Anticipated expiration: 2039-01-09
Also published as: CN109451318B

Abstract

The present invention provides a kind of convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium, includes the block of pixels that the first image and the second image are divided into default ranks number by VR video image after getting VR video image to be encoded；Then for each of the first image block of pixels, determine that a block of pixels as adapter block, calculates the block of pixels and arrives the distance between adapter block in the second image, wherein the similarity difference value between the block of pixels and adapter block is minimum；Distance is then based on to be calculated and the depth information apart from corresponding block of pixels；Before being encoded, according to depth information, each block of pixels is grouped, so that VR video image is at least divided into two groups, so in subsequent progress VR Video coding, compared with the video image of low quality group, relatively more code rates can be distributed to the video image of high quality group, so as to improve compression efficiency, bandwidth is saved.

Description

Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium

Technical field

The present invention relates to field of video encoding, in particular to a kind of method, apparatus convenient for VR Video coding, electricity Sub- equipment and storage medium.

Background technique

In current Video Coding Scheme, mainly have following three according to the algorithm that video content does adaptive layered coding Class: image is done and divides, analysis of complexity is carried out to encoding block, picture material is done and divides or identifies.The master of above-mentioned algorithm Wanting problem is that calculation amount is generally excessive, and the segmentation and identification to image do not have real-time usually, so that very strong in real-time Above-mentioned algorithm can not be applied in net cast application.

The sports tournament of panorama VR is broadcast live, and the resolution ratio and frame rate index request to video are all very high.If directly adopted With common coding method, video stream bit rate is excessively high, can make the difficulty of network direct broadcasting greatly.

Summary of the invention

In view of this, the embodiment of the present invention is designed to provide a kind of method, apparatus convenient for VR Video coding, electronics Equipment and storage medium, to alleviate the above problem.

In a first aspect, the embodiment of the invention provides a kind of methods convenient for VR Video coding, which comprises obtain VR video image to be encoded, the VR video image include the first image and the second image；By the first image and Second image is divided into the block of pixels of default ranks number；For each of the first image block of pixels, A block of pixels is determined in second image as adapter block, calculates the block of pixels the distance between to the adapter block, Wherein, the similarity difference value between the block of pixels and the adapter block is minimum；Based on the distance be calculated with it is described away from Depth information from corresponding block of pixels；Before being encoded, according to the depth information, each block of pixels is divided Group, so that the VR video image is at least divided into two groups.

Second aspect, the embodiment of the invention provides a kind of devices convenient for VR Video coding, module are obtained, for obtaining VR video image to be encoded, the VR video image include the first image and the second image；Division module, being used for will be described First image and second image are divided into the block of pixels of default ranks number；Computing module, for being directed to described first Each of image block of pixels determines that a block of pixels as adapter block, calculates the pixel in second image Block is the distance between to the adapter block, wherein the similarity difference value between the block of pixels and the adapter block is minimum；It is described Computing module is also used to be calculated based on the distance and the depth information apart from corresponding block of pixels；Grouping module, For according to the depth information, each block of pixels being grouped before being encoded, so that the VR video image At least it is divided into two groups.

The third aspect, the embodiment of the present invention provide a kind of electronic equipment, including memory interconnected, processor, institute It states and stores computer program in memory, when the computer program is executed by the processor, so that the electronic equipment Execute method described in first aspect.

Fourth aspect, the embodiment of the present invention provide a kind of computer readable storage medium, the computer-readable storage medium Computer program is stored in matter, when the computer program is run on computers, so that the computer executes first Method described in aspect.

Compared with prior art, the method, apparatus convenient for VR Video coding of various embodiments of the present invention proposition, electronic equipment And the beneficial effect of storage medium is: including first the first figure by VR video image after getting VR video image to be encoded Picture and the second image are divided into the block of pixels of default ranks number；Then for each of the first image pixel Block determines a block of pixels as adapter block in second image, calculates the block of pixels between the adapter block Distance, wherein the similarity difference value between the block of pixels and the adapter block is minimum；The distance is then based on to be calculated With the depth information apart from corresponding block of pixels；Before being encoded, according to the depth information, by each pixel Block is grouped, so that the VR video image is at least divided into two groups, then in subsequent progress VR Video coding, with low-quality The video content of amount group is compared, and distributes relatively more code rates to the video content of high quality group, so as to improve compression effect Rate saves bandwidth.

To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate Appended attached drawing, is described in detail below.

Detailed description of the invention

In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.

Fig. 1 is the structural block diagram of electronic equipment provided in an embodiment of the present invention；

Fig. 2 is one of the flow chart of method convenient for VR Video coding that first embodiment of the invention provides；

Fig. 3 is schematic diagram of the block of pixels that provides of first embodiment of the invention to the distance between adapter block；

Fig. 4 is the schematic diagram for the field block of pixels that first embodiment of the invention provides；

Fig. 5 is the determination schematic diagram for the adapter block that first embodiment of the invention provides；

Fig. 6 is the calculating schematic diagram of constant a, b that first embodiment of the invention provides；

Fig. 7 is the structural block diagram for the device convenient for VR Video coding that second embodiment of the invention provides.

Specific embodiment

Below in conjunction with attached drawing in the embodiment of the present invention, technical solution in the embodiment of the present invention carries out clear, complete Ground description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Usually exist The component of the embodiment of the present invention described and illustrated in attached drawing can be arranged and be designed with a variety of different configurations herein.Cause This, is not intended to limit claimed invention to the detailed description of the embodiment of the present invention provided in the accompanying drawings below Range, but it is merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art are not doing Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.

It should also be noted that similar label and letter indicate similar terms in following attached drawing, therefore, once a certain Xiang Yi It is defined in a attached drawing, does not then need that it is further defined and explained in subsequent attached drawing.Meanwhile of the invention In description, term " first ", " second " etc. are only used for distinguishing description, are not understood to indicate or imply relative importance.

As shown in Figure 1, being the block diagram of electronic equipment 100.The electronic equipment 100 may include: convenient for VR view Device, the memory 110, storage control 120, processor 130, Peripheral Interface 140, input-output unit 150, sound of frequency coding Frequency unit 160, display unit 170.Wherein, the electronic equipment 100 can be user terminal, such as PC (personal computer, PC), tablet computer, smart phone, personal digital assistant (personal digital Assistant, PDA) etc., it is also possible to server.

The memory 110, storage control 120, processor 130, Peripheral Interface 140, input-output unit 150, sound Frequency unit 160 and each element of display unit 170 are directly or indirectly electrically connected between each other, with realize data transmission or Interaction.It is electrically connected for example, these elements can be realized between each other by one or more communication bus or signal wire.It is described just It include that at least one can be stored in the memory in the form of software or firmware (firmware) in the device of VR Video coding In 110 or the software function module that is solidificated in the operating system (operating system, OS) of electronic equipment.The processing Device 130 for executing the executable module stored in memory 110, such as the device convenient for VR Video coding include it is soft Part functional module or computer program.

Wherein, memory 110 may be, but not limited to, random access memory (Random Access Memory, RAM), read-only memory (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM), Electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc.. Wherein, memory 110 is for storing program, and the processor 130 executes described program after receiving and executing instruction, aforementioned The method for the flow definition that any embodiment of the embodiment of the present invention discloses can be applied in processor 130, or by processor 130 realize.

Processor 130 may be a kind of IC chip, the processing capacity with signal.Above-mentioned processor 130 can To be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network processing unit (Network Processor, abbreviation NP) etc.；Can also be digital signal processor (DSP), specific integrated circuit (ASIC), Field programmable gate array (FPGA) either other programmable logic device, discrete gate or transistor logic, discrete hard Part component.It may be implemented or execute disclosed each method, step and the logic diagram in the embodiment of the present invention.General processor It can be microprocessor or the processor be also possible to any conventional processor etc..

Various input/output devices are couple processor 130 and memory 110 by the Peripheral Interface 140.Some In embodiment, Peripheral Interface 140, processor 130 and storage control 120 can be realized in one single chip.Other one In a little examples, they can be realized by independent chip respectively.

Input-output unit 150 is used to be supplied to the interaction that user input data realizes user and electronic equipment 100.It is described Input-output unit 150 may be, but not limited to, mouse and keyboard etc..

Audio unit 160 provides a user audio interface, may include one or more microphones, one or more raises Sound device and voicefrequency circuit.

Display unit 170 provides an interactive interface (such as user interface) between electronic equipment 100 and user Or it is referred to for display image data to user.In the present embodiment, the display unit 170 can be liquid crystal display or touching Control display.It can be the touching of the capacitance type touch control screen or resistance-type of support single-point and multi-point touch operation if touch control display Control screen etc..Single-point and multi-point touch operation is supported to refer to that touch control display can sense on the touch control display one or more The touch control operation generated simultaneously at a position, and the touch control operation that this is sensed transfers to processor 130 to be calculated and handled.

First embodiment

Referring to figure 2., Fig. 2 is a kind of process for method convenient for VR Video coding that first embodiment of the invention provides Figure, the method are applied to electronic equipment.Process shown in Fig. 2 will be described in detail below, which comprises

Step S110: obtaining VR video image to be encoded, and the VR video image includes the first image and the second figure Picture.

Since VR video image has left and right both of which, the VR video image that electronic equipment 100 is got can wrap Include left image and right image.Wherein, there are visual angle differences between left image and right image.

Optionally, the first image can be left image, correspondingly, second image is right image；Optionally, institute Stating the first image can be right image, correspondingly, second image is left image.

Step S120: the first image and second image are divided into the block of pixels of default ranks number.

Wherein, the default ranks number can arrange for 8 rows 8, i.e., the first image and second image are drawn It is divided into 8 × 8 block of pixels, the size of each block of pixels is identical.It certainly, as an alternative embodiment, can also be with Use the piecemeal principle of bigger (such as 9 × 9) or smaller (such as 6 × 6).

Certainly, as an alternative embodiment, being divided by the first image and second image It, can also be by the first image and second image point in order to improve computational efficiency before the block of pixels of preset quantity Not carry out binary conversion treatment, obtain and the corresponding first gradient image of the first image and corresponding with second image Second gradient image.Wherein, image binaryzation processing is exactly to set 0 or 255 for the gray value of the pixel on image, also It is that whole image is showed to apparent black and white effect, wherein 0 indicates white, and 255 indicate black.By 256 brightness degrees Gray level image is obtained by selection threshold value appropriate still can reflect the whole binary image with local feature of image.Example Such as, the gray value that gray value is greater than or equal to the pixel of threshold value resets to 255, and gray value is less than to the pixel of preset threshold The gray value of point resets to 0.

The binaryzation of image is conducive to being further processed for image, becomes image simply, and data volume reduces, can not only be convex The profile of interested target is showed, and computation complexity can be reduced.

Wherein it is possible to be carried out at binaryzation by SOBEL or CANNEY algorithm to the first image and second image Reason, to obtain and the corresponding first gradient image of the first image and the second gradient map corresponding with second image Picture.

Step S130: for each of the first image block of pixels, one is determined in second image A block of pixels calculates the block of pixels the distance between to the adapter block, wherein the block of pixels is adapted to described as adapter block Similarity difference value between block is minimum.

Since the relationship between the first image and the second image is to belong to the same VR video image, the first image Diversity factor is mainly as caused by the difference of visual angle between the second image.In this case, for every in the first image A block of pixels S can find an adapter block D corresponding with block of pixels S in the second image.Wherein, by block of pixels S When carrying out the calculating of similarity difference value with each block of pixels in the second image, adapter block D and block of pixels S as block of pixels S Between similarity difference value it is minimum.Make the SAD minimum when searching a block of pixels D in second image When, wherein SAD be the similarity difference value, determine block of pixels D for institute corresponding to the block of pixels S in the first image State adapter block, wherein S can be referred to as source pixel block, and D can be referred to as to be purpose block of pixels.

When default ranks number be 8 × 8, and carry out similarity difference value calculate when, optionally, for the first image Each of the block of pixels, using the block of pixels as source pixel block S, the dimension of the corresponding pixel block matrix of each block of pixels is 8 × 8, wherein the element value in the pixel block matrix is the pixel value of the pixel in the block of pixels, can be based on formulaCalculate the similarity difference between each block of pixels in the block of pixels and second image Value, wherein SAD is the similarity difference value, S_ijFor the pixel value for the pixel that the i-th row jth in the block of pixels arranges, d_ijFor The pixel value for the pixel that the i-th row jth arranges in some block of pixels in second image.Wherein, i, j are pictures in pixel block matrix The subscript of prime element.

Corresponding adapter block is being found for each of the first image block of pixels incorporated by reference to Fig. 3 Afterwards, the block of pixels (i.e. source pixel block S) and corresponding adapter block (i.e. purpose picture can be calculated by window search Plain block D) between moving displacement, i.e. distance mv.

In order to calculate mv, therefore, either source pixel block or purpose block of pixels, coordinate (x, y) are based respectively on pixel Block respectively where the upper left corner of 1/2 image be that origin determines, wherein the coordinate and mesh of the top left corner pixel point of source pixel block The difference of coordinate of block of pixels top left corner pixel point be exactly motion vector mv.Since the first image and the second image are left and right figures Picture, therefore, the water between the coordinate of the top left corner pixel point of the coordinate and purpose block of pixels of the top left corner pixel point of source pixel block Flat distance is motion vector mv.

Assuming that the coordinate of source pixel block is (x=8, y=6), S can be expressed as_(8,16), the coordinate of purpose block of pixels is (x =25, y=16), D can be expressed as_(25,16), then the numerical value of mv is exactly 25-8=17；It certainly is also likely to be negative, such as mesh The coordinate of block of pixels be (x=-2, y=16), then motion vector mv is exactly -10.

As another optional embodiment, it for the source pixel block S in the first image, is searched in second image It, therefore, will be with the source image in order to determine the adapter block of source pixel block S when rope makes the SAD minimum to multiple block of pixels Plain block S adjacent multiple block of pixels are determined as field block of pixels, wherein please refer to Fig. 4, field block of pixels can be the first image In with 8 block of pixels at the center S.

It is included every in the available field block of pixels of electronic equipment 100 after field block of pixels has been determined The band of position of the corresponding adapter block of a block of pixels, and count the position point of the adapter block of each block of pixels in the block of pixels of field Cloth, then using the highest region of the frequency of occurrences in position distribution as target area.Due to field block of pixels and source pixel block S it Between there are certain similarities, therefore, the adapter block D of source pixel block S also very likely appears in target area.It therefore, can be with It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the source pixel The corresponding adapter block of block S.Please refer to Fig. 5, the frequency highest that A point occurs as adapter block in figure, therefore using A point as with source image The corresponding adapter block of plain block S.

Step S140: it is calculated and the depth information apart from corresponding block of pixels based on the distance.

Optionally, formula can be based onThe depth information is calculated, wherein depth information is shooting The distance between object and video camera, Z are the depth information, and a and b are constant, and mv is the source pixel block S and the adaptation The distance between block.

Incorporated by reference to Fig. 6,Derivation process can refer to following process:

In order to establish the relationship between depth information and mv, therefore, according to shooting the distance between object and video camera (d), the similitude of the parallax between right and left eyes image (mv) and triangle, it is available: d/dt=dv/mv, wherein dv For the distance between video camera and right and left eyes image, dt is interpupillary distance, since the distance between video camera and right and left eyes image are to become Amount is not constant, and therefore, it is necessary to eliminate the dv in d/dt=dv/mv.

Since camera focus is fk, according to camera imaging principle, the linear relationship of dv and d are established, it is then available: 1/ D+1/dv=1/fk please refers to following formula so that dv is indicated by d.

Dv=1/ (1/fk-1/d)

It is then available by bringing the expression formula of dv into d/dt=dv/mv:

D/dt=1/ ((1/fk-1/d) * mv), so that the expression formula is not influenced by dv.

It is then available by the way that d/dt=1/ ((1/fk-1/d) * mv) is made further deformation:

D*mv=dt/ (1/fk-1/d)

D*mv=dt/ ((d-fk)/(fk*d))

D*mv=dt*fk*d/ (d-fk)

(d-fk) * mv=dt*fk

Further, equation is obtained: d=dt*fk/mv+fk.

Since interpupillary distance and focal length are all constants, above formula can be further converted into:

Since there are two unknown parameters a and b in above formula, therefore, it is necessary to construct two equations to come simultaneous solution a and b, In, the two parameters can shoot image by two groups of material objects and obtain:

Wherein, d0 is in kind at a distance from video camera, the mv0 in first group of material object shooting image The motion vector that the left eye in kind in image is overlapped between image and eye image is shot for first group of material object, d1 is second group For material object in material object shooting image at a distance from video camera, mv1 is the left eye overlapping in kind in second group of material object shooting image Motion vector between image and eye image, d0, d1, mv0 and mv1 are that can be substituted by measuring obtained data Above-mentioned equation group can solve a, the numerical value of b.

Step S150: before being encoded, according to the depth information, each block of pixels being grouped, so that The VR video image is at least divided into two groups.

In video camera when shooting before object, when being encoded to image, need with the image distribution to some regions The image of high code rate, some regions distributes low bit- rate, so as to improve compression efficiency, saves bandwidth, does not have in subjective quality Under the premise of significant difference, code rate is saved, therefore, it is necessary to take region to video camera to divide, as a kind of embodiment party Four markers can artificially be arranged under shooting environmental, so that constant a and b be calculated, and obtain the depth of marker for formula Degree is according to the edge of window dividing value cut as depth, and the edge of window dividing value cut according to depth is to take region to video camera It is divided, judges whether the block of pixels belongs to according to the depth data of block of pixels and determined according to the edge of window dividing value that depth is cut Window area in, by be located at window area in block of pixels be divided into high quality group, by be located at window area outside block of pixels Be divided into low quality group, when being encoded to VR video image, based on the first code rate to belong to the block of pixels of high quality group into Row coding, encodes the block of pixels for belonging to low quality group based on the second code rate, wherein the value of the first code rate is greater than second code The value of rate.

Therefore, by the way that VR video image to be layered, in subsequent progress VR Video coding, it is based on above-mentioned Data Rate Distribution Mode is distributed relatively large number of code rate to the video image of high quality group, be can be improved compared with the video image of low quality group Compression efficiency, code rate can be saved under the premise of subjective quality does not have significant difference by saving bandwidth.

A kind of method convenient for VR Video coding that first embodiment of the invention provides, is getting VR video to be encoded It include first block of pixels that the first image and the second image are divided into default ranks number by VR video image after image；Then needle To each of the first image block of pixels, determined in second image block of pixels as adapter block, The block of pixels is calculated the distance between to the adapter block, wherein the similarity difference between the block of pixels and the adapter block Value is minimum；The distance is then based on to be calculated and the depth information apart from corresponding block of pixels；Before being encoded, According to the depth information, each block of pixels is grouped, so that the VR video image is at least divided into two groups, that In subsequent progress VR Video coding, compared with the video image of low quality group, phase is distributed to the video image of high quality group Bandwidth is saved so as to improve compression efficiency to more code rates.

Second embodiment

Fig. 7 is please referred to, Fig. 7 is a kind of knot for device 400 convenient for VR Video coding that second embodiment of the invention provides Structure block diagram.Structural block diagram shown in Fig. 7 will be illustrated below, shown device includes:

Obtain module 410, for obtaining VR video image to be encoded, the VR video image include the first image and Second image；

Division module 420, for the first image and second image to be divided into the picture of default ranks number Plain block；

Computing module 430, for being directed to each of the first image block of pixels, in second image really A block of pixels is made as adapter block, calculates the block of pixels the distance between to the adapter block, wherein the block of pixels and institute The similarity difference value stated between adapter block is minimum；

The computing module 430 is also used to be calculated based on the distance and the depth apart from corresponding block of pixels Spend information；

Grouping module 440, for according to the depth information, each block of pixels being divided before being encoded Group, so that the VR video image is at least divided into two groups.

Optionally, described device further include: preprocessing module, for the first image and second image to be divided Not carry out binary conversion treatment, obtain and the corresponding first gradient image of the first image and corresponding with second image Second gradient image.

Optionally, the computing module 430, for being based on public affairs for each of the first image block of pixels FormulaThe similarity calculated between each block of pixels in the block of pixels and second image is poor Different value, wherein SAD is the similarity difference value, S_ijFor the block of pixels that the i-th row jth in the first image arranges, d_ijFor institute State the block of pixels that the i-th row jth arranges in the second image；Make the SAD when searching a block of pixels in second image When minimum, determine that the block of pixels is and the S_ijThe corresponding adapter block.

Optionally, the computing module 430, multiple block of pixels ought be searched in second image by, which being also used to, makes institute It, will be with the S when stating SAD minimum_ijAdjacent multiple block of pixels are determined as field block of pixels；The field block of pixels is obtained to be wrapped The band of position of the corresponding adapter block of each block of pixels included determines the highest region of the adapter block frequency of occurrences as target area Domain；It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the S_ij Corresponding adapter block.

Optionally, the computing module 430 is also used to based on formulaThe depth information is calculated, Wherein, Z is the depth information, and a and b are constant, and mv is the distance.

The present embodiment please join the process of the respective function of each Implement of Function Module of the device 400 convenient for VR Video coding See content described in above-mentioned Fig. 1 to embodiment illustrated in fig. 5, details are not described herein again.

In addition, schematic diagram can be as shown in Figure 1, include mutual the embodiment of the invention also provides a kind of electronic equipment Memory 110, the processor 120 of connection, the memory 110 is interior to store computer program, when the computer program is by institute When stating the execution of processor 120, so that the electronic equipment 100 executes provided by embodiment any one of of the invention convenient for VR view The method of frequency coding.

In addition, the embodiment of the invention also provides a kind of computer readable storage medium, in the computer-readable storage medium Computer program is stored in matter, when the computer program is run on computers, so that the computer executes this hair It is convenient for the method for VR Video coding provided by any one of bright embodiment.

In addition, the embodiment of the invention also provides a kind of computer program, the computer program can store beyond the clouds or On the storage medium of person local, when the computer program is run on computers, so that the computer executes the present invention It is convenient for the method for VR Video coding provided by any one embodiment.

In conclusion proposition of the embodiment of the present invention is situated between convenient for the method, apparatus of VR Video coding, electronic equipment and storage VR video image is included first that the first image and the second image are divided into after getting VR video image to be encoded by matter The block of pixels of default ranks number；Then for each of the first image block of pixels, in second image really A block of pixels is made as adapter block, calculates the block of pixels the distance between to the adapter block, wherein the block of pixels and institute The similarity difference value stated between adapter block is minimum；The distance is then based on to be calculated with described apart from corresponding block of pixels Depth information；Before being encoded, according to the depth information, each block of pixels is grouped, so that the VR Video image is at least divided into two groups, then in subsequent progress VR Video coding, it is right compared with the video image of low quality group The video image of high quality group distributes relatively more code rates, so as to improve compression efficiency, saves bandwidth.

In several embodiments provided herein, it should be understood that disclosed device and method can also pass through Other modes are realized.The apparatus embodiments described above are merely exemplary, for example, flow chart and block diagram in attached drawing Show the device of multiple embodiments according to the present invention, the architectural framework in the cards of method and computer program product, Function and operation.In this regard, each box in flowchart or block diagram can represent the one of a module, section or code Part, a part of the module, section or code, which includes that one or more is for implementing the specified logical function, to be held Row instruction.It should also be noted that function marked in the box can also be to be different from some implementations as replacement The sequence marked in attached drawing occurs.For example, two continuous boxes can actually be basically executed in parallel, they are sometimes It can execute in the opposite order, this depends on the function involved.It is also noted that every in block diagram and or flow chart The combination of box in a box and block diagram and or flow chart can use the dedicated base for executing defined function or movement It realizes, or can realize using a combination of dedicated hardware and computer instructions in the system of hardware.

In addition, each functional module in each embodiment of the present invention can integrate one independent portion of formation together Point, it is also possible to modules individualism, an independent part can also be integrated to form with two or more modules.

It, can be with if the function is realized and when sold or used as an independent product in the form of software function module It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a People's computer, server or network equipment etc.) it performs all or part of the steps of the method described in the various embodiments of the present invention. And storage medium above-mentioned includes: that USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), arbitrary access are deposited The various media that can store program code such as reservoir (RAM, Random Access Memory), magnetic or disk.It needs Illustrate, herein, relational terms such as first and second and the like be used merely to by an entity or operation with Another entity or operation distinguish, and without necessarily requiring or implying between these entities or operation, there are any this realities The relationship or sequence on border.Moreover, the terms "include", "comprise" or its any other variant are intended to the packet of nonexcludability Contain, so that the process, method, article or equipment for including a series of elements not only includes those elements, but also including Other elements that are not explicitly listed, or further include for elements inherent to such a process, method, article, or device. In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including the element Process, method, article or equipment in there is also other identical elements.

The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.It should also be noted that similar label and letter exist Similar terms are indicated in following attached drawing, therefore, once being defined in a certain Xiang Yi attached drawing, are then not required in subsequent attached drawing It is further defined and explained.

The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain Lid is within protection scope of the present invention.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. a kind of method convenient for VR Video coding, which is characterized in that the described method includes:

VR video image to be encoded is obtained, the VR video image includes the first image and the second image；

The first image and second image are divided into the block of pixels of default ranks number；

For each of the first image block of pixels, determine a block of pixels as suitable in second image With block, the block of pixels is calculated the distance between to the adapter block, wherein the similarity between the block of pixels and the adapter block Difference value is minimum；

Based on the distance, it is calculated and the depth information apart from corresponding block of pixels；

Before being encoded, according to the depth information, each block of pixels is grouped, so that the VR video image At least it is divided into two groups.

2. the method according to claim 1, wherein being drawn by the first image and second image It is divided into before the block of pixels of preset quantity, the method also includes:

The first image and second image are subjected to binary conversion treatment respectively, obtained corresponding with the first image First gradient image and the second gradient image corresponding with second image.

3. the method according to claim 1, wherein the default ranks number is 8 × 8, for first figure Each of the picture block of pixels, determines a block of pixels as adapter block in second image, comprising:

Formula is based on using the block of pixels as source pixel block for each of the first image block of pixelsCalculate the similarity difference between each block of pixels in the block of pixels and second image Value, wherein SAD is the similarity difference value, S_ijFor the pixel that the i-th row jth in the block of pixels arranges, d_ijFor second figure The pixel that the i-th row jth arranges in some block of pixels as in；

When searching a block of pixels in second image and making the SAD minimum, determine in second image The block of pixels is purpose block of pixels, and the purpose block of pixels is the corresponding adapter block of the source pixel block.

4. according to the method described in claim 3, it is characterized in that, the method also includes:

It, will be adjacent with the source pixel block when searching multiple block of pixels in second image and making the SAD minimum Multiple block of pixels be determined as field block of pixels；

The band of position for obtaining the corresponding adapter block of each block of pixels included by the field block of pixels, determines that adapter block goes out The existing highest region of frequency is as target area；

It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the source The corresponding adapter block of block of pixels.

5. the method according to claim 1, wherein being calculated with described based on the distance apart from corresponding The depth information of block of pixels, comprising:

Based on formulaThe depth information is calculated, wherein Z is the depth information, and a and b are constant, mv For the distance.

6. a kind of device convenient for VR Video coding, which is characterized in that described device includes:

Module is obtained, for obtaining VR video image to be encoded, the VR video image includes the first image and the second figure Picture；

Division module, for the first image and second image to be divided into the block of pixels of default ranks number；

Computing module, for determining one in second image for each of the first image block of pixels A block of pixels calculates the block of pixels the distance between to the adapter block, wherein the block of pixels is adapted to described as adapter block Similarity difference value between block is minimum；

The computing module is also used to be calculated based on the distance and the depth information apart from corresponding block of pixels；

Grouping module, for according to the depth information, each block of pixels being grouped before being encoded, so that The VR video image is at least divided into two groups.

7. device according to claim 6, which is characterized in that described device further include:

Preprocessing module obtains and institute for the first image and second image to be carried out binary conversion treatment respectively State the corresponding first gradient image of the first image and the second gradient image corresponding with second image.

8. device according to claim 6, which is characterized in that the computing module, for in the first image Each of the block of pixels be based on formula using the block of pixels as source pixel blockCalculate the picture The similarity difference value between each block of pixels in plain block and second image, wherein SAD is the similarity difference Value, S_ijFor the pixel that the i-th row jth in the block of pixels arranges, d_ijIt is arranged for the i-th row jth in some block of pixels in second image Pixel；

9. a kind of electronic equipment, which is characterized in that including memory interconnected, processor, storage meter in the memory Calculation machine program, when the computer program is executed by the processor, so that the electronic equipment perform claim requires in 1-5 Method described in any one.

10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium Program, when the computer program is run on computers, so that the computer is executed as any one in claim 1-5 Method described in.