CN109451318A - Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium - Google Patents

Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium Download PDF

Info

Publication number
CN109451318A
CN109451318A CN201910022693.7A CN201910022693A CN109451318A CN 109451318 A CN109451318 A CN 109451318A CN 201910022693 A CN201910022693 A CN 201910022693A CN 109451318 A CN109451318 A CN 109451318A
Authority
CN
China
Prior art keywords
block
pixels
image
adapter
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910022693.7A
Other languages
Chinese (zh)
Other versions
CN109451318B (en
Inventor
鲍金龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201910022693.7A priority Critical patent/CN109451318B/en
Publication of CN109451318A publication Critical patent/CN109451318A/en
Application granted granted Critical
Publication of CN109451318B publication Critical patent/CN109451318B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention provides a kind of convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium, includes the block of pixels that the first image and the second image are divided into default ranks number by VR video image after getting VR video image to be encoded;Then for each of the first image block of pixels, determine that a block of pixels as adapter block, calculates the block of pixels and arrives the distance between adapter block in the second image, wherein the similarity difference value between the block of pixels and adapter block is minimum;Distance is then based on to be calculated and the depth information apart from corresponding block of pixels;Before being encoded, according to depth information, each block of pixels is grouped, so that VR video image is at least divided into two groups, so in subsequent progress VR Video coding, compared with the video image of low quality group, relatively more code rates can be distributed to the video image of high quality group, so as to improve compression efficiency, bandwidth is saved.

Description

Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium
Technical field
The present invention relates to field of video encoding, in particular to a kind of method, apparatus convenient for VR Video coding, electricity Sub- equipment and storage medium.
Background technique
In current Video Coding Scheme, mainly have following three according to the algorithm that video content does adaptive layered coding Class: image is done and divides, analysis of complexity is carried out to encoding block, picture material is done and divides or identifies.The master of above-mentioned algorithm Wanting problem is that calculation amount is generally excessive, and the segmentation and identification to image do not have real-time usually, so that very strong in real-time Above-mentioned algorithm can not be applied in net cast application.
The sports tournament of panorama VR is broadcast live, and the resolution ratio and frame rate index request to video are all very high.If directly adopted With common coding method, video stream bit rate is excessively high, can make the difficulty of network direct broadcasting greatly.
Summary of the invention
In view of this, the embodiment of the present invention is designed to provide a kind of method, apparatus convenient for VR Video coding, electronics Equipment and storage medium, to alleviate the above problem.
In a first aspect, the embodiment of the invention provides a kind of methods convenient for VR Video coding, which comprises obtain VR video image to be encoded, the VR video image include the first image and the second image;By the first image and Second image is divided into the block of pixels of default ranks number;For each of the first image block of pixels, A block of pixels is determined in second image as adapter block, calculates the block of pixels the distance between to the adapter block, Wherein, the similarity difference value between the block of pixels and the adapter block is minimum;Based on the distance be calculated with it is described away from Depth information from corresponding block of pixels;Before being encoded, according to the depth information, each block of pixels is divided Group, so that the VR video image is at least divided into two groups.
Second aspect, the embodiment of the invention provides a kind of devices convenient for VR Video coding, module are obtained, for obtaining VR video image to be encoded, the VR video image include the first image and the second image;Division module, being used for will be described First image and second image are divided into the block of pixels of default ranks number;Computing module, for being directed to described first Each of image block of pixels determines that a block of pixels as adapter block, calculates the pixel in second image Block is the distance between to the adapter block, wherein the similarity difference value between the block of pixels and the adapter block is minimum;It is described Computing module is also used to be calculated based on the distance and the depth information apart from corresponding block of pixels;Grouping module, For according to the depth information, each block of pixels being grouped before being encoded, so that the VR video image At least it is divided into two groups.
The third aspect, the embodiment of the present invention provide a kind of electronic equipment, including memory interconnected, processor, institute It states and stores computer program in memory, when the computer program is executed by the processor, so that the electronic equipment Execute method described in first aspect.
Fourth aspect, the embodiment of the present invention provide a kind of computer readable storage medium, the computer-readable storage medium Computer program is stored in matter, when the computer program is run on computers, so that the computer executes first Method described in aspect.
Compared with prior art, the method, apparatus convenient for VR Video coding of various embodiments of the present invention proposition, electronic equipment And the beneficial effect of storage medium is: including first the first figure by VR video image after getting VR video image to be encoded Picture and the second image are divided into the block of pixels of default ranks number;Then for each of the first image pixel Block determines a block of pixels as adapter block in second image, calculates the block of pixels between the adapter block Distance, wherein the similarity difference value between the block of pixels and the adapter block is minimum;The distance is then based on to be calculated With the depth information apart from corresponding block of pixels;Before being encoded, according to the depth information, by each pixel Block is grouped, so that the VR video image is at least divided into two groups, then in subsequent progress VR Video coding, with low-quality The video content of amount group is compared, and distributes relatively more code rates to the video content of high quality group, so as to improve compression effect Rate saves bandwidth.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate Appended attached drawing, is described in detail below.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.
Fig. 1 is the structural block diagram of electronic equipment provided in an embodiment of the present invention;
Fig. 2 is one of the flow chart of method convenient for VR Video coding that first embodiment of the invention provides;
Fig. 3 is schematic diagram of the block of pixels that provides of first embodiment of the invention to the distance between adapter block;
Fig. 4 is the schematic diagram for the field block of pixels that first embodiment of the invention provides;
Fig. 5 is the determination schematic diagram for the adapter block that first embodiment of the invention provides;
Fig. 6 is the calculating schematic diagram of constant a, b that first embodiment of the invention provides;
Fig. 7 is the structural block diagram for the device convenient for VR Video coding that second embodiment of the invention provides.
Specific embodiment
Below in conjunction with attached drawing in the embodiment of the present invention, technical solution in the embodiment of the present invention carries out clear, complete Ground description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Usually exist The component of the embodiment of the present invention described and illustrated in attached drawing can be arranged and be designed with a variety of different configurations herein.Cause This, is not intended to limit claimed invention to the detailed description of the embodiment of the present invention provided in the accompanying drawings below Range, but it is merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art are not doing Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.
It should also be noted that similar label and letter indicate similar terms in following attached drawing, therefore, once a certain Xiang Yi It is defined in a attached drawing, does not then need that it is further defined and explained in subsequent attached drawing.Meanwhile of the invention In description, term " first ", " second " etc. are only used for distinguishing description, are not understood to indicate or imply relative importance.
As shown in Figure 1, being the block diagram of electronic equipment 100.The electronic equipment 100 may include: convenient for VR view Device, the memory 110, storage control 120, processor 130, Peripheral Interface 140, input-output unit 150, sound of frequency coding Frequency unit 160, display unit 170.Wherein, the electronic equipment 100 can be user terminal, such as PC (personal computer, PC), tablet computer, smart phone, personal digital assistant (personal digital Assistant, PDA) etc., it is also possible to server.
The memory 110, storage control 120, processor 130, Peripheral Interface 140, input-output unit 150, sound Frequency unit 160 and each element of display unit 170 are directly or indirectly electrically connected between each other, with realize data transmission or Interaction.It is electrically connected for example, these elements can be realized between each other by one or more communication bus or signal wire.It is described just It include that at least one can be stored in the memory in the form of software or firmware (firmware) in the device of VR Video coding In 110 or the software function module that is solidificated in the operating system (operating system, OS) of electronic equipment.The processing Device 130 for executing the executable module stored in memory 110, such as the device convenient for VR Video coding include it is soft Part functional module or computer program.
Wherein, memory 110 may be, but not limited to, random access memory (Random Access Memory, RAM), read-only memory (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM), Electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc.. Wherein, memory 110 is for storing program, and the processor 130 executes described program after receiving and executing instruction, aforementioned The method for the flow definition that any embodiment of the embodiment of the present invention discloses can be applied in processor 130, or by processor 130 realize.
Processor 130 may be a kind of IC chip, the processing capacity with signal.Above-mentioned processor 130 can To be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network processing unit (Network Processor, abbreviation NP) etc.;Can also be digital signal processor (DSP), specific integrated circuit (ASIC), Field programmable gate array (FPGA) either other programmable logic device, discrete gate or transistor logic, discrete hard Part component.It may be implemented or execute disclosed each method, step and the logic diagram in the embodiment of the present invention.General processor It can be microprocessor or the processor be also possible to any conventional processor etc..
Various input/output devices are couple processor 130 and memory 110 by the Peripheral Interface 140.Some In embodiment, Peripheral Interface 140, processor 130 and storage control 120 can be realized in one single chip.Other one In a little examples, they can be realized by independent chip respectively.
Input-output unit 150 is used to be supplied to the interaction that user input data realizes user and electronic equipment 100.It is described Input-output unit 150 may be, but not limited to, mouse and keyboard etc..
Audio unit 160 provides a user audio interface, may include one or more microphones, one or more raises Sound device and voicefrequency circuit.
Display unit 170 provides an interactive interface (such as user interface) between electronic equipment 100 and user Or it is referred to for display image data to user.In the present embodiment, the display unit 170 can be liquid crystal display or touching Control display.It can be the touching of the capacitance type touch control screen or resistance-type of support single-point and multi-point touch operation if touch control display Control screen etc..Single-point and multi-point touch operation is supported to refer to that touch control display can sense on the touch control display one or more The touch control operation generated simultaneously at a position, and the touch control operation that this is sensed transfers to processor 130 to be calculated and handled.
First embodiment
Referring to figure 2., Fig. 2 is a kind of process for method convenient for VR Video coding that first embodiment of the invention provides Figure, the method are applied to electronic equipment.Process shown in Fig. 2 will be described in detail below, which comprises
Step S110: obtaining VR video image to be encoded, and the VR video image includes the first image and the second figure Picture.
Since VR video image has left and right both of which, the VR video image that electronic equipment 100 is got can wrap Include left image and right image.Wherein, there are visual angle differences between left image and right image.
Optionally, the first image can be left image, correspondingly, second image is right image;Optionally, institute Stating the first image can be right image, correspondingly, second image is left image.
Step S120: the first image and second image are divided into the block of pixels of default ranks number.
Wherein, the default ranks number can arrange for 8 rows 8, i.e., the first image and second image are drawn It is divided into 8 × 8 block of pixels, the size of each block of pixels is identical.It certainly, as an alternative embodiment, can also be with Use the piecemeal principle of bigger (such as 9 × 9) or smaller (such as 6 × 6).
Certainly, as an alternative embodiment, being divided by the first image and second image It, can also be by the first image and second image point in order to improve computational efficiency before the block of pixels of preset quantity Not carry out binary conversion treatment, obtain and the corresponding first gradient image of the first image and corresponding with second image Second gradient image.Wherein, image binaryzation processing is exactly to set 0 or 255 for the gray value of the pixel on image, also It is that whole image is showed to apparent black and white effect, wherein 0 indicates white, and 255 indicate black.By 256 brightness degrees Gray level image is obtained by selection threshold value appropriate still can reflect the whole binary image with local feature of image.Example Such as, the gray value that gray value is greater than or equal to the pixel of threshold value resets to 255, and gray value is less than to the pixel of preset threshold The gray value of point resets to 0.
The binaryzation of image is conducive to being further processed for image, becomes image simply, and data volume reduces, can not only be convex The profile of interested target is showed, and computation complexity can be reduced.
Wherein it is possible to be carried out at binaryzation by SOBEL or CANNEY algorithm to the first image and second image Reason, to obtain and the corresponding first gradient image of the first image and the second gradient map corresponding with second image Picture.
Step S130: for each of the first image block of pixels, one is determined in second image A block of pixels calculates the block of pixels the distance between to the adapter block, wherein the block of pixels is adapted to described as adapter block Similarity difference value between block is minimum.
Since the relationship between the first image and the second image is to belong to the same VR video image, the first image Diversity factor is mainly as caused by the difference of visual angle between the second image.In this case, for every in the first image A block of pixels S can find an adapter block D corresponding with block of pixels S in the second image.Wherein, by block of pixels S When carrying out the calculating of similarity difference value with each block of pixels in the second image, adapter block D and block of pixels S as block of pixels S Between similarity difference value it is minimum.Make the SAD minimum when searching a block of pixels D in second image When, wherein SAD be the similarity difference value, determine block of pixels D for institute corresponding to the block of pixels S in the first image State adapter block, wherein S can be referred to as source pixel block, and D can be referred to as to be purpose block of pixels.
When default ranks number be 8 × 8, and carry out similarity difference value calculate when, optionally, for the first image Each of the block of pixels, using the block of pixels as source pixel block S, the dimension of the corresponding pixel block matrix of each block of pixels is 8 × 8, wherein the element value in the pixel block matrix is the pixel value of the pixel in the block of pixels, can be based on formulaCalculate the similarity difference between each block of pixels in the block of pixels and second image Value, wherein SAD is the similarity difference value, SijFor the pixel value for the pixel that the i-th row jth in the block of pixels arranges, dijFor The pixel value for the pixel that the i-th row jth arranges in some block of pixels in second image.Wherein, i, j are pictures in pixel block matrix The subscript of prime element.
Corresponding adapter block is being found for each of the first image block of pixels incorporated by reference to Fig. 3 Afterwards, the block of pixels (i.e. source pixel block S) and corresponding adapter block (i.e. purpose picture can be calculated by window search Plain block D) between moving displacement, i.e. distance mv.
In order to calculate mv, therefore, either source pixel block or purpose block of pixels, coordinate (x, y) are based respectively on pixel Block respectively where the upper left corner of 1/2 image be that origin determines, wherein the coordinate and mesh of the top left corner pixel point of source pixel block The difference of coordinate of block of pixels top left corner pixel point be exactly motion vector mv.Since the first image and the second image are left and right figures Picture, therefore, the water between the coordinate of the top left corner pixel point of the coordinate and purpose block of pixels of the top left corner pixel point of source pixel block Flat distance is motion vector mv.
Assuming that the coordinate of source pixel block is (x=8, y=6), S can be expressed as(8,16), the coordinate of purpose block of pixels is (x =25, y=16), D can be expressed as(25,16), then the numerical value of mv is exactly 25-8=17;It certainly is also likely to be negative, such as mesh The coordinate of block of pixels be (x=-2, y=16), then motion vector mv is exactly -10.
As another optional embodiment, it for the source pixel block S in the first image, is searched in second image It, therefore, will be with the source image in order to determine the adapter block of source pixel block S when rope makes the SAD minimum to multiple block of pixels Plain block S adjacent multiple block of pixels are determined as field block of pixels, wherein please refer to Fig. 4, field block of pixels can be the first image In with 8 block of pixels at the center S.
It is included every in the available field block of pixels of electronic equipment 100 after field block of pixels has been determined The band of position of the corresponding adapter block of a block of pixels, and count the position point of the adapter block of each block of pixels in the block of pixels of field Cloth, then using the highest region of the frequency of occurrences in position distribution as target area.Due to field block of pixels and source pixel block S it Between there are certain similarities, therefore, the adapter block D of source pixel block S also very likely appears in target area.It therefore, can be with It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the source pixel The corresponding adapter block of block S.Please refer to Fig. 5, the frequency highest that A point occurs as adapter block in figure, therefore using A point as with source image The corresponding adapter block of plain block S.
Step S140: it is calculated and the depth information apart from corresponding block of pixels based on the distance.
Optionally, formula can be based onThe depth information is calculated, wherein depth information is shooting The distance between object and video camera, Z are the depth information, and a and b are constant, and mv is the source pixel block S and the adaptation The distance between block.
Incorporated by reference to Fig. 6,Derivation process can refer to following process:
In order to establish the relationship between depth information and mv, therefore, according to shooting the distance between object and video camera (d), the similitude of the parallax between right and left eyes image (mv) and triangle, it is available: d/dt=dv/mv, wherein dv For the distance between video camera and right and left eyes image, dt is interpupillary distance, since the distance between video camera and right and left eyes image are to become Amount is not constant, and therefore, it is necessary to eliminate the dv in d/dt=dv/mv.
Since camera focus is fk, according to camera imaging principle, the linear relationship of dv and d are established, it is then available: 1/ D+1/dv=1/fk please refers to following formula so that dv is indicated by d.
Dv=1/ (1/fk-1/d)
It is then available by bringing the expression formula of dv into d/dt=dv/mv:
D/dt=1/ ((1/fk-1/d) * mv), so that the expression formula is not influenced by dv.
It is then available by the way that d/dt=1/ ((1/fk-1/d) * mv) is made further deformation:
D*mv=dt/ (1/fk-1/d)
D*mv=dt/ ((d-fk)/(fk*d))
D*mv=dt*fk*d/ (d-fk)
(d-fk) * mv=dt*fk
Further, equation is obtained: d=dt*fk/mv+fk.
Since interpupillary distance and focal length are all constants, above formula can be further converted into:
Since there are two unknown parameters a and b in above formula, therefore, it is necessary to construct two equations to come simultaneous solution a and b, In, the two parameters can shoot image by two groups of material objects and obtain:
Wherein, d0 is in kind at a distance from video camera, the mv0 in first group of material object shooting image The motion vector that the left eye in kind in image is overlapped between image and eye image is shot for first group of material object, d1 is second group For material object in material object shooting image at a distance from video camera, mv1 is the left eye overlapping in kind in second group of material object shooting image Motion vector between image and eye image, d0, d1, mv0 and mv1 are that can be substituted by measuring obtained data Above-mentioned equation group can solve a, the numerical value of b.
Step S150: before being encoded, according to the depth information, each block of pixels being grouped, so that The VR video image is at least divided into two groups.
In video camera when shooting before object, when being encoded to image, need with the image distribution to some regions The image of high code rate, some regions distributes low bit- rate, so as to improve compression efficiency, saves bandwidth, does not have in subjective quality Under the premise of significant difference, code rate is saved, therefore, it is necessary to take region to video camera to divide, as a kind of embodiment party Four markers can artificially be arranged under shooting environmental, so that constant a and b be calculated, and obtain the depth of marker for formula Degree is according to the edge of window dividing value cut as depth, and the edge of window dividing value cut according to depth is to take region to video camera It is divided, judges whether the block of pixels belongs to according to the depth data of block of pixels and determined according to the edge of window dividing value that depth is cut Window area in, by be located at window area in block of pixels be divided into high quality group, by be located at window area outside block of pixels Be divided into low quality group, when being encoded to VR video image, based on the first code rate to belong to the block of pixels of high quality group into Row coding, encodes the block of pixels for belonging to low quality group based on the second code rate, wherein the value of the first code rate is greater than second code The value of rate.
Therefore, by the way that VR video image to be layered, in subsequent progress VR Video coding, it is based on above-mentioned Data Rate Distribution Mode is distributed relatively large number of code rate to the video image of high quality group, be can be improved compared with the video image of low quality group Compression efficiency, code rate can be saved under the premise of subjective quality does not have significant difference by saving bandwidth.
A kind of method convenient for VR Video coding that first embodiment of the invention provides, is getting VR video to be encoded It include first block of pixels that the first image and the second image are divided into default ranks number by VR video image after image;Then needle To each of the first image block of pixels, determined in second image block of pixels as adapter block, The block of pixels is calculated the distance between to the adapter block, wherein the similarity difference between the block of pixels and the adapter block Value is minimum;The distance is then based on to be calculated and the depth information apart from corresponding block of pixels;Before being encoded, According to the depth information, each block of pixels is grouped, so that the VR video image is at least divided into two groups, that In subsequent progress VR Video coding, compared with the video image of low quality group, phase is distributed to the video image of high quality group Bandwidth is saved so as to improve compression efficiency to more code rates.
Second embodiment
Fig. 7 is please referred to, Fig. 7 is a kind of knot for device 400 convenient for VR Video coding that second embodiment of the invention provides Structure block diagram.Structural block diagram shown in Fig. 7 will be illustrated below, shown device includes:
Obtain module 410, for obtaining VR video image to be encoded, the VR video image include the first image and Second image;
Division module 420, for the first image and second image to be divided into the picture of default ranks number Plain block;
Computing module 430, for being directed to each of the first image block of pixels, in second image really A block of pixels is made as adapter block, calculates the block of pixels the distance between to the adapter block, wherein the block of pixels and institute The similarity difference value stated between adapter block is minimum;
The computing module 430 is also used to be calculated based on the distance and the depth apart from corresponding block of pixels Spend information;
Grouping module 440, for according to the depth information, each block of pixels being divided before being encoded Group, so that the VR video image is at least divided into two groups.
Optionally, described device further include: preprocessing module, for the first image and second image to be divided Not carry out binary conversion treatment, obtain and the corresponding first gradient image of the first image and corresponding with second image Second gradient image.
Optionally, the computing module 430, for being based on public affairs for each of the first image block of pixels FormulaThe similarity calculated between each block of pixels in the block of pixels and second image is poor Different value, wherein SAD is the similarity difference value, SijFor the block of pixels that the i-th row jth in the first image arranges, dijFor institute State the block of pixels that the i-th row jth arranges in the second image;Make the SAD when searching a block of pixels in second image When minimum, determine that the block of pixels is and the SijThe corresponding adapter block.
Optionally, the computing module 430, multiple block of pixels ought be searched in second image by, which being also used to, makes institute It, will be with the S when stating SAD minimumijAdjacent multiple block of pixels are determined as field block of pixels;The field block of pixels is obtained to be wrapped The band of position of the corresponding adapter block of each block of pixels included determines the highest region of the adapter block frequency of occurrences as target area Domain;It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the Sij Corresponding adapter block.
Optionally, the computing module 430 is also used to based on formulaThe depth information is calculated, Wherein, Z is the depth information, and a and b are constant, and mv is the distance.
The present embodiment please join the process of the respective function of each Implement of Function Module of the device 400 convenient for VR Video coding See content described in above-mentioned Fig. 1 to embodiment illustrated in fig. 5, details are not described herein again.
In addition, schematic diagram can be as shown in Figure 1, include mutual the embodiment of the invention also provides a kind of electronic equipment Memory 110, the processor 120 of connection, the memory 110 is interior to store computer program, when the computer program is by institute When stating the execution of processor 120, so that the electronic equipment 100 executes provided by embodiment any one of of the invention convenient for VR view The method of frequency coding.
In addition, the embodiment of the invention also provides a kind of computer readable storage medium, in the computer-readable storage medium Computer program is stored in matter, when the computer program is run on computers, so that the computer executes this hair It is convenient for the method for VR Video coding provided by any one of bright embodiment.
In addition, the embodiment of the invention also provides a kind of computer program, the computer program can store beyond the clouds or On the storage medium of person local, when the computer program is run on computers, so that the computer executes the present invention It is convenient for the method for VR Video coding provided by any one embodiment.
In conclusion proposition of the embodiment of the present invention is situated between convenient for the method, apparatus of VR Video coding, electronic equipment and storage VR video image is included first that the first image and the second image are divided into after getting VR video image to be encoded by matter The block of pixels of default ranks number;Then for each of the first image block of pixels, in second image really A block of pixels is made as adapter block, calculates the block of pixels the distance between to the adapter block, wherein the block of pixels and institute The similarity difference value stated between adapter block is minimum;The distance is then based on to be calculated with described apart from corresponding block of pixels Depth information;Before being encoded, according to the depth information, each block of pixels is grouped, so that the VR Video image is at least divided into two groups, then in subsequent progress VR Video coding, it is right compared with the video image of low quality group The video image of high quality group distributes relatively more code rates, so as to improve compression efficiency, saves bandwidth.
In several embodiments provided herein, it should be understood that disclosed device and method can also pass through Other modes are realized.The apparatus embodiments described above are merely exemplary, for example, flow chart and block diagram in attached drawing Show the device of multiple embodiments according to the present invention, the architectural framework in the cards of method and computer program product, Function and operation.In this regard, each box in flowchart or block diagram can represent the one of a module, section or code Part, a part of the module, section or code, which includes that one or more is for implementing the specified logical function, to be held Row instruction.It should also be noted that function marked in the box can also be to be different from some implementations as replacement The sequence marked in attached drawing occurs.For example, two continuous boxes can actually be basically executed in parallel, they are sometimes It can execute in the opposite order, this depends on the function involved.It is also noted that every in block diagram and or flow chart The combination of box in a box and block diagram and or flow chart can use the dedicated base for executing defined function or movement It realizes, or can realize using a combination of dedicated hardware and computer instructions in the system of hardware.
In addition, each functional module in each embodiment of the present invention can integrate one independent portion of formation together Point, it is also possible to modules individualism, an independent part can also be integrated to form with two or more modules.
It, can be with if the function is realized and when sold or used as an independent product in the form of software function module It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a People's computer, server or network equipment etc.) it performs all or part of the steps of the method described in the various embodiments of the present invention. And storage medium above-mentioned includes: that USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), arbitrary access are deposited The various media that can store program code such as reservoir (RAM, Random Access Memory), magnetic or disk.It needs Illustrate, herein, relational terms such as first and second and the like be used merely to by an entity or operation with Another entity or operation distinguish, and without necessarily requiring or implying between these entities or operation, there are any this realities The relationship or sequence on border.Moreover, the terms "include", "comprise" or its any other variant are intended to the packet of nonexcludability Contain, so that the process, method, article or equipment for including a series of elements not only includes those elements, but also including Other elements that are not explicitly listed, or further include for elements inherent to such a process, method, article, or device. In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including the element Process, method, article or equipment in there is also other identical elements.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.It should also be noted that similar label and letter exist Similar terms are indicated in following attached drawing, therefore, once being defined in a certain Xiang Yi attached drawing, are then not required in subsequent attached drawing It is further defined and explained.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain Lid is within protection scope of the present invention.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. a kind of method convenient for VR Video coding, which is characterized in that the described method includes:
VR video image to be encoded is obtained, the VR video image includes the first image and the second image;
The first image and second image are divided into the block of pixels of default ranks number;
For each of the first image block of pixels, determine a block of pixels as suitable in second image With block, the block of pixels is calculated the distance between to the adapter block, wherein the similarity between the block of pixels and the adapter block Difference value is minimum;
Based on the distance, it is calculated and the depth information apart from corresponding block of pixels;
Before being encoded, according to the depth information, each block of pixels is grouped, so that the VR video image At least it is divided into two groups.
2. the method according to claim 1, wherein being drawn by the first image and second image It is divided into before the block of pixels of preset quantity, the method also includes:
The first image and second image are subjected to binary conversion treatment respectively, obtained corresponding with the first image First gradient image and the second gradient image corresponding with second image.
3. the method according to claim 1, wherein the default ranks number is 8 × 8, for first figure Each of the picture block of pixels, determines a block of pixels as adapter block in second image, comprising:
Formula is based on using the block of pixels as source pixel block for each of the first image block of pixelsCalculate the similarity difference between each block of pixels in the block of pixels and second image Value, wherein SAD is the similarity difference value, SijFor the pixel that the i-th row jth in the block of pixels arranges, dijFor second figure The pixel that the i-th row jth arranges in some block of pixels as in;
When searching a block of pixels in second image and making the SAD minimum, determine in second image The block of pixels is purpose block of pixels, and the purpose block of pixels is the corresponding adapter block of the source pixel block.
4. according to the method described in claim 3, it is characterized in that, the method also includes:
It, will be adjacent with the source pixel block when searching multiple block of pixels in second image and making the SAD minimum Multiple block of pixels be determined as field block of pixels;
The band of position for obtaining the corresponding adapter block of each block of pixels included by the field block of pixels, determines that adapter block goes out The existing highest region of frequency is as target area;
It is determined as the block of pixels for belonging to the target area in the smallest the multiple block of pixels of the SAD and the source The corresponding adapter block of block of pixels.
5. the method according to claim 1, wherein being calculated with described based on the distance apart from corresponding The depth information of block of pixels, comprising:
Based on formulaThe depth information is calculated, wherein Z is the depth information, and a and b are constant, mv For the distance.
6. a kind of device convenient for VR Video coding, which is characterized in that described device includes:
Module is obtained, for obtaining VR video image to be encoded, the VR video image includes the first image and the second figure Picture;
Division module, for the first image and second image to be divided into the block of pixels of default ranks number;
Computing module, for determining one in second image for each of the first image block of pixels A block of pixels calculates the block of pixels the distance between to the adapter block, wherein the block of pixels is adapted to described as adapter block Similarity difference value between block is minimum;
The computing module is also used to be calculated based on the distance and the depth information apart from corresponding block of pixels;
Grouping module, for according to the depth information, each block of pixels being grouped before being encoded, so that The VR video image is at least divided into two groups.
7. device according to claim 6, which is characterized in that described device further include:
Preprocessing module obtains and institute for the first image and second image to be carried out binary conversion treatment respectively State the corresponding first gradient image of the first image and the second gradient image corresponding with second image.
8. device according to claim 6, which is characterized in that the computing module, for in the first image Each of the block of pixels be based on formula using the block of pixels as source pixel blockCalculate the picture The similarity difference value between each block of pixels in plain block and second image, wherein SAD is the similarity difference Value, SijFor the pixel that the i-th row jth in the block of pixels arranges, dijIt is arranged for the i-th row jth in some block of pixels in second image Pixel;
When searching a block of pixels in second image and making the SAD minimum, determine in second image The block of pixels is purpose block of pixels, and the purpose block of pixels is the corresponding adapter block of the source pixel block.
9. a kind of electronic equipment, which is characterized in that including memory interconnected, processor, storage meter in the memory Calculation machine program, when the computer program is executed by the processor, so that the electronic equipment perform claim requires in 1-5 Method described in any one.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium Program, when the computer program is run on computers, so that the computer is executed as any one in claim 1-5 Method described in.
CN201910022693.7A 2019-01-09 2019-01-09 Method, apparatus, electronic device and storage medium for facilitating VR video encoding Active CN109451318B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910022693.7A CN109451318B (en) 2019-01-09 2019-01-09 Method, apparatus, electronic device and storage medium for facilitating VR video encoding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910022693.7A CN109451318B (en) 2019-01-09 2019-01-09 Method, apparatus, electronic device and storage medium for facilitating VR video encoding

Publications (2)

Publication Number Publication Date
CN109451318A true CN109451318A (en) 2019-03-08
CN109451318B CN109451318B (en) 2022-11-01

Family

ID=65543945

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910022693.7A Active CN109451318B (en) 2019-01-09 2019-01-09 Method, apparatus, electronic device and storage medium for facilitating VR video encoding

Country Status (1)

Country Link
CN (1) CN109451318B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111954085A (en) * 2020-08-06 2020-11-17 咪咕文化科技有限公司 VR video display method, device, network equipment and storage medium
CN114786037A (en) * 2022-03-17 2022-07-22 青岛虚拟现实研究院有限公司 Self-adaptive coding compression method facing VR projection

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101965733A (en) * 2008-03-09 2011-02-02 Lg电子株式会社 Be used to encode or the method and apparatus of decoded video signal
CN104427345A (en) * 2013-09-11 2015-03-18 华为技术有限公司 Motion vector acquisition method, acquisition device, video codec and method thereof
CN104702954A (en) * 2013-12-05 2015-06-10 华为技术有限公司 Video coding method and device
WO2015200820A1 (en) * 2014-06-26 2015-12-30 Huawei Technologies Co., Ltd. Method and device for providing depth based block partitioning in high efficiency video coding
CN102970529B (en) * 2012-10-22 2016-02-17 北京航空航天大学 A kind of object-based multi-view point video fractal image compression & decompression method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101965733A (en) * 2008-03-09 2011-02-02 Lg电子株式会社 Be used to encode or the method and apparatus of decoded video signal
CN102970529B (en) * 2012-10-22 2016-02-17 北京航空航天大学 A kind of object-based multi-view point video fractal image compression & decompression method
CN104427345A (en) * 2013-09-11 2015-03-18 华为技术有限公司 Motion vector acquisition method, acquisition device, video codec and method thereof
CN104702954A (en) * 2013-12-05 2015-06-10 华为技术有限公司 Video coding method and device
WO2015200820A1 (en) * 2014-06-26 2015-12-30 Huawei Technologies Co., Ltd. Method and device for providing depth based block partitioning in high efficiency video coding

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111954085A (en) * 2020-08-06 2020-11-17 咪咕文化科技有限公司 VR video display method, device, network equipment and storage medium
CN114786037A (en) * 2022-03-17 2022-07-22 青岛虚拟现实研究院有限公司 Self-adaptive coding compression method facing VR projection
CN114786037B (en) * 2022-03-17 2024-04-12 青岛虚拟现实研究院有限公司 VR projection-oriented adaptive coding compression method

Also Published As

Publication number Publication date
CN109451318B (en) 2022-11-01

Similar Documents

Publication Publication Date Title
Liu et al. PQA-Net: Deep no reference point cloud quality assessment via multi-view projection
US10198623B2 (en) Three-dimensional facial recognition method and system
CN110458805B (en) Plane detection method, computing device and circuit system
Ju et al. Depth-aware salient object detection using anisotropic center-surround difference
Zhou et al. Omnidirectional image quality assessment by distortion discrimination assisted multi-stream network
US9704066B2 (en) Multi-stage image classification
US20180018503A1 (en) Method, terminal, and storage medium for tracking facial critical area
CN104574342B (en) The noise recognizing method and Noise Identification device of parallax depth image
US10848746B2 (en) Apparatus including multiple cameras and image processing method
CN111340866A (en) Depth image generation method, device and storage medium
Tu et al. V-PCC projection based blind point cloud quality assessment for compression distortion
WO2017095543A1 (en) Object detection with adaptive channel features
CN114627244A (en) Three-dimensional reconstruction method and device, electronic equipment and computer readable medium
CN109451318A (en) Convenient for the method, apparatus of VR Video coding, electronic equipment and storage medium
CN113139540A (en) Backboard detection method and equipment
CN113920023B (en) Image processing method and device, computer readable medium and electronic equipment
CN111291611A (en) Pedestrian re-identification method and device based on Bayesian query expansion
Wang et al. Salient video object detection using a virtual border and guided filter
Wang et al. Deep intensity guidance based compression artifacts reduction for depth map
Li et al. Graph-based saliency fusion with superpixel-level belief propagation for 3D fixation prediction
WO2023273515A1 (en) Target detection method, apparatus, electronic device and storage medium
Yang et al. User models of subjective image quality assessment on virtual viewpoint in free-viewpoint video system
Tsai et al. A novel method for 2D-to-3D video conversion based on boundary information
CN110264431A (en) Video beautification method, device and electronic equipment
CN116506627A (en) Encoding method and device for searching by constructing hash table by multi-feature hash value

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant