CN109561311A - A kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ - Google Patents

A kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ Download PDF

Info

Publication number
CN109561311A
CN109561311A CN201811491526.9A CN201811491526A CN109561311A CN 109561311 A CN109561311 A CN 109561311A CN 201811491526 A CN201811491526 A CN 201811491526A CN 109561311 A CN109561311 A CN 109561311A
Authority
CN
China
Prior art keywords
bit
image group
viewpoint
frame
rate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811491526.9A
Other languages
Chinese (zh)
Inventor
严涛
黄金火
闻辉
徐航
陈建辉
车艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Putian University
Original Assignee
Putian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Putian University filed Critical Putian University
Priority to CN201811491526.9A priority Critical patent/CN109561311A/en
Publication of CN109561311A publication Critical patent/CN109561311A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/597Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention belongs to 3 d video encoding technical field, in particular to a kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ.A kind of 3 d video encoding bit rate control method based on the domain ρ, comprising steps of establishing the domain ρ code rate model;Bit distributes between carrying out viewpoint according to the image similarity between viewpoint;Frame-layer, basic unit layer bit-allocation and rate-control are carried out according to the activity time domain complexity of frame per second, destination buffer capacity, actual buffering area size and the frame.By establishing the domain ρ code rate model, and on the basis of being encoded based on high efficiency viewpoint video, bit distributes and (that is: gives each viewpoint to carry out reasonable bit distribution in turn using correlation analysis between viewpoint) between carrying out viewpoint according to the image similarity between viewpoint, and frame-layer, basic unit layer bit-allocation and rate-control are carried out according to the activity time domain complexity of frame per second, actual buffering area size and the frame, overcome the limitation of target bit allocation in the prior art.

Description

A kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ
Technical field
The invention belongs to 3 d video encoding technical field, in particular to a kind of 3 d video encoding code rate based on the domain ρ Control method and storage equipment.
Background technique
Code rate control is all one of very important technology in video encoding standard all the time, in the world all video pressures Contracting standard leave code rate control its application can all be restricted.In the world common video compression standard such as MPEG-4, H.264, HEVC (High efficiency video coding), MVC (Multi-view video coding) etc. have provided standard code Rate Controlling model, and its rate control techniques is highly developed.And at present announcement newest in the world 3D video encoding standard it One multiple views HD video encodes (Multi-view high efficiency video coding, MV-HEVC), not yet gives Effective rate control algorithm out.
Currently, rate control techniques are less before the research for being engaged in the control of MV-HEVC code rate both at home and abroad is opposite, largely grind Study carefully the correlative study for being engaged in the control of the code rates such as MVC, HEVC, H264.U.S. Lim etc. proposes the MVC code based on binomial model All video frames are divided into several difference using the geometrical relationship of parallax prediction and motion prediction by rate control algolithm, the algorithm Type of coding frame, however the parallax prediction characteristic between each viewpoint has biggish difference.Though South Korea Seanae Park et al. Time encoding layer uses the influence of hierarchical B-frame in right consideration multiple view video coding, the experimental results showed that this algorithm is also able to maintain Efficient code efficiency, but the experimental result average bit rate control error that cycle tests provides is difficult to meet practical 1% or more The demand of application.Lei J et al. establishes the multiple view video coding rate control algorithm of R- λ model, and experimental result is ideal, But correlation between viewpoint is not accounted for.Vizzotto B B et al. only considers in the rate control algorithm of stereo scopic video coding The case where two viewpoints, and since this algorithm is in multiple view video coding, it is such with increasing for coded image type of coding The accuracy rate of target bit distribution based on TM5 can be deteriorated, and cause code rate control error very high, and bit distribution is difficult to control System.
Also many domestic and foreign scholars are engaged in the research of the multiple view video coding code rate control based on HEVC.Shao F etc. People proposes to use fixed allocation proportion to distribute code rate between texture depth, but the method can not be such that different sequences all obtains most Good code efficiency.Xiao et al. proposes the gradable Data Rate Distribution algorithm for being applied to different bandwidth.Fang et al. proposes one The analysis model that distortion is virtually regarded in kind estimation 3D video, is lost using the analysis method estimating depth graph code of association frequency domain time domain Virtual view distortion caused by very.Although the estimation model is accurate, complexity is higher.
Pan G et al. proposes the 3D-HEVC rate control algorithm based on depth, using fixed colored and depth code rate ratio Example is 4:1, but does so and cannot obtain optimal virtual viewpoint rendering quality.Xiao J M et al. proposes depth and texture point Level bit-rate control algolithm.Wang X et al. proposes the 3D-HEVC rate control algorithm based on binomial R-D model, due to straight It connects relatively low using H.264 middle code rate Controlling model, rate control accuracy.
Summary of the invention
For this reason, it may be necessary to a kind of multistage code rate control method of MV-HEVC multiple view video coding be provided, to solve tradition The problem of code rate Controlling model and high efficiency multiple view video coding bit distribution inaccuracy.
Specific technical solution is as follows:
A kind of 3 d video encoding bit rate control method based on the domain ρ, comprising steps of
Establish the domain ρ code rate model;
Bit distributes between carrying out viewpoint according to the image similarity between viewpoint;
Frame is carried out according to the activity time domain complexity of frame per second, destination buffer capacity, actual buffering area size and the frame Layer, basic unit layer bit-allocation and rate-control.
Further, described " establishing the domain ρ code rate model ", further comprises the steps of:
Using multiple regression technology, model parameter is calculated.
Further, described " using multiple regression technology, model parameter is calculated ", further comprises the steps of:
Assuming that ρ and texture part coding bit rate R (ρ) has following secondary relationship:
R (ρ)=θ1·(1-ρ)22·(1-ρ)+θ3
Wherein ρ indicates that the number of zero coefficient after quantization of transform coefficients accounts for the percentage of all coefficients, θ123It is returned for unitary Return coefficient;
Following R- ρ model is calculated:
R (ρ)=θ1·(1-ρ)22·(1-ρ)
Wherein, θ12It is provided by following statistical analysis technique: enabling x1(ρ)=(1- ρ)2, x2(ρ)=1- ρ, if (k11, k21,r1),(k12,k22,r2),…,(k1n,k2n,rn) it is already present n sample value, it enables
Using multiple regression technology, model parameter N is calculated are as follows:
Wherein, KTIt is the transposed matrix of K, (KTK)-1It is KTThe inverse matrix of K.
It is further, described that " bit distributes between carrying out viewpoint according to the image similarity between viewpoint;According to frame per second, target The activity time domain complexity of buffer pool size, actual buffering area size and the frame carries out frame-layer, the distribution of basic unit layer bit Controlled with code rate ", it further comprises the steps of:
Step 1: bit-allocation and rate-control being carried out to multi-view image group, current multi-view image group is calculated and answers The bit number being assigned to;
Step 2: bit-allocation and rate-control being carried out to single viewpoint image group, each one-view image group is calculated Weight factor Wk, according to the weight factor WkThe bit number that current one-view image group should be assigned to is calculated;
Step 3: bit-allocation and rate-control being carried out to frame-layer in single viewpoint image group, current encoded frame is calculated Target bit;
Step 4: bit-allocation and rate-control being carried out to macroblock layer, ρ value is calculated according to the domain ρ code rate model, is then led to Cross the quantization parameter Q that ρ value further calculates current macromb
Step 5: according to: quantization parameter QmbCurrent macro is encoded;
Step 6: judging whether macro block all in present frame encodes, if encoding, go to step 7;If not encoding, It then repeats to go to step 4 and then goes to step 7 until encoding to step 5;
Step 7: whether all frames encode in the current single viewpoint image group of judgement, if encoding, go to step 8; If not encoding, repeat to go to step 3 to 6, until compiling all frames of present image group;
Step 8: whether all single viewpoint image groups all encode in the current single viewpoint image group of judgement, if all Coding is completed, then goes to step 9;If completing without all codings, previous step 2 to 7 is repeated, until current multi-view image group In all single viewpoint image groups all coding is completed;
Step 9: judge whether current multi-view image group is the last one multi-view image group of entire multiview sequence, if It is the last one multi-view image group, then encodes completion, otherwise, repeats previous step 1 to step 8, until entire multiple views sequence Column coding is completed.
Further, the step 2, further comprises the steps of:
It is T that current multi-view image group, which is assigned to total bit,MVGOP, the weight factor W of each single viewpoint image groupk, then The viewpoint of K distributes target bits are as follows:
TSVGOP,k=TMVGOP·Wk
Wherein Wk(k=1,2 ..., Nview) initial value obtains according to Similarity measures between viewpoint:
Wherein, S1Basic viewpoint or main view point are indicated, as the reference view of other viewpoints, NviewPresentation code viewpoint Number, S (Vj,Vk) indicate vision pointj,VkSimilarity,
Wherein,The respectively feature vector of two sub-pictures.
In order to solve the above technical problems, additionally providing a kind of storage equipment, specific technical solution is as follows;
A kind of storage equipment, wherein being stored with instruction set, described instruction collection is for executing:
Establish the domain ρ code rate model;
Bit distributes between carrying out viewpoint according to the image similarity between viewpoint;
Frame is carried out according to the activity time domain complexity of frame per second, destination buffer capacity, actual buffering area size and the frame Layer, basic unit layer bit-allocation and rate-control.
Further, described instruction collection is also used to execute:
Described " establishing the domain ρ code rate model ", further comprises the steps of:
Using multiple regression technology, model parameter is calculated.
Further, described instruction collection is also used to execute:
Described " using multiple regression technology, model parameter is calculated ", further comprises the steps of:
Assuming that ρ and texture part coding bit rate R (ρ) has following secondary relationship:
R (ρ)=θ1·(1-ρ)22·(1-ρ)+θ3
Wherein ρ indicates that the number of zero coefficient after quantization of transform coefficients accounts for the percentage of all coefficients, θ123It is returned for unitary Return coefficient;
Following R- ρ model is calculated:
R (ρ)=θ1·(1-ρ)22·(1-ρ)
Wherein, θ12It is provided by following statistical analysis technique: enabling x1(ρ)=(1- ρ)2, x2(ρ)=1- ρ, if (k11, k21,r1),(k12,k22,r2),…,(k1n,k2n,rn) it is already present n sample value, it enables
Using multiple regression technology, model parameter N is calculated are as follows:
Wherein, KTIt is the transposed matrix of K, (KTK)-1It is KTThe inverse matrix of K.
Further, described instruction collection is also used to execute:
It is described that " bit distributes between carrying out viewpoint according to the image similarity between viewpoint;Held according to frame per second, destination buffer The activity time domain complexity of amount, actual buffering area size and the frame carries out frame-layer, the distribution of basic unit layer bit and code rate control System ", further comprises the steps of:
Step 1: bit-allocation and rate-control being carried out to multi-view image group, current multi-view image group is calculated and answers The bit number being assigned to;
Step 2: bit-allocation and rate-control being carried out to single viewpoint image group, each one-view image group is calculated Weight factor Wk, according to the weight factor WkThe bit number that current one-view image group should be assigned to is calculated;
Step 3: bit-allocation and rate-control being carried out to frame-layer in single viewpoint image group, current encoded frame is calculated Target bit;
Step 4: bit-allocation and rate-control being carried out to macroblock layer, ρ value is calculated according to the domain ρ code rate model, is then led to Cross the quantization parameter Q that ρ value further calculates current macromb
Step 5: according to: quantization parameter QmbCurrent macro is encoded;
Step 6: judging whether macro block all in present frame encodes, if encoding, go to step 7;If not encoding, It then repeats to go to step 4 and then goes to step 7 until encoding to step 5;
Step 7: whether all frames encode in the current single viewpoint image group of judgement, if encoding, go to step 8; If not encoding, repeat to go to step 3 to 6, until compiling all frames of present image group;
Step 8: whether all single viewpoint image groups all encode in the current single viewpoint image group of judgement, if all Coding is completed, then goes to step 9;If completing without all codings, previous step 2 to 7 is repeated, until current multi-view image group In all single viewpoint image groups all coding is completed;
Step 9: judge whether current multi-view image group is the last one multi-view image group of entire multiview sequence, if It is the last one multi-view image group, then encodes completion, otherwise, repeats previous step 1 to step 8, until entire multiple views sequence Column coding is completed.
Further, described instruction collection is also used to execute:
The step 2, further comprises the steps of:
It is T that current multi-view image group, which is assigned to total bit,MVGOP, the weight factor W of each single viewpoint image groupk, then The viewpoint of K distributes target bits are as follows:
TSVGOP,k=TMVGOP·Wk
Wherein Wk(k=1,2 ..., Nview) initial value obtains according to Similarity measures between viewpoint:
Wherein, S1Basic viewpoint or main view point are indicated, as the reference view of other viewpoints, NviewPresentation code viewpoint Number, S (Vj,Vk) indicate vision pointj,VkSimilarity,
Wherein,The respectively feature vector of two sub-pictures.
The beneficial effects of the present invention are: by establishing the domain ρ code rate model, and the basis based on high efficiency viewpoint video coding On, it is distributed and (that is: is given in turn using correlation analysis between viewpoint each according to bit between the image similarity progress viewpoint between viewpoint Viewpoint carries out reasonable bit distribution), it is negative to frame target bit allocation bring to effectively prevent buffer area " liquid level " fluctuation It influences, and frame is carried out according to the activity time domain complexity of frame per second, destination buffer capacity, actual buffering area size and the frame Layer, basic unit layer bit-allocation and rate-control, overcome the limitation of target bit allocation in the prior art, can be effectively The code rate of multiple view video coding is controlled, rate control accuracy reaches 99% or more, and Y-PSNR averagely improves 0.38dB More than.
Detailed description of the invention
Fig. 1 is a kind of flow chart of the 3 d video encoding bit rate control method based on the domain ρ described in specific embodiment;
Fig. 2 is the process for carrying out bit distribution between viewpoint described in specific embodiment according to the image similarity between viewpoint Figure;
Fig. 3 is the module diagram that equipment is stored described in specific embodiment.
Description of symbols:
300, equipment is stored.
Specific embodiment
Technology contents, construction feature, the objects and the effects for detailed description technical solution, below in conjunction with specific reality It applies example and attached drawing is cooperated to be explained in detail.
Fig. 1 to Fig. 2 is please referred to, in the present embodiment, a kind of 3 d video encoding bit rate control method based on the domain ρ can Using on a storage device, the storage equipment includes but is not limited to: personal computer, server, general purpose computer, dedicated meter Calculation machine, the network equipment, embedded device, programmable device, intelligent mobile terminal etc..Specific embodiment is as follows:
Explanation explained below is done to some nouns in present embodiment first:
Code rate control: a kind of optimization algorithm of coding, for realizing the control to video code flow size;
Video code rate: the data bits of unit time transmission when data are transmitted;
Frame per second: continuously there is frequency over the display in the bitmap images for being known as frame;
MV-HEVC (Multiview high efficiency video coding): high efficiency multiple view video coding;
HEVC (High efficiency video coding): high efficiency viewpoint video coding;
MV-GOP (Multi-View of Group of picture): multi-view image group;
SV-GOP (Signal-View of Group of picture): single viewpoint image group.
Specific embodiment is as follows:
Step S101: the domain ρ code rate model is established.In the present embodiment, described " establishing the domain ρ code rate model " further includes Step: multiple regression technology is used, model parameter is calculated.It is described " to use multiple regression technology, model ginseng is calculated Number ", further comprises the steps of:
Assuming that ρ and texture part coding bit rate R (ρ) has following secondary relationship:
R (ρ)=θ1·(1-ρ)22·(1-ρ)+θ3 (1)
Wherein ρ indicates that the number of zero coefficient after quantization of transform coefficients accounts for the percentage of all coefficients, θ123It is returned for unitary Return coefficient;
Following R- ρ model is calculated:
R (ρ)=θ1·(1-ρ)22·(1-ρ) (3)
Wherein, θ12It is provided by following statistical analysis technique: enabling x1(ρ)=(1- ρ)2, x2(ρ)=1- ρ, if (k11, k21,r1),(k12,k22,r2),…,(k1n,k2n,rn) it is already present n sample value, it enables
Using multiple regression technology, model parameter N is calculated are as follows:
Wherein, KTIt is the transposed matrix of K, (KTK)-1It is KTThe inverse matrix of K.
Step S102: bit distributes between carrying out viewpoint according to the image similarity between viewpoint.
Step S103: complicated according to the activity time domain of frame per second, destination buffer capacity, actual buffering area size and the frame Degree carries out frame-layer, basic unit layer bit-allocation and rate-control.
It should be noted that bit is difficult to distribute between high efficiency multiple view video coding code rate controlling difficulties are viewpoint, Can not reasonable bit distribution accurately be carried out to each viewpoint.Therefore according to the image phase between viewpoint in step S102 and step S103 Like property and encoded information to the different reasonable bit distribution of viewpoint.In the present embodiment, with weight factor WkIndicating should Shared ratio, W in all viewpoints of viewpoint kkShow that the viewpoint is main viewpoint more greatly, it, should as other view reference viewpoints Viewpoint coding benefit is directly related to the result of entire Video coding, it is therefore desirable to distribute more bit.
Referring to Fig. 2, step S102 and step S103 are specifically further comprised the steps of:
Step S201: bit-allocation and rate-control is carried out to multi-view image group, current multi-view image is calculated The bit number that group should be assigned to;
Step S202: bit-allocation and rate-control is carried out to single viewpoint image group, each haplopia point diagram is calculated As the weight factor W of groupk, according to the weight factor WkThe bit number that current one-view image group should be assigned to is calculated;
Step S203: bit-allocation and rate-control is carried out to frame-layer in single viewpoint image group, current volume is calculated Code frame target bit;
Step S204: bit-allocation and rate-control is carried out to macroblock layer, ρ value is calculated according to the domain ρ code rate model, then The quantization parameter Q of current macro is further calculated by ρ valuemb
Step S205: according to quantization parameter QmbCurrent macro is encoded;
Step S206: judging whether macro block all in present frame encodes, if encoding, goes to step S207;If not It encodes, then repeats to go to step S204 to step S205 and then go to step S207 until encoding;
Step S207: whether all frames encode in the current single viewpoint image group of judgement, if encoding, go to step S208;If not encoding, repeat to go to step S203 to S206, until compiling all frames of present image group;
Step S208: whether all single viewpoint image groups all encode in the current single viewpoint image group of judgement, if All coding is completed, then goes to step S209;If completing without all codings, previous step S202 to S207 is repeated, until working as All single viewpoint image groups all complete by coding in preceding multi-view image group;
Step S209: judge whether current multi-view image group is the last one multi-view image of entire multiview sequence Otherwise group, repeats previous step S201 to step S208 if the last one multi-view image group, then encode completion, until whole A multiview sequence coding is completed.
The wherein step S202, further comprises the steps of:
It is T that current multi-view image group, which is assigned to total bit,MVGOP, the weight factor W of each single viewpoint image groupk, then The viewpoint of K distributes target bits are as follows:
TSVGOP,k=TMVGOP·Wk (6)
Wherein Wk(k=1,2 ..., Nview) initial value obtains according to Similarity measures between viewpoint:
Wherein, S1Basic viewpoint or main view point are indicated, as the reference view of other viewpoints, NviewPresentation code viewpoint Number, S (Vj,Vk) indicate vision pointj,VkSimilarity,
Wherein,The respectively feature vector of two sub-pictures.
Wherein,The respectively feature vector of two sub-pictures.It is related after encoding each MV-GOP image group Parameter needs are updated, and can be constantly updated according to encoded information before.AGGOP(sni,0) it is i-th of MV- of coding The actually required bit number of GOP image group, AGOP(nk-1,0) i-th of MV-GOP image group of presentation code -1 GOP reality of kth Required bit number, then wk-1It is provided by formula (9)
For bit needed for preferably predicting present encoding viewpoint, the encoded information of multiple view video coding is made full use of i.e. Previous group MV-GOP, using correlation between viewpoint, to predict the code weight of current view point.wkLinear prediction model are as follows:
WhereinWithSimple regression coefficient is respectively indicated, initial value is set as 1 and 0, is encoding each MV-GOP image group knot Shu Hou, need in encoded refreshed by the stage.
The wherein bit distribution of frame-layer is in the present embodiment the geometrical relationship predicted using parallax with motion prediction, All video frames are divided into several different coding type frame, however the parallax prediction characteristic between each viewpoint is with biggish Difference, therefore the interview prediction relationship or identical coded image of projected relationship may have different coding special between the time Property, otherwise the target bit that at this moment cannot be obtained using identical model parameter calculation will appear certain deviation.Therefore it mentions Go out such as following formula frame-layer target bit allocation algorithm:
In above formula, T is the sum of coding M frame institute consumes bit number;MADaIndicate the average value of all frame MAD;MADjIndicate the The MAD of j frame;CjAnd CmRespectively jth frame and m frame frame originating point information occupies bit.From above formula (11) it is found that MADjAnd CjMore The target bits of big picture frame distribution are more.
According to HEV frame-layer target bit allocation method, in MV-HEVC, -1 frame target bit allocation of jth is as follows:
In above formula, CaThe average value of the consumed bit of head information of encoded frame in the current GOP of presentation code.
In general, video sequence moves more violent, activity time domain, that is, picture material field of every frame to multiple view video coding Scape transformation is bigger, and the bit that coding when just needs is more;Otherwise activity time domain, that is, picture material scene change of every frame is got over Small, the bit that coding when needs is fewer.In order to which the control of MV-HEVC code rate is more accurate, by the code rate control of above formula (12) Method processed is further improved, and present frame target bits are calculated by formula (13):
In above formula, TjBit is consumed by jth frame frame originating point information.N represents the time horizon where present frame, and W (l) represents every The weight of frame complexity, WB(l) weight of B frame is represented.
In the present embodiment, basic unit layer bit distribution can be as follows:
In order to compatible with HEVC, multiple view video coding basic unit layer bit distribution algorithm is similar to HEVC.By HEVC Basic unit layer bit rate control algolithm is it is found that algorithm basic unit layer bit rate distribution is compared that simply every frame is distributed Bit averagely gives each basic unit layer of the frame again, then in same basic unit layer all difference macro blocks using identical Quantization parameter QP is encoded.However in fact, even the macro block in same basic unit is in picture material, texture, activity Also there is very big difference in the complexities such as time domain.Therefore, in order to MV-HEVC code rate control it is more accurate, according to its picture material, The complexities such as texture, activity time domain use different quantized values, are calculated by formula (14):
In above formula, Ttotal,kRemaining bits sum and remaining basic unit number, T are respectively represented with Nhead,kIndicate that kth is macro The consumed bit of build information, FD (k) indicate the activity time domain degree of kth coded macroblocks.
In formula, X, Y are respectively the pixel points of macro block in the horizontal and vertical directions, and x, y are respectively horizontal, vertical direction On coordinate, Ij(x,y)、Ij-1(x, y) is respectively the brightness value at current macro and previous macro block position (x, y).
Method of the invention is provided as follows and the existing contrast table there are three bit rate control method is adopted in the present embodiment With the cycle tests of two kinds of different-formats, wherein Vassar, Flamenco2, Exit, Ballroom be VGA format and PoznanHall2, GT Fly are HD format.The resolution ratio of sequence includes 640 × 480 pixels and 1920 × 1088 pixels.Test The MV-HEVC system platform that platform uses JCT-3V to provide.
Upper table gives the experimental result of multiple view video coding code rate control.From upper table as it can be seen that first three code rate controls It is respectively 2.95%, 2.51%, 1.98% that algorithm code rate, which controls error, and code rate control error is all bigger, present embodiment The rate control algorithm proposed is compared with other three kinds of algorithms, and code rate is more accurate, and code rate deviation is smaller, average bit rate error Less than 1%, it is able to satisfy application request.
Referring to Fig. 3, in the present embodiment, a kind of specific embodiment storing equipment 300 is as follows:
A kind of storage equipment 300, wherein being stored with instruction set, described instruction collection is for executing:
Establish the domain ρ code rate model;
Bit distributes between carrying out viewpoint according to the image similarity between viewpoint;
Frame is carried out according to the activity time domain complexity of frame per second, destination buffer capacity, actual buffering area size and the frame Layer, basic unit layer bit-allocation and rate-control.
Further, described instruction collection is also used to execute:
Described " establishing the domain ρ code rate model ", further comprises the steps of:
Using multiple regression technology, model parameter is calculated.
Further, described instruction collection is also used to execute:
Described " using multiple regression technology, model parameter is calculated ", further comprises the steps of:
Assuming that ρ and texture part coding bit rate R (ρ) has following secondary relationship:
R (ρ)=θ1·(1-ρ)22·(1-ρ)+θ3
Wherein ρ indicates that the number of zero coefficient after quantization of transform coefficients accounts for the percentage of all coefficients, θ123It is returned for unitary Return coefficient;
Following R- ρ model is calculated:
R (ρ)=θ1·(1-ρ)22·(1-ρ)
Wherein, θ12It is provided by following statistical analysis technique: enabling x1(ρ)=(1- ρ)2, x2(ρ)=1- ρ, if (k11, k21,r1),(k12,k22,r2),…,(k1n,k2n,rn) it is already present n sample value, it enables
Using multiple regression technology, model parameter N is calculated are as follows:
Wherein, KTIt is the transposed matrix of K, (KTK)-1It is KTThe inverse matrix of K.
Further, described instruction collection is also used to execute:
It is described that " bit distributes between carrying out viewpoint according to the image similarity between viewpoint;Held according to frame per second, destination buffer The activity time domain complexity of amount, actual buffering area size and the frame carries out frame-layer, the distribution of basic unit layer bit and code rate control System ", further comprises the steps of:
Step 1: bit-allocation and rate-control being carried out to multi-view image group, current multi-view image group is calculated and answers The bit number being assigned to;
Step 2: bit-allocation and rate-control being carried out to single viewpoint image group, each one-view image group is calculated Weight factor Wk, according to the weight factor WkThe bit number that current one-view image group should be assigned to is calculated;
Step 3: bit-allocation and rate-control being carried out to frame-layer in single viewpoint image group, current encoded frame is calculated Target bit;
Step 4: bit-allocation and rate-control being carried out to macroblock layer, ρ value is calculated according to the domain ρ code rate model, is then led to Cross the quantization parameter Q that ρ value further calculates current macromb
Step 5: according to: quantization parameter QmbCurrent macro is encoded;
Step 6: judging whether macro block all in present frame encodes, if encoding, go to step 7;If not encoding, It then repeats to go to step 4 and then goes to step 7 until encoding to step 5;
Step 7: whether all frames encode in the current single viewpoint image group of judgement, if encoding, go to step 8; If not encoding, repeat to go to step 3 to 6, until compiling all frames of present image group;
Step 8: whether all single viewpoint image groups all encode in the current single viewpoint image group of judgement, if all Coding is completed, then goes to step 9;If completing without all codings, previous step 2 to 7 is repeated, until current multi-view image group In all single viewpoint image groups all coding is completed;
Step 9: judge whether current multi-view image group is the last one multi-view image group of entire multiview sequence, if It is the last one multi-view image group, then encodes completion, otherwise, repeats previous step 1 to step 8, until entire multiple views sequence Column coding is completed.
Further, described instruction collection is also used to execute:
The step 2, further comprises the steps of:
It is T that current multi-view image group, which is assigned to total bit,MVGOP, the weight factor W of each single viewpoint image groupk, then The viewpoint of K distributes target bits are as follows:
TSVGOP,k=TMVGOP·Wk
Wherein Wk(k=1,2 ..., Nview) initial value obtains according to Similarity measures between viewpoint:
Wherein, S1Basic viewpoint or main view point are indicated, as the reference view of other viewpoints, NviewPresentation code viewpoint Number, S (Vj,Vk) indicate vision pointj,VkSimilarity,
Wherein,The respectively feature vector of two sub-pictures.
Following steps are executed by the instruction set in storage equipment 300: by establishing the domain ρ code rate model, and based on efficient On the basis of rate viewpoint video coding, bit distribution is (that is: using between viewpoint between carrying out viewpoint according to the image similarity between viewpoint Correlation analysis gives each viewpoint to carry out reasonable bit distribution in turn), buffer area " liquid level " fluctuation is effectively prevented to frame mesh Bit distribution bring negative effect is marked, and according to the work of frame per second, destination buffer capacity, actual buffering area size and the frame Dynamic time domain complexity carries out frame-layer, basic unit layer bit-allocation and rate-control, overcomes target bits point in the prior art The limitation matched can efficiently control the code rate of multiple view video coding, and rate control accuracy reaches 99% or more, and peak value is believed It makes an uproar than average raising 0.38dB or more.
It should be noted that being not intended to limit although the various embodiments described above have been described herein Scope of patent protection of the invention.Therefore, it based on innovative idea of the invention, change that embodiment described herein is carried out and is repaired Change, or using equivalent structure or equivalent flow shift made by description of the invention and accompanying drawing content, it directly or indirectly will be with Upper technical solution is used in other related technical areas, is included within scope of patent protection of the invention.

Claims (10)

1. a kind of 3 d video encoding bit rate control method based on the domain ρ, which is characterized in that comprising steps of
Establish the domain ρ code rate model;
Bit distributes between carrying out viewpoint according to the image similarity between viewpoint;
Frame-layer, base are carried out according to the activity time domain complexity of frame per second, destination buffer capacity, actual buffering area size and the frame This elementary layer bit-allocation and rate-control.
2. a kind of 3 d video encoding bit rate control method based on the domain ρ according to claim 1, which is characterized in that
Described " establishing the domain ρ code rate model ", further comprises the steps of:
Using multiple regression technology, model parameter is calculated.
3. a kind of 3 d video encoding bit rate control method based on the domain ρ according to claim 2, which is characterized in that
Described " using multiple regression technology, model parameter is calculated ", further comprises the steps of:
Assuming that ρ and texture part coding bit rate R (ρ) has following secondary relationship:
R (ρ)=θ1·(1-ρ)22·(1-ρ)+θ3
Wherein ρ indicates that the number of zero coefficient after quantization of transform coefficients accounts for the percentage of all coefficients, θ123For simple regression system Number;
Following R- ρ model is calculated:
R (ρ)=θ1·(1-ρ)22·(1-ρ)
Wherein, θ12It is provided by following statistical analysis technique: enabling x1(ρ)=(1- ρ)2, x2(ρ)=1- ρ, if (k11,k21,r1), (k12,k22,r2),…,(k1n,k2n,rn) it is already present n sample value, it enables
Using multiple regression technology, model parameter N is calculated are as follows:
Wherein, KTIt is the transposed matrix of K, (KTK)-1It is KTThe inverse matrix of K.
4. a kind of 3 d video encoding bit rate control method based on the domain ρ according to claim 1, which is characterized in that
It is described that " bit distributes between carrying out viewpoint according to the image similarity between viewpoint;According to frame per second, destination buffer capacity, reality Border buffer size and the activity time domain complexity of the frame carry out frame-layer, basic unit layer bit-allocation and rate-control ", also Comprising steps of
Step 1: bit-allocation and rate-control being carried out to multi-view image group, current multi-view image group, which is calculated, to distribute The bit number arrived;
Step 2: bit-allocation and rate-control being carried out to single viewpoint image group, the power of each one-view image group is calculated Repeated factor Wk, according to the weight factor WkThe bit number that current one-view image group should be assigned to is calculated;
Step 3: bit-allocation and rate-control being carried out to frame-layer in single viewpoint image group, current encoded frame target is calculated Bit number;
Step 4: bit-allocation and rate-control being carried out to macroblock layer, ρ value is calculated according to the domain ρ code rate model, then passes through ρ value Further calculate the quantization parameter Q of current macromb
Step 5: according to quantization parameter QmbCurrent macro is encoded;
Step 6: judging whether macro block all in present frame encodes, if encoding, go to step 7;If not encoding, weigh 4 are gone to step again then goes to step 7 until encoding to step 5;
Step 7: whether all frames encode in the current single viewpoint image group of judgement, if encoding, go to step 8;If not It encodes, then repeats to go to step 3 to 6, until compiling all frames of present image group;
Step 8: whether all single viewpoint image groups all encode in the current single viewpoint image group of judgement, if all codings It completes, then goes to step 9;If completing without all codings, previous step 2 to 7 is repeated, until institute in current multi-view image group Having single viewpoint image group, all coding is completed;
Step 9: judging whether current multi-view image group is the last one multi-view image group of entire multiview sequence, if most The latter multi-view image group, then encode completion, otherwise, repeats previous step 1 to step 8, until entire multiview sequence is compiled Code is completed.
5. a kind of 3 d video encoding bit rate control method based on the domain ρ according to claim 4, which is characterized in that
The step 2, further comprises the steps of:
It is T that current multi-view image group, which is assigned to total bit,MVGOP, the weight factor W of each single viewpoint image groupk, then K Viewpoint distributes target bits are as follows:
TSVGOP,k=TMVGOP·Wk
Wherein Wk(k=1,2 ..., Nview) initial value obtains according to Similarity measures between viewpoint:
Wherein, S1Basic viewpoint or main view point are indicated, as the reference view of other viewpoints, NviewThe number of presentation code viewpoint, S(Vj,Vk) indicate vision pointj,VkSimilarity,
Wherein,The respectively feature vector of two sub-pictures.
6. a kind of storage equipment, wherein being stored with instruction set, which is characterized in that described instruction collection is for executing:
Establish the domain ρ code rate model;
Bit distributes between carrying out viewpoint according to the image similarity between viewpoint;
Frame-layer, base are carried out according to the activity time domain complexity of frame per second, destination buffer capacity, actual buffering area size and the frame This elementary layer bit-allocation and rate-control.
7. a kind of storage equipment according to claim 6, which is characterized in that described instruction collection is also used to execute:
Described " establishing the domain ρ code rate model ", further comprises the steps of:
Using multiple regression technology, model parameter is calculated.
8. a kind of storage equipment according to claim 7, which is characterized in that described instruction collection is also used to execute:
Described " using multiple regression technology, model parameter is calculated ", further comprises the steps of:
Assuming that ρ and texture part coding bit rate R (ρ) has following secondary relationship:
R (ρ)=θ1·(1-ρ)22·(1-ρ)+θ3
Wherein ρ indicates that the number of zero coefficient after quantization of transform coefficients accounts for the percentage of all coefficients, θ123For simple regression system Number;
Following R- ρ model is calculated:
R (ρ)=θ1·(1-ρ)22·(1-ρ)
Wherein, θ12It is provided by following statistical analysis technique: enabling x1(ρ)=(1- ρ)2, x2(ρ)=1- ρ, if (k11,k21,r1), (k12,k22,r2),…,(k1n,k2n,rn) it is already present n sample value, it enables
Using multiple regression technology, model parameter N is calculated are as follows:
Wherein, KTIt is the transposed matrix of K, (KTK)-1It is KTThe inverse matrix of K.
9. a kind of storage equipment according to claim 6, which is characterized in that described instruction collection is also used to execute:
It is described that " bit distributes between carrying out viewpoint according to the image similarity between viewpoint;According to frame per second, destination buffer capacity, reality Border buffer size and the activity time domain complexity of the frame carry out frame-layer, basic unit layer bit-allocation and rate-control ", also Comprising steps of
Step 1: bit-allocation and rate-control being carried out to multi-view image group, current multi-view image group, which is calculated, to distribute The bit number arrived;
Step 2: bit-allocation and rate-control being carried out to single viewpoint image group, the power of each one-view image group is calculated Repeated factor Wk, according to the weight factor WkThe bit number that current one-view image group should be assigned to is calculated;
Step 3: bit-allocation and rate-control being carried out to frame-layer in single viewpoint image group, current encoded frame target is calculated Bit number;
Step 4: bit-allocation and rate-control being carried out to macroblock layer, ρ value is calculated according to the domain ρ code rate model, then passes through ρ value Further calculate the quantization parameter Q of current macromb
Step 5: according to: quantization parameter QmbCurrent macro is encoded;
Step 6: judging whether macro block all in present frame encodes, if encoding, go to step 7;If not encoding, weigh 4 are gone to step again then goes to step 7 until encoding to step 5;
Step 7: whether all frames encode in the current single viewpoint image group of judgement, if encoding, go to step 8;If not It encodes, then repeats to go to step 3 to 6, until compiling all frames of present image group;
Step 8: whether all single viewpoint image groups all encode in the current single viewpoint image group of judgement, if all codings It completes, then goes to step 9;If completing without all codings, previous step 2 to 7 is repeated, until institute in current multi-view image group Having single viewpoint image group, all coding is completed;
Step 9: judging whether current multi-view image group is the last one multi-view image group of entire multiview sequence, if most The latter multi-view image group, then encode completion, otherwise, repeats previous step 1 to step 8, until entire multiview sequence is compiled Code is completed.
10. a kind of storage equipment according to claim 9, which is characterized in that described instruction collection is also used to execute:
The step 2, further comprises the steps of:
It is T that current multi-view image group, which is assigned to total bit,MVGOP, the weight factor W of each single viewpoint image groupk, then K Viewpoint distributes target bits are as follows:
TSVGOP,k=TMVGOP·Wk
Wherein Wk(k=1,2 ..., Nview) initial value obtains according to Similarity measures between viewpoint:
Wherein, S1Basic viewpoint or main view point are indicated, as the reference view of other viewpoints, NviewThe number of presentation code viewpoint, S(Vj,Vk) indicate vision pointj,VkSimilarity,
Wherein,The respectively feature vector of two sub-pictures.
CN201811491526.9A 2018-12-07 2018-12-07 A kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ Pending CN109561311A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811491526.9A CN109561311A (en) 2018-12-07 2018-12-07 A kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811491526.9A CN109561311A (en) 2018-12-07 2018-12-07 A kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ

Publications (1)

Publication Number Publication Date
CN109561311A true CN109561311A (en) 2019-04-02

Family

ID=65869158

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811491526.9A Pending CN109561311A (en) 2018-12-07 2018-12-07 A kind of 3 d video encoding bit rate control method and storage equipment based on the domain ρ

Country Status (1)

Country Link
CN (1) CN109561311A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110139089A (en) * 2019-05-09 2019-08-16 莆田学院 A kind of the 3 d video encoding bit rate control method and storage equipment of combination scene detection

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101242532A (en) * 2007-12-12 2008-08-13 浙江万里学院 A code rate control method oriented to multi-view point video
CN101287123A (en) * 2008-05-23 2008-10-15 清华大学 Code rate controlling method for video coding based on Rho domain
WO2010005691A1 (en) * 2008-06-16 2010-01-14 Dolby Laboratories Licensing Corporation Rate control model adaptation based on slice dependencies for video coding
CN107220881A (en) * 2017-05-27 2017-09-29 莆田学院 A kind of method and apparatus of the electric business temperature ranking based on time and space

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101242532A (en) * 2007-12-12 2008-08-13 浙江万里学院 A code rate control method oriented to multi-view point video
CN101287123A (en) * 2008-05-23 2008-10-15 清华大学 Code rate controlling method for video coding based on Rho domain
WO2010005691A1 (en) * 2008-06-16 2010-01-14 Dolby Laboratories Licensing Corporation Rate control model adaptation based on slice dependencies for video coding
CN107220881A (en) * 2017-05-27 2017-09-29 莆田学院 A kind of method and apparatus of the electric business temperature ranking based on time and space

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110139089A (en) * 2019-05-09 2019-08-16 莆田学院 A kind of the 3 d video encoding bit rate control method and storage equipment of combination scene detection

Similar Documents

Publication Publication Date Title
Ahmad et al. Interpreting plenoptic images as multi-view sequences for improved compression
CN102209243B (en) Depth map intra prediction method based on linear model
CN106851293B (en) For the device of the decoding image of intra prediction
CN104301724B (en) Method for processing video frequency, encoding device and decoding device
CN101461248B (en) Method and apparatus for adaptively determining a bit budget for encoding video pictures
CN100562116C (en) A kind of bit rate control method towards multi-view point video
CN101888566B (en) Estimation method of distortion performance of stereo video encoding rate
CN102685532B (en) Coding method for free view point four-dimensional space video coding system
CN102413353B (en) Method for allocating code rates of multi-view video and depth graph in stereo video encoding process
CN105049850A (en) HEVC (High Efficiency Video Coding) code rate control method based on region-of-interest
CN101674472A (en) Multistage code rate control method of video code with a plurality of visual points
JPWO2007077942A1 (en) Video encoding method and decoding method, apparatus thereof, program thereof, and storage medium storing program
CN102970529B (en) A kind of object-based multi-view point video fractal image compression & decompression method
Yuan et al. Rate distortion optimized inter-view frame level bit allocation method for MV-HEVC
CN101404766B (en) Multi-view point video signal encoding method
CN101572806B (en) Frame I code rate control method based on H264
CN103037214A (en) Video compression method
CN102281446B (en) Visual-perception-characteristic-based quantification method in distributed video coding
CN103581647A (en) Depth map sequence fractal coding method based on motion vectors of color video
CN103327327B (en) For the inter prediction encoding unit selection method of high-performance video coding HEVC
CN107005711A (en) Sample-by-sample predictive coding apparatus and method
CN104159095B (en) A kind of multiple views texture video and the bit rate control method of depth map encoding
CN111263144B (en) Motion information determination method and equipment
CN103596003B (en) Interframe predication quick mode selecting method for high-performance video coding
CN104038769A (en) Rate control method for intra-frame coding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190402

WD01 Invention patent application deemed withdrawn after publication