CN102572439B - Method for determining optimal multi-viewpoint video coding mode for coding - Google Patents

Method for determining optimal multi-viewpoint video coding mode for coding Download PDF

Info

Publication number
CN102572439B
CN102572439B CN201210066744.4A CN201210066744A CN102572439B CN 102572439 B CN102572439 B CN 102572439B CN 201210066744 A CN201210066744 A CN 201210066744A CN 102572439 B CN102572439 B CN 102572439B
Authority
CN
China
Prior art keywords
value
higher limit
complexity
current macro
coding mode
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210066744.4A
Other languages
Chinese (zh)
Other versions
CN102572439A (en
Inventor
王好谦
杜成立
张永兵
戴琼海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Graduate School Tsinghua University
Original Assignee
Shenzhen Graduate School Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Graduate School Tsinghua University filed Critical Shenzhen Graduate School Tsinghua University
Priority to CN201210066744.4A priority Critical patent/CN102572439B/en
Publication of CN102572439A publication Critical patent/CN102572439A/en
Priority to HK12108845.6A priority patent/HK1168218A1/en
Application granted granted Critical
Publication of CN102572439B publication Critical patent/CN102572439B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a method for determining an optimal multi-viewpoint video coding mode for coding. The method comprises the following steps of: setting different complexity values for each coding mode according to coding complexities; calculating coding complexity values of a current macro block according to the time-domain relevancy of the same viewpoint, inter-viewpoint relevancy and intra-frame relevancy respectively; determining a coding mode set selectable for the current macro block by adopting a mode transition method according to three calculation results; and selecting a coding mode with minimum rate distortion as a coding mode for the coding of the current macro block. By the method, multi-viewpoint video coding complexity can be effectively reduced on the basis of no reduction in coding efficiency.

Description

A kind of method that definite multiple view video coding optimal mode is encoded
Technical field
The present invention relates to three-dimensional video-frequency and process and communication field, be specifically related to the method for determining that multiple view video coding optimal mode is encoded.
Background technology
Three-dimensional video-frequency is with its abundanter content, more real visual experience, just constantly in daily life, applied, but because three-dimensional video-frequency contains than the more amount of information of ordinary video, basic expressions is that three-dimensional video-frequency all contains two-way or above video sequence conventionally, thereby takies more storage and transmission space.
In recent years, development along with various three-dimensional video-frequency technology, people have been not content with the solid impression that simple binocular video brings, to watching the demand of the degree of freedom of three-dimensional video-frequency also day by day to strengthen, corresponding application is also increasingly extensive, for example, and free viewpoint video, 3D TVs etc., the key technology of these application is exactly multi-view point video.
Multi-view point video comprises the multi-channel video sequence that the different video camera in a plurality of positions is taken simultaneously, increase due to video camera number, the amount of information that multi-view point video sequence comprises is larger than common three-dimensional video-frequency, therefore make its further application receive great restriction, for example broadcast, streaming media service etc.Given this, how effectively to compress multi-view point video sequence and can guarantee that its quality becomes a difficult problem urgently to be resolved hurrily.
Multiple view video coding (multiview video coding) technology is just with solving this difficult problem, and there is certain development, multiple view video coding not only has the time domain redundancy of similar ordinary video, simultaneously due to it, take and between camera, conventionally only have minimum fixed displacement, then between its multi-channel video sequence of taking out, also there is very strong spatial coherence.In video coding process except utilizing estimation to reduce time domain redundancy, the disparity estimation of simultaneously introducing between viewpoint reduces spatial redundancy, has so but brought greatly computational complexity, although compression efficiency improves, but the scramble time is also long, limited the application of multi-vision-point encoding.
In order to reduce the coding complexity of multi-view point video sequence, there is a lot of relevant research at present, conventionally can aspect following three, do Optimization Work, the first, construct suitable reference frame structure, improve code efficiency; The second, aspect the mode decision based on piece coupling, by predicting accurately the pattern of current block and then reducing computational complexity; The 3rd, the block matching algorithm of more optimizing, by predicted motion or disparity vector accurately, reduces hunting zone to reduce operand.
Above three aspects, for parallax structure, the B frame predict structure (HBP) of layering is widely used, and what in multiple view video coding platform JMVC, adopt is exactly this predict.
In multi-vision-point encoding scheme, adopt Multi-encoding pattern, requirement with balance encoder complexity and compression efficiency two aspects, inter-frame forecast mode (SKIP defined 8 in multi-vision-point encoding scheme in, Inter 16 * 16, Inter 16 * 8, Inter 8 * 16, Inter 8 * 8, Inter 8 * 4, Inter 4 * 8, with Inter 4 * 4), (Intra 4 * 4 for 9 kinds of intra-frame encoding modes, Intra 8 * 8, with four kinds of Intra 16 * 16), in coding, calculate successively the rate distortion costs (rate-distortion cost (RDcost)) of various coding modes, select coding mode corresponding to minimum rate distortion costs (RDcost) as best coding mode.This mode can farthest improve code efficiency, but computational complexity is high, has limited multiple view video coding development in actual applications.
For the reduction encoder complexity of trying one's best when not reducing code efficiency, correlative study person has proposed the method for model prediction, before calculating the rate distortion costs of various possible coding modes, the most probable forced coding pattern of adaptive selection present encoding piece.This just need to be to coding mode Accurate Prediction, but generally speaking, complexity due to real scene, no matter be simply based on relativity of time domain, spatial coherence, correlation between viewpoint, or comprehensive consideration temporal correlation, all can not obtain model prediction scheme scene changes to robustness.For various object juncture areas in video, the selection of pattern is more difficult simultaneously.Therefore, the model prediction precision obtaining is also extremely limited, does not still meet to a certain extent the demand that further reduces the operand of multi-vision-point encoding, and the range of choice that how to obtain pattern in predictive coding is more accurately a problem demanding prompt solution.
For disparity estimation, the algorithm of employing based on piece coupling, conventionally in larger region of search, can obtain matching result more accurately, but for the less region of parallax, excessive region of search can increase computational complexity, adaptive adjustment region of search how, make to reduce when guaranteeing disparity estimation precision as far as possible computational complexity become multi-vision-point encoding in another crucial direction of optimizing.
Summary of the invention
Reduce encoder complexity when guaranteeing code efficiency, the invention provides a kind of method that definite multiple view video coding optimal mode is encoded.
Determine the method that multiple view video coding optimal mode is encoded, comprise the steps:
A1, according to coding complexity various multi-vision-point encoding patterns are arranged to different complexity value;
A2, with the complexity value of reference macroblock, according to same viewpoint relativity of time domain, calculate first numerical value of complexity value of the coding mode of current macro, this complexity value as the corresponding complexity of current macro coding mode to be chosen the first higher limit;
A3, with the complexity value of reference macroblock according to the second value of the complexity value of the coding mode of correlation calculations current macro between viewpoint, this complexity value as the corresponding complexity of current macro coding mode to be chosen the second higher limit;
A4, with the complexity value of reference macroblock, according to the in-frame correlation of present frame, calculate the third value of complexity value of the coding mode of current macro, this complexity value as the corresponding complexity of current macro coding mode to be chosen the 3rd higher limit;
If the absolute value of A5 the first higher limit and the second upper limit value difference is less than threshold value, according to the first higher limit, the second higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity value of current macro coding mode to be chosen; If the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and on the first higher limit and the 3rd, the poor absolute value of restriction is less than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the first higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen; If the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and the absolute value of the first higher limit and the 3rd upper limit value difference is greater than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the second higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen;
A6, in complexity value, be less than in all coding modes of the definite higher limit of steps A 5, the coding mode of selection rate distortion rate minimum is encoded to current macro.
Preferably, in described steps A 5, if the absolute value of the first higher limit and the second upper limit value difference is less than threshold value, according to the complexity value of 1/3rd coding modes as current macro of the first higher limit, the second higher limit and the 3rd higher limit sum, i.e. the higher limit of the corresponding complexity of current macro coding mode to be chosen.
Preferably, in described steps A 5, if the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and on the first higher limit and the 3rd, the poor absolute value of restriction is less than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the complexity value of 1/2nd coding modes as current macro of the first higher limit and the 3rd higher limit sum, i.e. the higher limit of the corresponding complexity of current macro coding mode to be chosen.
Preferably, in described steps A 5, if the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and the absolute value of the first higher limit and the 3rd upper limit value difference is greater than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the complexity value of 1/2nd coding modes as current macro of the second higher limit and the 3rd higher limit sum, i.e. the higher limit of the corresponding complexity of current macro coding mode to be chosen.
Preferably, the first numerical value ma obtains by following algorithm:
Figure 2012100667444100002DEST_PATH_IMAGE001
Wherein, the i of the T two field picture of current macro in V viewpoint is capable, j row, and reference frame is V viewpoint T-1 frame, and the i of benchmark macro block in reference frame is capable, j row, represent benchmark macro block the complexity value of coding mode,
Figure 2012100667444100002DEST_PATH_IMAGE003
represent the macro block of the capable j+b row of reference frame i+a and the coefficient correlation of current macro.
Preferably, described second value mb obtains by following algorithm:
Figure 318465DEST_PATH_IMAGE004
Wherein, current macro is in V viewpoint T two field picture, and reference frame is in V+1 viewpoint T two field picture, be illustrated in the T two field picture of V+1 viewpoint the macro block maximum with current macro overlaid pixel after Global Vector translation the complexity value of coding mode,
Figure 933991DEST_PATH_IMAGE003
represent the macro block of the capable j+b row of the T two field picture i+a of V+1 viewpoint and the coefficient correlation of current macro.
Preferably, described third value mc obtains by following algorithm:
Figure 491006DEST_PATH_IMAGE006
Wherein, the i of the T two field picture of current macro in V viewpoint is capable, j row,
Figure DEST_PATH_IMAGE007
the i of T two field picture that represents V viewpoint is capable, the complexity value of the coding mode of the macro block of j-1 row,
Figure 727952DEST_PATH_IMAGE008
the i-1 of T two field picture that represents V viewpoint is capable, the complexity value of the coding mode of the macro block of j row.
Preferably, if the absolute value of the difference of the higher limit of complexity value of coding mode to be selected of current macro that process steps A 5 is definite and the higher limit of the complexity value of the coding mode to be selected of the right side adjacent macroblocks of current macro is greater than set point, the higher limit of the complexity value of the coding mode to be selected of current macro is set to half of the higher limit of complexity value of coding mode to be selected of current macro and the higher limit sum of the complexity value of the coding mode to be selected of right side adjacent macroblocks.
Preferably, by following algorithm, obtain region of search SR corresponding to current macro:
,
Wherein, diff represents the absolute value of the difference of the first numerical value ma and second value mb,
Figure 437020DEST_PATH_IMAGE010
represent minimum setting search region,
Figure DEST_PATH_IMAGE011
represent maximum setting search region.
The invention has the beneficial effects as follows:
1) in multiple view video coding, utilize the mode of model selection can farthest improve code efficiency, but model selection process need calculates the Rdcost of various patterns, and then select minimum Rdcost as best coding mode, amount of calculation greatly causes coding and very complicated, therefore before utilizing calculating Rdcost selection optimal mode, first predict the coding mode of current macro, can guarantee on the basis of code efficiency, reduce coding computational complexity.Adopt multi-modal method, make full use of the temporal correlation of multi-view point video sequence, can take different strategies for different content again, can fully guarantee the precision of estimation range, and then more effectively reduce encoder complexity.
2) by multi-modal strategy, tentatively obtain on the basis of coding mode range of choice, adopt the method Optimized Coding Based model selection scope of feedback, further guarantee precision of prediction, when making at utmost to reduce operand, can not reduce again compression efficiency (cannot effectively compress because the coding mode that prediction obtains is inaccurate).
3) according to the relevant parameter in multi-modal strategy, characterize the precision of coding mode range of choice, adaptive adjustment disparity estimation region of search accordingly, reduce the operand of disparity estimation, and then further reduce the integral operation complexity of multiple view video coding.
4) according to the relevant parameter self adaptation calculating in mode decision process, adjust the region of search in disparity estimation, in the situation that additionally not increasing amount of calculation, reduce the amount of calculation of bringing due to search in piece coupling as far as possible.
5) disparity estimation result is adopted to the mode of Kalman filtering, disparity vector is optimized to mark rank by integer rank, when improving precision of prediction, do not increase again too much operand.
Accompanying drawing explanation
Fig. 1 is the flow chart that the present invention determines a kind of embodiment of the method that multiple view video coding optimal mode is encoded;
Fig. 2 is the flow chart that the present invention determines another embodiment of the method that multiple view video coding optimal mode is encoded;
Fig. 3 is the particular flow sheet of a kind of execution mode of step 201 in Fig. 2;
Fig. 4 is the particular flow sheet of a kind of execution mode of step 202 in Fig. 2;
Fig. 5 is the particular flow sheet of a kind of execution mode of step 203 in Fig. 2;
Fig. 6 is the particular flow sheet of a kind of execution mode of step 204 in Fig. 2;
Fig. 7 is the flow chart that the present invention determines another embodiment of the method that multiple view video coding optimal mode is encoded.
Embodiment
Below with reference to as figure, specific embodiments of the invention are described in further detail.
As shown in Figure 1, determine the method that multiple view video coding optimal mode is encoded, comprise the steps:
A1, according to coding complexity various multi-vision-point encoding patterns are arranged to different complexity value;
A2, with the complexity value of reference macroblock, according to same viewpoint relativity of time domain, calculate first numerical value of complexity value of the coding mode of current macro, this complexity value as the corresponding complexity of current macro coding mode to be chosen the first higher limit;
A3, with the complexity value of reference macroblock according to the second value of the complexity value of the coding mode of correlation calculations current macro between viewpoint, this complexity value as the corresponding complexity of current macro coding mode to be chosen the second higher limit;
A4, with the complexity value of reference macroblock, according to the in-frame correlation of present frame, calculate the third value of complexity value of the coding mode of current macro, this complexity value is as the 3rd higher limit of the corresponding complexity of current macro coding mode to be chosen;
If the absolute value of A5 the first higher limit and the second upper limit value difference is less than threshold value, according to the first higher limit, the second higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen; If the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and on the first higher limit and the 3rd, the poor absolute value of restriction is less than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the first higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen; If the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and the absolute value of the first higher limit and the 3rd upper limit value difference is greater than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the second higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen;
A6, in complexity value, be less than in all coding modes of the definite higher limit of steps A 5, the coding mode of selection rate distortion rate minimum is encoded to current macro.
In specific embodiments of the invention, the corresponding method of determining that multiple view video coding optimal mode is encoded can be divided into four treatment steps, as shown in Fig. 2-7, comprising:
201: video image preliminary treatment
202: the multi-modal coding mode range of choice (i.e. the higher limit of the numerical value range of choice of the complexity of the coding mode of definite current macro) of obtaining
203: based on feedback optimized coding mode range of choice
204: disparity estimation
Wherein, video image pre-treatment step 201 comprises:
301: from the video sequence reading in, extract frame to be encoded, and according to Parameter File, determine the reference frame lists of coding.Calculate the global disparity vector of present frame
Figure 978991DEST_PATH_IMAGE012
.
302: frame to be encoded is divided into the macro block of fixed size (16x16 pixel), treatment step below will be processed each macro block successively.Particularly, for multi-view point video sequence, we adopt
Figure DEST_PATH_IMAGE013
represent the T frame of V viewpoint in time domain direction, namely current frame to be encoded.
Figure 900548DEST_PATH_IMAGE014
represent, the i of the T two field picture of V viewpoint in time domain direction is capable, and j is listed as corresponding macro block.The top left corner pixel of this macro block is
Figure DEST_PATH_IMAGE015
.
The multi-modal coding mode range of choice step 202 of obtaining is specially:
Various patterns for the ease of in quantitative analysis coding, represent various patterns according to its complexity with a pattern variable (complexity value) M, the larger M value of complexity is larger, specific as follows:
MODE(coding mode) Skip(dancing mode) (inter-frame mode) 16X16 (inter-frame mode) 16X8 (inter-frame mode) 8X16 (inter-frame mode) 8X8 INTRA(frame mode)
M value 0 1 3 3 5 6
Wherein, pattern INTRA has comprised all intra-frame encoding modes, and (intra-frame encoding mode is comparatively complicated, unification represents by M value 6), those skilled in the art can select different numeric representation complexities according to specific needs, to distinguish the complexity of different coding pattern.In order to represent conveniently, macro block
Figure 460843DEST_PATH_IMAGE016
corresponding M value is denoted as
Figure DEST_PATH_IMAGE017
.
401: the first numerical value ma based on relativity of time domain prediction current macro coding mode complexity value in viewpoint, determine that range of choice A(determines the first numerical value ma, as the first higher limit of coding mode complexity).For current macro , with time domain reference frame
Figure DEST_PATH_IMAGE019
for example, note macro block do benchmark macro block
Figure DEST_PATH_IMAGE021
, according to benchmark macro block
Figure 857778DEST_PATH_IMAGE022
the first numerical value ma of the M value computation complexity of 9 reference macroblocks centered by (being included), the first higher limit as the corresponding complexity of current macro coding mode to be selected, according to 9 reference macroblocks, follow that degree of correlation from current macro is different gives different weights, computational methods are as follows:
With benchmark macro block
Figure DEST_PATH_IMAGE023
centered by 9 reference macroblocks as figure below, it is as follows from the weights (coefficient correlation) of current macro: those skilled in the art can select different numeric representation coefficient correlations according to specific needs, larger the closer to the coefficient correlation of current macro.
Figure 23311DEST_PATH_IMAGE024
?
Figure 28176DEST_PATH_IMAGE001
According to above-mentioned coding mode range of choice A corresponding to current macro that calculate, the preset range of the coding mode of current macro is the coding mode that all M values are less than ma.
402: the second value mb based on dependency prediction current macro coding mode complexity value between viewpoint, determines range of choice B.For current macro
Figure DEST_PATH_IMAGE025
, according to global disparity vector
Figure 777695DEST_PATH_IMAGE012
select reference macroblock between its viewpoint, with reference frame in the right viewpoint of present frame place viewpoint
Figure 260629DEST_PATH_IMAGE026
for example, at reference frame the fixed size macro block being divided into, selects wherein and macro block
Figure DEST_PATH_IMAGE027
(according to the macro block after current macro translation global disparity vector) has the macro block of maximum coincidence pixels, and note is done benchmark macro block , according to benchmark macro block centered by the M value of 9 reference macroblocks calculate second value mb, according to 9 reference macroblocks are different from the degree of correlation of benchmark macro block respectively, give different (coefficient correlation) weights, computational methods are as follows:
With benchmark macro block
Figure 183138DEST_PATH_IMAGE028
centered by 9 reference macroblocks as figure below, its (coefficient correlation) weights are allocated as follows, in like manner, those skilled in the art can select different numeric representation coefficient correlations according to specific needs, larger the closer to the coefficient correlation of current macro.
Figure 775793DEST_PATH_IMAGE024
?
Figure 591303DEST_PATH_IMAGE004
According to the above-mentioned range of choice B (having determined second value mb) that calculates the complexity of coding mode corresponding to current macro, when the coding mode of front module is predefined for the coding mode that all M values are less than second value mb.
403: the range of choice C(of the complexity of the coding mode based on in-frame correlation prediction current macro has determined third value mc).For current macro
Figure DEST_PATH_IMAGE029
, select the interior time domain of its viewpoint with reference to the reference macroblock in present frame
Figure 348037DEST_PATH_IMAGE030
with
Figure DEST_PATH_IMAGE031
, according to spatial correlation, calculate third value mc,
According to the above-mentioned range of choice C that calculates the complexity of coding mode corresponding to current macro, the coding mode when front module is predefined for the coding mode that all M values are less than third value mc.
404: multi-modal Forecasting Methodology is selected
Calculate A (relevant to the first numerical value ma), two kinds of close degree that predict the outcome of B (mb is relevant to second value)
Figure 931520DEST_PATH_IMAGE032
if,
Figure DEST_PATH_IMAGE033
(1 represents threshold value, also can get different values according to the difference of application), show that the two predicts the outcome close, also shown current macro region of living in, now select mode three, make full use of time domain, the numerical value range of choice (as described in step 405) of the complexity of the further predictive coding pattern of correlation between spatial domain and viewpoint.
If A, what two kinds of B predicted the outcome differs larger, , illustrate that time domain or spatial domain prediction result are inconsistent, now, according to the feature of infra-frame prediction result C (mc is relevant to third value) and other two kinds of prediction mode, which kind of is more reliable to calculate time domain prediction and interview prediction result, if
Figure DEST_PATH_IMAGE035
think the more approaching reality of time domain prediction (showing that region described in current macro now belongs to the mild or uniform region of scene of motion), now select mode one, further utilize the numerical value range of choice of complexity of the coding mode of time domain and in-frame correlation prediction current macro.
If and
Figure 305235DEST_PATH_IMAGE036
, illustrate that interview prediction, than time domain prediction accurate (showing that the residing region of current macro is the violent or complicated region of scene of motion), now selects mode two, further utilize between viewpoint and the numerical value range of choice of in-frame correlation predictive coding pattern.
405: according to the numerical value range of choice D of the complexity of the prediction modal calculation coding mode of selecting
D (x) expression, M value is less than all coding modes of x.
For mode one: D (md)=D ((ma+mc)/2);
For mode two: D (md)=D ((mb+mc)/2);
For mode three: D (md)=D ((ma+nb+mc)/3).
The numerical value range of choice of the complexity of step 203 based on feedback optimized coding mode is specially:
In order to guarantee borderline region code efficiency, adopt the mode of feedback, upgrade coding mode range of choice, specific as follows:
501: temporarily preserve the numerical value range of choice D (md) of complexity of the coding mode of current macro, calculate the numerical value range of choice E (me) (higher limit of the numerical value range of choice of the complexity of the coding mode that me is this right side macro block) of complexity of the macro-block coding pattern on this macro block right side
502: feedback is upgraded coding mode range of choice
If , illustrating that now larger variation occurs image, borderline region normally, adjusts the numerical value range of choice of the complexity of present encoding pattern: D (md)=D ((md+me)/2)
If
Figure 951111DEST_PATH_IMAGE038
, the numerical value range of choice D (md) of the complexity of present encoding pattern is constant.
Step 204 disparity estimation is specially:
601: the numerical value range of choice D (md) of the complexity of the coding mode calculating according to step 405 and 502, adopt rate-distortion optimization method to select forced coding pattern, successively each M value is less than the rate distortion costs (Rdcost) of each mode computation current macro of md, select pattern that minimum Rdcost is corresponding as the forced coding pattern of current macro, can encode to current macro.
602: according to what calculate in 404
Figure DEST_PATH_IMAGE039
, self adaptation is adjusted region of search, default maximum search region MaxRS=27 (pixel), minimum region of search MinSR=17.Calculate the corresponding region of search of current macro as follows:
Figure 374003DEST_PATH_IMAGE009
603: according to 601 definite coding modes, each the sub-macro block to current macro, in the hunting zone of arriving of 602 calculating, the method based on piece coupling is calculated disparity vector.
604: utilize the optical parallax field calculating in kalman filtering optimization 603.

Claims (9)

1. determine the method that multiple view video coding optimal mode is encoded, it is characterized in that: comprise the steps:
A1, according to coding complexity various multi-vision-point encoding patterns are arranged to different complexity value;
A2, with the complexity value of reference macroblock, according to same viewpoint relativity of time domain, calculate first numerical value of complexity value of the coding mode of current macro, this complexity value is as the first higher limit of the corresponding complexity of current macro coding mode to be chosen;
A3, with the complexity value of reference macroblock according to the second value of the complexity value of the coding mode of correlation calculations current macro between viewpoint, this complexity value is as the second higher limit of the corresponding complexity of current macro coding mode to be chosen;
A4, with the complexity value of reference macroblock, according to the in-frame correlation of present frame, calculate the third value of complexity value of the coding mode of current macro, this complexity value is as the 3rd higher limit of the corresponding complexity of current macro coding mode to be chosen;
If the absolute value of A5 the first higher limit and the second upper limit value difference is less than threshold value, according to the first higher limit, the second higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen; If the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and the absolute value of the first higher limit and the 3rd upper limit value difference is less than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the first higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen; If the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and the absolute value of the first higher limit and the 3rd upper limit value difference is greater than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the second higher limit and the 3rd higher limit sum, determine the complexity value of the coding mode of current macro, as the higher limit of the corresponding complexity of current macro coding mode to be chosen;
A6, in complexity value, be less than in all coding modes of the definite higher limit of steps A 5, the coding mode of selection rate distortion rate minimum is encoded to current macro.
2. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, it is characterized in that: in described steps A 5, if the absolute value of the first higher limit and the second upper limit value difference is less than threshold value, according to the higher limit of 1/3rd complexity value as current macro coding mode to be selected of the first higher limit, the second higher limit and the 3rd higher limit sum.
3. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, it is characterized in that: in described steps A 5, if the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and the absolute value of the first higher limit and the 3rd upper limit value difference is less than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the higher limit of 1/2nd complexity value as current macro coding mode to be selected of the first higher limit and the 3rd higher limit sum.
4. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, it is characterized in that: in described steps A 5, if the absolute value of the first higher limit and the second upper limit value difference is greater than threshold value, and the absolute value of the first higher limit and the 3rd upper limit value difference is greater than the absolute value of the second higher limit and the 3rd upper limit value difference, according to the higher limit of 1/2nd complexity value as current macro coding mode to be selected of the second higher limit and the 3rd higher limit sum.
5. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, is characterized in that: the first numerical value ma obtains by following algorithm:
mz = Σ - 1 ≤ a ≤ 1 , - 1 ≤ b ≤ 1 w ( i + a , j + b ) * m V , T - 1 ( i , j ) Σ - 1 ≤ a ≤ 1 , - 1 ≤ b ≤ 1 w ( i + a , j + b ) ,
Wherein, the i of the T two field picture of current macro in V viewpoint is capable, j row, and reference frame is V viewpoint T-1 frame, and the i of benchmark macro block in reference frame is capable, j row, m v, T-1(i, j) represents the complexity value of the coding mode of benchmark macro block, and w (i+a, j+b) represents the macro block of the capable j+b row of reference frame i+a and the coefficient correlation of current macro.
6. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, is characterized in that: described second value mb obtains by following algorithm:
mb = Σ - 1 ≤ a ≤ 1 , - 1 ≤ b ≤ 1 w ( i + a , j + b ) * m V + 1 , T ( i , j ) Σ - 1 ≤ a ≤ 1 , - 1 ≤ b ≤ 1 w ( i + a , j + b ) ,
Wherein, current macro is in V viewpoint T two field picture, and reference frame is in V+1 viewpoint T two field picture, m v+1, T(i, j) be illustrated in the T two field picture of V+1 viewpoint the complexity value of the coding mode of the macro block maximum with current macro overlaid pixel after Global Vector translation, w (i+a, j+b) represents the macro block of the capable j+b row of the T two field picture i+a of V+1 viewpoint and the coefficient correlation of current macro.
7. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, is characterized in that: described third value mc obtains by following algorithm:
mc = m V , T ( i - 1 , j ) + m V , T ( i , j - 1 ) 2 , Wherein, the i of the T two field picture of current macro in V viewpoint is capable, j row, m v,T(i, j-1) represents that the i of T two field picture of V viewpoint is capable, the complexity value of the coding mode of the macro block of j-1 row, m v,T(i-1, j) represents that the i-1 of T two field picture of V viewpoint is capable, the complexity value of the coding mode of the macro block of j row.
8. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, it is characterized in that: if be greater than set point through the absolute value of the difference of the higher limit of the definite higher limit of complexity value of current macro coding mode to be selected of steps A 5 and the complexity value of the right side adjacent macroblocks of current macro, the complexity value higher limit of current macro coding mode to be selected is set to the higher limit of complexity value of current macro coding mode to be selected and half of the complexity value higher limit sum of right side adjacent macroblocks coding mode to be selected.
9. the method that definite multiple view video coding optimal mode as claimed in claim 1 is encoded, is characterized in that: by following algorithm, obtain region of search SR corresponding to current macro:
SR = MinSR + diff min ( ma , mb ) ( MaxSR - MinSR ) , Wherein, diff represents the absolute value of the difference of the first numerical value ma and second value mb, and MinSR represents minimum setting search region, and MaxSR represents maximum setting search region.
CN201210066744.4A 2012-03-14 2012-03-14 Method for determining optimal multi-viewpoint video coding mode for coding Expired - Fee Related CN102572439B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201210066744.4A CN102572439B (en) 2012-03-14 2012-03-14 Method for determining optimal multi-viewpoint video coding mode for coding
HK12108845.6A HK1168218A1 (en) 2012-03-14 2012-09-10 A method for determining the best mode of multi-view video coding to code

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210066744.4A CN102572439B (en) 2012-03-14 2012-03-14 Method for determining optimal multi-viewpoint video coding mode for coding

Publications (2)

Publication Number Publication Date
CN102572439A CN102572439A (en) 2012-07-11
CN102572439B true CN102572439B (en) 2014-02-12

Family

ID=46416746

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210066744.4A Expired - Fee Related CN102572439B (en) 2012-03-14 2012-03-14 Method for determining optimal multi-viewpoint video coding mode for coding

Country Status (2)

Country Link
CN (1) CN102572439B (en)
HK (1) HK1168218A1 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102957932A (en) * 2012-11-02 2013-03-06 同济大学 Coding mode quickly and automatically selecting method based on optimum stopping theory
CN103338370B (en) * 2013-06-05 2016-06-08 宁波大学 A kind of multi-view depth video fast encoding method
CN103533330B (en) * 2013-10-15 2016-01-06 华侨大学 A kind of multiple views multi-description video coding method based on data reusing
CN104618725A (en) * 2015-01-15 2015-05-13 华侨大学 Multi-view video coding algorithm combining quick search and mode optimization
CN105141956B (en) * 2015-08-03 2018-05-29 西安电子科技大学 A kind of gradual Rate-distortion optimization method of free viewpoint video depth map encoding
CN105872543B (en) * 2016-03-30 2018-10-23 上海大学 A kind of fast schema selection method in MVD viewpoints graduated encoding
CN106231303B (en) * 2016-07-22 2020-06-12 上海交通大学 Method for controlling complexity by using prediction mode in HEVC (high efficiency video coding)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101404766A (en) * 2008-11-05 2009-04-08 宁波大学 Multi-view point video signal encoding method
CN101888566A (en) * 2010-06-30 2010-11-17 清华大学 Estimation method of distortion performance of stereo video encoding rate
CN101917618A (en) * 2010-08-20 2010-12-15 浙江大学 Calculation complexity control method for multi-viewpoint video coding and layering B frame predicting structure

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101404766A (en) * 2008-11-05 2009-04-08 宁波大学 Multi-view point video signal encoding method
CN101888566A (en) * 2010-06-30 2010-11-17 清华大学 Estimation method of distortion performance of stereo video encoding rate
CN101917618A (en) * 2010-08-20 2010-12-15 浙江大学 Calculation complexity control method for multi-viewpoint video coding and layering B frame predicting structure

Also Published As

Publication number Publication date
HK1168218A1 (en) 2012-12-21
CN102572439A (en) 2012-07-11

Similar Documents

Publication Publication Date Title
CN102572439B (en) Method for determining optimal multi-viewpoint video coding mode for coding
CN101179728B (en) Method and apparatus for determining interframe encoding mode
CN102067601B (en) Methods and apparatus for template matching prediction (TMP) in video encoding and decoding
CN104412597B (en) The method and device that unified difference vector for 3D Video codings is derived
CN104247432A (en) Efficient multi-view coding using depth-map estimate update
CN103248895B (en) A kind of quick mode method of estimation for HEVC intraframe coding
CN100592798C (en) An implementation method for quick conversion and quantification of video coding
CN107071422B (en) Low complex degree HEVC rate adaption transformation coding method based on image correlation model
CN106210721B (en) A kind of quick code check code-transferring methods of HEVC
CN101375593A (en) Processing multiview video
CN101404774A (en) Macro-block partition mode selection method in movement search
CN107040782B (en) Video coding overall situation Rate-distortion optimization method based on Lagrangian method
CN101159871A (en) Macro block group video code rate control method
CN101986716A (en) Quick depth video coding method
CN101710985B (en) Image brightness compensation method for image coding
CN103533359A (en) H.264 code rate control method
CN103024381B (en) A kind of macro block mode fast selecting method based on proper discernable distortion
CN101895739B (en) Block statistical characteristic-based block encoding method
CN103024395B (en) Multiple view video coding mode selecting method and device
CN108012149A (en) A kind of method of Rate Control in Video Coding
KR100947447B1 (en) Method and its apparatus for fast mode decision in multi-view video coding
CN104702959B (en) A kind of intra-frame prediction method and system of Video coding
CN104159095A (en) Code rate control method for multi-view texture video and depth map coding
CN103391439B (en) A kind of H.264/AVC bit rate control method hidden based on active macro block
CN104618714B (en) A kind of stereo video frame importance appraisal procedure

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1168218

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1168218

Country of ref document: HK

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20140212

CF01 Termination of patent right due to non-payment of annual fee