CN104602019A - Video coding method and device - Google Patents

Video coding method and device Download PDF

Info

Publication number
CN104602019A
CN104602019A CN201410855788.4A CN201410855788A CN104602019A CN 104602019 A CN104602019 A CN 104602019A CN 201410855788 A CN201410855788 A CN 201410855788A CN 104602019 A CN104602019 A CN 104602019A
Authority
CN
China
Prior art keywords
reference frame
current
video image
sampling
decision
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410855788.4A
Other languages
Chinese (zh)
Inventor
蔡砚刚
魏伟
白茂生
刘阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LeTV Information Technology Beijing Co Ltd
Original Assignee
LeTV Information Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LeTV Information Technology Beijing Co Ltd filed Critical LeTV Information Technology Beijing Co Ltd
Priority to CN201410855788.4A priority Critical patent/CN104602019A/en
Publication of CN104602019A publication Critical patent/CN104602019A/en
Pending legal-status Critical Current

Links

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a video coding method and a video coding device. The video coding method comprises the following steps: performing downsampling on a to-be-coded initial resolution video image to generate a downsampling video image with one or more resolution stages; performing reference frame decision on the downsampling video image of each resolution stage to obtain the optimal reference frame of each image frame in the downsampling video image of each resolution stage; while performing reference frame decision on the initial resolution video image, adding a downsampling reference frame set of the existing PU and a reference frame in an existing reference list into a reference frame candidate set of the existing PU, and selecting the optimal reference frame of the existing PU from the reference frame candidate set. According to the method and the device disclosed by the invention, 50% reference frame decision time can be saved on an existing x265 coding framework.

Description

A kind of method for video coding and device
Technical field
The application relates to field of video encoding, particularly relates to a kind of method for video coding and device.
Background technology
When carrying out inter prediction encoding to video image, because the scenery in live image contiguous frames also exists certain correlation, therefore live image can be divided into some pieces or macro block, and manage to search out each piece or the position of macro block in contiguous frames image, and draw the relative displacement of locus between the two, the relative displacement obtained is exactly the motion vector of usual indication, and the process obtaining motion vector is called as estimation.
Motion vector and the predicated error that obtains after motion match are sent to decoding end jointly, in the position that decoding end indicates according to motion vector, from the contiguous reference frame image of having decoded, find corresponding block or macro block, and just obtain block or macro block position in the current frame after predicated error addition.
The interframe redundancy of video image can be removed by estimation, the bit number of transmission of video is greatly reduced.Therefore, estimation is an important component part in video coding system.General method for estimating is as follows: set the two field picture of t as present frame f (x, y), the two field picture of moment t ' is reference frame f ' (x, y), reference frame in advance or can lag behind present frame in time, as shown in Figure 1, as t ' <t, be referred to as forward motion estimation, as t ' >t, be referred to as backward motion estimation.When searching the optimum Match of the block in present frame t in reference frame t ', corresponding sports ground d (x can be obtained; T, t ± △ t), the motion vector of present frame can be obtained.
In order to improve precision of prediction, H.264 the reference picture (also reference frame can be called) of one or two image mated most with present frame as interframe encode can be selected in from before one group or below encoded image, this greatly increases making the complexity of inter prediction, but the result repeatedly compared, makes the precision of prediction after coupling significantly improve.H.264 can select at most best matching image in from 15 reference pictures, and be divided into two reference picture lists (also can be described as reference frame lists) with reference to image, be denoted as respectively: List0 and List1.
Prediction for inter-coded macroblocks and macroblock partition in P sheet can select reference picture from reference frame lists " List0 "; For the prediction of the inter-coded macroblocks in B sheet and macroblock partition, reference picture can be selected from reference frame lists " List0 " and " List1 ".
In List0: will based on POC (Picture Order Count, image sequence number) nearest forward direction image be designated as index0, then be all the other forward direction images (with POC incremental order), and backward image (with POC incremental order).
In List1: nearest backward image is designated as index0, be then all the other backward images (with POC incremental order), and forward direction image (with POC incremental order).
The prediction mode of P, B sheet as shown in Figure 2.
Such as, as shown in table 1, one H.264 decoder store 6 width short-term reference picture, its POC is respectively: 123,125,126,128,129 and 130.The POC of present image is 127.All 6 width short-term reference picture are all designated as " with for referencial use " in list0 and list1.
Index List0 List1
0 126 128
1 125 129
2 123 130
3 128 126
4 129 125
5 130 123
Table 1 short-term buffering index
HEVC (High Efficiency Video Coding, efficient video coding) inherit multi-reference frame scheme H.264 equally, select before one group or below the reference picture of one or two image mated most with present frame as interframe encode encoded image, this greatly increases making the complexity of inter prediction.
In recent years, along with the development of the Internet and hardware device, the cost that people make video is more and more low, to video share and requirements for access strengthens day by day.The resolution of video is more and more large, and HD video (resolution is 1920 × 1080, is denoted as 1080p) even ultra high-definition video (resolution is 3840 × 2160, is denoted as 4K) is made more and more, propagates and plays.This trend will accelerate the application of the field of video encoding of HEVC.Because HEVC adopts the mode of recurrence to divide encoding block, if continue to continue to use the reference frame decision scheme of H.264 high complexity, the exponential decline of coding rate can be brought.
Summary of the invention
Technical problems to be solved in this application are, overcome the deficiencies in the prior art, provide a kind of method for video coding and the device that can improve HEVC Video coding speed.
In order to solve the problem, the application provides a kind of method for video coding, and the method comprises:
Down-sampling is carried out to initial resolution video image to be encoded, generates the down-sampling video image of one or more stage resolution ratio;
For the down-sampling video image of each stage resolution ratio carries out reference frame decision-making respectively, obtain the optimum reference frame of the PU of each picture frame in the down-sampled images of each stage resolution ratio;
When reference frame decision-making is carried out to initial resolution video image, the reference frame jointly comprised in the down-sampling reference frame set of current PU and current reference frame list is included in the reference frame candidate collection of current PU, and from reference frame candidate collection, selects the optimum reference frame of current PU;
Wherein, the down-sampling reference frame set of described current PU comprises: the optimum reference frame of the PU in the down-sampled images of each stage resolution ratio that current PU is corresponding;
The PU of the down-sampled images of each stage resolution ratio that described current PU is corresponding comprises: in each down-sampling video image, the PU overlapping with the image-region shared by current PU or partially overlap.
In addition, when reference frame decision-making is carried out to initial resolution video image, also the reference frame of the adjacent block of current PU is included in the reference frame candidate collection of described current PU.
In addition, when down-sampling video image for each stage resolution ratio carries out reference frame decision-making respectively, take absolute length chang as the coding cost value of the match block of criterion calculation PU in each candidate reference frame, and candidate reference frame corresponding to the match block selecting coding cost value minimum is as the optimum reference frame of corresponding PU.
In addition, for initial resolution video image carry out reference frame decision-making time, take rate distortion costs as the coding cost value of the match block of criterion calculation PU in reference frame candidate collection in each candidate reference frame, and the candidate reference frame corresponding to match block selecting coding cost value minimum with λ × R (ref) sum is as the optimum reference frame of corresponding PU;
Wherein, λ is Lagrangian constant, and R (ref) represents the bit number that reference frame number ref takies.
In addition, the initial resolution video image carrying out described down-sampling is the primitive frame of video image.
The present invention also provides a kind of method for video coding, and the method comprises:
For video image to be encoded carry out reference frame decision-making time, by the reference frame of the adjacent block of current PU alternatively reference frame, and therefrom select the optimum reference frame of current PU.
In addition, for video image to be encoded carry out reference frame decision-making time, take rate distortion costs as the coding cost value of the match block of the current PU of criterion calculation in each candidate reference frame, and select coding cost value and the minimum candidate reference frame corresponding to match block of λ × R (ref) sum as the optimum reference frame of current PU;
Wherein, λ is Lagrangian constant, and R (ref) represents the bit number that reference frame number takies.
The present invention also provides a kind of video coding apparatus, comprises in this device: downsampling unit, predicting unit; Wherein:
Described downsampling unit, for carrying out down-sampling to initial resolution video image to be encoded, generates the down-sampling video image of one or more stage resolution ratio;
Described predicting unit, carries out reference frame decision-making respectively for the down-sampling video image for each stage resolution ratio, obtains the optimum reference frame of the PU of each picture frame in the down-sampled images of each stage resolution ratio;
Described predicting unit, also for when carrying out reference frame decision-making to initial resolution video image, the reference frame jointly comprised in the down-sampling reference frame set of current PU and current reference frame list is included in the reference frame candidate collection of current PU, and from reference frame candidate collection, selects the optimum reference frame of current PU;
Wherein, the down-sampling reference frame set of described current PU comprises: the optimum reference frame of the PU in the down-sampled images of each stage resolution ratio that current PU is corresponding;
The PU of the down-sampled images of each stage resolution ratio that described current PU is corresponding comprises: in each down-sampling video image, the PU overlapping with the image-region shared by current PU or partially overlap.
In addition, the reference frame of the adjacent block of current PU, when carrying out reference frame decision-making to initial resolution video image, is also included in the reference frame candidate collection of described current PU by described predicting unit.
The present invention also provides a kind of video coding apparatus, comprises: predicting unit in this device; Wherein:
Described predicting unit, for when carrying out reference frame decision-making for video image to be encoded, by the reference frame of the adjacent block of current PU alternatively reference frame, and therefrom selects the optimum reference frame of current PU.
In sum, the present invention proposes a kind of quick multi-reference frame decision scheme that significantly can reduce operand, when the program is applied to video coding system, the candidate reference frame of present encoding block is obtained by the coded data of down-sampling frame, under the prerequisite of not sacrificing inter prediction effect, decrease the quantity of candidate reference frame, improve the speed of reference frame decision-making; In addition, the present invention can also obtain the candidate reference frame of present encoding block by the neighboring prediction block of present encoding block, further increase the prediction accuracy of reference frame decision-making.Adopt method of the present invention and device, the reference frame decision-making time of 50% can be saved on existing x265 coding framework.
Accompanying drawing explanation
Fig. 1 is forward and backward estimation schematic diagram in prior art;
Fig. 2 is the prediction mode schematic diagram of H.264 P sheet and B sheet in agreement;
Fig. 3 is the method for video coding flow chart of the present invention first embodiment of the method;
Fig. 4 is the schematic diagram adopting the mode of decimation to carry out down-sampling;
Fig. 5 is the spatial domain position candidate schematic diagram of the movable information of HEVC;
Fig. 6 is the method for video coding flow chart of the present invention second embodiment of the method;
Fig. 7 is the structural representation of video coding apparatus first embodiment of the present invention;
Fig. 8 is the structural representation of video coding apparatus second embodiment of the present invention.
Embodiment
The present invention mainly comprises two aspects: the decision-making of level reference frame, the decision-making of adjacent block reference frame.
The decision-making of above-mentioned level reference frame refers to, carries out down-sampling to initial resolution video image to be encoded, generates the down-sampling video image of one or more stage resolution ratios; For the PU (Prediction Unit, predicting unit) in the down-sampled images of each stage resolution ratio carries out reference frame decision-making respectively, obtain the reference frame of the PU of each picture frame in the down-sampled images of each stage resolution ratio; When carrying out reference frame decision-making to initial resolution video image, using the set of optimum reference frame of PU in the down-sampled images of each stage resolution ratio that overlaps with the image-region corresponding to current PU or partially overlap in the down-sampled images of each stage resolution ratio and the common factor of the reference frame lists (List0 and List1) of current PU as with reference to frame candidate collection, carry out the decision-making of reference frame.
The decision-making of above-mentioned adjacent block reference frame refers to, when carrying out reference frame decision-making to video image, being reference frame candidate collection, carrying out the decision-making of reference frame by the set of reference frames cooperation of the adjacent block of current PU.
Below in conjunction with accompanying drawing, embodiments of the present invention is described in detail.
first embodiment of the method
Fig. 3 is the method for video coding flow chart of the present invention first embodiment of the method; As shown in Figure 3, the method comprises:
Step 301: to initial resolution video image (L to be encoded 0level video image) sequence carries out down-sampling, generates the down-sampling video image of each stage resolution ratio: L 1..., L dlevel video image; D be more than or equal to 1 integer;
In this step, different down-sampling ratio (i.e. the ratio of source resolution and target resolution) can be adopted to carry out down-sampling.When down-sampling ratio is Ds:1, represent every Ds pixel sampling 1 pixel, Ds be greater than 1 integer.
In the present embodiment, in order to retain the primary characteristic (such as, the textural characteristics of video) of video better, making Ds=2, namely adopting the mode of decimation to carry out down-sampling, as shown in Figure 4.
In addition, in order to accelerate the speed of computing, only down-sampling process being done to primitive frame in the present embodiment, reconstructed frame is not dealt with.
Step 302: to down-sampling video image (L at different levels 1~ L dlevel video image) carry out reference frame decision-making, obtain the reference frame of each PU (can be described as down-sampled images PU) in down-sampling video image at different levels;
Specifically, for L deach PU in (1≤d≤D) level video image, can carry out reference frame decision-making in the following way:
L is obtained by estimation dmatch block in current PU in level video image each reference frame in reference frame lists, and calculate the coding cost value (Cost) of each match block, the reference frame corresponding to match block selecting coding cost value minimum is as the optimum reference frame of this PU;
If the reference frame in current reference frame list comprises: ref 0, ref 1, ref 2... .ref n;
The coding cost value of the match block in each reference frame is respectively: Cost 0, Cost 1, Cost 2... .Cost n;
Then L din level video image, the reference frame of current PU is:
ref = ArgMin ref = 0,1,2 . . . n ( Cost ref ) ; (formula 1)
That is, the reference frame corresponding to match block (blocks and optimal matching blocks) minimum for coding cost value is defined as optimum reference frame.
In the present embodiment, for L d(1≤d≤D) level video image, with the coding cost value (Cost) of absolute length chang (SAD) for the above-mentioned each match block of criterion calculation:
SAD ( s , c ( MV ) ) = &Sigma; x = 1 , y = 1 Bx , By | s [ x , y ] - c [ x - MV x , y - MV y ] | ; (formula 2)
Wherein, s is current initial data of carrying out encoding, and c is the data of the reference frame for carrying out motion compensation of coding and rebuilding; Bx, By are the wide high information of current prediction block; MV is the motion vector of candidate; MVx is the x coordinate of motion vector; MVy is the y coordinate of motion vector.
That is, for L deach PU in (1≤d≤D) level video image, adopts absolute length chang (SAD) minimum principle to select the blocks and optimal matching blocks of current PU, and then selects the optimum reference frame of current PU.
Step 303: to initial resolution video image (L 0level video image) in each PU carry out reference frame decision-making before, first determine the corresponding PU of each PU in each down-sampling video image, and the reference frame of each corresponding PU;
Specifically, by L 0current PU in level video image is denoted as PU 0, the corresponding PU of PU0 in each down-sampling video image comprises: in down-sampling video image, with area coincidence shared by PU0 or the down-sampling video image PU that partially overlaps.
PU 0multiple corresponding PU can be comprised in each down-sampling video image, be denoted as down-sampling PU and gather P b:
P b={ PU b1..., PU bn; Set P bin each PU bican from identical or different down-sampling stage resolution ratio, 1≤i≤n, n be more than or equal to 1 integer.
Set P bmiddle different PU bican corresponding identical or different reference frame, can will gather P bcorresponding reference frame set is denoted as down-sampling reference frame set R b:
R b={ R b1..., R bm, m be more than or equal to 1 and be less than or equal to 15 integer.
Step 304: by initial resolution video image (L 0level video image) in the down-sampling reference frame set of each PU include in the reference frame candidate collection of corresponding PU with the reference frame jointly comprised in reference frame lists;
Specifically, by L 0current PU in level video image is denoted as PU 0, PU 0down-sampling set of reference frames be combined into R b={ R b1..., R bm; Reference frame lists List0={ref 01... ref 0k, reference frame lists List1={ref 11... ref 1j, k and j is positive integer, and k+j is less than or equal to 15; Then, PU 0reference frame candidate collection C 0:
C 0=R b∩ (List0 ∪ List1); Wherein, ∩ represents common factor, and ∪ represents union.
That is both R will be included in bin be included in again reference frame in List0, and be both included in R bin the reference frame be included in again in List1 include PU in 0reference frame candidate collection C 0in.
During specific implementation, also can first all be labeled as need not search for reference to each reference frame in frame list List0 and List1, then use R bin the reference frame that comprises in the reference frame that comprises and reference frame lists List0 and List1 contrast, if the reference frame comprised in reference frame lists List0 and List1 is included in R simultaneously bin, then the flag update with reference to reference frame corresponding in frame list is needs search.
Step 305: to initial resolution video image (L 0level video image) in each PU, use the reference frame of its adjacent block to upgrade the reference frame candidate collection of this PU, the reference frame by each PU adjacent block also includes the reference frame candidate collection of corresponding PU in;
Fig. 5 is the spatial domain position candidate schematic diagram of the movable information of HEVC.
{ a as shown in Figure 5 can be adopted in the present embodiment 0, a 1, b 0, b 1, b 2as the adjacent block of PU, and the reference frame of each adjacent block is included in the reference frame candidate collection of corresponding PU.
Specifically, by L 0current PU in level video image is denoted as PU 0, PU 0reference frame candidate collection C 0=R b∩ (List0 ∪ List1); By PU 0the set corresponding to reference frame of adjacent block be denoted as adjacent block reference frame set R n={ R n1..., R ns, s be less than or equal to 5 integer;
Then: C 0={ R b∩ (List0 ∪ List1) } ∪ R n;
That is, on the basis of step 304, R will be included in further nin reference frame include PU in 0reference frame candidate collection C 0in.
During specific implementation, the reference frame comprised in the reference frame of the adjacent block of PU and reference frame lists List0 and List1 can be used to contrast, if the reference frame comprised in reference frame lists List0 and List1 is the reference frame of the adjacent block of PU simultaneously, then the flag update with reference to reference frame corresponding in frame list is needs search.
The reference frame of adjacent block is included in reference frame candidate collection and can reduce the probability losing optimum reference frame.
This step is optional step.
Step 306: use initial resolution video image (L 0level video image) in the reference frame candidate collection of each PU carry out reference frame decision-making, obtain the optimum reference frame of each PU;
In this step, for L 0level video image adopts rate distortion costs value to calculate the coding cost value of each match block, namely selects blocks and optimal matching blocks with rate distortion costs minimum principle; And the coding cost value of each match block is selected optimum reference frame in conjunction with bit number shared by reference frame number.
Specifically, by L 0current PU in level video image is denoted as PU 0, PU 0reference frame candidate collection C 0={ ref 0... ref n, n be less than 15 integer;
PU is obtained by estimation 0at reference frame candidate collection C 0in each reference frame in match block, and calculate the coding cost value (Cost) of each match block: Cost 0, Cost 1, Cost 2... .Cost n;
Then PU 0reference frame be:
ref = ArgMin ref = 0,1,2 . . . n ( Cost ref + &lambda; &times; R ( ref ) ) ; (formula 3)
Wherein, R (ref) represents the bit number that reference frame number takies; λ is Lagrangian constant.
That is, coding cost value and the minimum reference frame of λ × R (ref) sum are defined as optimum reference frame.
In the present embodiment, for L 0level video image, with the coding cost value (Cost) of rate distortion costs value J for the above-mentioned each match block of criterion calculation:
J (MV, λ)=SAD (s, c (MV))+λ × R (MV-PMV); (formula 4)
Wherein, MV is the motion vector of candidate, and λ is Lagrangian constant, and PMV is median prediction vector, and R (MV-PMV) represents the bit number that motion vector differential coding may expend.
Step 307: use initial resolution video image (L 0level video image) in the reference frame of each PU carry out interframe encode, and carry out follow-up Video coding process.
second embodiment of the method
Fig. 6 is the method for video coding flow chart of the present invention second embodiment of the method; As shown in Figure 6, the method comprises:
Step 601: treat each PU in encode video image, use the reference frame of its adjacent block to determine the reference frame candidate collection of this PU, the reference frame by each PU adjacent block includes the reference frame candidate collection of corresponding PU in;
{ a as shown in Figure 5 can be adopted in the present embodiment 0, a 1, b 0, b 1, b 2as the adjacent block of PU, and the reference frame of each adjacent block is included in the reference frame candidate collection of corresponding PU.
During specific implementation, can first all be labeled as need not search for reference to each reference frame in frame list List0 and List1, then the reference frame comprised in the reference frame of the adjacent block of PU and reference frame lists List0 and List1 is used to contrast, if the reference frame comprised in reference frame lists List0 and List1 is the reference frame of the adjacent block of PU simultaneously, then the flag update with reference to reference frame corresponding in frame list is needs search.
Step 602: use the reference frame candidate collection of each PU in video image to be encoded to carry out reference frame decision-making, obtain the optimum reference frame of each PU;
In this step, adopt rate distortion costs value to calculate the coding cost value of each match block, namely select blocks and optimal matching blocks with rate distortion costs minimum principle; And the coding cost value of each match block is selected optimum reference frame in conjunction with bit number shared by reference frame number.
Specifically, the current PU in video image to be encoded is denoted as PU 0, PU 0reference frame candidate collection C 0={ ref 0... ref n, n be less than 15 integer;
PU is obtained by estimation 0at reference frame candidate collection C 0in the match block of each reference frame, and calculate the coding cost value (Cost) of each match block: Cost 0, Cost 1, Cost 2... .Cost n;
Then PU 0reference frame be:
ref = ArgMin ref = 0,1,2 . . . n ( Cost ref + &lambda; &times; R ( ref ) ) ;
Wherein, R (ref) represents the bit number that reference frame number takies; λ is Lagrangian constant.
That is, coding cost value and the minimum reference frame of λ × R (ref) sum are defined as optimum reference frame.
In the present embodiment, with the coding cost value (Cost) of rate distortion costs value J for the above-mentioned each match block of criterion calculation:
J(MV,λ)=SAD(s,c(MV))+λ×R(MV-PMV);
Wherein, MV is the motion vector of candidate, and λ is Lagrangian constant, and PMV is median prediction vector, and R (MV-PMV) represents the bit number that motion vector differential coding may expend.
Step 603: use the reference frame of each PU in video image to be encoded to carry out interframe encode, and carry out follow-up Video coding process.
device first embodiment
Fig. 7 is the structural representation of video coding apparatus first embodiment of the present invention; As shown in Figure 7, comprise in this device: downsampling unit, predicting unit; Wherein:
Downsampling unit, for carrying out down-sampling to initial resolution video image to be encoded, generates the down-sampling video image of one or more stage resolution ratio;
Predicting unit, carries out reference frame decision-making respectively for the down-sampling video image for each stage resolution ratio, obtains the optimum reference frame of the PU of each picture frame in the down-sampled images of each stage resolution ratio;
Predicting unit, also for when carrying out reference frame decision-making to initial resolution video image, the reference frame jointly comprised in the down-sampling reference frame set of current PU and current reference frame list is included in the reference frame candidate collection of current PU, and from reference frame candidate collection, selects the optimum reference frame of current PU;
Wherein, the down-sampling reference frame set of described current PU comprises: the optimum reference frame of the PU in the down-sampled images of each stage resolution ratio that current PU is corresponding;
The PU of the down-sampled images of each stage resolution ratio that described current PU is corresponding comprises: in each down-sampling video image, the PU overlapping with the image-region shared by current PU or partially overlap.
In addition, the reference frame of the adjacent block of current PU, when carrying out reference frame decision-making to initial resolution video image, is also included in the reference frame candidate collection of described current PU by predicting unit.
device second embodiment
Fig. 8 is the structural representation of video coding apparatus second embodiment of the present invention; As shown in Figure 8, comprise in this device: predicting unit; Wherein:
Predicting unit, for when carrying out reference frame decision-making for video image to be encoded, by the reference frame of the adjacent block of current PU alternatively reference frame, and therefrom selects the optimum reference frame of current PU.
In sum, the present invention proposes a kind of quick multi-reference frame decision scheme that significantly can reduce operand, when the program is applied to video coding system, the candidate reference frame of present encoding block is obtained by the coded data of down-sampling frame, under the prerequisite of not sacrificing inter prediction effect, decrease the quantity of candidate reference frame, improve the speed of reference frame decision-making; In addition, the present invention can also obtain the candidate reference frame of present encoding block by the neighboring prediction block of present encoding block, further increase the prediction accuracy of reference frame decision-making.Adopt method of the present invention and device, the reference frame decision-making time of 50% can be saved on existing x265 coding framework.

Claims (10)

1. a method for video coding, is characterized in that, the method comprises:
Down-sampling is carried out to initial resolution video image to be encoded, generates the down-sampling video image of one or more stage resolution ratio;
For the down-sampling video image of each stage resolution ratio carries out reference frame decision-making respectively, obtain the optimum reference frame of the PU of each picture frame in the down-sampled images of each stage resolution ratio;
When reference frame decision-making is carried out to initial resolution video image, the reference frame jointly comprised in the down-sampling reference frame set of current PU and current reference frame list is included in the reference frame candidate collection of current PU, and from reference frame candidate collection, selects the optimum reference frame of current PU;
Wherein, the down-sampling reference frame set of described current PU comprises: the optimum reference frame of the PU in the down-sampled images of each stage resolution ratio that current PU is corresponding;
The PU of the down-sampled images of each stage resolution ratio that described current PU is corresponding comprises: in each down-sampling video image, the PU overlapping with the image-region shared by current PU or partially overlap.
2. method according to claim 1, is characterized in that,
When reference frame decision-making is carried out to initial resolution video image, also the reference frame of the adjacent block of current PU is included in the reference frame candidate collection of described current PU.
3. method according to claim 1, is characterized in that,
When down-sampling video image for each stage resolution ratio carries out reference frame decision-making respectively, take absolute length chang as the coding cost value of the match block of criterion calculation PU in each candidate reference frame, and candidate reference frame corresponding to the match block selecting coding cost value minimum is as the optimum reference frame of corresponding PU.
4. method according to claim 1, is characterized in that,
For initial resolution video image carry out reference frame decision-making time, take rate distortion costs as the coding cost value of the match block of criterion calculation PU in reference frame candidate collection in each candidate reference frame, and the candidate reference frame corresponding to match block selecting coding cost value minimum with λ × R (ref) sum is as the optimum reference frame of corresponding PU;
Wherein, λ is Lagrangian constant, and R (ref) represents the bit number that reference frame number ref takies.
5. method according to claim 1, is characterized in that,
The initial resolution video image carrying out described down-sampling is the primitive frame of video image.
6. a method for video coding, is characterized in that, the method comprises:
For video image to be encoded carry out reference frame decision-making time, by the reference frame of the adjacent block of current PU alternatively reference frame, and therefrom select the optimum reference frame of current PU.
7. method according to claim 6, is characterized in that,
For video image to be encoded carry out reference frame decision-making time, take rate distortion costs as the coding cost value of the match block of the current PU of criterion calculation in each candidate reference frame, and select coding cost value and the minimum candidate reference frame corresponding to match block of λ × R (ref) sum as the optimum reference frame of current PU;
Wherein, λ is Lagrangian constant, and R (ref) represents the bit number that reference frame number takies.
8. a video coding apparatus, is characterized in that, comprises in this device: downsampling unit and predicting unit; Wherein:
Described downsampling unit, for carrying out down-sampling to initial resolution video image to be encoded, generates the down-sampling video image of one or more stage resolution ratio;
Described predicting unit, carries out reference frame decision-making respectively for the down-sampling video image for each stage resolution ratio, obtains the optimum reference frame of the PU of each picture frame in the down-sampled images of each stage resolution ratio;
Described predicting unit, also for when carrying out reference frame decision-making to initial resolution video image, the reference frame jointly comprised in the down-sampling reference frame set of current PU and current reference frame list is included in the reference frame candidate collection of current PU, and from reference frame candidate collection, selects the optimum reference frame of current PU;
Wherein, the down-sampling reference frame set of described current PU comprises: the optimum reference frame of the PU in the down-sampled images of each stage resolution ratio that current PU is corresponding;
The PU of the down-sampled images of each stage resolution ratio that described current PU is corresponding comprises: in each down-sampling video image, the PU overlapping with the image-region shared by current PU or partially overlap.
9. device according to claim 8, is characterized in that,
The reference frame of the adjacent block of current PU, when carrying out reference frame decision-making to initial resolution video image, is also included in the reference frame candidate collection of described current PU by described predicting unit.
10. a video coding apparatus, is characterized in that, comprises: predicting unit in this device; Wherein:
Described predicting unit, for when carrying out reference frame decision-making for video image to be encoded, by the reference frame of the adjacent block of current PU alternatively reference frame, and therefrom selects the optimum reference frame of current PU.
CN201410855788.4A 2014-12-31 2014-12-31 Video coding method and device Pending CN104602019A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410855788.4A CN104602019A (en) 2014-12-31 2014-12-31 Video coding method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410855788.4A CN104602019A (en) 2014-12-31 2014-12-31 Video coding method and device

Publications (1)

Publication Number Publication Date
CN104602019A true CN104602019A (en) 2015-05-06

Family

ID=53127458

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410855788.4A Pending CN104602019A (en) 2014-12-31 2014-12-31 Video coding method and device

Country Status (1)

Country Link
CN (1) CN104602019A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106454349A (en) * 2016-10-18 2017-02-22 哈尔滨工业大学 Motion estimation block matching method based on H.265 video coding
CN106817585A (en) * 2015-12-02 2017-06-09 掌赢信息科技(上海)有限公司 A kind of method for video coding of utilization long term reference frame, electronic equipment and system
CN107483953A (en) * 2017-10-10 2017-12-15 司马大大(北京)智能系统有限公司 Inter frame motion estimation method, apparatus and electronic equipment
CN107920254A (en) * 2016-10-11 2018-04-17 北京金山云网络技术有限公司 A kind of method for estimating, device and video encoder for B frames
CN108848376A (en) * 2018-06-20 2018-11-20 腾讯科技(深圳)有限公司 Video coding, coding/decoding method, device and computer equipment
WO2020034921A1 (en) * 2018-08-17 2020-02-20 北京金山云网络技术有限公司 Motion estimation method, device, electronic apparatus, and computer readable storage medium
CN111263151A (en) * 2020-04-26 2020-06-09 腾讯科技(深圳)有限公司 Video encoding method, video encoding device, electronic device, and computer-readable storage medium
CN111642141A (en) * 2018-12-31 2020-09-08 阿里巴巴集团控股有限公司 Resolution adaptive video coding
WO2020207451A1 (en) * 2019-04-11 2020-10-15 福州瑞芯微电子股份有限公司 H.265 encoding method and apparatus
WO2020253728A1 (en) * 2019-06-20 2020-12-24 杭州海康威视数字技术股份有限公司 Encoding method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009152805A (en) * 2007-12-19 2009-07-09 Canon Inc Video coder, control method of video coder, and computer program
CN102447902A (en) * 2011-09-30 2012-05-09 曾志华 Method for selecting reference field and acquiring time-domain motion vector
CN103561270A (en) * 2013-11-08 2014-02-05 武汉大学 Coding control method and device for HEVC

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009152805A (en) * 2007-12-19 2009-07-09 Canon Inc Video coder, control method of video coder, and computer program
CN102447902A (en) * 2011-09-30 2012-05-09 曾志华 Method for selecting reference field and acquiring time-domain motion vector
CN103561270A (en) * 2013-11-08 2014-02-05 武汉大学 Coding control method and device for HEVC

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106817585A (en) * 2015-12-02 2017-06-09 掌赢信息科技(上海)有限公司 A kind of method for video coding of utilization long term reference frame, electronic equipment and system
CN107920254A (en) * 2016-10-11 2018-04-17 北京金山云网络技术有限公司 A kind of method for estimating, device and video encoder for B frames
CN107920254B (en) * 2016-10-11 2019-08-30 北京金山云网络技术有限公司 A kind of method for estimating, device and video encoder for B frame
CN106454349B (en) * 2016-10-18 2019-07-16 哈尔滨工业大学 A kind of estimation block matching method based on H.265 Video coding
CN106454349A (en) * 2016-10-18 2017-02-22 哈尔滨工业大学 Motion estimation block matching method based on H.265 video coding
CN107483953A (en) * 2017-10-10 2017-12-15 司马大大(北京)智能系统有限公司 Inter frame motion estimation method, apparatus and electronic equipment
CN107483953B (en) * 2017-10-10 2019-11-29 司马大大(北京)智能系统有限公司 Inter frame motion estimation method, apparatus and electronic equipment
CN108848376B (en) * 2018-06-20 2022-03-01 腾讯科技(深圳)有限公司 Video encoding method, video decoding method, video encoding device, video decoding device and computer equipment
CN108848376A (en) * 2018-06-20 2018-11-20 腾讯科技(深圳)有限公司 Video coding, coding/decoding method, device and computer equipment
WO2020034921A1 (en) * 2018-08-17 2020-02-20 北京金山云网络技术有限公司 Motion estimation method, device, electronic apparatus, and computer readable storage medium
CN110839155B (en) * 2018-08-17 2021-12-03 北京金山云网络技术有限公司 Method and device for motion estimation, electronic equipment and computer-readable storage medium
CN110839155A (en) * 2018-08-17 2020-02-25 北京金山云网络技术有限公司 Method and device for motion estimation, electronic equipment and computer-readable storage medium
CN111642141A (en) * 2018-12-31 2020-09-08 阿里巴巴集团控股有限公司 Resolution adaptive video coding
WO2020207451A1 (en) * 2019-04-11 2020-10-15 福州瑞芯微电子股份有限公司 H.265 encoding method and apparatus
WO2020253728A1 (en) * 2019-06-20 2020-12-24 杭州海康威视数字技术股份有限公司 Encoding method and device
CN111263151A (en) * 2020-04-26 2020-06-09 腾讯科技(深圳)有限公司 Video encoding method, video encoding device, electronic device, and computer-readable storage medium
CN111263151B (en) * 2020-04-26 2020-08-25 腾讯科技(深圳)有限公司 Video encoding method, video encoding device, electronic device, and computer-readable storage medium

Similar Documents

Publication Publication Date Title
CN104602019A (en) Video coding method and device
US11350124B2 (en) Image processing method and image processing device
CN113016185B (en) Control of Merge in motion vector differential mode
TWI827681B (en) Syntax reuse for affine mode with adaptive motion vector resolution
TWI750494B (en) Constraints for usage of updated motion information
CN113196773B (en) Motion vector accuracy in Merge mode with motion vector difference
US11070834B2 (en) Low-complexity method for generating synthetic reference frames in video coding
CN113039787A (en) Decoder-side motion vector refinement
CN110741640B (en) Optical flow estimation for motion compensated prediction in video coding
CN113678452A (en) Constraint on decoder-side motion vector refinement
CN101867813B (en) Multi-view video coding method oriented for interactive application
CN110740321B (en) Motion prediction based on updated motion vectors
CN110312132A (en) A kind of decoding method, device and its equipment
CN110545425B (en) Inter-frame prediction method, terminal equipment and computer storage medium
CN104811726B (en) The candidate motion vector selection method of estimation in frame per second conversion
CN111818342A (en) Inter-frame prediction method and prediction device
CN112004097B (en) Inter-frame prediction method, image processing apparatus, and computer-readable storage medium
CN101895749B (en) Quick parallax estimation and motion estimation method
CN104581180A (en) Video coding method and device
KR101544158B1 (en) Method for searching bidirectional motion using multiple frame and image apparatus with the same technique
CN103152566B (en) A kind of video frame rate method for improving
CN104185993A (en) Image processing device and method, and recording medium
TW202021346A (en) Mv precision in bio
CN103796026A (en) Motion estimation method based on double reference frames
CN111510726B (en) Coding and decoding method and equipment thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned
AD01 Patent right deemed abandoned

Effective date of abandoning: 20181116