CN104796623B - Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method - Google Patents

Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method Download PDF

Info

Publication number
CN104796623B
CN104796623B CN201510054621.2A CN201510054621A CN104796623B CN 104796623 B CN104796623 B CN 104796623B CN 201510054621 A CN201510054621 A CN 201510054621A CN 104796623 B CN104796623 B CN 104796623B
Authority
CN
China
Prior art keywords
image
displacement
matching
block
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510054621.2A
Other languages
Chinese (zh)
Other versions
CN104796623A (en
Inventor
王炜
李靖
张茂军
熊志辉
徐玮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN201510054621.2A priority Critical patent/CN104796623B/en
Publication of CN104796623A publication Critical patent/CN104796623A/en
Application granted granted Critical
Publication of CN104796623B publication Critical patent/CN104796623B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention belongs to Image Information Processing field, provide a kind of splicing video based on pyramid Block-matching and functional optimization and go structural deviation method, on the basis of global registration, for the overlapping region structural deviation problem that parallax in video-splicing causes, pyramid block matching method is utilized to obtain the corresponding relation of two width images at some image blocks of overlapping region, and represent its position deviation with the displacement of two dimension, then minimized by energy functional and discrete displacement is diffused into whole image range, the displacement that last foundation obtains does deformation operation to image, obtain the frame of video of overlapping region non-structure deviation.Meanwhile, consider in the frame that deformation operates in the process of establishing of energy functional expression formula and interframe constraint, the video spliced there will not be abnormal distortion and shake when playing simultaneously.

Description

Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method
Technical field:
The invention belongs to Image Information Processing field, relate to video-splicing method, the splicing video particularly related to based on pyramid Block-matching and functional optimization goes structural deviation method.
Background technology
Image mosaic, as an important research direction of computer vision subject, receives and pays close attention to widely and deep research.Up to the present, existing a series of Theories and methods for different application demand is proposed by scholars, and achieves good experiment effect, and the algorithm of some of them maturation has been integrated in the business software of image processing field.Therefore, can think that image mosaic has been a more ripe research field.And for video-splicing problem, although just handling object is replaced with the video of multichannel video camera sync pulse jamming by image, but the scale of problem and complexity all sharply promote, be limited to the disposal ability of theory basis and hardware device, be also in the starting stage to the research of video-splicing.Existing video-splicing method is the simple expansion of certain image split-joint method on time dimension mostly, and the new features bring video-splicing problem and constraint lack to be discussed fully, causes splicing effect not good.
Because the photocentre of each video camera not exclusively overlaps, there is certain dislocation and distortion in the imaging of Same Scene in different cameras shooting, claims this difference to be parallax.The existence of parallax makes each road picture being mapped to same viewing plane by projective transformation not overlap completely, namely there is structural deviation.Therefore, in order to improve the joining quality of each frame video pictures, the impact of structural deviation must be eliminated.
Conventional images joining method utilizes optimum splicing seams technology to address this problem usually, and namely avoiding structural deviation affects serious region, and carries out mixing operation near optimum piece, obtains the good stitching image of quality.But the scene for video-splicing, in most cases, in overlapping region is dynamic, the i.e. optimum piece position of consecutive frame inconsistent, due to the impact of parallax, if the image in same region is always alternately got from different original image, the exception of video pictures can be caused to glimmer.Even if increase space-time restriction, make the position of front and back frame piece relatively stable, when there being moving object to pass from piece both sides, always there is piece only cannot adjust the situation just can avoiding structural deviation by a small margin.To sum up, by the structural deviation problem of avoiding in single image splicing, in video-splicing saliency out, joining quality can be affected.
Summary of the invention:
The structural deviation that the present invention is directed to overlapping region in video-splicing can affect the reality of joining quality, propose a kind of splicing video based on pyramid Block-matching and functional optimization and go structural deviation method, keeping the impact eliminating structural deviation between frame of video under conforming prerequisite, high-quality video-splicing can be completed.Strategy of the present invention is operated by image deformation, attempts to eliminate instead of avoid structural deviation, and concrete grammar is after global registration, and according to the result of overlapping region Block-matching, counterweight projected image carries out local directed complete set.In addition, owing to introducing deformation operation, for guaranteeing that the splicing video exported there will not be abnormal distortion when playing and rocks, the present invention adds interframe constraint when calculating deformation.
First the present invention utilizes pyramid block matching method to obtain the corresponding relation of two width images at some image blocks of overlapping region, and represents its position deviation with the displacement of two dimension.Then minimized by energy functional, solve the displacement obtaining whole image range.The last displacement according to each position carries out deformation operation to image, obtains the frame of video of overlapping region non-structure deviation.
Particularly, the technical solution used in the present invention is:
Splicing video based on pyramid Block-matching and functional optimization goes a structural deviation method, it is characterized in that comprising the following steps:
S1. after extracting re-projection, the lap of two width images carries out pyramid Block-matching, obtains discrete structure corresponding relation and the motion vector of discrete point;
S2., on the discrete displacement vector basis that step S1 obtains, each item constraint of comprehensive video splicing, sets up the energy functional of deformation diffusion, obtains the displacement in whole image range by minimization of energy functional;
S3. the displacement obtained according to step S2 carries out deformation operation to input picture, carries out re-projection and fusion, obtain the stitching image of current each frame of video to the image after deformation.
Further, the concrete grammar of described step S1 comprises:
S11. the image pyramid of two width image laps after re-projection is set up;
With I s, I tthe lap of two width images after expression re-projection, first to I sand I tcarry out multiple dimensioned convergent-divergent:
I s , k + 1 = g σ ⊗ I s , k k = 0,1 , . . . L - 1 I s , k + 1 ( i , j ) = I s , k + 1 ′ ( 2 i - 1,2 j - 1 ) i = 1,2 , . . . H k , j = 1,2 , . . . W k
Wherein k is current layer number, and L is convolution operation the total degree performed, and I s, 0=I s, g σrepresent that variance is the gaussian kernel function of σ, H kwith W kbe respectively the height and the width of kth tomographic image; To I tdo same operation, set up pyramid I t, 0, I t, 1... I t,L;
S12. pass through successively Block-matching, obtain the corresponding relation between current viewing plane epigraph block;
Descending with yardstick, namely number of plies k is by the order of L to 0, to one group of correspondence image I of every one deck s,kwith I s,koperation block matching algorithm, obtains the matching relationship of current layer, and with the displacement of two dimension represent the position deviation obtained, wherein V k,x() and V k,y() is vector component in level and vertical both direction, numbering m=1,2 ... M kwith n=1,2 ... N kbe respectively the Position Number of match block vertical and horizontal, M kwith N kbe respectively the maximum numbering of kth layer vertical and horizontal; With for as lower one deck (2m-1,2n-1), (2m-1,2n), (2m, 2n-1) and (2m, 2n) initial value of the four pairs of match block displacement search in place, successively searches for, finally obtains the original image i.e. Block-matching displacement of the 0th tomographic image and have:
x ^ t ( m , n ) = x ^ s ( m , n ) + V x ( m , n ) y ^ t ( m , n ) = y ^ s ( m , n ) + V y ( m , n )
Wherein, with be respectively the coordinate at each match block center under the re-projection image place viewing plane coordinate system of source images and target image; The result of Block-matching is discrete, the result of Block-matching is represented with following one dimension numbering form:
x ^ t , p = x ^ s , p + V x , p y ^ t , p = y ^ s , p + V y , p p = 1,2 , . . . P m
Wherein, P mfor the quantity that match block is right;
S13. under each match block center being transformed to original image coordinate system to be deformed according to known global mapping relation, and displacement discrete under calculating original image coordinate;
For obtaining I scorresponding original image I s, orgin the deformation quantity of each pixel, need viewing plane coordinate i is mapped to according to the mapping relations that the global registration stage obtains s, orgimage coordinate on, obtain corresponding coordinates of original image coordinates (x s,p, y s,p) and (x t,p, y t,p), now, I s, orgmiddle target location (x t,p, y t,p) corresponding displacement can be obtained by following formula:
U x , p = x t , p - x s , p U y , p = y t , p - y s , p .
Further, the concrete grammar of described step S2 comprises:
S21. combined block mates in frame that deformation action need in the discrete prior information and video-splicing obtained meets and interframe retrain, and setting up need minimized energy functional expression formula;
Deformation operation is carried out to input picture, needs to obtain I s, orgdisplacement function in full figure scope Ω wherein the separate and computational process of two components consistent, unifiedly represent one of them with scalar function U at this; Three constraints below function U demand fulfillment:
(1) U is at (x t,p, y t,p) place value should with U pclose as much as possible;
(2) in image, the U value of adjacent position should be close as much as possible;
(3) the U value of consecutive frame should be close as much as possible;
Wherein retrain (1) and represent the discrete displacement prior information obtained by Block-matching in step S1, the continuity constraint that constraint (2) is frame intrinsic displacement amount, the continuity constraint that constraint (3) is interframe displacement;
The energy functional component of constraint (1) correspondence is:
E D = ∫ ∫ Ω ( U - U ~ ) 2 dxdy
Wherein, for be defined on Ω by the reference displacement function that gridding interpolation obtains;
The energy functional component of constraint (2) correspondence is:
E S = ∫ ∫ Ω | ▿ U | 2 dxdy = ∫ ∫ Ω ( ( dU dx ) 2 + ( dU dy ) 2 ) dxdy
The energy functional component of constraint (3) correspondence is:
E C = ∫ ∫ Ω ( U - U pre ) 2 dxdy
Comprehensively above-mentioned three constraints, need minimized total energy functional to be:
E = E D + λ 1 E S + λ 2 E C = ∫ ∫ Ω ( ( U - U ~ ) 2 + λ 1 | ▿ U | 2 + λ 2 ( U - U pre ) 2 ) dxdy
S22. according to variation principle, energy functional minimization problem is converted to solving of partial differential equation;
According to variation principle, according to Euler-Lagrange (Euler-Language) formula, can be the Solve problems of following partial differential equation by the problem equivalent minimizing E:
Above-mentioned partial differential equation are ellipse partial differential equation of second order;
S23. the partial differential equation that difference method numerical solution obtains in step S22 are utilized
The partial differential equation obtained in step S22 are conitnuous forms, carry out discretization, and solve numerical solution to it;
Used first-order difference form is as follows:
∂ U ∂ x = U ( x + h , . . . ) - U ( x , . . . ) h + O ( h )
Finite Difference Scheme of Second Order is:
∂ 2 U ∂ x 2 = U ( x + h , . . . ) - 2 U ( x , . . . ) + U ( x - h , . . . ) 2 h + O ( h 2 )
Especially, the difference scheme of Laplacian Δ is:
ΔU = U ( x + h , y ) + U ( x - h , y ) + U ( x , y + h ) + U ( x , y - h ) - 4 U ( x , y ) 2 h + O ( h 2 )
Wherein, difference interval h gets the width of a pixel, h=1 in image coordinate system, by substituting into the difference scheme of above-mentioned differential operator, the partial differential equation obtained in S22 is converted to difference equation; This difference equation is linear, and it solves and realizes by overrelaxation alternative manner;
Displacement vector components U is solved respectively according to above-mentioned steps xand U ythe partial differential equation obtained in corresponding S22, can obtain original image I s, orgthe motion vector of each pixel
Further, the concrete grammar of described step S3 comprises:
S31. according to the displacement obtained in step S2, deformation operation is carried out to input picture;
According to I s, orgdisplacement function within the scope of full images can to I s, orgcarry out deformation operation, image I ' after distortion s, orgmiddle position (x ' s,p, y ' s,p) pixel value at place can from I s, orgas upper/lower positions value:
x s , p = x s , p ′ - U x , p y s , p = y s , p - U y , p
Usually (x s,p, y s,p) be non-integer position, the value of this position needs to do bilinear interpolation by neighbouring pixel and obtains;
S32. demarcated the projective parameter obtained according to video camera itself, the image projection after distortion has been merged to unified viewing plane;
By the image I ' after distortion s, orgwith target image I t, orgaccording to projecting to same viewing plane, now, the overlapping region of two images is non-structure deviations, merges, namely obtain the stitching image of present frame to two width re-projection image applications Multiscale Fusions or gradient field fusion method; Step S1 to S3 is suitable for each group synchronization frame of video, the video spliced can be obtained.
The method of the invention is on the basis of global registration, for the overlapping region structural deviation problem that parallax in video-splicing causes, utilize pyramid Block-matching and energy functional to minimize and certain deformation operation is carried out to input picture, the non-structure deviation after re-projection to viewing plane again of the image after distortion can be made, finally coordinate re-projection and mixing operation, complete high-quality video-splicing.Meanwhile, consider in the frame that deformation operates in the process of establishing of energy functional expression formula and interframe constraint, the video spliced there will not be abnormal distortion and shake when playing simultaneously.
Accompanying drawing illustrates:
Fig. 1 is overview flow chart of the present invention.
Fig. 2 is the matching result schematic diagram of each layer of pyramid Block-matching.
Embodiment
Below in conjunction with accompanying drawing and example, the specific embodiment of the present invention is described in further detail.
Video-splicing is divided into global registration and two stages of local directed complete set by the present invention.In the global registration stage, extract and the result of mating based on SIFT feature, calculate the projective parameter of each road video camera, accordingly can by each road image projection to unified viewing plane, this one-phase is existing ripe solution now; The present invention is mainly devoted to second stage, the i.e. local directed complete set stage, pyramid block matching method is utilized to obtain the corresponding relation of two width images at some image blocks of overlapping region, and represent its position deviation with the displacement of two dimension, then minimized by energy functional and discrete displacement is diffused into whole image range, the displacement that last foundation obtains does deformation operation to image, and obtain the frame of video of overlapping region non-structure deviation, the overview flow chart in local directed complete set stage as shown in Figure 1.
Splicing video based on pyramid Block-matching and functional optimization provided by the invention goes structural deviation method to be realized by following steps:
S1. demarcated the projective parameter obtained according to video camera itself, two width source images are projected to same viewing plane, and after extracting re-projection, the lap of two width images carries out pyramid Block-matching, obtain discrete match block and corresponding motion vector.
S2. on the discrete displacement vector basis obtained in step sl, with interframe constraint in the frame of comprehensive video splicing, set up the energy functional of deformation diffusion, and minimization of energy functional is converted into the Solve problems of partial differential equation, solve by difference the deformation quantity that partial differential equation obtain in whole image range.
S3. the deformation quantity in the whole image range obtained according to step S2 carries out deformation operation to input picture, then the image after deformation carried out re-projection to unified viewing plane and do to merge, and obtains the stitching image of present frame.
Further, the concrete grammar of described step S1 comprises:
S11. the image pyramid of two width image laps after re-projection is set up.
With I s, I tthe lap of two width images after expression re-projection, first to I sand I tcarry out multiple dimensioned convergent-divergent:
I s , k + 1 = g σ ⊗ I s , k k = 0,1 , . . . L - 1 I s , k + 1 ( i , j ) = I s , k + 1 ′ ( 2 i - 1,2 j - 1 ) i = 1,2 , . . . H k , j = 1,2 , . . . W k
Wherein k is current layer number, and L is convolution operation the total degree performed, and I s, 0=I s, g σrepresent that variance is the gaussian kernel function of σ, H kwith W kbe respectively the height and the width of kth tomographic image.To I tdo same operation, set up pyramid I t, 0, I t, 1... I t,L.
S12. pass through successively Block-matching, obtain the corresponding relation between current viewing plane epigraph block.
Descending with yardstick, namely number of plies k is by the order of L to 0, to one group of correspondence image I of every one deck s,kwith I s,koperation block matching algorithm, obtains the matching relationship of current layer, and with the displacement of two dimension represent the position deviation obtained, wherein V k,x() and V k,y() is vector component in level and vertical both direction, numbering m=1,2 ... M kwith n=1,2 ... N kbe respectively the Position Number of match block vertical and horizontal, M kwith N kbe respectively the maximum numbering of kth layer vertical and horizontal.With for as lower one deck (2m-1,2n-1), (2m-1,2n), the initial value that (2m, 2n-1) and four pairs, (2m, 2n) place match block displacement are searched for, successively search for, the matching result of each layer as shown in Figure 2.Finally obtain the original image i.e. Block-matching displacement of the 0th tomographic image and have:
x ^ t ( m , n ) = x ^ s ( m , n ) + V x ( m , n ) y ^ t ( m , n ) = y ^ s ( m , n ) + V y ( m , n )
Wherein, with be respectively the coordinate at each match block center under the re-projection image place viewing plane coordinate system of source images and target image.The result of Block-matching is discrete, therefore for process is convenient, the result of Block-matching can be represented with following one dimension numbering form:
x ^ t , p = x ^ s , p + V x , p y ^ t , p = y ^ s , p + V y , p p = 1,2 , . . . P m
Wherein, P mfor the quantity that match block is right.
S13. under each match block center being transformed to original image coordinate system to be deformed according to known global mapping relation, and deformation quantity discrete under calculating original image coordinate.
For obtaining I scorresponding original image I s, orgin the deformation quantity of each pixel, need viewing plane coordinate with i is mapped to according to the mapping relations that the global registration stage obtains s, orgimage coordinate on, obtain corresponding coordinates of original image coordinates (x s,p, y s,p) and (x t,p, y t,p), now, I s, orgmiddle target location (x t,p, y t,p) corresponding displacement can be obtained by following formula:
U x , p = x t , p - x s , p U y , p = y t , p - y s , p
Further, the concrete grammar of described step S2 comprises:
S21. combined block mates in frame that deformation action need in the discrete prior information and video-splicing obtained meets and interframe retrain, and setting up need minimized energy functional expression formula.
Deformation operation is carried out to input picture, needs to obtain I s, orgdisplacement function in full figure scope Ω in the present invention the separate and computational process of two components consistent, therefore for simplifying statement, unifiedly represent one of them with scalar function U.Meanwhile, right and the two-dimension displacement vector of follow-up appearance does similar simplification.Three constraints below function U demand fulfillment:
(1) U is at (x t,p, y t,p) place value should with U iclose as much as possible;
(2) in image, the U value of adjacent position should be close as much as possible;
(3) the U value of consecutive frame should be close as much as possible.
Wherein retrain (1) and represent the discrete displacement prior information obtained by Block-matching in step S1, the continuity constraint that constraint (2) is deformation quantity in frame, the continuity constraint that constraint (3) is interframe deformation quantity.
The energy functional component of constraint (1) correspondence is:
E D = ∫ ∫ Ω ( U - U ~ ) 2 dxdy
Wherein, for be defined on Ω by the reference displacement function that gridding interpolation obtains.
The energy functional component of constraint (2) correspondence is:
E S = ∫ ∫ Ω | ▿ U | 2 dxdy = ∫ ∫ Ω ( ( dU dx ) 2 + ( dU dy ) 2 ) dxdy
The energy functional component of constraint (3) correspondence is:
E C = ∫ ∫ Ω ( U - U pre ) 2 dxdy
Comprehensively above-mentioned three constraints, need minimized total energy functional to be:
E = E D + λ 1 E S + λ 2 E C = ∫ ∫ Ω ( ( U - U ~ ) 2 + λ 1 | ▿ U | 2 + λ 2 ( U - U pre ) 2 ) dxdy
S22. according to variation principle, energy functional minimization problem is converted to solving of partial differential equation.
If with the energy functional obtained in optimal method direct iteration solution procedure S21, because unknown function U is defined in image range, by U according to after image resolution ratio discretization, the displacement of each pixel position is the component of the unknown state variable of demand solution.Existing optimal method is when solving large-scale optimization problem like this, and convergence rate cannot meet the requirement of video-splicing to processing speed.Therefore, need to transform problem, to accelerate to solve.
According to variation principle, according to Euler-Language formula, can be the Solve problems of following partial differential equation by the problem equivalent minimizing E:
Above-mentioned partial differential equation are ellipse partial differential equation of second order.
S23. the partial differential equation that difference method numerical solution obtains in step S22 are utilized.
The partial differential equation obtained in step S22 are conitnuous forms, and its analytic solutions normal conditions cannot be tried to achieve, and the present invention carries out discretization when implementing to it, and solves numerical solution.Used first-order difference form is as follows:
∂ U ∂ x = U ( x + h , . . . ) - U ( x , . . . ) h + O ( h )
Finite Difference Scheme of Second Order is:
∂ 2 U ∂ x 2 = U ( x + h , . . . ) - 2 U ( x , . . . ) + U ( x - h , . . . ) 2 h + O ( h 2 )
Especially, the difference scheme of Laplacian Δ is:
ΔU = U ( x + h , y ) + U ( x - h , y ) + U ( x , y + h ) + U ( x , y - h ) - 4 U ( x , y ) 2 h + O ( h 2 )
In the present invention, difference interval h gets the width of a pixel, h=1 in image coordinate system, by substituting into the difference scheme of above-mentioned differential operator, the partial differential equation obtained in S22 is converted to difference equation.Solving of difference equation is realized by overrelaxation (SOR, SuccessiveOverRelaxation) alternative manner.
Displacement vector components U is solved respectively according to above-mentioned steps xand U ythe partial differential equation obtained in corresponding S22, can obtain original image I s, orgthe motion vector of each pixel
Further, the concrete grammar of described step S3 comprises:
S31. according to the deformation quantity obtained in step S2, deformation operation is carried out to input picture.
According to I s, orgdeformation quantity within the scope of full images can to I s, orgcarry out deformation operation, image I ' after distortion s, orgmiddle position (x ' s,p, y ' s,p) pixel value at place can from I s, orgas upper/lower positions value:
x s , p = x s , p ′ - U x , p y s , p = y s , p - U y , p
Usually (x s,p, y s,p) be non-integer position, the value of this position needs to do bilinear interpolation by neighbouring pixel and obtains.
S32. demarcated the projective parameter obtained according to video camera itself, the image projection after distortion has been merged to unified viewing plane.
By the image I ' after distortion s, orgwith target image I t, orgaccording to projecting to same viewing plane, now, the overlapping region of two images is non-structure deviations.Two width re-projection image applications Multiscale Fusions or gradient field fusion method are merged, namely obtains the stitching image of present frame.Step S1 to S3 is suitable for each group synchronization frame of video, the video spliced can be obtained.

Claims (3)

1. the splicing video based on pyramid Block-matching and functional optimization goes a structural deviation method, it is characterized in that comprising the following steps:
S1. after extracting re-projection, the lap of two width images carries out pyramid Block-matching, obtains discrete structure corresponding relation and the motion vector of discrete point;
S11. the image pyramid of two width image laps after re-projection is set up;
With I s, I tthe lap of two width images after expression re-projection, first to I sand I tcarry out multiple dimensioned convergent-divergent:
I s , k + 1 = g σ ⊗ I s , k k = 0 , 1 , ... L - 1 I s , k + 1 ( i , j ) = I s , k + 1 ′ ( 2 i - 1 , 2 j - 1 ) i = 1 , 2 , ... H k j = 1 , 2 , ... W k
Wherein k is current layer number, and L is convolution operation the total degree performed, and I s, 0=I s, g σrepresent that variance is the gaussian kernel function of σ, H kwith W kbe respectively the height and the width of kth tomographic image; To I tdo same operation, set up pyramid I t, 0, I t, 1... I t,L;
S12. pass through successively Block-matching, obtain the corresponding relation between current viewing plane epigraph block;
Descending with yardstick, namely number of plies k is by the order of L to 0, to one group of correspondence image I of every one deck s,kwith I s,koperation block matching algorithm, obtains the matching relationship of current layer, and with the displacement of two dimension represent the position deviation obtained, wherein V k,x() and V k,y() is vector component in level and vertical both direction, numbering m=1,2 ... M kwith n=1,2 ... N kbe respectively the Position Number of match block vertical and horizontal, M kwith N kbe respectively the maximum numbering of kth layer vertical and horizontal; With for as lower one deck (2m-1,2n-1), (2m-1,2n), (2m, 2n-1) and (2m, 2n) initial value of the four pairs of match block displacement search in place, successively searches for, finally obtains the original image i.e. Block-matching displacement of the 0th tomographic image and have:
x ^ t ( m , n ) = x ^ s ( m , n ) + V x ( m , n ) y ^ t ( m , n ) = y ^ s ( m , n ) + V y ( m , n )
Wherein, with be respectively the coordinate at each match block center under the re-projection image place viewing plane coordinate system of source images and target image; The result of Block-matching is discrete, the result of Block-matching is represented with following one dimension numbering form:
x ^ t , p = x ^ s , p + V x , p y ^ t , p = y ^ s , p + V y , p , p = 1 , 2 , ... P m
Wherein, P mfor the quantity that match block is right;
S13. under each match block center being transformed to original image coordinate system to be deformed according to known global mapping relation, and displacement discrete under calculating original image coordinate;
For obtaining I scorresponding original image I s, orgin the deformation quantity of each pixel, need viewing plane coordinate with i is mapped to according to the mapping relations that the global registration stage obtains s, orgimage coordinate on, obtain corresponding coordinates of original image coordinates (x s,p, y s,p) and (x t,p, y t,p), now, I s, orgmiddle target location (x t,p, y t,p) corresponding displacement can be obtained by following formula:
U x , p = x t , p - x s , p U y , p = y t , p - y s , p
S2., on the discrete displacement vector basis that step S1 obtains, each item constraint of comprehensive video splicing, sets up the energy functional of deformation diffusion, obtains the displacement in whole image range by minimization of energy functional;
S3. the displacement obtained according to step S2 carries out deformation operation to input picture, carries out re-projection and fusion, obtain the stitching image of current each frame of video to the image after deformation.
2. the splicing video based on pyramid Block-matching and functional optimization according to claim 1 goes structural deviation method, it is characterized in that: the concrete grammar of described step S2 comprises:
S21. combined block mates in frame that deformation action need in the discrete prior information and video-splicing obtained meets and interframe retrain, and setting up need minimized energy functional expression formula;
Deformation operation is carried out to input picture, needs to obtain I s, orgdisplacement function in full figure scope Ω wherein the separate and computational process of two components consistent, unifiedly represent one of them with scalar function U at this; Three constraints below function U demand fulfillment:
(1) U is at (x t,p, y t,p) place value should with U pclose as much as possible;
(2) in image, the U value of adjacent position should be close as much as possible;
(3) the U value of consecutive frame should be close as much as possible;
Wherein retrain (1) and represent the discrete displacement prior information obtained by Block-matching in step S1, the continuity constraint that constraint (2) is frame intrinsic displacement amount, the continuity constraint that constraint (3) is interframe displacement;
The energy functional component of constraint (1) correspondence is:
E D = ∫ ∫ Ω ( U - U ~ ) 2 d x d y
Wherein, for be defined on Ω by the reference displacement function that gridding interpolation obtains;
The energy functional component of constraint (2) correspondence is:
E S = ∫ ∫ Ω | ▿ U | 2 d x d y = ∫ ∫ Ω ( ( d U d x ) 2 + ( d U d y ) 2 ) d x d y
The energy functional component of constraint (3) correspondence is:
E C = ∫ ∫ Ω ( U - U p r e ) 2 d x d y
Comprehensively above-mentioned three constraints, need minimized total energy functional to be:
E = E D + λ 1 E S + λ 2 E C = ∫ ∫ Ω ( ( U - U ~ ) 2 + λ 1 | ▿ U | 2 + λ 2 ( U - U p r e ) 2 ) d x d y
S22. according to variation principle, energy functional minimization problem is converted to solving of partial differential equation;
According to variation principle, according to Euler-Language formula, can be the Solve problems of following partial differential equation by the problem equivalent minimizing E:
Above-mentioned partial differential equation are ellipse partial differential equation of second order;
S23. the partial differential equation that difference method numerical solution obtains in step S22 are utilized
The partial differential equation obtained in step S22 are conitnuous forms, carry out discretization, and solve numerical solution to it;
Used first-order difference form is as follows:
∂ U ∂ x = U ( x + h , ... ) - U ( x , ... ) h + O ( h )
Finite Difference Scheme of Second Order is:
∂ 2 U ∂ x 2 = U ( x + h , ... ) - 2 U ( x , ... ) + U ( x - h , ... ) 2 h + O ( h 2 )
Especially, the difference scheme of Laplacian Δ is:
Δ U = U ( x + h , y ) + U ( x - h , y ) + U ( x , y + h ) + U ( x , y - h ) - 4 U ( x , y ) 2 h + O ( h 2 )
Wherein, difference interval h gets the width of a pixel, h=1 in image coordinate system, by substituting into the difference scheme of above-mentioned differential operator, the partial differential equation obtained in S22 is converted to difference equation; This difference equation is linear, and it solves and realizes by overrelaxation alternative manner;
Displacement vector components U is solved respectively according to above-mentioned steps xand U ythe partial differential equation obtained in corresponding S22, can obtain original image I s, orgthe motion vector of each pixel
3. the splicing video based on pyramid Block-matching and functional optimization according to claim 2 goes structural deviation method, it is characterized in that: the concrete grammar of described step S3 comprises:
S31. according to the displacement obtained in step S2, deformation operation is carried out to input picture;
According to I s, orgdisplacement function within the scope of full images can to I s, orgcarry out deformation operation, image I ' after distortion s, orgmiddle position (x ' s,p, y ' s,p) pixel value at place can from I s, orgas upper/lower positions value:
x s , p = x s , p ′ - U x , p y s , p = y s , p - U y , p
Usually (x s,p, y s,p) be non-integer position, the value of this position needs to do bilinear interpolation by neighbouring pixel and obtains;
S32. demarcated the projective parameter obtained according to video camera itself, the image projection after distortion has been merged to unified viewing plane;
By the image I ' after distortion s, orgwith target image I t, orgaccording to projecting to same viewing plane, now, the overlapping region of two images is non-structure deviations, merges, namely obtain the stitching image of present frame to two width re-projection image applications Multiscale Fusions or gradient field fusion method; Step S1 to S3 is suitable for each group synchronization frame of video, the video spliced can be obtained.
CN201510054621.2A 2015-02-03 2015-02-03 Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method Active CN104796623B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510054621.2A CN104796623B (en) 2015-02-03 2015-02-03 Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510054621.2A CN104796623B (en) 2015-02-03 2015-02-03 Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method

Publications (2)

Publication Number Publication Date
CN104796623A CN104796623A (en) 2015-07-22
CN104796623B true CN104796623B (en) 2016-02-24

Family

ID=53561110

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510054621.2A Active CN104796623B (en) 2015-02-03 2015-02-03 Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method

Country Status (1)

Country Link
CN (1) CN104796623B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107491034B (en) * 2017-08-28 2020-02-21 华中科技大学 Cutter axis vector optimization method based on covariant field functional
CN108492275B (en) * 2018-01-24 2020-08-18 浙江科技学院 No-reference stereo image quality evaluation method based on deep neural network
CN110544204A (en) * 2019-07-31 2019-12-06 华南理工大学 image splicing method based on block matching
CN112884806B (en) * 2021-01-12 2022-09-02 昆明理工大学 Video stream measuring method and device based on combination of block matching and intensive reverse search
CN117152221B (en) * 2023-10-26 2024-01-16 山东科技大学 Image non-rigid registration method, system, equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101959008A (en) * 2009-03-03 2011-01-26 索尼株式会社 The method and apparatus that is used for image and Video processing
CN102088569A (en) * 2010-10-13 2011-06-08 首都师范大学 Sequence image splicing method and system of low-altitude unmanned vehicle
CN103856727A (en) * 2014-03-24 2014-06-11 北京工业大学 Multichannel real-time video splicing processing system
CN104090972A (en) * 2014-07-18 2014-10-08 北京师范大学 Image feature extraction and similarity measurement method used for three-dimensional city model retrieval
CN104301677A (en) * 2014-10-16 2015-01-21 北京十方慧通科技有限公司 Panoramic video monitoring method and device orienting large-scale scenes

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101959008A (en) * 2009-03-03 2011-01-26 索尼株式会社 The method and apparatus that is used for image and Video processing
CN102088569A (en) * 2010-10-13 2011-06-08 首都师范大学 Sequence image splicing method and system of low-altitude unmanned vehicle
CN103856727A (en) * 2014-03-24 2014-06-11 北京工业大学 Multichannel real-time video splicing processing system
CN104090972A (en) * 2014-07-18 2014-10-08 北京师范大学 Image feature extraction and similarity measurement method used for three-dimensional city model retrieval
CN104301677A (en) * 2014-10-16 2015-01-21 北京十方慧通科技有限公司 Panoramic video monitoring method and device orienting large-scale scenes

Also Published As

Publication number Publication date
CN104796623A (en) 2015-07-22

Similar Documents

Publication Publication Date Title
CN104796623B (en) Splicing video based on pyramid Block-matching and functional optimization goes structural deviation method
CN107959805B (en) Light field video imaging system and method for processing video frequency based on Hybrid camera array
CN105245841B (en) A kind of panoramic video monitoring system based on CUDA
CN102984453B (en) Single camera is utilized to generate the method and system of hemisphere full-view video image in real time
CN101276465B (en) Method for automatically split-jointing wide-angle image
CN102254348B (en) Virtual viewpoint mapping method based o adaptive disparity estimation
Zhang et al. Framebreak: Dramatic image extrapolation by guided shift-maps
CN110300292B (en) Projection distortion correction method, device, system and storage medium
CN101000461B (en) Method for generating stereoscopic panorama by fish eye image
CN106157304A (en) A kind of Panoramagram montage method based on multiple cameras and system
CN104935909B (en) Multi-image super-resolution method based on depth information
US11656722B1 (en) Method and apparatus for creating an adaptive bayer pattern
CN104794683B (en) Based on the video-splicing method scanned around gradual change piece area planar
CN103856727A (en) Multichannel real-time video splicing processing system
JP2007257287A (en) Image registration method
CN104732482A (en) Multi-resolution image stitching method based on control points
CN104463859B (en) A kind of real-time video joining method based on tracking specified point
CN101877140A (en) Panorama-based panoramic virtual tour method
CN104160422A (en) Method and apparatus for performing super-resolution
CN101840570A (en) Fast image splicing method
CN104809719A (en) Virtual view synthesis method based on homographic matrix partition
CN104616286A (en) Fast semi-automatic multi-view depth restoring method
CN107451952A (en) A kind of splicing and amalgamation method of panoramic video, equipment and system
CN105488777A (en) System and method for generating panoramic picture in real time based on moving foreground
CN107580203A (en) Immersion active stereo projective perspective transformation matrix solving method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant