CN103310464B - A kind of method of the direct estimation camera self moving parameter based on normal direction stream - Google Patents

A kind of method of the direct estimation camera self moving parameter based on normal direction stream Download PDF

Info

Publication number
CN103310464B
CN103310464B CN201310257841.6A CN201310257841A CN103310464B CN 103310464 B CN103310464 B CN 103310464B CN 201310257841 A CN201310257841 A CN 201310257841A CN 103310464 B CN103310464 B CN 103310464B
Authority
CN
China
Prior art keywords
foe
pure
normal direction
camera
rotation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310257841.6A
Other languages
Chinese (zh)
Other versions
CN103310464A (en
Inventor
袁丁
刘淼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihang University
Original Assignee
Beihang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beihang University filed Critical Beihang University
Priority to CN201310257841.6A priority Critical patent/CN103310464B/en
Publication of CN103310464A publication Critical patent/CN103310464A/en
Application granted granted Critical
Publication of CN103310464B publication Critical patent/CN103310464B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

Based on a direct method for normal direction flow field estimation camera self moving parameter, be based upon on the basis of geometric model, the parameter of camera autokinesis can be estimated.The method comprises six large steps, step one: the normal direction flow field calculating input sequential frame image; Step 2: ballot obtains possible FOE point, selects the pure rotation amount of its correspondence and tentatively rejects FOE distracter; Step 3: use RANSAC algorithm to estimate rotation parameter ω, the reliability of inspection estimation result also rejects FOE distracter further; Step 4: utilize rotation parameter to choose pure translational movement, and again reject FOE distracter according to half-plane constraint; Step 5: the reliability that back forecasting is estimated; Step 6: K means clustering algorithm finds out the optimum solution of estimation.Instant invention overcomes the aperture problem in existing self moving parameter method of estimation, and need imaging scene to meet the shortcomings such as some constraints, estimate the camera self moving parameter good reliability obtained, be applicable to all monocular system, there is wide application background.

Description

A kind of method of the direct estimation camera self moving parameter based on normal direction stream
(1) technical field
The present invention relates to a kind of method of the direct estimation camera self moving parameter based on normal direction stream, to imaging scene without any constraint, be namely applicable to the action reference variable of any monocular system, belong to computer vision field.
(2) background technology
Namely camera ego-motion estimation is estimate relative motion between camera and scenery from the sequential frame image before and after camera motion, and this is computer vision field, one of gordian technique of especially vision guided navigation aspect application.In addition, the self moving parameter of camera estimates there is important application in a lot of field such as automatic Pilot, mobile robot's self-navigation and electronic image stabilizing, is the hot research problem of computer vision field in decades.At present, along with the raising of computer processor arithmetic capability, carry out real-time estimation become possibility to the self moving parameter of camera, researchist has taken up to be applied in autonomous vision guided navigation technology, such as, application etc. in moon exploration.
(scene is such as needed to comprise specific, Special complex background etc.) when not having artificial restraint, how accurately and estimate that the self moving parameter of camera is a difficult problem for computer vision field always efficiently.Although in the past few decades, research work personnel have carried out a large amount of correlative study work, and the estimation of camera autokinesis remains a very challenging technical barrier.
The traditional implementation of the self moving parameter of estimation camera has two schemes.The first, characteristic matching method, namely sets up the relation of characteristic matching from sequential frame image, and in then setting up according to matching relationship, epipolar geom etry constraint realizes action reference variable.But accurately and fast matching characteristic amount is a difficult problem for computer vision field all the time, not yet solves so far.And, feature choose with matching process in often need to comprise a large amount of notable feature in scene, this has just had very large constraint to imaging scene.The second, optical flow method.Its realization means utilizes the optical flow field of each pixel motion in token image to estimate, but due to the existence of aperture problem, light stream cannot directly be obtained by sequential frame image.Usually through artificial restraint conditions such as introducing imaging scene are level and smooth, continuous in traditional implementation method, calculate optical flow field.But these additional constraint conditions are difficult to meet in actual applications, and therefore classic method is very limited in the application of reality.
Different from traditional implementation, first the eighties Horn in latter stage proposes direct method (DirectMethod)---and directly utilize normal direction stream to solve the self moving parameter of camera.The feature of direct method directly utilizes sequential frame image to calculate normal direction flow field, and without any need for context restrictions condition, thus avoid the interference of the error of calculation and aperture problem in classic method.The more classical vector field model being Ferm ü ller and proposing in direct method, this model utilizes the direction of normal direction stream to be divided into "+" and " ﹣ " two parts, estimates kinematic parameter according to this two-part critical line (being called " zero line ").But the accurate location of " zero line " is very difficult in algorithm realization process, this also brings larger error directly to the estimation result of algorithm.Sinclair directly locates FOE(FocusofExpansion by normal direction stream) estimate the translational motion information of camera, but this method can not estimate rotary motion, does not have versatility.Silva proposes by setting up linear search subspace to locate the kinematic parameter finding out camera, but this method is easily subject to the interference of external condition, and very responsive to error.At present, the technological achievement of direct method is also little, and Existing methods is also difficult to realize self moving parameter comparatively accurately and estimates.
(3) summary of the invention
1, object: a kind of method that the object of this invention is to provide direct estimation camera self moving parameter based on normal direction stream, which overcome the deficiencies in the prior art, estimation error is less, and to imaging scene without any constraint, can be applied to any monocular system.
2, technical scheme:
The present invention is intended to the relative movement parameters estimated between camera and scene.Its input quantity is the normal direction flow field directly calculated by sequential frame image, exports the estimated value into camera self moving parameter.First the motion model that the present invention relates to is introduced.
Normal direction flow field is input quantity of the present invention, but include translational component and the rotational component of motion in normal direction flow vector, the scenery degree of depth and gradient information, and these information are all mixed in together, therefore flow into method inherently technological difficulties of hand estimation camera self moving parameter from normal direction.The present invention flows into hand from normal direction, is estimated the self moving parameter of camera by the vector carrying out deep analysis to filter out meeting particular geometric characteristic to its geometrical property.First motion model is set up as follows:
Suppose that scene is motionless, and camera carries out general motion, namely comprise translation motion t=[UVW] twith rotary motion ω=[ω 1ω 2ω 3] t, then normal direction stream size Vn (x) at each pixel place can be expressed as vector form:
Vn ( x ) = Vn trans ( x ) + Vn rot ( x )
= W Z ( x ) n ( x ) ( x - FOE ) + n T ( x ) R ( x ) ω - - - ( 1 )
Wherein, Vn trans(x) and Vn rotx () is translational component and the rotational component of normal direction stream respectively; Z (x) is pixel x=(x, y) tthe scenery degree of depth at place; N (x) is its gradient direction; FOE (FocusofExpansion) is the translation motion axle of camera and the intersection point of imaging plane, can represent the direction of translatory motion of camera; The coefficients R (x) of rotary motion can be expressed as:
R ( x ) = xy / f - ( x 2 / f + f ) y ( y 2 / f + f ) - ( xy / f ) - x - - - ( 2 )
Wherein, f is the focal length of imaging system, (x, y) tit is the coordinate of arbitrary pixel in picture plane.
The present invention mainly utilizes the geometrical property of normal direction flow vector in formula (1), by choosing the normal direction stream of particular orientation, some blending constituents in normal direction flow vector are made to be zero, namely the normal direction stream characterizing certain peculair motion is found out, such as, pure flatly to move or pure rotary process Xiang Liu etc., then utilize these jus singulars to estimate the self moving parameter of camera to stream.The present invention that Here it is realizes the motion model that self moving parameter is estimated.
Based on above-mentioned motion model, introduce the concrete technical scheme that the present invention estimates camera self moving parameter below.The method of a kind of direct estimation camera self moving parameter based on normal direction stream of the present invention, its step is as follows:
Step one: read in sequential frame image I 1(x), I 2x (), computing method is to flow field u n
Normal direction flow field is input quantity of the present invention, first introduces the acquisition methods in normal direction flow field.Be different from optical flow field, normal direction flow field directly to be calculated by the gray scale-temporal information analyzing sequential frame image before and after motion, and its computation process does not need there is any artificial constraint condition to imaging scene, and therefore the present invention has certain reliability and versatility.
From the angle of practical application, the normal direction stream size of each pixel is all the projection of light stream on this gradient direction.Light stream representative as the motion of each pixel in plane, but due to the existence of aperture problem, is difficult to accurately to calculate when not introducing artificial restraint condition the light stream vector that each pixel is pointed out.And normal direction stream is only the component of light stream on gradient direction, it can directly be calculated by sequential frame image, and it comprises a large amount of movable information equally.
First light stream represents the motion of pixel, and its equation of constraint is:
E xu+E yv+E t=0(3)
Wherein, [uv] is the light stream vector at pixel (x, y) place, [E xe y] be shade of gray direction at pixel (x, y) place, E tit is the gray scale-time rate of change at point (x, y) place.E x, E yand E tdirectly can be calculated by the half-tone information of image sequence:
E x = I 2 ( i + 1 , j ) - I 2 ( i , j ) E y = I 2 ( i , j + 1 ) - I 2 ( i , j ) E t = I 2 ( i , j ) - I 1 ( i , j ) - - - ( 4 )
Wherein I 1, I 2be the gray matrix of front and back two two field picture, I (i, j) represents the gray-scale value of the i-th row jth row pixel.Normal direction stream is the projection of light stream on gradient direction, and therefore, normal direction flow vector may be defined as:
u n = ( u v T · n ) n | | n | | = [ - E x E t E x 2 + E y 2 - E y E t E x 2 + E y 2 ] T - - - ( 5 )
Wherein, vector u nmethod of representatives to flow vector, it is the unit gradient direction at pixel (x, y) place.Known by formula (4) and formula (5), by means of any artificial constraint condition, need directly can not calculate the normal direction flow vector of any pixel from the image sequence before and after motion.
Step 2: " ballot " obtains possible FOE point, finds out corresponding pure rotation amount and preliminary rejecting FOE distracter
Started with by the normal direction flow field obtained, the present invention adopts the mode of " ballot " to filter out jus singular to flow vector, then by analyzing these jus singulars to the geometric relationship between stream and camera motion parameter, estimates the self moving parameter of camera.
Known by formula (1), the translational component Vn in normal direction stream transx () can be considered the inner product of vector n (x) and (x-FOE).And if at certain pixel place two vector normal, then their inner product is zero.If but want to choose the point meeting this orthogonal property, determine that the particular location of FOE is condition precedent.Therefore, adopt the mode of " ballot " to select possible FOE point in the present invention, its constraint condition is:
n(x)(x-FOE)=0(6)
Wherein, n (x) is pixel x=(x, y) tthe gradient direction at place, FOE represents the direction of camera translation motion.Meeting translational component in the normal direction stream at the pixel place of formula (6) pure rotation condition is zero, and therefore this point is " the pure point of rotation ", and its normal direction stream is " pure rotation amount ".Known by pure rotation condition, the judgement of each pure point of rotation is based on a specific FOE, if this point is the pure point of rotation, then FOE must be positioned at through this point and on straight line perpendicular to its normal direction stream, and this pixel is also this FOE " backer ", straight line is " ballot line ".Also be like this for other pure point of rotation, therefore FOE should be positioned at the point of intersection of all ballot lines.The principle of ballot that Here it is.
The ballot adopted in the present invention carries out when pure rotation amount and FOE are all unknown.Therefore, first suppose that all pixels are the pure point of rotation, so after voting in each pixel place, the FOE that " backer " is maximum has the pure rotation number of maximum quantity, and reliability is relatively high.Therefore, the present invention chooses possible FOE according to " backer " order from many to few and comes further to check.First for selected FOE, the pure rotation amount satisfied condition is chosen according to the pure rotation condition in formula (6).Compare with formula (1), translational component Vn in the normal direction stream at pure point of rotation place trans(x)=0, therefore pure rotation amount meets:
Vn(x)=n T(x)R(x)ω(7)
That is, there is a kind of linear relationship between pure rotation amount and the pure point of rotation, and the rotary motion ω of camera is the coefficient of this linear relationship.Wherein, Vn (x) is the size of this x place normal direction stream, and n (x) is its gradient direction, and definition in the same formula of R (x) (2), ω is the rotational motion parameter of camera.Therefore, if the pure point of rotation and the normal direction stream thereof of more than three can be found, then the rotation parameter of camera can be estimated according to above formula.Otherwise if the pure rotation amount chosen according to constraint formulations (6) is less than three, this FOE will be regarded as distracter and disallowable.
Step 3: estimate rotation parameter ω with RANSAC algorithm, inspection estimation result reliability go forward side by side one step card FOE
The present invention adopts consistent (RANdomSAmpleConsensus, the RANSAC) algorithm of random sampling to carry out estimation equation (7) neutral line model, namely simulates the rotation parameter of camera.Because FOE in the present invention is obtained by the mode of ballot, therefore portion interfering terms may be included in this FOE and pure rotation amount thereof, and RANSAC algorithm can exclusive PCR effectively.Therefore the present invention adopts RANSAC algorithm to estimate the rotation parameter of camera, and whether reliably the possibility P that this algorithm can be obtained useful consequence as this FOE of measurement standard.P can be expressed as
P=1-(1-ρ n) m(8)
Wherein, m is iterations, and n is the minimum number that appraising model needs random selecting data.If certain pixel is applicable to the model estimated, think that it is " intra-office point ".ρ is that intra-office point accounts for all number percent corresponding to the pure point of rotation of this FOE.By P<P in the present invention thresthe FOE of (threshold value) is considered as distracter and rejects.
Now, the kinematic parameter FOE-ω one_to_one corresponding of camera, and most FOE distracter is disallowable.Whether kinematic parameter reliably will be verified by pure translational constraints further to FOE-ω.
Step 4: choose pure translational movement and again reject FOE distracter according to half-plane constraint
Choose pure translational movement according to the rotation parameter ω obtained in the present invention and verify the reliability that kinematic parameter FOE-ω estimates.Pure rotation amount choose based on specific FOE, similarly, by analyzing based on the motion model of rotation parameter ω, can filter out rotational component be zero pure flatly move normal direction stream, the geometrical property then by analyzing these pure translational movements rejects the distracter of estimation further.
First, the motion model based on rotation parameter ω is gang take photocentre as summit, and ω is the centrum of axle, and centrum is projected as gang's quafric curve on imaging plane:
F ( x , y ) = ( &omega; 1 &omega; 3 x + &omega; 2 &omega; 3 y + f ) 2 / ( x 2 + y 2 + f 2 ) - - - ( 9 )
= C
Wherein, f is the focal length of camera, ω=[ω 1ω 2ω 3] tbe the rotational motion parameter of camera, C is 0 to (1+ (ω 1/ ω 3) 2+ (ω 2/ ω 3) 2) between constant.This is that gang is by turning axle (ω 1/ ω 3, ω 2/ ω 3) curve determined, any point place on curve, if its normal direction flow vector is perpendicular to this tangential direction, then wherein rotation information is zero, is " pure translational movement ".
The geometrical property analyzing pure translational movement is known, if camera (namely near scene) motion forward, then FOE (FocusofExpansion) should be positioned at the contrary half-plane of translational movement pure with this; Otherwise if camera (namely away from scene) motion backward, then FOC (FocusofContraction) should be positioned at the consistent half-plane of translational movement pure with this.Wherein FOE and FOC represents the different directions of camera translation motion, the half-plane constraint of Here it is pure translational movement.
Choose pure translational movement according to the rotation parameter calculated, then check this FOE(or FOC) whether meet half-plane constraint, if do not met, be considered as distracter and reject, the present invention checks by setting a very little number of threshold values δ in proof procedure.
Step 5: back forecasting
Rejected by three screenings, most of FOE distracter is excluded.If this camera self moving parameter is estimated reliable, then the FOE point in inter-trust domain should be centered around near true value, presents state of aggregation.Therefore, the present invention adopts the method for back forecasting, is traced back on original image by the FOE finally screening " credible " that obtain, if FOE presents state of aggregation, then thinks that this estimates reliable; Otherwise the threshold value of setting is finely tuned when suitably the pure rotation amount of amendment or pure translational movement are chosen, finally make to estimate reliably.
Step 6: K means clustering algorithm finds out the optimum solution of estimation
After guaranteed this estimation reliable results by back forecasting, the present invention adopts K-means clustering algorithm to find out optimum kinematic parameter to FOE-ω as final estimation result of the present invention.
3, beneficial effect:
The present invention relates to a kind of method of the direct estimation camera self moving parameter based on normal direction stream, its advantage is:
1) half-tone information of two dimensional image is directly utilized to calculate input quantity of the present invention---normal direction flow field, effectively prevent extraction and the tracking error of the characteristic quantity introduced in " aperture " problem common in optical flow method and characteristic matching process.
2) to imaging scene without any artificial restraint, such as: imaging scene is level and smooth, continuously, include a large amount of notable features etc.Therefore the present invention is applicable to all monocular system.
3) RANSAC algorithm effectively prevent the interference of " point not in the know ".
4) back forecasting checks the present invention to estimate the reliability of result further.
5) inspection of quadruple closed loop can go out believable kinematic parameter by Effective selection, thus ensure that reliability and feasibility of the present invention.
(4) accompanying drawing explanation
Fig. 1 the method for the invention FB(flow block).
In figure: FOE is the translation motion axle of camera and the intersection point of imaging plane;
RANSAC(RANdomSAmpleConsensus) be random sampling unification algorism;
P represents that RANSAC algorithm can obtain the possibility of useful consequence;
P thresrepresent threshold value;
FOE-ω represents the estimates of parameters of one group of camera autokinesis.
(5) embodiment
In order to understand technical scheme of the present invention better, below the specific embodiment of the present invention is further described:
The present invention realizes under MatlabR2011a language environment.First computing machine reads in the normal direction flow field that sequential frame image calculates characterizing motility, then possible FOE point is found out by " ballot ", and then select corresponding pure rotation amount and calculate the rotation parameter ω of camera, select pure translational movement according to turning axle again and verify that whether FOE is reliable, find out optimal estimation value finally by back forecasting and clustering algorithm.
The present invention is a kind of method of the direct estimation camera self moving parameter based on normal direction stream, and the flow process of the method as shown in Figure 1.The method comprises the following steps:
Step one: read in sequential frame image I 1(x), I 2x (), computing method is to flow field u n
(1) under MatlabR2011a language environment, sequential frame image I is read in 1(x), I 2(x);
(2) pre-service is carried out to sequential frame image, such as: Gaussian smoothing filter, mean filter etc., Gaussian function chooses 5 × 5, the template of σ=1.4, mean filter chooses the template of 3 × 3;
(3) the Sobel operator of 3 × 3 is selected to find out gradient direction n (x) of each pixel;
(4) according to the Computing Principle of normal direction stream in formula (5), input quantity---the normal direction flow field u of present system is directly obtained n(x).Because normal direction stream is the projection of light stream on gradient direction, therefore normal direction stream u n(x) and n (x) conllinear, and the large I of normal direction stream just can born and can be zero.
Step 2: " ballot " obtains possible FOE point, finds out corresponding pure rotation amount and preliminary rejecting FOE distracter
(1) in a manifold larger than imaging plane size, carry out " ballot " and find out possible FOE point.Manifold there is a gray scale zero setting, then to do at each pixel place through this point and perpendicular to the straight line (" ballot line ") of this original image normal direction stream, and the pixel gray scale of this ballot line process is increased a certain amount of τ, τ and be size according to pretreatment image and specifically set.The true dimension of picture processed in the present invention is 574 × 652, τ value is 0.00001.
First judge whether certain pixel votes on line at certain, adopts the relation between two vectors to weigh in this subject study.Such as, judge pixel (x, y) whether on straight line Ax+By+C=0, can be regarded as and judge whether vector (A, B, C) and (x, y, 1) is vertical.And by ε 1be set to the angle threshold value between two vectors, namely
By threshold epsilon 1the angle system of being set to, whether voting on line Ax+By+C=0, generally can get ε by judging point (x, y) more accurately 1=0 ~ 0.001.If point on this line, then think " backer " of corresponding FOE.After poll closing, the brightest pixel can think the FOE point that " backer " is maximum.
(2) the FOE point that selected " backer " is maximum, then finds out the pure rotation amount corresponding to this FOE according to rotation condition condition pure in formula (6).Need in specific operation process to adopt threshold epsilon, namely meet constraint
n(x)(x-FOE)<ε(11)
Wherein, n (x) is the gradient direction at pixel x place, and FOE represents the direction of camera translation motion.If meet pure rotation condition in above formula, then think this some place normal direction stream u nx (), for pure rotary process is to stream, generally gets threshold epsilon <0.001.
(3) known by formula (7), there is a kind of linear relationship between pure rotation amount and the pure point of rotation, and the rotational motion parameter ω of camera is its linear coefficient.As long as find more than three the pure rotation amounts of non-zero, the rotation parameter ω of camera just can be estimated.If the pure rotation amount corresponding to certain FOE point less than three, then thinks that this FOE is distracter and is rejected.If the brightest FOE point is disallowable, then sequentially chooses residual pixel point the brightest heavy point and be FOE and repeat aforesaid operations.
Step 3: estimate rotation parameter ω with RANSAC algorithm, inspection estimation result reliability go forward side by side one step card FOE
(1) because RANSAC algorithm can avoid interference the impact of item on fitting effect effectively, therefore the present invention's linear model of adopting RANSAC algorithm to come in estimation equation (7).First from the pure rotation amount selected, random selecting n=3 pure rotation amount carries out initial fitting, then other pure rotation amounts are checked whether to be applicable to this model with the linear model that initial fitting goes out, if be applicable to, then be labeled as " intra-office point ", and then again estimate linear model with all " intra-office points ".Iteration is until " the intra-office point " of twice mark in front and back only becomes hardly so like this, and the coefficient of the linear model that now matching obtains is the camera rotation parameter ω corresponding to FOE.
(2) first the number of times m providing circulation random selecting sample is needed in RANSAC algorithm.The probability P that can produce useful consequence according to this algorithm can estimate the number of times m of random sampling:
m = log ( 1 - P ) log ( 1 - &rho; n ) - - - ( 12 )
Wherein, the number of the pure rotation amount of ρ=intra-office point number/all correspondences is the number percent of intra-office point.N is the minimum number that appraising model needs random selecting data, estimates three parameters of camera rotation parameter in advance, therefore choose minimum number n=3 in the present invention.The threshold value P of Effective Probability is supposed in the present invention thres=80%, get P=P thresestimate and obtain random sampling number of times m.
(3) carrying out in each random sampling in the process of iteration all will according to formula (8) calculating probability P, if P>P thresthen think this reliable results stop random sampling iteration.
(4) if still do not find reliable estimation model when frequency in sampling is greater than m, then this FOE is considered as distracter and rejects.After RANSAC round-robin algorithm is complete, most of FOE is identified as distracter and disallowable, the rotation parameter ω that minority FOE is corresponding with it define one to one action reference variable to FOE-ω.
(5) rotation parameter ω is directly calculated by the pure rotation amount corresponding to a certain FOE, and therefore it is believable for FOE.So the key of estimation is that whether FOE is reliable in the present invention.In order to determine to estimate result further, the present invention checks action reference variable to the reliability of FOE-ω by by the pure flat half-plane constraint moved again.
Step 4: choose pure translational movement and again reject FOE distracter according to half-plane constraint
(1) gang's quafric curve can be determined by rotation parameter ω:
F ( x , y ) = ( &omega; 1 &omega; 3 x + &omega; 2 &omega; 3 y + f ) 2 / ( x 2 + y 2 + f 2 ) - - - ( 13 )
= C
Wherein f is the focal length of camera, ω=[ω 1ω 2ω 3] tbe the rotational motion parameter of camera, C is 0 to (1+ (ω 1/ ω 3) 2+ (ω 2/ ω 3) 2) between constant.The normal direction stream analyzing pixel on this race's curve is known, is zero, does not namely comprise rotation information, be therefore referred to as " pure translational movement " perpendicular to rotational component in the normal direction stream of this pixel place tangent line.
(2) the choosing of pure translational movement.Known according to the relevant geometrical constraint of above-mentioned quafric curve, meet
u n(x,y)//(F x(x,y),F y(x,y))(14)
Normal direction flow vector be " pure translational movement ".Wherein, F x(x, y), F y(x, y) be F (x, y) at point (x, y) place along the partial derivative on X-and Y-direction, u n(x, y) is the direction flow vector at this some place.
(3) known by pure flat geometrical property of moving normal direction stream, if the translation of camera is " close ", scenery travels forward, then FOE should be positioned at the half-plane region contrary with pure translational movement; Otherwise FOC is positioned at the half-plane region consistent with pure translational movement.The half-plane constraint of Here it is pure translational movement, only needs checking in implementation procedure
u n(x)·(x-FOE)>0(15)
Or
u n(x)·(x-FOC)<0(16)
Wherein u nx () is the normal direction flow vector at pixel x place, FOE(or FOC) represent the direction of camera translation motion.The present invention checks by setting a very little threshold value δ in proof procedure.
(4) the present invention verifies often couple of FOE-ω.First the quafric curve determined according to rotation parameter ω is chosen " pure translational movement ", then checks FOE (or FOC) whether to meet half-plane constraint with these " pure translational movements ".If FOE (or FOC) meets the half-plane constraint of " the pure translational movement " of more than 85%, then think that this parameter estimation is credible to FOE-ω.Otherwise, be considered as distracter and reject.
(5) through the checking of triple closed loop, FOE distracter is close to all rejects.The result of this estimation is close to be determined, but whether this estimation is reliable, and the present invention will be determined by back forecasting.
Step 5: back forecasting
Adopt back forecasting to check the reliability of this estimation in the present invention.Through inspection and the rejecting of triple closed loop, FOE distracter is close to all to be got rid of.Therefore, if this estimation is reliable, then most remaining parameter should near its true value to FOE-ω, therefore FOE point traces back on original image and tests by the present invention.If remaining FOE presents the state of " gathering ", then think that this estimates reliable; Otherwise, make to estimate reliably by suitably adjusting the threshold epsilon of screening pure rotation amount.
Step 6: K means clustering algorithm finds out the optimum solution of estimation
After this estimation of back forecasting is reliable, the present invention adopts K means clustering algorithm to find out the center of most kinematic parameter as optimal estimation value, the estimation of namely final camera self moving parameter.
Validity of the present invention and accuracy are verified by synthesising picture and true picture, achieve good estimated result.Sharpest edges of the present invention are that the Gray Level-Gradient information directly utilizing image is estimated, effectively prevent the aperture problem existed in classic method, and to imaging scene without any artificial constraint, the camera self moving parameter being applicable to all monocular system is estimated.
From the experimental results, method in the present invention can estimate the parameter (translation direction FOE and rotation parameter ω) of camera autokinesis effectively from sequential frame image, reliability is high, and the self moving parameter that can be applicable in all monocular system is estimated, has broad application prospects and is worth.

Claims (1)

1., based on a method for the direct estimation camera self moving parameter of normal direction stream, the method is carried out under following motion model, suppose that scene is motionless, and camera carries out general motion, namely comprises translation motion t=[UVW] twith rotary motion ω=[ω 1ω 2ω 3] t, then normal direction stream size Vn (x) at each pixel place is expressed as vector form:
V n ( x ) = Vn t r a n s ( x ) + Vn r o t ( x ) = W Z ( x ) n ( x ) ( x - F O E ) + n T ( x ) R ( x ) &omega; - - - ( 1 )
Wherein, Vn trans(x) and Vn rotx () is translational component and the rotational component of normal direction stream respectively; Z (x) is pixel x=(x, y) tthe scenery degree of depth at place; N (x) is its gradient direction; FOE is the translation motion axle of camera and the intersection point of imaging plane, represents the direction of translatory motion of camera; The coefficients R (x) of rotary motion is expressed as:
R ( x ) = x y / f - ( x 2 / f + f ) y ( y 2 / f + f ) - ( x y / f ) - x - - - ( 2 )
Wherein, f is the focal length of imaging system, (x, y) tit is the coordinate of arbitrary pixel in picture plane;
Utilize the geometrical property of normal direction flow vector in formula (1), by choosing the normal direction stream of particular orientation, some blending constituents in normal direction flow vector are made to be zero, namely the normal direction stream characterizing certain peculair motion is found out, such as, pure flat move or pure rotary process to stream, then utilize these jus singulars to stream the self moving parameter of camera is estimated, Here it is realize self moving parameter estimate motion model;
It is characterized in that: the step of the method is as follows
Step one: read in sequential frame image I 1(x), I 2x (), computing method is to flow field u n;
Normal direction flow field is input quantity, is directly to be calculated by the gray scale-temporal information analyzing sequential frame image before and after motion, and its computation process does not need there is any artificial constraint condition to imaging scene;
From the angle of practical application, the normal direction stream size of each pixel is all the projection of light stream on this gradient direction; Light stream representative as the motion of each pixel in plane, but due to the existence of aperture problem, is difficult to accurately to calculate when not introducing artificial restraint condition the light stream vector that each pixel is pointed out; And normal direction stream is only the component of light stream on gradient direction, it is directly calculated by sequential frame image, and it comprises a large amount of movable information equally;
First light stream represents the motion of pixel, and its equation of constraint is:
E xu+E yv+E t=0(3)
Wherein, [u, v] is the light stream vector at pixel (x, y) place, [E x, E y] be shade of gray direction at pixel (x, y) place, E tit is the gray scale-time rate of change at point (x, y) place; E x, E yand E tdirectly calculated by the half-tone information of image sequence:
E x = I 2 ( i + 1 , j ) - I 2 ( i , j ) E y = I 2 ( i , j + 1 ) - I 2 ( i , j ) E t = I 2 ( i , j ) - I 1 ( i , j ) - - - ( 4 )
Wherein I 1, I 2be the gray matrix of front and back two two field picture, I (i, j) represents the gray-scale value of the i-th row jth row pixel; Normal direction stream is the projection of light stream on gradient direction, and therefore, normal direction flow vector is defined as:
u n = ( u v T &CenterDot; n ) n | | n | | = - E x E t E x 2 + E y 2 - E y E t E x 2 + E y 2 T - - - ( 5 )
Wherein, vector u nmethod of representatives to flow vector, it is the unit gradient direction at pixel (x, y) place; Known by formula (4) and formula (5), by means of any artificial constraint condition, need directly can not calculate the normal direction flow vector of any pixel from the image sequence before and after motion;
Step 2: " ballot " obtains possible FOE point, finds out corresponding pure rotation amount and preliminary rejecting FOE distracter;
Being started with by the normal direction flow field obtained, adopt the mode of " ballot " to filter out jus singular to flow vector, then by analyzing these jus singulars to the geometric relationship between stream and camera motion parameter, estimating the self moving parameter of camera;
Known by above-mentioned formula (1), the translational component Vn in normal direction stream transx () is considered as the inner product of vector n (x) and (x-FOE); And if at certain pixel place two vector normal, then their inner product is zero; If but want to choose the point meeting this orthogonal property, determine that the particular location of FOE is condition precedent; Therefore, adopt the mode of " ballot " to select possible FOE point, its constraint condition is:
n(x)(x-FOE)=0(6)
Wherein, n (x) is pixel x=(x, y) tthe gradient direction at place, FOE represents the direction of camera translation motion; Meeting translational component in the normal direction stream at the pixel place of formula (6) pure rotation condition is zero, and therefore this point is " the pure point of rotation ", and its normal direction stream is " pure rotation amount "; Known by pure rotation condition, the judgement of each pure point of rotation is based on a specific FOE, if this point is the pure point of rotation, then FOE must be positioned at through this point and on straight line perpendicular to its normal direction stream, and this pixel is also this FOE " backer ", straight line is " ballot line "; Also be like this for other pure point of rotation, therefore FOE is positioned at the point of intersection of all ballot lines;
The ballot adopted is carry out when pure rotation amount and FOE are all unknown, therefore, first suppose that all pixels are the pure point of rotation, so after voting in each pixel place, the FOE that " backer " is maximum has the pure rotation number of maximum quantity, and reliability is relatively high; Therefore, choose possible FOE according to " backer " order from many to few to come further to check; First for selected FOE, the pure rotation amount satisfied condition is chosen according to the pure rotation condition in formula (6); Compare with formula (1), translational component Vn in the normal direction stream at pure point of rotation place trans(x)=0, therefore pure rotation amount meets:
Vn(x)=n T(x)R(x)ω(7)
That is, there is a kind of linear relationship between pure rotation amount and the pure point of rotation, and the rotary motion ω of camera is the coefficient of this linear relationship; Wherein, Vn (x) is the size of this x place normal direction stream, and n (x) is its gradient direction, the coefficient that R (x) is rotary motion, represents same formula (2); ω in formula (7) is the rotational motion parameter of camera; Therefore, if find the pure point of rotation and the normal direction stream thereof of more than three, then the rotation parameter of camera is estimated according to above formula; Otherwise if the pure rotation amount chosen according to constraint formulations (6) is less than three, this FOE will be regarded as distracter and disallowable;
Step 3: estimate rotation parameter ω with RANSAC algorithm, inspection estimation result reliability go forward side by side one step card FOE;
Adopt random sampling to be unanimously that RANSAC algorithm carrys out estimation equation (7) neutral line model, namely simulate the rotation parameter of camera; Because FOE is obtained by the mode of ballot, therefore portion interfering terms may be included in this FOE and pure rotation amount thereof, and RANSAC algorithm can exclusive PCR effectively, therefore adopt the rotation parameter of RANSAC algorithm estimation camera, and whether reliably the possibility P that this algorithm can be obtained useful consequence as this FOE of measurement standard; P is expressed as
P=1-(1-ρ n) m(8)
Wherein, m is the number of times of circulation random selecting sample, and n is the minimum number that appraising model needs random selecting data; If certain pixel is applicable to the model estimated, think that it is " intra-office point "; ρ is that intra-office point accounts for all number percent corresponding to the pure point of rotation of this FOE; P is less than threshold value P thresfOE be considered as distracter and reject;
Now, the kinematic parameter FOE-ω one_to_one corresponding of camera, and most FOE distracter is disallowable; Whether kinematic parameter reliably will be verified by pure translational constraints further to FOE-ω;
Step 4: choose pure translational movement and again reject FOE distracter according to half-plane constraint;
Choose pure translational movement according to the rotation parameter ω obtained and verify the reliability that kinematic parameter FOE-ω estimates; Pure rotation amount choose based on specific FOE, similarly, by analyzing based on the motion model of rotation parameter ω, filter out rotational component be zero pure flatly move normal direction stream, the geometrical property then by analyzing these pure translational movements rejects the distracter of estimation further;
First, the motion model based on rotation parameter ω is gang take photocentre as summit, and ω is the centrum of axle, and centrum is projected as gang's quafric curve on imaging plane:
F ( x , y ) = ( &omega; 1 &omega; 3 x + &omega; 2 &omega; 3 y + f ) 2 / ( x 2 + y 2 + f 2 ) = C - - - ( 9 )
Wherein, f is the focal length of camera, ω=[ω 1ω 2ω 3] tbe the rotational motion parameter of camera, C is 0 to (1+ (ω 1/ ω 3) 2+ (ω 2/ ω 3) 2) between constant; This is that gang is by turning axle (ω 1/ ω 3, ω 2/ ω 3) curve determined, any point place on curve, if its normal direction flow vector is perpendicular to this tangential direction, then wherein rotation information is zero, is " pure translational movement ";
The geometrical property analyzing pure translational movement is known, if camera is forward namely near scene motion, then FOE should be positioned at the contrary half-plane of translational movement pure with this; Otherwise if camera is backward namely away from scene motion, then FOC should be positioned at the consistent half-plane of translational movement pure with this; Wherein FOE and FOC represents the different directions of camera translation motion, the half-plane constraint of Here it is pure translational movement;
Choose pure translational movement according to the rotation parameter calculated, then check this FOE or FOC whether to meet half-plane constraint, if do not met, be considered as distracter and reject; Check by setting a very little threshold value δ in proof procedure;
Wherein, the threshold value δ that setting one is very little in the proof procedure described in step 4 checks, this threshold value δ=0.001 ~ 0.005;
Step 5: back forecasting;
Rejected by three screenings, most of FOE distracter is excluded; If this camera self moving parameter is estimated reliable, then the FOE point in inter-trust domain should be centered around near true value, presents state of aggregation; Therefore, adopt the method for back forecasting, the FOE finally screening " credible " that obtain is traced back on original image, if FOE presents state of aggregation, then think that this estimates reliable; Otherwise the threshold value of setting is finely tuned when suitably the pure rotation amount of amendment or pure translational movement are chosen, finally make to estimate reliably;
Step 6: K means clustering algorithm finds out the optimum solution of estimation;
After guaranteed this estimation reliable results by back forecasting, K-means clustering algorithm is adopted to find out optimum kinematic parameter to FOE-ω as finally estimating result.
CN201310257841.6A 2013-06-26 2013-06-26 A kind of method of the direct estimation camera self moving parameter based on normal direction stream Active CN103310464B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310257841.6A CN103310464B (en) 2013-06-26 2013-06-26 A kind of method of the direct estimation camera self moving parameter based on normal direction stream

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310257841.6A CN103310464B (en) 2013-06-26 2013-06-26 A kind of method of the direct estimation camera self moving parameter based on normal direction stream

Publications (2)

Publication Number Publication Date
CN103310464A CN103310464A (en) 2013-09-18
CN103310464B true CN103310464B (en) 2016-04-06

Family

ID=49135641

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310257841.6A Active CN103310464B (en) 2013-06-26 2013-06-26 A kind of method of the direct estimation camera self moving parameter based on normal direction stream

Country Status (1)

Country Link
CN (1) CN103310464B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103593730B (en) * 2013-10-30 2016-08-17 中国运载火箭技术研究院 A kind of design parameter management system recalled
CN103914819B (en) * 2014-03-26 2016-11-23 东华大学 A kind of based on the infrared image joining method improving RANSAC
CN108519102B (en) * 2018-03-26 2021-06-01 东南大学 Binocular vision mileage calculation method based on secondary projection
CN108573500B (en) * 2018-04-24 2020-07-28 西安交通大学 Method for directly estimating vehicle-mounted camera motion parameters
CN109242907A (en) * 2018-09-29 2019-01-18 武汉光庭信息技术股份有限公司 A kind of vehicle positioning method and device based on according to ground high speed camera
CN110211190B (en) * 2019-05-31 2022-11-01 北京百度网讯科技有限公司 Method and device for training camera self-motion parameter estimation model and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622764A (en) * 2012-02-23 2012-08-01 大连民族学院 Target tracking method on basis of movable camera platform
EP2570993A2 (en) * 2011-09-16 2013-03-20 Harman International (China) Holdings Co., Ltd. Egomotion estimation system and method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2570993A2 (en) * 2011-09-16 2013-03-20 Harman International (China) Holdings Co., Ltd. Egomotion estimation system and method
CN102622764A (en) * 2012-02-23 2012-08-01 大连民族学院 Target tracking method on basis of movable camera platform

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Camera Motion Estimation Using Normal Flows;袁丁 等;《2012 4th International conference on graphic and image processing》;20121007;1-5 *
相机自运动参数的鲁棒性估计;汪剑鸣 等;《红外与激光工程》;20101225;第39卷(第6期);1168-1172 *

Also Published As

Publication number Publication date
CN103310464A (en) 2013-09-18

Similar Documents

Publication Publication Date Title
CN103310464B (en) A kind of method of the direct estimation camera self moving parameter based on normal direction stream
CN101216941B (en) Motion estimation method under violent illumination variation based on corner matching and optic flow method
CN103578117B (en) Determine the photographic head method relative to the attitude of environment
Li et al. A 4-point algorithm for relative pose estimation of a calibrated camera with a known relative rotation angle
Liu et al. Robust and efficient relative pose with a multi-camera system for autonomous driving in highly dynamic environments
CN110009674A (en) Monocular image depth of field real-time computing technique based on unsupervised deep learning
CN111046767B (en) 3D target detection method based on monocular image
CN103424105B (en) Method for checking object and device
CN107871327A (en) The monocular camera pose estimation of feature based dotted line and optimization method and system
US11080892B2 (en) Computer-implemented methods and system for localizing an object
CN102901444A (en) Method for detecting component size based on matching pursuit (MP) wavelet filtering and detecting system thereof
Cheng et al. Mobile robot indoor dual Kalman filter localisation based on inertial measurement and stereo vision
CN103456030A (en) Target tracking method based on scattering descriptor
CN112149713B (en) Method and device for detecting insulator image based on insulator image detection model
Miclea et al. New sub-pixel interpolation functions for accurate real-time stereo-matching algorithms
CN105160644A (en) Method for positioning center of crisscross image in CCD image measurement system
Jung et al. Photometric visual-inertial navigation with uncertainty-aware ensembles
Zhao et al. Robust depth-aided RGBD-inertial odometry for indoor localization
Wang et al. Heading reference-assisted pose estimation for ground vehicles
CN113436239A (en) Monocular image three-dimensional target detection method based on depth information estimation
Ali et al. Multi-sensor depth fusion framework for real-time 3D reconstruction
CN106406507A (en) Image processing method and electronic equipment
Jiao et al. Lce-calib: automatic lidar-frame/event camera extrinsic calibration with a globally optimal solution
Tian et al. Efficient ego-motion estimation for multi-camera systems with decoupled rotation and translation
CN113920150A (en) Simplified binocular vision mileage positioning method for planet vehicle under resource limitation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant