CN100483462C

CN100483462C - Establishing method of human face 3D model by fusing multiple-visual angle and multiple-thread 2D information

Info

Publication number: CN100483462C
Application number: CNB02146278XA
Authority: CN
Inventors: 徐光祐; 张辉
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2002-10-18
Filing date: 2002-10-18
Publication date: 2009-04-29
Anticipated expiration: 2022-10-18
Also published as: CN1404016A

Abstract

The present invention belongs to the field of computer-aided human face image re-construction technology. The present invention features that human face as model establishing object is first photographed with two, one upper and one lower, stere ovideo cameras to obtain video sequence from the front to the side of the face with unchanged expression; the precise posture parameters in different times are then obtained via autoamtic posture estimating algorithm; after the initiation of model and posture, 3D information are extracted separately from two types of 2D threads including one stereo reconstruction type of the image and the other matching and correcting type of model projection contour and image contour; and the 3D face model is finally established.

Description

Merge the method for building up of the human face three-dimensional model of various visual angles, multi thread two-dimensional signal

Technical field

The method for building up that merges the human face three-dimensional model of various visual angles, multi thread two-dimensional signal belongs to computer face image reconstruction technique field.

Background technology

Existing method according to image establishment three-dimensional face model lacks the strategy that extracts three-dimensional information and merge from multiple two-dimentional clue.Kang, people such as Sing Bing are at " image method and apparatus for semi-automatically mappinga face on to a wireframe topology " (United States Patent6, a kind of semi-automatic method according to the positive visual founder's face model of single width has been proposed in the patented technology 031,539).The principle of this method is: automatically detect a plurality of unique points on people's face image, and then according to the landmark points (landmark points) of positioning feature point face, obtain the displacement of all net points in people's face wire-frame model then in view of the above, create out and the corresponding faceform of this image.The major defect of this method is the information that does not make full use of each visual angle, and this has limited the levels of precision of model.

At " utilizing desktop video camera human cloning face " (" the 8th computer vision international conference collection of thesis ", 2001, second volume, pp.745-745) in the literary composition, people such as author Zhang Zhengyou, Z.Liu have proposed a kind of method of utilizing the corresponding faceform of establishment of two dimension in the monocular video.The principle of this method is: the manual image of selecting two width of cloth near front view, by people's face shape model of camera self-calibration and simplification, based on two dimension to using the model that the non-linear way of finding the solution obtains specific people.Though this method has been utilized various visual angles information, only be to use several images to go up corresponding two-dimensional points information and do not made full use of other shape constraining, the model that obtains is not accurate enough, especially abundant regional all the more so of texture.

These methods have some common shortcomings: (1) model is meticulous inadequately; (2) can not do the compatible animation of MPEG4 (moving picture experts group organizational standard 4).

Summary of the invention

The object of the present invention is to provide a kind of method for building up that merges the human face three-dimensional model of various visual angles, multi thread two-dimensional signal.

But the present invention relates to a kind of modeling method of animation three-dimensional face, at first take people's face of modeling object progressively forwards the side to from the front video sequence (shooting process people face is answered amimia variation) with demarcating good stereo camera; Be that a full automatic attitude algorithm for estimating is to obtain each accurate attitude parameter constantly in the sequence then; Then after model and attitude initialization, according to two class two-dimensional clues by different processing scheme extraction three-dimensional informations: a class is based between image corresponding point and carries out stereo reconstruction, and another kind of is coupling, the correction of model projection profile and image contour; At last, merge in various visual angles, extract, set up the faceform corresponding to the three-dimensional information of the two-dimentional clue of difference.This method and existing method ratio, more careful, more accurate shape that the faceform who obtains has, and can do the human face animation of MPEG4 compatibility.

The invention is characterized in:

It at first use with the people on the face, corresponding two the identical and outer polar curve direction video cameras substantially vertically of upper/lower positions people's face of taking modeling object under the situation of the amimia variation of people's face progressively changes about 90 video sequences of spending to the side from the front, gather up and down the two-way synchronization video with two identical capture cards again and import in the computing machine; Then, from the universal model Face of people's face _gBeginning is progressively made amendment according to the two-dimensional signal of the modeling object that obtains, and finally obtains specific people's faceform Face _s, the three-dimensional point coordinate among the just people who wherein the revises three-dimensional node tabulation P on the face, and people triangle surface tabulation F on the face remains unchanged promptly only changes mould shapes P and the topological structure F of model is constant; Particularly, it contains successively and has the following steps:

(1). input stereoscopic video sequence I _{L, t}And I _{R, t}:

L, R be corresponding to two video cameras, t=1 ..., N, the different images constantly of expression N frame: the two-dimensional array that I then is made up of pixel, each pixel comprise R, G, three components of B; The moment of t=1 is selected by people's judgement is manual corresponding to the positive attitude of people's face;

(2). attitude is estimated, is promptly found the solution the attitude parameter (being the rigid body translation coefficient) in the number of people rotary course:

Interframe movement with people's face in the above-mentioned image sequence of following rigid body translation formulate:

M _t+1＝R _tM _t+T _t，t＝1，...，N-1，

M _t, M _T+1The people's coordinate of any point in the L camera coordinate system on the face when representing t frame, t+1 frame respectively; M _t, M _T+1Being people's same position on the face at difference three-dimensional coordinate constantly, promptly is the three-dimensional corresponding point of interframe; Motion with reliable three-dimensional point just can be estimated attitude parameter;

{ R _t, T _t, t=1 ..., N-1} is rotation and the translational component of t frame to t+1 frame attitude parameter;

Attitude is estimated to contain successively to have the following steps:

(2.1), the initial moment: carry out the detection and the coupling of two dimensional character earlier, obtain the three-dimensional point in the initial moment again by stereo reconstruction:

At first, feature detection frame t0=t is set, and at I _{L, t}Make two dimensional character on the figure and detect, its step is as follows:

With existing KLT algorithm computation I _{L, t}Unique point p on the figure (x=i, the image change degree G that y=j) locates (i, j):

G (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} \overset{&RightArrow;}{g} (i + ii, j + jj) {\overset{&RightArrow;}{g}}^{T} (i + ii, j + jj),

Gradient vector for tested point p place image:

\overset{&RightArrow;}{g} (i, j) = {[\begin{matrix} gx & gy \end{matrix}]}^{T} = {[\begin{matrix} \frac{I (i + 1, j) - I (i - 1, j)}{2} & \frac{I (i, j + 1) - I (i, j - 1)}{2} \end{matrix}]}^{T},

(i+1 is the gray scale of this some place image j) to I, and the rest may be inferred by analogy;

With tested point p is that the window W at center is of a size of (2*blockx+1) * (2*blocky+1); And then obtain the minimum singular value sing_min of G with existing matlab kit, selecting the very big some p of sing_min again is detected unique point; Being designated as { p in t detected unique point of the moment _{L, t}(k) }, L, t presentation image sequence number, K (t) is the unique point sum, k is its label, 1≤k≤K (t);

Secondly, to two video camera L of synchronization, two width of cloth images that R shooting Same Scene obtains are that stereographic map is to making the two dimensional character coupling; The projection of same three-dimensional point in different images is two-dimensional points all, and it is corresponding to be called two dimension between them mutually, stereographic map to two-dimentional correspondence be called coupling:

For I _{L, t}Certain 1 p among the figure _{L, t}(k)=(x _{L, t}(k) y _{L, t}(k)) ^T, its match point p _{R, t}(k)=(x _{R, t}(k) y _{R, t}(k)) ^TShould be positioned at one by p _{L, t}(k) on Jue Ding the straight line, in this one-dimensional space, select to make the point of total gray difference minimum in the local window W as p _{L, t}(k) match point p _{R, t}(k), the expression formula of total gray difference:

Diff (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} {(I_{L, t} (x_{L, t} (k) + ii, y_{L, t} (k) + jj) - I_{R, t} (i + ii, j + jj))}^{2}

(i j) is straight line On any point; The matching result of carving t that clocks is { p _{R, t}(k) }, total number is that K (t) is individual, every all with p _{L, t}(k) corresponding, on the straight line of following formulate, search for, make Diff (i, j) Zui Xiao position:

{[\begin{matrix} p_{L, t} (k) \\ 1 \end{matrix}]}^{T} F [\begin{matrix} p_{R, t} (k) \\ 1 \end{matrix}] = 0,

Wherein:

F = {[\begin{matrix} f_{L} & 0 & u_{0 L} \\ 0 & f_{L} & v_{0 L} \\ 0 & 0 & 1 \end{matrix}]}^{- T} [\begin{matrix} 0 & - T_{c, z} & T_{c, y} \\ T_{c, z} & 0 & - T_{c, x} \\ - T_{c, y} & T_{c, x} & 0 \end{matrix}] R_{c} {[\begin{matrix} f_{R} & 0 & u_{0 R} \\ 0 & f_{R} & v_{0 R} \\ 0 & 0 & 1 \end{matrix}]}^{- 1},

f _L, f _R, u _0L, v _0L, u _0R, v _0RBe camera intrinsic parameter, known quantity; R _c, T _c=[T _{C, x}T _{C, y}T _{C, z}] ^TBe respectively rotation and translational component, belong to external parameters of cameras, known quantity;

Once more, obtain the synchronization stereographic map to matching result p _{L, t}And p _{R, t}After, obtain the three-dimensional point M that represents with camera coordinate system by stereo reconstruction _{M, t}, m is video camera L or R, M _{M, t}Be the coordinate of t moment three-dimensional point in the m camera coordinate system;

Configuration relation formula according to two video cameras:

M _L=R _cM _R+ T _c, M _L, M _RBe respectively the coordinate of same point in these two camera coordinate systems; And camera perspective projection formula:

s_{m} A_{m}^{- 1} [\begin{matrix} p_{m, t} \\ 1 \end{matrix}] = M_{m, t},

s _m(be s _LOr s _R) be scale factor (obtaining) undetermined by formula after a while,

Obtain the coordinate M of three-dimensional point in video camera L, R _{L, t}And M _{R, t}:

M_{L, t} = \frac{1}{2} (s_{L} A_{L}^{- 1} [\begin{matrix} p_{L, t} \\ 1 \end{matrix}] + s_{R} R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} \\ 1 \end{matrix}] + T_{c});

A_{L} = [\begin{matrix} f_{L} & 0 & u_{0 L} \\ 0 & f_{L} & v_{0 L} \\ 0 & 0 & 1 \end{matrix}],

A_{R} = [\begin{matrix} f_{R} & 0 & u_{0 R} \\ 0 & f_{R} & v_{0 R} \\ 0 & 0 & 1 \end{matrix}];

[\begin{matrix} s_{L} \\ s_{R} \end{matrix}] = {(B^{T} B)}^{- 1} B^{T} T_{c},

And

B = [A_{L}^{- 1} [\begin{matrix} p_{L, t} \\ 1 \end{matrix}] - R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} \\ 1 \end{matrix}]];

M_{R, t} = R_{c}^{- 1} (M_{L, t} - T_{c});

At the remaining part that attitude is estimated, three-dimensional point all uses the coordinate in the L camera coordinate system to represent, therefore omits subscript L, is about to M _{L, t}Brief note is M _t

(2.2), the follow-up moment: t+1 stereographic map is constantly followed the tracks of carry out two dimensional character with the KLT algorithm, obtained the three-dimensional point in each follow-up moment more equally by stereo reconstruction; Tracking is meant that the two dimension between the different time chart pictures that same video camera takes is corresponding:

When two dimensional character was followed the tracks of, the unique point of establishing detection was in image I _{M, t}(p) on (m is L or R), (x=i is arbitrary unique point y=j) to p, and then it is in image I _{M, t+1}On tracking results be

Wherein

\overset{&RightArrow;}{d} = G^{- 1} \overset{&RightArrow;}{e},

G as above-mentioned, and

\overset{&RightArrow;}{e} (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} \overset{&RightArrow;}{g} (i + ii, j + jj) (I_{m, t + 1} (i + ii, j + jj) - I_{m, t} (i + ii, j + jj))

Note is followed the tracks of the unique point that obtains and is { p _{M, t+1}(k) }, m represents L or R, and t+1 is constantly:

For video camera L be:

p_{L, t + 1} (k) = p_{L, t} (k) - {\overset{&RightArrow;}{d}}_{L, t} (p_{L, t} (k)), 1 \leq k \leq K (t);

For video camera R be:

p_{R, t + 1} (k) = p_{R, t} (k) - {\overset{&RightArrow;}{d}}_{R, t} (p_{R, t} (k)), 1 \leq k \leq K (t);

Calculate three-dimensional point { M according to arbitrary trace point to coupling _T+1(k) | k=1 ..., K (t) } (as previously mentioned, having omitted subscript L here);

M_{t + 1} (k) = \frac{1}{2} (s_{L} A_{L}^{- 1} [\begin{matrix} p_{L, t + 1} (k) \\ 1 \end{matrix}] + s_{R} R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t + 1} (k) \\ 1 \end{matrix}] + T_{c});

Wherein

[\begin{matrix} s_{L} \\ s_{R} \end{matrix}] = {(B^{T} B)}^{- 1} B^{T} T_{c},

B = [A_{L}^{- 1} [\begin{matrix} p_{L, t + 1} (k) \\ 1 \end{matrix}] - R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t + 1} (k) \\ 1 \end{matrix}]];

(2.3), attitude parameter initialization: according to one group of three-dimensional corresponding point { M between two moment t and the t+1 _t(k) | k=1 ..., K (t) } and { M _T+1(k) | k=1 ..., K (t) } and the expression reliability sequence { bReliable _k, k=1 ..., K (t) is from formula M _T+1(k)=R _tM _t(k)+T _t, t=1 ..., use among the N-1 based on the Robust Estimation algorithm of minimum intermediate value and find the solution R _t, T _t:

If: definition is also brought in constant renewal in boolean's array { bReliable _kBe reliable tolerance sequence, and Yes represents that reliably No represents unreliable, wherein value is that the label of the item of Yes is { k _s, s=1 ..., Kr (t), Kr (t) they are that value is the sum of the item of Yes, from wherein choosing Num_subset subclass, each subclass contains three couples of three-dimensional corresponding point, i.e. { Set _n={ N _{1, n}, N _{2, n}, N _{3, n}, n=1 ..., Num_subset, 1≤N _{1, n}, N _{2, n}, N _{3, n}≤ Kr (t), three-dimensional corresponding point are in each ternary subclass:

{M_{t} (k_{N_{1, n}}), M_{t} (k_{N_{2, n}}), M_{t} (k_{N_{3, n}})}

With

{M_{t + 1} (k_{N_{1, n}}), M_{t + 1} (k_{N_{2, n}}), M_{t + 1} (k_{N_{3, n}})},

Can note by abridging and be { (M _Im,M _Im') | im=0,1,2}, its represents three pairs of not three-dimensional corresponding point of conllinear, R _t, T _tSatisfy M _Im'=R _tM _Im+ T _t| im=0,1,2; From following formula, obtain R and T (omitting subscript t) then:

R = I + \frac{\sin | | r | |}{| | r | |} {[r]}_{\times} + \frac{1 - \cos | | r | |}{{| | r | |}^{2}} {[r]}_{\times}^{2},

Wherein

r = θ * \hat{r},

θ = \arccos \frac{(\hat{r} \times {dir}_{1}) \cdot (\hat{r} \times {dir}_{1}')}{| | (\hat{r} \times {dir}_{1}) \cdot (\hat{r} \times {dir}_{1}') | |},

\hat{r} = \frac{({dir}_{1}' - {dir}_{1}) \times ({dir}_{2}' - {dir}_{2})}{| | ({dir}_{1}' - {dir}_{1}) \times ({dir}_{2}' - {dir}_{2}) | |},

Be the central shaft of rotation, claim turning axle, the vector of 3*1;

θ be around

Rotation angle;

{dir}_{im} = \frac{M_{im} - M_{0}}{| | M_{im} - M_{0} | |}, im = 1,2;

{dir}_{im}' = \frac{M_{im}' - M_{0}'}{| | M_{im}' - M_{0}' | |}, im = 1,2;

[r] _xBe according to 3*1 vector r=[r _xr _yr _z] ^TThe antisymmetric matrix of definition:

{[r]}_{\times} = {[\begin{matrix} r_{x} \\ r_{y} \\ r_{z} \end{matrix}]}_{\times} = [\begin{matrix} 0 & - r_{z} & r_{y} \\ r_{z} & 0 & - r_{x} \\ - r_{y} & r_{x} & 0 \end{matrix}],

And

I [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 1 & 0 & 1 \end{matrix}]

It is unit matrix;

Translational component T is estimated by following formula:

T = \frac{1}{3} Σ_{im = 0}^{2} (M_{im}' - {RM}_{im});

To each subclass Set _nEstimate attitude parameter by above-mentioned formula, be designated as R _{T, n}, T _{T, n}Again to all reliable three-dimensional corresponding { M _t(k _s), M _T+1(k _s), s=1 ..., Kr (t) is calculated as follows corresponding point M _t(k _s), M _T+1(k _s) and attitude parameter R _{T, n}, T _{T, n}Matching degree ε _{N, s}, this value is more little, and then both meet more:

ε _n，s＝‖M _t+1(k _s)-R _t，nM _t(k _s)-T _t，n‖ ²；

Again from set { R _{T, n}, T _{T, n}, n=1 ..., choose R among the Num_subset _t, T _t:

{R _t，T _t}＝{R _{n_m}，T _{n_m}}，

n_m = \underset{n}{\arg \min} {\underset{s}{med}}^{n} {ϵ_{n, s} | s = 1, . . ., Kr (t)},

Expression is got set { ε at subscript s _{N, s}| s=1 ..., Kr (t) intermediate value, it constitutes a new array to different n

{{\underset{s}{med}}^{n} | n = 1, . . ., Num_subset};

And

Then be in this intermediate value array, to select to get that n of minimum value;

(2.4), at t=t+1 constantly, be provided with:

Whether need to call again the Boolean denotation amount bReDetect of feature detection and matching module;

Whether need to carry out the Boolean denotation amount bRefine of attitude refinement

(2.5), when t=N, attitude is estimated to finish;

(3). model initialization and attitude initialization:

The model initialization is the initial step of universal model specialization method, and it is from universal model Face _gObtain an initialization model Face _Init, promptly universal model being carried out change of scale, available following formula carries out:

M_{init} = [\begin{matrix} s_{x} & 0 & 0 \\ 0 & s_{y} & 0 \\ 0 & 0 & s_{z} \end{matrix}] M_{g},

M _gAnd M _InitBe respectively Face _gAnd Face _InitOn arbitrary group of corresponding point; s _xBe modeling object and Face _gTwo inner eye corners apart from the ratio of Seg_E length, s _yBe the ratios of two centers to mouth centre distance Seg_EM length; s _z=s _yAgain according to this to Face _gIn all press the following formula conversion;

The attitude initialization is in order to obtain the correspondence between model and image, promptly to calculate the rigid body translation that is tied to the L camera coordinate system from people's face generic coordinate; Promptly find the solution R from following formula _{G, 1}, T _{G, 1}:

M ₁=R _{G, 1}M _Init+ T _{G, 1}, M ₁And M _InitPeople's face and Face when being moment t=1 respectively _InitBetween arbitrary group of corresponding point; The present invention gets the corresponding point of above-mentioned two models in LE (left eye), RE (right eye) and these three positions of Mid_M (mouth center), by above-mentioned

, θ, R, [r] _x, the various R that tries to achieve successively of T _{G, 1}And T _{G, 1}

Press following formula again from R _t, T _t, t=1 ..., N-1 obtains R _{G, t}, T _{G, t}, t=2 ..., N:

R _g，t＝R _t-1R _g，t-1，T _g，t＝R _t-1*T _g，t-1+T _t-1，t＝2，...，N；

(4). utilize three-dimensional information correction faceform:

(4.1), pick up the side unique point by hand:

If: the manual side unique point that marks is

{m_{L, side}^{kk} = {(\begin{matrix} x_{L, side}^{kk} & y_{L, side}^{kk} \end{matrix})}^{T}},

Kk=1 ..., NumSF, NumSF=24 are the number of side unique point, side represents the moment corresponding to lateral plan;

(4.2), shape corrections A: Face _InitBe deformed into Face ₁We determine the reposition of a subclass Subset among people's face three-dimensional node tabulation P earlier, determine the coordinate of all nodes among the P again by a known radial basis function algorithm;

The radial basis function algorithm is as described below: establishing will be with Face _StartBe deformed into Face _New, the node among the known Subset is at Face _NewIn coordinate be { New ₁..., New _s, remember that it is at Face _StartIn corresponding point be { Sta ₁..., Sta _s, S is the number of Subset mid point; For the arbitrary position pt in the model, it is at Face now _NewIn coordinate

Can obtain by following known formula:

M_{pt}^{new} = Σ_{jRBF = 1}^{S} C_{jRBF} φ (| | M_{pt}^{start} - {Sta}_{jRBF} | |);

Be that the pt point is at Face _StartIn coordinate, and { C ₁..., C _SCan obtain by following formula

[\begin{matrix} C_{1}^{T} \\ \cdot \\ \cdot \\ \cdot \\ C_{S}^{T} \end{matrix}] = {[\begin{matrix} φ (| | {Sta}_{1} - {Sta}_{1} | |) & \cdot \cdot \cdot & φ (| | {Sta}_{1} - {Sta}_{S} | |) \\ \cdot \cdot \cdot & \cdot \cdot \cdot & \cdot \cdot \cdot \\ φ (| | {Sta}_{S} - {Sta}_{1} | |) & \cdot \cdot \cdot & φ (| | {Sta}_{S} - {Sta}_{S} | |) \end{matrix}]}^{- 1} [\begin{matrix} {New}_{1}^{T} \\ \cdot \\ \cdot \\ \cdot \\ {New}_{S}^{T} \end{matrix}];

φ is known radial basis function;

During " shape corrections A " radial basis function algorithm on utilize, Face is set _Start=Face _Init, Face _New=Face ₁And the point among the Subset comprises two classes: a class is people's left eye interior angle LE, right eye interior angle RE, the outer lip LM in a left side, four three-dimensional remarkable characteristics of right outer lip RM on the face, is drawn by above-mentioned stereo reconstruction method by two-dimentional remarkable characteristic; Another kind of is above-mentioned 24 side unique points, and their three-dimensional position obtains from the manual two-dimensional points that marks by following step:

If: the three-dimensional coordinate of side unique point in people's face generic coordinate system is

{{Ms}_{g}^{kk} = {(\begin{matrix} 0 & Y_{g}^{kk} & Z_{g}^{kk} \end{matrix})}^{T}}, kk = 1, . . ., NumSF;

Then, have for arbitrary kk

\{\begin{matrix} x_{L, side}^{kk} = \frac{R_{g, side, 1}^{T} {Ms}_{g}^{kk} + T_{g, side, 1}}{R_{g, side, 3}^{T} {Ms}_{g}^{kk} + T_{g, side, 3}} \\ y_{L, side}^{kk} = \frac{R_{g, side, 2}^{T} {Ms}_{g}^{kk} + T_{g, side, 2}}{R_{g, side, 3}^{T} {Ms}_{g}^{kk} + T_{g, side, 3}} \end{matrix}

Wherein

R_{g, side} = (\begin{matrix} R_{g, side, 1}^{T} \\ R_{g, side, 2}^{T} \\ R_{g, side, 3}^{T} \end{matrix}),

T_{g, side} = (\begin{matrix} T_{g, side, 1} \\ T_{g, side, 2} \\ T_{g, side, 3} \end{matrix})

Be the representation in components of attitude parameter, be known quantity; Can directly obtain with the On Solving System of Linear Equations method

In two unknown numbers

With

(4.3), shape corrections B: press aforesaid radial basis function algorithm Face ₁Be deformed into Face ₂

If Face _Start=Face ₁, Face _New=Face ₂Among the Subset except comprising the point of " shape corrections A " lining having determined, the point that also has some new interpolations, i.e. the three-dimensional point that the manual facial contour that marks recovers out under the positive attitude, the basic skills that obtains these three-dimensional point is current faceform Face ₁Image plane under positive attitude is done perspective projection and is calculated the two-dimensional silhouette point of three-dimensional model projection result, and the coupling that depends on facial contour on these point and the above-mentioned image at last and be closed polygon is calculated its three-dimensional coordinate:

At first, calculate Face ₁In the projected outline of the t=1 moment on the image plane of the manual front view of selecting

{Cont}_{1} = {{pt}_{1}^{iC 1} | iC 1 = 1, . . ., nNum 1},

Be the label of model three-dimensional node tabulation P mid point,

1 \leq {pt}_{1}^{iC 1} \leq nVertex;

And nNum1 is Cont ₁The number on middle summit, they are to the new point that adds of Subset among the shape corrections B; They are to select by the intersection point of judging projection line and model, and promptly on a certain point, projection line and model can not have other intersection points, and its algorithm is as follows: Face ₁Be three-dimensional model, the perspective projection center is

Center = - R_{g, 1}^{- 1} T_{g, 1}

Known quantity was then calculated the projection line that Center orders and the intersection point of model to all summits in the model, judged that in view of the above this summit is whether in projected outline, if M _NowBe Face ₁Middle any point, Plane _JpBe Face ₁In the plane that constitutes of arbitrary dough sheet, calculated Center point and M _NowStraight line, i.e. projection line is with Plane _JpIntersection point be C _NowIf for a certain dough sheet 1≤jp≤nMesh, some C _NowAt line segment CM _NowInside, M then _NowNot in projected outline; Otherwise be projected outline's point;

Calculate Cont again ₁In any point

At Face ₂Three-dimensional coordinate: the facial contour of establishing on the above-mentioned image is

Cont_{img}_{1} = {(\begin{matrix} {CI}_{1}^{iCimg} & yC I_{1}^{iCimg} \end{matrix}) | Cimg = 1, . . ., nNumCI 1},

It is the polygon that is made of the two-dimensional points tabulation, and nNumCI1 is the number of point; Then

pti = {pt}_{1}^{iC 1}

At Face ₂Three-dimensional coordinate

M_{2}^{pti} = M_{1}^{pti} + t_{line} v; v = {(\begin{matrix} v_{x} & v_{y} & v_{z} \end{matrix})}^{T} = v_{pn} \times (v_{n} \times v_{pn});

Wherein,

M_{1}^{pti} = {(\begin{matrix} X_{1}^{pti} & Y_{1}^{pti} & Z_{1}^{pti} \end{matrix})}^{T}

Be that the pti point is at Face ₁In coordinate; Warp

The projecting direction of point is v _PnBe known quantity; v _nFor the pti point at Face _gIn normal direction;

Parametric t _LineCan be according to two-dimentional straight line

{(\begin{matrix} \frac{X_{1}^{pti} + t_{line} v_{x}}{Z_{1}^{pti} + t_{line} v_{z}} & \frac{Y_{1}^{pti} + t_{line} v_{y}}{Z_{1}^{pti} + t_{line} v_{z}} \end{matrix})}^{T}

With closed polygon Cont_img ₁Intersection point obtain:

If Cont_img ₁In arbitrary line segment be seg=(x ₀+ sd _xy ₀+ sd _y) ^T, 0≤s≤1, s is the line segment parameter, other all are known quantities; Calculate by following formula earlier

[\begin{matrix} si \\ t_{line} \end{matrix}] = {[\begin{matrix} d_{x} & - (X_{1}^{pti} - v_{x} * Z_{1}^{pti} / v_{z}) \\ d_{y} & - (Y_{1}^{pti} - v_{y} * Z_{1}^{pti} / v_{z}) \end{matrix}]}^{- 1} [\begin{matrix} x_{0} - v_{x} / v_{z} \\ y_{0} - v_{y} / v_{z} \end{matrix}]

All seg are obtained this two numerical value si and t _LineDo not have for the formula above a lot of seg and to separate, promptly the matrix in the formula is irreversible; The seg that has the si that separates and obtain to satisfy 0≤si≤1 has one or two, gets and point

{(\begin{matrix} \frac{X_{1}^{pti}}{Z_{1}^{pti}} & \frac{Y_{1}^{pti}}{Z_{1}^{pti}} \end{matrix})}^{T}

That seg that distance is near, the t that it obtains _LineBeing institute asks;

(4.4), shape corrections C: press aforesaid radial basis function algorithm Face ₂Be deformed into Face ₃

If Face _Start=Face ₂, Face _New=Face ₃, except comprising the point of " shape corrections A, B " lining having determined, also have the point of some new interpolations among the Subset, promptly from the manual front face feature contour point that marks: the three-dimensional point that the point of eyes, nostril and mouth is recovered out:

If the two-dimensional coordinate of certain feature contour point is

(\begin{matrix} x {CF}_{1}^{iCF} & {yCF}_{1}^{iCF} \end{matrix}),

This point is at Face ₂In coordinate be

M_{2}^{iCF} = {(\begin{matrix} X_{2}^{iCF} & Y_{2}^{iCF} & Z_{2}^{iCF} \end{matrix})}^{T},

If this Z coordinate is correct as calculated, promptly it is at Face ₃In the Z coordinate equal

Then it is at Face ₃In coordinate be

M_{3}^{iCF} = {(\begin{matrix} x {CF}_{1}^{iCF} * Z_{2}^{iCF} & y {CF}_{1}^{iCF} * Z_{2}^{iCF} & Z_{2}^{iCF} \end{matrix})}^{T},

Be that these feature contour point projections under positive attitude equal the unique point on the actual image;

(4.5), shape corrections D: press aforesaid radial basis function algorithm Face ₃Be deformed into Face ₄

If Face _Start=Face ₃, Face _New=Face ₄, except comprising the point of " shape corrections A, B, C " lining having determined, also have the point of some new interpolations among the Subset, even promptly from manual medial view 1 R that marks _{G, t}Rotation angle recover the three-dimensional point of coming out in the facial contour near the moment t of 25 degree, concrete steps are as follows: establish this view corresponding to moment int1, described identical with shape corrections B, calculate Face earlier ₃Projected outline when int1

{Cont}_{int 1} = {{pt}_{int 1}^{iC 2} | iC 2 = 1, . . ., nNim_int 1},

Be the label of P mid point, nNum_int1 is the summit number in the projected outline; For any point

pt 1 i = {pt}_{int 1}^{iC 2},

It is at Face ₃In three-dimensional coordinate

M_{3}^{pt 1 i} = {[\begin{matrix} X_{3}^{pt 1 i} & Y_{3}^{pt 1 i} & Z_{3}^{pt 1 i} \end{matrix}]}^{T},

And

Center = - R_{g, 1}^{- 1} T_{g, 1}

Be the coordinate of L video camera photocentre in the generic coordinate system as previously mentioned, then this is at Face ₄In coordinate should satisfy

M_{4}^{pt 1 i} = M_{3}^{pt 1 i} + t_{line 2} v 2,

v 2 = {[\begin{matrix} {v 2}_{x} & {v 2}_{y} & {v 2}_{z} \end{matrix}]}^{T} = M_{3}^{pt 1 i} - Center,

Parametric t _Line2Equally according to straight line

{(\begin{matrix} \frac{X_{3}^{pt 1 i} + t_{line 2} {v 2}_{x}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} & \frac{Y_{3}^{pt 1 i} + t_{line 2} {v 2}_{y}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} \end{matrix})}^{T}

With closed polygon Cont_img _Int1Intersection point obtain;

(4.6), shape corrections E: press aforesaid radial basis function algorithm Face ₄Be deformed into Face ₅(be final shape Face _s);

If Face _Start=Face ₄, Face _New=Face ₅, except comprising the point of " shape corrections A, B, C, D " lining having determined, also have the point of some new interpolations among the Subset, even promptly from manual medial view 2 R that mark _{G, t}Rotation angle recover the three-dimensional point of coming out in the facial contour near the moment t of 40 degree, concrete recovering step is identical with shape corrections D;

(5). texture:

Present texture is front and the lateral plan (I that takes by the L video camera _{L, 1}And I _{L, side}) generate, texture is to be the model Face that creates _sGeneration appears to have the cylinder texture maps Texture of the same sense of reality of photograph _s, promptly will be in image conversion to the unified cylindrical coordinate system of gathering and merge, it contains successively and has the following steps:

(5.1), generate cylinder unwrapping figure:

At first to the net result Face of shape corrections E _sIn arbitrary three-dimensional point

M_{s}^{pt} = {(\begin{matrix} X_{s}^{pt} & Y_{s}^{pt} & Z_{s}^{pt} \end{matrix})}^{T}

Do the cylinder mapping by following formula:

\{\begin{matrix} {1 g}_{pt} = \arctan (\frac{Z_{s}^{pt}}{X_{s}^{pt}}) \\ {lt}_{pt} = Y_{s}^{pt} \end{matrix},

The result of mapping is a two dimensional surface, and the point on it is { Cyn _Pt}={ (lg _PtLt _Pt); Here lg _PtBe the longitude station on the cylinder, represent with radian value; Lt _PtIt then is the coordinate position of cylinder axial rotary;

(5.2), generate positive texture maps:

Have a few At known front attitude parameter R _{G, 1}, T _{G, 1}Press following formula down at front view I _{L, 1}Last projection,

{\overset{&RightArrow;}{p}}_{1}^{pt} = {(\begin{matrix} x_{1}^{pt} & y_{1}^{pt} \end{matrix})}^{T}

Be Face _sMiddle three-dimensional point

At I _{L, 1}On two-dimensional projection:

\{\begin{matrix} x_{1}^{pt} = \frac{R_{1 g, 1}^{T} M_{s}^{pt} + T_{1 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \\ y_{1}^{pt} = \frac{R_{2 g, 1}^{T} M_{s}^{pt} + T_{2 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \end{matrix},

Wherein

R_{g, 1} = (\begin{matrix} R_{1 g, 1}^{T} \\ R_{2 g, 1}^{T} \\ R_{3 g, 1}^{T} \end{matrix}),

T_{g, 1} = (\begin{matrix} T_{1 g, 1} \\ T_{2 g, 1} \\ T_{3 g, 1} \end{matrix})

It is the representation in components of rotation matrix and translation vector;

Then, I _{L, 1}In each pixel mapping to the Cyn plane, promptly to Face _sIn arbitrary dough sheet facet=(pt ₁Pt ₂Pt ₃) ^T, m _Proj=Δ (p ₁, p ₂, p ₃) represent at these 3 at I _{L, 1}The triangle that constitutes on the plane, m _Cyn=Δ (p ₁', p ₂', p ₃') be the corresponding triangle on the Cyn plane, promptly

p_{iT 1} = {(\begin{matrix} x_{1}^{{pt}_{iT 1}} & y_{1}^{{pt}_{iT 1}} \end{matrix})}^{T},

p_{iT 1}' = {(\begin{matrix} {1 g}_{{pt}_{iT 1}} & {lt}_{{pt}_{iT 1}} \end{matrix})}^{T},

IT1=1,2,3; Diabolo m again _CynInterior point [lg lt] is calculated as follows [x y]:

[\begin{matrix} x \\ y \\ 1 \end{matrix}] = {[\begin{matrix} a_{1} & a_{2} & a_{3} \\ a_{4} & a_{5} & a_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}];

a ₁～a ₆Obtain by following formula:

[\begin{matrix} a_{1} \\ a_{2} \\ a_{3} \\ a_{4} \\ a_{5} \\ a_{6} \end{matrix}] = {[\begin{matrix} x_{1}^{{pt}_{1}} & y_{1}^{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{1}} & y_{1}^{{pt}_{1}} & 1 \\ x_{1}^{{pt}_{2}} & y_{1}^{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{2}} & y_{1}^{{pt}_{2}} & 1 \\ x_{1}^{{pt}_{3}} & y_{1}^{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{3}} & y_{1}^{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}];

Make positive texture maps again

I_{{Texture}_{1}} (1 g, lt) = I_{L, 1} (x, y);

(5.3), generate side grain figure:

Have a few

In known lateral attitude parameters R _{G, side}, T _{G, side}Press following formula down at lateral plan I _{L, side}Last projection,

{\overset{&RightArrow;}{p}}_{side}^{pt} = {(\begin{matrix} x_{side}^{pt} & y_{side}^{pt} \end{matrix})}^{T}

Be Face _sMiddle three-dimensional point

At I _{L, side}On two-dimensional projection:

\{\begin{matrix} x_{side}^{pt} = \frac{R_{1 g, side}^{T} M_{s}^{pt} + T_{1 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \\ y_{side}^{pt} = \frac{R_{2 g, side}^{T} M_{s}^{pt} + T_{2 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \end{matrix},

R_{g, side} = (\begin{matrix} R_{1 g, side}^{T} \\ R_{2 g, side}^{T} \\ R_{3 g, side}^{T} \end{matrix}),

T_{g, side} = (\begin{matrix} T_{1 g, side} \\ T_{2 g, side} \\ T_{3 g, side} \end{matrix});

Again to Face _sIn arbitrary dough sheet facet=(pt ₁Pt ₂Pt ₃) ^T, establish ms _Proj=Δ (ps ₁, ps ₂, ps ₃),

{ps}_{iT 2} = {(\begin{matrix} x_{side}^{{pt}_{iT 2}} & y_{side}^{{pt}_{iT 2}} \end{matrix})}^{T},

IT2=1,2,3; Diabolo m again _CynInterior point [lg lt] is calculated as follows [xs ys]:

[\begin{matrix} xs \\ ys \\ 1 \end{matrix}] = {[\begin{matrix} {as}_{1} & {as}_{2} & {as}_{3} \\ {as}_{4} & {as}_{5} & {as}_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}];

As ₁～as ₆Obtain by following formula:

[\begin{matrix} {as}_{1} \\ {as}_{2} \\ {as}_{3} \\ {as}_{4} \\ {as}_{5} \\ {as}_{6} \end{matrix}] = {[\begin{matrix} x_{side}^{{pt}_{1}} & y_{side}^{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{1}} & y_{side}^{{pt}_{1}} & 1 \\ x_{side}^{{pt}_{2}} & y_{side}^{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{2}} & y_{side}^{{pt}_{2}} & 1 \\ x_{side}^{{pt}_{3}} & y_{side}^{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{3}} & y_{side}^{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}];

Make side grain figure again

I_{{Texture}_{side}} (1 g, lt) = I_{L, side} (xs, ys);

(5.4), the mirrored sides texture maps is to obtain the texture maps of opposite side, and this is because Face _sTopological structure be complete symmetry;

If: that side arbitrary tri patch m on the face that directly obtains texture _Cyn=Δ (p ₁, p ₂, p ₃), it is m at opposite side symmetrical dough sheet on the face _Cyn'=Δ (p ₁', p ₂', p ₃'), wherein

p_{iT 3} = {(\begin{matrix} {1 g}_{{pt}_{iT 3}} & {lt}_{{pt}_{iT 3}} \end{matrix})}^{T},

p_{iT 3}' = {(\begin{matrix} {1 g'}_{{pt}_{iT 3}} & {lt'}_{{pt}_{iT 3}} \end{matrix})}^{T},

iT3＝1，2，3；

To m _CynIn any point p=(lg lt) ^TBe calculated as follows [lg ' lt '],

[\begin{matrix} 1 g' \\ lt' \\ 1 \end{matrix}] = {[\begin{matrix} {rs}_{1} & {rs}_{2} & {rs}_{3} \\ {rs}_{4} & {rs}_{5} & {rs}_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}];

Rs ₁～rs ₆Obtain by following formula:

[\begin{matrix} {rs}_{1} \\ {rs}_{2} \\ {rs}_{3} \\ {rs}_{4} \\ {rs}_{5} \\ {rs}_{6} \end{matrix}] = {[\begin{matrix} {1 g'}_{{pt}_{1}} & {lt'}_{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{1}} & {lt'}_{{pt}_{1}} & 1 \\ {1 g'}_{{pt}_{2}} & {lt'}_{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{2}} & {lt'}_{{pt}_{2}} & 1 \\ {1 g'}_{{pt}_{3}} & {lt'}_{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{3}} & {lt'}_{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}];

Generate opposite side from a directly veined lateral reflection again:

I_{{Texture}_{side}} (1 g', lt') = I_{{Texture}_{side}} (1 g, lt);

(5.5), the fusion of positive and the texture maps side obtains final texture maps Texture _s

Setting threshold lg _MinAnd lg _Max, Texture _sAt arbitrary position Cyn=(lglt) ^TThe value at place provides by following formula:

2. sequence { bReliable is reliably measured in the renewal in the described attitude estimating step _kMethod, it contains successively and has the following steps:

(1). according to sequence Err _tStatistical study calculate correct corresponding number K_True;

(1.1), establish: the initial value of K_True is Kr (t) * ratio, and ratio is predefined constant; Err _t={ ε _s=‖ M _T+1(k _s)-R _tM _t(k _s)-T _t‖ ²| s=1 ..., Kr (t) in be E by the subsequence of preceding K_True element composition of ordering from small to large;

(1.2), calculate the average of E

\overset{&OverBar;}{ϵ} = \frac{1}{K_True} \underset{ϵ_{s} &Element; E}{Σ} ϵ_{s}

And standard deviation

σ = \sqrt{\frac{1}{K_True - 1} \underset{ϵ_{s} &Element; E}{Σ} {(ϵ_{s} - \overset{&OverBar;}{ϵ})}^{2}};

(1.3), at Err _tThe middle searching satisfied ‖ ε _s-ε ‖〉point of 3* σ, establishing number is InValid_T;

(1.4), if 0≤Kr (t)-K_True-InValid_T≤1, promptly K_True correspondence is correct, stops; Otherwise, change step (1.5);

(1.5), if K-K_True-InValid_T〉1, make K_True add 1, change step (1.2); Otherwise, change step (1.6);

(1.6), if K-K_True-InValid_T＜0 makes K_True subtract 1, change step (1.2);

(2). establish Err _tIn before K_True less ε _sCorresponding label is { s _u, u=1 ..., K_True,

In be Yes corresponding to the position assignment of these labels, other assignment are No;

3. the two method of the Boolean denotation amount bReDetect whether setting up in the described attitude estimating step will call feature detection and matching module again, the Boolean denotation amount bRefine that whether will carry out the attitude refinement is as follows:

(1). if t=N, promptly current image is the last frame in the sequence, bReDetect=No then, bRefine=Yes finishes; Otherwise, change step (2);

(2). if current R _tRotation angle θ _tSurpass preset threshold value θ _ bound, bReDetect=Yes then, bRefine=Yes finishes; Otherwise, change step (3);

(3). (i.e. { the b Reliable if the number Kr (t) of reliable three-dimensional point _k, k=1 ..., value is counting of Yes among the K (t)) less than preset threshold value K_bound, bReDetect=Yes then, bRefine=Yes; Otherwise, change step (4);

(4) .bReDetect=No; If t-t0 is the multiple of the constant Inc_refine that presets, bRefine=Yes then, i.e. refinement is just to do once every Inc_refine; Otherwise bRefine=No;

(5). make t=t+1, enter next processing constantly;

4. in the described attitude estimating step, bRefine is that Yes is illustrated in the current time refinement that gestures, and promptly draws more accurate attitude parameter:

Come all relevant attitude parameter { R of refinement if be chosen in N continuous um=t-tStart+1 correct all the time three-dimensional correspondence of the moment _{TStart+ τ}, T _{TStart+ τ}, τ=0 ..., t-tStart-1, tStart are the initial moment of refinement computing, now get t second before constantly the moment that bRefine is Yes is set, if tStart＜t0 gets it and is t0; In a refinement computing, calculate { Ra earlier _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., t-tStart; M _TStart(k _Pp), pp=1 ..., Kr (t) }, make that the total error f_obj of two-dimentional re-projection reaches minimum value in all images:

f_obj = \underset{m}{Σ} Σ_{τ = 0}^{t - tStart} Σ_{pp = 1}^{Kr (t)} [{(a_{pp, m}^{τ} - x_{pp, m}^{τ})}^{2} + {(b_{pp, m}^{τ} - y_{pp, m}^{τ})}^{2}];

Ra wherein _{TStart+ τ}, Ta _{TStart+ τ}Be to be carved into tStart+ τ rigid body translation coefficient constantly during from tStart; { M _TStart(k _Pp), pp=1 ..., Kr (t) is reliable three-dimensional point array in this Num the moment, and Kr (t) is individual altogether, and pp is the label of three-dimensional point; M is L or R, the expression video camera;

Be in m video camera, tStart+ the τ two-dimensional detection in the image or the result of tracking constantly, promptly

[\begin{matrix} a_{pp, m}^{τ} \\ b_{pp, m}^{τ} \\ 1 \end{matrix}] = A_{m}^{- 1} [\begin{matrix} p_{m, tStart + τ} (k_{pp}) \\ 1 \end{matrix}],

A _mBe A _LOr A _R

Be the pp three-dimensional point constantly, project to the result on the m video camera, promptly at tStart+ τ

\{\begin{matrix} x_{pp, m}^{τ} = \frac{R_{1 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{1 c, m}}{R_{3 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{3 c, m}} \\ y_{pp, m}^{τ} = \frac{R_{2 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{2 c, m}}{R_{3 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{3 c, m}} \end{matrix},

Wherein

R_{c, m} = (\begin{matrix} R_{1 c, m}^{T} \\ R_{2 c, m}^{T} \\ R_{3 c, m}^{T} \end{matrix})

With

T_{c, m} = (\begin{matrix} T_{1 c, m} \\ T_{2 c, m} \\ T_{3 c, m} \end{matrix})

Be the rigid body translation that is tied to the m camera coordinate system from the L camera coordinates, promptly

R_{c, L} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}],

T_{c, L} = [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}],

R_{c, R} = R_{c}^{- 1},

T_{c, R} = - R_{c}^{- 1} T_{c, R};

In addition,

{Ra}_{tStart} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}],

{Ta}_{tStart} = [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}];

Optimize the exact value that f_obj just can obtain attitude parameter; Earlier find the solution { Ra with the Levenberg-Marquardt algorithm _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., t-tStart; M _TStart(k _Pp), pp=1 ..., Kr (t) }; Use following formula { Ra again _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., t-tStart is transformed into { R _{TStart+ τ}, T _{TStart+ τ}, τ=0 ..., t-tStart-1:

R _tStart＝Ra _tStart+1，T _tStart＝Ta _tStart+1，

\{\begin{matrix} R_{tStart + τ - 1} = {Ra}_{tStart + τ} {Ra}_{tStart + τ - 1}^{- 1} \\ T_{tStart + τ - 1} = {Ta}_{tStart + τ} - R_{tStart + τ - 1} * {Ta}_{tStart + τ - 1} \end{matrix}, τ = 2, . . ., t - tStart .

Evidence: it has reached intended purposes.

Description of drawings

Fig. 1. people's face modeling process flow diagram

Fig. 2. attitude is estimated process flow diagram

Fig. 3. the texture process flow diagram

Fig. 4. the relation of two-dimensional coordinate system on camera coordinate system and the picture plane

Fig. 5. the remarkable characteristic of front face and line segment

Fig. 6. typical people's face side projection and some side unique points

Fig. 7. the front human face outline that stack shows

Fig. 8. the front face feature contour point that stack shows

Fig. 9. medial view 1 facial contour that stack shows (25 degree angle)

Figure 10. medial view 2 facial contours that stack shows (40 degree angle)

Figure 11. the stereo camera configuration of placing up and down

Figure 12. the typical stereographic map of system acquisition is to (two row correspond respectively to top up and down, bottom video camera, each row is images of synchronization, the numeral that marks is the frame number in whole sequence)

Figure 13. the neutrality of universal model and animated state (from left to right being neutral line chart, neutral dough sheet figure, animation line chart and animation dough sheet figure successively)

Figure 14. the experimental result of feature detection and coupling under the different attitudes (first row is a testing result, second row be corresponding stereographic map to matching result; The numeral that marks is the frame number in whole sequence)

Figure 15. the stereographic map of positive attitude on the two-dimentional remarkable characteristic (unique point stack be presented on the raw image) that marks

Figure 16. the stack of side unique point shows

Figure 17. the actual Texture that obtains ₁

Figure 18. the Texture that the 35 step directly obtained _Side

Figure 19. the Texture that finally obtains _Side

Figure 20. cylinder texture maps Texture _s

Figure 21. shone upon Texture _sModeling result Face _s(checking) from different visual angles

Embodiment

We create the faceform under the framework of a universal model specialization, that is: from universal model Face _gBeginning is progressively made amendment according to the two-dimensional signal of the modeling object that obtains, and finally obtains specific human model Face _sIn this course, the three-dimensional point coordinate among the just P of modification, F then remains unchanged, and promptly constant (P and F have represented the shape of people's face to topological structure jointly, i.e. Face={P, F}.Particularly, P={M ^Pti=(X ^PtiY ^PtiZ ^Pti) ^T| pti=1 ..., nVertex} is people's three-dimensional node tabulation on the face, and F={facet _Fj=(pt _{1, fi}Pt _{2, fi}Pt _{3, fj}) ^T| fj=1 ..., nMesh} is people's triangle surface tabulation on the face, wherein each facet _Fj=(pt _{1, fi}Pt _{2, fi}Pt _{3, fi}) ^TExpression is by three points among the P

A triangle of forming.Pt _{Pti, fi}, pti=1 ..., 3, fi=1 ..., nMesh is the sequence number of each point among the P, satisfies 1≤pt _{Pti, fj}≤ nVertex.NVertex and nMesh are respectively the total numbers of three-dimensional point and tri patch.Generally speaking, P representation model shape, F is the topological structure of representation model then).The topological structure of model can be kept along this technology path, thereby human face animation can be realized easily; Simultaneously, this universal model has also been represented our general knowledge to this closely similar object class of people's face, can provide additional constraint for concrete processing procedure.

Below we will describe modeling procedure in detail, and provide a concrete embodiment.

The overall procedure of people's face modeling as shown in Figure 1, the front and back of each step orders and input and output all clearly mark." processing module " this part, the numeral number in the block diagram is in detail this module to be set forth the little section number of part in this section.In addition, " input " refer to before those modelings by the ready data of manual intervention (video data, Face _g, and need the manual two-dimensional points that marks), " intermediate result " then is the data that system obtains in the modeling flow process automatically.

The stereoscopic video sequence of input is designated as I _{L, t}And I _{R, t}, t=1 ..., N represents the different moment, and N then is the totalframes of video sequence, and L, R be corresponding to two video cameras, the two-dimensional array that I then is made up of pixel, each pixel comprises R, G, three components of B.The moment of t=1 is selected by people's judgement is manual corresponding to the positive attitude of people's face.

Below we describe the concrete disposal route of each step in detail.

1.1 attitude is estimated

This step will be found the solution the attitude parameter (being also referred to as the rigid body translation coefficient) in the number of people rotary course.Be designated as { R _t, T _t, t=1 ..., N-1}, wherein each R _t, T _tBe the people face rigid body translation coefficient of t frame, constantly and the t+1 moment arbitrary promptly to three-dimensional corresponding point M for t to the t+1 frame _tAnd M _T+1(so-called corresponding, as to be meant that they are that people's same position on the face is at difference three-dimensional coordinate constantly) has following formula

M _t+1＝R _tM _t+T _t，t＝1，...，N-1

Wherein the coordinate system at three-dimensional point place is the L camera coordinate system.The definition of camera coordinate system is a true origin with the photocentre C of perspective video camera as shown in Figure 4, and the Z axle is along optical axis direction, X, and the x of Y-axis two-dimensional coordinate system respectively and on the picture plane, the y axle is parallel.

The entire process flow process that attitude is estimated as shown in Figure 2, below we do simple declaration to the principle of algorithm earlier, introducing each step in detail.

Our thinking is to utilize the motion of reliable three-dimensional point to come estimated parameter, and three-dimensional point needs (to be called as " two dimensional character " below according to the two-dimentional corresponding point that project on the different images.About " two dimension is corresponding ", be meant that same three-dimensional point all is projected as two-dimensional points in different images, be called two-dimentional corresponding point mutually between them; Wherein stereographic map to two-dimentional correspondence be called " coupling ", the different two-dimentional correspondences between images constantly that same video camera is taken then are called " tracking ".Here " stereographic map to " refers to a pair of image that synchronization L and R video camera are taken) calculate.Particularly, we at first will obtain three-dimensional point, and this is to detect and coupling by two dimensional character in the initial moment, obtains by stereo reconstruction then; For the follow-up moment then is to follow the tracks of respectively in L and two image sequences of R, then obtains by stereo reconstruction equally.We just can estimate attitude after the three-dimensional point in the different moment had been arranged.Consider from the number of unknown quantity, only need the corresponding three-dimensional points in three pairs of two moment just can finish, but the three-dimensional point that requires to reconstruct is reliably, and be difficult in the actual experiment guarantee that all three-dimensional point are all reliable.In order to estimate the reliability of three-dimensional point, we are definition and continual renovation array { b Reliable _k, this is boolean's array, value is Yes (representing that this point is reliable) or No (it is unreliable to represent).This array can be updated along with the carrying out of following the tracks of, and it all is reliable that initialization when just beginning have a few, and some was put and then can be judged as insecurely afterwards, and the method that we make this judgement will describe in detail below.

Directly bigger in the attitude parameter error that adjacent two constantly motion calculation go out according to three-dimensional point, we can constantly carry out the attitude refinement to reduce error according to all the reliable three-dimensional point in one period long period at some.When carry out this refinement computing and controlled by Boolean denotation bRefine, its value Yes represents that current time carries out the attitude refinement, and No is then opposite.

Along with the carrying out of following the tracks of, three-dimensional point can be fewer and feweri reliably.In order to make attitude estimate to proceed and to obtain accurate result, we are necessary to carry out again two dimensional character detection and coupling and stereo reconstruction, find out new reliable three-dimensional point.When carry out this operation by Boolean denotation bReDetect control, its value Yes represents that current time need find out the work of new three-dimensional point, and No is then opposite.

The method to set up of these two Boolean denotations of bRefine and bReDetect also will be described in detail below.

Below we elaborate complete process:

1.1.1 the detection of two dimensional character

We utilize existing KLT algorithm to finish the detection of two dimensional character on gray-scale map.Its principle is: establishing the image that constantly m video camera of t (m is L or R, represents some in two video cameras) obtains is I _{M, t}(p), p=(x, y) any point on the presentation image wherein; The gradient vector of this some place image is

\overset{&RightArrow;}{g} = {&dtri; I}_{m, t} (p) = {(\begin{matrix} \frac{{&PartialD; I}_{m, t}}{&PartialD; x} (p) & \frac{{&PartialD; I}_{m, t}}{&PartialD; y} (p) \end{matrix})}^{T},

Calculate the matrix of 2*2

G = \underset{w}{&Integral;} \overset{&RightArrow;}{g} {\overset{&RightArrow;}{g}}^{T} dp,

Wherein W is to be a rectangular window at center with p.We obtain the minimum singular value sing_min (utilizing existing matlab kit) of G, and select the very big p point of those sing_min as detected unique point.On physical significance, G is actual to be the expression that local gray level on the image is changed the degree of enriching: if local gray scale is constant, at this moment gradient vector g is 0, and G is 0, and all singular values all are 0; If local grey scale change has fixing direction (for example image is gone up more intense single edge), gradient vector g is substantially perpendicular to this direction, and at this moment the singular value of G minimum approaches 0; It is very big only all to compare the minimum singular value that just satisfies G in significant part (for example intersection point on two limits) in the grey scale change along all directions.Therefore, we are actual to be to select those local gray levels to change abundant point as unique point, and such some degree of reliability when following the tracks of and mate is higher.

We need do the discretize processing to top formula during computer realization, and concrete computing formula is as follows:

\overset{&RightArrow;}{g} (i, j) = {[\begin{matrix} gx & gy \end{matrix}]}^{T} = {[\begin{matrix} \frac{I_{m, t} (i + 1, j) - I_{m, t} (i - 1, j)}{2} & \frac{I_{m, t} (i, j + 1) - I_{m, t} (i, j - 1)}{2} \end{matrix}]}^{T} - - - (1.1.1)

G (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} \overset{&RightArrow;}{g} (i + ii, j + jj) {\overset{&RightArrow;}{g}}^{T} (i + ii, j + jj) - - - (1.1.2)

Here blockx, blocky has determined the net point number that window W comprises.In fact, (i j) is the center to this window, and net point adds up to (2*blockx+1) * (2*blocky+1) with tested point.

In Fig. 2, t detected unique point constantly is designated as { p _{L, t}(k) }, L, t presentation image sequence number.The total number of unique point is K (t), and k satisfies 1≤k≤K (t), the label of expression point.

At this moment we make t0=t, and which these features that expression will be tracked be detected constantly at.

1.1.2 two dimensional character coupling

Matching problem is finished according to known method.It is as follows simply to introduce its principle:

If the configuration relation between two video cameras is represented with following formula

M _L＝R _cM _R+T _c (1.1.3)

L and R represent two camera coordinate systems, M _LAnd M _RBe the coordinate representation of same three-dimensional point in these two coordinate systems; R _cAnd T _c=[T _{C, x}T _{C, y}T _{C, z}] ^TBe respectively rotation and translational component (being collectively referred to as the outer parameter of stereo camera, is known quantity), the former is the matrix of a 3*3, and the latter is a tri-vector.

After (1.1.3) arranged, order

F = {[\begin{matrix} f_{L} & 0 & u_{0 L} \\ 0 & f_{L} & v_{0 L} \\ 0 & 0 & 1 \end{matrix}]}^{- T} [\begin{matrix} 0 & - T_{c, z} & T_{c, y} \\ T_{c, z} & 0 & - T_{c, x} \\ - T_{c, y} & T_{c, x} & 0 \end{matrix}] R_{c} {[\begin{matrix} f_{R} & 0 & u_{0 R} \\ 0 & f_{R} & v_{0 R} \\ 0 & 0 & 1 \end{matrix}]}^{- 1}

(this matrix is called as basis matrix), wherein f _L, f _R, u _0L, v _0L, u _0R, v _0RBeing the intrinsic parameter of video camera, all is known quantity; Its matrix representation is

A_{L} = [\begin{matrix} f_{L} & 0 & u_{0 L} \\ 0 & f_{L} & v_{0 L} \\ 0 & 0 & 1 \end{matrix}],

A_{R} = [\begin{matrix} f_{R} & 0 & u_{0 R} \\ 0 & f_{R} & v_{0 R} \\ 0 & 0 & 1 \end{matrix}] .

Then for L, 1 p of certain among the t figure _{L, t}(k)=(x _{L, t}(k) y _{L, t}(k)) T, it is at R, the match point p among the t figure _{R, t}(k)=(x _{R, t}(k) y _{R, t}(k)) T should satisfy following known formula (1.1.4):

{[\begin{matrix} p_{L, t} (k) \\ 1 \end{matrix}]}^{T} F [\begin{matrix} p_{R, t} (k) \\ 1 \end{matrix}] = 0 - - - - (1.1.4)

(1.1.4) physical significance of formula is p _{R, t}(k) be positioned at by p _{L, t}(k) on Jue Ding the straight line, reduce to this straight line for the search volume of seeking this point from whole image like this, promptly dropped to one dimension from two dimension.In this one-dimensional space, the point of selecting to make total gray difference minimum in the local window Wm promptly will make following (1.1.5) formula minimum as matching result:

Diff＝∫[I _R，t(p _R，t(k))-I _L，t(p _L，t(k))] ²dp (1.1.5)

After the discretize, the formula during actual computation is:

Diff (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} {(I_{L, t} (x_{L, t} (k) + ii, y_{L, t} (k) + jj) - I_{R, t} (i + ii, j + jj))}^{2} - - - (1.1.6)

Wherein (i j) is straight line On any point.

T matching result constantly is { p among Fig. 2 _{R, t}(k) }, total number also is K (t), and each is put all and p _{L, t}(k) corresponding.To each p _{L, t}(k) matching process is independently, promptly searches for the position that makes (1.1.6) formula minimum on the straight line of formula (1.1.4) expression.

Obtain the method acquisition three-dimensional point sequence { M of coupling back with stereo reconstruction _t(k) | k=1 ..., K (t) }.The process of stereo reconstruction is independently to each to match point, and principle is as follows:

At first, we have the perspective projection principle of video camera, i.e. Xia Mian known formula,

s_{m} A_{m}^{- 1} [\begin{matrix} p_{m, t} \\ 1 \end{matrix}] = M_{m, t},

Wherein m is L or R, represents in two video cameras any; M _{M, t}Be the coordinate of t moment three-dimensional point in the m camera coordinate system; s _mIt is a scale factor to be asked.We have omitted (k) and have represented any point.

Can solve three-dimensional point, M in conjunction with the configuration relation between two video cameras (1.1.3) again _{L, t}Expression formula be

M_{L, t} = \frac{1}{2} (s_{L} A_{L}^{- 1} [\begin{matrix} p_{L, t} \\ 1 \end{matrix}] + s_{R} R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} \\ 1 \end{matrix}] + T_{c})

Wherein

[\begin{matrix} s_{L} \\ s_{R} \end{matrix}] = {(B^{T} B)}^{- 1} B^{T} T_{c},

And

B = [A_{L}^{- 1} [\begin{matrix} p_{L, t} \\ 1 \end{matrix}] - R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} \\ 1 \end{matrix}]] .

And M _{R, t}Then can obtain by following formula

M_{R, t} = R_{c}^{- 1} (M_{L, t} - T_{c})

If do not specify, the three-dimensional point that the stereo reconstruction of mentioning in this article goes out all is meant the M in the L camera coordinate system _{L, t}For simplicity, we remove L in statement, are abbreviated as M _tAll unique points are repeated top stereo reconstruction step, and we obtain three-dimensional point sequence { M _t(k) | k=1 ..., K (t) }.

On physical significance, each two-dimensional projection's point is actual to be to have determined three-dimensional straight line, according to a pair of two-dimensional points of same three-dimensional point in two video cameras, by finding the solution the intersection point of two 3 d-lines, just can determine this three-dimensional point like this.

Simultaneously, we also define Boolean variable sequence { bReliable _k, k=1 ..., K (t), it represents whether each three-dimensional point is reliable, and value is Yes or No, and all values in the sequence all is initialized as Yes at present, and it all is reliable that expression is had a few.

1.1.3 the tracking of two dimensional character

Two dimensional character on the gray-scale map is followed the tracks of and is utilized existing KLT algorithm to finish equally.Its principle is: the unique point of establishing detection is at visual I _{M, t}(p) on (m is similarly L or R), wherein p is arbitrary unique point; Now need be at visual I _{M, t+1}This point of last tracking.Based on the coupling of local grain information, order

\overset{&RightArrow;}{e} = \underset{W}{&Integral;} (I_{m, t + 1} (p) - I_{m, t} (p)) \overset{&RightArrow;}{g} dp,

\overset{&RightArrow;}{d} = G^{- 1} \overset{&RightArrow;}{e}

(calculating of G is as previously mentioned), then

Be visual I _{M, t+1}Go up tracking results to p.

It is right to need equally during actual computation

Find the solution and make discretize and handle:

\overset{&RightArrow;}{e} (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} \overset{&RightArrow;}{g} (i + ii, j + jj) (I_{m, t + 1} (i + ii, j + jj) - I_{m, t} (i + ii, j + jj)) - - - (1.1.7)

The unique point that tracking obtains is expressed as { p in Fig. 2 _{M, t+1}(k) }.

1.1.4 attitude parameter initialization

Identical stereo reconstruction method can obtain t+1 three-dimensional point sequence constantly when using with coupling for the result who follows the tracks of, and we have just obtained one group of three-dimensional corresponding point { M in these two moment (t and t+1) like this _p(k) | k=1 ..., K (t) } and { M _T+1(k) | k=1 ..., K (t) }.Below according to three-dimensional correspondence of this group and the sequence { bReliable that represents its reliability _k, k=1 ..., K (t) (this sequence is called as " reliably measuring sequence ") finds the solution the posture changing parameters R of t frame to the t+1 frame _t, T _t, upgrade reliable tolerance sequence simultaneously.

At first, consider that three pairs of three-dimensional corresponding point just are enough to calculate attitude parameter from the number of unknown quantity.The principle of simply introducing this algorithm known is as follows:

If { (M _Im, M _Im') | im=0,1,2} is three couples of not three-dimensional corresponding point (M of conllinear _ImAnd M _Im' be the coordinate of same three-dimensional point in the L camera coordinate system in two frames of front and back), attitude parameter R _t, T _tSatisfy M _Im' R _tM _Im+ T _t| im=0,1,2.For determining rotational component R wherein _t, we need determine the central shaft that rotates

(being called turning axle, the vector of 3*1) and around this anglec of rotation θ.The definition vector of unit length

{dir}_{im} = \frac{M_{im} - M_{0}}{| | M_{im} - M_{0} | |}, im = 1,2;

With

{dir}_{im}' = \frac{M_{im}' - M_{0}'}{| | M_{im}' - M_{0}' | |}, im = 1,2 .

Then

\hat{r} = \frac{({dir}_{1}' - {dir}_{1}) \times ({dir}_{2}' - {dir}_{2})}{| | ({dir}_{1}' - {dir}_{1}) \times ({dir}_{2}' - {dir}_{2}) | |}

Obtaining turning axle

After, θ can find the solution by following formula:

θ = \arccos \frac{(\hat{r} \times {dir}_{1}) \cdot (\hat{r} \times {dir}_{1}')}{| | (\hat{r} \times {dir}_{1}) \cdot (\hat{r} \times {dir}_{1}') | |}

Obtained

After θ, rotation matrix R _tCan provide by following formula:

R_{t} = I + \frac{\sin | | r | |}{| | r | |} {[r]}_{\times} + \frac{1 - \cos | | r | |}{{| | r | |}^{2}} {[r]}_{\times}^{2}

Wherein

r = θ * \hat{r},

[r] _xBe according to 3*1 vector r=[r _xr _yr _z] ^TThe antisymmetric matrix of definition, promptly

{[r]}_{\times} = {[\begin{matrix} r_{x} \\ r_{y} \\ r_{z} \end{matrix}]}_{\times} = [\begin{matrix} 0 & - r_{z} & r_{y} \\ r_{z} & 0 & - r_{x} \\ - r_{y} & r_{x} & 0 \end{matrix}],

And

I = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 1 & 0 & 1 \end{matrix}]

It is unit matrix.

Estimate rotational component R _tAfterwards, translational component T _tCan obtain by following formula:

T_{t} = \frac{1}{3} Σ_{im = 0}^{2} (M_{im}' - R_{t} M_{im})

Though top algorithm can obtain result accurately in theory, numerical stability is very poor, promptly at its input { (M _Im, M _Im') | im=0,1, when 2} had less error, the solving result of algorithm can be very inaccurate.And the input data generally have error, even some are wrong in addition.In order to obtain result accurately in this case, we have adopted the Robust Estimation algorithm based on minimum intermediate value (Least-Median-Square) commonly used.Its ultimate principle is: randomly from current reliable three-dimensional corresponding (i.e. { bReliable _k, k=1 ..., value is the item of Yes among the K (t).If these label is { k _s, s=1 ..., Kr (t), wherein Kr (t) is the total number of reliable point) in choose Num_subset subclass, each subclass Set _n, n=1 ..., Num_subset contains three pairs of three-dimensional corresponding point, estimates attitude parameter R by top algorithm in view of the above _{T, n}, T _{T, n}, then to all reliable three-dimensional corresponding { M _t(k _s), M _T+1(k _s), s=1 ..., Kr (t) calculates ε by following formula (1.1.8) _{N, s}:

ε _n，s＝‖M _t+1(k _s)-R _t，nM _t(k _s)-T _t，n|| ² (1.1.8)

This ε _{N, s}Physical significance be corresponding point M _t(k _s), M _T+1(k _s) and attitude parameter R _{T, n}, T _{T, n}Matching degree, this value is more little, illustrates that both meet more.We come from set { R by formula (1.1.9) _{T, n}, T _{T, n}, n=1 ..., select R among the Num_subset _t, T _t, i.e. { R _t, T _t}={ R _{N_m}, T _{N_m}, wherein

n_m = \underset{n}{\arg \min} {\underset{s}{med}}^{n} {ϵ_{n, s} | s = 1, . . ., Kr (t)} - - - (1.1.9)

Wherein

{{\underset{s}{med}}^{n} | n = 1, . . ., Num_subset};

And

Then be in this intermediate value array, to select to get that n of minimum value.

Find the solution R above _t, T _tAlgorithm can guarantee exist a lot of wrong three-dimensionals at once still obtaining correct result, this be because: the first, according to { M _t(k) | k=1 ..., K (t) } and { M _T+1(k) | k=1 ..., K (t) } in three accurately correspondingly just can calculate attitude parameter; The second, selecting the foundation of attitude parameter by formula (1.1.9) is the intermediate value of whole sequence, is wrong even therefore a lot of three-dimensional correspondences are arranged, and causes a lot of numerical value in this sequence bigger; As long as correct three-dimensional corresponding number surpasses K (t) half, algorithm still can obtain correct attitude estimated result.

Then we need upgrade reliable tolerance sequence { bReliable _k, judge that promptly which three-dimensional correspondence is correct.Concrete grammar is as follows: make Err _t={ ε _s=‖ M _T+1(k _s)-R _tM _t(k _s)-T _t‖ ²| s=1 ..., Kr (t) } represent that current reliable three-dimensional point is to R _t, T _tDegree of support, it is correct establishing K_True (computing method are as described below) in the individual point of this Kr (t) individual, we establish Err _tIn before K_True less ε _sCorresponding label is { s _u, u=1 ..., K_True, then

In corresponding to the position value Yes of these labels, other position is No.

The calculation procedure of K_True is:

(1) initial value that K_True is set is Kr (t) * ratio (ratio is a predefined constant); Note Err _tIn be E by the subsequence of preceding K_True element composition of ordering from small to large;

(2) average of calculating E

\overset{&OverBar;}{ϵ} = \frac{1}{K_True} \underset{ϵ_{s} &Element; E}{Σ} ϵ_{s}

And standard deviation

σ = \sqrt{\frac{1}{K_True - 1} \underset{ϵ_{s} &Element; E}{Σ} {(ϵ_{s} - \overset{&OverBar;}{ϵ})}^{2}};

(3) at Err _tThe middle searching satisfied ‖ ε _s-ε ‖〉point of 3* σ, establishing number is In Valid_T;

(4) if 0≤Kr (t)-K_True-InValid_T≤1 illustrates that current judgement to the sum of errors mistake is compatible, stop, current K_True is net result; Otherwise, change step 5;

(5) if K-K_True-InValid_T〉1, K_True adds 1 (being to add a bit among the E), changes step 2; Otherwise, change step 6;

(6) K_True subtracts 1 (being to reduce a bit among the E), changes step 2.

The principle of this method is the boundary position of sum of errors mistake on statistical significance.

1.1.5 sign bReDetect and bRefine are set

This is the sign amount of two Boolean types, and value is Yes or No.BReDetect represents whether will call feature detection and matching module again, and bRefine then represents whether will carry out the attitude refinement.Their set-up mode is as follows:

(1) if t=N, promptly current image is last width of cloth in the sequence, bReDetect=No then, bRefine=Yes; Finish; Otherwise, change step 2;

(2) if current R _tRotation angle θ _tSurpassed preset threshold value θ _ bound, bReDetect=Yes then, bRefine=Yes; Finish; Otherwise, change step 3;

(3) if the number Kr (t) of reliable three-dimensional point is (i.e. { bReliable _k, k=1 ..., value is the number of the point of Yes among the K (t)) less than preset threshold value K_bound, bReDetect=Yes then, bRefine=Yes; Otherwise, change step 4;

(4) bReDetect=No; If t-t0 is the multiple of the constant Inc_refine that presets, bRefine=Yes; Otherwise bRefine=No.

From the principle, we change need reform when excessive feature detection and coupling in reliably count out deficiency or attitude; Otherwise just do signature tracking.Refinement is then just done once every regular time interval (Inc_refine).

Make t=t+1 now, prepare to enter next processing constantly.

1.1.6 attitude refinement

BRefine is that Yes is illustrated in current time and does the attitude refinement, at this moment system also needs to be provided with the initial moment tStart of refinement computing, actual we get its for t constantly before second the moment that bRefine is Yes is set, be t0 if the tStart that obtains so just makes it less than t0.

1.1.4 the attitude parameter initialization of joint has only utilized the three-dimensional corresponding of two moment (t and t+1).For obtaining more accurate attitude parameter, we select those to come all relevant attitude parameter { R of refinement N continuous um=t-tStart+1 correct all the time three-dimensional correspondence of the moment _{TStart+ τ}, T _{TStart+ τ}, τ=0 ..., t-tStart-1.Specifically, in a refinement computing, we calculate { Ra earlier _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., t-tStart; M _TStart(k _Pp), pp=1 ..., Kr (t) }, make that the f_obj in the formula (1.1.10) reaches its minimum value.

f_obj = \underset{m}{Σ} Σ_{τ = 0}^{t - tStart} Σ_{pp = 1}^{Kr (t)} [{(a_{pp, m}^{τ} - x_{pp, m}^{τ})}^{2} + {(b_{pp, m}^{τ} - y_{pp, m}^{τ})}^{2}] - - - (1.1.10)

Ra wherein _{TStart+ τ}, Ta _{TStart+ τ}Be to be carved into tStart+ τ rigid body translation coefficient constantly during from tStart; { M _TStart(k _Pp), pp=1 ..., Kr (t) is reliable three-dimensional point array in this Num the moment, comprises the individual point of Kr (t), pp represents the label of three-dimensional point.M is L or R, represents some video cameras. Be the result of two-dimensional detection or tracking, in m video camera tStart+ τ image constantly, promptly

[\begin{matrix} a_{pp, m}^{τ} \\ b_{pp, m}^{τ} \\ 1 \end{matrix}] = A_{m}^{- 1} [\begin{matrix} p_{m, tStart + τ} (k_{pp}) \\ 1 \end{matrix}] - - - (1.1.11)

And Then be the pp three-dimensional point constantly, project to the result on the m video camera, promptly at tStart+ τ

\{\begin{matrix} x_{pp, m}^{τ} = \frac{R_{1 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{1 c, m}}{R_{3 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{3 c, m}} \\ y_{pp, m}^{τ} = \frac{R_{2 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{2 c, m}}{R_{3 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{3 c, m}} \end{matrix} - - - (1.1.12)

R_{c, m} = (\begin{matrix} R_{1 c, m}^{T} \\ R_{2 c, m}^{T} \\ R_{3 c, m}^{T} \end{matrix})

With

T_{c, m} = (\begin{matrix} T_{1 c, m} \\ T_{2 c, m} \\ T_{3 c, m} \end{matrix})

Expression is tied to the rigid body translation of m camera coordinate system from the L camera coordinates.Promptly

R_{c, L} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}],

T_{c, L} = [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}],

R_{c, R} = R_{c}^{- 1},

T_{c, R} = - R_{c}^{- 1} T_{c, R} .

Have in addition

{Ra}_{tStart} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}],

{Ta}_{tStart} = [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}] .

On physical significance, what formula (1.1.10) was represented is the total error of two-dimentional re-projection in all images.Optimize the exact value that this target can obtain attitude parameter.Parameter { Ra wherein _{TStart+ τ}, Ta _TStart+r, τ=1 ..., t-tStart; M _TStart(k _Pp), pp=1 ..., Kr (t) find the solution and can call general Levenberg-Marquardt algorithm (utilizing the matlab kit) and finish.

Obtained { Ra _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., calculate { R by following formula behind the t-tStart _{TStart+ τ}, T _{TStart+ τ}, τ=0 ..., t-tStart-1, thus reach the refinement purpose.

R _tStart＝Ra _tStart+1，T _tStart＝Ta _tStart+1，

\{\begin{matrix} R_{tStart + τ - 1} = {Ra}_{tStart + τ} {Ra}_{tStart + τ - 1}^{- 1} \\ T_{tStart + τ - 1} = {Ta}_{tStart + τ} - R_{tStart + τ - 1} * {Ta}_{tStart + τ - 1} \end{matrix}, τ = 2, . . ., t - tStart .

The physical significance of top formula is the compound of rigid motion, for example: { R _{TStart+ τ-1}, T _{TStart+ τ-1}Be to be carved into tStart+ τ rigid body translation constantly at tStart+ τ-1 o'clock, and it equals the compound of two rigid body translations, and promptly doing earlier to be carved into tStart rigid body translation constantly from tStart+ τ-1 o'clock (is { Ra _{TStart+ τ-1}, Ta _{TStart+ τ-1}Inverse transformation), be carved into tStart+ τ rigid body translation ({ Ra constantly when remaking from tStart _{TStart+ τ}, Ta _{TStart+ τ}).

1.2 the initialization of model and attitude

Press the thinking of universal model specialization, at first need from Face _gObtain an initial model Face _InitIn addition, obtained { R in the superincumbent attitude estimating step _t, T _t, t=1 ..., N-1}, and in order to obtain the correspondence between model and projection image, we also need to calculate the rigid body translation R from generic coordinate system (the hypomere introduction is an object coordinates system that is fixed on the number of people) to each frame L camera coordinate system _{G, t}, T _{G, t}, t=1 ..., N.These two tasks are finished by model initialization and attitude initialization respectively.

Explain the generic coordinate system earlier.We get the center of gravity Middle that coordinate origin is left eye interior angle LE, right eye interior angle RE, the outer lip LM in a left side, four points of right outer lip RM; Get X-direction for point to the direction of left eye LE from right eye RE, get the direction (direction straight up) that Y direction Cong Xia Chin points to forehead, get Z-direction for point to the direction of nose from hindbrain.Three direction pairwise orthogonals, and constitute right-handed coordinate system.Each unique point above-mentioned is referring to Fig. 5.

The initialized process of model is actual to be that original universal model is applied a change of scale.Promptly finish by following formula (1.2.1):

M_{init} = [\begin{matrix} s_{x} & 0 & 0 \\ 0 & s_{y} & 0 \\ 0 & 0 & s_{z} \end{matrix}] M_{g} - - - (1.2.1)

M wherein _gAnd M _InitBe respectively Face _gAnd Face _InitOn arbitrary group of corresponding point.Three scale factor s _x, s _y, s _zUtilize the distance configuration of people's four principal character points on the face to determine (comprising left inner eye corner, right inner eye corner, the left corners of the mouth and the right corners of the mouth).Particularly: s _xBe modeling object and Face _gThe ratio of middle Seg_E length (distances of two inner eye corners), s _yThen be the ratio of Seg_EM length (distance at mouth center is arrived at two centers), s _zJust be taken as s _yThese line segments also mark in Fig. 5.

Determined behind these three scale-up factors Face _gIn all the point by formula (1.2.1) make change of scale, the result is Face _Init

The initialized task of attitude is to calculate R _{G, 1}, T _{G, 1}, promptly the rigid body translation of L camera coordinate system when the generic coordinate is tied to the 1st frame promptly will be found the solution the parameter in (1.2.2)

M ₁＝R _g，1M _init+T _g，1 (1.2.2)

M wherein ₁And M _InitBe respectively 1 o'clock moment people's face and Face _InitArbitrary group of corresponding point between these two models.

For finding the solution R _{G, 1}, T _{G, 1}, we select the corresponding point of these two models in LE (left eye), RE (right eye) and these three positions of Mid_M (mouth center), obtain the result by the initialized method of attitude in the front 1.1.4 joint.

Obtained R _{G, 1}, T _{G, 1}After, system is compound according to rigid motion, calculates other each attitude parameter R constantly by following formula _{G, t}, T _{G, t}, t=2 ..., N:

R _g，t＝R _t-1R _g，t-1，T _g，t＝R _t-1*T _g，t-1+T _t-1，t＝2，...，N

System can judge the moment side corresponding to lateral plan automatically now, promptly selects side for making R _{G, t}Rotation angle near those t of 90 degree.

1.3 shape corrections A

This step will be Face _InitBe deformed into Face ₁As previously mentioned, we keep the topological structure of Face, and just revise the wherein coordinate of three-dimensional point.Particularly, we determine the reposition of a subclass Subset among the three-dimensional node tabulation P earlier, determine the coordinate of all nodes among the P again by a known radial basis function algorithm.

Simply introduce earlier the radial basis function algorithm basic principle.If will be with Face _StartBe deformed into Face _New, the node among the known Subset is at Face _NewIn coordinate be { New ₁..., New _s, remember that it is at Face _StartIn corresponding point be { Sta ₁..., Sta _S; Here S is the number of Subset mid point.For the arbitrary position pt in the model, it is at Face now _NewIn coordinate

Can obtain by following known formula:

M_{pt}^{new} = Σ_{jRBF = 1}^{S} C_{jRBF} φ (| | M_{pt}^{start} - {Sta}_{jRBF} | |)

Wherein

[\begin{matrix} C_{1}^{T} \\ \cdot \\ \cdot \\ \cdot \\ C_{S}^{T} \end{matrix}] = {[\begin{matrix} φ (| | {Sta}_{1} - {Sta}_{1} | |) & \cdot \cdot \cdot & φ (| | {Sta}_{1} - {Sta}_{S} | |) \\ \cdot \cdot \cdot & \cdot \cdot \cdot & \cdot \cdot \cdot \\ φ (| | {Sta}_{S} - {Sta}_{1} | |) & \cdot \cdot \cdot & φ (| | {Sta}_{S} - {Sta}_{S} | |) \end{matrix}]}^{- 1} [\begin{matrix} {New}_{1}^{T} \\ \cdot \\ \cdot \\ \cdot \\ {New}_{S}^{T} \end{matrix}];

This method is actual to be according to Face _StartMiddle each point determines that to the distance relation of some calibration points (being the point among the Subset) this point is at Face _NewIn the position.φ is known radial basis function (field of definition and codomain all are nonnegative real number), and it is the decreasing function of independent variable r (expression distance), and promptly the calibration point of distance is less to the influence of this point.

This step, we had Face during radial basis function algorithm on utilize _Start=Face _Init, Face _New=Face ₁And the point among the Subset comprises two classes: a class is three-dimensional remarkable characteristic, totally 4, obtain by the stereo reconstruction algorithm of two-dimentional remarkable characteristic (referring to people's left eye interior angle LE, right eye interior angle RE, the outer lip LM in a left side, four projections of point on image plane of right outer lip RM on the face) by the front; Another kind of is that the side unique point (refers to the point that those are easy to recognize, as Xia Chin, mouth, nose, bridge of the nose top, forehead etc. on people's face side projection view.Typical people's face side projection and our some side unique points of definition represent with little circle that in Fig. 6 its example is referring to Figure 16), what reality was used has 24, and their three-dimensional position obtains from the manual two-dimensional points that marks by following step.

If the manual side unique point that marks is

{m_{L, side}^{kk} = {(\begin{matrix} x_{L, side}^{kk} & y_{L, side}^{kk} \end{matrix})}^{T}},

Kk=1 ..., NumSF, wherein NumSF=24 is the number of side unique point, side is the side frame label that the front has obtained.Their three-dimensional coordinates in the generic coordinate system are

{{Ms}_{g}^{kk} = {(\begin{matrix} 0 & Y_{g}^{kk} & Z_{g}^{kk} \end{matrix})}^{T}}, kk = 1, . . ., NumSF;

Then to arbitrary kk, we have

\{\begin{matrix} x_{L, side}^{kk} = \frac{R_{g, side, 1}^{T} {Ms}_{g}^{kk} + T_{g, side, 1}}{R_{g, side, 3}^{T} {Ms}_{g}^{kk} + T_{g, side, 3}} \\ y_{L, side}^{kk} = \frac{R_{g, side, 2}^{T} {Ms}_{g}^{kk} + T_{g, side, 2}}{R_{g, side, 3}^{T} {Ms}_{g}^{kk} + T_{g, side, 3}} \end{matrix} - - - (1.3.1)

Wherein

R_{g, side} = (\begin{matrix} R_{g, side, 1}^{T} \\ R_{g, side, 2}^{T} \\ R_{g, side, 3}^{T} \end{matrix}),

T_{g, side} = (\begin{matrix} T_{g, side, 1} \\ T_{g, side, 2} \\ T_{g, side, 3} \end{matrix})

It is the representation in components of attitude parameter.Two linear equations are arranged (1.3.1), thereby can directly obtain with the On Solving System of Linear Equations method

In two unknown numbers

With

This method is actual to be the specific position (promptly the X component of coordinate is 0 in the generic coordinate system) that has utilized the side unique point, thereby can directly obtain three-dimensional point from the single width image.

1.4 shape corrections B

This step is utilized the radial basis function algorithm of front, Face equally _Start=Face ₁, Face _New=Face ₂Except comprising the point of having determined lining, front " shape corrections A ", also has the point of some new interpolations among the Subset, promptly according to the manual front human face outline (facial contour on the face attitude of the making a comment or criticism hypograph that marks, polygonal profile with closure represents that an example as shown in Figure 7.This figure has been added to polygonal profile on people's face image, sees clearlyer) recover the three-dimensional point of coming out.The basic skills that obtains these three-dimensional point is that current faceform is done perspective projection to image plane, and calculates the two-dimensional silhouette point of three-dimensional model perspective projection, and the coupling that depends on the facial contour on these point and the image is at last calculated its three-dimensional coordinate.Concrete step is as described below:

At first need to calculate Face ₁Profile in from the t=1 moment (being the manual front view of selecting) to the image plane projection

{Cont}_{1} = {{pt}_{1}^{iC 1} | iC 1 = 1, . . ., nNum 1},

Wherein

Be the label of P (being the three-dimensional node tabulation of model) mid point, obviously have

1 \leq {pt}_{1}^{iC 1} \leq nVertex;

And nNum1 is Cont ₁In the summit number.Cont ₁In point be exactly the point that newly adds in this step shape corrections among the Subset, they are to select by the intersection point of judging projection line and model, promptly on a certain point, projection line and model can not have other intersection point, detailed computing method are as described below:

The note three-dimensional model is Face _Proj(be exactly Face here ₁), the perspective projection center is

Center = - R_{g, 1}^{- 1} T_{g, 1}

Be the coordinate of L video camera photocentre in the generic coordinate system, then we calculated the projection line of this point and the intersection point of model to all summits in the model, judged that in view of the above this summit is whether in projected outline.Particularly, M _NowBe Face _ProjMiddle any point, Plane _JpBe Face _ProjIn the plane that constitutes of arbitrary dough sheet, calculated Center and M _NowStraight line (being projection line) and Plane _JpIntersection point be C _NowIf (as previously mentioned, nMesh is Face to some jp that satisfies 1≤jp≤nMesh _ProjIn the dough sheet sum) C that obtains _NowAt line segment CM _NowInside, M then _NowNot in projected outline; Otherwise just should belong to projected outline.

Then to calculate Cont ₁In any point

At Face ₂Three-dimensional coordinate.If the facial contour on the image is

Cont_{img}_{1} = {(\begin{matrix} x {CI}_{1}^{iCimg} & yC I_{1}^{iCimg} \end{matrix}) | iCimg = 1, . . ., nNumCI 1},

It is the polygon that is made of the two-dimensional points tabulation, and nNumCI1 is the number of point.Then we determine by following method At Face ₂Three-dimensional coordinate:

If

pti = {pt}_{1}^{iC 1}

Point is at Face _gIn normal direction be v _n, at Face ₁In coordinate be

M_{1}^{pti} = {(\begin{matrix} X_{1}^{pti} & Y_{1}^{pti} & Z_{1}^{pti} \end{matrix})}^{T},

And through the projecting direction of this point be

v_{pn} = M_{1}^{pti} + R_{g, 1}^{- 1} T_{g, 1}

(this three is a known quantity), then the pti point is at Face ₂In coordinate

Satisfy

M_{2}^{pti} = M_{1}^{pti} + t_{line} v - - - (1.4.1)

Wherein

v＝(v _x v _y v _z) ^T＝v _pn×(v _n×v _pn) (1.4.2)

Parametric t _LineCan be according to two-dimentional straight line

{(\begin{matrix} \frac{X_{1}^{pti} + t_{line} v_{x}}{Z_{1}^{pti} + t_{line} v_{z}} & \frac{Y_{1}^{pti} + t_{line} v_{y}}{Z_{1}^{pti} + t_{line} v_{z}} \end{matrix})}^{T}

With closed polygon Cont_img ₁Intersection point obtain.Concrete steps are as follows:

If Cont_img ₁In arbitrary line segment can be expressed as seg=(x ₀+ sd _xy ₀+ sd _y) ^T, 0≤s≤1, wherein s is the line segment parameter, other all are known quantities.

We calculate by following formula earlier

[\begin{matrix} si \\ t_{line} \end{matrix}] = {[\begin{matrix} d_{x} & - (X_{1}^{pti} - v_{x} * Z_{1}^{pti} / v_{z}) \\ d_{y} & - (Y_{1}^{pti} - v_{y} * Z_{1}^{pti} / v_{z}) \end{matrix}]}^{- 1} [\begin{matrix} x_{0} - v_{x} / v_{z} \\ y_{0} - v_{y} / v_{z} \end{matrix}]

All seg are obtained this two numerical value si and t _LineDo not have for the formula above a lot of seg and to separate, promptly the matrix in the formula is irreversible.The seg that has the si that separates and obtain to satisfy 0≤si≤1 has one or two, gets and point

{(\begin{matrix} \frac{X_{1}^{pti}}{Z_{1}^{pti}} & \frac{Y_{1}^{pti}}{Z_{1}^{pti}} \end{matrix})}^{T}

That seg that distance is near, the t that it obtains _LineBeing institute asks.

Here v is

The deformation direction of point has utilized the local shape of general people's face to its calculating.In fact, this direction is Face ₁Go up the component of this some place normal direction, can keep local curvature so on the one hand as far as possible perpendicular to projection line; Can reduce deformation distance a little on the other hand as far as possible.

1.5 shape corrections C

This step is utilized the radial basis function algorithm of front, Face equally _Start=Face ₂, Face _New=Face ₃Among the Subset except comprising the point of front " shape corrections A, B " lining having determined, the point that also has some new interpolations, promptly according to the manual front face feature contour point that the marks (point of manual eyes, nostril and the mouth that marks under the face attitude of making a comment or criticism, be some discrete points, as shown in Figure 8, be that stack shows equally) recover the three-dimensional point of coming out.Concrete recovering step is as described below:

If the two-dimensional coordinate of certain feature contour point is

(\begin{matrix} x {CF}_{1}^{iCF} & {yCF}_{1}^{iCF} \end{matrix}),

This point is at Face ₂In coordinate be

M_{2}^{iCF} = {(\begin{matrix} X_{2}^{iCF} & Y_{2}^{iCF} & Z_{2}^{iCF} \end{matrix})}^{T},

We suppose that the Z coordinate of this point is correct as calculated, and promptly it is at Face ₃In the Z coordinate equal

Then it is at Face ₃In coordinate be

M_{3}^{iCF} = {(\begin{matrix} x {CF}_{1}^{iCF} * Z_{2}^{iCF} & y {CF}_{1}^{iCF} * Z_{2}^{iCF} & Z_{2}^{iCF} \end{matrix})}^{T} - - - (1.5.1)

Be that these feature contour point projections under positive attitude equal the unique point on the actual image.

1.6 shape corrections D

This step is utilized the radial basis function algorithm of front, Face equally _Start=Face ₃, Face _New=Face ₄, except comprising the point of front " shape corrections A, B, C " lining having determined, also have the point of some new interpolations among the Subset, promptly (selection of this view is judged automatically that by system actual is to make R from the manual medial view 1 that marks _{G, t}Rotation angle near the moment t of 25 degree.So-called " medial view " is meant the middle projection attitude between front and side) facial contour (representing with closed polygon that equally typical example the is Fig. 9) three-dimensional point that recovery is come out, concrete recovering step is as described below:

If medial view 1 is similar to the processing of front view (FV) facial contour with front 1.4 joints corresponding to moment int1, also to calculate Face earlier ₃Projected outline when int1

{Cont}_{int 1} = {{pt}_{int 1}^{iC 2} | iC 2 = 1, . . ., nNim_int 1},

Wherein

Be the label of P (being faceform's three-dimensional node tabulation) mid point, and nNum_int1 is the summit number in the projected outline.The computing method of projected outline are (1.4 joint) as previously mentioned.

Cont _Int1In point at Face ₄In coordinate determine with the similar step of 1.4 joints by following: for arbitrary

pt 1 i = {pt}_{int 1}^{iC 2},

Definition

v 2 = {[\begin{matrix} {v 2}_{x} & {v 2}_{y} & {v 2}_{z} \end{matrix}]}^{T} = M_{3}^{pt 1 i} - Center,

Wherein

M_{3}^{pt 1 i} = {[\begin{matrix} X_{3}^{pt 1 i} & Y_{3}^{pt 1 i} & Z_{3}^{pt 1 i} \end{matrix}]}^{T}

Be that this point is at Face ₃In three-dimensional coordinate, and

Center = - R_{g, 1}^{- 1} T_{g, 1}

Be the coordinate of L video camera photocentre in the generic coordinate system as previously mentioned.Then this is at Face ₄In coordinate

Satisfy

M_{4}^{pt 1 i} = M_{3}^{pt 1 i} + t_{line 2} v 2 - - - (1.6.1)

Parametric t wherein _Line2According to two-dimentional straight line

{(\begin{matrix} \frac{X_{3}^{pt 1 i} + t_{line 2} {v 2}_{x}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} & \frac{Y_{3}^{pt 1 i} + t_{line 2} {v 2}_{y}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} \end{matrix})}^{T}

With closed polygon Cont_img _Int1Intersection point obtain, method as 1.4 the joint as described in.

1.7 shape corrections E

This step is utilized the radial basis function algorithm of front, Face equally _Start=Face ₄, Face _New=Face ₅, except comprising the point of front " shape corrections A, B, C, D " lining having determined, also have the point of some new interpolations among the Subset, promptly (selection of this view is judged automatically by system that also actual is to make R from the manual medial view 2 that marks _{G, t}Rotation angle near the moment t of 40 degree) facial contour (typical example is Figure 10) recovers the three-dimensional point of coming out, the processing in concrete recovering step and 1.6 joints is just the same.

1.8 texture

Texture is to be the model Face that creates _sGenerate cylinder texture maps Texture _s, make final modeling result appear to have the same sense of reality of photograph.Present texture is to generate by two images, i.e. the front and the lateral plan (I of L video camera shooting _{L, 1}And I _{L, side}), the ultimate principle of this part is gathering in image conversion to the unified cylindrical coordinate system, and merges, its flow process as shown in Figure 3, concrete steps are as follows:

1.8.1 cylinder unwrapping map generalization

System is at first to Face _sIn arbitrary three-dimensional point

M_{s}^{pt} = {(\begin{matrix} X_{s}^{pt} & Y_{s}^{pt} & Z_{s}^{pt} \end{matrix})}^{T}

Do the cylinder mapping shown in the formula (1.8.1):

\{\begin{matrix} {1 g}_{pt} = \arctan (\frac{Z_{s}^{pt}}{X_{s}^{pt}}) \\ {lt}_{pt} = Y_{s}^{pt} \end{matrix} - - - (1.8.1)

The result of mapping is a two dimensional surface, and the point on it is { Cyn _Pt}={ (lg _PtLt _Pt).Here lg _PtBe the longitude station on the cylinder, represent with radian value; Lt _PtIt then is the coordinate position of cylinder axial rotary.

1.8.2 positive texture maps

To own

At front attitude parameter R _{G, 1}, T _{G, 1}By formula (1.8.2) carries out projection down:

\{\begin{matrix} x_{1}^{pt} = \frac{R_{1 g, 1}^{T} M_{s}^{pt} + T_{1 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \\ y_{1}^{pt} = \frac{R_{2 g, 1}^{T} M_{s}^{pt} + T_{2 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \end{matrix} - - - (1.8.2)

Wherein

R_{g, 1} = (\begin{matrix} R_{1 g, 1}^{T} \\ R_{2 g, 1}^{T} \\ R_{3 g, 1}^{T} \end{matrix}),

T_{g, 1} = (\begin{matrix} T_{1 g, 1} \\ T_{2 g, 1} \\ T_{3 g, 1} \end{matrix})

It is the representation in components of rotation matrix and translation vector.

At this moment

{\overset{&RightArrow;}{p}}_{1}^{pt} = {(\begin{matrix} x_{1}^{pt} & y_{1}^{pt} \end{matrix})}^{T}

Be Face _sMiddle three-dimensional point At front view I _{L, 1}On two-dimensional projection.We obtain an I now _{L, 1}Point on the plane

With put Cyn on the Cyn plane _Pt=(lg _PtLt _Pt) ^TTwo-dimentional corresponding relation, this relation be since they all and three-dimensional point

Corresponding obtaining (respectively by formula 1.8.2 and 1.8.1).We based on this corresponding relation with I _{L, 1}In each pixel mapping to the Cyn plane, promptly to Face _sIn arbitrary dough sheet facet=(pt ₁Pt ₂Pt ₃) ^T, establish m _Proj=Δ (p ₁, p ₂, p ₃) represent at these 3 at I _{L, 1}The triangle that constitutes on the plane, m _Cyn=Δ (p ₁', p ₂', p ₃') be the corresponding triangle on the Cyn plane, promptly

p_{iT 1} = {(\begin{matrix} x_{1}^{{pt}_{iT 1}} & y_{1}^{{pt}_{iT 1}} \end{matrix})}^{T},

p_{iT 1}' = {(\begin{matrix} {1 g}_{{pt}_{iT 1}} & {lt}_{{pt}_{iT 1}} \end{matrix})}^{T},

iT1＝1，2，3。Diabolo m _CynInterior point [lg lt] calculates [xy] by following formula (1.8.4), and order

I_{{Texture}_{1}} (1 g, lt) = I_{L, 1} (x, y) - - - (1.8.3)

Being about to positive texture maps is mapped on the cylinder unwrapping figure.

[\begin{matrix} x \\ y \\ 1 \end{matrix}] = {[\begin{matrix} a_{1} & a_{2} & a_{3} \\ a_{4} & a_{5} & a_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}] - - - (1.8.4)

Six unknown quantity a wherein _IA, iA=1 ..., 6 by formula (1.8.5) obtain.

[\begin{matrix} a_{1} \\ a_{2} \\ a_{3} \\ a_{4} \\ a_{5} \\ a_{6} \end{matrix}] = {[\begin{matrix} x_{1}^{{pt}_{1}} & y_{1}^{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{1}} & y_{1}^{{pt}_{1}} & 1 \\ x_{1}^{{pt}_{2}} & y_{1}^{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{2}} & y_{1}^{{pt}_{2}} & 1 \\ x_{1}^{{pt}_{3}} & y_{1}^{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{3}} & y_{1}^{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}] - - - (1.8.5)

The result of positive texture is designated as Texture ₁

1.8.3 side grain figure

Very similar with the processing of front view (FV), will own

In the lateral attitude parameters R _{G, side}, T _{G, side}By formula (1.8.6) carries out projection down:

\{\begin{matrix} x_{side}^{pt} = \frac{R_{1 g, side}^{T} M_{s}^{pt} + T_{1 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \\ y_{side}^{pt} = \frac{R_{2 g, side}^{T} M_{s}^{pt} + T_{2 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \end{matrix} - - - (1.8.6)

Here same important expression

R_{g, side} = (\begin{matrix} R_{1 g, side}^{T} \\ R_{2 g, side}^{T} \\ R_{3 g, side}^{T} \end{matrix}),

T_{g, side} = (\begin{matrix} T_{1 g, side} \\ T_{2 g, side} \\ T_{3 g, side} \end{matrix}) .

Then to Face _sIn arbitrary dough sheet facet=(pt ₁Pt ₂Pt ₃) ^T, establish ms _Proj=Δ (ps ₁, ps ₂, ps ₃), wherein

{ps}_{iT 2} = {(\begin{matrix} x_{side}^{{pt}_{iT 2}} & y_{side}^{{pt}_{iT 2}} \end{matrix})}^{T},

iT2＝1，2，3。Diabolo m _CynInterior point [lg lt] calculates [xs ys] by the formula (1.8.8) of back, and order

I_{{Texture}_{side}} (1 g, lt) = I_{L, side} (xs, ys) - - - (1.8.7)

Being about to side grain figure is mapped on the cylinder unwrapping figure.

[\begin{matrix} xs \\ ys \\ 1 \end{matrix}] = {[\begin{matrix} {as}_{1} & {as}_{2} & {as}_{3} \\ {as}_{4} & {as}_{5} & {as}_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}] - - - (1.8.8)

Six unknown quantity as wherein _IAS, iAS=1 ..., 6 by formula (1.8.9) obtain.

[\begin{matrix} {as}_{1} \\ {as}_{2} \\ {as}_{3} \\ {as}_{4} \\ {as}_{5} \\ {as}_{6} \end{matrix}] = {[\begin{matrix} x_{side}^{{pt}_{1}} & y_{side}^{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{1}} & y_{side}^{{pt}_{1}} & 1 \\ x_{side}^{{pt}_{2}} & y_{side}^{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{2}} & y_{side}^{{pt}_{2}} & 1 \\ x_{side}^{{pt}_{3}} & y_{side}^{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{3}} & y_{side}^{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}] - - - (1.8.9)

The result of side grain mapping is designated as Texture _Side

1.8.4 the reflection of side grain figure

Because people's face is only to a sideway swivel, so can only obtain one-sided side grain, the texture of opposite side can pass through Texture _SideDoing the reflection processing obtains.Specifically, because Face _sTopological structure be complete symmetry, be provided with that side arbitrary tri patch m on the face of direct acquisition texture _Cyn=Δ (p ₁, p ₂, p ₃) and it is at opposite side symmetrical dough sheet m on the face _Cyn'=Δ (p ₁', p ₂', p ₃'), wherein

p_{iT 3} = {(\begin{matrix} {1 g}_{{pt}_{iT 3}} & {lt}_{{pt}_{iT 3}} \end{matrix})}^{T},

p_{iT 3}' = {(\begin{matrix} {1 g'}_{{pt}_{iT 3}} & {lt'}_{{pt}_{iT 3}} \end{matrix})}^{T},

IT3=1,2,3, then to m _CynIn any point p=(lg lt) ^TBy following formula (1.8.9) calculates [lg ' lt '], and order

I_{{Texture}_{side}} (1 g', lt') = I_{{Texture}_{side}} (1 g, lt)

Promptly the lateral reflection from direct acquisition texture generates opposite side.

[\begin{matrix} 1 g' \\ lt' \\ 1 \end{matrix}] = {[\begin{matrix} {rs}_{1} & {rs}_{2} & {rs}_{3} \\ {rs}_{4} & {rs}_{5} & {rs}_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}] - - - (1.8.9)

Six unknown quantity rs wherein _IRS, iRS=1 ..., 6 by formula (1.8.10) obtain.

[\begin{matrix} {rs}_{1} \\ {rs}_{2} \\ {rs}_{3} \\ {rs}_{4} \\ {rs}_{5} \\ {rs}_{6} \end{matrix}] = {[\begin{matrix} {1 g'}_{{pt}_{1}} & {lt'}_{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{1}} & {lt'}_{{pt}_{1}} & 1 \\ {1 g'}_{{pt}_{2}} & {lt'}_{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{2}} & {lt'}_{{pt}_{2}} & 1 \\ {1 g'}_{{pt}_{3}} & {lt'}_{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{3}} & {lt'}_{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}] - - - (1.8.10)

We have obtained complete Texture now _Side

1.8.5 the fusion of two texture maps

We merge Texture ₁And Texture _SideObtain final texture maps Texture _sParticularly, setting threshold lg _MinAnd lg _Max, Texture _sAt arbitrary position Cyn=(lg lt) ^TThe value at place is determined by following formula (1.8.11)

Modeling method above we have realized in a common table top PC system.This computer CPU is Intel PIII933, is furnished with the Rambus internal memory of 256M, and Microsoft Windows 2000 Professional operating systems have been installed.For carrying out data acquisition, we have used two identical SANYO VCC-5972P type video cameras, and two identical MatroxMeteorII capture cards gather the two-way synchronization video, and picture resolution is 768*576, true colour imagery, 15 frame/seconds of frame rate.(on the one hand, this configuration makes the public view field of two video cameras bigger to stereo camera by configuration up and down shown in Figure 11 during data acquisition; On the other hand, at this moment the direction of outer polar curve substantially vertically, with the people on the face the actual conditions of the most along continuous straight runs of grey scale change adapt), the typical frame in the stereoscopic video sequence of an actual acquisition is as shown in figure 12.The video of actual acquisition has 50 frames (being N=50), and wherein first frame is the manual positive attitude image of selecting.

The universal model Face that we use _gHuman face animation part from the reference software MOMUSYS of MPEG4 is called ISTFACE, by Portuguese Instituto Superior Tecnico exploitation.It comprises the individual node of 1039 (nVertex), form the individual triangle surface of 1704 (nMesh), form a complete number of people, not only comprise people's face surface (for example zone such as organ such as eyes, nose, mouth, ear and cheek, forehead, hindbrain), also comprise eyeball, tooth, these internals of tongue and neck area.It can be driven by MPEG4 human face animation parameter stream, by the distortion realization animation of three dimensional network lattice point.Figure 13 is the neutral state and the animated state (motion of lip) of this model.

Introduce each step in this embodiment below in detail:

The first step: initialization: use camera arrangement shown in Figure 11 to take three-dimensional video-frequency shown in Figure 12; And be provided with:

● time initial value t=t0=1.Wherein the t express time corresponding to the visual sequence number in the video sequence, is represented the image when pre-treatment; The same express time of t0 is represented the moment that current feature of following the tracks of is detected;

● calculate basis matrix

F = A_{L}^{- T} [\begin{matrix} 0 & - T_{c, z} & T_{c, y} \\ T_{c, z} & 0 & - T_{c, x} \\ - T_{c, y} & T_{c, x} & 0 \end{matrix}] R_{c} A_{R}^{- 1}

(wherein the meaning of each known quantity is referring to the explanation in the 1.1.2 joint).Actual numerical value is

R_{c} = [\begin{matrix} 0.999952 & - 0.005045 & + 0.008457 \\ 0.006579 & 0.981257 & - 0.192594 \\ - 0.007327 & 0.192640 & 0.981242 \end{matrix}],

T_{c} = [\begin{matrix} 0.012413 \\ 0.611240 \\ 0.088092 \end{matrix}]

A_{L} = [\begin{matrix} f_{L} & 0 & u_{0 L} \\ 0 & f_{L} & v_{0 L} \\ 0 & 0 & 1 \end{matrix}],

A_{R} = [\begin{matrix} f_{R} & 0 & u_{0 R} \\ 0 & f_{R} & v_{0 R} \\ 0 & 0 & 1 \end{matrix}],

F wherein _L=1330.3, f _R=1330.6, u _0L=384, v _0L=288, u _0R=384, v _0R=288;

● width, height: the width of image and height, in specific implementation, width=768, height=576;

● blockx, blocky: the net point number that has comprised in the local window when having determined feature detection, tracking and coupling.In fact, (i j) is the center to this window, and net point adds up to (2*blockx+1) * (2*blocky+1) with tested point.In specific implementation, blockx=3, blocky=3;

● quality, dis_min: the threshold value during feature point detection, the quality of the former controlling features point, the spacing of latter's controlling features point.In specific implementation, quality=0.01, dis_min=10;

● Num_subset, ratio: the constant that uses during attitude parameter initialization (1.1.4 joint), the former is the number of the subclass selected at random, how the latter upgrades { bReliablek} with deciding.Actual value is Num_subset=100, ratio=0.9;

● θ _ bound, K_bound, Inc_refine: be that we are for calculating the threshold value that bReDetect and bRefine use.Their value is: θ _ bound=20, unit degree of being; K_bound=10; Inc_refine=4;

● lg _MinAnd lg _Max: be used for determining Texture ₁And Texture _SideBetween separation, be a position between the tail of the eye and the ear upper edge.Particularly, lg _MinCorresponding to right side this point on the face, at Face _sMiddle label is 306; Lg _MaxCorresponding to left side this point on the face, at Face _sMiddle label is 120;

● φ: i.e. known radial basis function during shape corrections, we select

φ = {(1 - r)}_{+}^{6} (6 + 36 r + {82 r}^{2} + 72 r^{3} + 30 r^{4} + 5 r^{5}),

Wherein

Just putting it when the representative function value is negative is 0.

Second step: at visual I _{L, t}Last compute gradient vector

{\overset{&RightArrow;}{g}}_{L, t} (i, j) = {[\begin{matrix} gx & gy \end{matrix}]}^{T} = {[\begin{matrix} \frac{I_{L, t} (i + 1, j) - I_{L, t} (i - 1, j)}{2} & \frac{I_{L, t} (i, j) - I_{L, t} (i, j - 1)}{2} \end{matrix}]}^{T}

And

G_{L,t} (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} {\overset{&RightArrow;}{g}}_{L, t} (i + ii, j + jj) {\overset{&RightArrow;}{g}}_{L, t}^{T} (i + ii, j + jj)

1≤i≤width wherein, any point on 1≤j≤height presentation image, blockx, blocky are the constants relevant with window size.If used the point that exceeds image border when calculating, directly order

G_{L, t} (i, j) = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}] .

Then utilize existing mathematical tool software matlab to G _{L, t}(i j) makes svd, and gets the combination of its minimum singular value and obtain { min_val (i, j) }, 1≤i≤width, and 1≤j≤height, this is the floating number matrix identical with former dimension of picture.

Lift a concrete instance.I at stereoscopic video sequence shown in Figure 12 _{L, 1}On, (i=346, j=297) gray-scale value for every bit on the 7*7 at center (blockx=blocky=3) the local image is with certain point

[\begin{matrix} 48 & 63 & 81 & 97 & 102 & 90 & 72 \\ 57 & 78 & 109 & 135 & 137 & 121 & 96 \\ 57 & 98 & 146 & 175 & 177 & 159 & 130 \\ 61 & 102 & 153 & 180 & 177 & 162 & 133 \\ 59 & 95 & 134 & 150 & 146 & 130 & 100 \\ 54 & 80 & 107 & 121 & 113 & 95 & 72 \\ 48 & 57 & 61 & 69 & 66 & 56 & 50 \end{matrix}]

And { the gx} and { the gy} matrix is of 7*7

{gx} = [\begin{matrix} 9 & 16.5 & 17 & 10.5 & 3.5 & - 15 & - 18.5 \\ 13.5 & 26 & 28.5 & 14 & - 7 & - 20.5 & - 26.5 \\ 28 & 44.5 & 38.5 & 15.5 & - 8 & - 23.5 & - 29.5 \\ 27.5 & 46 & 39 & 12 & - 9 & - 22 & - 31 \\ 25 & 37.5 & 27.5 & 6 & - 10 & - 23 & - 28 \\ 18 & 26.5 & 20.5 & 3 & - 13 & - 20.5 & - 20 \\ 4.5 & 6.5 & 6 & 2.5 & - 6.5 & - 8 & - 3.5 \end{matrix}]

{gy} = [\begin{matrix} 10 & 19.5 & 31.5 & 41.5 & 39 & 32.5 & 23 \\ 4.5 & 17.5 & 32.5 & 39 & 37.5 & 34.5 & 29 \\ 2 & 12 & 22 & 22.5 & 20 & 20.5 & 18.5 \\ 1 & - 1.5 & - 6 & - 12.5 & - 15.5 & - 14.5 & - 15 \\ - 3.5 & - 11 & - 23 & - 29.5 & - 32 & - 33.5 & - 30.5 \\ - 5.5 & - 19 & - 36.5 & - 40.5 & - 40 & - 37 & - 25 \\ - 0.5 & - 13 & - 27.5 & - 33 & - 28.5 & - 20.5 & - 8.5 \end{matrix}]

The matrix of the 2*2 that calculates by aforementioned formula

G = [\begin{matrix} 23311 & 2665.5 \\ 2665.5 & 30700 \end{matrix}],

Two singular value is 31561.2 and 22449.8, so min_val (346,297)=22449.8 (min_val is less in two singular values here, and argument of function is the two-dimensional coordinate that image is gone up point).

The 3rd step: the position of seeking several conditions below satisfying in matrix { min_val (i, j) } is as detected unique point.(1) the min_val value at this some place is bigger; Particularly, the maximal value of establishing whole matrix is max_min_val, and then the value at unique point place should be greater than quality*max_min_val, and wherein quality is a pre-set threshold, actually is taken as 0.01; (2) min_val at this some place is the maximal value in its 3*3 neighborhood; (3) distance of any two unique points should be greater than dis_min: if the distance of two unique points is less than or equal to this value, then a less position of min_val value is not considered to unique point; Dis_min is actual to be taken as 10.We detect I by these three rules _{L, t}On characteristic point sequence, and be designated as { p _{L, t}(k) }, 1≤k≤K (t).The result who detects is referring to Figure 14.The 4th step: to arbitrary p _{L, t}(k), calculate the outer polar curve of its correspondence

And calculate

Diff (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} {(I_{L, t} (x_{L, t} (k) + ii, y_{L, t} (k) + jj) - I_{R, t} (i + ii, j + jj))}^{2}

Wherein (i is that (its vector representation is l=[l to straight line l j) ₁l ₂l ₃], promptly the equation of straight line is l ₁I+l ₂J+l ₃=0) any point on promptly satisfies

\frac{| | l_{1} i + l_{2} j + l_{3} | |}{\sqrt{l_{1}^{2} + l_{2}^{2}}} < 1 .

(i_{\min}, j_{\min}) = \underset{(i, j)}{\arg \min} Diff (i, j) .

P then _{L, t}(k) match point is p _{R, t}(k)=(i _Minj _Min) ^TAll detected unique points are carried out identical processing, and we obtain I _{R, t}On the match point sequence, and be designated as { p _{R, t}(k) }, 1≤k≤K (t).

The 5th the step: according to arbitrary to the coupling p _{L, t}(k) and p _{R, t}(k), calculate

B = [A_{L}^{- 1} [\begin{matrix} p_{L, t} (k) \\ 1 \end{matrix}] - R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} (k) \\ 1 \end{matrix}]],

[\begin{matrix} s_{L} \\ s_{R} \end{matrix}] = {(B^{T} B)}^{- 1} B^{T} T_{c},

And finally obtain three-dimensional point

M_{t} (k) = \frac{1}{2} (s_{L} A_{L}^{- 1} [\begin{matrix} p_{L, t} (k) \\ 1 \end{matrix}] + s_{R} R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} (k) \\ 1 \end{matrix}] + T_{c}) .

All paired couplings are carried out same treatment, and we obtain t three-dimensional feature point sequence { M constantly _t(k) | k=1 ..., K (t) }.The Boolean variable of initialization simultaneously sequence { bReliable _k, k=1 ..., K (t) is Yes entirely, it all is reliable that expression is had a few.

The 6th step: at arbitrary p _{L, t}(k)=(i j) locates to calculate

{\overset{&RightArrow;}{e}}_{L, t} (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} {\overset{&RightArrow;}{g}}_{L, t} (i + ii, j + jj) (I_{L, t + 1} (i + ii, j + jj) - I_{L, t} (i + ii, j + jj)),

And

{\overset{&RightArrow;}{d}}_{L, t} (i, j) = G_{L, t} {(i, j)}^{- 1} {\overset{&RightArrow;}{e}}_{L, t} (i, j) .

Then this some t+1 corresponding point constantly in the L video sequence are

p_{L, t + 1} (k) = p_{L, t} (k) - {\overset{&RightArrow;}{d}}_{L, t} (i, j) .

All unique points are carried out identical processing, and we obtain I _{L, t+1}On tracking results, and be designated as { p _{L, t+1}(k) }, 1≤k≤K (t).

The 7th step: handle the R video sequence with similar method.Particularly, press aforementioned formula earlier, calculate on the t figure at R

{ G _{R, t}(iR, jR) }, then to arbitrary p _{R, t}(k)=(iR jR) calculates

{\overset{&RightArrow;}{e}}_{R, t} (iR, jR) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} {\overset{&RightArrow;}{g}}_{R, t} (iR + ii, jR + jj) (I_{R, t + 1} (iR + ii, jR + jj) - I_{R, t} (iR + ii, jR + jj)),

Finally have

{\overset{&RightArrow;}{d}'}_{R, t} (iR, jR) = G_{R, t} {(iR, jR)}^{- 1} {\overset{&RightArrow;}{e}}_{R, t} (iR, jR),

Then this some t+1 corresponding point constantly in the R video sequence are

p_{R, t + 1} (k) = p_{R, t} (k) - {\overset{&RightArrow;}{d}'}_{R, t} (iR, jR) .

All unique points are carried out identical processing, and we obtain I _{R, t+1}On tracking results, and be designated as { p _{R, t+1}(k) }, 1≤k≤K (t).

The 8th step: use with similar method of the 5th step and obtain t+1 three-dimensional point sequence constantly.Particularly, according to arbitrary to the coupling p _{L, t+1}(k) and p _{R, t+1}(k), calculate

B = [A_{L}^{- 1} [\begin{matrix} p_{L, t + 1} (k) \\ 1 \end{matrix}] - R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t + 1} (k) \\ 1 \end{matrix}]],

[\begin{matrix} s_{L} \\ s_{R} \end{matrix}] = {(B^{T} B)}^{- 1} B^{T} T_{c},

And finally obtain three-dimensional point

M_{t + 1} (k) = \frac{1}{2} (s_{L} A_{L}^{- 1} [\begin{matrix} p_{L, t + 1} (k) \\ 1 \end{matrix}] + s_{R} R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t + 1} (k) \\ 1 \end{matrix}] + T_{c}) .

All paired tracking results are carried out same treatment, and we obtain t+1 three-dimensional feature point sequence { M constantly _T+1(k) | k=1 ..., K (t) }.

The 9th step: establish sequence { bReliable _k, k=1 ..., value is that the label of the item of Yes is (k among the K (t) _s, s=1 ..., Kr (t), wherein Kr (t) is that value is the total number of Yes.The individual ternary subclass { Set of picked at random Num_subset (actual be taken as 100) _n={ N _{1, n}, N _{2, n}, N _{3, n}, n=1 ..., Num_subset, wherein 1≤N _{1, n}, N _{2, n}, N _{3, n}≤ Kr (t).According to

{M_{t} (k_{N_{1, n}}), M_{t} (k_{N_{2, n}}), M_{t} (k_{N_{3, n}})}

With

{M_{t + 1} (k_{N_{1, n}}), M_{t + 1} (k_{N_{2, n}}), M_{t + 1} (k_{N_{3, n}})}

Corresponding relation, estimate the rigid motion parameters R by the method for 1.1.4 joint _{T, n}, T _{T, n}, then to all reliable three-dimensional corresponding { M _t(k _s), M _T+1(k _s), s=1 ..., Kr (t) calculates ε _{N, s}=‖ M _T+1(k _s)-R _{T, n}M _t(k _s)-T _{T, n}‖ ², n=1 ..., Num_subset; S=1 ..., Kr (t), and obtain { R by following formula _t, T _t}:

{ R _t, T _t}={ R _{N_m}, T _{N_m}, wherein

n_m = \underset{n}{\arg \min} {\underset{s}{med}}^{n} {ϵ_{n, s} | s = 1, . . ., Kr (t)}

The tenth step: the sequence of calculation

{Err}_{t} = {ϵ_{s} = {| | M_{t + 1} (k_{s}) - R_{t} M_{t} (k_{s}) - T_{t} | |}^{2} | s = 1, . . ., Kr (t)},

And renewal { b Reliablek _s, s=1 ..., Kr (t).Particularly, we obtain Err _tIn before K_True less ε _sCorresponding label { s _u, u=1 ..., K_True, then In be changed to Yes corresponding to the position of these labels, other be changed to No.The concrete calculation procedure of K_True as previously mentioned.

The 11 step: four step method by the 1.1.5 joint are provided with the sign amount bReDetect and the bRefine of two Boolean types, and make t=t+1.

The 12 step: if bRefine=No changeed for the 13 step; Otherwise, make tStart be t constantly before second the frame that bRefine is Yes is set, be t0 if the tStart that obtains so just makes it less than t0.Call general Levenberg-Marquardt algorithm (utilizing the matlab kit) then and find the solution formula (1.1.10), obtain { Ra _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., t-tStart; M _TStart(k _Pp), pp=1 ..., Kr (t) }, calculate { R by following formula again _{TStart+ τ}, T _{TStart+ τ}, τ=0 ..., t-tStart-1:

R _tStart＝Ra _tStart+1，T _tStart＝Ta _tStart+1，

\{\begin{matrix} R_{tStart + τ - 1} = {Ra}_{tStart + τ} {Ra}_{tStart + τ - 1}^{- 1} \\ T_{tStart + τ - 1} = {Ta}_{tStart + τ} - R_{tStart + τ - 1} * {Ta}_{tStart + τ - 1} \end{matrix}, τ = 2, . . ., t - tStart .

The 13 step:, changeed for second step if bReDetect=Yes then makes t0=t; Otherwise, make t=t+1, if t=N at this moment then changeed for the 14 step, otherwise changeed for the 6th step;

Above each step finished attitude estimation (contents of 1.1 joints).When the input of Figure 12 is handled, feature detection and the coupling the 1st, 15,29,38 frame places (being the different values of t0), actual result as shown in figure 14.Each whole number of features that detect and mate are respectively 89,97,107 and 61 (being each K (t0)), (are sequence { bReliable in the reliable always unique point in all moment (i.e. signature tracking between twice detection) linings that t0 remains unchanged _kIn value be the number that Yes is ordered) have 12,18,36,14 respectively.

The 14 step: at I _{L, 1}And I _{R, 1}Last manual left inner eye corner, right inner eye corner, the left corners of the mouth and these four two-dimentional remarkable characteristics of the right corners of the mouth selected, actual result as shown in figure 15.According to the coupling between two width of cloth figure of these four positions, can obtain four three-dimensional remarkable characteristics by aforesaid stereo reconstruction algorithm, promptly left inner eye corner

Right inner eye corner

The left side corners of the mouth

The right corners of the mouth

Then by formula (1.2.1) is with original universal model Face _gBe deformed into Face _Init

Three scale factor s wherein _x, s _y, s _zComputing method as follows: the order

s_{x} = \frac{dist_3 D (M_{1}^{LE}, M_{1}^{RE})}{dist_3 D (M_{g}^{LE}, M_{g}^{RE})},

Be modeling object and Face _gThe ratio of middle Seg_E length (distances of two inner eye corners); Order

s_{x} = \frac{dist_3 D (M_{1}^{LE} + M_{1}^{RE}, M_{1}^{LM} + M_{1}^{RM})}{dist_3 D (M_{g}^{LE} + M_{g}^{RE}, M_{g}^{LM} + M_{g}^{RM})},

Be the ratio of Seg_EM length (distance at mouth center is arrived at two centers); Make s _z=s _yWherein

Be at Face _gGo up the three-dimensional coordinate of these four principal character points.

The 15 step: then we find the solution the rigid body translation R that the generic coordinate is tied to the L camera coordinate system _{G, 1}, T _{G, 1}We select

{M_{init}^{LE}, M_{init}^{RE}, (M_{init}^{LM} + M_{init}^{RM}) / 2}

With

{M_{1}^{LE}, M_{1}^{RE}, (M_{1}^{LM} + M_{1}^{RM}) / 2}

These three three-dimensional corresponding point are found the solution by the method for aforesaid attitude parameter initialization one joint.

Obtained R _{G, 1}, T _{G, 1}After, R calculates by following formula in system _{G, t}, T _{G, t}, t=2 ..., N:

Then system can judge the frame side corresponding to lateral plan automatically, promptly selects side for making R _{G, t}Rotation angle near the moment t of 90 degree.Our actual side=50 that obtains, the corresponding anglec of rotation is 86.9 degree.

The 16 step: at visual I _{L, side}The side unique point is picked up in last craft, and experimental result is designated as shown in figure 16

{m_{L, side}^{kk} = {(\begin{matrix} x_{L, side}^{kk} & y_{L, side}^{kk} \end{matrix})}^{T}},

Kk=1 ..., NumSF, wherein NumSF is the number of side unique point, actual is 24.

The 17 step: find the solution formula (1.3.1) by preceding method, obtain three-dimensional side unique point

{{Ms}_{g}^{kk} = {(\begin{matrix} 0 & Y_{g}^{kk} & Z_{g}^{kk} \end{matrix})}^{T}}, kk = 1, . . ., NumSF;

The 18 goes on foot: the radial basis function algorithm in saving by aforesaid 1.3 is Face _InitBe deformed into Face ₁Point among the Subset comprises two classes: a class is three-dimensional remarkable characteristic, totally 4, promptly

Another kind of is the side unique point, and promptly 17 steps were obtained

Kk=1 ..., NumSF.

The 19 step: obtain Face by the method for calculating projected outline in aforementioned 1.4 joints ₁In the point that constantly projected to image plane at 1 o'clock

{Cont}_{1} = {{pt}_{1}^{iC 1} | iC 1 = 1, . . ., nNum 1},

Face ₁Be exactly Face there _Proj

The 20 step: the manual I that marks _{L, 1}On facial contour, as shown in Figure 7.This is the polygon of a closure, is designated as

Cont_{img}_{1} = {(\begin{matrix} x {CI}_{1}^{iCimg} & y {CI}_{1}^{iCimg} \end{matrix}) | iCimg = 1, . . ., nNumCI 1},

Be continuous two-dimentional vertex list, nNumCI1 is the number on two-dimentional summit.

The 21 step: (1.4.1) determines Cont according to aforesaid formula ₁Mid point is at Face ₂In three-dimensional coordinate

{M_{2}^{p t_{1}^{iC 1}} | iC 1 = 1, . . ., nNum 1} .

The 22 goes on foot: the radial basis function algorithm in saving by aforementioned 1.3 is Face ₁Be deformed into Face ₂The new point that adds is to obtain in the 21 step among the Subset

{M_{2}^{p t_{1}^{iC 1}} | iC 1 = 1, . . ., nNum 1} .

The 23 step: the manual I that marks _{L, 1}Front face feature contour point on the figure (experimental result is Fig. 8), the result is

{(\begin{matrix} x {CF}_{1}^{iCF} & {yCF}_{1}^{iCF} \end{matrix}), | iCF = 1, . . ., nNumCF},

Wherein nNumCF is the total number of point.

The 24 step: find the solution the 23 three-dimensional coordinate that goes on foot the feature contour point mark by aforesaid formula (1.5.1)

The 25 goes on foot: the radial basis function algorithm in saving by aforementioned 1.3 is Face ₂Be deformed into Face ₃The new point that adds is to obtain in the 24 step among the Subset

The 26 step: system judges the frame int1 corresponding to medial view 1 automatically, promptly selects int1 for making R _{G, t}Rotation angle near the moment t of 25 degree.Then at I _{L, int1}Craft on the figure marks facial contour, as shown in Figure 9.Represent with closed polygon equally, be designated as

Cont_{img}_{int 1} = {(\begin{matrix} x {CI}_{int 1}^{iCimg 1} & y {CI}_{int 1}^{iCimg 1} \end{matrix}) | iCimg 1 = 1, . . ., nNumCIint 1},

NNumCI int1 is a number of vertex.

The 27 step: obtain Face by the method for calculating projected outline in aforementioned 1.4 joints ₃When moment int1, project to the point of image plane

{Cont}_{int 1} = {{pt}_{int 1}^{iC 2} | iC 2 = 1, . . ., nNum_int 1},

Face ₃Be exactly Face there _Proj

The 28 step: according to Cont_img _Int1Determine Cont _Int1Mid point is at Face ₄In three-dimensional coordinate.Particularly, for appointing

pt 1 i = {pt}_{int 1}^{iC 2},

Definition

v 2 = {[\begin{matrix} {v 2}_{x} & {v 2}_{y} & {v 2}_{z} \end{matrix}]}^{T} = M_{3}^{pt 1 i} + R_{g, 1}^{- 1} T_{g, 1},

Wherein

M_{3}^{pt 1 i} = {[\begin{matrix} X_{3}^{pt 1 i} & Y_{3}^{pt 1 i} & Z_{3}^{pt 1 i} \end{matrix}]}^{T}

Be that this point is at Face ₃In three-dimensional coordinate.Then this is at Face ₄In coordinate

Satisfy

M_{4}^{pt 1 i} = M_{3}^{pt 1 i} + t_{line 2} v 2 .

Parametric t wherein _Line2According to two-dimentional straight line

{(\begin{matrix} \frac{X_{3}^{pt 1 i} + t_{line 2} {v 2}_{x}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} & \frac{Y_{3}^{pt 1 i} + t_{line 2} {v 2}_{y}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} \end{matrix})}^{T}

With closed polygon Cont_img _Int1Intersection point obtain, method as 1.4 the joint as described in.To Cont _Int1In all are pressed top step and calculate, obtain

{M_{4}^{p t_{int 1}^{iC 2}} | iC 2 = 1, . . ., nNum_int 1} .

The 29 goes on foot: the radial basis function algorithm in saving by aforementioned 1.3 is Face ₃Be deformed into Face ₄The new point that adds is to obtain in the 28 step among the Subset

{M_{4}^{p t_{int 1}^{iC 2}} | iC 2 = 1, . . ., nNum_int 1} .

The 30 step: system judges the frame int2 corresponding to medial view 2 automatically, promptly selects int2 for making R _{G, t}Rotation angle near the moment t of 40 degree.Then at I _{L, int2}Craft on the figure marks facial contour, as shown in figure 10.Represent with closed polygon equally, be designated as

Cont_{img}_{int 2} = {(\begin{matrix} x {CI}_{int 2}^{iCimg 2} & y {CI}_{int 2}^{iCimg 2} \end{matrix}) | iCimg 2 = 1, . . ., nNumCIint 2},

NNumCIint2 is a number of vertex.

The 31 step: obtain Face by the method for calculating projected outline in aforementioned 1.4 joints ₄When moment int2, project to the point of image plane

{Cont}_{int 2} = {{pt}_{int 2}^{iC 3} | iC 3 = 1, . . ., nNum_int 2},

Face ₄Be exactly Face there _Proj

The 32 step: according to Cont_img _{Int 2}Determine Cont _{Int 2}Mid point is at Face ₅In three-dimensional coordinate.Particularly, for arbitrary

pt 2 i = {pt}_{int 2}^{iC 3},

Definition

v 3 = {[\begin{matrix} {v 3}_{x} & {v 3}_{y} & {v 3}_{z} \end{matrix}]}^{T} = M_{4}^{pt 2 i} + R_{g, 1}^{- 1} T_{g, 1},

Wherein

M_{4}^{pt 2 i} = {[\begin{matrix} X_{4}^{pt 2 i} & Y_{4}^{pt 2 i} & Z_{4}^{pt 2 i} \end{matrix}]}^{T}

Be that this point is at Face ₄In three-dimensional coordinate.Then this is at Face ₅In coordinate

Satisfy

M_{5}^{pt 2 i} = M_{4}^{pt 2 i} + t_{line 3} v 3 .

Parametric t wherein _Line3According to two-dimentional straight line

{(\begin{matrix} \frac{X_{4}^{pt 2 i} + t_{line 3} {v 3}_{x}}{Z_{4}^{pt 2 i} + t_{line 3} {v 3}_{z}} & \frac{Y_{4}^{pt 2 i} + t_{line 3} {v 3}_{y}}{Z_{4}^{pt 2 i} + t_{line 3} {v 3}_{z}} \end{matrix})}^{T}

With closed polygon Cont_img _Int2Intersection point obtain, method as 1.4 the joint as described in.To Cont _Int2In all are pressed top step and calculate, obtain

{M_{5}^{p t_{int 2}^{iC 3}} | iC 3 = 1, . . ., nNum_int 2} .

The 33 goes on foot: the radial basis function algorithm in saving by aforementioned 1.3 is Face ₄Be deformed into Face ₅The new point that adds is to obtain in the 32 step among the Subset

{M_{5}^{p t_{int 2}^{iC 3}} | iC 3 = 1, . . ., nNum_int 2} .

Face ₅Be final shape Face _s

We have carried out shape corrections altogether five times like this, and corresponding to the shape corrections A-E of 1.3 to 1.7 joints, in fact each goes on foot counting out shown in following table 1 among the Subset that uses.

Different models	Initiate Subset point and number thereof	Subset point sum
Different models	Initiate Subset point and number thereof	Subset point sum	Face ₁	4 positive notable features (two inner eye corners, two corners of the mouths)+24 side features	28
Face ₂	Projected outline's point (41) of front view	69	Face ₁		28
Face ₂	Projected outline's point (41) of front view	69	Face ₃	The feature contour point (44) of two, two nostrils and mouth in the front view	113
Face ₄	Projected outline's point (37) during medial view 1	150	Face ₃		113
Face ₄	Projected outline's point (37) during medial view 1	150	Face _s	Projected outline's point (44) during medial view 2	194

Each goes on foot position and number that initiate Subset is ordered in table 1 shape corrections

The 34 step: to Face _sIn arbitrary three-dimensional point

M_{s}^{pt} = {(\begin{matrix} X_{s}^{pt} & Y_{s}^{pt} & Z_{s}^{pt} \end{matrix})}^{T}

By formula

\{\begin{matrix} {1 g}_{pt} = \arctan (\frac{Z_{s}^{pt}}{X_{s}^{pt}}) \\ {lt}_{pt} = Y_{s}^{pt} \end{matrix}

Do the cylinder mapping, the result is { Cyn _Pt}={ (lg _PtLt _Pt).

The 35 step: will own At front attitude parameter R _{G, 1}, T _{G, 1}Down by formula

\{\begin{matrix} x_{1}^{pt} = \frac{R_{1 g, 1}^{T} M_{s}^{pt} + T_{1 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \\ y_{1}^{pt} = \frac{R_{2 g, 1}^{T} M_{s}^{pt} + T_{2 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \end{matrix}

Carry out projection, obtain

{{\overset{&RightArrow;}{p}}_{1}^{pt}} = {{(\begin{matrix} x_{1}^{pt} & y_{1}^{pt} \end{matrix})}^{T}} .

Then by formula (1.8.3) obtains positive texture maps Texture ₁, concrete instance as shown in figure 17.

The 35 step: similar with the 34 step, obtain Face _sSide grain figure Texture _SideParticularly, will own

In the lateral attitude parameters R _{G, side}, T _{G, side}Down by formula

\{\begin{matrix} x_{side}^{pt} = \frac{R_{1 g, side}^{T} M_{s}^{pt} + T_{1 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \\ y_{side}^{pt} = \frac{R_{2 g, side}^{T} M_{s}^{pt} + T_{2 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \end{matrix}

Carry out projection, obtain

{{\overset{&RightArrow;}{p}}_{side}^{pt}} = {{(\begin{matrix} x_{side}^{pt} & y_{side}^{pt} \end{matrix})}^{T}} .

Then by formula (1.8.7) obtains side grain figure Texture _Side, concrete instance as shown in figure 18.

The 36 step: on cylinder figure { (lg, lt) }, carry out the texture reflection.That side face that directly obtains texture in the actual experiment is the left side, thus this step according to the texture information in left side by formula (1.8.8) generate the right side face.At this moment complete Texture _SideAs shown in figure 19.

The 37 step: we by formula

Merge Texture ₁And Texture _SideObtain final texture maps Texture _s, actual result as shown in figure 20.

We are final, shone upon Texture _sModeling result Face _sAs shown in figure 21.

Claims

1. merge the method for building up of the human face three-dimensional model of various visual angles, multi thread two-dimensional signal, the method that contains the corresponding faceform of establishment of two dimension, it is characterized in that: it at first use with the people on the face, corresponding two the identical and outer polar curve direction video cameras vertically of upper/lower positions people's face of taking modeling object under the situation of the amimia variation of people's face progressively changes about 90 video sequences of spending to the side from the front, gather up and down the two-way synchronization video with two identical capture cards again and import in the computing machine; Then, from the universal model Face of people's face _gBeginning is progressively made amendment according to the two-dimensional signal of the modeling object that obtains, and finally obtains specific people's faceform Face _s, the three-dimensional point coordinate among the just people who wherein the revises three-dimensional node tabulation P on the face, and people triangle surface tabulation F on the face remains unchanged promptly only changes mould shapes P and the topological structure F of model is constant; Particularly, it contains successively and has the following steps:

(1). input stereoscopic video sequence I _{L, t}And I _{R, t}:

(2). attitude is estimated, is promptly found the solution the attitude parameter in the number of people rotary course, i.e. the rigid body translation coefficient:

M _t+1＝R _tM _t+T _t，t＝1，..，N-1，

Attitude is estimated to contain successively to have the following steps:

G (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} \overset{&RightArrow;}{g} (i + ii, j + jj) {\overset{&RightArrow;}{g}}^{T} (i + ii, j + jj),

Gradient vector for tested point p place image:

\overset{&RightArrow;}{g} (i, j) = {[\begin{matrix} gx & gy \end{matrix}]}^{T} = {[\begin{matrix} \frac{I (i + 1, j) - I (i - 1, j)}{2} & \frac{I (i, j + 1) - I (i, j - 1)}{2} \end{matrix}]}^{T},

Diff (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} {(I_{L, t} (x_{L, t} (k) + ii, y_{L, t} (k) + jj) - I_{R, t} (i + ii, j + jj))}^{2}

(i j) is straight line

On any point; The matching result of carving t that clocks is { p _{R, t}(k) }, total number is that K (t) is individual, every all with p _{L, t}(k) corresponding, on the straight line of following formulate, search for, make Diff (i, j) Zui Xiao position:

{[\begin{matrix} p_{L, t} (k) \\ 1 \end{matrix}]}^{T} F [\begin{matrix} p_{R, t} (k) \\ 1 \end{matrix}] = 0,

Wherein:

F = {[\begin{matrix} f_{L} & 0 & u_{0 L} \\ 0 & f_{L} & v_{0 L} \\ 0 & 0 & 1 \end{matrix}]}^{- T} [\begin{matrix} 0 & - T_{c, z} & T_{c, y} \\ T_{c, z} & 0 & - T_{c, x} \\ - T_{c, y} & T_{c, x} & 0 \end{matrix}] R_{c} {[\begin{matrix} f_{R} & 0 & u_{0 R} \\ 0 & f_{R} & v_{0 R} \\ 0 & 0 & 1 \end{matrix}]}^{- 1},

f _L, f _R, u _OL, v _OL, u _OR, v _ORBe camera intrinsic parameter, known quantity; R _c, T _c=[T _{C, x}T _{C, y}T _{C, z}] ^TBe respectively rotation and translational component, belong to external parameters of cameras, known quantity;

Configuration relation formula according to two video cameras:

s_{m} A_{m}^{- 1} [\begin{matrix} p_{m, t} \\ 1 \end{matrix}] = M_{m, t},

s _m, i.e. s _LOr s _R, be that scale factor undetermined is obtained by formula after a while,

M_{L, t} = \frac{1}{2} (s_{L} A_{L}^{- 1} [\begin{matrix} p_{L, t} \\ 1 \end{matrix}] + s_{R} R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} \\ 1 \end{matrix}] + T_{c});

A_{L} = [\begin{matrix} f_{L} & 0 & u_{0 L} \\ 0 & f_{L} & v_{0 L} \\ 0 & 0 & 1 \end{matrix}],

A_{R} = [\begin{matrix} f_{R} & 0 & u_{0 R} \\ 0 & f_{R} & v_{0 R} \\ 0 & 0 & 1 \end{matrix}];

[\begin{matrix} s_{L} \\ s_{R} \end{matrix}] = {(B^{T} B)}^{- 1} B^{T} T_{c},

And

B = [A_{L}^{- 1} [\begin{matrix} p_{L, t} \\ 1 \end{matrix}] - R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t} \\ 1 \end{matrix}]];

M_{R, t} = R_{c}^{- 1} (M_{L, t} - T_{c});

When two dimensional character was followed the tracks of, the unique point of establishing detection was in image I _{M, t}(p), m is on L or the R, and (x=i is arbitrary unique point y=j) to p, and then it is in image I _{M, t+1}On tracking results be Wherein

\overset{&RightArrow;}{d} = G^{- 1} \overset{&RightArrow;}{e},

G as above-mentioned, and

\overset{&RightArrow;}{e} (i, j) = Σ_{ii = - blockx}^{blockx} Σ_{jj = - blocky}^{blocky} \overset{&RightArrow;}{g} (i + ii, j + jj) (I_{m, t + 1} (i + ii, j + jj) - I_{m, t} (i + ii, j + jj))

For video camera L be:

p_{L, t + 1} (k) = p_{L, t} (k) - {\overset{&RightArrow;}{d}}_{L, t} (p_{L, t} (k)), 1 \leq k \leq K (t);

For video camera R be:

p_{R, t + 1} (k) = p_{R, t} (k) - {\overset{&RightArrow;}{d}}_{R, t} (p_{R, t} (k)), 1 \leq k \leq K (t);

Calculate three-dimensional point { M according to arbitrary trace point to coupling _T+1(k) | k=1 .., K (t) }, as previously mentioned, omitted subscript L here;

M_{t + 1} (k) = \frac{1}{2} (s_{L} A_{L}^{- 1} [\begin{matrix} p_{L, t + 1} (k) \\ 1 \end{matrix}] + s_{R} R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t + 1} (k) \\ 1 \end{matrix}] + T_{c});

Wherein

[\begin{matrix} s_{L} \\ s_{R} \end{matrix}] = {(B^{T} B)}^{- 1} B^{T} T_{c},

B = [A_{L}^{- 1} [\begin{matrix} p_{L, t + 1} (k) \\ 1 \end{matrix}] - R_{c} A_{R}^{- 1} [\begin{matrix} p_{R, t + 1} (k) \\ 1 \end{matrix}]];

If: definition is also brought in constant renewal in boolean's array { bReliableZ _kBe reliable tolerance sequence, and Yes represents that reliably No represents unreliable, wherein value is that the label of the item of Yes is { k _s, s=1 ..., Kr (t), Kr (t) they are that value is the sum of the item of Yes, from wherein choosing Num_subset subclass, each subclass contains three couples of three-dimensional corresponding point, i.e. { Set _n={ N _{1, n}, N _{2, n}, N _{3, n}, n=1 ..., Num_subset, 1≤N _{1, n}, N _{2, n}, N _{3, n}≤ Kr (t), three-dimensional corresponding point are in each ternary subclass:

{M_{t} (k_{N_{1, n}}), M_{t} (k_{N_{2, n}}), M_{t} (k_{N_{3, n}})}

With

{M_{t + 1} (k_{N_{1, n}}), M_{t + 1} (k_{N_{2, n}}), M_{t + 1} (k_{N_{3, n}})},

Can note by abridging and be { (M _Im, M _Im') | im=0,1,2}, its represents three pairs of not three-dimensional corresponding point of conllinear, R _t, T _tSatisfy M _Im'=R _tM _Im+ T _t| im=0,1,2; From following formula, obtain R and T then, omit subscript t:

R = I + \frac{\sin | | r | |}{| | r | |} {[r]}_{\times} + \frac{1 - \cos | | r | |}{{| | r | |}^{2}} {[r]}_{\times}^{2},

Wherein

r = θ * \hat{r},

θ = \arccos \frac{(\hat{r} \times {dir}_{1}) \cdot (\hat{r} \times {dir}_{1}')}{| | (\hat{r} \times {dir}_{1}) \cdot (\hat{r} \times {dir}_{1}') | |},

\hat{r} = \frac{({dir}_{1}' - {dir}_{1}) \times ({dir}_{2}' - {dir}_{2})}{| | ({dir}_{1}' - {dir}_{1}) \times ({dir}_{2}' - {dir}_{2}) | |},

Be the central shaft of rotation, claim turning axle, the vector of 3*1;

θ be around

Rotation angle;

{dir}_{im} = \frac{M_{im} - M_{0}}{| | M_{im} - M_{0} | |}, im = 1,2;

{dir}_{im}' = \frac{M_{im}' - M_{0}'}{| | M_{im}' - M_{0} | |}, im = 1,2;

{[r]}_{\times} = {[\begin{matrix} r_{x} \\ r_{y} \\ r_{z} \end{matrix}]}_{\times} = [\begin{matrix} 0 & - r_{z} & r_{y} \\ r_{z} & 0 & - r_{x} \\ - r_{y} & r_{x} & 0 \end{matrix}],

And

I = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}]

It is unit matrix;

Translational component T is estimated by following formula:

T = \frac{1}{3} Σ_{im = 0}^{2} (M_{im}' - {RM}_{im});

ε _n，s＝‖M _t+1(k _s)-R _t，nM _t(k _s)-T _t，n‖ ²；

{R_{t}, T_{t}} = {R_{n_m}, T_{n_m}}, n_m = \underset{n}{\arg \min} {\underset{s}{med}}^{n} {ϵ_{n, s} | s = 1, . . ., Kr (t)},

{{\underset{s}{med}}^{n} | n = 1, . . ., Num_subset};

And Then be in this intermediate value array, to select to get that n of minimum value;

(2.4), at t=t+1 constantly, be provided with:

Whether need to carry out the Boolean denotation amount bRefine of attitude refinement;

(2.5), when t=N, attitude is estimated to finish;

(3). model initialization and attitude initialization:

M_{init} = [\begin{matrix} s_{x} & 0 & 0 \\ 0 & s_{y} & 0 \\ 0 & 0 & s_{z} \end{matrix}] M_{g},

M ₁=R _{G, 1}M _Init+ T _{G, 1}, M ₁And M _InitPeople's face and Face when being moment t=1 respectively _InitBetween arbitrary group of corresponding point; It is that left eye, RE are that right eye and Mid_M are the corresponding point of these three positions, mouth center at LE that the present invention gets above-mentioned two models, by above-mentioned

(4). utilize three-dimensional information correction faceform:

(4.1), pick up the side unique point by hand:

If: the manual side unique point that marks is

{m_{L, side}^{kk} = {(\begin{matrix} x_{L, side}^{kk} & y_{L, side}^{kk} \end{matrix})}^{T}},

Can obtain by following known formula:

M_{pt}^{new} = Σ_{jRBF = 1}^{S} C_{jRBF} φ (| | M_{pt}^{start} - {Sta}_{jRBF} | |);

[\begin{matrix} C_{1}^{T} \\ \cdot \\ \cdot \\ \cdot \\ C_{S}^{T} \end{matrix}] = {[\begin{matrix} φ (| | {Sta}_{1} - {Sta}_{1} | |) & \cdot \cdot \cdot & φ (| | {Sta}_{1} - {Sta}_{S} | |) \\ \cdot \cdot \cdot & \cdot \cdot \cdot & \cdot \cdot \cdot \\ φ (| | {Sta}_{S} - {Sta}_{1} | |) & \cdot \cdot \cdot & φ (| | {Sta}_{S} - {Sta}_{S} | |) \end{matrix}]}^{- 1} [\begin{matrix} {New}_{1}^{T} \\ \cdot \\ \cdot \\ \cdot \\ {New}_{S}^{T} \end{matrix}];

φ is known radial basis function;

During " shape corrections A " radial basis function algorithm on utilize, Face is set _Start=Face _Init, Face _New=Face ₁, and the point among the Subset comprises two classes: a class is people's left eye interior angle LE, right eye interior angle RE, the outer lip LM in a left side, four three-dimensional remarkable characteristics of right outer lip RM on the face, is drawn by above-mentioned stereo reconstruction method by two-dimentional remarkable characteristic; Another kind of is above-mentioned 24 side unique points, and their three-dimensional position obtains from the manual two-dimensional points that marks by following step:

{{Ms}_{g}^{kk} = {(\begin{matrix} 0 & Y_{g}^{kk} & Z_{g}^{kk} \end{matrix})}^{T}}, kk = 1, . . ., NumSF;

Then, have for arbitrary kk

\{\begin{matrix} x_{L, side}^{kk} = \frac{R_{g, side, 1}^{T} {Ms}_{g}^{kk} + T_{g, side, 1}}{R_{g, side, 3}^{T} {Ms}_{g}^{kk} + T_{g, side, 3}} \\ y_{L, side}^{kk} = \frac{R_{g, side, 2}^{T} {Ms}_{g}^{kk} + T_{g, side, 2}}{R_{g, side, 3}^{T} {Ms}_{g}^{kk} + T_{g, side, 3}} \end{matrix}

Wherein

R_{g, side} = (\begin{matrix} R_{g, side, 1}^{T} \\ R_{g, side, 2}^{T} \\ R_{g, side, 3}^{T} \end{matrix}),

T_{g, side} = (\begin{matrix} T_{g, side, 1} \\ T_{g, side, 2} \\ T_{g, side, 3} \end{matrix})

In two unknown numbers

With

{Cont}_{1} = {{pt}_{1}^{iC 1} | iC 1 = 1, . . ., nNum 1},

Be the label of model three-dimensional node tabulation P mid point,

1 \leq {pt}_{1}^{iC 1} \leq nVertex;

And nNum ₁Be Cont ₁The number on middle summit, they are to the new point that adds of Subset among the shape corrections B; They are to select by the intersection point of judging projection line and model, and promptly on a certain point, projection line and model can not have other intersection points, and its algorithm is as follows: Face ₁Be three-dimensional model, the perspective projection center is

Center = - R_{g, 1}^{- 1} T_{g, 1}

Calculate Cont again ₁In any point

Cont_{img}_{1} = {(\begin{matrix} x {CI}_{1}^{iCimg} & y {CI}_{1}^{iCimg} \end{matrix}) | iCimg = 1, . . ., nNumCI 1},

pti = {pt}_{1}^{iC 1}

At Face ₂Three-dimensional coordinate

M_{2}^{pti} = M_{1}^{pti} + t_{line} v; v = {(\begin{matrix} v_{x} & v_{y} & v_{z} \end{matrix})}^{T} = v_{pn} \times (v_{n} \times v_{pn});

Wherein,

M_{1}^{pti} = {(\begin{matrix} X_{1}^{pti} & Y_{1}^{pti} & Z_{1}^{pti} \end{matrix})}^{T}

Be that the pti point is at Face ₁In coordinate; Warp

Parametric t _LineCan be according to two-dimentional straight line

{(\begin{matrix} \frac{X_{1}^{pti} + t_{line} v_{x}}{Z_{1}^{pti} + t_{line} v_{z}} & \frac{Y_{1}^{pti} + t_{line} v_{y}}{Z_{1}^{pti} + t_{line} v_{z}} \end{matrix})}^{T}

With closed polygon Cont_img ₁Intersection point obtain:

[\begin{matrix} si \\ t_{line} \end{matrix}] = {[\begin{matrix} d_{x} & - (X_{1}^{pti} - v_{x} * Z_{1}^{pti} / v_{z}) \\ d_{y} & - (Y_{1}^{pti} - v_{y} * Z_{1}^{pti} / v_{z}) \end{matrix}]}^{- 1} [\begin{matrix} x_{0} - v_{x} / v_{z} \\ y_{0} - v_{y} / v_{z} \end{matrix}]

{(\begin{matrix} \frac{X_{1}^{pti}}{Z_{1}^{pti}} & \frac{Y_{1}^{pti}}{Z_{1}^{pti}} \end{matrix})}^{T}

If Face _Star,=Face ₂, Face _New=Face ₃, except comprising the point of " shape corrections A, B " lining having determined, also have the point of some new interpolations among the Subset, promptly from the manual front face feature contour point that marks: the three-dimensional point that the point of eyes, nostril and mouth is recovered out:

If the two-dimensional coordinate of certain feature contour point is

(\begin{matrix} x {CF}_{1}^{iCF} & {yCF}_{1}^{iCF} \end{matrix}),

This point is at Face ₂In coordinate be

M_{2}^{iCF} = {(\begin{matrix} X_{2}^{iCF} & Y_{2}^{iCF} & Z_{2}^{iCF} \end{matrix})}^{T},

Then it is at Face ₃In coordinate be

M_{3}^{iCF} = {(\begin{matrix} x {CF}_{1}^{iCF} * Z_{2}^{iCF} & y {CF}_{1}^{iCF} * Z_{2}^{iCF} & Z_{2}^{iCF} \end{matrix})}^{T},

{Cont}_{int 1} = {{pt}_{int 1}^{iC 2} | iC 2 = 1, . . ., nNim_int 1},

pt 1 i = {pt}_{int 1}^{iC 2},

It is at Face ₃In three-dimensional coordinate

M_{3}^{pt 1 i} = {[\begin{matrix} X_{3}^{pt 1 i} & Y_{3}^{pt 1 i} & Z_{3}^{pt 1 i} \end{matrix}]}^{T},

And

Center = - R_{g, 1}^{- 1} T_{g, 1}

M_{4}^{pt 1 i} = M_{3}^{pt 1 i} + t_{line 2} v 2,

v 2 = {[\begin{matrix} {v 2}_{x} & {v 2}_{y} & {v 2}_{z} \end{matrix}]}^{T} = M_{3}^{pt 1 i} - Center,

Parametric t _Line2Equally according to straight line

{(\begin{matrix} \frac{X_{3}^{pt 1 i} + t_{line 2} {v 2}_{x}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} & \frac{Y_{3}^{pt 1 i} + t_{line 2} {v 2}_{y}}{Z_{3}^{pt 1 i} + t_{line 2} {v 2}_{z}} \end{matrix})}^{T}

With closed polygon Cont_img _Int1Intersection point obtain;

(4.6), shape corrections E: press aforesaid radial basis function algorithm Face ₄Be deformed into Face ₅, promptly final shape Face _s

(5). texture:

Present texture is front and the lateral plan of taking by the L video camera, i.e. I _{L, 1}And I _{L, side}, generating, texture is to be the model Face that creates _sGeneration appears to have the cylinder texture maps Texture of the same sense of reality of photograph _s, promptly will be in image conversion to the unified cylindrical coordinate system of gathering and merge, it contains successively and has the following steps:

(5.1), generate cylinder unwrapping figure:

M_{s}^{pt} = {(\begin{matrix} X_{s}^{pt} & Y_{s}^{pt} & Z_{s}^{pt} \end{matrix})}^{T}

Do the cylinder mapping by following formula:

\{\begin{matrix} {1 g}_{pt} = \arctan (\frac{Z_{s}^{pt}}{X_{s}^{pt}}) \\ {lt}_{pt} = Y_{s}^{pt} \end{matrix},

(5.2), generate positive texture maps:

Have a few

At known front attitude parameter R _{G, 1}, T _{G, 1}Press following formula down at front view I _{L, 1}Last projection,

{\overset{&RightArrow;}{p}}_{1}^{pt} {(\begin{matrix} x_{1}^{pt} & y_{1}^{pt} \end{matrix})}^{T}

Be Face _sMiddle three-dimensional point

At I _{L, 1}On two-dimensional projection:

\{\begin{matrix} x_{1}^{pt} = \frac{R_{1 g, 1}^{T} M_{s}^{pt} + T_{1 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \\ y_{1}^{pt} = \frac{R_{2 g, 1}^{T} M_{s}^{pt} + T_{2 g, 1}}{R_{3 g, 1}^{T} M_{s}^{pt} + T_{3 g, 1}} \end{matrix},

Wherein

R_{g, 1} = (\begin{matrix} R_{1 g, 1}^{T} \\ R_{2 g, 1}^{T} \\ R_{3 g, 1}^{T} \end{matrix}),

T_{g, 1} = (\begin{matrix} T_{1 g, 1} \\ T_{2 g, 1} \\ T_{3 g, 1} \end{matrix})

p_{iT 1} = {(\begin{matrix} x_{1}^{{pt}_{iT 1}} & y_{1}^{{pt}_{iT 1}} \end{matrix})}^{T},

p_{iT 1}' = {(\begin{matrix} {1 g}_{{pt}_{iT 1}} & {lt}_{{pt}_{iT 1}} \end{matrix})}^{T},

[\begin{matrix} x \\ y \\ 1 \end{matrix}] = {[\begin{matrix} a_{1} & a_{2} & a_{3} \\ a_{4} & a_{5} & a_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}];

a ₁～a ₆Obtain by following formula:

[\begin{matrix} a_{1} \\ a_{2} \\ a_{3} \\ a_{4} \\ a_{5} \\ a_{6} \end{matrix}] = {[\begin{matrix} x_{1}^{{pt}_{1}} & y_{1}^{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{1}} & y_{1}^{{pt}_{1}} & 1 \\ x_{1}^{{pt}_{2}} & y_{1}^{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{2}} & y_{1}^{{pt}_{2}} & 1 \\ x_{1}^{{pt}_{3}} & y_{1}^{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{1}^{{pt}_{3}} & y_{1}^{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}];

Make positive texture maps again

(5.3), generate side grain figure:

Have a few

{\overset{&RightArrow;}{p}}_{side}^{pt} = {(\begin{matrix} x_{side}^{pt} & y_{side}^{pt} \end{matrix})}^{T}

Be Face _sMiddle three-dimensional point

At I _{L, side}On two-dimensional projection:

\{\begin{matrix} x_{side}^{pt} = \frac{R_{1 g, side}^{T} M_{s}^{pt} + T_{1 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \\ y_{side}^{pt} = \frac{R_{2 g, side}^{T} M_{s}^{pt} + T_{2 g, side}}{R_{3 g, side}^{T} M_{s}^{pt} + T_{3 g, side}} \end{matrix},

R_{g, side} = (\begin{matrix} R_{1 g, side}^{T} \\ R_{2 g, side}^{T} \\ R_{3 g, side}^{T} \end{matrix}),

T_{g, side} = (\begin{matrix} T_{1 g, side} \\ T_{2 g, side} \\ T_{3 g, side} \end{matrix});

{ps}_{iT 2} = {(\begin{matrix} x_{side}^{{pt}_{iT 2}} & y_{side}^{{pt}_{iT 2}} \end{matrix})}^{T},

[\begin{matrix} xs \\ ys \\ 1 \end{matrix}] = {[\begin{matrix} {as}_{1} & {as}_{2} & {as}_{3} \\ {as}_{4} & {as}_{5} & {as}_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}];

As ₁～as ₆Obtain by following formula:

[\begin{matrix} {as}_{1} \\ {as}_{2} \\ {as}_{3} \\ {as}_{4} \\ {as}_{5} \\ {as}_{6} \end{matrix}] = {[\begin{matrix} x_{side}^{{pt}_{1}} & y_{side}^{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{1}} & y_{side}^{{pt}_{1}} & 1 \\ x_{side}^{{pt}_{2}} & y_{side}^{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{2}} & y_{side}^{{pt}_{2}} & 1 \\ x_{side}^{{pt}_{3}} & y_{side}^{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & x_{side}^{{pt}_{3}} & y_{side}^{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}];

Make side grain figure again

I_{{Texture}_{side}} (1 g, lt) = I_{L, side} (xs, ys);

p_{iT 3} = {(\begin{matrix} {1 g}_{{pt}_{iT 3}} & {lt}_{{pt}_{iT 3}} \end{matrix})}^{T},

p_{iT 3}' = {(\begin{matrix} {1 g'}_{{pt}_{iT 3}} & {lt'}_{{pt}_{iT 3}} \end{matrix})}^{T},

iT3＝1，2，3；

To m _CynIn any point p=(lg lt) ^TBe calculated as follows [lg ' lt '],

[\begin{matrix} 1 g' \\ lt' \\ 1 \end{matrix}] = {[\begin{matrix} {rs}_{1} & {rs}_{2} & {rs}_{3} \\ {rs}_{4} & {rs}_{5} & {rs}_{6} \\ 0 & 0 & 1 \end{matrix}]}^{- 1} [\begin{matrix} 1 g \\ lt \\ 1 \end{matrix}];

Rs ₁～rs ₆Obtain by following formula:

[\begin{matrix} {rs}_{1} \\ {rs}_{2} \\ {rs}_{3} \\ {rs}_{4} \\ {rs}_{5} \\ {rs}_{6} \end{matrix}] = {[\begin{matrix} {1 g'}_{{pt}_{1}} & {lt'}_{{pt}_{1}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{1}} & {lt'}_{{pt}_{1}} & 1 \\ {1 g'}_{{pt}_{2}} & {lt'}_{{pt}_{2}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{2}} & {lt'}_{{pt}_{2}} & 1 \\ {1 g'}_{{pt}_{3}} & {lt'}_{{pt}_{3}} & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & {1 g'}_{{pt}_{3}} & {lt'}_{{pt}_{3}} & 1 \end{matrix}]}^{- 1} [\begin{matrix} {1 g}_{{pt}_{1}} \\ {lt}_{{pt}_{1}} \\ {1 g}_{{pt}_{2}} \\ {lt}_{{pt}_{2}} \\ {1 g}_{{pt}_{3}} \\ {lt}_{{pt}_{3}} \end{matrix}];

Generate opposite side from a directly veined lateral reflection again:

I_{{Texture}_{side}} (1 g', lt') = I_{{Texture}_{side}} (1 g, lt);

Setting threshold lg _MinAnd lg _Max, Texture _sAt arbitrary position Cyn=(lg lt) ^TThe value at place provides by following formula:

2. the method for building up that merges the human face three-dimensional model of various visual angles, multi thread two-dimensional signal according to claim 1, it is characterized in that: sequence { bReliable is reliably measured in the renewal in the described attitude estimating step _kMethod, it contains successively and has the following steps:

(1). according to correct corresponding number K_True is calculated in the statistical study of sequence Errt;

(1.2), calculate the average of E

\overset{&OverBar;}{ϵ} = \frac{1}{K_True} \underset{ϵ_{s} &Element; E}{Σ} ϵ_{s}

And standard deviation

σ = \sqrt{\frac{1}{K_True - 1} \underset{ϵ_{s} &Element; E}{Σ} {(ϵ_{s} - \overset{&OverBar;}{ϵ})}^{2}};

(1.6), if K-K_True-InValid_T＜0 makes K_True subtract 1, change step (1.2);

(2). establish Err _tIn before K_True less ε _sCorresponding label is { s _u, u=1 ..., K_True is { b Reliablek _sIn be Yes corresponding to the position assignment of these labels, other assignment are No.

3. the method for building up that merges the human face three-dimensional model of various visual angles, multi thread two-dimensional signal according to claim 1 is characterized in that: the two method of the Boolean denotation amount bReDetect whether setting up in the described attitude estimating step will call feature detection and matching module again, the Boolean denotation amount bRefine that whether will carry out the attitude refinement is as follows:

(3). (i.e. { bReliable if the number Kr (t) of reliable three-dimensional point _k, k=1 ..., value is counting of Yes among the K (t)) less than preset threshold value K_bound, bReDetect=Yes then, bRefine=Yes; Otherwise, change step (4);

(5). make t=t+1, enter next processing constantly.

4. the method for building up that merges the human face three-dimensional model of various visual angles, multi thread two-dimensional signal according to claim 3, it is characterized in that: in the described attitude estimating step, bRefine is that Yes is illustrated in the current time refinement that gestures, and promptly draws more accurate attitude parameter:

f_obj = \underset{m}{Σ} Σ_{τ = 0}^{t - tStart} Σ_{pp = 1}^{Kr (t)} [{(a_{pp, m}^{τ} - x_{pp, m}^{τ})}^{2} + {(b_{pp, m}^{τ} - y_{pp, m}^{τ})}^{2}];

Ra wherein _{TStart+ τ}, Ta _{TStart+ τ}Be to be carved into tStart+ τ rigid body translation coefficient constantly during from tStart; { M _TStart(k _Pp), pp=1 .., Kr (t) is reliable three-dimensional point array in this Num the moment, and Kr (t) is individual altogether, and pp is the label of three-dimensional point; M is L or R, the expression video camera; Be in m video camera, tStart+ the τ two-dimensional detection in the image or the result of tracking constantly, promptly

[\begin{matrix} a_{pp, m}^{τ} \\ b_{pp, m}^{τ} \\ 1 \end{matrix}] = A_{m}^{- 1} [\begin{matrix} p_{m, tStart + τ} (k_{pp}) \\ 1 \end{matrix}],

A _mBe A _LOr A _R

\{\begin{matrix} x_{pp, m}^{τ} = \frac{R_{1 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{1 c, m}}{R_{3 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{3 c, m}} \\ y_{pp, m}^{τ} = \frac{R_{2 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{2 c, m}}{R_{3 c, m}^{T} ({Ra}_{tStart + τ} M_{tStart} (k_{pp}) + {Ta}_{tStart + τ}) + T_{3 c, m}} \end{matrix},

Wherein

R_{c, m} = (\begin{matrix} R_{1 c, m}^{T} \\ R_{2 c, m}^{T} \\ R_{3 c, m}^{T} \end{matrix})

With

T_{c, m} = (\begin{matrix} T_{1 c, m} \\ T_{2 c, m} \\ T_{3 c, m} \end{matrix})

R_{c, L} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}],

T_{c, L} = [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}],

R_{c, R} = R_{c}^{- 1}, T_{c, R} = - R_{c}^{- 1} T_{c, R};

In addition,

{Ra}_{tStart} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}],

{Ta}_{tStart} = [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}];

Optimize the exact value that f_obj just can obtain attitude parameter; Earlier find the solution { Ra with the Levenberg-Marquardt algorithm _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., t-tStart; M _TStart(kpp), pp=1 ..., Kr (t) }; Use following formula { Ra again _{TStart+ τ}, Ta _{TStart+ τ}, τ=1 ..., t-tStart is transformed into { R _{TStart+ τ}, T _{TStart+ τ}, τ=0 ..., t-tStart-1:

R _tStart＝Ra _tStart+1，T _tStart＝Ta _tStart+1，

\{\begin{matrix} R_{tStart + τ - 1} = {Ra}_{tStart + τ} {Ra}_{tStart + τ - 1}^{- 1} \\ T_{tStart + τ - 1} = {Ta}_{tStart + τ} - R_{tStart + τ - 1} * {Ta}_{tStart + τ - 1} \end{matrix}, τ = 2, . . ., t - tStart .