CN105869178B

CN105869178B - A kind of complex target dynamic scene non-formaldehyde finishing method based on the convex optimization of Multiscale combination feature

Info

Publication number: CN105869178B
Application number: CN201610261547.6A
Authority: CN
Inventors: 何自芬; 张印辉; 伍星; 张云生; 王森
Original assignee: Kunming University of Science and Technology
Current assignee: Kunming University of Science and Technology
Priority date: 2016-04-26
Filing date: 2016-04-26
Publication date: 2018-10-23
Anticipated expiration: 2036-04-26
Also published as: CN105869178A

Abstract

The complex target dynamic scene non-formaldehyde finishing method based on the convex optimization of Multiscale combination feature that the present invention relates to a kind of, belongs to image procossing and technical field of computer vision.The present invention extracts wavelet field high yardstick target signature first, and calculate movement and spatial domain skirt response, by wavelet field, spatially and temporally edge feature merges to obtain Multiscale combination probability skirt response, then combination edge internal maps weight total variation energy functional model is established, the convex optimization of weight total variation functional model is calculated using alternating direction, and the unitary energy function item of markov random file and binary spatial domain energy function when defining super-pixel scale sky accordingly, obtain the posterior probability segmentation result of image sequence finally by figure cutting single-step iteration reasoning.The present invention can effectively changed complex target be split appearance color is inconsistent and shape from dynamic scene at any time.

Description

A kind of complex target dynamic scene based on the convex optimization of Multiscale combination feature is unsupervised Dividing method

Technical field

The present invention relates to a kind of complex target dynamic scene non-formaldehyde finishing sides based on the convex optimization of Multiscale combination feature Method belongs to image procossing and technical field of computer vision.

Background technology

Segmentation of Image Sequences is the technology of the band of position where semantic foreground target in each frame image of extraction, extensive use In the computer visions such as target identification, tracking, image understanding and vision guided navigation and robot vision field.In reality, by In illumination and the variation of environment and the movement of camera itself, cause that actual background is dynamic change and this variation is Random；In addition, the feature of foreground target also has complexity under normal conditions, even if the appearance color of target is in synchronization May also be inconsistent, the topological structure of target can also change at any time.Therefore, the automatic of complex target is divided under dynamic scene Cutting technology becomes difficult point and hot issue that top scientific research institution of various countries competitively chases.

Segmentation of Image Sequences method main can be divided into two classes, that is, have measure of supervision and unsupervised approaches.Early stage Image partition method needs the foreground target for being artificially labeled in certain frames.These methods very labor intensive, because this needs craft Divide training set to handle image sequence.There is supervision Segmentation of Image Sequences method to usually require the language in user's selection key frame Adopted region.It is main to show although these methods obtain segmentation result to a certain extent, but have some limitations In the selection of the key frame of segmentation result heavy dependence mark.Unsupervised image division method is without using the image that manually marks It is trained, but by the relevant range of Computer Automatic Extraction semanteme foreground target.

Recent image sequence non-formaldehyde finishing method has used dynamic background texture model, by comparing its current value Determine whether a pixel belongs to background with past value, and the value for propagating a background pixel enters background extracting mould Type.Gauss hybrid models are handled to estimate background distributions using Dirichlet, and are used them as to scene changes A continuous newer model learning process input.This method is to be based on a typical hypotheses, i.e. background becomes That changes is very steady and slow, and which limits image segmentation of this kind of algorithm under dynamic scene.

Fast target partition algorithm attempts the dynamic profile model of structure target, and assumes that background changes with time and be Smooth.One advantage of this method is that it is possible to handle the Deja Vu in marking refinement stage on image block, so And the only point on moving boundaries in target is initialized, a large amount of wrong report seed point may be will produce, especially in dynamic field Jing Zhong.Subsequent choosing is done from the variation level of granularity with the method permission for building space-time dividing to combine layering clue by setting It selects.Although this method is effective to processing layering shape clue, still there are one very strong limitations, cause the mistake of scene Segmentation, can not solve the problems, such as the foreground segmentation task of itself.

Another kind of unsupervised Segmentation of Image Sequences method is ranked up using the structuring study in target assumption region.Make With shape and action consistency class target it is interregional assume packet sequencings, introduced in the figure of weighted area in frame and interframe Constraint look for maximum weighted super-pixel.Or cut using figure and realize segmentation with color clue, and region is by being based on one The classification of the form clue of a multivariate model is sorted.It sorts although target assumption can be conducive to image block level, in reality Such methods are all since there are the segmentations of the highly redundant in different images region, even if a moderate image sequence is also subject to choosing It selects the higher storage of target assumption and calculates cost.

The present invention is subsidized by project of national nature science fund project (61461022,61302173) and is studied, and essentially consists in exploration It is whole to estimate robust fusion algorithm with local feature multi-scale coupling mechanism and multiple dimensioned perceptual error, solve segmentation posteriority with Really be distributed inconsistent problem, for vision computing platform under dynamic scene the accurate segmentation of complicated foreground target provide it is theoretical with Method foundation.

Invention content

Based on problem above, the present invention provides a kind of dynamic scene non-formaldehyde finishings based on the convex optimization of Analysis On Multi-scale Features Method, solves that appearance color is inconsistent and the topological complex object changed at any time of shape, when target be in due to External environment change and camera motion caused by dynamic background when, other non-formaldehyde finishing technologies be also all difficult to foreground target into The problem of row is effectively divided.

The technical scheme is that：A kind of complex target dynamic scene based on the convex optimization of Multiscale combination feature is without prison Dividing method is superintended and directed, wavelet field high yardstick target signature is extracted first, and calculate movement and spatial domain skirt response, by wavelet field, sky Domain and time domain edge Fusion Features obtain Multiscale combination probability skirt response, and it is total then to establish combination edge internal maps weight Variation energy functional model calculates the convex optimization of weight total variation energy functional model using alternating direction, and defines super-pixel ruler The unitary energy function item and binary spatial domain energy function of markov random file when degree is empty, finally by figure cutting single-step iteration Reasoning obtains the posterior probability segmentation result of image sequence；

The specific step of the complex target dynamic scene non-formaldehyde finishing method based on the convex optimization of Multiscale combination feature It is rapid as follows：

Step1, wavelet field high yardstick objective contour feature extraction；In image sequenceThe image I at each moment t^tIn, Maximal margin response of the foreground wavelet field high yardstick objective contour at each location of pixels (a, b) is verified first, and T is video Totalframes；

In the step Step1, the maximal margin response definition at each pixel of wavelet field high yardstick objective contour is：

B_s ^t(a, b)=max (H_s ^t(a,b),V_s ^t(a,b),D_s ^t(a,b))

Wherein, H_s ^t、V_s ^tAnd D_s ^tIndicate image after wavelet transformation in the horizontal, vertical of the positions scale s (a, b) respectively With it is diagonally opposed on wavelet coefficient.

Step2, the two images I for calculating adjacent moment^tAnd I^t+1Variation light stream vector fieldPass through gradient Operator calculates the gradient of variation light stream vector field amplitudeLinear normalization is carried out to optical flow gradient to handle to obtain Movement edge

The structuring random forest F that Step3, foundation are made of N number of tree tree maximizes meter by the information gain of standard The bifurcated function on each tree is calculated, N is the number set in random forest；By image I^tInput structure random forest F carries out non- Linear class prediction F (I^t), according to the prediction case p (I of each tree tree^t|Tree), statistical framework random forest is non-thread Property prediction resultIt is normalized to obtain the spatial domain skirt response F of each pixel of image^t (a,b)；

Step4, the response of high yardstick wavelet coefficient maximal margin is projected into pixel dimension, and linear combination fine dimension Movement and spatial domain edge feature, fusion obtain Multiscale combination probability skirt response：

Wherein B_s ^t(a, b) is the maximal margin response at each pixel of wavelet field high yardstick objective contour；

Step5, by image sequenceOver-segmentation constitutes super-pixel set G, connection space neighborhood (a, b) ∈ ε_sAnd the time Neighborhood (a, b, t) ∈ ε_t,t+1, markov random file when to build sky；Wherein ε_sConnect between all adjacent pixels of spatial domain The set of line, ε_t,t+1The set of line between time-domain t moment and all adjacent pixels of t+1 moment two continuous frames images；

Step6, the unitary energy letter that markov random file is determined according to the convex optimum results of combination edge internal maps Several U^t, the class label l of a super-pixel i ∈ G is judged by the display model of all picture frames_iBe foreground or background can It can property；

In the step Step6, markov random file is determined according to the convex optimum results of combination edge internal maps Unitary energy function item U^tThe step of it is as follows：

Step6.1, the projection M according to the internal maps seed point at combined probability edge on a fine scale, establish image Foreground target in sequence maps weight total variation energy functional model：

Wherein first item is weight total variation item, and ▽ u are the gradient of alternately variable u；Section 2 is segment smoothing item, Mv-I Indicate that seed point mapping matrix M subtracts each other with gray value of image again with the dot product for replacing variable v；Relative importance between the two is logical Parameter ρ is crossed to balance；

Step6.2, the weight total variation segmentation problem of Pixel-level fine dimension is converted to constrained minimization problem：U*= argmin_u,vH (u, v), constraints are u ∈ [0,1], v ∈ [0,1]；

Apply linear restriction with internal seeds point mapping matrix M between Step6.3, two alternating variables u and v, by multiplying Sub- alternating direction convex optimized algorithm, iteration, which updates to obtain, successively minimizes variable u*；

Step6.4, according to convex optimum results u*, training gauss hybrid models foreground parameter Θ_FWith context parameter Θ_B, obtain The unitary energy function item U of Markov random field model after convex optimization is smooth^t=Π (l_i|u*)；Π is to be similar to outside super-pixel So；

Step6.5, the posterior probability for calculating current super-pixel RGB color vector, it is five to define super-pixel shape likelihood Π The negative log-likelihood function of the color character of ingredient gauss hybrid models：

Wherein Pr is probability density function, c_iIndicate super-pixel index, the class label l of pixel i_i=0 or l_i=1, Value is determined according to convex optimum results u*, and negative log-likelihood function is calculated further according to the gauss hybrid models of color clue Π^t(l_i|u*)。

Step7, markov random file binary spatial domain energy function V and binary time domain energy function W are determined, calculated separately In spatial neighborhood (a, b) ∈ ε_sWith time neighborhood (a, b, t) ∈ ε_t,t+1Two super-pixel at both ends take class label respectively on line l_i∈ L and l_jThe possibility of ∈ L, L are the set of all class labels；

In the step Step7, markov random file binary spatial domain energy function V and binary time domain energy function are determined The step of W, is as follows：

The consistency that Step7.1, binary item encode is embodied in room and time clue, is adjusted with the comparison of a standard Function defines binary spatial domain energy function V：

Wherein dist indicates two super-pixel c_i,c_jGeometric centerWithBetween Euclidean distance；Π^t(l_i) It indicates to bear log-likelihood function Π^t(l_i|U* abbreviation), σ representation space weight factors；

Step7.2, temporal target action being consistent property on image sequence is made by binary smooth potential function W, Binary time domain energy function W is encoded using overlapping and appearance information：

Wherein β indicates the time weighting factor；

The specific calculation of Step7.3, percentage φ of two super-pixel with same pixel contacted by light stream For：

Wherein # indicates the number of pixel.

Random field posterior probability classification approximate reasoning when Step8, super-pixel scale sky.By image sequenceOver-segmentation Constitute super-pixel set G, connection space neighborhood (a, b) ∈ ε_sWith time neighborhood (a, b, t) ∈ ε_t,t+1.Define super-pixel Markov Random field energy function E (L) solves foreground-background class label optimization class result L；

Step9, likelihood distribution of being classified by the posterior probability of energy function E (L) described in figure syncopation single-step iteration reasoning, Output makes the label L that energy function minimizes for one^*

L^*=argmin_LE(L)

Export L^*Obtain the final segmentation result of dynamic scene image sequence.

The beneficial effects of the invention are as follows：

1) present invention is merged using multi-scale transform domain, spatial domain and time-domain clue, obtains a kind of completely new target wheel The combined probability edge extracting method of wide feature can effectively extract foreground in dynamic background image sequence on multiple scales The maximal margin of objective contour responds.

2) Markov established using the convex optimization of combination edge internal maps total variation that the method for the invention proposes Random field energy function, can effectively appearance color is inconsistent and shape at any time changed complex target from dynamic It is split in scene.

Description of the drawings

Fig. 1 is the flow diagram of the present invention；

Fig. 2 is 6 frame complex target dynamic scene image sequence datas of the present invention for test；

Fig. 3 is the present invention at the maximization wavelet coefficient edge that scale 1 detects；

Fig. 4 is the invention detects that the simultaneously movement edge after normalized；

Fig. 5 is detected by structuring random forest and the spatial domain edge after normalized；

Fig. 6 is combination edge proposed by the present invention multiple features fusion result；

Fig. 7 is the foreground internal maps that the present invention is calculated by the combination edge feature proposed；

Fig. 8 is the segmentation result that the present invention obtains in panda image sequence data.

Specific implementation mode

Embodiment 1：As shown in figures 1-8, a kind of complex target dynamic scene based on the convex optimization of Multiscale combination feature without Supervised segmentation method extracts wavelet field high yardstick target signature, and calculates movement and spatial domain skirt response first, by wavelet field, Spatially and temporally edge feature merges to obtain Multiscale combination probability skirt response, then establishes combination edge internal maps weight Total variation energy functional model calculates the convex optimization of weight total variation energy functional model using alternating direction, and defines super-pixel The unitary energy function item of markov random file and binary spatial domain energy function, change finally by one step of figure cutting when scale sky The posterior probability segmentation result of image sequence is obtained for reasoning；

B_s ^t(a, b)=max (H_s ^t(a,b),V_s ^t(a,b),D_s ^t(a,b))

Wherein β indicates the time weighting factor；

Wherein # indicates the number of pixel.

L^*=argmin_LE(L)

Export L^*Obtain the final segmentation result of dynamic scene image sequence.

Embodiment 2：As shown in figures 1-8, a kind of complex target dynamic scene based on the convex optimization of Multiscale combination feature without Supervised segmentation method inputs the image sequence data containing dynamic scene information residing for target information and target, calculates multiple dimensioned Wavelet Edge feature, structuring random forest spatial edge feature and movement gradient feature, before in dynamic image sequence The combined probability edge of the maximal margin response extraction objective contour feature of scape objective contour；Structure foreground position on a fine scale The mapping convex Optimized model of seed point total variation is set, the objective contour smoothly extracted in high yardstick of alternately more newly arriving by variable is special Combined probability edge internal maps seed point is levied, the unitary of markov random file is established according to the target location mapping after smooth Data capacity function and binary space, time energy function item utilize Ma Er when figure cutting an iteration completion super-pixel grade sky Section husband random field posterior probability reasoning is to realize the Pixel-level segmentation of dynamic scene image sequence.

It is dynamic that the present invention overcomes the deficiencies of the prior art and provide a kind of complex target based on the convex optimization of Multiscale combination feature State scene image sequence non-formaldehyde finishing method.The flow diagram of the present invention is as shown in Figure 1.It is described special based on Multiscale combination The specific implementation step for levying the complex target dynamic scene image sequence non-formaldehyde finishing method of convex optimization is as follows：

Acquired dynamic image sequence data are inputted first, illumination variation, the foreground of scene environment when to obtaining image It without specifically constraint and is required with image-forming conditions such as background contrasts, the movement of background and the movements of camera itself.Fig. 2 is The present invention for test 6 frame complex target dynamic scene image sequence datas, wherein foreground target appearance color it is uneven and And foreground shape changes over time, it is opposing stationary when between foreground and background there are apparent camera motion, when and opposite transport Dynamic, background more spot is miscellaneous.

(1) then complete high yardstick objective contour feature extraction.In image sequencePer frame image I^tIn verify foreground High yardstick objective contour is responded in the maximal margin of wavelet field.It is defined according to high yardstick object edge proposed by the present invention：

B_s ^t(a, b)=max (H_s ^t(a,b),V_s ^t(a,b),D_s ^t(a,b))

Extraction foreground high yardstick objective contour is responded in the maximal margin of wavelet field.The method of the invention is examined in scale 1 The maximization wavelet coefficient edge measured is as shown in Figure 3.

(2) the two images I of adjacent moment is calculated^tAnd I^t+1Variation light stream vector fieldPass through gradient operator Calculate the gradient of variation light stream vector field amplitudeOptical flow gradient progress linear normalization is handled and is moved EdgeThe method of the invention detects that the movement edge after simultaneously normalized is as shown in Figure 4.

(3) the structuring random forest F being made of N number of tree tree is established, is maximized and is calculated by the information gain of standard Bifurcated function on each tree, N are the number set in random forest.By image I^tInput structure random forest F carries out non-thread Property class prediction F (I^t), according to the prediction case p (I of each tree tree^t|Tree), statistical framework random forest is non-linear Prediction resultIt is normalized to obtain the spatial domain skirt response F of each pixel of image^t(a, b).The method of the invention detects that the spatial domain edge after simultaneously normalized is as shown in Figure 5 by structuring random forest.

(4) response of high yardstick wavelet coefficient maximal margin is projected into pixel dimension, and the fortune of linear combination fine dimension Dynamic and spatial domain edge feature, fusion obtain Multiscale combination probability skirt response：

The results are shown in Figure 6 for combination edge proposed by the present invention multiple features fusion.

(5) the projection M according to the internal maps seed point at combined probability edge on a fine scale, is established in image sequence Foreground target map weight total variation energy functional model:

Weight total variation item and segment smoothing item weight parameter ρ values between the two are 1.8.

(6) the weight total variation segmentation problem of Pixel-level fine dimension is converted to constrained minimization problem：

U*=argmin_u,vH (u, v), constraints are u ∈ [0,1], v ∈ [0,1].

Apply linear restriction with internal seeds point mapping matrix M between (7) two alternating variables u and v, is replaced by multiplier Direction convex optimized algorithm carries out the proximal end project between alternately variable successively, and iteration, which updates to obtain, minimizes variable u*.This Invention is described as shown in Figure 7 by combining the foreground internal maps that edge feature is calculated.

(8) according to convex optimum results u*, training gauss hybrid models foreground parameter Θ_FWith context parameter Θ_B, obtain convex excellent Change the unitary data capacity function item of smooth rear Markov random field modelsΠ is to be similar to outside super-pixel So.

(9) posterior probability of current super-pixel RGB color vector is calculated, definition super-pixel shape likelihood is five ingredient Gausses The negative log-likelihood function of the color character of mixed model：

(10) consistency of binary item coding is embodied in room and time clue, with Bayesian segmentation system and is defined Space binary item V, the comparison adjustment item as a standard：

Wherein dist indicates two super-pixel c_i,c_jGeometric centerWithBetween Euclidean distance, parameter σ takes Value is set as 2.8.

(11) temporal target action being consistent property on image sequence is made by binary smooth potential function W, used Overlapping and appearance information carry out scramble time constraint：

Wherein, parameter beta value is set as 1.6.

(12) the percentage φ that there is same pixel by two super-pixel that light stream contacts is calculated：

Wherein # indicates the number of pixel.

(13) random field posterior probability classification approximate reasoning when super-pixel scale sky.By image sequenceOver-segmentation structure At super-pixel set G, connection space neighborhood (a, b) ∈ ε_sWith time neighborhood (a, b, t) ∈ ε_t,t+1.Define super-pixel Markov Random field energy function E (L) solves foreground-background class label optimization class result L.

(14) defeated by the classification likelihood distribution of the posterior probability of energy function E (L) described in figure syncopation single-step iteration reasoning Go out makes the label L that energy function minimizes for one^*

L^*=argmin_LE(L)

Export L^*Obtain the final segmentation result of dynamic scene image sequence.It is of the present invention to be based on Multiscale combination feature Segmentation result such as Fig. 8 that the complex target dynamic scene non-formaldehyde finishing method of convex optimization obtains in panda image sequence data It is shown.As seen from the figure, in due to dynamic background image caused by camera motion and environmental change, appearance color it is uneven and The foreground panda semantic objects region that shape changes over time can access effective segmentation at all six moment.

The specific implementation mode of the present invention is explained in detail above in conjunction with attached drawing, but the present invention is not limited to above-mentioned Embodiment within the knowledge of a person skilled in the art can also be before not departing from present inventive concept Put that various changes can be made.

Claims

1. a kind of complex target dynamic scene non-formaldehyde finishing method based on the convex optimization of Multiscale combination feature, feature exist In：Wavelet field high yardstick target signature is extracted first, and calculates movement and spatial domain skirt response, by wavelet field, spatially and temporally Edge feature merges to obtain Multiscale combination probability skirt response, then establishes combination edge internal maps weight total variation energy Functional model calculates the convex optimization of weight total variation energy functional model, and horse when defining super-pixel scale sky using alternating direction The unitary energy function item and binary spatial domain energy function of Er Kefu random fields, obtain finally by figure cutting single-step iteration reasoning The posterior probability segmentation result of image sequence；

The specific steps of the complex target dynamic scene non-formaldehyde finishing method based on the convex optimization of Multiscale combination feature are such as Under：

Step1, wavelet field high yardstick objective contour feature extraction；In image sequenceThe image I at each moment t^tIn, first Maximal margin response of the foreground wavelet field high yardstick objective contour at each location of pixels (a, b) is verified, T is total frame of video Number；

Step2, the two images I for calculating adjacent moment^tAnd I^t+1Variation light stream vector fieldPass through gradient operator meter Calculate the gradient of variation light stream vector field amplitudeLinear normalization is carried out to optical flow gradient to handle to obtain movement side Edge

The structuring random forest F that Step3, foundation are made of N number of tree tree is maximized by the information gain of standard and is calculated often Bifurcated function on a tree, N are the number set in random forest；By image I^tInput structure random forest F carries out non-linear Class prediction F (I^t), according to the prediction case p (I of each tree tree^t|Tree), statistical framework random forest is non-linear pre- Survey resultIt is normalized to obtain the spatial domain skirt response F of each pixel of image^t(a,b)；

Step4, the response of high yardstick wavelet coefficient maximal margin is projected into pixel dimension, and the movement of linear combination fine dimension With spatial domain edge feature, fusion obtains Multiscale combination probability skirt response：

WhereinFor the maximal margin response at each pixel of wavelet field high yardstick objective contour；

Step5, by image sequenceOver-segmentation constitutes super-pixel set G, connection space neighborhood (a, b) ∈ ε_sWith time neighborhood (a,b,t)∈ε_t,t+1, markov random file when to build sky；Wherein ε_sThe line between all adjacent pixels of spatial domain Set, ε_t,t+1The set of line between time-domain t moment and all adjacent pixels of t+1 moment two continuous frames images；

Step6, the unitary energy function item that markov random file is determined according to the convex optimum results of combination edge internal maps U^t, the class label l of a super-pixel i ∈ G is judged by the display model of all picture frames_iIt is the possibility of foreground or background；

Step7, markov random file binary spatial domain energy function V and binary time domain energy function W are determined, calculated separately in sky Between neighborhood (a, b) ∈ ε_sWith time neighborhood (a, b, t) ∈ ε_t,t+1Two super-pixel at both ends take class label l respectively on line_i∈L And l_jThe possibility of ∈ L, L are the set of all class labels；

Step8, super-pixel markov random file energy function E (L) is defined to solve foreground-background class label optimization class As a result L；

Step9, by formula L^*=argmin_LThe posterior probability classification likelihood of energy function E (L) described in E (L) single-step iteration reasoning, it is defeated Go out L^*Obtain final dynamic scene Segmentation of Image Sequences result.

2. the complex target dynamic scene non-formaldehyde finishing according to claim 1 based on the convex optimization of Multiscale combination feature Method, it is characterised in that：In the step Step1, the maximal margin response at each pixel of wavelet field high yardstick objective contour It is defined as：

B_s ^t(a, b)=max (H_s ^t(a,b),V_s ^t(a,b),D_s ^t(a,b))

Wherein, H_s ^t、V_s ^tAnd D_s ^tIndicate image after wavelet transformation in the horizontal, vertical and right of the positions scale s (a, b) respectively Wavelet coefficient on angular direction.

3. the complex target dynamic scene non-formaldehyde finishing according to claim 1 based on the convex optimization of Multiscale combination feature Method, it is characterised in that：In the step Step6, Markov is determined according to the convex optimum results of combination edge internal maps The unitary energy function item U of random field^tThe step of it is as follows：

Step6.1, the projection M according to the internal maps seed point at combined probability edge on a fine scale, establish image sequence In foreground target map weight total variation energy functional model：

Wherein first item is weight total variation item,For the gradient of alternately variable u；Section 2 is segment smoothing item, and Mv-I is indicated Seed point mapping matrix M subtracts each other with gray value of image again with the dot product for replacing variable v；Relative importance between the two passes through ginseng ρ is counted to balance；

Apply linear restriction with internal seeds point mapping matrix M between Step6.3, two alternating variables u and v, is handed over by multiplier For direction convex optimized algorithm, iteration, which updates to obtain, successively minimizes variable u*；

Step6.4, according to convex optimum results u*, training gauss hybrid models foreground parameter Θ_FWith context parameter Θ_B, obtain convex excellent Change the unitary energy function item U of smooth rear Markov random field model^t=Π (l_i|u*)；Π is super-pixel shape likelihood；

Step6.5, the posterior probability for calculating current super-pixel RGB color vector, it is five ingredients to define super-pixel shape likelihood Π The negative log-likelihood function of the color character of gauss hybrid models：

Wherein Pr is probability density function, c_iIndicate super-pixel index, the class label l of pixel i_i=0 or l_i=1, value It is determined according to convex optimum results u*, negative log-likelihood function Π is calculated further according to the gauss hybrid models of color clue^t(l_i| u*)。

4. the complex target dynamic scene non-formaldehyde finishing according to claim 3 based on the convex optimization of Multiscale combination feature Method, it is characterised in that：In the step Step7, markov random file binary spatial domain energy function V and binary time domain are determined The step of energy function W, is as follows：

The consistency that Step7.1, binary item encode is embodied in room and time clue, with the comparison adjustment function of a standard Define binary spatial domain energy function V：

Wherein dist indicates two super-pixel c_i,c_jGeometric centerWithBetween Euclidean distance；Π^t(l_i) indicate negative Log-likelihood function Π^t(l_i|u^*) abbreviation, σ representation space weight factors；

Step7.2, temporal target action being consistent property on image sequence is made by binary smooth potential function W, used It is overlapped and encodes binary time domain energy function W with appearance information：

Wherein β indicates the time weighting factor；

Step7.3, it is by the specific calculation of percentage φ of two super-pixel with same pixel of light stream contact：

Wherein # indicates the number of pixel.