CN103914822A

CN103914822A - Interactive video foreground object extraction method based on super pixel segmentation

Info

Publication number: CN103914822A
Application number: CN201210592951.3A
Authority: CN
Inventors: 胡事民; 陈韬; 朱哲; 丁蒙; 熊君君
Original assignee: Tsinghua University; Beijing Samsung Telecommunications Technology Research Co Ltd
Current assignee: Tsinghua University; Beijing Samsung Telecom R&D Center; Beijing Samsung Telecommunications Technology Research Co Ltd
Priority date: 2012-12-31
Filing date: 2012-12-31
Publication date: 2014-07-09

Abstract

The invention discloses an interactive video foreground object extraction method based on super pixel segmentation. The method includes an S1 step of sampling each frame of a video sequence, and then pre-segmenting based on a super pixel; an S2 step of calculating based on video object interactive extraction of the super pixel; and an S3 step of sampling the original video resolution from the segmented results by interactive calculation, and refining the edge of the segmented results. The invention can help users to quickly and efficiently extract an object in a video interactively.

Description

Interactive video foreground object extracting method based on super pixel segmentation

Technical field

The present invention relates to video editing technical field, particularly a kind of interactive video foreground object extracting method based on super pixel segmentation.

Background technology

The efficient editor of image and video is research topic important in computer graphics.Compared with image, the data volume of video is larger and structure content is complicated, thereby video editing has more challenge than picture editting.Video editing is important technological means during postproduction of movies making, advertisement are synthesized.Existing famous video editing instrument has professional Adobe Premiere, Ulead MediaStudio and free VirtualDub at present.

Cutting apart of video foreground object is an important and basic problem in video editing, obtained in the last few years research widely.Because the complexity of background in video and the motion of prospect have arbitrariness, therefore do not have a kind of method can automatically be partitioned into the foreground object in all types of videos.Interactively video foreground Object Segmentation, i.e. the given a small amount of interactive information of user, algorithm is automatically partitioned into foreground object by these information, and feeds back timely to user, and user is optimized according to feedback information again.In interactively video foreground Object Segmentation method, complicacy, algorithm response speed and the segmentation effect of user interactions is the criterion of evaluating an interactive approach.

At present existing comparatively famous interactive video dividing method has " Discontinuity-Aware Video Object Cutout " that the people such as " Video Snapcut " and Zhong Fan that the people such as Bai Xue proposed in 2009 proposed in 2012 etc." Video Snapcut " method has adopted based on manifold local classifiers, and first user carries out Interactive Segmentation to key frame, and the method can automatically propagate into subsequent frame by segmentation result in the borderline feature of prospect according to key frame.But the method generally can only have reasonable effect within the shorter time interval, when the prospect of scene more similar with background color, while there is change in topology and foreground object generation rapid movement in the border of foreground object, method based on propagating is difficult to obtain good result, therefore needs more user interactions to revise the result of every frame.

The current domestic Patents in this field has: a kind of exchange method for object video rapid extraction (application number 201110219610.7).

Summary of the invention

(1) technical matters that will solve

The technical problem to be solved in the present invention is: how to extract exactly the foreground object in video sequence by user's interactive information rapidly.

(2) technical scheme

For solving the problems of the technologies described above, the invention provides a kind of interactive video foreground object extracting method based on super pixel segmentation, comprise the following steps:

S1: each frame to video sequence carries out down-sampling, then carries out the pre-segmentation based on super pixel;

S2: the result of pre-segmentation is carried out to the interactive calculating of extracting of object video based on super pixel;

S3: interactive mode is extracted to the segmentation result calculating and be upsampled to former video resolution, and the edge of segmentation result is carried out to refinement.

Further, the described pre-segmentation based on super pixel comprises the following steps: use mean shift algorithm to carry out pre-segmentation to each frame, the pixel that color is close is polymerized to a class, forms a super pixel.

Further, the interactive calculating of extracting of the described object video based on super pixel comprises the steps: to build a three-dimensional graph model according to the stroke of the sign foreground object of user's input, then uses figure segmentation method to calculate foreground object region.

Further, after the stroke length of the sign foreground object that described user inputs exceedes a setting threshold, just the graph model of described three-dimensional is upgraded, and use figure segmentation method to recalculate foreground object region.

Further, adopt stingy drawing method to carry out refinement to the edge of described segmentation result.

(3) beneficial effect

The present invention carries out pre-segmentation to each two field picture, make the pixel that color is close be polymerized to a class, form a super pixel, simultaneously in order to accelerate computing velocity, make user can see in real time that segmentation result, the present invention have adopted two strategies to accelerate: on the one hand by first video sequence being carried out to down-sampling, then computed segmentation result under low resolution, and then be upsampled under high resolving power segmentation result is carried out to edge thinning, thereby greatly reduce calculated amount.

Brief description of the drawings

Fig. 1 is a kind of interactive video foreground object extracting method process flow diagram based on super pixel segmentation of the embodiment of the present invention;

Fig. 2 carries out the result after pre-segmentation based on super pixel in step S1;

Fig. 3 is the prospect result calculating according to the interactive information of user's input in step S2, and wherein left figure is with representing foreground object in wire frame, and right figure represents foreground object with white portion;

Fig. 4 is that step S3 carries out the segmentation result after refinement to segmenting edge, and wherein left figure is with representing foreground object in wire frame, and right figure represents foreground object with white portion.

Embodiment

Below in conjunction with drawings and Examples, the specific embodiment of the present invention is described in further detail.Following examples are used for illustrating the present invention, but are not used for limiting the scope of the invention.

As shown in Figure 1, be the processing flow chart of an embodiment of the interactive video foreground object extracting method based on super pixel segmentation, comprising:

Step S1, carries out down-sampling to each frame of video sequence.Down-sampling carries out level and smooth rear interlacing every row ground capture element to each frame, image length and wide 1/2nd of the source images that is after down-sampling by gaussian kernel each time.Carry out (n=1 or 2, determines according to the resolution of video, and the higher down-sampling number of times that carries out of resolution is more) after n down-sampling, each frame is carried out to pre-segmentation with average drifting (MeanShift) algorithm, Fig. 2 is out four two field pictures of pre-segmentation.

Step S2, carries out the interactive calculating of extracting of object video based on super pixel to the image of pre-segmentation.User carries out prospect mark in foreground object region entering stroke, builds the three-dimensional plot model of a video data according to the stroke of the sign foreground object of user's input, and then use figure (graphcut) method of cutting is calculated foreground object region.Method of the present invention can by solve fructufy time offer user, if user is dissatisfied to segmentation result, can be by the correct and wrong foreground area of simple stroke mark.So that improve counting yield, after stroke length exceedes certain threshold value, three-dimensional plot model is upgraded, and use figure segmentation method to recalculate the foreground area of rebuild three-dimensional plot model, segmentation result is returned to user.Fig. 3 is the 1st frame of a video sequence and the interactive mode of the 20th frame is extracted the result of calculating.The figure on the left side uses wire frame representation foreground object, white region representation foreground object for the figure on the right.

Step S3, the segmentation result that interactive computing is gone out is upsampled to former video resolution, and the edge of segmentation result is carried out to refinement.Adopt the method for scratching more accurately figure (matting) to carry out refinement for the edge of segmentation result, make final segmentation result more accurate.Fig. 4 has chosen the segmentation result of two frames identical with Fig. 3.Can find out from the white portion on the right, the segmentation result carrying out after edge thinning is obviously better than the result in Fig. 3, and the details at the edges such as hair has all been cut apart out well.

Above embodiment is only for illustrating the present invention; and be not limitation of the present invention; the those of ordinary skill in relevant technologies field; without departing from the spirit and scope of the present invention; can also make a variety of changes and modification; therefore all technical schemes that are equal to also belong to category of the present invention, and scope of patent protection of the present invention should be defined by the claims.

Claims

1. the interactive video foreground object extracting method based on super pixel segmentation, is characterized in that, comprises the following steps:

2. the interactive video foreground object extracting method based on super pixel segmentation as claimed in claim 1, it is characterized in that, the described pre-segmentation based on super pixel comprises the following steps: use mean shift algorithm to carry out pre-segmentation to each frame, the pixel that color is close is polymerized to a class, forms a super pixel.

3. the interactive video foreground object extracting method based on super pixel segmentation as claimed in claim 1, it is characterized in that, the interactive calculating of extracting of the described object video based on super pixel comprises the steps: to build a three-dimensional graph model according to the stroke of the sign foreground object of user's input, then uses figure segmentation method to calculate foreground object region.

4. the interactive video foreground object extracting method based on super pixel segmentation as claimed in claim 3, it is characterized in that, after the stroke length of the sign foreground object that described user inputs exceedes a setting threshold, just the graph model of described three-dimensional is upgraded, and use figure segmentation method to recalculate foreground object region.

5. the interactive video foreground object extracting method based on super pixel segmentation as claimed in claim 1, is characterized in that, adopts stingy drawing method to carry out refinement to the edge of described segmentation result.