CN104318208A

CN104318208A - Video scene detection method based on graph partitioning and instance learning

Info

Publication number: CN104318208A
Application number: CN201410525867.9A
Authority: CN
Inventors: 檀结庆; 白天
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2014-10-08
Filing date: 2014-10-08
Publication date: 2015-01-28

Abstract

The invention relates to a video scene detection method based on graph partitioning and instance learning, and compared with the prior art, solves defects that a scene boundary is partitioned excessively and time complexity cannot be applied. The invention comprises the following steps: video shots are partitioned, a video sequence is inputted, and a video shot partitioning method is used for detecting all shots in the given video sequence; visual similarity features of the shots are extracted, key frames are extracted in the shots, HSV features and SIFT features are combined to form structure and partitioning of a visual similarity feature directed time sequence graph of the shot, a finite time sequence graph of the entire video is formed and described, and the graph is partitioned into a plurality of sub graphs; and according to scene detection based on the instance learning, part of the sub graphs are recognized into training examples (TE), the rest unrecognized sub graphs use the instance learning method to be classified into the recognized sub graphs, and all sub graphs are finally outputted to be the detected scene. Excessive partitioning can be prevented and calculation complexity is reduced.

Description

A kind of video scene detection method based on scheming segmentation and case-based learning

Technical field

The present invention relates to technical field of video processing, specifically a kind of video scene detection method based on scheming segmentation and case-based learning.

Background technology

Structuring video data analytical technology applies to digital video analysis in a large number with process in recent years, and a video display video is made up of thousands of video lens usually, but the information content that single case for lense contains is less, therefore needs similar lens group to be made into scene.Video scene should by multiple continuously and semantic relevant camera lens form, identical content expressed by the camera lens of composition scene.

Describing method at present based on figure is widely used in video scene detects, and substantially can be divided into following a few class: the method (STG) based on scene transition diagram and the method (SSG) based on camera lens similar diagram.First STG method uses clustering technique to carry out cluster to camera lens, then on the basis of cluster, generates digraph.In STG, each summit represents an arrangement of mirrors head, while represent the transition of two arrangement of mirrors heads.In SSG method, standardization abatement technology is used to carry out figure segmentation and obtains scene; In SSG figure, each summit represents a camera lens, all there is a limit between any two summits, and every bar limit all imparts weight according to the similarity of camera lens.

The describing method of existing figure can be split video scene to a certain extent, but also there are following two deficiencies: 1, over-segmentation, because needs arrange global threshold, therefore threshold value choose most important, in order to obtain scene boundary more exactly, therefore need to arrange a larger threshold value, so just inevitably cause the situation of over-segmentation; 2, computation complexity is higher, is that the standardization abatement in STG in clustering shots or SSG all needs a large amount of operation time, the time complexity how falling ground scene detection method to engineering can scope be a challenge.

How to develop a kind ofly can prevent over-segmentation, video scene detection method that time complexity is low become the technical matters being badly in need of solving.

Summary of the invention

The object of the invention is the defect cannot applied to solve the border over-segmentation of prior art Scene and time complexity, providing a kind of video scene detection method based on figure segmentation and case-based learning to solve the problems referred to above.

To achieve these goals, technical scheme of the present invention is as follows:

Based on the video scene detection method scheming segmentation and case-based learning, comprise the following steps:

Video lens is split, input video sequence, utilizes video lens dividing method to detect all camera lenses in given video sequence;

Extract the vision similarity feature of camera lens, from camera lens, extract key frame, the vision similarity feature of associating HSV characteristic sum SIFT feature structure camera lens;

The structure of oriented sequential chart and segmentation, the limited sequential chart of the whole video of structure description, is divided into some subgraphs by figure;

The scene detection of instance-based learning, partial subgraph is identified as training example TE, remaining Unidentified subgraph utilizes case-based learning method to be referred in the subgraph identified, all subgraphs finally exported are the scene detected.

The vision similarity feature of described extraction camera lens comprises the following steps:

Start frame, a middle frame and end frame in extraction camera lens all sequences, as key frame, obtain the key frame set that describes whole camera lens;

Extract the SIFT feature of key frame, and do normalized to SIFT feature, its formula is as follows:

{SimFF}_{sift} (F_{a}^{i}, F_{b}^{j}) = \frac{M}{Min (N_{B}^{i}, N_{b}^{j})};

{SimSS}_{sift} (S_{i}, S_{j}) = Max ({SimFF}_{sift} (F_{h}^{i}, F_{l}^{j})), h &Element; {KF}_{i}, l &Element; {KF}_{j};

Wherein, for key frame the quantity of SIFT feature, for key frame the quantity of SIFT feature, key frame a belongs to camera lens i, and key frame b belongs to camera lens j, and M is for comparing with the feature quantity of rear coupling, KF _ifor the key frame set of camera lens i, KF _jfor the key frame set of camera lens j, for the SIFT feature similarity of key frame a and b, for the SIFT feature similarity of camera lens i and j; Th _siftfor threshold value, N_SimSS _sift(S _i, S _j) be the SIFT feature after normalized;

Extract the HSV feature of key frame, and do normalized to HSV feature, its formula is as follows:

SimFF _color(F _i，F _j)＝∑ _h∈binsMin(H _i(h)，H _j(h))，

SimSS _color(S _i，S _j)＝Max(SimFF _color(F _h，F _l))，h∈KFi，l∈KF _j，

Wherein H _ifor the normalization HSV histogram of key frame Fi, H _jfor the normalization HSV histogram of key frame Fj, KF _ifor the key frame set of camera lens i, KF _jfor the key frame set of camera lens j, SimFF _color(S _i, S _j) be the HSV characteristic similarity of key frame Fi and Fj, SimSS _color(S _i, S _j) be the HSV characteristic similarity of camera lens i and j;

The camera lens vision similarity feature of associating HSV characteristic sum SIFT feature, its formula is as follows:

SimSS _visual(S _i，S _j)＝α·N_SimSS _sift(S _i，S _j)+β·SimSS _color(S _i，S _j)，

Wherein, α and β represents non-negative weights and alpha+beta=1.

Structure and the segmentation of described oriented sequential chart comprise the following steps:

According to the oriented sequential chart of the building method generating video of oriented sequential chart, the building method of its oriented sequential chart is as follows:

If G=(V, E) represents digraph, wherein a V={v _i| v _i=1,2 ..., N} is the set on summit, E={e _{i, j}the set on limit, all camera lenses with vision order carry out sorting ... v _i, v _i+1, vertex v _irepresent i-th camera lens, vertex v _jrepresent a jth camera lens,

If v _j-v _i=1, will from v _iadd a directed edge to v _j;

Defining variable i, j, L, i=1, j=2, L are moving window length;

Judge vertex v _jin-degree whether be greater than 1, if then carry out 315 step process, if not, then carry out 314 step process;

Judge SimSS _visual(S _i, S _j) whether be greater than given threshold value T, if then from vertex v _igenerate a directed edge to vertex v _j, if not, then carry out 315 step process;

The value of variable j is added 1; If j-i>L or j>N, then carry out 316 step process, otherwise proceed 313 step process;

The value of variable i is added 1, the value of variable j adds 1;

If i<N, then proceed 313 step process, otherwise represent that oriented sequential chart structure is complete.

Oriented sequential chart is divided into subgraph, and each subgraph represents a video sequence.

The scene detection of described instance-based learning comprises the following steps:

Detect respectively the subgraph after segmentation, if its density of the subgraph detected is greater than 0.33, trained example TE by as one, its subgraph density calculation formula is as follows:

Wherein, Ne is the quantity on the limit that subgraph comprises, and Nv is the quantity on the summit that subgraph comprises;

All subgraphs are divided into TE and non-TE two parts;

Relationship subgraph chronologically, the subgraph in non-TE chronologically relationship generate a sequence label, then use case-based learning method to detect scene boundary, finally export the scene obtained.

Described case-based learning method comprises the following steps:

Each subgraph between TE and non-TE is by imparting label, and label value is 0,1 or-1, and the assignment condition of its label value is as follows:

Calculate the similarity of subgraph and TE, result puts into SL;

Calculate the similarity of subgraph and non-TE, result puts into SR;

If SL>SR, then calculate SR/SL, otherwise calculate SL/SR, result of calculation is put into S;

If the value >0.85 in S, then this subgraph label is denoted as 0; Otherwise if SL>SR, then this subgraph label is denoted as-1; If SL is less than or equal to SR, then this subgraph label is denoted as 1; Obtain the sequence label formed by 0,1 and-1;

The fuzzy value Fuz at computed segmentation place, its computing formula is as follows:

Fuz = Max (\frac{(N_{R} - N_{L})}{N}, \frac{N_{zero}}{N})

Wherein, N is the quantity of label between TE and non-TE, N _lfor the accumulated value of all labels of left-half after divided, N _rfor the accumulated value of all labels of right half part after divided, N _zerofor label value between TE and non-TE is the accumulated value of the label of 0;

If be labeled as in sequence label 0 number of labels be greater than 2N/3, then represent there is not a scene boundary in the sequence;

If the fuzzy value Fuz calculated at split position place is maximal value, then this position is applicable scene boundary; If the fuzzy value Fuz that there is several split position place is all equal, then choose middle split position as scene boundary.

Beneficial effect

A kind of video scene detection method based on scheming segmentation and case-based learning of the present invention, compared with prior art prevents over-segmentation, reduces computation complexity.Improve accuracy rate and the recall rate of the detection of whole video scene, good Detection results can be kept to the scene of illumination acute variation and high motion scenes.By extracting SIFT feature and HSV histogram as the visual signature of scene detection, proposing a kind of structure and dividing method of the sequential digraph based on camera lens, utilizing the Scene Segmentation of instance-based learning to obtain the video scene determined.

Accompanying drawing explanation

Fig. 1 is method flow diagram of the present invention;

Fig. 2 is the digraph in digraph of the present invention structure and segmentation step after initialization;

Fig. 3 is the organigram of oriented sequential chart in digraph of the present invention structure and segmentation step;

Fig. 4 is that digraph of the present invention constructs and oriented sequential chart segmentation schematic diagram in segmentation step;

Fig. 5 is subgraph assignment label schematic diagram in the scene detection step that the present invention is based on case-based learning;

Fig. 6 is label sequences segmentation schematic diagram in the scene detection step that the present invention is based on case-based learning.

Embodiment

For making to have a better understanding and awareness architectural feature of the present invention and effect of reaching, coordinating detailed description in order to preferred embodiment and accompanying drawing, being described as follows:

As shown in Figure 1, a kind of video scene detection method based on scheming segmentation and case-based learning of the present invention, comprises the following steps:

The first step, video lens is split, input video sequence, utilizes video lens dividing method to detect all camera lenses in given video sequence.Video lens dividing method can use method of the prior art, as http:// www-nlpir.nist.gov/projects/tvpubs/tvpapers03/ramonlull. paper .pdfmiddle introduced video lens dividing method.

Second step, extracts the vision similarity feature of camera lens, from camera lens, extracts key frame, the vision similarity feature of associating HSV characteristic sum SIFT feature structure camera lens.Its concrete steps are as follows:

(1) start frame, a middle frame and end frame in extraction camera lens all sequences, as key frame, obtain the key frame set KF that describes whole camera lens.

(2) extract the SIFT feature of key frame, and do normalized to SIFT feature, calculate the unique point quantity of coupling, judge the similarity degree of two camera lenses, its formula is as follows:

{SimFF}_{sift} (F_{a}^{i}, F_{b}^{j}) = \frac{M}{Min (N_{B}^{i}, N_{b}^{j})};

{SimSS}_{sift} (S_{i}, S_{j}) = Max ({SimFF}_{sift} (F_{h}^{i}, F_{l}^{j})), h &Element; {KF}_{i}, l &Element; {KF}_{j};

Wherein, for key frame the quantity of SIFT feature, for key frame the quantity of SIFT feature, key frame a belongs to camera lens i, and key frame b belongs to camera lens j, and M is for comparing with the feature quantity of rear coupling, KF _ifor the key frame set of camera lens i, KF _jfor the key frame set of camera lens j,

for the SIFT feature similarity of key frame a and b, SimSS _sift(S _i, S _j) be the SIFT feature similarity of camera lens i and j; Th _siftfor threshold value, can be 0.12, N_SimSS usually through experimental verification _sift(S _i, S _j) be the SIFT feature after normalized.

(3) extract the HSV feature of key frame, and normalized is done to HSV feature.When carrying out the comparison of HSV histogram feature, first calculate the HSV normalization histogram of two two field pictures, then judge the similarity of two camera lenses, its formula is as follows:

SimFF _color(F _i，F _j)＝∑ _h∈binsMin(H _i(h)，H _j(h))，

SimSS _color(S _i，S _j)＝Max(SimFF _color(F _h，F _l))，h∈KF _i，l∈KF _j，

Wherein H _ifor the normalization HSV histogram of key frame Fi, H _jfor the normalization HSV histogram of key frame Fj, KF _ifor the key frame set of camera lens i, KF _jfor the key frame set of camera lens j, SimFF _color(S _i, S _j) be the HSV characteristic similarity of key frame Fi and Fj, SimSS _color(S _i, S _j) be the HSV characteristic similarity of camera lens i and j.

(4) combine the camera lens vision similarity feature of HSV characteristic sum SIFT feature, its formula is as follows:

Wherein, α and β represents non-negative weights and alpha+beta=1.

3rd step, the structure of oriented sequential chart and segmentation, the limited sequential chart of the whole video of structure description, is divided into some subgraphs by figure.Carried out the oriented sequential chart of generating video by the building method of oriented sequential chart, for the oriented sequential chart generated, utilize dijkstra's algorithm that figure is divided into some subgraphs, each subgraph represents a video sequence.Its concrete steps are as follows:

(1) according to the oriented sequential chart of the building method generating video of oriented sequential chart, if G=(V, E) represents digraph, wherein a V={v _i| v _i=1,2 ..., N} is the set on summit, E={e _{i, j}the set on limit, all camera lenses with vision order carry out sorting ... v _i, v _i+1, vertex v _irepresent i-th camera lens, vertex v _jrepresent a jth camera lens, if v _j-v _i=1, will from v _iadd a directed edge to v _j, the digraph after initialization as shown in Figure 2.

(2) defining variable i, j, L, i=1, j=2, L are moving window length.

(3) calculation procedure as shown in Figure 3, first judges vertex v _jin-degree whether be greater than 1, if then carry out (5) step process, if not, be then for further processing.

(4) SimSS is judged _visual(S _i, S _j) whether be greater than given threshold value T, analyze by experiment, threshold value T can be set to 0.6.

If then from vertex v _igenerate a directed edge to vertex v _j, if not, be then for further processing.

(5) value of variable j is added 1; If j-i>L or j>N, be then for further processing, otherwise proceed (3) step process.

(6) value of variable i is added 1, the value of variable j adds 1;

(7) as shown in Figure 4, utilize dijkstra's algorithm that figure is divided into some subgraphs, each subgraph represents a video sequence.Dijkstra's algorithm is for searching for the shortest path of sequential digraph, and limit on shortest paths will all be removed, thus obtains some subgraphs.

4th step, the scene detection of instance-based learning, partial subgraph is identified as training example TE, remaining Unidentified subgraph utilizes case-based learning method to be referred in the subgraph identified, all subgraphs finally exported are the scene detected.Its concrete steps are as follows:

(1) detect respectively the subgraph after segmentation, if its density of the subgraph detected is greater than 0.33, trained example TE by as one, its subgraph density calculation formula is as follows:

Wherein, Ne is the quantity on the limit that subgraph comprises, and Nv is the quantity on the summit that subgraph comprises.

(2) by experimental results demonstrate density be greater than 0.33 TE can as a training example TE, density is greater than 0.33 as a training example TE, all subgraphs are divided into TE and non-TE two parts, subgraph in non-TE may be then the interface that crosses between two scenes, i.e. scene boundary.

(3) relationship subgraph chronologically, the subgraph in non-TE chronologically relationship generate a sequence label, use case-based learning method generating labels sequence and scene boundary detected, finally exporting the scene obtained.

Case-based learning method specifically comprises the following steps:

A, as shown in Figure 5, each subgraph between TE and non-TE is by imparting label, and label value is 0,1 or-1, and 1 represents that this subgraph is similar to next TE vision, and-1 represents that this subgraph is similar to a upper TE vision, and 0 expression cannot judge.The assignment condition of label value is as follows:

The similarity of a, calculating subgraph and TE, result puts into SL;

The similarity of b, calculating subgraph and non-TE, result puts into SR;

If c is SL>SR, then calculate SR/SL, otherwise calculate SL/SR, result of calculation is put into S;

If the value >0.85 in d S, then this subgraph label is denoted as 0; Otherwise if SL>SR, then this subgraph label is denoted as-1; If SL is less than or equal to SR, then this subgraph label is denoted as 1; Obtain the sequence label formed by 0,1 and-1.

The fuzzy value Fuz at B, computed segmentation place, its computing formula is as follows:

Fuz = Max (\frac{(N_{R} - N_{L})}{N}, \frac{N_{zero}}{N})

Wherein, N is the quantity of TE and non-TE label, as shown in Figure 6, and N _lfor the accumulated value of all labels of left-half after divided, N _rfor the accumulated value of all labels of right half part after divided, N _zerofor label value between TE and non-TE is the accumulated value of the label of 0.

If C be labeled as in sequence label 0 number of labels be greater than 2N/3, then represent there is not a scene boundary in the sequence.

D, as shown in Figure 6, sequence label to be split, obtain scene according to segmentation result.If the fuzzy value Fuz calculated at split position place is maximal value, then this position is applicable scene boundary; If the fuzzy value Fuz that there is several split position place is all equal, then choose middle split position as scene boundary.

This method improves accuracy rate and the recall rate of whole scene detection, and can keep good Detection results to the scene of illumination acute variation and high motion scenes.The present invention includes following components: utilize scale invariant feature (SIFT) and hsv color histogram to carry out the similarity feature of combined structure camera lens; Extract the key frame of camera lens, and the associating visual signature extracted from key frame based on SIFT and HSV carrys out representative shot, each camera lens is as a node of oriented sequential chart, recycle the oriented sequential chart that the method construct one compared in moving window describes video, then whole figure is split, obtain some subgraphs, namely each subgraph is a video segment.The video segment recycling case-based learning method obtained finally is obtained to the video scene determined.The present invention proposes the scene detection method of instance-based learning and figure segmentation from the angle of engineer applied, and the shot similarity feature of the method has stronger robustness to change of scale and light change.This method improves the accuracy of the scene detection of dissimilar video display video work, improves the level of application of video scene detection technique in all types of films and television programs post-production.

The scene detection method that the present invention describes, can keep good Detection results to illumination acute variation scene and high motion scenes.Owing to have employed union feature based on SIFT feature and HSV histogram feature as the visual signature of Shot Detection, therefore reduce flase drop and loss.Simultaneously due to based on the oriented sequential chart technology of moving window and the application of case-based learning method, effectively reduce the computing time of detection algorithm.In order to verify the validity of detection method, we have done a large amount of experiments and have carried out quantification and qualification to test video, and as shown in table 1, table 1 gives the details of test video.

The details of table 1 test video

Quantitative evaluating method adopts general in the world recall ratio (recall), precision ratio (precision) and F mark.By different detection methods, same video sequence is calculated simultaneously, carry out qualitative analysis to judge the good and bad degree of various method.As shown in table 2, detection method of the present invention and STG method compare,

Table 2F Indexes Comparison result

Result shows method of the present invention has good performance in recall rate and accuracy rate, and utilization figure is split and the video scene detection method of case-based learning is effective.

More than show and describe ultimate principle of the present invention, principal character and advantage of the present invention.The technician of the industry should understand; the present invention is not restricted to the described embodiments; the just principle of the present invention described in above-described embodiment and instructions; the present invention also has various changes and modifications without departing from the spirit and scope of the present invention, and these changes and improvements all fall in claimed scope of the present invention.The protection domain of application claims is defined by appending claims and equivalent thereof.

Claims

1., based on the video scene detection method scheming segmentation and case-based learning, it is characterized in that, comprise the following steps:

11) video lens segmentation, input video sequence, utilizes video lens dividing method to detect all camera lenses in given video sequence;

12) extract the vision similarity feature of camera lens, from camera lens, extract key frame, the vision similarity feature of associating HSV characteristic sum SIFT feature structure camera lens;

13) structure of oriented sequential chart and segmentation, the limited sequential chart of the whole video of structure description, is divided into some subgraphs by figure;

14) scene detection of instance-based learning, partial subgraph is identified as training example TE, remaining Unidentified subgraph utilizes case-based learning method to be referred in the subgraph identified, all subgraphs finally exported are the scene detected.

2. a kind of video scene detection method based on scheming segmentation and case-based learning according to claim 1, is characterized in that: the vision similarity feature of described extraction camera lens comprises the following steps:

21) start frame, a middle frame and end frame in extraction camera lens all sequences, as key frame, obtain the key frame set that describes whole camera lens;

22) extract the SIFT feature of key frame, and do normalized to SIFT feature, its formula is as follows:

{SimFF}_{sift} (F_{a}^{i}, F_{b}^{j}) = \frac{M}{Min (N_{a}^{i}, N_{b}^{j})};

{SimSS}_{sift} (S_{i}, S_{j}) = Max ({SimFF}_{sift} (F_{h}^{i}, F_{l}^{j})), h &Element; {KF}_{i}, l &Element; {KF}_{j};

Wherein, for key frame the quantity of SIFT feature, for key frame the quantity of SIFT feature, key frame a belongs to camera lens i, and key frame b belongs to camera lens j, and M is for comparing with the feature quantity of rear coupling, KF _ifor the key frame set of camera lens i, KF _jfor the key frame set of camera lens j, for the SIFT feature similarity of key frame a and b, SimSS _sift(S _i, S _j) be the SIFT feature similarity of camera lens i and j; Th _siftfor threshold value, N_SimSS _sift(S _i, S _j) be the SIFT feature after normalized;

23) extract the HSV feature of key frame, and do normalized to HSV feature, its formula is as follows:

SimFF _color(F _i，F _j)＝∑ _h∈binsMin(H _i(h)，H _j(h))，

Wherein H _ifor the normalization HSV histogram of key frame Fi, H _jfor the normalization HSV histogram of key frame Fj, KF _ifor the key frame set of camera lens i, KF _jfor the key frame set of camera lens j, SimFF _color(F _i, F _j) be the HSV characteristic similarity of key frame Fi and Fj, SimSS _color(S _i, S _j) be the HSV characteristic similarity of camera lens i and j;

24) combine the camera lens vision similarity feature of HSV characteristic sum SIFT feature, its formula is as follows:

Wherein, α and β represents non-negative weights and alpha+beta=1.

3. a kind of video scene detection method based on scheming segmentation and case-based learning according to claim 1, it is characterized in that, structure and the segmentation of described oriented sequential chart comprise the following steps:

31) according to the oriented sequential chart of the building method generating video of oriented sequential chart, the building method of its oriented sequential chart is as follows:

311) G=(V, E) is established to represent digraph, wherein a V={v _i| v _i=1,2 ..., N} is the set on summit, E={e _{i, j}the set on limit, all camera lenses with vision order carry out sorting ... v _i, v _i+1, vertex v _irepresent i-th camera lens, vertex v _jrepresent a jth camera lens,

If v _j-v _i=1, then from v _iadd a directed edge to v _j;

312) defining variable i, j, L, i=1, j=2, L are moving window length;

313) vertex v is judged _jin-degree whether be greater than 1, if then carry out 315 step process, if not, then carry out 314 step process;

314) SimSS is judged _visual(S _i, S _j) whether be greater than given threshold value T, if then from vertex v _igenerate a directed edge to vertex v _j, if not, then carry out 315 step process;

315) value of variable j is added 1; If j-i > L or j > N, then carry out 316 step process, otherwise proceed 313 step process;

316) value of variable i is added 1, the value of variable j adds 1;

If i < is N, then proceed 313 step process, otherwise represent that oriented sequential chart structure is complete.

32) oriented sequential chart is divided into subgraph, each subgraph represents a video sequence.

4. a kind of video scene detection method based on scheming segmentation and case-based learning according to claim 1, it is characterized in that, the scene detection of described instance-based learning comprises the following steps:

41) detect respectively the subgraph after segmentation, if its density of the subgraph detected is greater than 0.33, trained example TE by as one, its subgraph density calculation formula is as follows:

42) all subgraphs are divided into TE and non-TE two parts;

43) relationship subgraph chronologically, the subgraph in non-TE chronologically relationship generate a sequence label, then use case-based learning method to detect scene boundary, finally export the scene obtained.

5. a kind of video scene detection method based on scheming segmentation and case-based learning according to claim 4, it is characterized in that, described case-based learning method comprises the following steps:

51) each subgraph between TE and non-TE is by imparting label, and label value is 0,1 or-1, and the assignment condition of its label value is as follows:

511) calculate the similarity of subgraph and TE, result puts into SL;

512) calculate the similarity of subgraph and non-TE, result puts into SR;

513) if SL > is SR, then calculate SR/SL, otherwise calculate SL/SR, result of calculation is put into S;

514) if the value > in S 0.85, then this subgraph label is denoted as 0; Otherwise if SL > is SR, then this subgraph label is denoted as-1; If SL is less than or equal to SR, then this subgraph label is denoted as 1; Obtain the sequence label formed by 0,1 and-1;

52) the fuzzy value Fuz at computed segmentation place, its computing formula is as follows:

Fuz = Max (\frac{(N_{R} - N_{L})}{N}, \frac{N_{zero}}{N})

53) if be labeled as in sequence label 0 number of labels be greater than 2N/3, then represent there is not a scene boundary in the sequence;

54) if the fuzzy value Fuz calculated at split position place is maximal value, then this position is applicable scene boundary; If the fuzzy value Fuz that there is several split position place is all equal, then choose middle split position as scene boundary.