CN101216886B - A shot clustering method based on spectral segmentation theory - Google Patents

A shot clustering method based on spectral segmentation theory Download PDF

Info

Publication number
CN101216886B
CN101216886B CN2008100560968A CN200810056096A CN101216886B CN 101216886 B CN101216886 B CN 101216886B CN 2008100560968 A CN2008100560968 A CN 2008100560968A CN 200810056096 A CN200810056096 A CN 200810056096A CN 101216886 B CN101216886 B CN 101216886B
Authority
CN
China
Prior art keywords
camera lens
shot
lens
classification
method based
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2008100560968A
Other languages
Chinese (zh)
Other versions
CN101216886A (en
Inventor
薛玲
李超
钟林
李欢
熊璋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihang University
Original Assignee
Beihang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beihang University filed Critical Beihang University
Priority to CN2008100560968A priority Critical patent/CN101216886B/en
Publication of CN101216886A publication Critical patent/CN101216886A/en
Application granted granted Critical
Publication of CN101216886B publication Critical patent/CN101216886B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

The invention relates to a shot clustering method based on the spectrum segmentation theory, which comprises the following steps: utilizing, the spectrum segmentation theory for shot clustering; extracting feature vectors of each unspecified shot; calculating similarity between each two categories according to the extracted feature vectors; then constituting each shot cluster as a weighted undirected graph; segmenting each shot category into two shot categories by a using spectrum according to the similarity between each two categories; using Bayesian information criteria to judge whether the segmentation is effective or not, the effectively segmented shot sub-categories are iteratively segmented, the ineffectively segmented shot categories are terminals; finally syncretizing the classification results after the segmentation to get the optimal shot classification number and the classification result. The invention solves the difficult problem that the optimized classification number is difficult to estimate in the clustering algorithm, and improves the recall ratio and the pertinency ratio of the clustering result by utilizing the precise classification spectrum segmentation; the proposed overall fusion operation has a function of correcting the classification errors, thereby effectively avoiding the problem of local optimum relation.

Description

A kind of lens clustering method based on spectral segmentation theory
Technical field
The invention belongs to video content analysis and searching field, be specifically related to a kind of method of camera lens being carried out cluster.
Background technology
Video lens is meant semantically continual one section video content, is the basic structure and the semantic primitive of video information retrieval, and representing the unit of video semanteme to carry out cluster to these is the basis that video semanteme is analyzed.Current clustering algorithm can roughly be divided into to be had supervision and not to have two kinds of supervision.Have the supervision cluster by given sample set sorter to be trained, classification is accurate, but needs artificial mark sample set.Do not have the supervision clustering algorithm and have self-learning function, need not training sample, be difficult to determine but be faced with optimization classification number, classification results to initial division than difficult problems such as sensitivities.
In recent years, the research about lens clustering method has a lot.The method of estimation optimization classification number commonly used has following several in the video lens clustering algorithm at present: (1) is based on the judge criterion of the pattern of exploration, this method is based on a kind of appropriate information criterion judgment criteria, traversal the classification number situation that might occur, obtain optimized classification number; (2) based on the method for estimation that merges, at first choose a number of categories much larger than the optimal classification number and carry out cluster, clustering result merges to obtain optimum classification number mutually according to information entropy least disadvantage principle; (3) based on the iteration cluster of k-means, each clustering result iteration is carried out the k-means algorithm, adopt the appropriate information criterion to judge whether to stop, when all iteration stop, obtain optimum classification number.
First method is the simplest, and the result who obtains also is the most objective, but this method computation complexity is higher, and is not having the hunting zone under the situation of priori necessary enough big.Need recomputate the information entropy loss of fusion after the each fusion of second method assorting process, speed of convergence is slow, the computation complexity height, and can't be to the preliminary classification error correction.Mistake that a class is isolated may appear in advantages such as the third method is representative with X-means, has fast convergence rate, and computation complexity is little, but the k-means algorithm only considered to concern between class, and not at the error correction of this mistake.
Summary of the invention
The technical problem to be solved in the present invention: overcome the deficiencies in the prior art, a kind of lens clustering method based on spectral segmentation theory is provided, this method can be difficult to estimate optimized classification number in the clustering algorithm under the low complex degree situation, utilize the spectrum of accurate two classification to cut apart, improved the recall ratio and the precision ratio of cluster result; The overall mixing operation that proposes has the error correction to classification error, has avoided the locally optimal solution problem.
The object of the present invention is achieved like this: a kind of lens clustering method based on spectral segmentation theory may further comprise the steps:
(1) each camera lens to be classified is extracted its proper vector;
(2) according to the proper vector of extracting, calculate the similarity between per two classes;
(3) the camera lens collection is configured to a non-directed graph of having the right,, uses spectrum to cut apart each camera lens class two is divided into two camera lens classes according to the similarity between per two camera lens classes;
(4) judge with bayesian information criterion whether effective this cuts apart; The camera lens subclass iteration cutting operation of effectively cutting apart, the invalid camera lens class of cutting apart is a terminal node;
(5) spectrum is cut apart the result of final output, used bayesian information criterion to judge that two classify and whether are communicated with, merge, finally obtain optimum cluster numbers and cluster result according to connectedness.
The hsv color histogram is adopted in the extraction of the proper vector in the described step (1), and calculates the proper vector of the average color histogram of whole all frames of camera lens as this camera lens.
The calculating formula of similarity of calculating between per two camera lens classes in the described step (2) is:
e ij = e - | | H i - H j | | 2 2 σ 2
E wherein IjRepresent the similarity between two class camera lens i, the j, H i, H jBe respectively camera lens s i, s jColor histogram, σ are constant.
Method in the described step (3) is: the result that spectrum is cut apart A, B} satisfy following formula and obtain minimum value in global scope:
Ncut ( A , B ) = cut ( A , B ) assoc ( A , V ) + cut ( A , B ) assoc ( B , V )
Wherein:
Figure G2008100560968D00023
With camera lens collection S be expressed as a non-directed graph G=who has the right (V, E), v iPoint representative shot s i, e IjSimilarity between expression camera lens i, the j.
Spectrum cutting procedure in the described step (3) is as follows by the performing step that the calculated characteristics vector obtains: at first calculate N * N symmetric matrix E, each element is e Ij, e IjSimilarity between expression camera lens i, the j obtains diagonal matrix D (E), d according to E Ii=∑ je Ij, make up matrix L (E), L (E)=(D (E)) -1/2E (D (E)) -1/2, select maximum proper vector among the L (E), determine the result of cutting apart according to the sign bit of the every dimension of proper vector.
The bayesian information criterion of described step (4) adopts Gauss's spherical model, can the given sample data collection of optimization match; BIC calculates optional model posterior probability, and as the standard as measurement model adaptability, than judging more effective to calculate two kinds of distances between the model profile as the general information criterion of criterion.
Use bayesian information criterion that the classification that produces is merged in the described step (5), promptly any two classes are carried out the BIC model and judge whether it belongs to two classes, if it is better to belong to a class, then defining this two class is communicated with, at last the classification of all connections is merged, get to the end best cluster numbers and clustering result.
The advantage that prior art of the present invention is compared is:
(1) the existing clustering method classifications that adopt apart from the standard as classification more, the empirical parameter that provides is provided effect, be difficult to provide a kind of general, general solution, but the camera lens cluster is a kind of cluster of uncertain classification number, is difficult to estimate accurately classification number and center of all categories before cluster.The present invention is used for the camera lens cluster with spectrogram theory and bayesian information criterion, each camera lens to be classified is extracted its proper vector, according to the proper vector that extracts, calculate the similarity between per two classes, then the camera lens collection is configured to a non-directed graph of having the right, according to the similarity between per two camera lens classes, to use spectrum to cut apart each camera lens class two and be divided into two camera lens classes, judge with bayesian information criterion whether effective this cuts apart again, the camera lens subclass iteration cutting operation of effectively cutting apart, the invalid camera lens class of cutting apart is a terminal node.The present invention utilizes the spectrum of accurate two classification to cut apart, and it is more accurate as the criterion that classification stops to adopt simultaneously based on the bayesian information criterion of posterior probability, has improved the recall ratio and the precision ratio of cluster result.
(2) the present invention compares with traditional lens clustering method, when the present invention finishes at cutting operation, the result of spectrum being cut apart final output, use bayesian information criterion to judge whether two classification are communicated with, merge according to connectedness, finally obtain optimum cluster numbers and cluster result, solved a difficult problem that under the low complex degree situation, is difficult to estimate optimized classification number in the clustering algorithm; The overall mixing operation that proposes has the error correction to classification error, has avoided the locally optimal solution problem.
Description of drawings
Fig. 1 is the schematic flow sheet that the present invention is based on the spectral segmentation theory cluster.
Embodiment
As shown in Figure 1, the present invention specifically may further comprise the steps:
1. camera lens category feature vector extracts
Extract the proper vector of whole camera lenses, the mean value of calculated characteristics vector is as the proper vector of camera lens class.Adopt the hsv color histogram to describe the feature of image among the present invention, promptly calculate the color characteristic of the average color histogram of its whole frames as this camera lens.
Color characteristic is a kind of bottom physical features that best embodies Image Visual Feature, the object or the scene that are comprised in color characteristic and the image have higher correlativity, and compare with other visual signatures, color characteristic is less to the dependence at the size of image itself, direction, visual angle, has higher robustness.Color histogram is the color characteristic that is widely adopted in image indexing system, and it has described the probability distribution of different color in entire image.Though can't describe the object in the image and the locus of object, but still be a kind of describing method efficiently.
HSV (Hue Saturation Value) color space meets the subjective judgement of people to color similarity most, and this paper adopts the hsv color histogram to describe the feature of image.For each camera lens, calculate the color characteristic of the average color histogram of its whole frames as this camera lens.
In the present invention, adopt 12 (H) * 4 (S) * 4 (V) totally 192 grades of hsv color histograms.Definition H i, H jBe respectively camera lens s i, s jColor histogram, then we define the limit e among the figure G IjAs follows:
e ij = e - | | H i - H j | | 2 2 σ 2
Wherein σ is camera lens s i, s jBetween the function of time interval t because it doesn't matter relation between its camera lens of video of testing in this experiment and time, σ gets constant 0.15.
2. use spectrum to cut apart two story board classes
With camera lens collection S be expressed as a non-directed graph G=who has the right (V, E), ν jPoint representative shot s i, e IjSimilarity between expression camera lens i, the j.The result that spectrum is cut apart A, B} satisfies following formula:
Ncut ( A , B ) = cut ( A , B ) assoc ( A , V ) + cut ( A , B ) assoc ( B , V )
In global scope, obtain minimum value.
Wherein:
Figure G2008100560968D00043
The spectrum partitioning algorithm is briefly described as follows: at first calculate N * N symmetric matrix E, each element is e Ij, obtain diagonal matrix D (E), d according to E Ii=∑ je Ij, make up matrix L (E), L (E)=(D (E)) -1/2E (D (E)) -1/2Select maximum proper vector among the L (E), determine the result of cutting apart according to the sign bit of the every dimension of proper vector.
Distance is estimated in comprehensive of the criteria for classification of spectrum partitioning algorithm definition, the class, makes that the result of cutting apart is more accurate.Algorithm will the problem of minimizing be converted to asks the maximum proper vector of matrix, and can take the method for corresponding approximate treatment, has reduced complexity.
3. calculate the value of BIC (k=1):
If video lens to be clustered is a S set, be used for depositing the formation of camera lens class temporarily, note is made Q, is used to write down the formation of last classification results, and note is made RQ.Spectrum of use is cut apart S set is divided into two camera lens class S 1, S 2(suppose optimal classification count k 〉=2).
With S 1, S 2Insert formation Q afterbody, get the first camera lens class of formation Q S i, suppose that the sample in the camera lens class is Gauss's spherical distribution (Spherical Gaussians) around the class center.Then at camera lens class S iIn sample set X={x i: i=1 ..., the Gaussian distribution of the M dimension of R} is as follows:
f ( μ i , V i ; X ) = ( 2 π ) - p / 2 | V i | - 1 / 2 exp [ - 1 2 ( x - μ i ) ′ V i - 1 ( x - μ i ) ]
Calculate the value of BIC (k=1):
BIC ( k = 1 ) = log L ( μ ^ i , V ^ i ; x i ∈ S i ) - M log R
Wherein
Figure G2008100560968D00052
The maximum likelihood of be respectively sample data collection X in M dimension Gaussian distribution average and variance is estimated.M represents the number of parameter in this model.L is the maximum likelihood function of sample data collection X, L ()=∏ f ().R is the number of samples among the camera lens class Si.
4. calculate the value of BIC (k=2):
Spectrum of use is cut apart it is divided into two classes once more, and note is done: S i (1), S i (2)Then corresponding X (x i (1), x i (2)) Gaussian distribution as follows:
x i (1)~(μ i (1),V i (1)),x i (2)~(μ i (2),V i (2))
Calculate the value of BIC (k=2):
BIC ( k = 2 ) = log [ L ( μ ^ i ( 1 ) , V ^ i ( 1 ) · L ( μ ^ i ( 2 ) , V ^ i ( 2 ) ) ] - 2 M log R
This moment, the number of parameters of model was 2M.
5. judge whether classification is effective:
If BIC (k=2)>BIC (k=1), it is better to think that then such is divided into two classes, and classification is effective, with S i (1), S i (2)Insert formation Q.If BIC (k=2)≤BIC (k=1) thinks that promptly such does not need to divide again, it is invalid to classify, with S iInsert formation RQ.
6. cluster result merges adjustment
Continue step 5.Be located at camera lens class among the formation RQ and be S '=(S ' 1, S ' 2..., S ' k), for i arbitrarily, j (i, j=1 ..., k, i ≠ j), calculate S ' i∪ S ' jBIC (k=2) and BIC (k=1), if BIC (k=2)>BIC (k=1) then defines S ' i, S ' jBe connection, otherwise for not being communicated with.All connected sets are merged output optimal classification number and classification results.

Claims (7)

1. lens clustering method based on spectral segmentation theory is characterized in that may further comprise the steps:
(1) each camera lens to be classified is extracted its proper vector;
(2) according to the proper vector of extracting, calculate the similarity between per two classes;
(3) the camera lens collection is configured to a non-directed graph of having the right,, uses spectrum to cut apart each camera lens class two is divided into two camera lens classes according to the similarity between per two camera lens classes;
(4) judge with bayesian information criterion whether effective this cuts apart; The camera lens subclass iteration cutting operation of effectively cutting apart, the invalid camera lens class of cutting apart is a terminal node;
(5) spectrum is cut apart the result of final output, used bayesian information criterion to judge that two classify and whether are communicated with, merge, finally obtain optimum cluster numbers and cluster result according to connectedness.
2. the lens clustering method based on spectral segmentation theory according to claim 1, it is characterized in that: the hsv color histogram is adopted in the extraction of the proper vector in the described step (1), and calculates the proper vector of the average color histogram of whole all frames of camera lens as this camera lens.
3. the lens clustering method based on spectral segmentation theory according to claim 1 is characterized in that: the calculating formula of similarity of calculating between per two camera lens classes in the described step (2) is:
e ij = e - | | H i - H j | | 2 2 σ 2
E wherein IjRepresent the similarity between two class camera lens i, the j, H i, H jBe respectively camera lens s i, s jColor histogram, σ are constant.
4. the lens clustering method based on spectral segmentation theory according to claim 1 is characterized in that: the method in the described step (3) is: the result that spectrum is cut apart A, B} satisfy following formula and obtain minimum value in global scope:
Ncut ( A , B ) = cut ( A , B ) assoc ( A , V ) + cut ( A , B ) assoc ( B , V )
Wherein: cut ( A , B ) = Σ i ∈ A , j ∈ e ij , assoc ( A ) = Σ i ∈ A , j ∈ V e ij , With camera lens collection S be expressed as a non-directed graph G=who has the right (V, E), v iPoint representative shot s i, e IjSimilarity between expression camera lens i, the j.
5. the lens clustering method based on spectral segmentation theory according to claim 1 is characterized in that: the spectrum cutting procedure in the described step (3) is as follows by the performing step that the calculated characteristics vector obtains: at first calculate N * N symmetric matrix E, each element is e Ij, e IjSimilarity between expression camera lens i, the j obtains diagonal matrix D (E), d according to E Ii=∑ je Ij, make up matrix L (E), L (E)=(D (E)) -1/2E (D (E)) -1/2, select maximum proper vector among the L (E), determine the result of cutting apart according to the sign bit of the every dimension of proper vector.
6. the lens clustering method based on spectral segmentation theory according to claim 1 is characterized in that: the bayesian information criterion of described step (4) adopts Gauss's spherical model, can the given sample data collection of optimization match; BIC calculates optional model posterior probability, and as the standard as measurement model adaptability, than judging more effective to calculate two kinds of distances between the model profile as the general information criterion of criterion.
7. the lens clustering method based on spectral segmentation theory according to claim 1, it is characterized in that: use bayesian information criterion that the classification that produces is merged in the described step (5), promptly any two classes are carried out the BIC model and judge whether it belongs to two classes, if it is better to belong to a class, then defining this two class is communicated with, at last the classification of all connections is merged, get to the end best cluster numbers and clustering result.
CN2008100560968A 2008-01-11 2008-01-11 A shot clustering method based on spectral segmentation theory Expired - Fee Related CN101216886B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008100560968A CN101216886B (en) 2008-01-11 2008-01-11 A shot clustering method based on spectral segmentation theory

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2008100560968A CN101216886B (en) 2008-01-11 2008-01-11 A shot clustering method based on spectral segmentation theory

Publications (2)

Publication Number Publication Date
CN101216886A CN101216886A (en) 2008-07-09
CN101216886B true CN101216886B (en) 2010-06-09

Family

ID=39623317

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2008100560968A Expired - Fee Related CN101216886B (en) 2008-01-11 2008-01-11 A shot clustering method based on spectral segmentation theory

Country Status (1)

Country Link
CN (1) CN101216886B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102457758B (en) * 2010-10-25 2013-11-20 深圳市掌网立体时代视讯技术有限公司 Tool and method for screening and testing sensor components of dual-lens stereo equipment
CN102496146B (en) * 2011-11-28 2014-03-05 南京大学 Image segmentation method based on visual symbiosis
CN102799891A (en) * 2012-05-24 2012-11-28 浙江大学 Spectral clustering method based on landmark point representation
CN104156497B (en) * 2013-05-14 2018-05-22 利弗莫尔软件技术公司 Split the method and system of the non-directed graph for the matrix for representing to be used in CAE using two-way horizontal collection
CN103942415B (en) * 2014-03-31 2017-10-31 中国人民解放军军事医学科学院卫生装备研究所 A kind of Flow cytometry data automatic analysis method
CN111625683B (en) * 2020-05-07 2023-05-23 山东师范大学 Automatic video abstract generation method and system based on graph structure difference analysis
CN113143200A (en) * 2021-05-07 2021-07-23 苏州健雄职业技术学院 Laser scanning fundus camera imaging method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5825909A (en) * 1996-02-29 1998-10-20 Eastman Kodak Company Automated method and system for image segmentation in digital radiographic images
CN1367468A (en) * 2002-03-25 2002-09-04 北京工业大学 Cornea focus image cutting method based on K-mean cluster and information amalgamation
CN1452130A (en) * 2003-05-29 2003-10-29 上海交通大学 Multiple target image hierarchical clustering method
CN1873657A (en) * 2005-06-03 2006-12-06 中国科学院自动化研究所 Segmentation method of natural image in robustness

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5825909A (en) * 1996-02-29 1998-10-20 Eastman Kodak Company Automated method and system for image segmentation in digital radiographic images
CN1367468A (en) * 2002-03-25 2002-09-04 北京工业大学 Cornea focus image cutting method based on K-mean cluster and information amalgamation
CN1452130A (en) * 2003-05-29 2003-10-29 上海交通大学 Multiple target image hierarchical clustering method
CN1873657A (en) * 2005-06-03 2006-12-06 中国科学院自动化研究所 Segmentation method of natural image in robustness

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
方勇,戚飞虎.一种新的视频镜头边界检测及关键帧提取方法.华南理工大学学报32.2004,3218-23. *
田娟,王崇骏,李静,陈兆乾.一种基于谱图分割的简单聚类算法.复旦学报43 5.2004,43(5),810-814.
田娟,王崇骏,李静,陈兆乾.一种基于谱图分割的简单聚类算法.复旦学报43 5.2004,43(5),810-814. *

Also Published As

Publication number Publication date
CN101216886A (en) 2008-07-09

Similar Documents

Publication Publication Date Title
CN113378632B (en) Pseudo-label optimization-based unsupervised domain adaptive pedestrian re-identification method
CN110363122B (en) Cross-domain target detection method based on multi-layer feature alignment
CN109902590B (en) Pedestrian re-identification method for deep multi-view characteristic distance learning
CN109325960B (en) Infrared cloud chart cyclone analysis method and analysis system
CN108921083B (en) Illegal mobile vendor identification method based on deep learning target detection
CN108090429B (en) Vehicle type recognition method for graded front face bayonet
CN101216886B (en) A shot clustering method based on spectral segmentation theory
CN111291678B (en) Face image clustering method and device based on multi-feature fusion
US8611644B2 (en) Method and apparatus for training classifier, method and apparatus for image recognition
US10740927B2 (en) Method and device for vehicle identification
CN102254303B (en) Methods for segmenting and searching remote sensing image
CN102436589B (en) Complex object automatic recognition method based on multi-category primitive self-learning
CN109410184B (en) Live broadcast pornographic image detection method based on dense confrontation network semi-supervised learning
CN112883839B (en) Remote sensing image interpretation method based on adaptive sample set construction and deep learning
CN102044073B (en) Method and system for judging crowd density in image
CN105718866A (en) Visual target detection and identification method
CN103984943A (en) Scene text identification method based on Bayesian probability frame
CN102411711B (en) Finger vein recognition method based on individualized weight
CN101923652A (en) Pornographic picture identification method based on joint detection of skin colors and featured body parts
CN106055653A (en) Video synopsis object retrieval method based on image semantic annotation
CN106022254A (en) Image recognition technology
CN103839033A (en) Face identification method based on fuzzy rule
CN109086794B (en) Driving behavior pattern recognition method based on T-LDA topic model
CN112734741A (en) Image processing method and system for pneumonia CT image
CN110880010A (en) Visual SLAM closed loop detection algorithm based on convolutional neural network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100609

Termination date: 20150111

EXPY Termination of patent right or utility model