CN104809468A - Multi-view classification method based on indefinite kernels - Google Patents

Multi-view classification method based on indefinite kernels Download PDF

Info

Publication number
CN104809468A
CN104809468A CN201510188546.9A CN201510188546A CN104809468A CN 104809468 A CN104809468 A CN 104809468A CN 201510188546 A CN201510188546 A CN 201510188546A CN 104809468 A CN104809468 A CN 104809468A
Authority
CN
China
Prior art keywords
view
sigma
gamma
data
represent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510188546.9A
Other languages
Chinese (zh)
Inventor
薛晖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN201510188546.9A priority Critical patent/CN104809468A/en
Publication of CN104809468A publication Critical patent/CN104809468A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a multi-view classification method based on indefinite kernels. The method includes the following steps of firstly, obtaining a multi-view image set used for conducting training; secondly, generating a projection matrix through multi-view data, and projecting different view data in a united low-dimension space; thirdly, training samples in the low-dimension projection space through the definite kernel technology to obtain a classifier; fourthly, standardizing new multi-view data sets, projecting the data sets into the trained low-dimension space, and inputting the projected data sets into the classifier obtained through training so as to obtain the classification result. By means of the method, the mark-number-incomplete multi-view classification problem which needs to be solved is converted into the single-view semi-supervised classification problem in the united low-dimension space, and the mark number integrity on a single view can be achieved; judgment information of data with mark numbers and structure information of data without mark numbers are fully utilized, and the performance of the classifier is improved; in addition, new multi-view data can be directly tested and classified.

Description

A kind of multi views sorting technique based on indefinite core
Technical field
The present invention relates to pattern-recognition and machine learning techniques field, particularly a kind of multi views sorting technique based on indefinite core.
Background technology
Kernel method is one of core technology in machine learning, it is a class important method of the Nonlinear Learning problem existed in solving practical problems, its core concept is, by a kind of Nonlinear Mapping, raw data is embedded into high-dimensional feature space, then utilizes linear learning device in new spatial analysis and process data.Its advantage is mainly reflected in, and without the need to knowing concrete form and the parameter of Nonlinear Mapping in advance, but introducing kernel function, by changing kernel function form and parameter, implicitly realizing the mapping from the low-dimensional input space to high-dimensional feature space; Utilize kernel function, the kernel function that inner product operation complicated in high-dimensional feature space can be converted into the low-dimensional input space calculates, thus solves the problems such as " dimension disaster " that may occur when high-dimensional feature space computing dexterously, greatly reduces calculated amount; According to particular problem, kernel function can be selected neatly, to embed the priori of more problems concerning study.
Although kernel method has above-mentioned many key advantage, be limited to traditional Statistical Learning Theory, existing most of kernel method all requires core positive definite, meets Mercer condition.But in many practical applications, positive definite kernel can not obtain good Generalization Capability sometimes, be even difficult to utilize.On the contrary, indefinite core be able to show the experience classifying quality more excellent than positive definite kernel, becomes a study hotspot in machine learning field just gradually, receives the concern of more and more researcher.Such as, in recognition of face problem, Liu employs indefinite fractional order polynomial kernel in core principle component analysis (KPCA), achieves recognition effect more better than the polynomial kernel KPCA of use positive definite; In video tracking problem, Liwicki etc. make use of indefinite robust gradient core further, to solve the out of true solution problem may brought to keep renewal speed to use the sample set of simplification to represent in positive definite kernel KPCA.Experiment shows, adopts the KPCA of indefinite core to be obviously better than using the gaussian kernel KPCA of positive definite.But existing indefinite kernel method is all confined in traditional single-view problem.Along with the development of data acquisition technology, researcher finds in increasing real-world problem, there is a large amount of multi views sample, for example, people can describe with facial image and sound, and each internet webpage can represent etc. with document and web page interlinkage.Therefore, being extended to further by indefinite kernel method in multi views problem concerning study, to adapt to the needs of practical application better, is necessary.
Multi views study is another hot issue of machine learning.Large quantity research shows, often contains more prior imformation in problem concerning study in multi views data.Multi views the destination of study is just, deeply excavates the potential information between multiple view, by mutually promoting between view, to obtain better learning performance.But, in the study of traditional multi views, often require that the class label of data on each view is complete, but this requirement is usually difficult to meet in realistic problem.Such as, in one section of video record, p.s. all can flash multiframe picture, and it is almost impossible for wanting in every frame picture, mark everyone; Meanwhile, if having multiple people to talk in video recording, the audio frequency marking everyone is wanted also to be difficult to.Therefore, under the data on Audio and Video two views only have a small amount of labelled situation, it is extremely difficult especially that the data that ensure on each view all marked all people.Therefore, on multiple view, label is imperfect is an one of difficult problem in multi views study, and existing multi views learning method all or can not be difficult to head it off effectively.
In existing multi views study, nearly all learning algorithm all designs based on two views.When the number of view is more than two, man-to-man strategy can be adopted to learn, by view combination of two, apply existing algorithm and classify, then choose final classification results in a vote.Therefore, the multi views problem concerning study of two views is the keys solving multi views classification problem.
Summary of the invention
Goal of the invention: for prior art Problems existing, the present invention proposes a kind of multi views sorting technique based on indefinite core, the method is by being mapped to a new unified lower dimensional space and introducing indefinite kernel method by multiple view spaces, effectively prevent " dimension disaster ", improve dirigibility and the validity of subsequent classification.
Technical scheme: before the method concrete steps are described, first provide related definition and expression:
(a) sample: one group of Amazon database picture data set;
(b) category label: the i.e. generic of a sample;
(c) indefinite core: developed and next indefinite kernel function by data inner product in reproducing kernel Kre ǐ n space;
(d) canonical correlation analysis algorithm (CCA): Canonical Correlation Analysis;
(e) principal component analysis (PCA) (PCA): Principal Component Analysis;
(f) semi-supervised canonical correlation analysis algorithm (SemiCCA): Semi-supervised Canonical CorrelationAnalysis;
G () partly matches semi-supervised Generalized correlation analysis (S 2gCA): Semi-paired and Semi-supervisedGeneralized Correlation Analysis;
H territory adaptive method (ARC-t) that () changes based on asymmetric consideration convey: Asymmetric RegularizedCross-domain Transformation Problem with similarity and dissimilarity constraints;
(i) semi-supervised identification indefinite core regularization least square sorter (Semi-IKRLSC): Semi-supervised discriminatively Indefinite Kernel Regularized Least-SquareClassifier;
(j) spatial synchronization: multiple view spaces is mapped to new single view spaces.
The invention provides a kind of multi views sorting technique based on indefinite core, mainly for the multi views sorting technique of two views, the method comprises two stages: training and application, specifically comprise the steps:
1) the multi-view image collection for training is obtained;
2) multi views data genaration projection matrix is utilized, by different views data projection to unified lower dimensional space;
3) utilize indefinite nuclear technology to the sample training in low dimension projective space, obtain sorter;
4) the multi views data set standardizing new, projects to the lower dimensional space of training and obtaining by data set, and the data set after projection is input in the sorter of training and obtaining, and obtains classification results.
Described step 2) adopt SemiCCA algorithm to carry out dimensionality reduction, specifically comprise the steps:
1) according to the data of two views, projection matrix is solved:
If X=is (X (L), X (U)), Y=(Y (L), Y (U)) represent data in two view View-X and View-Y respectively, wherein with represent the sample matched in View-X and View-Y respectively, with represent unpaired sample in View-X and View-Y respectively, the formula solving projection matrix is:
M Q x Q y = λN Q x Q y - - - ( 1 )
Wherein, Q xrepresent the projection matrix corresponding to View-X, Q yrepresent the projection matrix corresponding to View-Y, M = μ 0 R xy ( L ) R yx ( L ) 0 + ( 1 - μ ) R xx 0 0 R yy , N = μ R xx ( L ) 0 0 R yy ( L ) + ( 1 - μ ) I D x 0 0 I D y , represent the covariance matrix of paired sample in View-X, represent the covariance matrix of paired sample in View-Y, represent the covariance matrix of paired sample between View-X and View-Y two views, represent the covariance matrix of paired sample between View-Y and View-X two views, represent the covariance matrix of total data in View-X, represent the covariance matrix of total data in View-Y, the μ regularization parameter that to be span be [0,1], λ is generalized eigenvalue, D xand D yrepresent the dimension of View-X and View-Y respectively, represent D x× D xthe unit matrix of dimension, represent D y× D ythe unit matrix of dimension;
2) according to the projection matrix solved, lower dimensional space is mapped to by unified for multi views data X and Y.
Described step 3) adopt Semi-IKRLSC Algorithm for Training sorter, given sample set S = { x 1 , . . . , x l , x l + 1 , . . . , x n } ⋐ R m , Wherein S L = { ( x i , y i ) } i = 1 l Have label sample, y from c class irepresent category label, be non-label sample, concrete training process is:
1) degree of separation T between compactness and class is obtained in the local class having a label sample lwand T lb:
T lw = Σ k = 1 c Σ i = 1 l k | | f ( x i ( k ) ) - 1 / l k Σ j = 1 l k f ( x j ( k ) ) | | 2 = 1 2 Q T Σ i = 1 l Σ j = 1 l Γ i , j lw ( x i - x j ) ( x i - x j ) T Q = Q T M lw Q - - - ( 5 )
T lb = Σ k = 1 c l k | | 1 / l k Σ j = 1 l k f ( x j ( k ) ) - 1 / l Σ j = 1 l f ( x j ) | | 2 = 1 2 Q T Σ i = 1 l Σ j = 1 l Γ i , j lb ( x i - x j ) ( x i - x j ) T Q = Q T M lb Q - - - ( 6 )
Wherein, f (x)=Q tx is classifier functions to be asked, represent i-th marker samples of a kth classification, l kfor the number of samples that a kth classification marks altogether, Q is discriminant classification vector, M lw = 1 2 Σ i = 1 l Σ j = 1 l Γ j = 1 lw ( x i - x j ) ( x i - x j ) T , M lb = 1 2 Σ i = 1 l Σ j = 1 l Γ j = 1 lb ( x i - x j ) ( x i - x j ) T , in order to weigh the relative position of similar middle sample, δ is variance;
2) the global structure information T of data is calculated p:
T p = Σ i = 1 n | | f ( x i ) - 1 n Σ j = 1 n f ( x j ) | | 2 = 1 2 Q T Σ i = 1 n Σ j = 1 n Γ i , j p ( x i - x j ) ( x i - x j ) T Q = Q T M p Q - - - ( 7 )
Wherein, f is classifier functions, and Q is discriminant classification vector, M p = 1 2 Σ i = 1 n Σ j = 1 n Γ i , j p ( x i - x j ) ( x i - x j ) T ;
3) the global structure information of data is introduced degree of separation between compactness and class in the class of local:
G ( SSL ) ( f ) = ( 1 - γ ) T lw + γ | | Q | | 2 = Q T [ ( 1 - γ ) M lw + γI ] Q = Q T T rlw ( SSL ) Q - - - ( 8 )
J ( SSL ) ( f ) = ( 1 - γ ) T lb + γ T p = Q T [ ( 1 - γ ) M lb + γ M p ] Q = Q T T rlb ( SSL ) Q - - - ( 9 )
Wherein, G (SSL)(f) and J (SSL)(f) be respectively introduce global structure information class in degree of separation between compactness and class, γ is regularization parameter, T lwfor have a label data local class in compactness, Q is discriminant classification vector, and I is unit battle array, for have a label data local class between degree of separation, T pfor the global structure information of data, T rlb ( SSL ) = ( 1 - γ ) M lb + γ M p ;
4) utilize indefinite core sorting technique, train all samples, obtain classifier functions f *:
f * = min f ∈ K ~ { 1 / l Σ i = 1 l ( y i - f ( x i ) ) 2 + [ η G ( SSL ) ( f ) - ( 1 - η ) J ( SSL ) ( f ) ] } = min f ∈ K ~ { 1 / l Σ i = 1 l ( y i - Q T x i ) 2 + Q T [ η T rlw ( SSL ) - ( 1 - η ) T rlb ( SSL ) ] Q } - - - ( 12 )
Wherein, represent reproducing kernel Kre ǐ n space, l is the number having label sample, y ifor sample x icategory label, f is classifier functions, and 0≤η≤1 is regularization parameter, G (SSL)compactness in f class that () is introducing global structure information, J (SSL)degree of separation between f class that () is introducing global structure information, Q is discriminant classification vector, T rlw ( SSL ) = ( 1 - γ ) M lw + γI , T rlb ( SSL ) = ( 1 - γ ) M lb + γ M p .
Beneficial effect: advantage of the present invention is mainly reflected in three aspects:
1, select the unification of the multiple view of SemiCCA algorithm realization, will the imperfect multi views classification problem of label solved be needed to be converted into the single-view semisupervised classification problem in unified lower dimensional space, the label that can realize in single-view be complete.
2, select the Semi-IKRLSC Algorithm for Training based on indefinite core to go out performance preferably sorter, and take full advantage of the discriminant information of label data and the structural information of unlabelled data, to promote classifier performance.
3, can directly new multi views data be tested and be classified.
Accompanying drawing explanation
Fig. 1 is the method flow diagram of the multi views classification that the present invention is based on indefinite core;
Fig. 2 is the spatial synchronization schematic diagram in training stage dimensionality reduction step of the present invention.
Embodiment
Below in conjunction with accompanying drawing and example, the present invention is described in detail, these embodiments should be understood only be not used in for illustration of the present invention and limit the scope of the invention, after having read the present invention, the amendment of those skilled in the art to the various equivalent form of value of the present invention has all fallen within right appended by the application.
The imperfect problem of label during the present invention learns mainly for multi views, and in conjunction with indefinite kernel method, to obtaining better classification performance.Institute's problems faced mainly comprises following three aspects:
1, in multi views study, a large amount of label sample is obtained very difficult.Having on a small quantity in label sample situation, ensureing that the data label of each view is complete extremely difficult especially.But existing multi views learning method is mostly based upon on the complete basis of viewdata label, very difficult process is not at same view spaces and the incomplete data of label.
2, existing sorting technique not only directly can not train the incomplete data of multi views label, nor can utilize data message fully, and classification performance is not high.
3, existing multi views method effectively cannot be tested new multi views label deficiency of data and classifies.
A kind of multi views sorting technique based on indefinite core as shown in Figure 1, comprises two stages: training and application, specifically comprise the steps:
1) the multi-view image collection for training is obtained;
2) multi views data genaration projection matrix is utilized, by different views data projection to unified lower dimensional space;
3) utilize indefinite nuclear technology to the sample training in low dimension projective space, obtain sorter;
4) the multi views data set standardizing new, projects to the lower dimensional space of training and obtaining by data set, and the data set after projection is input in the sorter of training and obtaining, and obtains classification results.
Existing much research demonstrates the remarkable learning performance of CCA algorithm in multi views study, but CCA is based upon on the data set basis of matching completely, and CCA probably caused and joins phenomenon.For avoiding this phenomenon, SemiCCA algorithm is on the basis of CCA, and the penalty term adding PCA form improves, and make use of pairing and non-paired data simultaneously.Its basic thought is, finds one group of projection matrix Q xand Q y, make the correlativity of similar features in sample set maximum, the correlativity between different characteristic is minimum.We utilize SemiCCA algorithm, by the data projection of different views in unified lower dimensional space, make data class label complete, so, both reached the object of dimensionality reduction, also solved the incomplete problem of label, the spatial synchronization thought that Fig. 2 directviewing description is used in dimensionality reduction step.
Given X=(X (L), X (U)), Y=(Y (L), Y (U)) represent data in two view View-X and View-Y respectively, wherein with represent the sample matched in View-X and View-Y respectively, with represent unpaired sample in View-X and View-Y respectively.Therefore, can be expressed as to solving of SemiCCA the problem that solves generalized eigenvalue:
M Q x Q y = λN Q x Q y - - - ( 1 )
Wherein,
M = μ 0 R xy ( L ) R yx ( L ) 0 + ( 1 - μ ) R xx 0 0 R yy - - - ( 2 )
N = μ R xx ( L ) 0 0 R yy ( L ) + ( 1 - μ ) I D x 0 0 I D y - - - ( 3 )
represent the covariance matrix of paired sample in View-X, represent the covariance matrix of paired sample in View-Y, represent the covariance matrix of paired sample between View-X and View-Y two views, represent the covariance matrix of paired sample between View-Y and View-X two views, represent the covariance matrix of total data in View-X, represent the covariance matrix of total data in View-Y, the μ regularization parameter that to be span be [0,1], for carrying out suitable compromise between CCA and PCA.Solve this generalized eigenvalue problem, obtain projection matrix Q xand Q ycorrespond respectively to View-X and View-Y, be then mapped to unified for multi views data X and Y in new lower dimensional space Z, thus reach the object of spatial synchronization, wherein the dimension of View-Z is less than the dimension of View-X and View-Y.Thus, the incomplete multi views classification problem of label has just changed into the single-view semisupervised classification problem in View-Z.
In semi-supervised learning, the traditional classifier based on Tikhonov Regularization Theory only considered the slickness of classification function, and ignores discriminant information and the structural information of data, limits the raising of classification performance.Semi-IKRLSC algorithm is by being incorporated in semisupervised classification by indefinite core, global structure information in conjunction with data defines new regularization term with local discriminant information, take full advantage of sample prior imformation, make similar sample in output region close as much as possible, and inhomogeneity sample as much as possible away from, effectively improve classifier performance.
Given sample set S = { x 1 , . . . , x l , x l + 1 , . . . , x n } ⋐ R m , Wherein S L = { ( x i , y i ) } i = 1 l Have label sample, y from c class irepresent category label, it is non-label sample.The objective function of Semi-IKRLSC algorithm can be write as following form:
min f &Element; K ~ { 1 / l &Sigma; i = 1 l V ( y i , f ( x i ) ) + &lambda; < f , f > K ~ } - - - ( 4 )
Wherein, represent reproducing kernel Kre ǐ n space, V (y i, f (x i)) be loss function, in order to measure the error between class label predicted value and actual value. for indefinite regularization term, in order to global structure information and the local discriminant information of fused data.
In order to construct indefinite regularization term first T is used lwand T lbdegree of separation between compactness and class in the local class indicating label sample, T lwand T lbbe defined as follows:
T lw = &Sigma; k = 1 c &Sigma; i = 1 l k | | f ( x i ( k ) ) - 1 / l k &Sigma; j = 1 l k f ( x j ( k ) ) | | 2 = 1 2 Q T &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; i , j lw ( x i - x j ) ( x i - x j ) T Q = Q T M lw Q - - - ( 5 )
T lb = &Sigma; k = 1 c l k | | 1 / l k &Sigma; j = 1 l k f ( x j ( k ) ) - 1 / l &Sigma; j = 1 l f ( x j ) | | 2 = 1 2 Q T &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; i , j lb ( x i - x j ) ( x i - x j ) T Q = Q T M lb Q - - - ( 6 )
Wherein, f (x)=Q tx is classifier functions to be asked, and Q is discriminant classification vector, M lw = 1 2 &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; j = 1 lw ( x i - x j ) ( x i - x j ) T , M lb = 1 2 &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; j = 1 lb ( x i - x j ) ( x i - x j ) T , represent i-th marker samples of kth class, l krepresent a kth class always total l kindividual marker samples, δ represents variance.Υ i,jin order to weigh the relative position of similar middle sample: if similar sample is relatively close, then Υ i,jvalue comparatively large, represent that sample has stronger correlativity in output region; On the contrary, if similar sample is relatively away from, then Υ i,jvalue less, represent that the relevance of sample in output region is less; And for inhomogeneous sample, in output region then as far as possible away from.
But when there being label sample less, the tolerance T of degree of separation between compactness and class in the class of local defined above lwand T lbhave inclined.For avoiding this situation, we introduce the global structure information of data:
T p = &Sigma; i = 1 n | | f ( x i ) - 1 n &Sigma; j = 1 n f ( x j ) | | 2 = 1 2 Q T &Sigma; i = 1 n &Sigma; j = 1 n &Gamma; i , j p ( x i - x j ) ( x i - x j ) T Q = Q T M p Q - - - ( 7 )
Wherein, M p = 1 2 &Sigma; i = 1 n &Sigma; j = 1 n &Gamma; i , j p ( x i - x j ) ( x i - x j ) T , &Gamma; i , j p = 1 / n .
Then degree of separation G between compactness and class is redefined in the class in semi-supervised problem (SSL)(f) and J (SSL)(f):
G ( SSL ) ( f ) = ( 1 - &gamma; ) T lw + &gamma; | | Q | | 2 = Q T [ ( 1 - &gamma; ) M lw + &gamma;I ] Q = Q T T rlw ( SSL ) Q - - - ( 8 )
J ( SSL ) ( f ) = ( 1 - &gamma; ) T lb + &gamma; T p = Q T [ ( 1 - &gamma; ) M lb + &gamma; M p ] Q = Q T T rlb ( SSL ) Q - - - ( 9 )
Wherein, γ is regularization parameter, is used for compromising to global structure information and local discriminant information, 0≤γ≤1.T lwfor have a label data local class in compactness, t lbfor have a label data local class between degree of separation, unit matrix I retrains T as regularization term lw, to improve compactness G in class (SSL)the stability of (f).T pfeature the global information of label sample and unlabelled sample, J can be ensured well (SSL)the reliability of (f).
Thus, final objective function can be expressed as:
min f &Element; K ~ { 1 / l &Sigma; i = 1 l ( y i - f ( x i ) ) 2 + [ &eta; G ( SSL ) ( f ) - ( 1 - &eta; ) J ( SSL ) ( f ) ] } - - - ( 10 )
Namely
min f &Element; K ~ { 1 / l &Sigma; i = 1 l ( y i - Q T x i ) 2 + Q T [ &eta; T rlw ( SSL ) - ( 1 - &eta; ) T rlb ( SSL ) ] Q } - - - ( 11 )
Wherein, l is the number having label sample, y ifor sample x icategory label.η is regularization parameter, 0≤η≤1.
To new multi views label deficiency of data, first project, unified for the data of different views to new lower dimensional space, solve the incomplete problem of class label.Then utilize the semi-supervised classifier of having trained and having obtained, the data of lower dimensional space are classified.
When given view is more than two, man-to-man strategy can be adopted to learn, by view combination of two, application the inventive method carries out dimensionality reduction and classification, then chooses final classification results in a vote.
We adopt Amazon database to test, and first all pictures are adjusted to consistent length, and turn to gray-scale map.By scale invariability, Amazon database data has two kinds of feature: SURF and SIFT feature.For SURF feature, have two kinds of coded systems: (1) carries out k-means cluster to part Amazon database data, coding dimension is 800; (2) another kind of coding dimension is 600, and Dslr database constructs.The coding dimension of SIFT feature is 900.These two kinds of features are respectively as View-X and View-Y view.Dslr always has the picture of 31 classifications, and we choose wherein 20 classes and train.S 2gCA and ARC-t algorithm is the algorithms most in use in multi views study, and k nearest neighbor algorithm (KNN) is the basic skills in sorting algorithm, and Laplace regularization least square algorithm (MR) is positive definite kernel method classical in semisupervised classification.The present invention respectively with S 2gCA, ARC-t and SemiCCA+MR, SemiCCA+KNN contrast, and experimental result is as shown in table 1, can find out that the multi views sorting technique performance based on indefinite core in the present invention is apparently higher than other algorithms, and stability are higher.On Webcam and Dslr data set, even if data processing technique is different, but the nicety of grading of institute of the present invention extracting method is all stabilized between 50%-60% on three data sets.On Amazon and Dslr data set, be then stabilized in about 40%.
Contrast and experiment on table 1 Amazon data set.

Claims (3)

1., based on a multi views sorting technique for indefinite core, it is characterized in that, comprise the steps:
1) the multi-view image collection for training is obtained;
2) multi views data genaration projection matrix is utilized, by different views data projection to unified lower dimensional space;
3) utilize indefinite nuclear technology to the sample training in low dimension projective space, obtain sorter;
4) the multi views data set standardizing new, projects to the lower dimensional space of training and obtaining by data set, and the data set after projection is input in the sorter of training and obtaining, and obtains classification results.
2. the multi views sorting technique based on indefinite core according to claim 1, is characterized in that, described step 2) adopt SemiCCA algorithm to carry out dimensionality reduction, specifically comprise the steps:
1) according to the data of two views, projection matrix is solved;
If X=is (X (L), X (U)), Y=(Y (L), Y (U)) represent data in two view View-X and View-Y respectively, wherein with represent the sample matched in View-X and View-Y respectively, with represent unpaired sample in View-X and View-Y respectively, then the formula solving projection matrix is:
M Q x Q y = &lambda;N Q x Q y - - - ( 1 )
Wherein, Q xrepresent the projection matrix corresponding to View-X, Q yrepresent the projection matrix corresponding to View-Y, M = &mu; 0 R xy ( L ) R yx ( L ) 0 + ( 1 - &mu; ) R xx 0 0 R yy , N = &mu; R xx ( L ) 0 0 R yy ( L ) + ( 1 - &mu; ) I D x 0 0 I D y , represent the covariance matrix of paired sample in View-X, represent the covariance matrix of paired sample in View-Y, represent the covariance matrix of paired sample between View-X and View-Y two views, represent the covariance matrix of paired sample between View-Y and View-X two views, represent the covariance matrix of total data in View-X, represent the covariance matrix of total data in View-Y, the μ regularization parameter that to be span be [0,1], λ is generalized eigenvalue, D xand D yrepresent the dimension of View-X and View-Y respectively, represent D x× D xthe unit matrix of dimension, represent D y× D ythe unit matrix of dimension;
2) according to the projection matrix solved, lower dimensional space is mapped to by unified for multi views data X and Y.
3. the multi views sorting technique based on indefinite core according to claim 1, is characterized in that, described step 3) adopt Semi-IKRLSC Algorithm for Training sorter, given sample set S = { x 1 , &CenterDot; &CenterDot; &CenterDot; , x l , x l + 1 , &CenterDot; &CenterDot; &CenterDot; , x n } &Subset; R m , Wherein S L = { ( x i , y i ) } i = 1 l Have label sample, y from c class irepresent category label, be non-label sample, concrete training process is:
1) degree of separation T between compactness and class is obtained in the local class having a label sample lwand T lb:
T lw = &Sigma; k = 1 c &Sigma; i = 1 l k | | f ( x i ( k ) ) - 1 / l k &Sigma; j = 1 l k f ( x j ( k ) ) | | 2 = 1 2 Q T &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; i , j lw ( x i - x j ) ( x i - x j ) T Q = Q T M lw Q - - - ( 5 )
T lb = &Sigma; k = 1 c l k | | 1 / l k &Sigma; i = 1 l k f ( x i ( k ) ) - 1 / l &Sigma; j = 1 l f ( x j ) | | 2 = 1 2 Q T &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; i , j lb ( x i - x j ) ( x i - x j ) T Q = Q T M lb Q - - - ( 6 )
Wherein, f (x)=Q tx is classifier functions to be asked, represent i-th marker samples of a kth classification, l kfor the number of samples that a kth classification marks altogether, Q is discriminant classification vector, M lw = 1 2 &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; i , j lw ( x i - x j ) ( x i - x j ) T , M lb = 1 2 &Sigma; i = 1 l &Sigma; j = 1 l &Gamma; i , j lb ( x i - x j ) ( x i - x j ) T , in order to weigh the relative position of similar middle sample, δ is variance;
2) the global structure information T of data is calculated p:
T p = &Sigma; i = 1 n | | f ( x i ) - 1 n &Sigma; j = 1 n f ( x j ) | | 2 = 1 2 Q T &Sigma; i = 1 n &Sigma; j = 1 n &Gamma; i , j p ( x i - x j ) ( x i - x j ) T Q = Q T M p Q - - - ( 7 )
Wherein, f is classifier functions, and Q is discriminant classification vector, M p = 1 2 &Sigma; i = 1 n &Sigma; j = 1 n &Gamma; i , j p ( x i - x j ) ( x i - x j ) T ;
3) the global structure information of data is introduced degree of separation between compactness and class in the class of local:
G ( SSL ) ( f ) = ( 1 - &gamma; ) T lw + &gamma; | | Q | | 2 = Q T [ ( 1 - &gamma; ) M lw + &gamma;I ] Q = Q T T rlw ( SSL ) Q - - - ( 8 )
J ( SSL ) ( f ) = ( 1 - &gamma; ) T lb + &gamma; T p = Q T [ ( 1 - &gamma; ) M lb + &gamma; M p ] Q = Q T T rlb ( SSL ) Q - - - ( 9 )
Wherein, G (SSL)(f) and J (SSL)(f) be respectively introduce global structure information class in degree of separation between compactness and class, γ is regularization parameter, T lwfor have a label data local class in compactness, Q is discriminant classification vector, and I is unit battle array, t lbfor have a label data local class between degree of separation, T pfor the global structure information of data, T rlb ( SSL ) = ( 1 - &gamma; ) M lb + &gamma; M p ;
4) utilize indefinite core sorting technique, train all samples, obtain classifier functions f *:
f * = min f &Element; K ~ { 1 / l &Sigma; i = 1 l ( y i - f ( x i ) ) 2 + [ &eta; G ( SSL ) ( f ) - ( 1 - &eta; ) J ( SSL ) ( f ) ] } = min f &Element; K ~ { 1 / l &Sigma; i = 1 l ( y i - Q T x i ) 2 + Q T [ &eta; T rlw ( SSL ) - ( 1 - &eta; ) T rlb ( SSL ) ] Q } - - - ( 12 )
Wherein, represent reproducing kernel space, l is the number having label sample, y ifor sample x icategory label, f is classifier functions, and 0≤η≤1 is regularization parameter, G (SSL)compactness in f class that () is introducing global structure information, J (SSL)degree of separation between f class that () is introducing global structure information, Q is discriminant classification vector, T rlw ( SSL ) = ( 1 - &gamma; ) M lw + &gamma;I , T rlb ( SSL ) = ( 1 - &gamma; ) M lb + &gamma; M p .
CN201510188546.9A 2015-04-20 2015-04-20 Multi-view classification method based on indefinite kernels Pending CN104809468A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510188546.9A CN104809468A (en) 2015-04-20 2015-04-20 Multi-view classification method based on indefinite kernels

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510188546.9A CN104809468A (en) 2015-04-20 2015-04-20 Multi-view classification method based on indefinite kernels

Publications (1)

Publication Number Publication Date
CN104809468A true CN104809468A (en) 2015-07-29

Family

ID=53694279

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510188546.9A Pending CN104809468A (en) 2015-04-20 2015-04-20 Multi-view classification method based on indefinite kernels

Country Status (1)

Country Link
CN (1) CN104809468A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107169505A (en) * 2017-04-01 2017-09-15 南京邮电大学 A kind of image classification method based on the incoherent multi views of nuclear statistics
CN107239787A (en) * 2016-03-29 2017-10-10 南京大学 A kind of utilization multi-source data have the Image classification method of privacy protection function
CN107967343A (en) * 2017-12-11 2018-04-27 西南交通大学 A kind of multi views construction method
CN108564051A (en) * 2018-04-23 2018-09-21 广州大学 A kind of core fast algorithm of regularization affine hull model face image set identification
CN109145976A (en) * 2018-08-14 2019-01-04 聚时科技(上海)有限公司 A kind of multiple view cluster machine learning method based on optimal neighbours' core

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012135979A1 (en) * 2011-04-08 2012-10-11 Nokia Corporation Method, apparatus and computer program product for providing multi-view face alignment
CN103955462A (en) * 2014-03-21 2014-07-30 南京邮电大学 Image marking method based on multi-view and semi-supervised learning mechanism
CN104463208A (en) * 2014-12-09 2015-03-25 北京工商大学 Multi-view semi-supervised collaboration classification algorithm with combination of agreement and disagreement label rules

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012135979A1 (en) * 2011-04-08 2012-10-11 Nokia Corporation Method, apparatus and computer program product for providing multi-view face alignment
CN103955462A (en) * 2014-03-21 2014-07-30 南京邮电大学 Image marking method based on multi-view and semi-supervised learning mechanism
CN104463208A (en) * 2014-12-09 2015-03-25 北京工商大学 Multi-view semi-supervised collaboration classification algorithm with combination of agreement and disagreement label rules

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
HUIXUE等: "Discriminative Indefinite Kernel Classifier from Pairwise Constraints and Unlabeled Data", 《21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012)》 *
JIANHUI CHEN等: "Training SVM with Indefinite Kernels", 《ICML’08 PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING》 *
周旭东: "基于不同多视图数据场景的典型相关分析研究和应用", 《中国博士学位论文全文数据库 信息科技辑》 *
薛晖等: "基于跨视图约束的多视图分类方法", 《模式识别与人工智能》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107239787A (en) * 2016-03-29 2017-10-10 南京大学 A kind of utilization multi-source data have the Image classification method of privacy protection function
CN107169505A (en) * 2017-04-01 2017-09-15 南京邮电大学 A kind of image classification method based on the incoherent multi views of nuclear statistics
CN107967343A (en) * 2017-12-11 2018-04-27 西南交通大学 A kind of multi views construction method
CN107967343B (en) * 2017-12-11 2021-04-13 西南交通大学 Multi-view construction method
CN108564051A (en) * 2018-04-23 2018-09-21 广州大学 A kind of core fast algorithm of regularization affine hull model face image set identification
CN108564051B (en) * 2018-04-23 2021-10-01 广州大学 Rapid kernel calculation method for regularized affine bag model face image set identification
CN109145976A (en) * 2018-08-14 2019-01-04 聚时科技(上海)有限公司 A kind of multiple view cluster machine learning method based on optimal neighbours' core

Similar Documents

Publication Publication Date Title
Cheng et al. Scene recognition with objectness
Masi et al. Learning pose-aware models for pose-invariant face recognition in the wild
CN109949317B (en) Semi-supervised image example segmentation method based on gradual confrontation learning
Gordo Supervised mid-level features for word image representation
CN102609681B (en) Face recognition method based on dictionary learning models
Li et al. Multilabel SVM active learning for image classification
WO2018023734A1 (en) Significance testing method for 3d image
CN109492099A (en) It is a kind of based on field to the cross-domain texts sensibility classification method of anti-adaptive
Li et al. Local correlation consistency for knowledge distillation
Shi et al. Semisupervised discriminative locally enhanced alignment for hyperspectral image classification
CN104809468A (en) Multi-view classification method based on indefinite kernels
CN111738355B (en) Image classification method and device with attention fused with mutual information and storage medium
Uricchio et al. Fisher encoded convolutional bag-of-windows for efficient image retrieval and social image tagging
CN110555060A (en) Transfer learning method based on paired sample matching
Zhou et al. Image retrieval based on effective feature extraction and diffusion process
CN105205501A (en) Multi-classifier combined weak annotation image object detection method
CN104778476A (en) Image classification method
Tian et al. Aligned dynamic-preserving embedding for zero-shot action recognition
Zhang et al. Automatic discrimination of text and non-text natural images
CN110993102A (en) Campus big data-based student behavior and psychological detection result accurate analysis method and system
CN111126464A (en) Image classification method based on unsupervised domain confrontation field adaptation
CN103942214B (en) Natural image classification method and device on basis of multi-modal matrix filling
Tu et al. Multiattribute sample learning for hyperspectral image classification using hierarchical peak attribute propagation
Cevikalp et al. Video based face recognition by using discriminatively learned convex models
Yu et al. Deep metric learning with dynamic margin hard sampling loss for face verification

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150729

RJ01 Rejection of invention patent application after publication