CN105550645A - Least-squares classification method on product Grassmann manifold - Google Patents

Least-squares classification method on product Grassmann manifold Download PDF

Info

Publication number
CN105550645A
CN105550645A CN201510901535.0A CN201510901535A CN105550645A CN 105550645 A CN105550645 A CN 105550645A CN 201510901535 A CN201510901535 A CN 201510901535A CN 105550645 A CN105550645 A CN 105550645A
Authority
CN
China
Prior art keywords
grassmann
shape
product
flows
sigma
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510901535.0A
Other languages
Chinese (zh)
Other versions
CN105550645B (en
Inventor
尹宝才
王玉萍
王立春
孔德慧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN201510901535.0A priority Critical patent/CN105550645B/en
Publication of CN105550645A publication Critical patent/CN105550645A/en
Application granted granted Critical
Publication of CN105550645B publication Critical patent/CN105550645B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Multimedia (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a least-squares classification method on a product Grassmann manifold. The method has a closed-form solution, and the accuracy rate of identification can be improved. The method comprises the following steps: (1) expressing a video in the form of product Grassmann manifold; (2) establishing a least-square model on the Grassmann manifold and solving the least-square model; and (3) performing least-squares classification, and outputting the classification result.

Description

A kind of product Grassmann flows the least squared classified method on shape
Technical field
The invention belongs to the technical field of pattern-recognition, relate to the least squared classified method on a kind of product Grassmann stream shape particularly.
Background technology
In recent years, linear subspaces method has very important application in computer vision, as target identification, and recognition of face, human body tracking etc.Linear subspaces can reduce calculation cost and can better portray the inherent geometry of data itself.It is a kind of linear subspaces with nonlinear organization that Grassmann flows shape, and the point that video data is shown as on product Grassmann stream shape by high-order SVD breakdown is had significant effect in gesture identification.For identification problem, except the character representation that will find, the sorting technique of robust also plays vital effect to recognition correct rate.
Least square method is as simple effective method most in statistical study, and its research in stream shape space also obtains the concern of a lot of scholar.Lui gives the non-linear least square method on Grassmann stream shape by means of kernel function, and it is measured by geodesic line distance.But use WeightedKarcherMean algorithm in the method Solve problems, it is an iterative algorithm, and what obtain is an approximate solution.
Summary of the invention
Technology of the present invention is dealt with problems and is: overcome the deficiencies in the prior art, provides the least squared classified method on a kind of product Grassmann stream shape, and it has to close separates, and can improve the accuracy of identification.
Technical solution of the present invention is: this product Grassmann flows the least squared classified method on shape, and the method comprises the following steps:
(1) carry out product Grassmann stream shape to video to represent;
(2) on Grassmann stream shape, set up least square model and solve;
(3) least squared classified is carried out, and output category result.
The present invention is equidistantly embedded in Symmetric matrix by the point flowed by Grassmann on shape and carries out error metrics again, separates, can improve the accuracy of identification so have to close.
Accompanying drawing explanation
Fig. 1 shows method frame figure of the present invention.
Embodiment
As shown in Figure 1, this product Grassmann flows the least squared classified method on shape, and the method comprises the following steps:
(1) carry out product Grassmann stream shape to video to represent;
(2) on Grassmann stream shape, set up least square model and solve;
(3) least squared classified is carried out, and output category result.
The present invention is equidistantly embedded in Symmetric matrix by the point flowed by Grassmann on shape and carries out error metrics again, separates, can improve the accuracy of identification so have to close.
Preferably, in described step (1), videometer is shown as the form of tensor wherein I 1, I 2, I 3represent the height of video, wide, length respectively; Change under each pattern can be obtained by high-order SVD, , wherein core tensor, V (1), V (2), V (3)the factor matrix under each pattern respectively, and each V (k)being tall and thin orthogonal matrix, is the point that Stiefel flows on shape, so span (V (k)) be the point on Grassmann stream shape, (span (V (1)), span (V (2)), span (V (3))) be the point on product Grassmann stream shape.
Preferably, least squared classified in described step (2)
R ( y ) = Δ min y | | XX T - Σ j = 1 N D j D j T y j | | F 2 - - - ( 1 )
The solution optimizing formula (1) is formula (3)
y *=2(K(D)+K(D) T) -1K(X,D)(3)
Corresponding error is | | XX T - Σ j = 1 N D j D j T y j * | | F .
Preferably, the middle sample (X, Y, Z) of described step (3) is defined as about the residual error of kth class
ϵ k = | | XX T - Σ j = 1 N k U j k U j k T u j k * | | F + | | YY T - Σ j = 1 N k V j k V j k T v j k * | | F + | | ZZ T - Σ j = 1 N k W j k W j k T w j k * | | F
Wherein be the solution of formula (1) on each submanifold respectively, final classification results is by k *=argmin kε kdetermine.
Illustrate above method below.
1. the product Grassmann stream shape of video represents
Video can be expressed as tensor form as high dimensional data as wherein I 1, I 2, I 3represent the height of video, wide, length respectively.Change under each pattern can be obtained by high-order SVD, namely wherein core tensor, V (1), V (2), V (3)the factor matrix under each pattern respectively, and each V (k)be tall and thin orthogonal matrix, therefore can regard the point on Stiefel stream shape as, so span (V (k)) be the point on Grassmann stream shape.Therefore (span (V (1)), span (V (2)), span (V (3))) be the point on product Grassmann stream shape.
2.Grassmann flows the embedded least square method on shape and solves
Least square technology is a most simple effective method in statistical study.In theorem in Euclid space, parameter can by minimize residual error R (β)=|| y-A β || 2obtain, wherein for training set, for regressand value.Estimated parameter have Explicit solutions shape as now corresponding error is || y-A (A ta) -1a ty|| 2.
Order for sample size is the training set of N, wherein represent that Grassmann flows shape, fitting parameter, it is input amendment.Utilize projection mapping
The point flowed by Grassmann on shape is embedded into Symmetric matrix, and wherein Sym (d) represents Symmetric matrix.The distance of upper two point X and Y in such Grassmann space can define by the distance of embedded space, namely d ( X , Y ) = 2 2 | | - XX T - YY T | | F , And the geodesic line distance that this Distance geometry Grassmann flows the upper definition of shape is of equal value.Such distance definition is also convenient to follow-up solving.Being similar to the theorem in Euclid space principle of least square, providing embedded least square for solving following optimization problem,
R ( y ) = Δ min y | | XX T - Σ j = 1 N D j D j T y j | | F 2 - - - ( 1 )
Wherein y jit is a jth element of vectorial y.
Introduce below and how to solve above-mentioned optimization problem.Have
| | XX T - Σ j = 1 N D j D j T y j | | F 2 = T r ( X T XX T X ) + Σ j , i N y i y j T r ( D i T D j D j T D i ) - 2 Σ j N y j T r ( D j T XX T D j )
Definition
[ K ( D ) ] i , j = T r ( D i T D j D j T D i ) = | | D i T D j | | F 2 ,
[ K ( X , D ) ] i = T r ( D j T XX T D j ) = | | X T D j | | F 2
Therefore model (1) becomes
min y{y TK(D)y-2y TK(X,D)}(2)
To (2) about y differentiate, and make derivative equal 0, have
(K(D)+K(D) T)y-2K(X,D)=0
So the solution of optimization problem (1) is
y *=2(K(D)+K(D) T) -1K(X,D)(3)
Therefore corresponding error is | | XX T - Σ j = 1 N D j D j T y j * | | F .
3. based on the principle of classification of embedded least square method
The 3 factorial Grassmann that corresponding video data is discussed below flow shape wherein × represent cartesian product.Suppose that training set has M class, note kth class training set is wherein N kfor number of samples.Target infers test sample book which kind of belongs to.
Sample (X, Y, Z) is defined as about the residual error of kth class
ϵ k = | | XX T - Σ j = 1 N k U j k U j k T u j k * | | F + | | YY T - Σ j = 1 N k V j k V j k T v j k * | | F + | | ZZ T - Σ j = 1 N k W j k W j k T w j k * | | F
Wherein the solution of each submanifold upper returning problem (1) respectively.Final classification results is determined by following formula
k *=argmin kε k
Experimental verification is carried out to above-mentioned model, and has achieved obvious effect.In an experiment, select the gesture database of Cambridge University, it comprises 900 videos, totally 9 different gestures, and it is divided into 5 set according to different illumination conditions.Set 5 is general as training set, and 1-4 is as test set in set.In experiment, original series is converted into gray level image and adjusts size to 14 × 32 × 23.Current high-caliber 2 kinds of recognition methodss and the correct recognition rata of the inventive method respectively on four test sets is listed in table 1.The inventive method correct recognition rata, apparently higher than the discrimination of the PM method of Lui, is on close level with the best result kgLLC of people's methods such as Harandi.This description of test the inventive method is simply effective.
Table 1
The above; it is only preferred embodiment of the present invention; not any pro forma restriction is done to the present invention, every above embodiment is done according to technical spirit of the present invention any simple modification, equivalent variations and modification, all still belong to the protection domain of technical solution of the present invention.

Claims (4)

1. product Grassmann flows the least squared classified method on shape, it is characterized in that: the method comprises the following steps:
(1) carry out product Grassmann stream shape to video to represent;
(2) on Grassmann stream shape, set up least square model and solve;
(3) least squared classified is carried out, and output category result.
2. product Grassmann according to claim 1 flows the least squared classified method on shape, it is characterized in that: in described step (1), videometer is shown as the form of tensor wherein I 1, I 2, I 3represent the height of video, wide, length respectively; Change under each pattern can be obtained by high-order SVD decomposition, wherein core tensor, V (1), V (2), V (3)the factor matrix under each pattern respectively, and each V (k)being tall and thin orthogonal matrix, is the point that Stiefel flows on shape, so span (V (k)) be the point on Grassmann stream shape, (span (V (1)), span (V (2)), span (V (3))) be the point on product Grassmann stream shape.
3. product Grassmann according to claim 2 flows the least squared classified method on shape, it is characterized in that: least squared classified in described step (2)
R ( y ) = Δ min y | | XX T - Σ j = 1 N D j D j T y j | | F 2 - - - ( 1 )
Wherein input amendment, represent that Grassmann flows shape, for sample size is the training set of N, fitting parameter, || || frepresent Frobenius norm.
The solution optimizing formula (1) is formula (3)
y *=2(K(D)+K(D) T) -1K(X,D)(3)
Corresponding error is | | XX T - Σ j = 1 N D j D j T y j * | | F .
Wherein for optimum solution, and
[ K ( D ) ] i , j = T r ( D i T D j D j T D i ) = | | D i T D j | | F 2 ,
[ K ( X , D ) ] i = T r ( D j T XX T D j ) = | | X T D j | | F 2
4. product Grassmann according to claim 3 flows the least squared classified method on shape, it is characterized in that: sample in described step (3) residual error about kth class is defined as
ϵ k = | | XX T - Σ j = 1 N k U j k U j k T u j k * | | F + | | YY T - Σ j = 1 N k V j k V j k T v j k * | | F + | | ZZ T - Σ j = 1 N k W j k W j k T w j k * | | F
Wherein be that 3 factorial Grassmann flow shape on kth class training set, N kfor number of samples be the solution of formula (1) on each submanifold respectively, final classification results is by k *=argmin kε kdetermine.
CN201510901535.0A 2015-12-08 2015-12-08 A kind of least squared classified method in product Grassmann manifold Active CN105550645B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510901535.0A CN105550645B (en) 2015-12-08 2015-12-08 A kind of least squared classified method in product Grassmann manifold

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510901535.0A CN105550645B (en) 2015-12-08 2015-12-08 A kind of least squared classified method in product Grassmann manifold

Publications (2)

Publication Number Publication Date
CN105550645A true CN105550645A (en) 2016-05-04
CN105550645B CN105550645B (en) 2019-02-12

Family

ID=55829828

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510901535.0A Active CN105550645B (en) 2015-12-08 2015-12-08 A kind of least squared classified method in product Grassmann manifold

Country Status (1)

Country Link
CN (1) CN105550645B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108171279A (en) * 2018-01-28 2018-06-15 北京工业大学 A kind of adaptive product Grassmann manifold Subspace clustering methods of multi-angle video
CN110135499A (en) * 2019-05-16 2019-08-16 北京工业大学 Clustering method based on the study of manifold spatially adaptive Neighborhood Graph

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050286663A1 (en) * 2004-06-23 2005-12-29 Intel Corporation Compact feedback for closed loop MIMO systems
CN105005757A (en) * 2015-03-12 2015-10-28 电子科技大学 Method for recognizing license plate characters based on Grassmann manifold

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050286663A1 (en) * 2004-06-23 2005-12-29 Intel Corporation Compact feedback for closed loop MIMO systems
CN105005757A (en) * 2015-03-12 2015-10-28 电子科技大学 Method for recognizing license plate characters based on Grassmann manifold

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
刘园园: "快速低秩矩阵与张量恢复的算法研究", 《中国博士学位论文全文数据库 信息科技辑》 *
刘荣利: "基于统计学习的视觉目标跟踪算法研究", 《中国博士学位论文全文数据库 信息科技辑》 *
李淑芳等: "Grassmann流形上半监督特征映射算法及其视频目标识别", 《重庆邮电大学学报(自然科学版)》 *
王力: "基于流形的线性结构探测及目标识别方法研究", 《中国博士学位论文全文数据库 信息科技辑》 *
黄淼等: "一种基于Grassmann流形的图像集分类算法研究", 《微型电脑应用》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108171279A (en) * 2018-01-28 2018-06-15 北京工业大学 A kind of adaptive product Grassmann manifold Subspace clustering methods of multi-angle video
CN108171279B (en) * 2018-01-28 2021-11-05 北京工业大学 Multi-view video adaptive product Grassmann manifold subspace clustering method
CN110135499A (en) * 2019-05-16 2019-08-16 北京工业大学 Clustering method based on the study of manifold spatially adaptive Neighborhood Graph

Also Published As

Publication number Publication date
CN105550645B (en) 2019-02-12

Similar Documents

Publication Publication Date Title
CN103824050B (en) A kind of face key independent positioning method returned based on cascade
CN106408030B (en) SAR image classification method based on middle layer semantic attribute and convolutional neural networks
Sun et al. Graph regularized and sparse nonnegative matrix factorization with hard constraints for data representation
JP7266674B2 (en) Image classification model training method, image processing method and apparatus
Gu et al. Robust image recognition by L1-norm twin-projection support vector machine
CN104517112A (en) Table recognition method and system
CN105701504B (en) Multi-modal manifold embedding grammar for zero sample learning
CN104834941A (en) Offline handwriting recognition method of sparse autoencoder based on computer input
CN103745233B (en) The hyperspectral image classification method migrated based on spatial information
CN103544486A (en) Human age estimation method based on self-adaptation sign distribution
CN104008375A (en) Integrated human face recognition mehtod based on feature fusion
CN104966052A (en) Attributive characteristic representation-based group behavior identification method
Gao et al. A novel face feature descriptor using adaptively weighted extended LBP pyramid
CN106971201A (en) Multi-tag sorting technique based on integrated study
CN105930859B (en) Radar Signal Sorting Method based on linear manifold cluster
Liao et al. An oil painters recognition method based on cluster multiple kernel learning algorithm
Tan et al. L1-norm latent SVM for compact features in object detection
CN105354532A (en) Hand motion frame data based gesture identification method
CN107330363B (en) Rapid internet billboard detection method
CN106557783B (en) A kind of automatic extracting system and method for caricature dominant role
CN110111365B (en) Training method and device based on deep learning and target tracking method and device
CN105550645A (en) Least-squares classification method on product Grassmann manifold
CN111985532A (en) Scene-level context-aware emotion recognition deep network method
CN102521623B (en) Subspace-based incremental learning face recognition method
CN104573726B (en) Facial image recognition method based on the quartering and each ingredient reconstructed error optimum combination

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant