CN105550645A - Least-squares classification method on product Grassmann manifold - Google Patents
Least-squares classification method on product Grassmann manifold Download PDFInfo
- Publication number
- CN105550645A CN105550645A CN201510901535.0A CN201510901535A CN105550645A CN 105550645 A CN105550645 A CN 105550645A CN 201510901535 A CN201510901535 A CN 201510901535A CN 105550645 A CN105550645 A CN 105550645A
- Authority
- CN
- China
- Prior art keywords
- grassmann
- shape
- product
- flows
- sigma
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Multimedia (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Image Analysis (AREA)
Abstract
The invention discloses a least-squares classification method on a product Grassmann manifold. The method has a closed-form solution, and the accuracy rate of identification can be improved. The method comprises the following steps: (1) expressing a video in the form of product Grassmann manifold; (2) establishing a least-square model on the Grassmann manifold and solving the least-square model; and (3) performing least-squares classification, and outputting the classification result.
Description
Technical field
The invention belongs to the technical field of pattern-recognition, relate to the least squared classified method on a kind of product Grassmann stream shape particularly.
Background technology
In recent years, linear subspaces method has very important application in computer vision, as target identification, and recognition of face, human body tracking etc.Linear subspaces can reduce calculation cost and can better portray the inherent geometry of data itself.It is a kind of linear subspaces with nonlinear organization that Grassmann flows shape, and the point that video data is shown as on product Grassmann stream shape by high-order SVD breakdown is had significant effect in gesture identification.For identification problem, except the character representation that will find, the sorting technique of robust also plays vital effect to recognition correct rate.
Least square method is as simple effective method most in statistical study, and its research in stream shape space also obtains the concern of a lot of scholar.Lui gives the non-linear least square method on Grassmann stream shape by means of kernel function, and it is measured by geodesic line distance.But use WeightedKarcherMean algorithm in the method Solve problems, it is an iterative algorithm, and what obtain is an approximate solution.
Summary of the invention
Technology of the present invention is dealt with problems and is: overcome the deficiencies in the prior art, provides the least squared classified method on a kind of product Grassmann stream shape, and it has to close separates, and can improve the accuracy of identification.
Technical solution of the present invention is: this product Grassmann flows the least squared classified method on shape, and the method comprises the following steps:
(1) carry out product Grassmann stream shape to video to represent;
(2) on Grassmann stream shape, set up least square model and solve;
(3) least squared classified is carried out, and output category result.
The present invention is equidistantly embedded in Symmetric matrix by the point flowed by Grassmann on shape and carries out error metrics again, separates, can improve the accuracy of identification so have to close.
Accompanying drawing explanation
Fig. 1 shows method frame figure of the present invention.
Embodiment
As shown in Figure 1, this product Grassmann flows the least squared classified method on shape, and the method comprises the following steps:
(1) carry out product Grassmann stream shape to video to represent;
(2) on Grassmann stream shape, set up least square model and solve;
(3) least squared classified is carried out, and output category result.
The present invention is equidistantly embedded in Symmetric matrix by the point flowed by Grassmann on shape and carries out error metrics again, separates, can improve the accuracy of identification so have to close.
Preferably, in described step (1), videometer is shown as the form of tensor
wherein I
1, I
2, I
3represent the height of video, wide, length respectively; Change under each pattern can be obtained by high-order SVD,
, wherein
core tensor, V
(1), V
(2), V
(3)the factor matrix under each pattern respectively, and each V
(k)being tall and thin orthogonal matrix, is the point that Stiefel flows on shape, so span (V
(k)) be the point on Grassmann stream shape, (span (V
(1)), span (V
(2)), span (V
(3))) be the point on product Grassmann stream shape.
Preferably, least squared classified in described step (2)
The solution optimizing formula (1) is formula (3)
y
*=2(K(D)+K(D)
T)
-1K(X,D)(3)
Corresponding error is
Preferably, the middle sample (X, Y, Z) of described step (3) is defined as about the residual error of kth class
Wherein
be the solution of formula (1) on each submanifold respectively, final classification results is by k
*=argmin
kε
kdetermine.
Illustrate above method below.
1. the product Grassmann stream shape of video represents
Video can be expressed as tensor form as high dimensional data as
wherein I
1, I
2, I
3represent the height of video, wide, length respectively.Change under each pattern can be obtained by high-order SVD, namely
wherein
core tensor, V
(1), V
(2), V
(3)the factor matrix under each pattern respectively, and each V
(k)be tall and thin orthogonal matrix, therefore can regard the point on Stiefel stream shape as, so span (V
(k)) be the point on Grassmann stream shape.Therefore (span (V
(1)), span (V
(2)), span (V
(3))) be the point on product Grassmann stream shape.
2.Grassmann flows the embedded least square method on shape and solves
Least square technology is a most simple effective method in statistical study.In theorem in Euclid space, parameter
can by minimize residual error R (β)=|| y-A β ||
2obtain, wherein
for training set,
for regressand value.Estimated parameter have Explicit solutions shape as
now corresponding error is || y-A (A
ta)
-1a
ty||
2.
Order
for sample size is the training set of N, wherein
represent that Grassmann flows shape,
fitting parameter,
it is input amendment.Utilize projection mapping
The point flowed by Grassmann on shape is embedded into Symmetric matrix, and wherein Sym (d) represents Symmetric matrix.The distance of upper two point X and Y in such Grassmann space can define by the distance of embedded space, namely
And the geodesic line distance that this Distance geometry Grassmann flows the upper definition of shape is of equal value.Such distance definition is also convenient to follow-up solving.Being similar to the theorem in Euclid space principle of least square, providing embedded least square for solving following optimization problem,
Wherein y
jit is a jth element of vectorial y.
Introduce below and how to solve above-mentioned optimization problem.Have
Definition
Therefore model (1) becomes
min
y{y
TK(D)y-2y
TK(X,D)}(2)
To (2) about y differentiate, and make derivative equal 0, have
(K(D)+K(D)
T)y-2K(X,D)=0
So the solution of optimization problem (1) is
y
*=2(K(D)+K(D)
T)
-1K(X,D)(3)
Therefore corresponding error is
3. based on the principle of classification of embedded least square method
The 3 factorial Grassmann that corresponding video data is discussed below flow shape
wherein × represent cartesian product.Suppose that training set has M class, note kth class training set is
wherein N
kfor number of samples.Target infers test sample book
which kind of belongs to.
Sample (X, Y, Z) is defined as about the residual error of kth class
Wherein
the solution of each submanifold upper returning problem (1) respectively.Final classification results is determined by following formula
k
*=argmin
kε
k
Experimental verification is carried out to above-mentioned model, and has achieved obvious effect.In an experiment, select the gesture database of Cambridge University, it comprises 900 videos, totally 9 different gestures, and it is divided into 5 set according to different illumination conditions.Set 5 is general as training set, and 1-4 is as test set in set.In experiment, original series is converted into gray level image and adjusts size to 14 × 32 × 23.Current high-caliber 2 kinds of recognition methodss and the correct recognition rata of the inventive method respectively on four test sets is listed in table 1.The inventive method correct recognition rata, apparently higher than the discrimination of the PM method of Lui, is on close level with the best result kgLLC of people's methods such as Harandi.This description of test the inventive method is simply effective.
Table 1
The above; it is only preferred embodiment of the present invention; not any pro forma restriction is done to the present invention, every above embodiment is done according to technical spirit of the present invention any simple modification, equivalent variations and modification, all still belong to the protection domain of technical solution of the present invention.
Claims (4)
1. product Grassmann flows the least squared classified method on shape, it is characterized in that: the method comprises the following steps:
(1) carry out product Grassmann stream shape to video to represent;
(2) on Grassmann stream shape, set up least square model and solve;
(3) least squared classified is carried out, and output category result.
2. product Grassmann according to claim 1 flows the least squared classified method on shape, it is characterized in that: in described step (1), videometer is shown as the form of tensor
wherein I
1, I
2, I
3represent the height of video, wide, length respectively; Change under each pattern can be obtained by high-order SVD decomposition,
wherein
core tensor, V
(1), V
(2), V
(3)the factor matrix under each pattern respectively, and each V
(k)being tall and thin orthogonal matrix, is the point that Stiefel flows on shape, so span (V
(k)) be the point on Grassmann stream shape, (span (V
(1)), span (V
(2)), span (V
(3))) be the point on product Grassmann stream shape.
3. product Grassmann according to claim 2 flows the least squared classified method on shape, it is characterized in that: least squared classified in described step (2)
Wherein
input amendment,
represent that Grassmann flows shape,
for sample size is the training set of N,
fitting parameter, || ||
frepresent Frobenius norm.
The solution optimizing formula (1) is formula (3)
y
*=2(K(D)+K(D)
T)
-1K(X,D)(3)
Corresponding error is
Wherein
for optimum solution, and
4. product Grassmann according to claim 3 flows the least squared classified method on shape, it is characterized in that: sample in described step (3)
residual error about kth class is defined as
Wherein
be that 3 factorial Grassmann flow shape
on kth class training set, N
kfor number of samples
be the solution of formula (1) on each submanifold respectively, final classification results is by k
*=argmin
kε
kdetermine.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510901535.0A CN105550645B (en) | 2015-12-08 | 2015-12-08 | A kind of least squared classified method in product Grassmann manifold |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510901535.0A CN105550645B (en) | 2015-12-08 | 2015-12-08 | A kind of least squared classified method in product Grassmann manifold |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105550645A true CN105550645A (en) | 2016-05-04 |
CN105550645B CN105550645B (en) | 2019-02-12 |
Family
ID=55829828
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510901535.0A Active CN105550645B (en) | 2015-12-08 | 2015-12-08 | A kind of least squared classified method in product Grassmann manifold |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105550645B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108171279A (en) * | 2018-01-28 | 2018-06-15 | 北京工业大学 | A kind of adaptive product Grassmann manifold Subspace clustering methods of multi-angle video |
CN110135499A (en) * | 2019-05-16 | 2019-08-16 | 北京工业大学 | Clustering method based on the study of manifold spatially adaptive Neighborhood Graph |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050286663A1 (en) * | 2004-06-23 | 2005-12-29 | Intel Corporation | Compact feedback for closed loop MIMO systems |
CN105005757A (en) * | 2015-03-12 | 2015-10-28 | 电子科技大学 | Method for recognizing license plate characters based on Grassmann manifold |
-
2015
- 2015-12-08 CN CN201510901535.0A patent/CN105550645B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050286663A1 (en) * | 2004-06-23 | 2005-12-29 | Intel Corporation | Compact feedback for closed loop MIMO systems |
CN105005757A (en) * | 2015-03-12 | 2015-10-28 | 电子科技大学 | Method for recognizing license plate characters based on Grassmann manifold |
Non-Patent Citations (5)
Title |
---|
刘园园: "快速低秩矩阵与张量恢复的算法研究", 《中国博士学位论文全文数据库 信息科技辑》 * |
刘荣利: "基于统计学习的视觉目标跟踪算法研究", 《中国博士学位论文全文数据库 信息科技辑》 * |
李淑芳等: "Grassmann流形上半监督特征映射算法及其视频目标识别", 《重庆邮电大学学报(自然科学版)》 * |
王力: "基于流形的线性结构探测及目标识别方法研究", 《中国博士学位论文全文数据库 信息科技辑》 * |
黄淼等: "一种基于Grassmann流形的图像集分类算法研究", 《微型电脑应用》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108171279A (en) * | 2018-01-28 | 2018-06-15 | 北京工业大学 | A kind of adaptive product Grassmann manifold Subspace clustering methods of multi-angle video |
CN108171279B (en) * | 2018-01-28 | 2021-11-05 | 北京工业大学 | Multi-view video adaptive product Grassmann manifold subspace clustering method |
CN110135499A (en) * | 2019-05-16 | 2019-08-16 | 北京工业大学 | Clustering method based on the study of manifold spatially adaptive Neighborhood Graph |
Also Published As
Publication number | Publication date |
---|---|
CN105550645B (en) | 2019-02-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103824050B (en) | A kind of face key independent positioning method returned based on cascade | |
CN106408030B (en) | SAR image classification method based on middle layer semantic attribute and convolutional neural networks | |
Sun et al. | Graph regularized and sparse nonnegative matrix factorization with hard constraints for data representation | |
JP7266674B2 (en) | Image classification model training method, image processing method and apparatus | |
Gu et al. | Robust image recognition by L1-norm twin-projection support vector machine | |
CN104517112A (en) | Table recognition method and system | |
CN105701504B (en) | Multi-modal manifold embedding grammar for zero sample learning | |
CN104834941A (en) | Offline handwriting recognition method of sparse autoencoder based on computer input | |
CN103745233B (en) | The hyperspectral image classification method migrated based on spatial information | |
CN103544486A (en) | Human age estimation method based on self-adaptation sign distribution | |
CN104008375A (en) | Integrated human face recognition mehtod based on feature fusion | |
CN104966052A (en) | Attributive characteristic representation-based group behavior identification method | |
Gao et al. | A novel face feature descriptor using adaptively weighted extended LBP pyramid | |
CN106971201A (en) | Multi-tag sorting technique based on integrated study | |
CN105930859B (en) | Radar Signal Sorting Method based on linear manifold cluster | |
Liao et al. | An oil painters recognition method based on cluster multiple kernel learning algorithm | |
Tan et al. | L1-norm latent SVM for compact features in object detection | |
CN105354532A (en) | Hand motion frame data based gesture identification method | |
CN107330363B (en) | Rapid internet billboard detection method | |
CN106557783B (en) | A kind of automatic extracting system and method for caricature dominant role | |
CN110111365B (en) | Training method and device based on deep learning and target tracking method and device | |
CN105550645A (en) | Least-squares classification method on product Grassmann manifold | |
CN111985532A (en) | Scene-level context-aware emotion recognition deep network method | |
CN102521623B (en) | Subspace-based incremental learning face recognition method | |
CN104573726B (en) | Facial image recognition method based on the quartering and each ingredient reconstructed error optimum combination |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |