CN110135499A - Clustering method based on the study of manifold spatially adaptive Neighborhood Graph - Google Patents
Clustering method based on the study of manifold spatially adaptive Neighborhood Graph Download PDFInfo
- Publication number
- CN110135499A CN110135499A CN201910410873.2A CN201910410873A CN110135499A CN 110135499 A CN110135499 A CN 110135499A CN 201910410873 A CN201910410873 A CN 201910410873A CN 110135499 A CN110135499 A CN 110135499A
- Authority
- CN
- China
- Prior art keywords
- data
- clustering
- manifold
- adaptive neighborhood
- adaptive
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Software Systems (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computing Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Mathematical Physics (AREA)
- Medical Informatics (AREA)
- Image Analysis (AREA)
Abstract
A kind of clustering method that the adaptive neighborhood graphics based on manifold space is practised, there is the multidimensional data of complex nonlinear structure for image and video etc., traditional Spectral Clustering constrains the similitude between learning data using Laplce, the problem of be easy to causeing misinterpretation data global structure, it is established in Grassmann manifold based on adaptive neighborhood regularization is introduced from after the Unified frame of expression, thus to improve the clustering performance to video or image set multidimensional data.The present invention relates to manifold learning, the fields such as pattern-recognition and machine learning are significantly better than other classical clustering methods depending on Clustering Effect towards image and the more of video high dimensional data, effectively improve recognition capability.
Description
Technical field
The present invention relates to manifold learning, the fields such as pattern-recognition and machine learning, especially towards image, video multidimensional number
According to cluster task.
Background technique
With the fast development of internet and cloud computing technology, the multidimensional datas such as image, the video of magnanimity are by wide in real time
General propagation and storage, in addition to small part data have the personalized label of user, what is do not marked mostly is unordered
Figure or video sequence, manually mark its classification information inefficiency and need to pay high cost, cause " there are data, it is difficult
Using " predicament.Therefore, how big in face of these scale of constructions, variation multiplicity non-linear multidimensional data, carry out efficient earth's surface to it
It reaches, excavates the semantic informations information such as the classification being hidden in data, be the ultimate challenge that current the field of data mining faces, this
The manifold that the solution of problem depends on high dimensional data indicates and the breakthrough of clustering method.
The present age is based on the Spectral Clustering from expression, it is therefore an objective to which study accurately reflects the affine square of similitude between data
Battle array is generally used the European measurement of initial data when describing similitude, and is usually constrained using Laplce so that affine
The global structure of matrix holding initial data.However, there are two disadvantages: firstly, these methods be mainly designed for Europe it is several in
Vector data in moral space is not particularly suited for the multidimensional data with non-linearity manifold structure, such as video and image set.Its
Secondary, clustering performance is largely dependent upon the quality of the Laplacian Matrix learnt in advance, and wherein global structure may quilt
Misinterpretation is without considering various structures.
Summary of the invention
The high dimensional data problem is clustered to solve conventional method, the invention discloses kind of a popular world adaptive neighborhoods
The clustering method that graphics is practised.It is established in Grassmann manifold based on introducing adaptive neighborhood after the Unified frame of expression certainly just
Then change thus to improve the clustering performance to video or image set multidimensional data.Model in this method direct solution manifold is asked
Topic, in the optimal solution of Euclidean space search symmetrical matrix linear combination, to simplify learning process, significant effect is better than other
Classical clustering method.
Specific technical solution is as follows:
Image/video etc. is not suitable for complex nonlinear knot for the measure of this linear space of Euclidean distance
The problem of high dimensional data of structure, since these multidimensional datas can be embedded in low dimensional manifold, it is more to disclose these for we first
The non-linearity manifold structure of dimension data behind realizes abstract stream for based on Grassmann manifold is introduced from the learning method of expression
Then shape frame introduces adaptive neighborhood regularization on this basis to learn new neighborhood relationships from coefficient matrix.Finally will
The coefficient matrix and adaptive figure learnt executes spectral clustering (Ncut) operation, obtains more views cluster knot of video and image set
Fruit, to improve clustering precision and recognition capability.
It is to the present invention, especially to the explanation based on the adaptive neighborhood graphics learning method from expression below:
Directly learn the structural information (i.e. Laplacian Matrix) come self initial data for conventional method, we will be based on
Introduce Grassmann manifold from the learning method of expression, realize introduce after abstract manifold frame adaptive neighborhood regularization with from
Coefficient matrix learns new neighborhood relationships.
(1) by Euclidean space based on the study formula from expression:
It expands in manifold to establish in Grassmann manifold based on the study from expression:
Wherein one group of Grassmann point χ={ X1,X2,...XN},Xi∈g(p,d)。zijIndicate XiAnd XjBetween it is similar
Property, ziFor XiNew expression
(2) basis:The mapping distance of middle Grassmann manifold embodies mould
Abstract manifold distance in type.That is:
The Grassmann point of mapping has form identical with positive definite matrix, the abstract line in manifold in Sym (d)
Property combinatorial operation indicate are as follows:WhereinIndicate three ranks
Tensor, it includes all mapping symmetrical matrixes.Therefore, we can be rebuild in Grassmann manifold based on the study from expression
Model is as follows:
Reconstructed error ε is also 3 rank tensors.
(3) adaptive neighborhood regularizationIt is a kind of adaptive learning
The good strategy of figure, it reflects the neighborhood relationships between data, and the constant drawing learnt in advance rather than from initial data is general
Lars matrix.The present invention learns to indicate { z using this method1,...,zNNeighborhood relationships, rather than original Grassmann number
According to { X1,X2..., XN, a new adaptive neighborhood form is reconfigured to be write as:
Above-mentioned adaptive neighborhood regularization is substituted into method proposed by the present invention, obtains final objective function:
Here it is GMAN.
(4) point in some manifolds is given, such as the subspace of identical dimensionals some in Euclidean space, then clustering in manifold
Task refer to through the similitude between measurement sample, all given points (subspace) are clustered into some clusters.It is solving
(3) after, we can obtain the subspace representing matrix Z learnt from initial data and the adaptive neighborhood learnt from Z simultaneously
Figure A. matrix Z and A reflects the similitude between data, is used directly for executing K-means or NCut.In experiment respectively
(| Z |+| Z |TThe He of)/2Upper execution NCut is to obtain cluster result.
Beneficial effect
Face-image clustering face cluster is a basic problem of area of pattern recognition, experiments have shown that the present invention is not only
Good Clustering Effect can be shown to sets of video data to facial data set, and also, greatly improve recognition capability.
Detailed description of the invention
Fig. 1, the method for the present invention flow chart;
Fig. 2, Extended YaleB data set;
Fig. 3, CMUPIE data set;
Fig. 4, UCF sports video data collection;
Fig. 5, ballet action video;
Fig. 6, traffic highway sets of video data;
Specific embodiment
Face-image clustering-face cluster is a basic problem of area of pattern recognition.This is used in the present embodiment
Invention clusters on two image sets, is finally completed and clusters task to more views of image data, and in this experiment, we use
Two common face data collection, including Extended YaleB and CMUPIE, to construct set of face images data.Furthermore in order to
Illustrate that the present invention equally can handle video, therefore also select three groups of video datas, using the present invention for solving higher-dimension video
Data regard the problem of cluster more.Furthermore in order to illustrate effectiveness of the invention, comparative experiments also has been carried out with other clustering algorithms.
Process frame of the present invention is as shown in Figure 1:
We use two face image data collection (Fig. 2, Extended YaleB and Fig. 3, CMUPIE), two movement views
Frequency data set (Fig. 4, UC exercise data collection and Fig. 5, entrechat data set) and a traffic scene sets of video data construct
Input sample point (Fig. 6, traffic highway sets of video data).Operating process is as follows:
SVD decomposition is executed after image, video set data are extracted feature first, is then denoted as a series of
Grassmann manifold point form: { X1、X2、…、XN, as input data sample point;
Then it by being handled based on the adaptive neighborhood graphics learning method from expression input data, obtains optimization and asks
Subspace representing matrix Z and adaptive neighborhood figure A after solution, wherein the adaptive neighborhood graphics habit side based on expression certainly
Method model is as follows:
Wherein, the i-th row jth of matrix Z is classified as zij, it indicates data point XiAnd XjBetween similitude
LAIt is the Laplacian Matrix of figure A, adaptive neighborhood figure A equally reflects the similitude between data sample
For ‖ Z ‖l, the nuclear norm of matrix Z, F norm and l are taken in experiment respectively1Norm calculates separately result
Three rank tensor ε are the reconstructed errors during the expression of subspace, | | | |FIndicate F norm, the mark of tr representing matrix
Weighting factor λ1λ2λ3Value range is { 10-4, 10-3, 10-2, 10-1, 100, 101, 102, 103Three rank tensors×3Indicate the product of mould -3;
The subspace representing matrix Z and adaptive figure A finally obtained according to Optimization Solution, carries out spectral clustering to it respectively
Ncut operation, obtains subspace clustering result;
It is noted that the clustering method of comparative approach LRR, SSC, LS3C and LRSR based on vector data, therefore we pass through
The high-dimensional of these long vectors is reduced to low dimensional by PCA, and each video/image set vector is then turned to input.
In order to verify the superiority of GMAN, we are by itself and SCGSM, FGLRR, LapFGLRR, CascadedGLRR, LRR,
Several classical clustering methods of SSC, LS3C and LRSR are compared.Table 1 is that the input data of comparative approach indicates.
Table 1: the input data type of all comparative approach
Needle can be effectively improved in order to verify the adaptive neighborhood graphics learning method proposed by the invention based on from expression
To more view clustering performances of high dimensional data, this experiment carries out son on two face data collection Extended Yale B and CMUPI
Space clustering.Table 2 is the experimental result of the subspace clustering of the two face data collection compared with other clustering methods.Wherein
GMAN-Z and GMAN-A is respectively indicated to sub- space representation matrix Z and adaptive neighborhood figure A execution spectral clustering operation, GMAN-Nu,
GMAN-F, GMAN-1 are respectively indicated in method model to matrix Z using nuclear norm, F norm, the obtained more view clusters of 1 norm
As a result.
In order to assess clustering method from different aspect, we have selected five kinds of canonical measures, i.e. accuracy (ACC), standard
Mutual information (NMI), blue moral index (RI), purity (purity) and F index (FM), the high value of these indexs can reflect more preferably
Clustering performance.
Table 2: the subspace clustering result on face data collection
Can be seen that from the experimental result of table 2 it is proposed that method, especially GMAN-Nu can obtain satisfactory
Cluster result.Most of clustering methods based on Grassmann manifold better than based on vector data method (SSC, LRR and
LS3C), this shows that Grassmann manifold can extract more discriminant informations from image set data;It is important as one
Baseline, FGLRR cluster result ratio SCGSM is more preferable, demonstrates the advantage based on the learning method from expression;In addition, for GMAN-
The clustering performance of Nu, GMAN-F and GMAN-1, adaptive neighborhood figure A are better than coefficient matrix Z, and which reflects adaptive learning neighborhoods
Superiority of the relationship to image set data.
Therefore, we conclude that, it is proposed by the invention towards manifold based on from expression adaptive neighborhood graphics
Learning method can effectively solve more view clustering problems of high dimensional image, promote facial recognition capability.
In order to which the method for further proving us can also show good clustering performance to higher-dimension video data, selection
Two corresponding action video data sets: UCF exercise data collection and entrechat data set and a traffic scene video data
Collection, in these three sets of video data use it is proposed that based on from expression adaptive neighborhood graphics learning method it is carried out
Cluster operation.Table 3 and table 4 show corresponding cluster results.
Table 3: the subspace clustering result on action video data set
Table 4: the subspace clustering result on traffic video data set
Change from can be seen that more than table 3UCF and entrechat sets of video data depending on cluster result in extensive and class
Challenge, it is proposed that GMAN-Nu and GMANF still maintain relatively excellent clustering performance, be more than the side of all comparisons
Method.And GMAN-Nu-Z have it is more preferable than baseline FGLRR effect than GMAN-Nu-A better performance and GMAN-Nu-Z, this
Mean that adaptive neighborhood figure A has positive-effect to coefficient matrix Z in optimization process.Likewise, 4 traffic scene video counts of table
Shown according to the experimental result of collection, it is proposed that all methods all than other comparative approach have higher clustering precision, especially
It is that accuracy improves 95% or more.
To which we from which further follow that conclusion, the cluster proposed by the present invention based on the study of manifold spatially adaptive Neighborhood Graph
Method can efficiently accomplish the cluster task of the high dimensional data image and video for complex nonlinear structure, have good
Application prospect.
Claims (1)
1. the high dimensional data figure based on the clustering method of manifold spatially adaptive Neighborhood Graph study, suitable for complex nonlinear structure
Picture or video, it is characterised in that the following steps are included:
(1) it constructs sample number and is the image data set of N, and be denoted as a series of Grassmann manifold point { X1、X2、…、
XN, as input data sample point;
(2) by being handled based on the adaptive neighborhood graphics learning method from expression input data, after obtaining Optimization Solution
Subspace representing matrix Z and adaptive neighborhood figure A, wherein described based on the adaptive neighborhood graphics learning method mould from expression
Type is as follows:
Wherein, the i-th row jth of matrix Z is classified as zij, it indicates data point XiAnd XjBetween similitude, for ‖ Z ‖lSquare can be taken
The nuclear norm of battle array Z, F norm and l1Norm calculate separately as a result, three rank tensor ε be subspace expression during reconstructed error, |
|·||FIndicate F norm, the mark of tr representing matrix, LAIt is the Laplacian Matrix of figure A, adaptive neighborhood figure A equally reflects data
Similitude between sample, weighting factor λ1、λ2λ3, value range be { 10-4, 10-3, 10-2, 10-1, 100, 101, 102, 103, three
Rank tensor×3Indicate the product of mould -3;
(3) the subspace representing matrix Z and adaptive figure A obtained according to Optimization Solution, carries out spectral clustering Ncut behaviour to it respectively
Make, obtains subspace clustering result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910410873.2A CN110135499A (en) | 2019-05-16 | 2019-05-16 | Clustering method based on the study of manifold spatially adaptive Neighborhood Graph |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910410873.2A CN110135499A (en) | 2019-05-16 | 2019-05-16 | Clustering method based on the study of manifold spatially adaptive Neighborhood Graph |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110135499A true CN110135499A (en) | 2019-08-16 |
Family
ID=67574763
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910410873.2A Pending CN110135499A (en) | 2019-05-16 | 2019-05-16 | Clustering method based on the study of manifold spatially adaptive Neighborhood Graph |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110135499A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111046745A (en) * | 2019-11-20 | 2020-04-21 | 北京工业大学 | Multi-view clustering method based on soft block diagonal |
CN111461257A (en) * | 2020-04-24 | 2020-07-28 | 北京工业大学 | Sharing-difference representation and clustering method for multi-view video in manifold space |
CN112801142A (en) * | 2021-01-08 | 2021-05-14 | 北京工业大学 | Tensor autoregressive moving average model-based video classification method |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103279556A (en) * | 2013-06-09 | 2013-09-04 | 南方报业传媒集团 | Iteration text clustering method based on self-adaptation subspace study |
US20150104070A1 (en) * | 2013-10-14 | 2015-04-16 | Digitalglobe, Inc. | Detecting and identifying parking lots in remotely-sensed images |
CN105303196A (en) * | 2015-10-27 | 2016-02-03 | 沈阳大学 | Data clustering analysis method on the basis of Grassmann manifold |
CN105550645A (en) * | 2015-12-08 | 2016-05-04 | 北京工业大学 | Least-squares classification method on product Grassmann manifold |
CN108171279A (en) * | 2018-01-28 | 2018-06-15 | 北京工业大学 | A kind of adaptive product Grassmann manifold Subspace clustering methods of multi-angle video |
-
2019
- 2019-05-16 CN CN201910410873.2A patent/CN110135499A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103279556A (en) * | 2013-06-09 | 2013-09-04 | 南方报业传媒集团 | Iteration text clustering method based on self-adaptation subspace study |
US20150104070A1 (en) * | 2013-10-14 | 2015-04-16 | Digitalglobe, Inc. | Detecting and identifying parking lots in remotely-sensed images |
CN105303196A (en) * | 2015-10-27 | 2016-02-03 | 沈阳大学 | Data clustering analysis method on the basis of Grassmann manifold |
CN105550645A (en) * | 2015-12-08 | 2016-05-04 | 北京工业大学 | Least-squares classification method on product Grassmann manifold |
CN108171279A (en) * | 2018-01-28 | 2018-06-15 | 北京工业大学 | A kind of adaptive product Grassmann manifold Subspace clustering methods of multi-angle video |
Non-Patent Citations (1)
Title |
---|
蒲玲: "自适应邻域图的流形学习方法", 《计算机应用与软件》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111046745A (en) * | 2019-11-20 | 2020-04-21 | 北京工业大学 | Multi-view clustering method based on soft block diagonal |
CN111046745B (en) * | 2019-11-20 | 2023-08-04 | 北京工业大学 | Multi-view clustering method based on soft block diagonal angle |
CN111461257A (en) * | 2020-04-24 | 2020-07-28 | 北京工业大学 | Sharing-difference representation and clustering method for multi-view video in manifold space |
CN111461257B (en) * | 2020-04-24 | 2024-05-28 | 北京工业大学 | Sharing-difference representation and clustering method for multi-view video in manifold space |
CN112801142A (en) * | 2021-01-08 | 2021-05-14 | 北京工业大学 | Tensor autoregressive moving average model-based video classification method |
CN112801142B (en) * | 2021-01-08 | 2024-05-28 | 北京工业大学 | Video classification method based on tensor autoregressive moving average model |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhu et al. | Low-rank sparse subspace for spectral clustering | |
Wang et al. | Beyond low-rank representations: Orthogonal clustering basis reconstruction with optimized graph structure for multi-view spectral clustering | |
Xie et al. | Hyper-Laplacian regularized multilinear multiview self-representations for clustering and semisupervised learning | |
Kang et al. | Clustering with similarity preserving | |
Shi et al. | Auto-weighted multi-view clustering via spectral embedding | |
CN110135499A (en) | Clustering method based on the study of manifold spatially adaptive Neighborhood Graph | |
CN105760821A (en) | Classification and aggregation sparse representation face identification method based on nuclear space | |
Zhou et al. | Image retrieval based on effective feature extraction and diffusion process | |
Qi et al. | Personalized sketch-based image retrieval by convolutional neural network and deep transfer learning | |
Chen et al. | Adaptive projected matrix factorization method for data clustering | |
CN105046323B (en) | Regularization-based RBF network multi-label classification method | |
Liu et al. | Adaptive low-rank kernel block diagonal representation subspace clustering | |
Xu et al. | Discriminative analysis for symmetric positive definite matrices on lie groups | |
Tan et al. | High-order fuzzy clustering algorithm based on multikernel mean shift | |
CN110263855A (en) | A method of it is projected using cobasis capsule and carries out image classification | |
CN117173702A (en) | Multi-view multi-mark learning method based on depth feature map fusion | |
Xu et al. | A domain gap aware generative adversarial network for multi-domain image translation | |
Wang et al. | Multi-manifold clustering | |
WO2020252746A1 (en) | Method for performing image classification by using common-base capsule projection | |
Martineau et al. | Effective training of convolutional neural networks for insect image recognition | |
CN111414863B (en) | Enhanced integrated remote sensing image classification method | |
Puthenputhussery et al. | Sparse representation based complete kernel marginal fisher analysis framework for computational art painting categorization | |
Yang et al. | Robust landmark graph-based clustering for high-dimensional data | |
Deng et al. | Biological modeling of human visual system for object recognition using GLoP filters and sparse coding on multi-manifolds | |
CN113139556B (en) | Manifold multi-view image clustering method and system based on self-adaptive composition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190816 |
|
WD01 | Invention patent application deemed withdrawn after publication |