CN112766412A

CN112766412A - Multi-view clustering method based on self-adaptive sparse graph learning

Info

Publication number: CN112766412A
Application number: CN202110158287.0A
Authority: CN
Inventors: 肖庆江; 黄奕轩; 杜世强; 石玉清; 单广荣
Original assignee: Northwest Minzu University
Current assignee: Northwest Minzu University
Priority date: 2021-02-05
Filing date: 2021-02-05
Publication date: 2021-05-07
Anticipated expiration: 2041-02-05
Also published as: CN112766412B

Abstract

The invention discloses a multi-view clustering method based on self-adaptive sparse graph learning, which comprises the following steps of: firstly, aiming at a data matrix of each view, obtaining a similar matrix of each view through adaptive neighbor learning; then, automatically weighting the similar matrix of each view, and learning by using sparse constraint to obtain a shared similar matrix with a sparse structure; and finally, optimizing the objective function by using an efficient iterative update algorithm based on a multiplier alternating direction method (ADMM), and performing standard spectral clustering on the shared similar matrix to obtain a final clustering result. The method improves the quality of each view similarity graph, and simultaneously enhances the robustness to noise and abnormal values. The calculation complexity of the method is approximately equal to that of a spectral clustering method based on a single view, so that the model calculation speed is high, and the framework is simple and easy to realize.

Description

Multi-view clustering method based on self-adaptive sparse graph learning

Technical Field

The invention belongs to the technical field of data analysis, and particularly relates to a multi-view clustering method based on self-adaptive sparse graph learning.

Background

Clustering is a data analysis method commonly used in the fields of machine learning, pattern recognition, data mining, artificial intelligence, etc., and aims to divide a data set into a plurality of subsets consisting of similar objects according to the characteristics of data. With the rapid development of internet technology and sensor technology, the description of actually acquired data has evolved from a single view in the past to a ubiquitous multi-view description, which can provide more sufficient information for data analysis tasks, and is semantically richer, more useful, but more complex. Numerous studies have shown that multi-view learning is more efficient, robust and has better generalization capabilities than single-view learning. The multi-view learning approach aims at simulating each view to learn one function and improving generalization performance by jointly optimizing all functions, thereby effectively fusing information from different views.

The purpose of multi-view clustering is to group data points into a certain number of patterns by using compatible and complementary information of multi-view data, and existing algorithms mainly include a graph-based method, a matrix decomposition method, a multi-kernel learning method and a subspace learning method. Among them, the graph-based method is receiving wide attention due to its simplicity and high efficiency, and can be subdivided into a multi-view spectral clustering method, which generally uses a graph constructed by KNN, and a multi-view subspace clustering method, which generally uses a graph constructed by a self-representation model, such as sparse representation and low rank representation.

Most of the multi-view clustering methods are based on graph models to fuse multi-view information, and earlier methods often focused on fusing two-view information and are not suitable for three or more views. Researchers have proposed a sparse graph learning-based multi-view spectral clustering (S-MVSC) method, which aims to learn a shared similarity matrix with a sparse structure from multiple views, but neglects the quality difference between different similarity matrices. Another researcher has proposed a self-weighted multi-view clustering (SwMC) method with multiple graphs to study laplacian rank constrained graphs, but it requires singular value decomposition in each iteration, resulting in a very time-consuming process.

Disclosure of Invention

Aiming at the defects pointed out in the background technology, the invention provides a multi-view clustering method based on self-adaptive sparse graph learning, and aims to solve the problems in the prior art in the background technology.

In order to achieve the purpose, the invention adopts the technical scheme that:

a multi-view clustering method based on adaptive sparse graph learning comprises the following steps:

(1) obtaining the ith data point x of the similarity matrix of each view to the vth view through the adaptive neighbor learning of the data matrix_iAll data points of the v-th view

Will be with probability

As x_iIs close to x_iConnected, therefore, for all data points of the v-th view, the probability is determined by solving the following problem

From S^vMiddle school

There are k nonzero values, k is a neighbor parameter, and the result obtained after solving is as follows:

wherein the content of the first and second substances,

adaptively learning the similarity of each view from the data according to the solution resultA matrix;

(2) shared similarity matrix learning

For multi-view data, first p similar matrices S are constructed⁽¹⁾,S⁽²⁾,...,S^(p)In which S is^(v)∈R^nxn(v is more than or equal to 1 and less than or equal to p), introducing parameters lambda and gamma, and proposing the following models:

s.t.α^(v)≥0,α^T1_p＝1,

wherein α ═ α⁽¹⁾,α⁽²⁾,...,α^(p)]λ > 0, γ > 0; the solution of the above model translates into solving the following relaxation problem:

s.t.α^(v)≥0,α^T1_p＝1,

(3) optimization algorithm

And solving the relaxation problem by using a multiplier alternating direction method ADMM, respectively and alternately updating alpha and S through fixed variables, and learning a consensus similar matrix with a sparse structure as the input of standard spectral clustering to obtain a clustering result.

Preferably, in step (1), the solution is performed

Since only the nearest data point will be at probability 1

And all other data points cannot be neighbors of

Is close to, so is converted into a solutionThe following problems are solved:

since the optimal solution is that all data points of the vth view will be x with the same probability of 1/n_iSo further to solve the following problem:

solve the final result to

Preferably, in step (3), when updating S, α is fixed, and the relaxation problem is shifted to solve the following problem:

for the

The objective function of (1) is:

therefore, it is

Is equivalent toThe following formula:

definition of

Further simplified to

Introducing a soft threshold shrinkage operator:

where μ > 0, the similarity matrix to obtain the updated S is:

preferably, in step (3), when updating α, S is fixed, and the relaxation problem is shifted to solve the following problem:

s.t.α^(v)≥0,α^T1_p＝1；

definition of

Then the equivalence is to solve the following problem:

s.t.α^(v)≥0,α^T1_p＝1；

further conversion was to the following form:

the solution is then solved by the multiplier alternating direction method ADMM.

Compared with the defects and shortcomings of the prior art, the invention has the following beneficial effects:

(1) the invention constructs a similar matrix for each view by using an adaptive neighbor learning method as input, thereby improving the quality of the similar graph constructed for each view.

(2) The multi-view clustering (ASGL) model based on the self-adaptive sparse graph learning automatically weights each view, learns a shared similar matrix with a sparse structure from the views as the input of standard spectral clustering, considers the quality difference among different views, and learns that the shared similar matrix is sparse, so that the noise generated by different views can be effectively eliminated, and the robustness to the noise and abnormal values is improved.

(3) The ASGL model is fast and easy to implement, and the computational complexity of the ASGL is approximately equal to that of single-view atlas clustering under the condition that the time consumption for constructing similar matrixes for all views and iteratively solving the optimal shared similar matrix is not considered.

(4) The ASGL model is optimized through an efficient iterative updating algorithm based on a multiplier alternating direction method (ADMM), and compared with several latest algorithms, the ASGL model has the advantage that the effectiveness of the ASGL method is verified through numerical experiments on six data sets.

Drawings

Fig. 1 is a flow framework diagram of a multi-view clustering method based on adaptive sparse graph learning according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

1. Adaptive neighbor graph learning

Ith data point x for the v view_iStation of the v-th viewHas data points

Will be with probability

As x_iIs close to x_iAnd (4) connecting. Usually, a small distance

A large probability should be assigned

Thus, for all data points of the v-th view, a probability is determined

The natural approach of (a) is to solve the following problem:

equation (1) has a simple solution, only the nearest data point will be x with probability 1_i ^vAnd all other data points cannot be neighbors of xiv, in other words, without considering the distance information in some data, will translate to solving the following problem:

since the optimal solution is that all data points of the vth view will be x with the same probability of 1/n_iIn conjunction with equations (1) and (2), further translates to solving the following problem:

from S^vMiddle school

There are k nonzero values, k is a neighbor parameter, and the final result of the solution is as follows:

wherein the content of the first and second substances,

the similarity matrix for each view can be learned adaptively from the data by equation (4).

2. Shared similarity matrix learning

Based on the fact that the performance of spectral clustering depends on the quality of the similarity matrix to a great extent, the invention researches a method for learning the shared similarity matrix from a plurality of input data views, and simultaneously considers the quality difference between different similarity matrices. For multi-view data, first p similar matrices S are constructed⁽¹⁾,S⁽²⁾,...,S^(p)In which S is^(v)∈R^nxn(v is more than or equal to 1 and less than or equal to p), introducing parameters lambda and gamma, and proposing the following models:

wherein α ═ α⁽¹⁾,α⁽²⁾,...,α^(p)]λ > 0, γ > 0; since the solution of equation (5) involves minimizing l₀Norm, so the solution of the above model translates to solving the following relaxation problem:

3. Optimization algorithm

And (3) using an efficient iterative updating algorithm based on a multiplier alternating direction method (ADMM), respectively and alternately updating alpha and S through fixed variables, and learning a shared similar matrix to obtain a clustering result.

(1) Fix α, update S, switch to solving the following problem:

for the objective function of equation (7), there is:

the formula (8) is equivalent to the following formula:

to simplify the calculation amount, the rewrite equation (9) is as follows:

wherein the content of the first and second substances,

introducing a soft threshold shrinkage operator:

where μ > 0, a similar solution is obtained for formula (10):

(2) fix S, update α, shift to solve the following problem:

definition of

Then the equivalence is to solve the following problem:

the rewrite equation (14) is in the form:

equation (15) can be solved by an efficient iterative algorithm based on the multiplier alternating direction method ADMM.

By analysis, it can be easily found that the solution of α is related to S, and the solution of S is also related to α, so the original equation (6) is solved by alternately and iteratively optimizing S and α.

The whole process of solving equation (6) is shown in algorithm 1:

the flow frame diagram of the multi-view clustering method (ASGL) based on the adaptive sparse graph learning is shown in FIG. 1.

In order to verify the correctness of the clustering result, the clustering experiments are respectively carried out on a 3-source text data set, a COIL20 toy data set, an MSRC image data set, a NUS image data set, an ORL face data set and an Outdoor Scene data set, and the correctness rates (Accuracy) corresponding to the clustering result are 73.79%, 93.36%, 82.90%, 28.67%, 83.13% and 64.47% respectively; normalized mutual information (Normalized mutual information) is: 67.45%, 97.01%, 73.75%, 16.34%, 93.19% and 55.58%; the Adjusted random coefficients (Adjusted rand index) are respectively: 57.63%, 92.73%, 67.95%, 10.12%, 79.34% and 46.20%; the purities (Purity) were respectively: 81.07%, 94.78%, 83.29%, 31.10%, 86.85% and 66.55%. Compared with 5 latest multi-view clustering methods S-MVSC, SwMC, AMGL, MLAN and AWP, the ASGL method provided by the invention obtains the highest clustering result under 6 experimental databases and 4 common clustering evaluation criteria.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. A multi-view clustering method based on adaptive sparse graph learning is characterized by comprising the following steps:

(1) obtaining the similar matrix of each view by the self-adaptive neighbor learning of the data matrix

Ith data point x for the v view_iAll data points of the v-th view

Will be with probability

As x_iIs close to x_iAnd (4) connecting. Thus, for all data points of the v-th view, the probability is determined by solving the following problem

From S^vMiddle school

wherein the content of the first and second substances,

learning the similarity matrix of each view from the data in a self-adaptive manner according to the solving result;

(2) shared similarity matrix learning

s.t.α^(v)≥0,α^T1_p＝1,

s.t.α^(v)≥0,α^T1_p＝1,

(3) optimization algorithm

And solving the relaxation problem by using a multiplier alternating direction method (ADMM), respectively and alternately updating alpha and S through fixed variables, and learning a shared similar matrix with a sparse structure as the input of standard spectral clustering to obtain a clustering result.

2. The multi-view clustering method based on adaptive sparse graph learning as claimed in claim 1, wherein in step (1), solving is performed

Since only the nearest data point will be at probability 1

And all other data points cannot be neighbors of

So the following problem is solved:

solve the final result to

3. The multi-view clustering method based on adaptive sparse graph learning as claimed in claim 1, wherein in step (3), when updating S, α is fixed, and the relaxation problem is converted to solve the following problems:

for the

The objective function of (1) is:

therefore, it is

Equivalent to the following equation:

definition of

Further simplified to

Introducing a soft threshold shrinkage operator:

where μ > 0, the similarity matrix to obtain the updated S is:

4. the multi-view clustering method based on adaptive sparse graph learning as claimed in claim 1, wherein in step (3), when updating α, S is fixed, and the relaxation problem is converted to solve the following problems:

s.t.α^(v)≥0,α^T1_p＝1；

definition of

Then the equivalence is to solve the following problem:

s.t.α^(v)≥0,α^T1_p＝1；

further conversion was to the following form: