CN113591879A

CN113591879A - Deep multi-view clustering method, network, device and storage medium based on self-supervision learning

Info

Publication number: CN113591879A
Application number: CN202110831409.8A
Authority: CN
Inventors: 宗林林; 苗发强; 徐博; 梁文新; 张宪超
Original assignee: Dalian University of Technology
Current assignee: Dalian University of Technology
Priority date: 2021-07-22
Filing date: 2021-07-22
Publication date: 2021-11-02
Anticipated expiration: 2041-07-22

Abstract

The invention discloses a depth multi-view clustering method, a network, a device and a storage medium based on self-supervision learning, belongs to the field of depth multi-view clustering, and is applied to multiple application scenes such as abnormal point detection, value combination of multiple products, group division of target users and the like.

Description

Deep multi-view clustering method, network, device and storage medium based on self-supervision learning

Technical Field

The invention belongs to the field of image processing, relates to a deep multi-view clustering method, and particularly relates to a deep multi-view clustering method based on self-supervision learning.

Background

With the rapid development of big data and the internet of things, data collection from different sources and all-around and multi-angle is more convenient, multi-view data is more and more, and deep multi-view clustering also becomes a hotspot of current research. Deep multi-view clustering refers to the task of clustering using representation data collected from multiple ways for the same object. The multi-view clustering utilizes the information collected from a wider layer to mine the complementarity and consistency relationship between views, explore the internal relation between data more deeply and mine the potential value. For example, in the aspect of advertisement push, information can be mined from multiple angles such as attributes, behaviors and attention relations of users, so that a more reasonable user partition is obtained, and advertisements are accurately put. The method also has important significance in the fields of financial anti-fraud, medical data abnormity reminding and the like. Most existing multi-view clustering algorithms explore and represent information recognition clustering in an unsupervised mode, but the development of multi-view clustering is restricted to a certain extent by an unsupervised learning mode. The constraint clustering method breaks through the limitation that the traditional clustering relies on an unsupervised model, but the manual marking constraint increases additional labor cost. The recently proposed self-supervision clustering does not need manual labeling constraint and further reduces the consumption of manual resources.

Disclosure of Invention

In order to solve the problem of consumption of artificial resources caused by artificial marking in deep multi-view clustering, the invention introduces an automatic supervision learning mechanism to enable multi-view data to be suitable for an automatic supervision learning algorithm, thereby improving the clustering processing capability of the multi-view data.

In order to achieve the purpose, the invention provides a deep multi-view clustering method, a network, a device and a storage medium based on self-supervision learning, which guides and optimizes a multi-view clustering task by extracting paired constraint information and introducing a self-supervision model.

A depth multi-view clustering method based on self-supervision learning comprises the following steps:

compressing original multi-view data to a low-dimensional potential space to obtain potential feature representation of a view;

performing feature fusion on potential features of each view to obtain potential feature representation shared by multiple views;

adding orthogonal constraint to the potential feature representation shared by the views to obtain the orthogonal potential feature representation shared by the views, and calculating and constructing a similarity matrix shared by the views through a Gaussian kernel function;

acquiring potential feature representation of each view, constructing a corresponding similarity matrix, extracting must-be-trusted links and must-not-be-trusted links by setting a threshold value to form a pair-wise constraint matrix of each view, and selecting the pair-wise constraint matrix with extremely high reliability to form the view-shared pair-wise constraint matrix by solving an intersection;

carrying out diffusion propagation on the paired constraint matrixes shared by the views on the initial matrix through a propagation network to obtain the paired constraint matrixes shared by the views which are completely propagated, and adjusting the similarity matrix shared by the views through the paired constraint matrixes to ensure that the similarity matrix shared by the views is fused with constraint information to represent the similarity between the instances and simultaneously meet the nonnegativity and symmetry to obtain the adjusted similarity matrix shared by the views;

and inputting the similarity matrix shared by the adjusted views as a similarity matrix of the spectral clustering model for clustering to obtain a cluster division result of the multi-view data.

Further, the method for compressing the original multi-view data into the low-dimensional potential space is as follows:

for multi-view data X ═ X¹,X²,…,X^m}，

Using an encoder E_v(X^v)＝Z^vExtracting view latent features Z of multiple views^v，

Using a decoder

Decoding view latent features Z^vReconstructing view raw data X^v。

The loss function of the autoencoder model is:

where v ∈ {1,2, …, m }, denotes the vth view, and m denotes the number of views.

Further, in feature Fusion, Fusion (. circle.) denotes a Fusion network, Z^*Potential features for multi-view sharing:

Z^*＝Fusion(Z¹,Z²,…,Z^m)

defining fusion losses

Optimizing parameters of the converged network:

the orthogonal underlying features of view sharing are represented as:

the similarity matrix shared by the views is:

for orthogonal latent feature representation of view sharing, d (·,) represents the distance between the two data, σ is the bandwidth parameter of the gaussian kernel.

Furthermore, a threshold value is set to extract the must-credible link and the must-incredible link, paired constraint matrixes of the views are formed, and paired constraint matrixes with extremely high credibility are selected in an intersection solving mode to form the paired constraint matrixes shared by the views, wherein the method comprises the following steps:

calculating potential feature representation Z of each view respectively^vSimilarity matrix W^v；

Setting a confidence threshold δ_mAnd must not be trusted threshold δ_cA set of linked-by pairwise constraints of

And a set of pairwise constraints that are not necessarily linked as

Wherein

Is data

A category label of (1);

definition of

Expressed as the degree of similarity of the ith data and the jth data in the vth view,

stipulate if

Then define

If it is not

Then define

Will be provided with

The corresponding element within is defined as 1,

setting the corresponding element in the view to be-1 and setting the other uncertain elements to be 0 to obtain a pair-wise constraint matrix L corresponding to each view^v；

The bound credibility and bound incredibility threshold values are indirectly obtained by a constraint selectivity delta, n is defined as the number of data samples, and the specification is provided

Similarity matrix W from view potential features^vBefore n is selected₁The larger value as the must-be-trusted instance pair, then the nth value₁The larger must-be-trusted instance corresponds to a value of δ_m(ii) a Provision for

Similarity matrix W from view potential features^vBefore n is selected₂The smaller value as the must not be trusted instance pair, n₂The smaller must-be-untrusted example corresponds to a value of δ_c。

Further, the matrix L is constrained in pairs from the view by intersection^vMiddle screening out paired constraint forming view shared constraint matrix L with extremely high credibility^*Expressed as:

further, the propagation network is composed of a plurality of node layers similar to neurons, the output of the upper layer is the input of the next layer, F (t-1) is designated as the output of the t-1 layer, and the single-layer propagation network works in the following mode:

where U is a parameter of the constraint handling network, V is a parameter of the constraint aggregation network, b is a bias,

is an activation function;

the input of the propagation network is a pair-wise constraint matrix L^*And an initial matrix F⁰The output is a constraint matrix F with full propagation^*；

The loss function of the training propagation network is

Wherein gamma is>0 is a hyper-parameter controlling network training; tr (-) is the trace of the matrix,

is a normalized laplacian matrix; d is a diagonal matrix with diagonal elements having values of W^*Adding corresponding row elements; and I is an identity matrix.

Further, a fully propagated constraint matrix F^*Each item in (1)

Representing confidence values of pairwise constraints, and

through a fully propagated constraint matrix F^*Adjusting similarity matrix W of view potential features^*Expressed as:

wherein

Indicating how similar the ith instance is to the jth instance.

A deep multi-view clustering network based on self-supervised learning, comprising:

and the automatic encoder compresses the original multi-view data to a low-dimensional potential space and acquires the potential characteristic representation of the view.

The view fusion module comprises a fusion network, and is used for performing feature fusion on potential features of all views to acquire potential features shared by the views; and the orthogonal layer is used for adding orthogonal constraint to the view sharing potential feature to obtain the view sharing orthogonal potential feature representation, and responding to the orthogonal potential feature and constructing a view sharing similarity matrix through Gaussian kernel function calculation.

The self-supervision module comprises a pair-constrained module, a pair-constrained matrix and a pair-constrained matrix, wherein the pair-constrained matrix comprises a pair-constrained matrix, the potential feature representation of each view is obtained, a corresponding similarity matrix is constructed, a necessary credible link and a necessary incredible link are extracted by setting a threshold value to form the pair-constrained matrix of each view, and the pair-constrained matrix with extremely high credibility is selected by solving an intersection to form the pair-constrained matrix shared by the views;

and the propagation network is used for performing diffusion propagation on the view-shared paired constraint matrix on the initial matrix to obtain the completely propagated view-shared paired constraint matrix, adjusting the view-shared similarity matrix through the propagation network to ensure that the view-shared similarity matrix integrates constraint information to represent the similarity between the instances and simultaneously meet the nonnegativity and symmetry, and obtaining the adjusted view-shared similarity matrix.

And the spectral clustering model is used for clustering by taking the adjusted similarity matrix shared by the views as the similarity matrix input of the spectral clustering model, and obtaining the cluster division result of the multi-view data.

An apparatus for deep multi-view clustering based on self-supervised learning, comprising a processor and a memory, the processor executing code in the memory to implement any of the methods.

A computer storage medium storing a computer program for execution by hardware to implement any of the methods.

The invention has the beneficial effect of providing an automatic supervision depth multi-view clustering method. The method has the advantages that constraint information is efficiently acquired from multi-view data, labor cost is reduced, potential feature representation is continuously optimized through a self-supervision learning mechanism, and clustering capability is further improved. The method can be applied to various application scenes such as abnormal point detection, value combination of various products, group division of target users and the like.

Drawings

FIG. 1 is a general flow diagram of the process of the present invention.

Fig. 2 is a diagram of a network architecture model of the present invention.

Fig. 3 is a diagram of a propagation network model of the present invention.

FIG. 4 is a graph of the effect of changing the constraint selection rate on the accuracy of the constraint pair selected, where (a) is represented on the ALOI dataset and (b) is represented on the LUse-21 dataset.

FIG. 5 is a graph of the impact of varying constraint selectivity on clustering performance, where (a) is represented on the ALOI dataset and (b) is represented on the LUse-21 dataset.

FIG. 6 is a graph of the impact of varying the hyperparameters β and γ on clustering performance, where (a) is shown on the ALOI dataset and (b) is shown on the LUse-21 dataset.

Detailed Description

The following detailed description of the invention refers to the accompanying drawings that illustrate specific embodiments of the invention.

As shown in fig. 1, the present invention provides a deep multi-view clustering method based on an auto-supervised learning mechanism. By extracting the paired constraint information of the multi-view data instances, the self-monitoring model is combined with the constraint clustering, and the multi-view clustering model is optimized, so that the clustering accuracy of the algorithm is improved.

In one scheme, the self-supervised learning based deep multi-view clustering method SDMvSC comprises the following steps of deep potential feature extraction, similarity matrix extraction, pairwise constraint propagation, similarity matrix adjustment of view sharing and clustering result acquisition, and the steps are summarized as follows:

1. deep latent feature extraction: obtaining potential feature representation Z of each view using an auto-encoder network to compress raw view data into a low-dimensional potential space^vAnd fusing them to obtain a potential feature representation Z shared by the views^*。

2. Extracting a similarity matrix: obtaining potential feature representation Z of view sharing^*Adding orthogonal constrained view-shared orthogonal latent feature representations

Similarity matrix W for view sharing construction through Gaussian kernel function calculation^*。

3. And (3) pair-wise constraint extraction: obtaining potential feature representations Z for views^vAnd (3) constructing a similarity matrix W of potential features of each view through Gaussian kernel function calculation^vThen setting a threshold value to select a similarity matrix W of potential features of each view^vAnd the must-link and must-not-link form a pair-wise constraint matrix L of each view^vSelecting the paired constraint matrix L shared by the paired constraint component views with extremely high credibility by solving the intersection^*。

4. Propagation of pairwise constraints: pair-wise constraint matrix L for sharing views through constraint propagation network^*Paired constraint matrix F for obtaining view sharing of full propagation by diffusion propagation on initial matrix as much as possible^*。

5. Adjusting a similarity matrix shared by the views: pair-wise constraint matrix F shared by fully propagated views^*Adjusting a similarity matrix W for view sharing^*Obtaining an adjusted view-shared similarity matrix

6. Obtaining a clustering result: similarity matrix for sharing adjusted views

And inputting the similarity matrix serving as a spectral clustering model for clustering to obtain a cluster division result of the multi-view data.

In a specific embodiment, the method is specifically described as follows:

1. definition of

1.1. Paired constraints

Paired constraints are divided into two categories, namely, must-join constraints and must-don constraints. A pair-wise constraint representing a set of must-link links as

And a set of pairwise constraints that are not necessarily linked as

Wherein

Is data

The category label of (1).

1.2. Multi-view data

The multiview data is represented as X ═ X¹,X²,…,X^mWhere m denotes the number of views of the multi-view data. Usually using X^vData representing the v-th view and v ∈ {1,2, …, m }.

2. Clustering process

The method of the invention aims to accurately obtain clustering results by using data of a plurality of views, and firstly uses a self-encoder to obtain clustering results from a plurality of viewsExtracting potential feature representation Z of each view^vThereafter, potential feature representation Z of multi-view sharing is extracted through converged network^*(ii) a Constraint matrix L shared by views formed by extracting paired constraint information by utilizing common features among multi-view data^*After propagating pairwise constrained information through constraint propagation network with history memory, a pairwise constraint matrix F shared by propagated views is utilized^*Similarity matrix W for optimized view sharing^*And finally, obtaining a clustering result through spectral clustering.

2.1. Deep latent feature extraction

Since the deep neural network can more efficiently mine the latent features of the data, the present invention learns the low-dimensional latent feature representation of each view using a deep self-coder. First of all, the invention uses an encoder E_v(X^v)＝Z^vExtracting potential feature representation Z of each view^vThen using a decoder

Decoding Z^vTo reconstruct the data X^v. For the depth autoencoder model, the loss function is:

in order to further extract potential features shared by multiple views, in a depth feature fusion layer, the invention mines consistency information and complementarity information between multi-view data by fusing features of all views. Fusion (. smallcircle.) denotes a converged network, Z^*Is a potential feature representation shared for multiple views. Thus, the present invention has Z^*＝Fusion(Z¹,Z²,…,Z^m). The invention defines the fusion loss

To optimize parameters of the converged network.

Extracting potential feature representations Z for multi-view sharing^*Then, the invention passes through the orthogonal layer as Z^*An orthogonal constraint is added. The orthogonal layers are realized by Cholesky decomposition, i.e.

2.2. Similarity matrix extraction

Acquisition of orthogonal latent feature representations for multi-view sharing

Then, the similarity matrix W shared by multiple views is calculated and constructed through the Gaussian kernel function^*。

Where d (·,) denotes the distance between the two data, σ is the bandwidth parameter of the gaussian kernel, which controls the radial range of action. The method sets σ as the median of the euclidean distances between the data points.

2.3. Pairwise constrained extraction

The paired constraints describe weak supervision information of whether two instances belong to the same class, so that the extraction of the paired constraints can improve the accuracy of data feature mining. By using the method for constructing the similarity matrix in the formula (4), the invention respectively calculates the potential feature representation Z of each view^vI.e. the similarity matrix W of the potential features of the view^v. Then, the invention sets the necessary confidence threshold delta_mAnd must not be trusted threshold δ_cTo look for W^vThe pair-wise constraint information of (1). Definition of

stipulate if

Then define

If it is not

Then define

Then will be

The corresponding element within is defined as 1,

setting the corresponding element in the view to be-1 and setting the other uncertain elements to be 0 to obtain a pair-wise constraint matrix L corresponding to each view^v。

In the experiment, the bound credibility threshold and the bound incredibility threshold are indirectly obtained by restricting the selectivity delta. Defining n as the number of data samples, specifying

Because the extracted pair-wise constraint is not completely correct and because the multi-view data has consistency and complementarity information, the invention solves the problem that the pair-wise constraint matrix L is extracted from the views in an intersection manner^vMiddle screening out paired constraint forming view shared constraint matrix L with extremely high credibility^*。

2.4. Pairwise constrained propagation

In general, the greater the number of active pairwise constraints, the better the constraint performance. However, the number of reliable paired constraints obtained in equation (6) tends to be small. In order to spread the extracted pair-wise constraint information as much as possible throughout the constraint matrix, the present invention constructs a propagation network to propagate the pair-wise constraints.

The propagation network is composed of several layers of nodes like neurons. It is specified that the output of the upper layer is the input of the next layer. In order to make propagated constraint matrix and extracted paired constraint matrix L with higher reliability^*Keeping consistency, and continuously using a constraint matrix L with higher credibility in the propagation process^*The input propagates the network. F (t-1) is designated as the output of layer t-1, U and V are parameters of the propagation network, b is the bias,

for the activation function, the single-layer propagation network works in the following way:

is an activation function.

Input of propagation network is a constraint matrix L shared by views^*And an initial matrix F⁰I-S. Constraint matrix F whose output is fully propagated^*. The loss function of the training propagation network is

2.5. View-shared similarity matrix adjustment

Constraint matrix F of the invention for complete propagation^*Each item in (1)

Representing confidence values of pairwise constraints, computing

Ensure F^*With symmetry, using a fully propagated constraint matrix F^*Adjusting a similarity matrix W for view sharing^*Obtaining an optimized multi-view shared similarity matrix

Wherein

Indicating how similar the ith instance is to the jth instance,

constraint information is fused so that similarities between instances can be represented more accurately.

2.6. Clustering result acquisition

Adjusted view-shared similarity matrix

Satisfy the nonnegativity and symmetry, will

3. Experiment of

3.1. Data set

The invention uses two multi-view data sets for experiments, and the information of the data sets is as follows:

ALOI image dataset: the dataset contains three views, 77-dimensional RBG color histogram, 13-dimensional HSV/HSB color histogram and 64-dimensional color similarity.

LUse-21 dataset: the dataset has three views, including 254-dimensional LBP, 512-dimensional GIST, and 256-dimensional CENTRIST.

Table 1 details of each data set

3.2. Evaluation index

The algorithm of the invention is evaluated by using three traditional clustering indexes of accuracy, mutual information and purity:

the accuracy is as follows:

where l is the actual label, c is the prediction is the clustering label, and pi is the set of all permutations of {1,2, …, k }. The optimal arrangement pi is calculated by the Cohn-Monkles algorithm.

Mutual information:

wherein I (l; c) represents mutual information between l and c, and H (-) represents entropy thereof.

Purity:

wherein Ω ═ { w ═ w₁,w₂,…,w_kDenotes cluster division, C ═ C₁,c₂,…,c_jDenotes true category classification.

The value ranges of the three clustering indexes are all between [0,1], and the closer to 1, the better the clustering effect is.

3.3. Comparison algorithm

In this experiment, 4 algorithms were mainly used for comparison with the present invention algorithm.

ECMSC: the method is called exclusive-structured Multi-view Subspace Clustering, and is a traditional Multi-view Clustering algorithm based on a Subspace model. It attempts to deal with complementary information between different view representations by introducing a new location-aware exclusivity term, and then using the correspondence term to further bring these complementary representations to a common clustering indication.

MvSCN: the method is called a Multi-view Spectral Clustering Network, and is a depth Multi-view Spectral Clustering algorithm. It combines the local invariance defined by the depth metric learning network in each view and the consistency of different views into one objective function to obtain the view-shared features.

S2 DMVSC: it is called Self-Supervised Deep Multi-View subframe Cluster in its entirety. The method integrates spectral clustering and similarity learning into a deep learning framework, fully utilizes clustering results to supervise potential representation learning and common potential subspace learning of each view, and automatically calculates a similarity matrix between data objects according to high-level and cluster-driven representations.

The incomplete version of SDMvSC algorithm: by sequentially deleting some parts of the algorithm, the importance of each part can be seen through comparison with the algorithm of the invention. The DMvSC algorithm does not comprise a self-supervision model part, and a clustering result is obtained through potential features extracted from an encoder. The PDMvSC algorithm uses a propagation calculation formula to optimize the pairwise constraint matrix.

3.4. Results of the experiment

The experimental results are shown in table 2 and evaluated using evaluation criteria such as ACC and NMI.

Table 2 experimental results (%)

Comparing an ECMSC algorithm: as can be seen from the table, the sdmvcs algorithm is superior to the ECMSC algorithm, mainly because the depth model can better explore the relationships between data and extract potential feature representations.

Comparing the MvSCN algorithm: the data in the table show that the SDMvSC algorithm is superior to the MVSCN algorithm. After the view features are extracted by the MVSCN algorithm, the fusion features are obtained by adopting a splicing mode, and the SDMvSC algorithm adopts a fusion network mode to fuse the features, so that the fusion view features are better mined by adopting a network mode.

Comparing the S2DMVSC algorithm: the S2DMVSC algorithm drives the self-supervision model by using the clustering label, the clustering label used for supervision is changed in the process of each iteration, and the uncertainty of the label has the possibility of misleading the optimization direction of the model. In contrast, the SDMvSC algorithm extracts the pair constraint of a plurality of views, which is consistent, and the correctness of the supervision information is ensured to a great extent.

And fourthly, comparing the incomplete version of the SDMvSC algorithm. Compared with the DMvSC algorithm, the self-supervision module optimizes the fused view potential representation, and the clustering performance is improved; compared with the PDMvSC algorithm, the paired constraint propagation network can more effectively diffuse paired constraint information compared with the traditional propagation formula.

3.5. Analysis of experiments

3.5.1. Effect of constraint Selectivity δ on Pair-constrained accuracy

The choice of constraint selectivity δ affects the number of pairs of constraints that are selected. In this experiment, the constraint selection rate δ was sequentially changed in the range of [0.001-0.015], and the actual number of selected pairwise constraints and the selected pairwise constraint accuracy were observed. Fig. 3 illustrates the case of varying the constraint selection rate δ versus the actual number of selected pairwise constraints and the accuracy of the selected pairwise constraints. It is found that by extracting the operation of intersecting the pair-wise constraint matrixes of all views, the ratio of the actually obtained pair-wise constraints is far lower than the constraint selectivity delta, so the operation helps to screen out incorrect constraints and improve the accuracy of the selected constraint information.

3.5.2. Effect of constraint Selectivity δ on clustering Performance

The constraint selectivity δ determines the quality of the chosen pairwise constraints, and pairwise constraint information in turn affects the clustering performance. In the experiment, the constraint selectivity delta is changed in turn in the range of [0.001-0.015], and the influence of constraint information on the clustering result is observed. Fig. 4 shows the influence of the constraint selectivity δ on the clustering performance, and as the constraint selectivity δ increases, the constraint information increases continuously, but the clustering result tends to show a trend of increasing first and then gradually decreasing. The reason is that the paired constraint as constraint information can improve the performance of clustering, but as the delta is increased, the proportion of error constraint is increased, and the capability of a propagation network is inhibited. Therefore, the reasonable delta is set, and the performance of multi-view clustering can be improved.

3.6. Hyper-parameter settings analysis

In the network training phase, two hyper-parameters beta and gamma are set to adjust the network training rate. In this experiment, the effect of changes in β and γ on algorithm performance was analyzed. Fig. 5 shows the effect of β and γ on the clustering performance. The variation range of beta and gamma is [0.1,0.3,0.5,0.7,0.9], when beta belongs to [0.7,0.9] and gamma belongs to [0.5,0.7,0.9], the SDMvSC obtains stable and good performance.

Claims

1. A deep multi-view clustering method based on self-supervision learning is characterized by comprising the following steps:

2. The depth multi-view clustering method based on the self-supervised learning as recited in claim 1, wherein: the method of compressing the original multi-view data into a low-dimensional potential space is:

for multi-view data X ═ X¹，X²，…，X^m}，

Using codesDevice E_v(X^v)＝Z^vExtracting view latent features Z of multiple views^v，

Using a decoder

Decoding view latent features Z^vReconstructing view raw data X^v。

The loss function of the autoencoder model is:

3. The self-supervised learning based deep multi-view clustering method according to claim 1 or 2, wherein:

in feature Fusion, Fusion (. circle.) denotes a Fusion network, Z^*Potential features for multi-view sharing:

Z^*＝Fusion(Z¹，Z²，…，Z^m)

defining fusion losses

Optimizing parameters of the converged network:

the orthogonal underlying features of view sharing are represented as:

the similarity matrix shared by the views is:

4. The deep multi-view clustering method based on the self-supervised learning as recited in claim 3, wherein paired constraint matrixes of the views are formed by setting a threshold value to extract a must-trusted link and a must-untrusted link, and paired constraint matrixes with extremely high credibility shared by the views are selected by intersection, and the method comprises the following steps:

And a set of pairwise constraints that are not necessarily linked as

Wherein

Is data

A category label of (1);

definition of

stipulate if

Then define

If it is not

Then define

Will be provided with

The corresponding element within is defined as 1,

From the viewSimilarity matrix W of potential features^vBefore n is selected₁The larger value as the must-be-trusted instance pair, then the nth value₁The larger must-be-trusted instance corresponds to a value of δ_m(ii) a Provision for

5. The depth multi-view clustering method based on the self-supervised learning as recited in claim 4, wherein: from view pair-wise constraint matrix L by intersection^vMiddle screening out paired constraint forming view shared constraint matrix L with extremely high credibility^*Expressed as:

6. the self-supervised learning based deep multi-view clustering method according to claim 4 or 5, wherein:

the propagation network is composed of a plurality of node layers similar to neurons, the output of the upper layer is the input of the next layer, F (t-1) is designated as the output of the t-1 layer, and the single-layer propagation network has the working mode that:

is an activation function;

of propagation networksThe input is a pairwise constraint matrix L^*And an initial matrix F⁰The output is a constraint matrix F with full propagation^*；

The loss function of the training propagation network is

Wherein gamma > 0 is a hyper-parameter controlling network training; tr (-) is the trace of the matrix,

7. The self-supervised learning based deep multi-view clustering method of claim 5, wherein: fully propagated constraint matrix F^*Each item in (1)

Representing confidence values of pairwise constraints, and

wherein

Indicating how similar the ith instance is to the jth instance.

8. A deep multi-view clustering network based on self-supervised learning, comprising:

9. An apparatus for deep multi-view clustering based on self-supervised learning, comprising a processor and a memory, wherein the processor executes code in the memory to implement the method of any one of claims 1 to 7.

10. A computer storage medium, in which a computer program is stored, the computer program being executable by hardware to implement the method of any one of claims 1 to 7.