CN109840545A

CN109840545A - A kind of robustness structure Non-negative Matrix Factorization clustering method based on figure regularization

Info

Publication number: CN109840545A
Application number: CN201811597620.2A
Authority: CN
Inventors: 舒振球; 陆翼; 孙燕武; 范洪辉
Original assignee: Jiangsu University of Technology
Current assignee: Jiangsu University of Technology
Priority date: 2018-12-26
Filing date: 2018-12-26
Publication date: 2019-06-04

Abstract

The present invention provides a kind of robustness structure Non-negative Matrix Factorization clustering method based on figure regularization, comprising: S10 obtains m images to be clustered, and according to image configuration k to be clustered closest figures；S20 obtains corresponding data matrix Y for each closest figure, includes n data point in data matrix Y, is decomposed to obtain eigenmatrix W and coefficient matrix H to data matrix Y using non-negative matrix factorization method；S30 is based on l_2,pThe objective function O of robustness structure Non-negative Matrix Factorization of the norm foundation based on figure regularization；For S40 according to objective function O, the method iteration preset times weighted using iteration are updated eigenmatrix W, coefficient entry and figure regular terms；Using k-means clustering algorithm, eigenmatrix W obtained to each arest neighbors figure is analyzed and is clustered S50 respectively.It uses robust loss function to measure reconstructed error therein, flag data is not used to be differentiated in the robust loss function, after introducing the semi-supervised method of Non-negative Matrix Factorization, can effectively improve efficiency and accurate rate.

Description

A robust structural non-negative matrix factorization clustering method based on graph regularization

技术领域technical field

本发明涉及图像处理技术领域，尤其涉及一种基于图正则化的非负矩阵分解聚类方法。The invention relates to the technical field of image processing, in particular to a non-negative matrix decomposition clustering method based on graph regularization.

背景技术Background technique

近几年，高维数据在许多领域里出现，如多媒体分析、计算机视觉、模式识别等，如何进行降维操作成为了一个研究课题。非负矩阵分解作为一种常用的降维方法，目标是学习基于局部的特征表示，虽然其为聚类问题提供了大量的问题形成技术和算法方法，被广泛用于各种应用中。但是，这种降维方法不能保证能够得到很好的聚类性能，因此，非负矩阵变量被提出。非负矩阵分解的基本思想是学习两个最接近原始矩阵的非负矩阵，即原始数据矩阵只使用加法操作进行重构，不使用学习因子的任何减法组合，进而得到基于部分的数据表示。In recent years, high-dimensional data has appeared in many fields, such as multimedia analysis, computer vision, pattern recognition, etc., how to perform dimensionality reduction operations has become a research topic. As a commonly used dimensionality reduction method, non-negative matrix factorization aims to learn local-based feature representations, although it provides a large number of problem formation techniques and algorithmic methods for clustering problems and is widely used in various applications. However, this dimensionality reduction method cannot guarantee good clustering performance, therefore, non-negative matrix variables are proposed. The basic idea of non-negative matrix factorization is to learn two non-negative matrices that are closest to the original matrix, that is, the original data matrix is reconstructed using only addition operations, without any subtractive combination of learning factors, and then a part-based data representation is obtained.

在半监督学习中，同一类中由标记数据和未标记数据组成的整体往往来自同一子空间，即标记数据和未标记数据的内在表示是一致的，未标记数据的表示也具有块对角结构。在实际应用中，数据总是受到噪声和离群值的污染，极大可能会影响块对角结构的发现，降低其性能。In semi-supervised learning, the whole composed of labeled data and unlabeled data in the same class often comes from the same subspace, that is, the intrinsic representations of labeled data and unlabeled data are consistent, and the representation of unlabeled data also has a block-diagonal structure . In practical applications, the data is always polluted by noise and outliers, which may greatly affect the discovery of block diagonal structure and reduce its performance.

发明内容SUMMARY OF THE INVENTION

针对上述问题，本发明提供了一种基于图正则化的鲁棒性结构非负矩阵分解聚类方法，有效解决现有技术中数据受到噪声和离群值的污染后，可能会影响块对角结构的发现并降低其性能的技术问题。In view of the above problems, the present invention provides a robust structural non-negative matrix factorization clustering method based on graph regularization, which effectively solves the problem that in the prior art, after the data is polluted by noise and outliers, the block diagonal may be affected. The discovery of the structure and the technical problems of reducing its performance.

本发明提供的技术方案如下：The technical scheme provided by the present invention is as follows:

一种基于图正则化的鲁棒性结构非负矩阵分解聚类方法，包括：A robust structural non-negative matrix factorization clustering method based on graph regularization, including:

S10获取m个待聚类图像，并根据待聚类图像构造k个最邻近图；S10 obtains m images to be clustered, and constructs k nearest neighbor graphs according to the images to be clustered;

S20针对每个最邻近图得到相应的数据矩阵Y，所述数据矩阵Y中包括n个数据点，使用非负矩阵分解方法对数据矩阵Y进行分解得到特征矩阵W和系数矩阵H；S20 obtains a corresponding data matrix Y for each nearest neighbor graph, the data matrix Y includes n data points, and uses a non-negative matrix decomposition method to decompose the data matrix Y to obtain a feature matrix W and a coefficient matrix H;

S30基于范数建立基于图正则化的鲁棒性结构非负矩阵分解的目标函数O；The S30 is based on The norm establishes the objective function O for robust structural non-negative matrix factorization based on graph regularization;

其中，表示正则化项，μ表示基础系数，Z表示正则化项中的指标矩阵，表示标记数据点矩阵，表示无标记数据点的零矩阵；表示系数项，β表示稀疏系数；λTr(W^TL_WW)和μTr(H^TL_HH)表示图正则项，λ表示低秩系数，L^W＝E^W-V^W和L^H＝E^H-V^H为拉普拉斯方程，V为构造的最近邻图的权重矩阵，E为对角矩阵；in, represents the regularization term, μ represents the base coefficient, Z represents the index matrix in the regularization term, represents a matrix of labeled data points, a matrix of zeros representing unlabeled data points; represents the coefficient term, β represents the sparse coefficient; λTr(W ^T L _W W) and μTr(H ^T L _H H) represent the graph regularization term, λ represents the low-rank coefficient, L ^W =E ^W -V ^W and L ^H =E ^H -V ^H is the Laplace equation, V is the weight matrix of the constructed nearest neighbor graph, and E is the diagonal matrix;

S40根据目标函数O，使用迭代加权的方法迭代预设次数，对特征矩阵W、系数项及图正则项λTr(W^TL_WW)和μTr(H^TL_HH)进行更新；S40, according to the objective function O, use the iterative weighting method to iterate a preset number of times, and perform the iteration on the feature matrix W, the coefficient term And the graph regular terms λTr(W ^T L _W W) and μTr(H ^T L _H H) are updated;

S50采用k-means聚类算法分别对每个最近邻图所得到的特征矩阵W进行分析并聚类。S50 uses the k-means clustering algorithm to analyze and cluster the feature matrix W obtained by each nearest neighbor graph respectively.

本发明提供的基于图正则化的鲁棒性结构非负矩阵分解聚类方法，作为一种无监督的学习机制，通过特定的指标(预先设定的指标矩阵)进行评估来寻求对具有相似的无标记数据点进行适当的分区，该方法将数据空间的几何信息进行编码，不仅保留固有的几何结构，同时得到其隐藏的简洁表示。对于现有技术中噪声和离群值可能会影响块对角结构发现的问题，采用鲁棒损失函数对其中的重构误差进行测量，在该鲁棒损失函数中没有使用标记数据进行判别，引入非负矩阵分解的半监督方法后，能够有效的提高效率及精确率。The robust structural non-negative matrix factorization clustering method based on graph regularization provided by the present invention, as an unsupervised learning mechanism, evaluates through a specific index (pre-set index matrix) to seek for those with similar Appropriate partitioning of unlabeled data points, the method encodes the geometric information of the data space, not only preserving the inherent geometric structure, but also obtaining its hidden compact representation. For the problem that noise and outliers in the prior art may affect the discovery of block-diagonal structures, a robust loss function is used to measure the reconstruction error. In this robust loss function, labeled data is not used for discrimination, and the introduction of The semi-supervised method of non-negative matrix factorization can effectively improve the efficiency and accuracy.

附图说明Description of drawings

下面将以明确易懂的方式，结合附图说明优选实施方式，对上述特性、技术特征、优点及其实现方式予以进一步说明。The preferred embodiments will be described below in a clear and easy-to-understand manner with reference to the accompanying drawings, and the above-mentioned characteristics, technical features, advantages and implementations thereof will be further described.

图1为本发明中基于图正则化的鲁棒性结构非负矩阵分解聚类方法流程示意图。FIG. 1 is a schematic flowchart of the robust structural non-negative matrix factorization clustering method based on graph regularization in the present invention.

具体实施方式Detailed ways

为了更清楚地说明本发明实施例或现有技术中的技术方案，下面将对照附图说明本发明的具体实施方式。显而易见地，下面描述中的附图仅仅是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图，并获得其它的实施方式。In order to more clearly describe the embodiments of the present invention or the technical solutions in the prior art, the specific embodiments of the present invention will be described below with reference to the accompanying drawings. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative efforts, and obtain other implementations.

对于任意的矩阵B，b_i表示矩阵B的第i行向量，bⁱ表示为矩阵B的第i列向量。如果矩阵B是方块矩阵，使用Tr[B]表示矩阵B的轨迹，矩阵B的转置矩阵表示为B^T。当p>0时，定义向量b∈R^m的范数为矩阵B∈R^m×n的Frobenius范数是矩阵B的范数为For any matrix B, b _i represents the ith row vector of matrix B, and b ⁱ represents the ith column vector of matrix B. If matrix B is a square matrix, use Tr[B] to represent the trajectory of matrix B, and the transpose matrix of matrix B is represented as B ^T . When p>0, define the vector ^b∈Rm norm is The Frobenius norm of a matrix B ∈ R ^m×n is matrix B norm is

当0<p<1时，混合(q＝2)范数不是一个有效的矩阵范数，因此其不承认三角不等式。When 0<p<1, mix The (q=2) norm is not a valid matrix norm, so it does not recognize trigonometric inequalities.

考虑一个包含n个数据点的数据集数据矩阵Y＝[y₁,...,y_n]，y_i∈R^d表示第i的特征描述样本，非负矩阵分解的目的在于找到两个非负矩阵和最好的近似原数据矩阵Y有许多散度函数测量重建误差，则基于NMF(Non-negative matrixfactorization，非负矩阵分解)和Frobenius范数建立的目标函数O₁如式(1)：Consider a dataset with n data points Data matrix Y=[y ₁ ,...,y _n ], y _i ∈R ^d represents the i-th feature description sample, the purpose of non-negative matrix decomposition is to find two non-negative matrices and The best approximate original data matrix Y has many divergence functions to measure the reconstruction error, then the objective function O ₁ established based on NMF (Non-negative matrix factorization, non-negative matrix factorization) and Frobenius norm is as formula (1):

在半监督学习中，若n个数据点中包含L(L＜＜n)个标记样本和U个未标记样本，则其中，表示标记数据点矩阵，表示未标记点数据矩阵。假设数据点均是从C类中取样的，中每个被标记的数据点都被一个类标记所标记，则其中，有着第C层类的n_i数据点的特征矩阵，可得，n＝L+U。将每个子空间的维数设为m，且定义r＝mC，表示第C层子空间的n_c数据点。为显示地追求区块对角化，在非负矩阵分解的基础上实施正则化项得到如式(2)的目标函数O₂：In semi-supervised learning, if n data points contain L (L<<n) labeled samples and U unlabeled samples, then in, represents a matrix of labeled data points, Represents an unlabeled point data matrix. Assuming that the data points are all sampled from class C, Each labeled data point in is labeled with a class label, then in, With the feature matrix of n _i data points of the C-th class, we can get, n=L+U. Let the dimension of each subspace be m, and define r=mC, represents the n _c data points of the C-th subspace. To explicitly pursue block diagonalization, a regularization term is implemented based on non-negative matrix factorization The objective function O ₂ of formula (2) is obtained:

其中，loss()表示损失函数，μ表示基础系数。Among them, loss() represents the loss function, and μ represents the base coefficient.

在实际应用中，数据经常受到噪声和离群值的污染，进而影响对角结构的发现并降低其性能，基于此，本发明中基于(p>0)范数建立具备鲁棒性的损失函数，在目标函数O₂的基础上得到如式(3)的目标函数O₃：In practical applications, data are often polluted by noise and outliers, which in turn affect the discovery of diagonal structures and reduce their performance. Based on this, the present invention is based on (p>0) norm establishes a robust loss function, and on the basis of the objective function O ₂ , the objective function O ₃ as shown in formula (3) is obtained:

由本发明的目的在于发现对角结构，因此，采用简单的函数实现目标，对于正则化项定义一个指标矩阵其中，表示无标记数据点的零矩阵，表示标记数据点矩阵，得到如式(4)的正则化项：Since the purpose of the present invention is to find the diagonal structure, therefore, a simple function is used to achieve the goal, for the regularization term define an indicator matrix in, a matrix of zeros representing unlabeled data points, Represents the matrix of labeled data points, and obtains the regularization term as in Eq. (4):

根据该正则化项，得到如式(5)的目标函数：According to the regularization term, the objective function of equation (5) is obtained:

实际上，块状对角线结构可以看作是结构化的整体形式，可以影响所有类的表示，故本发明中进一步引入稀疏约束对非负矩阵分解目标函数系数矩阵因子的约束来控制其系数程度。在基于交替非负最小二乘的非负矩阵框架中，使用范数正则化控制式(5)的稀疏性，引入系数项后，得到如式(6)的目标函数：In fact, the block-shaped diagonal structure can be regarded as a structured overall form, which can affect the representation of all classes. Therefore, the present invention further introduces sparse constraints on the non-negative matrix factorization objective function coefficient matrix factor constraints to control its coefficients degree. In a nonnegative matrix framework based on alternating nonnegative least squares, use Norm regularization controls the sparsity of equation (5). After introducing the coefficient term, the objective function of equation (6) is obtained:

其中，β是稀疏系数。where β is the sparse coefficient.

考虑数据流形的几何结构以及功能歧管，分别创建两个图表来反映歧管的结构分布，故在式(6)的基础上引入图正则项得到式(7)：Considering the geometric structure of the data manifold and the functional manifold, two graphs are created to reflect the structural distribution of the manifold. Therefore, on the basis of Equation (6), a graph regular term is introduced to obtain Equation (7):

其中，λ表示低秩系数，L^W＝E^W-V^W和L^H＝E^H-V^H为拉普拉斯方程，V为构造的最近邻图的权重矩阵，E为对角矩阵。Among them, λ represents a low-rank coefficient, L ^W =E ^W -V ^W and L ^H =E ^H -V ^H are Laplace equations, V is the weight matrix of the constructed nearest neighbor graph, and E is a diagonal matrix.

在式(7)中，矩阵H和W都不是凸或光滑的，故采用迭代乘法算法对其进行优化。由于损失函数对噪声和异常值具有鲁棒性，且(0<p<1)范数比范数更为稀疏，故对(0<p≤1)混合函数进行限制，对于公式(7)的目标函数,令I＝Y-HW，可得如式(8)的目标函数O：In equation (7), the matrices H and W are not convex or smooth, so an iterative multiplication algorithm is used to optimize them. Since the loss function is robust to noise and outliers, and (0<p<1) norm ratio The norm is more sparse, so (0<p≤1) is limited by the mixed function. For the objective function of formula (7), let I=Y-HW, and the objective function O of formula (8) can be obtained:

由于H,W≥0，引入约束的拉格朗日乘数ψ和φ，使得系数矩阵H>0，特征矩阵W>0。令Ψ＝[ψ]，Φ＝[φ]，目标函数O的拉格朗日函数L如式(9)：Since H, W≥0, the constrained Lagrangian multipliers ψ and φ are introduced, so that the coefficient matrix H>0 and the feature matrix W>0. Let Ψ=[ψ], Φ=[φ], the Lagrangian function L of the objective function O is as formula (9):

根据矩阵性质拉格朗日函数L对系数矩阵H的偏导如式(10)：According to the properties of the matrix The partial derivative of the Lagrangian function L with respect to the coefficient matrix H is shown in formula (10):

其中，对角矩阵i_k在对角矩阵D中，其数值趋近于0,可得 Among them, the diagonal matrix i _k is in the diagonal matrix D, and its value is close to 0, we can get

拉格朗日函数L对特征矩阵W的偏导如式(11)：The partial derivative of the Lagrangian function L with respect to the eigenmatrix W is shown in formula (11):

令式(10)和式(11)等于0，并令ψH＝0，φW＝0，得到等式(12)和(13)：Setting equations (10) and (11) equal to 0, and setting ψH = 0 and φW = 0, yields equations (12) and (13):

(HWDW^T)H-(YDW^T)H＝0(12)(HWDW ^T )H-(YDW ^T )H=0(12)

(H^THWD-H^TYD)W+μ(Z⊙W)W＝0(13)(H ^T HWD-H ^T YD)W+μ(Z⊙W)W=0(13)

进而得到式(14)和(15)：Then formulas (14) and (15) are obtained:

以此得到系数矩阵H和特征矩阵W更新规则，如式(16)和(17)：In this way, the coefficient matrix H and feature matrix W update rules are obtained, such as formulas (16) and (17):

基于此，系数项和图正则项的更新规则如式(18)和(19)：Based on this, the update rules for coefficient terms and graph regular terms are as follows (18) and (19):

在目标函数O中，L^H＝E^H-V^H，L^W＝E^W-V^W为拉普拉斯方程，V为权重矩阵，E为对角矩阵，参数μ和λ均大于0，系数项和图正则项的更新规则如式(20)和(21)：In the objective function O, L ^H =E ^H -V ^H , L ^W =E ^W -V ^W is the Laplace equation, V is the weight matrix, E is the diagonal matrix, the parameters μ and λ are both greater than 0, and the coefficient The update rules of terms and graph regular terms are as formulas (20) and (21):

要说明的是，公式(20)和(21)中的h和w实质上就是系数矩阵H和特征矩阵W，为了表示区分，这里用小写表示。It should be noted that h and w in formulas (20) and (21) are essentially the coefficient matrix H and the feature matrix W, which are denoted by lowercase here in order to express distinction.

基于上述基于图正则化的鲁棒性结构非负矩阵的分解，应用于图像聚类时，包括以下步骤：Based on the above-mentioned decomposition of the robust structural non-negative matrix based on graph regularization, when applied to image clustering, it includes the following steps:

S20针对每个最邻近图得到相应的数据矩阵Y，数据矩阵Y中包括n个数据点，使用非负矩阵分解方法对数据矩阵Y进行分解得到特征矩阵W和系数矩阵H；S20 obtains the corresponding data matrix Y for each nearest neighbor graph, includes n data points in the data matrix Y, and uses the non-negative matrix decomposition method to decompose the data matrix Y to obtain the characteristic matrix W and the coefficient matrix H;

应当说明的是，上述实施例均可根据需要自由组合。以上仅是本发明的优选实施方式，应当指出，对于本技术领域的普通相关人员来说，在不脱离本发明原理的前提下，还可以做出若干改进和润饰，这些改进和润饰也应视为本发明的保护范围。It should be noted that the above embodiments can be freely combined as required. The above are only the preferred embodiments of the present invention. It should be pointed out that, for those of ordinary skill in the art, several improvements and modifications can be made without departing from the principles of the present invention, and these improvements and modifications should also be regarded as It is the protection scope of the present invention.

Claims

1. A robustness structure non-negative matrix factorization clustering method based on graph regularization is characterized by comprising the following steps:

s10, acquiring m images to be clustered, and constructing k nearest graphs according to the images to be clustered;

s20, obtaining a corresponding data matrix Y for each nearest graph, wherein the data matrix Y comprises n data points, and decomposing the data matrix Y by using a non-negative matrix decomposition method to obtain a feature matrix W and a coefficient matrix H;

s30 is based on l_2,pNorm establishment graph regularization-based robustAn objective function O of non-negative matrix factorization of the sexual structure;

wherein,represents a regularization term, mu represents a base coefficient, Z represents an index matrix in the regularization term, a matrix of marker data points is represented,a zero matrix representing unlabeled data points;representing coefficient terms, β representing sparse coefficients,. lambda.Tr (W)^TL_WW) and μ Tr (H)^TL_HH) Denotes the graph regularization term, λ denotes the low rank coefficient, L^W＝E^W-V^WAnd L^H＝E^H-V^HThe method comprises the following steps of (1) taking Laplace equation as a basis, wherein V is a weight matrix of a constructed nearest neighbor graph, and E is a diagonal matrix;

s40, according to the objective function O, using the iterative weighting method to iterate the preset times, and carrying out the iteration on the characteristic matrix W and the coefficient itemsAnd graph regularization term λ Tr (W)^TL_WW) and μ Tr (H)^TL_HH) Updating is carried out;

s50, analyzing and clustering the feature matrix W obtained by each nearest neighbor graph by adopting a k-means clustering algorithm.

2. The clustering method according to claim 1, wherein in step S40, the update rule of the feature matrix W is:

wherein,i_kin the diagonal matrix D, the value thereof approaches 0.

3. The clustering method according to claim 1, wherein in step S40, coefficient termsAnd graph regularization term λ Tr (W)^TL_WW) and μ Tr (H)^TL_HH) The update rule of (1) is:

wherein L is^W＝E^W-V^WAnd L^H＝E^H-V^HIn Laplace's equation, V is the weight matrix of the constructed nearest neighbor graph, and E is the diagonal matrix.