CN109947938A

CN109947938A - Multi-label classification method, system, readable storage medium and computer device

Info

Publication number: CN109947938A
Application number: CN201910081373.9A
Authority: CN
Inventors: 何志芬; 江山; 周锦春; 李希勇
Original assignee: Pingxiang University
Current assignee: Pingxiang University
Priority date: 2019-01-28
Filing date: 2019-01-28
Publication date: 2019-06-28

Abstract

The present invention provides a multi-label classification method, system, readable storage medium and computer equipment. The method includes: converting a multi-label classification problem into two types of classification problems, each of which is corresponding to a label in the multi-label data set. ;Construct a semi-supervised multi-label learning model according to the preset feature selection and the correlation between the two types of classifiers; use a preset algorithm to solve the semi-supervised multi-label learning model to obtain the model parameters of the semi-supervised multi-label classification and marker correlation matrix; according to the model parameters and marker correlation matrix, predict the marker set to which the unknown marker belongs. The multi-label classification method in the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. , and dimensionality reduction for high-dimensional data is beneficial to obtain a multi-label classifier with better generalization performance.

Description

Multi-label classification method, system, readable storage medium and computer device

技术领域technical field

本发明涉及多标记学习技术领域，特别涉及一种多标记分类方法、系统、可读存储介质及计算机设备。The present invention relates to the technical field of multi-label learning, and in particular, to a multi-label classification method, system, readable storage medium and computer device.

背景技术Background technique

机器学习专门研究计算机怎样模拟或实现人类的学习行为，是计算机学科中一个重要的研究领域。监督学习是机器学习中研究得最多、应用最为广泛的一种学习框架，其假设每个学习对象只隶属于一个标记。Machine learning specializes in how computers simulate or realize human learning behavior, and is an important research field in computer science. Supervised learning is the most studied and widely used learning framework in machine learning, which assumes that each learning object belongs to only one label.

但在现实世界中，一个标记往往不足以准确地描述一些复杂的语义对象。由此可见，同时具有多个标记的复杂学习对象无处不在，而传统的监督学习方法难以很好地处理这些复杂的学习对象。But in the real world, a token is often not enough to accurately describe some complex semantic objects. It can be seen that complex learning objects with multiple labels at the same time are ubiquitous, and traditional supervised learning methods are difficult to deal with these complex learning objects well.

为此，多标记学习应运而生，其任务是学习已知的多标记数据集，来预测未知样本所属的标记集。其中，半监督多标记学习是多标记学习中一类典型，其利用少量的已标记样本以及大量的未标记样本来训练获得多标记分类器。For this reason, multi-label learning comes into being, and its task is to learn the known multi-label data set to predict the label set to which the unknown sample belongs. Among them, semi-supervised multi-label learning is a typical type of multi-label learning, which uses a small number of labeled samples and a large number of unlabeled samples to train to obtain a multi-label classifier.

然而，现有技术当中，目前半监督多标记学习方法虽然考虑了标记间的相关性但无法估计标记相关性且没有考虑多标记特征选择，也有些半监督多标记学习方法考虑了多标记特征选择，但无法通过学习自动挖掘和利用标记相关性，导致训练出的多标记分类器的泛化性能差。However, in the prior art, although the current semi-supervised multi-label learning method considers the correlation between labels, it cannot estimate the label correlation and does not consider multi-label feature selection, and some semi-supervised multi-label learning methods consider multi-label feature selection. , but cannot automatically mine and exploit label correlation through learning, resulting in poor generalization performance of the trained multi-label classifier.

发明内容SUMMARY OF THE INVENTION

基于此，本发明的目的是提供一种多标记分类方法、系统、可读存储介质及计算机设备，以解决现有技术当中半监督多标记学习方法训练出的多标记分类器的泛化性能差的技术问题。Based on this, the purpose of the present invention is to provide a multi-label classification method, system, readable storage medium and computer equipment to solve the poor generalization performance of the multi-label classifier trained by the semi-supervised multi-label learning method in the prior art technical issues.

根据本发明实施例的一种多标记分类方法，包括：A multi-label classification method according to an embodiment of the present invention includes:

将多标记分类问题转化为两类分类问题，每个两类分类器对应多标记数据集中的一个标记；Convert the multi-label classification problem into a two-class classification problem, each of which corresponds to a label in the multi-label data set;

根据预设的特征选择和所述两类分类器之间的相关性，构建半监督多标记学习模型；According to the preset feature selection and the correlation between the two types of classifiers, construct a semi-supervised multi-label learning model;

采用预设算法对所述半监督多标记学习模型进行求解，以得到所述半监督多标记分类的模型参数和标记相关性矩阵；Using a preset algorithm to solve the semi-supervised multi-label learning model, to obtain the model parameters and label correlation matrix of the semi-supervised multi-label classification;

根据所述半监督多标记分类的模型参数和标记相关性矩阵，预测未知标记所属的标记集。According to the model parameters of the semi-supervised multi-label classification and the label correlation matrix, the label set to which the unknown label belongs is predicted.

另外，根据本发明上述实施例的一种多标记分类方法，还可以具有如下附加的技术特征：In addition, according to a kind of multi-label classification method of the above-mentioned embodiment of the present invention, can also have the following additional technical features:

进一步地，所述将多标记分类问题转化为两类分类问题的步骤包括：Further, the step of converting the multi-label classification problem into a two-class classification problem includes:

将所述多标记分类问题转化为Q个两类分类器，第q个所述两类分类器的判别函数定义为 The multi-label classification problem is transformed into Q two-class classifiers, and the discriminant function of the q-th two-class classifier is defined as

其中，Q表示所述多标记数据集中的标记个数，和分别表示第q个所述两类分类器的判别函数所对应的权重和偏差。where Q represents the number of markers in the multi-label dataset, and respectively represent the weight and bias corresponding to the discriminant function of the qth classifier of the two types of classifiers.

进一步地，所述两类分类器之间的相关性为每个所述两类分类器的学习同时依赖于原始特征向量和其他标记变量。Further, the correlation between the two types of classifiers is that the learning of each of the two types of classifiers depends on the original feature vector and other label variables at the same time.

进一步地，所述两类分类器的判别函数可转化为：Further, the discriminant functions of the two types of classifiers can be transformed into:

其中，权重矩阵偏差矩阵标记相关向量 Among them, the weight matrix Deviation matrix marker correlation vector

进一步地，所述特征选择为对权重矩阵W进行约束的项 wⁱ为W的第i行。Further, the feature selection is an item that constrains the weight matrix W w ⁱ is the ith row of W.

进一步地，所述半监督多标记学习模型的目标函数表达式为：Further, the objective function expression of the semi-supervised multi-label learning model is:

其中，n和Q分别表示样本个数和标记个数，表示已标记样本对应的标记矩阵，标记矩阵由n_l个已标记多标记样本对应的标记矩阵和未标记多标记样本对应的标记矩阵F_u组成，F_u初始状态为全零矩阵，F_i·表示第i个样本对应的标记向量，p_i定义了第i个训练样本的重要性，为标记相关性矩阵，是为了避免过拟合的正则化项， wⁱ为W的第i行，λ和η为平衡参数，为多标记数据矩阵， tr(·)表示矩阵的迹，表示元素全为1的向量。Among them, n and Q represent the number of samples and the number of markers, respectively, Represents the label matrix corresponding to the labeled sample, the label matrix Label matrix corresponding to n _l labeled multilabel samples It is composed of the label matrix Fu corresponding to the unlabeled multi-label sample. The initial state of _Fu is an all-zero matrix, F _i represents the label vector corresponding to the _ith sample, and p _i defines the importance of the ith training sample. is the marker correlation matrix, is the regularization term to avoid overfitting, w ⁱ is the ith row of W, λ and η are balance parameters, is the multi-label data matrix, tr( ) represents the trace of the matrix, Represents a vector whose elements are all ones.

进一步地，所述预设算法为交替迭代求解算法。Further, the preset algorithm is an alternate iterative solution algorithm.

根据本发明实施例的一种多标记分类系统，包括：A multi-label classification system according to an embodiment of the present invention includes:

问题转化模块，用于将多标记分类问题转化为两类分类问题，每个两类分类器对应多标记数据集中的一个标记；The problem conversion module is used to convert the multi-label classification problem into two-class classification problems, and each two-class classifier corresponds to a label in the multi-label data set;

模型构建模块，用于根据预设的特征选择和所述两类分类器之间的相关性，构建半监督多标记学习模型；a model building module for building a semi-supervised multi-label learning model according to preset feature selection and the correlation between the two types of classifiers;

模型求解模块，用于采用预设算法对所述半监督多标记学习模型进行求解，以得到所述半监督多标记分类的模型参数和标记相关性矩阵；a model solving module for solving the semi-supervised multi-label learning model by using a preset algorithm to obtain model parameters and a label correlation matrix of the semi-supervised multi-label classification;

标记预测模块，根据所述半监督多标记分类的模型参数和标记相关性矩阵，预测未知标记所属的标记集。The marker prediction module predicts the marker set to which the unknown marker belongs according to the model parameters of the semi-supervised multi-marker classification and the marker correlation matrix.

本发明另一方法还提出一种计算机可读存储介质，其上存储有计算机程序，该程序被处理器执行时实现如上述的方法Another method of the present invention also provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the above-mentioned method is implemented

本发明另一方法还提出一种计算机设备，包括存储器、处理器以及存储在存储器上并可在处理器上运行的计算机程序，所述处理器执行所述程序时实现如上述的方法。Another method of the present invention also provides a computer device, comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor implementing the above-mentioned method when the processor executes the program.

本发明当中的多标记分类方法，采用联合标记相关性和特征选择的半监督多标记学习方法，在多标记分类器设计的同时不仅考虑了利用大量的未标记数据集，也考虑了标记相关性和多标记特征选择，这不仅有效地利用了大量的未标记多标记样本，而且也通过学习自动获得标记之间的相关性，此外对高维数据进行降维，有利于获得泛化性能更好的多标记分类器。The multi-label classification method in the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection. When designing the multi-label classifier, not only the use of a large number of unlabeled data sets, but also the label correlation is considered. and multi-label feature selection, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. In addition, dimensionality reduction for high-dimensional data is beneficial to obtain better generalization performance multi-label classifier.

附图说明Description of drawings

图1为本发明第一实施例中的多标记分类方法的流程图；1 is a flowchart of a multi-label classification method in a first embodiment of the present invention;

图2为本发明第二实施例中的多标记分类系统的流程图；Fig. 2 is the flow chart of the multi-label classification system in the second embodiment of the present invention;

图3为本发明第三实施例中的计算机设备的结构图。FIG. 3 is a structural diagram of a computer device in a third embodiment of the present invention.

主要元件符号说明：Description of main component symbols:

问题转化模块problem conversion module 1111 模型构建模块Model building blocks 1212 模型求解模块Model Solving Module 1313 标记预测模块Tag prediction module 1414 存储器memory 1010 处理器processor 2020 计算机程序Computer program 3030

以下具体实施方式将结合上述附图进一步说明本发明。The following specific embodiments will further illustrate the present invention in conjunction with the above drawings.

具体实施方式Detailed ways

为了便于理解本发明，下面将参照相关附图对本发明进行更全面的描述。附图中给出了本发明的若干实施例。但是，本发明可以以许多不同的形式来实现，并不限于本文所描述的实施例。相反地，提供这些实施例的目的是使对本发明的公开内容更加透彻全面。In order to facilitate understanding of the present invention, the present invention will be described more fully hereinafter with reference to the related drawings. Several embodiments of the invention are presented in the accompanying drawings. However, the present invention may be embodied in many different forms and is not limited to the embodiments described herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete.

需要说明的是，当元件被称为“固设于”另一个元件，它可以直接在另一个元件上或者也可以存在居中的元件。当一个元件被认为是“连接”另一个元件，它可以是直接连接到另一个元件或者可能同时存在居中元件。本文所使用的术语“垂直的”、“水平的”、“左”、“右”以及类似的表述只是为了说明的目的。It should be noted that when an element is referred to as being "fixed to" another element, it can be directly on the other element or intervening elements may also be present. When an element is referred to as being "connected" to another element, it can be directly connected to the other element or intervening elements may also be present. The terms "vertical," "horizontal," "left," "right," and similar expressions are used herein for illustrative purposes only.

除非另有定义，本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的，不是旨在于限制本发明。本文所使用的术语“及/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.

请参阅图1，所示为本发明第一实施例中的多标记分类方法，包括步骤S01 至步骤S04。Please refer to FIG. 1, which shows the multi-label classification method in the first embodiment of the present invention, including steps S01 to S04.

步骤S01，将多标记分类问题转化为两类分类问题，每个两类分类器对应多标记数据集中的一个标记。In step S01, the multi-label classification problem is converted into a two-class classification problem, and each two-class classifier corresponds to a label in the multi-label data set.

具体地，将多标记分类问题转化为两类分类问题的步骤包括：Specifically, the steps of transforming a multi-label classification problem into a two-class classification problem include:

将所述多标记分类问题转化为Q个两类分类器，第q个所述两类分类器的判别函数定义为其中，Q表示所述多标记数据集中的标记个数，和分别表示第q个所述两类分类器的判别函数所对应的权重和偏差。The multi-label classification problem is transformed into Q two-class classifiers, and the discriminant function of the q-th two-class classifier is defined as where Q represents the number of markers in the multi-label dataset, and respectively represent the weight and bias corresponding to the discriminant function of the qth classifier of the two types of classifiers.

步骤S02，根据预设的特征选择和所述两类分类器之间的相关性，构建半监督多标记学习模型。Step S02, constructing a semi-supervised multi-label learning model according to the preset feature selection and the correlation between the two types of classifiers.

其中，所述两类分类器之间的相关性为：每个所述两类分类器的学习同时依赖于原始特征向量和其他标记变量。The correlation between the two types of classifiers is: the learning of each of the two types of classifiers depends on the original feature vector and other label variables at the same time.

可以理解的，由于每个标记所对应的分类器的学习同时依赖于原始特征向量和其他标记变量，这样使得相互独立的两类分类器可以同时学习，而且多标记分类和标记相关性也可以同时学习。基于此，第q个所述两类分类器的所对应的判别函数可转化为：Understandably, since the learning of the classifier corresponding to each tag depends on the original feature vector and other tag variables at the same time, two types of classifiers that are independent of each other can be learned at the same time, and multi-tag classification and tag correlation can also be simultaneously learned. study. Based on this, the corresponding discriminant function of the qth two-class classifier can be transformed into:

此外，为了降低高维数据特征空间的维数，所述特征选择为：对权重矩阵W 进行约束的项wⁱ为W的第i行。In addition, in order to reduce the dimension of the feature space of high-dimensional data, the feature selection is: the term that constrains the weight matrix W w ⁱ is the ith row of W.

具体来说，本步骤S02的目的在于：将半监督多标记分类模型参数、标记相关性以及半监督多标记特征选择统一到同一个模型框架中进行联合学习，以构建半监督多标记学习模型。Specifically, the purpose of this step S02 is to unify the semi-supervised multi-label classification model parameters, label correlation and semi-supervised multi-label feature selection into the same model framework for joint learning to construct a semi-supervised multi-label learning model.

其中，所述半监督多标记学习模型的目标函数表达为：Wherein, the objective function of the semi-supervised multi-label learning model is expressed as:

其中，n和Q分别表示样本个数和标记个数，表示已标记样本对应的标记矩阵，标记矩阵由n_l个已标记多标记样本对应的标记矩阵和未标记多标记样本对应的标记矩阵F_u组成，F_u初始状态为全零矩阵，F_i·表示第i个样本对应的标记向量，p_i定义了第i个训练样本的重要性，为标记相关性矩阵，是为了避免过拟合的正则化项， wⁱ为W的第i行，λ和η为平衡参数。Among them, n and Q represent the number of samples and the number of markers, respectively, Represents the label matrix corresponding to the labeled sample, the label matrix Label matrix corresponding to n _l labeled multilabel samples It is composed of the label matrix Fu corresponding to the unlabeled multi-label sample. The initial state of _Fu is an all-zero matrix, F _i represents the label vector corresponding to the _ith sample, and p _i defines the importance of the ith training sample. is the marker correlation matrix, is the regularization term to avoid overfitting, w ⁱ is the ith row of W, and λ and η are the balance parameters.

基于此，模型可进一步表示为：Based on this, the model can be further expressed as:

其中，为多标记数据矩阵，tr(·)表示矩阵的迹，表示元素全为1的向量。in, is the multi-label data matrix, tr( ) represents the trace of the matrix, Represents a vector whose elements are all ones.

步骤S03，采用预设算法对所述半监督多标记学习模型进行求解，以得到所述半监督多标记分类的模型参数和标记相关性矩阵。Step S03, using a preset algorithm to solve the semi-supervised multi-label learning model, to obtain the model parameters and label correlation matrix of the semi-supervised multi-label classification.

其中，所述预设算法为交替迭代求解算法。Wherein, the preset algorithm is an alternate iterative solution algorithm.

具体交替迭代求解算法包括步骤(一)至步骤(三):The specific alternate iterative solution algorithm includes steps (1) to (3):

步骤(一)：固定C和F，求W和bStep (1): Fix C and F, find W and b

当给定C和F时，可通过优化求解如下目标函数来得到W和b。When C and F are given, W and b can be obtained by optimally solving the following objective function.

由于||W||_2，1不可导，此目标函数是非光滑凸问题，本发明将其转化为如下等价的约束光滑凸优化问题，并采用基于Nesterov的加速投影梯度方法来求解。Since ||W|| ₂ , 1 is not differentiable, this objective function is a non-smooth convex problem. The present invention transforms it into the following equivalent constrained smooth convex optimization problem, and uses the Nesterov-based accelerated projected gradient method to solve it.

其中，Θ＝{θ∈E^(d+1)×Q|θ＝(W^T，b)^T，||W||_2，1≤r，W∈R^d×Q，b∈R^Q}为凸集，r为l_2，1球的半径。where Θ={θ∈E ^(d+1)×Q |θ=(W ^T , b) ^T , ||W|| _2,1 ≤r, W∈R ^d×Q , b∈R ^Q } is Convex set, r is l ₂ , 1 the radius of the sphere.

令表示第t次迭代的搜索点，其中α_t为调整参数。make represents the search point of the t-th iteration, where α _t is an adjustment parameter.

对于定义for definition

其中，表示函数g(·)关于θ在点上导数，关于θ是强凸的。因此，在第t+1次迭代中Θ_t+1的近似解可根据如下式子计算得到。in, Represents the function g( ) with respect to θ at the point upper derivative, is strongly convex with respect to θ. Therefore, the approximate solution of Θ _t+1 in the t+1th iteration can be calculated according to the following equation.

因此，w_t+1可计算如下式子得到：Therefore, w _t+1 can be calculated as follows:

其中，Ω＝{W∈R^d×Q|||W||_2，1≤r]是封闭凸集，π_Ω(·)为到凸集Ω上的欧式投影。Among them, Ω={W∈R ^d×Q |||W|| _2,1 ≤r] is a closed convex set, and π _Ω (·) is the Euclidean projection onto the convex set Ω.

b_t+1可计算如下式子得到：b _t+1 can be calculated as follows:

即 which is

步骤(二)：固定W,b和F，求CStep (2): Fix W, b and F, and find C

当给定W,b和F时，可通过优化求解如下目标函数来得到C。When W, b and F are given, C can be obtained by optimally solving the following objective function.

C中每个分量可以通过优化如下目标函数获得：Each component in C can be obtained by optimizing the following objective function:

步骤(三)：固定W,b和C，求FStep (3): Fix W, b and C, and find F

优化W,b和C之后，可以计算因此，每次迭代求解过程中，标记矩阵F中未标记样本所对应的标记值可根据如下定义的公式来调整。After optimizing W, b and C, it is possible to calculate Therefore, in each iterative solution process, the label value corresponding to the unlabeled sample in the label matrix F can be adjusted according to the formula defined below.

步骤S04，根据所述半监督多标记分类的模型参数和标记相关性矩阵，预测未知标记所属的标记集。Step S04, according to the model parameters of the semi-supervised multi-label classification and the label correlation matrix, predict the label set to which the unknown label belongs.

举例来说，假设给定一个未知样本x_u，其在第q个标记上的判别函数值为：其预测的标记向量为： h(x_u)＝sign(f₁(x_u)，...，f_Q(x_u))，sign(·)为符号函数。For example, given an unknown sample x _u , its discriminant function value on the qth marker is: The predicted sign vector is: h(x _u )=sign(f ₁ (x _u ), . . . , f _Q (x _u )), where sign(·) is a sign function.

综上，本发明上述实施例当中的多标记分类方法，采用联合标记相关性和特征选择的半监督多标记学习方法，在多标记分类器设计的同时不仅考虑了利用大量的未标记数据集，也考虑了标记相关性和多标记特征选择，这不仅有效地利用了大量的未标记多标记样本，而且也通过学习自动获得标记之间的相关性，此外对高维数据进行降维，有利于获得泛化性能更好的多标记分类器。In summary, the multi-label classification method in the above-mentioned embodiments of the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection, and not only considers the use of a large number of unlabeled data sets while designing the multi-label classifier, Label correlation and multi-label feature selection are also considered, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. In addition, dimensionality reduction for high-dimensional data is beneficial to Obtain a multi-label classifier with better generalization performance.

本发明另一方面还提供一种多标记分类系统，请查阅图2，所示为本发明第三实施例中的多标记分类系统，包括：Another aspect of the present invention also provides a multi-label classification system, please refer to FIG. 2, which shows the multi-label classification system in the third embodiment of the present invention, including:

问题转化模块11，用于将多标记分类问题转化为两类分类问题，每个两类分类器对应多标记数据集中的一个标记；The problem conversion module 11 is used to convert the multi-label classification problem into two-class classification problems, and each two-class classifier corresponds to a label in the multi-label data set;

模型构建模块12，用于根据预设的特征选择和所述两类分类器之间的相关性，构建半监督多标记学习模型；Model building module 12, for constructing a semi-supervised multi-label learning model according to preset feature selection and the correlation between the two types of classifiers;

模型求解模块13，用于采用预设算法对所述半监督多标记学习模型进行求解，以得到所述半监督多标记分类的模型参数和标记相关性矩阵；Model solving module 13, for adopting preset algorithm to solve described semi-supervised multi-label learning model, to obtain the model parameters and label correlation matrix of described semi-supervised multi-label classification;

标记预测模块14，根据所述半监督多标记分类的模型参数和标记相关性矩阵，预测未知标记所属的标记集。The marker prediction module 14 predicts the marker set to which the unknown marker belongs according to the model parameters of the semi-supervised multi-marker classification and the marker correlation matrix.

进一步地，所述问题转化模块11还可以用于将所述多标记分类问题转化为 Q个两类分类器，第q个所述两类分类器的判别函数定义为 Further, the problem conversion module 11 can also be used to convert the multi-label classification problem into Q two-class classifiers, and the discriminant function of the q-th two-class classifier is defined as:

进一步地，所述特征选择为：对权重矩阵W进行约束的项 wⁱ为W的第i行。Further, the feature selection is: an item that constrains the weight matrix W w ⁱ is the ith row of W.

综上，本发明上述实施例当中的多标记分类系统，采用联合标记相关性和特征选择的半监督多标记学习方法，在多标记分类器设计的同时不仅考虑了利用大量的未标记数据集，也考虑了标记相关性和多标记特征选择，这不仅有效地利用了大量的未标记多标记样本，而且也通过学习自动获得标记之间的相关性，此外对高维数据进行降维，有利于获得泛化性能更好的多标记分类器。To sum up, the multi-label classification system in the above-mentioned embodiments of the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection, and not only considers the use of a large number of unlabeled data sets while designing the multi-label classifier, Label correlation and multi-label feature selection are also considered, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. In addition, dimensionality reduction for high-dimensional data is beneficial to Obtain a multi-label classifier with better generalization performance.

本发明另一方法还提出一种计算机可读存储介质，其上存储有计算机程序，该程序被处理器执行时实现如上述的多标记分类方法Another method of the present invention further provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, implements the above-mentioned multi-label classification method

本发明另一方法还提出一种计算机设备，请参阅图3，所示为本发明第三实施例当中的计算机设备，包括存储器10、处理器20以及存储在存储器20上并可在处理器10上运行的计算机程序30，所述处理器20执行所述程序30时实现如上述的多标记分类方法。Another method of the present invention further proposes a computer device, please refer to FIG. 3 , which shows the computer device in the third embodiment of the present invention, including a memory 10 , a processor 20 , and a computer device stored in the memory 20 and available in the processor 10 . A computer program 30 running on the processor 20 implements the multi-label classification method described above when the processor 20 executes the program 30.

本领域技术人员可以理解，在流程图中表示或在此以其他方式描述的逻辑和/或步骤，例如，可以被认为是用于实现逻辑功能的可执行指令的定序列表，可以具体实现在任何计算机可读介质中，以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用，或结合这些指令执行系统、装置或设备而使用。就本说明书而言，“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。Those skilled in the art will appreciate that logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing logical functions, may be embodied in in any computer-readable medium for use by an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch and execute instructions from an instruction execution system, apparatus, or device), or Used in conjunction with these instruction execution systems, apparatus or devices. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in conjunction with an instruction execution system, apparatus, or apparatus.

计算机可读介质的更具体的示例(非穷尽性列表)包括以下：具有一个或多个布线的电连接部(电子装置)，便携式计算机盘盒(磁装置)，随机存取存储器(RAM)，只读存储器(ROM)，可擦除可编辑只读存储器(EPROM 或闪速存储器)，光纤装置，以及便携式光盘只读存储器(CDROM)。另外，计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质，因为可以例如通过对纸或其他介质进行光学扫描，接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序，然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wiring (electronic devices), portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as it may be, for example, by optically scanning the paper or other medium, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.

应当理解，本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中，多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如，如果用硬件来实现，和在另一实施方式中一样，可用本领域公知的下列技术中的任一项或它们的组合来实现：具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路，具有合适的组合逻辑门电路的专用集成电路，可编程门阵列(PGA)，现场可编程门阵列(FPGA) 等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.

在本说明书的描述中，参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不一定指的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structure, material or feature is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

以上所述实施例仅表达了本发明的几种实施方式，其描述较为具体和详细，但并不能因此而理解为对本发明专利范围的限制。应当指出的是，对于本领域的普通技术人员来说，在不脱离本发明构思的前提下，还可以做出若干变形和改进，这些都属于本发明的保护范围。因此，本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several embodiments of the present invention, and the descriptions thereof are specific and detailed, but should not be construed as limiting the scope of the patent of the present invention. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of the present invention, some modifications and improvements can be made, and these all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention should be governed by the appended claims.

Claims

1. A multi-label classification method, comprising:

converting the multi-label classification problem into two classification problems, wherein each two classifier corresponds to one label in the multi-label data set;

constructing a semi-supervised multi-label learning model according to the preset feature selection and the correlation between the two types of classifiers;

solving the semi-supervised multi-label learning model by adopting a preset algorithm to obtain model parameters and a label correlation matrix of the semi-supervised multi-label classification;

and predicting a label set to which unknown labels belong according to the model parameters and the label correlation matrix of the semi-supervised multi-label classification.

2. The multi-label classification method according to claim 1, wherein said step of converting the multi-label classification problem into a two-class classification problem comprises:

converting the multi-label classification problem into Q two types of classifiers, wherein the discriminant function of the Q two types of classifiers is defined as

Wherein Q represents the number of markers in the multi-marker dataset,andand respectively representing the weight and the deviation corresponding to the discriminant functions of the qth classifiers and the two classifiers.

3. The multi-label classification method according to claim 2, characterized in that the correlation between the two classes of classifiers depends on the original feature vectors and other label variables simultaneously for the learning of each of the two classes of classifiers.

4. The multi-label classification method according to claim 3, characterized in that the discriminant functions of the two classes of classifiers are convertible into:

wherein the weight matrixDeviation ofMatrix arrayMark up correlation vector

5. The multi-label classification method according to claim 4, characterized in that the features are selected as terms constraining a weight matrix WwⁱRow i of W.

6. The multi-label classification method according to claim 1, characterized in that the objective function expression of the semi-supervised multi-label learning model is:

s.t.c_qq＝1，q＝1，2，...，Q

wherein n and Q respectively represent the number of samples and the number of marks,a marking matrix representing the correspondence of marked samples, the marking matrixFrom n to_lMarking matrix corresponding to marked multi-marked samplesMark matrix corresponding to unmarked multi-marked sampleComposition of，The initial state is an all-zero matrix, F_i·Representing the marker vector, p, corresponding to the ith sample_iThe importance of the ith training sample is defined,in order to mark the correlation matrix, the correlation matrix is marked,in order to avoid over-fitting the regularization term,wⁱrow i of W, λ and η are balance parameters,in order to be a multi-label data matrix,tr (-) denotes the trace of the matrix,representing a vector with elements all 1.

7. The multi-label classification method according to claim 1, characterized in that the preset algorithm is an alternating iterative solution algorithm.

8. A multi-label classification system, comprising:

the problem conversion module is used for converting the multi-label classification problem into two types of classification problems, and each two types of classifiers corresponds to one label in the multi-label data set;

the model building module is used for building a semi-supervised multi-label learning model according to the preset feature selection and the correlation between the two types of classifiers;

the model solving module is used for solving the semi-supervised multi-label learning model by adopting a preset algorithm so as to obtain model parameters and a label correlation matrix of the semi-supervised multi-label classification;

and the mark prediction module predicts a mark set to which unknown marks belong according to the model parameters of the semi-supervised multi-mark classification and the mark correlation matrix.

9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1 to 7.

10. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method according to any of claims 1-7 when executing the program.