CN109947938A - Multi-label classification method, system, readable storage medium and computer device - Google Patents
Multi-label classification method, system, readable storage medium and computer device Download PDFInfo
- Publication number
- CN109947938A CN109947938A CN201910081373.9A CN201910081373A CN109947938A CN 109947938 A CN109947938 A CN 109947938A CN 201910081373 A CN201910081373 A CN 201910081373A CN 109947938 A CN109947938 A CN 109947938A
- Authority
- CN
- China
- Prior art keywords
- label
- semi
- matrix
- correlation
- classification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 239000011159 matrix material Substances 0.000 claims abstract description 59
- 239000003550 marker Substances 0.000 claims abstract description 23
- 230000006870 function Effects 0.000 claims description 24
- 239000013598 vector Substances 0.000 claims description 18
- 238000004590 computer program Methods 0.000 claims description 7
- 238000006243 chemical reaction Methods 0.000 claims description 5
- 230000014509 gene expression Effects 0.000 claims description 4
- 230000009286 beneficial effect Effects 0.000 abstract description 4
- 230000009467 reduction Effects 0.000 abstract description 4
- 238000003491 array Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000006399 behavior Effects 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明提供一种多标记分类方法、系统、可读存储介质及计算机设备,该方法包括:将多标记分类问题转化为两类分类问题,每个两类分类器对应多标记数据集中的一个标记;根据预设的特征选择和两类分类器之间的相关性,构建半监督多标记学习模型;采用预设算法对半监督多标记学习模型进行求解,以得到半监督多标记分类的模型参数和标记相关性矩阵;根据模型参数和标记相关性矩阵,预测未知标记所属的标记集。本发明当中的多标记分类方法,采用联合标记相关性和特征选择的半监督多标记学习方法,不仅有效地利用了大量的未标记多标记样本,而且也通过学习自动获得标记之间的相关性,此外对高维数据进行降维,有利于获得泛化性能更好的多标记分类器。
The present invention provides a multi-label classification method, system, readable storage medium and computer equipment. The method includes: converting a multi-label classification problem into two types of classification problems, each of which is corresponding to a label in the multi-label data set. ;Construct a semi-supervised multi-label learning model according to the preset feature selection and the correlation between the two types of classifiers; use a preset algorithm to solve the semi-supervised multi-label learning model to obtain the model parameters of the semi-supervised multi-label classification and marker correlation matrix; according to the model parameters and marker correlation matrix, predict the marker set to which the unknown marker belongs. The multi-label classification method in the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. , and dimensionality reduction for high-dimensional data is beneficial to obtain a multi-label classifier with better generalization performance.
Description
技术领域technical field
本发明涉及多标记学习技术领域,特别涉及一种多标记分类方法、系统、 可读存储介质及计算机设备。The present invention relates to the technical field of multi-label learning, and in particular, to a multi-label classification method, system, readable storage medium and computer device.
背景技术Background technique
机器学习专门研究计算机怎样模拟或实现人类的学习行为,是计算机学科 中一个重要的研究领域。监督学习是机器学习中研究得最多、应用最为广泛的 一种学习框架,其假设每个学习对象只隶属于一个标记。Machine learning specializes in how computers simulate or realize human learning behavior, and is an important research field in computer science. Supervised learning is the most studied and widely used learning framework in machine learning, which assumes that each learning object belongs to only one label.
但在现实世界中,一个标记往往不足以准确地描述一些复杂的语义对象。 由此可见,同时具有多个标记的复杂学习对象无处不在,而传统的监督学习方 法难以很好地处理这些复杂的学习对象。But in the real world, a token is often not enough to accurately describe some complex semantic objects. It can be seen that complex learning objects with multiple labels at the same time are ubiquitous, and traditional supervised learning methods are difficult to deal with these complex learning objects well.
为此,多标记学习应运而生,其任务是学习已知的多标记数据集,来预测 未知样本所属的标记集。其中,半监督多标记学习是多标记学习中一类典型, 其利用少量的已标记样本以及大量的未标记样本来训练获得多标记分类器。For this reason, multi-label learning comes into being, and its task is to learn the known multi-label data set to predict the label set to which the unknown sample belongs. Among them, semi-supervised multi-label learning is a typical type of multi-label learning, which uses a small number of labeled samples and a large number of unlabeled samples to train to obtain a multi-label classifier.
然而,现有技术当中,目前半监督多标记学习方法虽然考虑了标记间的相 关性但无法估计标记相关性且没有考虑多标记特征选择,也有些半监督多标记 学习方法考虑了多标记特征选择,但无法通过学习自动挖掘和利用标记相关性, 导致训练出的多标记分类器的泛化性能差。However, in the prior art, although the current semi-supervised multi-label learning method considers the correlation between labels, it cannot estimate the label correlation and does not consider multi-label feature selection, and some semi-supervised multi-label learning methods consider multi-label feature selection. , but cannot automatically mine and exploit label correlation through learning, resulting in poor generalization performance of the trained multi-label classifier.
发明内容SUMMARY OF THE INVENTION
基于此,本发明的目的是提供一种多标记分类方法、系统、可读存储介质 及计算机设备,以解决现有技术当中半监督多标记学习方法训练出的多标记分 类器的泛化性能差的技术问题。Based on this, the purpose of the present invention is to provide a multi-label classification method, system, readable storage medium and computer equipment to solve the poor generalization performance of the multi-label classifier trained by the semi-supervised multi-label learning method in the prior art technical issues.
根据本发明实施例的一种多标记分类方法,包括:A multi-label classification method according to an embodiment of the present invention includes:
将多标记分类问题转化为两类分类问题,每个两类分类器对应多标记数据 集中的一个标记;Convert the multi-label classification problem into a two-class classification problem, each of which corresponds to a label in the multi-label data set;
根据预设的特征选择和所述两类分类器之间的相关性,构建半监督多标记 学习模型;According to the preset feature selection and the correlation between the two types of classifiers, construct a semi-supervised multi-label learning model;
采用预设算法对所述半监督多标记学习模型进行求解,以得到所述半监督 多标记分类的模型参数和标记相关性矩阵;Using a preset algorithm to solve the semi-supervised multi-label learning model, to obtain the model parameters and label correlation matrix of the semi-supervised multi-label classification;
根据所述半监督多标记分类的模型参数和标记相关性矩阵,预测未知标记 所属的标记集。According to the model parameters of the semi-supervised multi-label classification and the label correlation matrix, the label set to which the unknown label belongs is predicted.
另外,根据本发明上述实施例的一种多标记分类方法,还可以具有如下附 加的技术特征:In addition, according to a kind of multi-label classification method of the above-mentioned embodiment of the present invention, can also have the following additional technical features:
进一步地,所述将多标记分类问题转化为两类分类问题的步骤包括:Further, the step of converting the multi-label classification problem into a two-class classification problem includes:
将所述多标记分类问题转化为Q个两类分类器,第q个所述两类分类器的 判别函数定义为 The multi-label classification problem is transformed into Q two-class classifiers, and the discriminant function of the q-th two-class classifier is defined as
其中,Q表示所述多标记数据集中的标记个数,和分别表 示第q个所述两类分类器的判别函数所对应的权重和偏差。where Q represents the number of markers in the multi-label dataset, and respectively represent the weight and bias corresponding to the discriminant function of the qth classifier of the two types of classifiers.
进一步地,所述两类分类器之间的相关性为每个所述两类分类器的学习同 时依赖于原始特征向量和其他标记变量。Further, the correlation between the two types of classifiers is that the learning of each of the two types of classifiers depends on the original feature vector and other label variables at the same time.
进一步地,所述两类分类器的判别函数可转化为:Further, the discriminant functions of the two types of classifiers can be transformed into:
其中,权重矩阵偏差矩阵标记相关向量 Among them, the weight matrix Deviation matrix marker correlation vector
进一步地,所述特征选择为对权重矩阵W进行约束的项 wi为W的第i行。Further, the feature selection is an item that constrains the weight matrix W w i is the ith row of W.
进一步地,所述半监督多标记学习模型的目标函数表达式为:Further, the objective function expression of the semi-supervised multi-label learning model is:
其中,n和Q分别表示样本个数和标记个数, 表示已标记样本对应的标记矩阵,标记矩阵 由nl个已标记多标记样本对应的标记矩阵和未标记多标记 样本对应的标记矩阵Fu组成,Fu初始状态为全零矩阵,Fi·表示第i个样本对 应的标记向量,pi定义了第i个训练样本的重要性,为标记相关性矩阵,是为了避免过拟合的正则化项, wi为W的第i行,λ和η为平衡参数,为多标记数据矩阵, tr(·)表示矩阵的迹,表示元素全为1的向量。Among them, n and Q represent the number of samples and the number of markers, respectively, Represents the label matrix corresponding to the labeled sample, the label matrix Label matrix corresponding to n l labeled multilabel samples It is composed of the label matrix Fu corresponding to the unlabeled multi-label sample. The initial state of Fu is an all-zero matrix, F i represents the label vector corresponding to the ith sample, and p i defines the importance of the ith training sample. is the marker correlation matrix, is the regularization term to avoid overfitting, w i is the ith row of W, λ and η are balance parameters, is the multi-label data matrix, tr( ) represents the trace of the matrix, Represents a vector whose elements are all ones.
进一步地,所述预设算法为交替迭代求解算法。Further, the preset algorithm is an alternate iterative solution algorithm.
根据本发明实施例的一种多标记分类系统,包括:A multi-label classification system according to an embodiment of the present invention includes:
问题转化模块,用于将多标记分类问题转化为两类分类问题,每个两类分 类器对应多标记数据集中的一个标记;The problem conversion module is used to convert the multi-label classification problem into two-class classification problems, and each two-class classifier corresponds to a label in the multi-label data set;
模型构建模块,用于根据预设的特征选择和所述两类分类器之间的相关性, 构建半监督多标记学习模型;a model building module for building a semi-supervised multi-label learning model according to preset feature selection and the correlation between the two types of classifiers;
模型求解模块,用于采用预设算法对所述半监督多标记学习模型进行求解, 以得到所述半监督多标记分类的模型参数和标记相关性矩阵;a model solving module for solving the semi-supervised multi-label learning model by using a preset algorithm to obtain model parameters and a label correlation matrix of the semi-supervised multi-label classification;
标记预测模块,根据所述半监督多标记分类的模型参数和标记相关性矩阵, 预测未知标记所属的标记集。The marker prediction module predicts the marker set to which the unknown marker belongs according to the model parameters of the semi-supervised multi-marker classification and the marker correlation matrix.
本发明另一方法还提出一种计算机可读存储介质,其上存储有计算机程序, 该程序被处理器执行时实现如上述的方法Another method of the present invention also provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the above-mentioned method is implemented
本发明另一方法还提出一种计算机设备,包括存储器、处理器以及存储在 存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现 如上述的方法。Another method of the present invention also provides a computer device, comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor implementing the above-mentioned method when the processor executes the program.
本发明当中的多标记分类方法,采用联合标记相关性和特征选择的半监督 多标记学习方法,在多标记分类器设计的同时不仅考虑了利用大量的未标记数 据集,也考虑了标记相关性和多标记特征选择,这不仅有效地利用了大量的未 标记多标记样本,而且也通过学习自动获得标记之间的相关性,此外对高维数 据进行降维,有利于获得泛化性能更好的多标记分类器。The multi-label classification method in the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection. When designing the multi-label classifier, not only the use of a large number of unlabeled data sets, but also the label correlation is considered. and multi-label feature selection, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. In addition, dimensionality reduction for high-dimensional data is beneficial to obtain better generalization performance multi-label classifier.
附图说明Description of drawings
图1为本发明第一实施例中的多标记分类方法的流程图;1 is a flowchart of a multi-label classification method in a first embodiment of the present invention;
图2为本发明第二实施例中的多标记分类系统的流程图;Fig. 2 is the flow chart of the multi-label classification system in the second embodiment of the present invention;
图3为本发明第三实施例中的计算机设备的结构图。FIG. 3 is a structural diagram of a computer device in a third embodiment of the present invention.
主要元件符号说明:Description of main component symbols:
以下具体实施方式将结合上述附图进一步说明本发明。The following specific embodiments will further illustrate the present invention in conjunction with the above drawings.
具体实施方式Detailed ways
为了便于理解本发明,下面将参照相关附图对本发明进行更全面的描述。 附图中给出了本发明的若干实施例。但是,本发明可以以许多不同的形式来实 现,并不限于本文所描述的实施例。相反地,提供这些实施例的目的是使对本 发明的公开内容更加透彻全面。In order to facilitate understanding of the present invention, the present invention will be described more fully hereinafter with reference to the related drawings. Several embodiments of the invention are presented in the accompanying drawings. However, the present invention may be embodied in many different forms and is not limited to the embodiments described herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete.
需要说明的是,当元件被称为“固设于”另一个元件,它可以直接在另一个元 件上或者也可以存在居中的元件。当一个元件被认为是“连接”另一个元件,它可 以是直接连接到另一个元件或者可能同时存在居中元件。本文所使用的术语“垂 直的”、“水平的”、“左”、“右”以及类似的表述只是为了说明的目的。It should be noted that when an element is referred to as being "fixed to" another element, it can be directly on the other element or intervening elements may also be present. When an element is referred to as being "connected" to another element, it can be directly connected to the other element or intervening elements may also be present. The terms "vertical," "horizontal," "left," "right," and similar expressions are used herein for illustrative purposes only.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术 领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术 语只是为了描述具体的实施例的目的,不是旨在于限制本发明。本文所使用的 术语“及/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
请参阅图1,所示为本发明第一实施例中的多标记分类方法,包括步骤S01 至步骤S04。Please refer to FIG. 1, which shows the multi-label classification method in the first embodiment of the present invention, including steps S01 to S04.
步骤S01,将多标记分类问题转化为两类分类问题,每个两类分类器对应多 标记数据集中的一个标记。In step S01, the multi-label classification problem is converted into a two-class classification problem, and each two-class classifier corresponds to a label in the multi-label data set.
具体地,将多标记分类问题转化为两类分类问题的步骤包括:Specifically, the steps of transforming a multi-label classification problem into a two-class classification problem include:
将所述多标记分类问题转化为Q个两类分类器,第q个所述两类分类器的 判别函数定义为其中,Q表示所述多标记数据集中的标记个 数,和分别表示第q个所述两类分类器的判别函数所对应的权 重和偏差。The multi-label classification problem is transformed into Q two-class classifiers, and the discriminant function of the q-th two-class classifier is defined as where Q represents the number of markers in the multi-label dataset, and respectively represent the weight and bias corresponding to the discriminant function of the qth classifier of the two types of classifiers.
步骤S02,根据预设的特征选择和所述两类分类器之间的相关性,构建半监 督多标记学习模型。Step S02, constructing a semi-supervised multi-label learning model according to the preset feature selection and the correlation between the two types of classifiers.
其中,所述两类分类器之间的相关性为:每个所述两类分类器的学习同时 依赖于原始特征向量和其他标记变量。The correlation between the two types of classifiers is: the learning of each of the two types of classifiers depends on the original feature vector and other label variables at the same time.
可以理解的,由于每个标记所对应的分类器的学习同时依赖于原始特征向 量和其他标记变量,这样使得相互独立的两类分类器可以同时学习,而且多标 记分类和标记相关性也可以同时学习。基于此,第q个所述两类分类器的所对应 的判别函数可转化为:Understandably, since the learning of the classifier corresponding to each tag depends on the original feature vector and other tag variables at the same time, two types of classifiers that are independent of each other can be learned at the same time, and multi-tag classification and tag correlation can also be simultaneously learned. study. Based on this, the corresponding discriminant function of the qth two-class classifier can be transformed into:
其中,权重矩阵偏差矩阵标记相关向量 Among them, the weight matrix Deviation matrix marker correlation vector
此外,为了降低高维数据特征空间的维数,所述特征选择为:对权重矩阵W 进行约束的项wi为W的第i行。In addition, in order to reduce the dimension of the feature space of high-dimensional data, the feature selection is: the term that constrains the weight matrix W w i is the ith row of W.
具体来说,本步骤S02的目的在于:将半监督多标记分类模型参数、标记 相关性以及半监督多标记特征选择统一到同一个模型框架中进行联合学习,以 构建半监督多标记学习模型。Specifically, the purpose of this step S02 is to unify the semi-supervised multi-label classification model parameters, label correlation and semi-supervised multi-label feature selection into the same model framework for joint learning to construct a semi-supervised multi-label learning model.
其中,所述半监督多标记学习模型的目标函数表达为:Wherein, the objective function of the semi-supervised multi-label learning model is expressed as:
其中,n和Q分别表示样本个数和标记个数, 表示已标记样本对应的标记矩阵,标记矩阵 由nl个已标记多标记样本对应的标记矩阵和未标记多标记 样本对应的标记矩阵Fu组成,Fu初始状态为全零矩阵,Fi·表示第i个样本对 应的标记向量,pi定义了第i个训练样本的重要性,为标记相关性矩阵,是为了避免过拟合的正则化项, wi为W的第i行,λ和η为平衡参数。Among them, n and Q represent the number of samples and the number of markers, respectively, Represents the label matrix corresponding to the labeled sample, the label matrix Label matrix corresponding to n l labeled multilabel samples It is composed of the label matrix Fu corresponding to the unlabeled multi-label sample. The initial state of Fu is an all-zero matrix, F i represents the label vector corresponding to the ith sample, and p i defines the importance of the ith training sample. is the marker correlation matrix, is the regularization term to avoid overfitting, w i is the ith row of W, and λ and η are the balance parameters.
基于此,模型可进一步表示为:Based on this, the model can be further expressed as:
其中,为多标记数据矩阵,tr(·)表示矩阵的迹,表示元素全为1的向 量。in, is the multi-label data matrix, tr( ) represents the trace of the matrix, Represents a vector whose elements are all ones.
步骤S03,采用预设算法对所述半监督多标记学习模型进行求解,以得到所 述半监督多标记分类的模型参数和标记相关性矩阵。Step S03, using a preset algorithm to solve the semi-supervised multi-label learning model, to obtain the model parameters and label correlation matrix of the semi-supervised multi-label classification.
其中,所述预设算法为交替迭代求解算法。Wherein, the preset algorithm is an alternate iterative solution algorithm.
具体交替迭代求解算法包括步骤(一)至步骤(三):The specific alternate iterative solution algorithm includes steps (1) to (3):
步骤(一):固定C和F,求W和bStep (1): Fix C and F, find W and b
当给定C和F时,可通过优化求解如下目标函数来得到W和b。When C and F are given, W and b can be obtained by optimally solving the following objective function.
由于||W||2,1不可导,此目标函数是非光滑凸问题,本发明将其转化为如下 等价的约束光滑凸优化问题,并采用基于Nesterov的加速投影梯度方法来求解。Since ||W|| 2 , 1 is not differentiable, this objective function is a non-smooth convex problem. The present invention transforms it into the following equivalent constrained smooth convex optimization problem, and uses the Nesterov-based accelerated projected gradient method to solve it.
其中,Θ={θ∈E(d+1)×Q|θ=(WT,b)T,||W||2,1≤r,W∈Rd×Q,b∈RQ}为凸 集,r为l2,1球的半径。where Θ={θ∈E (d+1)×Q |θ=(W T , b) T , ||W|| 2,1 ≤r, W∈R d×Q , b∈R Q } is Convex set, r is l 2 , 1 the radius of the sphere.
令表示第t次迭代的搜索点,其中αt为调整参数。make represents the search point of the t-th iteration, where α t is an adjustment parameter.
对于定义for definition
其中,表示函数g(·)关于θ在点上导数, 关于θ是强凸的。因此,在第t+1次迭代中Θt+1的近似解可根据如 下式子计算得到。in, Represents the function g( ) with respect to θ at the point upper derivative, is strongly convex with respect to θ. Therefore, the approximate solution of Θ t+1 in the t+1th iteration can be calculated according to the following equation.
因此,wt+1可计算如下式子得到:Therefore, w t+1 can be calculated as follows:
其中,Ω={W∈Rd×Q|||W||2,1≤r]是封闭凸集,πΩ(·)为到凸集Ω上的欧式 投影。Among them, Ω={W∈R d×Q |||W|| 2,1 ≤r] is a closed convex set, and π Ω (·) is the Euclidean projection onto the convex set Ω.
bt+1可计算如下式子得到:b t+1 can be calculated as follows:
即 which is
步骤(二):固定W,b和F,求CStep (2): Fix W, b and F, and find C
当给定W,b和F时,可通过优化求解如下目标函数来得到C。When W, b and F are given, C can be obtained by optimally solving the following objective function.
C中每个分量可以通过优化如下目标函数获得:Each component in C can be obtained by optimizing the following objective function:
步骤(三):固定W,b和C,求FStep (3): Fix W, b and C, and find F
优化W,b和C之后,可以计算因此,每次迭代求解过 程中,标记矩阵F中未标记样本所对应的标记值可根据如下定义的公式来调整。After optimizing W, b and C, it is possible to calculate Therefore, in each iterative solution process, the label value corresponding to the unlabeled sample in the label matrix F can be adjusted according to the formula defined below.
步骤S04,根据所述半监督多标记分类的模型参数和标记相关性矩阵,预测 未知标记所属的标记集。Step S04, according to the model parameters of the semi-supervised multi-label classification and the label correlation matrix, predict the label set to which the unknown label belongs.
举例来说,假设给定一个未知样本xu,其在第q个标记上的判别函数值为:其预测的标记向量为: h(xu)=sign(f1(xu),...,fQ(xu)),sign(·)为符号函数。For example, given an unknown sample x u , its discriminant function value on the qth marker is: The predicted sign vector is: h(x u )=sign(f 1 (x u ), . . . , f Q (x u )), where sign(·) is a sign function.
综上,本发明上述实施例当中的多标记分类方法,采用联合标记相关性和 特征选择的半监督多标记学习方法,在多标记分类器设计的同时不仅考虑了利 用大量的未标记数据集,也考虑了标记相关性和多标记特征选择,这不仅有效 地利用了大量的未标记多标记样本,而且也通过学习自动获得标记之间的相关 性,此外对高维数据进行降维,有利于获得泛化性能更好的多标记分类器。In summary, the multi-label classification method in the above-mentioned embodiments of the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection, and not only considers the use of a large number of unlabeled data sets while designing the multi-label classifier, Label correlation and multi-label feature selection are also considered, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. In addition, dimensionality reduction for high-dimensional data is beneficial to Obtain a multi-label classifier with better generalization performance.
本发明另一方面还提供一种多标记分类系统,请查阅图2,所示为本发明第 三实施例中的多标记分类系统,包括:Another aspect of the present invention also provides a multi-label classification system, please refer to FIG. 2, which shows the multi-label classification system in the third embodiment of the present invention, including:
问题转化模块11,用于将多标记分类问题转化为两类分类问题,每个两类 分类器对应多标记数据集中的一个标记;The problem conversion module 11 is used to convert the multi-label classification problem into two-class classification problems, and each two-class classifier corresponds to a label in the multi-label data set;
模型构建模块12,用于根据预设的特征选择和所述两类分类器之间的相关 性,构建半监督多标记学习模型;Model building module 12, for constructing a semi-supervised multi-label learning model according to preset feature selection and the correlation between the two types of classifiers;
模型求解模块13,用于采用预设算法对所述半监督多标记学习模型进行求 解,以得到所述半监督多标记分类的模型参数和标记相关性矩阵;Model solving module 13, for adopting preset algorithm to solve described semi-supervised multi-label learning model, to obtain the model parameters and label correlation matrix of described semi-supervised multi-label classification;
标记预测模块14,根据所述半监督多标记分类的模型参数和标记相关性矩 阵,预测未知标记所属的标记集。The marker prediction module 14 predicts the marker set to which the unknown marker belongs according to the model parameters of the semi-supervised multi-marker classification and the marker correlation matrix.
进一步地,所述问题转化模块11还可以用于将所述多标记分类问题转化为 Q个两类分类器,第q个所述两类分类器的判别函数定义为 Further, the problem conversion module 11 can also be used to convert the multi-label classification problem into Q two-class classifiers, and the discriminant function of the q-th two-class classifier is defined as:
其中,Q表示所述多标记数据集中的标记个数,和分别表 示第q个所述两类分类器的判别函数所对应的权重和偏差。where Q represents the number of markers in the multi-label dataset, and respectively represent the weight and bias corresponding to the discriminant function of the qth classifier of the two types of classifiers.
进一步地,所述两类分类器之间的相关性为每个所述两类分类器的学习同 时依赖于原始特征向量和其他标记变量。Further, the correlation between the two types of classifiers is that the learning of each of the two types of classifiers depends on the original feature vector and other label variables at the same time.
进一步地,所述两类分类器的判别函数可转化为:Further, the discriminant functions of the two types of classifiers can be transformed into:
其中,权重矩阵偏差矩阵标记相关向量 Among them, the weight matrix Deviation matrix marker correlation vector
进一步地,所述特征选择为:对权重矩阵W进行约束的项 wi为W的第i行。Further, the feature selection is: an item that constrains the weight matrix W w i is the ith row of W.
进一步地,所述半监督多标记学习模型的目标函数表达式为:Further, the objective function expression of the semi-supervised multi-label learning model is:
其中,n和Q分别表示样本个数和标记个数,表示已标记样本对应的标记矩阵,标记矩阵由nl个已标记多标记样本对应的标记矩阵和未标记多标记 样本对应的标记矩阵Fu组成,Fu初始状态为全零矩阵,Fi·表示第i个样本对 应的标记向量,pi定义了第i个训练样本的重要性,为标记相关性矩阵,是为了避免过拟合的正则化项, wi为W的第i行,λ和η为平衡参数, 为多标记数据矩阵, tr(·)表示矩阵的迹,表示元素全为1的向量。Among them, n and Q represent the number of samples and the number of markers, respectively, Represents the label matrix corresponding to the labeled sample, the label matrix Label matrix corresponding to n l labeled multilabel samples It is composed of the label matrix Fu corresponding to the unlabeled multi-label sample. The initial state of Fu is an all-zero matrix, F i represents the label vector corresponding to the ith sample, and p i defines the importance of the ith training sample. is the marker correlation matrix, is the regularization term to avoid overfitting, w i is the ith row of W, λ and η are balance parameters, is the multi-label data matrix, tr( ) represents the trace of the matrix, Represents a vector whose elements are all ones.
进一步地,所述预设算法为交替迭代求解算法。Further, the preset algorithm is an alternate iterative solution algorithm.
综上,本发明上述实施例当中的多标记分类系统,采用联合标记相关性和 特征选择的半监督多标记学习方法,在多标记分类器设计的同时不仅考虑了利 用大量的未标记数据集,也考虑了标记相关性和多标记特征选择,这不仅有效 地利用了大量的未标记多标记样本,而且也通过学习自动获得标记之间的相关 性,此外对高维数据进行降维,有利于获得泛化性能更好的多标记分类器。To sum up, the multi-label classification system in the above-mentioned embodiments of the present invention adopts the semi-supervised multi-label learning method of joint label correlation and feature selection, and not only considers the use of a large number of unlabeled data sets while designing the multi-label classifier, Label correlation and multi-label feature selection are also considered, which not only effectively utilizes a large number of unlabeled multi-label samples, but also automatically obtains the correlation between labels through learning. In addition, dimensionality reduction for high-dimensional data is beneficial to Obtain a multi-label classifier with better generalization performance.
本发明另一方法还提出一种计算机可读存储介质,其上存储有计算机程序, 该程序被处理器执行时实现如上述的多标记分类方法Another method of the present invention further provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, implements the above-mentioned multi-label classification method
本发明另一方法还提出一种计算机设备,请参阅图3,所示为本发明第三实 施例当中的计算机设备,包括存储器10、处理器20以及存储在存储器20上并 可在处理器10上运行的计算机程序30,所述处理器20执行所述程序30时实现 如上述的多标记分类方法。Another method of the present invention further proposes a computer device, please refer to FIG. 3 , which shows the computer device in the third embodiment of the present invention, including a memory 10 , a processor 20 , and a computer device stored in the memory 20 and available in the processor 10 . A computer program 30 running on the processor 20 implements the multi-label classification method described above when the processor 20 executes the program 30.
本领域技术人员可以理解,在流程图中表示或在此以其他方式描述的逻辑 和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表, 可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如 基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设 备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而 使用。就本说明书而言,“计算机可读介质”可以是任何可以包含、存储、通信、 传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装 置或设备而使用的装置。Those skilled in the art will appreciate that logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing logical functions, may be embodied in in any computer-readable medium for use by an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch and execute instructions from an instruction execution system, apparatus, or device), or Used in conjunction with these instruction execution systems, apparatus or devices. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in conjunction with an instruction execution system, apparatus, or apparatus.
计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或 多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取 存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM 或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外, 计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因 为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时 以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机 存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wiring (electronic devices), portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as it may be, for example, by optically scanning the paper or other medium, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.
应当理解,本发明的各部分可以用硬件、软件、固件或它们的组合来实现。 在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执 行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方 式中一样,可用本领域公知的下列技术中的任一项或它们的组合来实现:具有 用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合 逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA) 等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具 体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结 构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中, 对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具 体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适 的方式结合。In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structure, material or feature is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细, 但并不能因此而理解为对本发明专利范围的限制。应当指出的是,对于本领域 的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和 改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附 权利要求为准。The above-mentioned embodiments only represent several embodiments of the present invention, and the descriptions thereof are specific and detailed, but should not be construed as limiting the scope of the patent of the present invention. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of the present invention, some modifications and improvements can be made, and these all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention should be governed by the appended claims.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910081373.9A CN109947938A (en) | 2019-01-28 | 2019-01-28 | Multi-label classification method, system, readable storage medium and computer device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910081373.9A CN109947938A (en) | 2019-01-28 | 2019-01-28 | Multi-label classification method, system, readable storage medium and computer device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109947938A true CN109947938A (en) | 2019-06-28 |
Family
ID=67006598
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910081373.9A Pending CN109947938A (en) | 2019-01-28 | 2019-01-28 | Multi-label classification method, system, readable storage medium and computer device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109947938A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110781295A (en) * | 2019-09-09 | 2020-02-11 | 河南师范大学 | A method and device for feature selection of multi-labeled data |
CN113051452A (en) * | 2021-04-12 | 2021-06-29 | 清华大学 | Operation and maintenance data feature selection method and device |
CN117893839A (en) * | 2024-03-15 | 2024-04-16 | 华东交通大学 | A multi-label classification method and system based on graph attention mechanism |
CN118427581A (en) * | 2024-07-05 | 2024-08-02 | 华东交通大学 | Weak supervision mark distribution feature selection method and system based on mark correlation |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107256411A (en) * | 2017-05-27 | 2017-10-17 | 南京师范大学 | The multiple labeling data classification method of feature selecting and mark correlation combination learning |
CN107330448A (en) * | 2017-06-09 | 2017-11-07 | 南京师范大学 | A kind of combination learning method based on mark covariance and multiple labeling classification |
-
2019
- 2019-01-28 CN CN201910081373.9A patent/CN109947938A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107256411A (en) * | 2017-05-27 | 2017-10-17 | 南京师范大学 | The multiple labeling data classification method of feature selecting and mark correlation combination learning |
CN107330448A (en) * | 2017-06-09 | 2017-11-07 | 南京师范大学 | A kind of combination learning method based on mark covariance and multiple labeling classification |
Non-Patent Citations (2)
Title |
---|
何志芬: "结合标记相关性的多标记分类算法研究", 《中国博士学位论文全文数据库 I140-35 信息科技辑(月刊)》 * |
蔡亚萍: "结合标记相关性的多标记特征选择及分类算法研究", 《中国优秀硕士学位全文数据库 I140-259 信息科技辑》 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110781295A (en) * | 2019-09-09 | 2020-02-11 | 河南师范大学 | A method and device for feature selection of multi-labeled data |
CN110781295B (en) * | 2019-09-09 | 2023-04-07 | 河南师范大学 | Multi-label data feature selection method and device |
CN113051452A (en) * | 2021-04-12 | 2021-06-29 | 清华大学 | Operation and maintenance data feature selection method and device |
CN113051452B (en) * | 2021-04-12 | 2022-04-26 | 清华大学 | Operation and maintenance data feature selection method and device |
CN117893839A (en) * | 2024-03-15 | 2024-04-16 | 华东交通大学 | A multi-label classification method and system based on graph attention mechanism |
CN117893839B (en) * | 2024-03-15 | 2024-06-07 | 华东交通大学 | Multi-label classification method and system based on graph attention mechanism |
CN118427581A (en) * | 2024-07-05 | 2024-08-02 | 华东交通大学 | Weak supervision mark distribution feature selection method and system based on mark correlation |
CN118427581B (en) * | 2024-07-05 | 2024-10-18 | 华东交通大学 | Weakly supervised label distribution feature selection method and system based on label correlation |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhang et al. | Leveraging prior-knowledge for weakly supervised object detection under a collaborative self-paced curriculum learning framework | |
Bao et al. | Boosted near-miss under-sampling on SVM ensembles for concept detection in large-scale imbalanced datasets | |
Wang et al. | Parametric local metric learning for nearest neighbor classification | |
Deselaers et al. | Weakly supervised localization and learning with generic knowledge | |
Yang et al. | Improving multi-label learning with missing labels by structured semantic correlations | |
Lian et al. | Max-margin dictionary learning for multiclass image categorization | |
CN109947938A (en) | Multi-label classification method, system, readable storage medium and computer device | |
CN103605984B (en) | Indoor scene sorting technique based on hypergraph study | |
Du et al. | Semi-siamese training for shallow face learning | |
US20140015855A1 (en) | Systems and methods for creating a semantic-driven visual vocabulary | |
CN105787513A (en) | Transfer learning design method and system based on domain adaptation under multi-example multi-label framework | |
US20050097068A1 (en) | Invariant pattern recognition | |
Kumar et al. | Bird species classification using transfer learning with multistage training | |
CN108898181A (en) | Image classification model processing method and device and storage medium | |
CN107330448A (en) | A kind of combination learning method based on mark covariance and multiple labeling classification | |
Wei et al. | Prototypical classifier for robust class-imbalanced learning | |
Chen et al. | Active expansion sampling for learning feasible domains in an unbounded input space | |
Zhou et al. | Classify multi-label images via improved CNN model with adversarial network | |
CN116150411A (en) | A zero-shot cross-modal retrieval method based on adaptive class-dependent discrete hashing | |
CN108108769B (en) | Data classification method, device and storage medium | |
Shahriyar et al. | An approach for multi label image classification using single label convolutional neural network | |
Nie et al. | Multi-label image recognition with attentive transformer-localizer module | |
CN104036021A (en) | Method for semantically annotating images on basis of hybrid generative and discriminative learning models | |
CN113610106B (en) | Feature compatible learning method and device between models, electronic equipment and medium | |
CN118334466A (en) | Image labeling method based on partial multi-mark learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190628 |