CN111814894A - A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk - Google Patents
A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk Download PDFInfo
- Publication number
- CN111814894A CN111814894A CN202010695386.8A CN202010695386A CN111814894A CN 111814894 A CN111814894 A CN 111814894A CN 202010695386 A CN202010695386 A CN 202010695386A CN 111814894 A CN111814894 A CN 111814894A
- Authority
- CN
- China
- Prior art keywords
- view
- view data
- data
- matrix
- transition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 238000005295 random walk Methods 0.000 title claims abstract description 18
- 239000011159 matrix material Substances 0.000 claims abstract description 69
- 230000007704 transition Effects 0.000 claims abstract description 47
- 238000012360 testing method Methods 0.000 claims abstract description 8
- 238000004364 calculation method Methods 0.000 claims description 24
- 230000006870 function Effects 0.000 claims description 11
- 238000004422 calculation algorithm Methods 0.000 description 8
- 238000011160 research Methods 0.000 description 3
- 238000012549 training Methods 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000003709 image segmentation Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000007786 learning performance Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/15—Correlation function computation including computation of convolution operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Computational Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computing Systems (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Algebra (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- Image Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明涉及一种快速种子随机游走的多视角半监督分类方法,首先,采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;然后,根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态并计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;最后,迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵来生成用于测试的多视角数据的类别标签。本发明仅使用少量的监督信息,就能够对图像、文本、视频等各类型数据进行准确有效的分类,具有一定的实用价值。
The invention relates to a multi-view semi-supervised classification method for fast seed random walk. First, a Gaussian kernel function is used to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data; The learned class labels of the multi-view data establish the initial distribution state of each view of the multi-view data and calculate the arrival probability matrix of the first transition state of each view of the multi-view data; finally, iteratively calculate the multi-view data. The arrival probability matrix of multiple transition states of each view, and the weighted summation of the arrival probability matrices of all transition states of each view to obtain the reward matrix of each view to generate the class label of the multi-view data for testing. The present invention can accurately and effectively classify various types of data such as images, texts, and videos by using only a small amount of supervision information, and has certain practical value.
Description
技术领域technical field
本发明涉及多视角和半监督学习领域,特别是一种快速种子随机游走的多视角半监督分类方法。The invention relates to the field of multi-view and semi-supervised learning, in particular to a multi-view semi-supervised classification method with fast seed random walk.
背景技术Background technique
多视角数据在实际应用中极为普遍,例如多摄像机图像收集和多模信息获取。从异构源收集的数据通常包含大量冗余和不相关的信息,这可能会使学习算法的性能下降。在多视角数据中,每个视角捕获部分信息而不是完整信息。完整的表示形式是潜在的和多余的,因此很难为要学习的任务提取有用的信息。另一方面,对所有多视角特征进行单视角学习或简单串联通常是无效的。因此,学习足够的判别性特征并使用多视角驱动算法从这些数据中挖掘有效的信息至关重要。Multi-view data is extremely common in practical applications, such as multi-camera image collection and multi-modal information acquisition. Data collected from heterogeneous sources often contain a lot of redundant and irrelevant information, which may degrade the performance of learning algorithms. In multi-view data, each view captures partial information rather than complete information. The full representation is latent and redundant, making it difficult to extract useful information for the task to be learned. On the other hand, single-view learning or simple concatenation of all multi-view features is usually ineffective. Therefore, it is crucial to learn enough discriminative features and use multi-view-driven algorithms to mine effective information from these data.
近年来,随机游走理论有了突破性进展。称为PageRank的经典算法在Web搜索中起着至关重要的作用,其中所有Web的重要性都使用Web链接结构进行排名。之后,许多个性化的PageRank方法被提出来解决各种学习任务。Random walk with restart(RWR)是一种代表性方案,可在无向加权图中的任意两个节点之间提供良好的相关性评分等级。根据类别标签信息是否可用,随机游走主要可以分为无监督方法和有监督方法。前者将重点放在数据聚类和解决无监督学习任务上,例如图像分割,后者旨在解决分类任务及其相应的特定应用。这些研究工作表明随机游走在各种实际应用中具有广泛的适用性和高度的兼容性。In recent years, there have been breakthroughs in random walk theory. The classic algorithm called PageRank plays a vital role in web search, where all the importance of the web is ranked using the web link structure. After that, many personalized PageRank methods have been proposed to solve various learning tasks. Random walk with restart (RWR) is a representative scheme that provides a good correlation score between any two nodes in an undirected weighted graph. According to whether the class label information is available, random walks can be mainly divided into unsupervised methods and supervised methods. The former focuses on data clustering and solving unsupervised learning tasks, such as image segmentation, while the latter aims to solve classification tasks and their corresponding specific applications. These research works demonstrate the wide applicability and high compatibility of random walks in various practical applications.
多视角学习旨在从多视角数据中挖掘有用的模式。先前的大量研究表明,多视角学习可以充分利用多视角数据中的相似性和互补性信息,并且比单视角学习具有更好的生成能力和优越的性能。作为最早的多视角学习代表性范例之一,在无监督模式下的协同训练最大化了两个不同视角之间的相互一致性。后来,普通的无监督多视角学习方法受到越来越多的关注,包括无监督多视角特征表示和多视角聚类。相应地,有监督的多视角学习方法也已经吸引了来自不同领域的越来越多的研究兴趣,例如图像分类,手势识别和图像注释。然而,标记足够的多视角训练数据是难以实现且费时的,而仅一小部分标记的样本容易获得,仅有少量数据不足以完全进行监督学习,而对于无监督学习则利用不足。作为两种学习范式的折中方案,多视角半监督分类可以利用有限比例的标记样本来提高性能。为此,学界提出了多视角半监督学习以最大程度利用小比例标记数据点来挖掘有效信息。但目前为止,对于多视角半监督分类所做的工作仍然有限,对于此问题,需要进行更多的研究。Multi-view learning aims to mine useful patterns from multi-view data. Numerous previous studies have shown that multi-view learning can fully exploit the similarity and complementarity information in multi-view data, and has better generative ability and superior performance than single-view learning. As one of the earliest representative paradigms of multi-view learning, co-training in an unsupervised mode maximizes the mutual consistency between two different views. Later, ordinary unsupervised multi-view learning methods received more and more attention, including unsupervised multi-view feature representation and multi-view clustering. Correspondingly, supervised multi-view learning methods have also attracted increasing research interest from different fields, such as image classification, gesture recognition, and image annotation. However, labeling enough multi-view training data is difficult and time-consuming, while only a small number of labeled samples are readily available, only a small amount of data is not sufficient for fully supervised learning, and underutilized for unsupervised learning. As a compromise between the two learning paradigms, multi-view semi-supervised classification can leverage a limited proportion of labeled samples to improve performance. To this end, the academic community proposes multi-view semi-supervised learning to maximize the use of a small proportion of labeled data points to mine effective information. But so far, the work done on multi-view semi-supervised classification is still limited, and more research is needed for this problem.
目前的多视角半监督学习算法主要分为两类。第一种方法是基于子空间的方法,通常将给定的标签矩阵嵌入目标函数中作为一个共同的回归目标,学习一个潜在的低维子空间来投影输入数据,以挖掘视角之间的共性。例如,Nie等人对一个凸问题进行建模以避免局部极小值,并提出了一种新的自适应权值学习策略来学习投影矩阵。Xue等人将标签信息引入深度矩阵分解,并学习相关预测子空间用于不完全多视角数据分类。第二种模式是基于图的模型,它将每个数据点视为一个由多个视角融合而成的联合图的顶点,并通过加权边将标签信息从标记样本传播到未标记样本。作为一种早期的基于图的方法,Karasuyama提出了一种标签传播环境下的多图集成方法,该方法通过权值正则化学习稀疏多视角权重,将多个图线性组合。相比之下,Nie等人从先验图结构中学习权重来融合多视角,而不是通过权重正则化来学习融合权重。目前,已有多种多视角半监督学习方法被开发出来,并在具体应用中表明了它们的有效性。但是,多视角半监督分类在很大程度上尚未得到充分的研究,这些模型仍然存在一些缺陷。高计算复杂度是大多数算法在面对大规模学习问题时需要解决的一个主要限制,因此需要做更多的工作。此外,有限的学习性能要求算法在只有少量的标记数据可用于模型训练时挖掘更有效的模式。所以,更有效、更高效的多视角半监督学习方法仍有待开发。The current multi-view semi-supervised learning algorithms are mainly divided into two categories. The first method is a subspace-based method, which usually embeds a given label matrix in the objective function as a common regression objective, and learns a latent low-dimensional subspace to project the input data to mine commonalities between viewpoints. For example, Nie et al. model a convex problem to avoid local minima and propose a new adaptive weight learning strategy to learn the projection matrix. Xue et al. introduced label information into deep matrix factorization and learned a relevant prediction subspace for incomplete multi-view data classification. The second mode is a graph-based model, which treats each data point as a vertex of a joint graph fused from multiple viewpoints and propagates label information from labeled to unlabeled samples through weighted edges. As an early graph-based method, Karasuyama proposed a multi-graph ensemble method in the context of label propagation, which learns sparse multi-view weights through weight regularization to linearly combine multiple graphs. In contrast, Nie et al. learn the weights from the prior graph structure to fuse multi-views instead of learning the fusion weights through weight regularization. Currently, a variety of multi-view semi-supervised learning methods have been developed and their effectiveness has been demonstrated in specific applications. However, multi-view semi-supervised classification is largely understudied, and these models still suffer from some flaws. High computational complexity is a major limitation that most algorithms need to address when faced with large-scale learning problems, so more work is required. Furthermore, limited learning performance requires algorithms to mine more efficient patterns when only a small amount of labeled data is available for model training. Therefore, more effective and efficient multi-view semi-supervised learning methods remain to be developed.
发明内容SUMMARY OF THE INVENTION
有鉴于此,本发明提出一种快速种子随机游走的多视角半监督分类方法,能够对文本、图像以及视频等各类型的数据集进行准确有效的分类,在仅利用少量监督信息的情况下,在各个数据集上均取得了较高的分类性能,具有一定的实用价值。In view of this, the present invention proposes a multi-view semi-supervised classification method with fast seed random walk, which can accurately and effectively classify various types of data sets such as text, images, and videos, and only uses a small amount of supervision information. , has achieved high classification performance on each dataset, and has certain practical value.
本发明采用以下方案实现:一种快速种子随机游走的多视角半监督分类方法,包括以下步骤:The present invention adopts the following scheme to realize: a multi-view semi-supervised classification method of fast seed random walk, comprising the following steps:
步骤S1:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;Step S1: using a Gaussian kernel function to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data;
步骤S2:根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态;Step S2: establishing the initial distribution state of each view of the multi-view data according to the category label of the multi-view data for semi-supervised learning;
步骤S3:根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;Step S3: Calculate the arrival probability matrix of the first transition state of each view of the multi-view data according to the initial distribution state of each view of the multi-view data established in S2;
步骤S4:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵;Step S4: iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data and perform a weighted summation of the arrival probability matrices of all the transition states of each view to obtain the reward matrix of each view;
步骤S5:根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签。Step S5: Predict the category label of each multi-view data for testing according to the reward matrix of each view of the multi-view data calculated in S4.
进一步地,所述步骤S1具体包括以下步骤:Further, the step S1 specifically includes the following steps:
步骤S11:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵,计算公式为:Step S11 : using the Gaussian kernel function to calculate the similarity matrix of each angle of view of the input multi-angle data, the calculation formula is:
其中[Wt]ij为第t个视角下的第i个和第j个数据点之间的相似性,为第t个视角下的第i个数据点,σ为带宽,控制高斯核函数的局部作用范围;where [W t ] ij is the similarity between the i-th and j-th data points under the t-th view, is the i-th data point under the t-th viewing angle, and σ is the bandwidth, which controls the local scope of the Gaussian kernel function;
步骤S12:计算多视角数据的每个视角的转移概率矩阵,计算公式为:Step S12: Calculate the transition probability matrix of each view of the multi-view data, and the calculation formula is:
其中,为对角矩阵,且Pt为多视角数据的第t个视角的转移概率矩阵。in, is a diagonal matrix, and P t is the transition probability matrix of the t-th view of the multi-view data.
进一步地,步骤S2中所述根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态,计算公式为:Further, according to the category label of the multi-view data used for semi-supervised learning in step S2, the initial distribution state of each view of the multi-view data is established, and the calculation formula is:
[Qt](0)=[Q0,O],[Q t ] (0) = [Q 0 , O],
其中,为全0矩阵,c为多视角数据的类别数目,N为多视角数据的总数目,n为用于半监督学习的多视角数据的数目,[Qy](0)为多视角数据第t个视角的起始分布状态,Q0的计算公式如下:in, is an all-zero matrix, c is the number of categories of multi-view data, N is the total number of multi-view data, n is the number of multi-view data used for semi-supervised learning, [Q y ] (0) is the start of the t-th view of multi-view data Distribution state, the calculation formula of Q 0 is as follows:
其中Ci为第i个类别,若多视角数据的第j个数据点xj属于第i个类别Ci,则[Q0]ij的值为1,反之为0。Among them, C i is the ith category. If the jth data point x j of the multi-view data belongs to the ith category C i , the value of [Q 0 ] ij is 1, otherwise, it is 0.
进一步地,步骤S3中所述根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵,计算公式为:Further, in step S3, the arrival probability matrix of the first transition state of each view of the multi-view data is calculated according to the initial distribution state of each view of the multi-view data established in S2, and the calculation formula is:
其中α为根据先验知识确定的重启概率,[Qt](1)为多视角数据的第t个视角在起始分布状态下的第一次转移状态的到达概率矩阵。in α is the restart probability determined according to prior knowledge, [Q t ] (1) is the arrival probability matrix of the first transition state of the t-th view of the multi-view data in the initial distribution state.
进一步地,所述步骤S4具体包括以下步骤:Further, the step S4 specifically includes the following steps:
步骤S41:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,计算公式如下:Step S41: Iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data, and the calculation formula is as follows:
[Qt](k)=(1-α)[Qt](k-l)Pt+α[Qt](0)for k≥2,[Q t ] (k) = (1-α)[Q t ] (kl) P t +α[Q t ] (0) for k≥2,
其中,[Qt](k)为多视角数据第t个视角的第k次转移状态的到达概率矩阵;Among them, [Q t ] (k) is the arrival probability matrix of the k-th transition state of the t-th view of the multi-view data;
步骤S42:对多视角数据的每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵,计算公式如下:Step S42: Weighted summation is performed on the arrival probability matrices of all transition states of each view of the multi-view data to obtain the reward matrix of each view, and the calculation formula is as follows:
其中,顀为根据先验知识确定的转移步数的数目,[Rt](s)为多视角数据的第t个视角在转移步数顀下的奖励矩阵,γ为根据先验知识确定的衰变因子。Among them, J is the number of transition steps determined according to the prior knowledge, [R t ] (s) is the reward matrix of the t-th view of the multi-view data under the number of transition steps, and γ is determined according to the prior knowledge. decay factor.
进一步地,步骤S5中所述根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签,具体计算公式如下:Further, according to the reward matrix of each view of the multi-view data calculated in step S5, the category label of each multi-view data for testing is predicted according to the reward matrix of S4, and the specific calculation formula is as follows:
其中,V为多视角数据的视角数目,为第j个数据点的预测标签。where V is the number of viewing angles of the multi-view data, is the predicted label for the jth data point.
与现有技术相比,本发明有以下有益效果:Compared with the prior art, the present invention has the following beneficial effects:
本发明采用的算法基于带有重启概率的随机游走,可以有效地发掘数据点之间的相关性并捕捉数据集的全局结构信息,对比其他同类算法,具有较大的性能提升。本发明能够对图像、文本、视频等各类型数据进行准确有效的分类,在实验中,对现实应用中的八个公开数据集进行了分类,其中包含了文本、图像以及视频类型的数据集,在仅利用少量监督信息的情况下,在各个数据集上均取得了较高的分类性能,具有一定的实用价值。The algorithm adopted in the present invention is based on random walk with restart probability, which can effectively explore the correlation between data points and capture the global structure information of the data set, and has a large performance improvement compared with other similar algorithms. The invention can accurately and effectively classify various types of data such as images, texts, and videos. In the experiment, eight public data sets in practical applications are classified, including data sets of text, image and video types. In the case of using only a small amount of supervision information, high classification performance has been achieved on each dataset, which has certain practical value.
附图说明Description of drawings
图1为本发明实施例的流程图。FIG. 1 is a flowchart of an embodiment of the present invention.
图2为本发明的一实施例的整体方法的实现流程图。FIG. 2 is a flow chart of the implementation of the overall method according to an embodiment of the present invention.
具体实施方式Detailed ways
下面结合附图及实施例对本发明做进一步说明。The present invention will be further described below with reference to the accompanying drawings and embodiments.
应该指出,以下详细说明都是例示性的,旨在对本申请提供进一步的说明。除非另有指明,本文使用的所有技术和科学术语具有与本申请所属技术领域的普通技术人员通常理解的相同含义。It should be noted that the following detailed description is exemplary and intended to provide further explanation of the application. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.
需要注意的是,这里所使用的术语仅是为了描述具体实施方式,而非意图限制根据本申请的示例性实施方式。如在这里所使用的,除非上下文另外明确指出,否则单数形式也意图包括复数形式,此外,还应当理解的是,当在本说明书中使用术语“包含”和/或“包括”时,其指明存在特征、步骤、操作、器件、组件和/或它们的组合。It should be noted that the terminology used herein is for the purpose of describing specific embodiments only, and is not intended to limit the exemplary embodiments according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural as well, furthermore, it is to be understood that when the terms "comprising" and/or "including" are used in this specification, it indicates that There are features, steps, operations, devices, components and/or combinations thereof.
如图1、2所示,本实施例提供了一种快速种子随机游走的多视角半监督分类方法,包括以下步骤:As shown in Figures 1 and 2, this embodiment provides a multi-view semi-supervised classification method with fast seed random walk, including the following steps:
步骤S1:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;Step S1: using a Gaussian kernel function to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data;
步骤S2:根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态;Step S2: establishing the initial distribution state of each view of the multi-view data according to the category label of the multi-view data for semi-supervised learning;
步骤S3:根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;Step S3: Calculate the arrival probability matrix of the first transition state of each view of the multi-view data according to the initial distribution state of each view of the multi-view data established in S2;
步骤S4:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵;Step S4: iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data and perform a weighted summation of the arrival probability matrices of all the transition states of each view to obtain the reward matrix of each view;
步骤S5:根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签。Step S5: Predict the category label of each multi-view data for testing according to the reward matrix of each view of the multi-view data calculated in S4.
在本实施例中,所述步骤S1具体包括以下步骤:In this embodiment, the step S1 specifically includes the following steps:
步骤S11:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵,计算公式为:Step S11 : using the Gaussian kernel function to calculate the similarity matrix of each angle of view of the input multi-angle data, the calculation formula is:
其中[Wt]ij为第t个视角下的第i个和第j个数据点之间的相似性,为第t个视角下的第i个数据点,σ为带宽,控制高斯核函数的局部作用范围;where [W t ] ij is the similarity between the i-th and j-th data points under the t-th view, is the i-th data point under the t-th viewing angle, and σ is the bandwidth, which controls the local scope of the Gaussian kernel function;
步骤S12:计算多视角数据的每个视角的转移概率矩阵,计算公式为:Step S12: Calculate the transition probability matrix of each view of the multi-view data, and the calculation formula is:
其中,为对角矩阵,且Pt为多视角数据的第t个视角的转移概率矩阵。in, is a diagonal matrix, and P t is the transition probability matrix of the t-th view of the multi-view data.
在本实施例中,步骤S2中所述根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态,计算公式为:In this embodiment, the initial distribution state of each view of the multi-view data is established according to the category label of the multi-view data used for semi-supervised learning in step S2, and the calculation formula is:
[Qt](0)=[Q0,O],[Q t ] (0) = [Q 0 , O],
其中,为全0矩阵,c为多视角数据的类别数目,N为多视角数据的总数目,n为用于半监督学习的多视角数据的数目,[Qt](0)为多视角数据第t个视角的起始分布状态,Q0的计算公式如下:in, is an all-zero matrix, c is the number of categories of multi-view data, N is the total number of multi-view data, n is the number of multi-view data used for semi-supervised learning, [Q t ] (0) is the start of the t-th view of multi-view data Distribution state, the calculation formula of Q 0 is as follows:
其中Ci为第i个类别,若多视角数据的第j个数据点xj属于第i个类别Ci,则[Q0]ij的值为1,反之为0。Among them, C i is the ith category. If the jth data point x j of the multi-view data belongs to the ith category C i , the value of [Q 0 ] ij is 1, otherwise, it is 0.
在本实施例中,步骤S3中所述根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵,计算公式为:In this embodiment, in step S3, the arrival probability matrix of the first transition state of each view of the multi-view data is calculated according to the initial distribution state of each view of the multi-view data established in S2, and the calculation formula is:
其中α为根据先验知识确定的重启概率,[Qt](1)为多视角数据的第t个视角在起始分布状态下的第一次转移状态的到达概率矩阵。in α is the restart probability determined according to prior knowledge, [Q t ] (1) is the arrival probability matrix of the first transition state of the t-th view of the multi-view data in the initial distribution state.
在本实施例中,所述步骤S4具体包括以下步骤:In this embodiment, the step S4 specifically includes the following steps:
步骤S41:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,计算公式如下:Step S41: Iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data, and the calculation formula is as follows:
[Qt](k)=(1-α)[Qt](k-l)Pt+α[Qt](0)for k≥2,[Q t ] (k) = (1-α)[Q t ] (kl) P t +α[Q t ] (0) for k≥2,
其中,[Qt](k)为多视角数据第t个视角的第k次转移状态的到达概率矩阵;Among them, [Q t ] (k) is the arrival probability matrix of the k-th transition state of the t-th view of the multi-view data;
步骤S42:对多视角数据的每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵,计算公式如下:Step S42: Weighted summation is performed on the arrival probability matrices of all transition states of each view of the multi-view data to obtain the reward matrix of each view, and the calculation formula is as follows:
其中,顀为根据先验知识确定的转移步数的数目,[Rt](s)为多视角数据的第t个视角在转移步数顀下的奖励矩阵,γ为根据先验知识确定的衰变因子。Among them, J is the number of transition steps determined according to the prior knowledge, [R t ] (s) is the reward matrix of the t-th view of the multi-view data under the number of transition steps, and γ is determined according to the prior knowledge. decay factor.
在本实施例中,步骤S5中所述根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签,具体计算公式如下:In this embodiment, according to the reward matrix of each view of the multi-view data calculated in step S4, the category label of each multi-view data for testing is predicted according to the step S5, and the specific calculation formula is as follows:
其中,V为多视角数据的视角数目,为第j个数据点的预测标签。where V is the number of viewing angles of the multi-view data, is the predicted label for the jth data point.
本实施例从实际应用出发,首先,采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;然后,根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态并计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;最后,迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵来生成用于测试的多视角数据的类别标签。本实施例基于带有重启概率的随机游走,可以有效地发掘数据点之间的相关性并捕捉数据集的全局结构信息,进而能够对图像、文本、音频等各类型数据进行准确有效的分类,具有一定的应用价值。This embodiment starts from practical application. First, the Gaussian kernel function is used to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data; The initial distribution state of each view of the view data and the arrival probability matrix of the first transition state of each view of the multi-view data are calculated; finally, the arrival of multiple transition states of each view of the multi-view data is iteratively calculated The probability matrix, and the weighted summation of the arrival probability matrix of all transition states of each view to obtain the reward matrix of each view to generate the class label for the multi-view data for testing. Based on the random walk with restart probability, this embodiment can effectively explore the correlation between data points and capture the global structure information of the data set, so as to accurately and effectively classify various types of data such as images, texts, and audios , has certain application value.
以上所述仅为本发明的较佳实施例,凡依本发明申请专利范围所做的均等变化与修饰,皆应属本发明的涵盖范围。The above descriptions are only preferred embodiments of the present invention, and all equivalent changes and modifications made according to the scope of the patent application of the present invention shall fall within the scope of the present invention.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010695386.8A CN111814894B (en) | 2020-07-17 | 2020-07-17 | A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010695386.8A CN111814894B (en) | 2020-07-17 | 2020-07-17 | A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111814894A true CN111814894A (en) | 2020-10-23 |
CN111814894B CN111814894B (en) | 2022-09-09 |
Family
ID=72866453
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010695386.8A Active CN111814894B (en) | 2020-07-17 | 2020-07-17 | A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111814894B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113111721A (en) * | 2021-03-17 | 2021-07-13 | 同济大学 | Human behavior intelligent identification method based on multi-unmanned aerial vehicle visual angle image data driving |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110781788A (en) * | 2019-10-18 | 2020-02-11 | 中国科学技术大学 | A method and system for ground classification of field robots based on a small number of annotations |
US20200125897A1 (en) * | 2018-10-18 | 2020-04-23 | Deepnorth Inc. | Semi-Supervised Person Re-Identification Using Multi-View Clustering |
-
2020
- 2020-07-17 CN CN202010695386.8A patent/CN111814894B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200125897A1 (en) * | 2018-10-18 | 2020-04-23 | Deepnorth Inc. | Semi-Supervised Person Re-Identification Using Multi-View Clustering |
CN110781788A (en) * | 2019-10-18 | 2020-02-11 | 中国科学技术大学 | A method and system for ground classification of field robots based on a small number of annotations |
Non-Patent Citations (2)
Title |
---|
CHARU C.AGGARWAL: "《社会网络数据分析》", 31 December 2016, 武汉大学出版社 * |
JINJIANG TANG 等: "Multi-view non-negative matrix factorization for scene recognition", 《JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113111721A (en) * | 2021-03-17 | 2021-07-13 | 同济大学 | Human behavior intelligent identification method based on multi-unmanned aerial vehicle visual angle image data driving |
Also Published As
Publication number | Publication date |
---|---|
CN111814894B (en) | 2022-09-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111259786B (en) | Pedestrian re-identification method based on synchronous enhancement of appearance and motion information of video | |
CN112613556B (en) | A meta-learning-based approach for sentiment classification of few-shot images | |
CN106250915B (en) | An Automatic Image Annotation Method Fused with Deep Features and Semantic Neighborhoods | |
CN107506712A (en) | Method for distinguishing is known in a kind of human behavior based on 3D depth convolutional networks | |
CN111709518A (en) | A method for enhanced network representation learning based on community awareness and relational attention | |
CN112036322A (en) | Method, system and device for building cross-domain pedestrian re-identification model in multi-task network | |
Wen et al. | Improving face recognition with domain adaptation | |
CN110210335B (en) | A training method, system and device for a person re-identification learning model | |
CN105117429A (en) | Scenario image annotation method based on active learning and multi-label multi-instance learning | |
CN110399518A (en) | A Visual Question Answering Enhancement Method Based on Graph Convolution | |
CN105787513A (en) | Transfer learning design method and system based on domain adaptation under multi-example multi-label framework | |
CN110163117B (en) | Pedestrian re-identification method based on self-excitation discriminant feature learning | |
Li et al. | Two-class 3D-CNN classifiers combination for video copy detection | |
Su et al. | Transfer learning for video recognition with scarce training data for deep convolutional neural network | |
CN105320764A (en) | 3D model retrieval method and 3D model retrieval apparatus based on slow increment features | |
CN111723600B (en) | A feature descriptor for person re-identification based on multi-task learning | |
Han et al. | 3d shape temporal aggregation for video-based clothing-change person re-identification | |
Wu et al. | Inter-attribute awareness for pedestrian attribute recognition | |
CN116434033A (en) | Cross-modal contrast learning method and system for RGB-D image dense prediction task | |
CN110516540B (en) | Group behavior identification method based on multi-stream architecture and long-term and short-term memory network | |
CN116543192A (en) | A small-sample classification method for remote sensing images based on multi-view feature fusion | |
CN111814894B (en) | A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk | |
Sajid et al. | Facial asymmetry-based feature extraction for different applications: a review complemented by new advances | |
CN104200220B (en) | Dynamic texture identification method based on static texture model aggregation | |
CN114332491A (en) | Saliency target detection algorithm based on feature reconstruction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |