CN111814894A - A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk - Google Patents

A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk Download PDF

Info

Publication number
CN111814894A
CN111814894A CN202010695386.8A CN202010695386A CN111814894A CN 111814894 A CN111814894 A CN 111814894A CN 202010695386 A CN202010695386 A CN 202010695386A CN 111814894 A CN111814894 A CN 111814894A
Authority
CN
China
Prior art keywords
view
view data
data
matrix
transition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010695386.8A
Other languages
Chinese (zh)
Other versions
CN111814894B (en
Inventor
王石平
黄晟
王哲文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuzhou University
Original Assignee
Fuzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuzhou University filed Critical Fuzhou University
Priority to CN202010695386.8A priority Critical patent/CN111814894B/en
Publication of CN111814894A publication Critical patent/CN111814894A/en
Application granted granted Critical
Publication of CN111814894B publication Critical patent/CN111814894B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/15Correlation function computation including computation of convolution operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/217Validation; Performance evaluation; Active pattern learning techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computing Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Algebra (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明涉及一种快速种子随机游走的多视角半监督分类方法,首先,采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;然后,根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态并计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;最后,迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵来生成用于测试的多视角数据的类别标签。本发明仅使用少量的监督信息,就能够对图像、文本、视频等各类型数据进行准确有效的分类,具有一定的实用价值。

Figure 202010695386

The invention relates to a multi-view semi-supervised classification method for fast seed random walk. First, a Gaussian kernel function is used to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data; The learned class labels of the multi-view data establish the initial distribution state of each view of the multi-view data and calculate the arrival probability matrix of the first transition state of each view of the multi-view data; finally, iteratively calculate the multi-view data. The arrival probability matrix of multiple transition states of each view, and the weighted summation of the arrival probability matrices of all transition states of each view to obtain the reward matrix of each view to generate the class label of the multi-view data for testing. The present invention can accurately and effectively classify various types of data such as images, texts, and videos by using only a small amount of supervision information, and has certain practical value.

Figure 202010695386

Description

一种快速种子随机游走的多视角半监督分类方法A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk

技术领域technical field

本发明涉及多视角和半监督学习领域,特别是一种快速种子随机游走的多视角半监督分类方法。The invention relates to the field of multi-view and semi-supervised learning, in particular to a multi-view semi-supervised classification method with fast seed random walk.

背景技术Background technique

多视角数据在实际应用中极为普遍,例如多摄像机图像收集和多模信息获取。从异构源收集的数据通常包含大量冗余和不相关的信息,这可能会使学习算法的性能下降。在多视角数据中,每个视角捕获部分信息而不是完整信息。完整的表示形式是潜在的和多余的,因此很难为要学习的任务提取有用的信息。另一方面,对所有多视角特征进行单视角学习或简单串联通常是无效的。因此,学习足够的判别性特征并使用多视角驱动算法从这些数据中挖掘有效的信息至关重要。Multi-view data is extremely common in practical applications, such as multi-camera image collection and multi-modal information acquisition. Data collected from heterogeneous sources often contain a lot of redundant and irrelevant information, which may degrade the performance of learning algorithms. In multi-view data, each view captures partial information rather than complete information. The full representation is latent and redundant, making it difficult to extract useful information for the task to be learned. On the other hand, single-view learning or simple concatenation of all multi-view features is usually ineffective. Therefore, it is crucial to learn enough discriminative features and use multi-view-driven algorithms to mine effective information from these data.

近年来,随机游走理论有了突破性进展。称为PageRank的经典算法在Web搜索中起着至关重要的作用,其中所有Web的重要性都使用Web链接结构进行排名。之后,许多个性化的PageRank方法被提出来解决各种学习任务。Random walk with restart(RWR)是一种代表性方案,可在无向加权图中的任意两个节点之间提供良好的相关性评分等级。根据类别标签信息是否可用,随机游走主要可以分为无监督方法和有监督方法。前者将重点放在数据聚类和解决无监督学习任务上,例如图像分割,后者旨在解决分类任务及其相应的特定应用。这些研究工作表明随机游走在各种实际应用中具有广泛的适用性和高度的兼容性。In recent years, there have been breakthroughs in random walk theory. The classic algorithm called PageRank plays a vital role in web search, where all the importance of the web is ranked using the web link structure. After that, many personalized PageRank methods have been proposed to solve various learning tasks. Random walk with restart (RWR) is a representative scheme that provides a good correlation score between any two nodes in an undirected weighted graph. According to whether the class label information is available, random walks can be mainly divided into unsupervised methods and supervised methods. The former focuses on data clustering and solving unsupervised learning tasks, such as image segmentation, while the latter aims to solve classification tasks and their corresponding specific applications. These research works demonstrate the wide applicability and high compatibility of random walks in various practical applications.

多视角学习旨在从多视角数据中挖掘有用的模式。先前的大量研究表明,多视角学习可以充分利用多视角数据中的相似性和互补性信息,并且比单视角学习具有更好的生成能力和优越的性能。作为最早的多视角学习代表性范例之一,在无监督模式下的协同训练最大化了两个不同视角之间的相互一致性。后来,普通的无监督多视角学习方法受到越来越多的关注,包括无监督多视角特征表示和多视角聚类。相应地,有监督的多视角学习方法也已经吸引了来自不同领域的越来越多的研究兴趣,例如图像分类,手势识别和图像注释。然而,标记足够的多视角训练数据是难以实现且费时的,而仅一小部分标记的样本容易获得,仅有少量数据不足以完全进行监督学习,而对于无监督学习则利用不足。作为两种学习范式的折中方案,多视角半监督分类可以利用有限比例的标记样本来提高性能。为此,学界提出了多视角半监督学习以最大程度利用小比例标记数据点来挖掘有效信息。但目前为止,对于多视角半监督分类所做的工作仍然有限,对于此问题,需要进行更多的研究。Multi-view learning aims to mine useful patterns from multi-view data. Numerous previous studies have shown that multi-view learning can fully exploit the similarity and complementarity information in multi-view data, and has better generative ability and superior performance than single-view learning. As one of the earliest representative paradigms of multi-view learning, co-training in an unsupervised mode maximizes the mutual consistency between two different views. Later, ordinary unsupervised multi-view learning methods received more and more attention, including unsupervised multi-view feature representation and multi-view clustering. Correspondingly, supervised multi-view learning methods have also attracted increasing research interest from different fields, such as image classification, gesture recognition, and image annotation. However, labeling enough multi-view training data is difficult and time-consuming, while only a small number of labeled samples are readily available, only a small amount of data is not sufficient for fully supervised learning, and underutilized for unsupervised learning. As a compromise between the two learning paradigms, multi-view semi-supervised classification can leverage a limited proportion of labeled samples to improve performance. To this end, the academic community proposes multi-view semi-supervised learning to maximize the use of a small proportion of labeled data points to mine effective information. But so far, the work done on multi-view semi-supervised classification is still limited, and more research is needed for this problem.

目前的多视角半监督学习算法主要分为两类。第一种方法是基于子空间的方法,通常将给定的标签矩阵嵌入目标函数中作为一个共同的回归目标,学习一个潜在的低维子空间来投影输入数据,以挖掘视角之间的共性。例如,Nie等人对一个凸问题进行建模以避免局部极小值,并提出了一种新的自适应权值学习策略来学习投影矩阵。Xue等人将标签信息引入深度矩阵分解,并学习相关预测子空间用于不完全多视角数据分类。第二种模式是基于图的模型,它将每个数据点视为一个由多个视角融合而成的联合图的顶点,并通过加权边将标签信息从标记样本传播到未标记样本。作为一种早期的基于图的方法,Karasuyama提出了一种标签传播环境下的多图集成方法,该方法通过权值正则化学习稀疏多视角权重,将多个图线性组合。相比之下,Nie等人从先验图结构中学习权重来融合多视角,而不是通过权重正则化来学习融合权重。目前,已有多种多视角半监督学习方法被开发出来,并在具体应用中表明了它们的有效性。但是,多视角半监督分类在很大程度上尚未得到充分的研究,这些模型仍然存在一些缺陷。高计算复杂度是大多数算法在面对大规模学习问题时需要解决的一个主要限制,因此需要做更多的工作。此外,有限的学习性能要求算法在只有少量的标记数据可用于模型训练时挖掘更有效的模式。所以,更有效、更高效的多视角半监督学习方法仍有待开发。The current multi-view semi-supervised learning algorithms are mainly divided into two categories. The first method is a subspace-based method, which usually embeds a given label matrix in the objective function as a common regression objective, and learns a latent low-dimensional subspace to project the input data to mine commonalities between viewpoints. For example, Nie et al. model a convex problem to avoid local minima and propose a new adaptive weight learning strategy to learn the projection matrix. Xue et al. introduced label information into deep matrix factorization and learned a relevant prediction subspace for incomplete multi-view data classification. The second mode is a graph-based model, which treats each data point as a vertex of a joint graph fused from multiple viewpoints and propagates label information from labeled to unlabeled samples through weighted edges. As an early graph-based method, Karasuyama proposed a multi-graph ensemble method in the context of label propagation, which learns sparse multi-view weights through weight regularization to linearly combine multiple graphs. In contrast, Nie et al. learn the weights from the prior graph structure to fuse multi-views instead of learning the fusion weights through weight regularization. Currently, a variety of multi-view semi-supervised learning methods have been developed and their effectiveness has been demonstrated in specific applications. However, multi-view semi-supervised classification is largely understudied, and these models still suffer from some flaws. High computational complexity is a major limitation that most algorithms need to address when faced with large-scale learning problems, so more work is required. Furthermore, limited learning performance requires algorithms to mine more efficient patterns when only a small amount of labeled data is available for model training. Therefore, more effective and efficient multi-view semi-supervised learning methods remain to be developed.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明提出一种快速种子随机游走的多视角半监督分类方法,能够对文本、图像以及视频等各类型的数据集进行准确有效的分类,在仅利用少量监督信息的情况下,在各个数据集上均取得了较高的分类性能,具有一定的实用价值。In view of this, the present invention proposes a multi-view semi-supervised classification method with fast seed random walk, which can accurately and effectively classify various types of data sets such as text, images, and videos, and only uses a small amount of supervision information. , has achieved high classification performance on each dataset, and has certain practical value.

本发明采用以下方案实现:一种快速种子随机游走的多视角半监督分类方法,包括以下步骤:The present invention adopts the following scheme to realize: a multi-view semi-supervised classification method of fast seed random walk, comprising the following steps:

步骤S1:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;Step S1: using a Gaussian kernel function to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data;

步骤S2:根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态;Step S2: establishing the initial distribution state of each view of the multi-view data according to the category label of the multi-view data for semi-supervised learning;

步骤S3:根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;Step S3: Calculate the arrival probability matrix of the first transition state of each view of the multi-view data according to the initial distribution state of each view of the multi-view data established in S2;

步骤S4:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵;Step S4: iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data and perform a weighted summation of the arrival probability matrices of all the transition states of each view to obtain the reward matrix of each view;

步骤S5:根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签。Step S5: Predict the category label of each multi-view data for testing according to the reward matrix of each view of the multi-view data calculated in S4.

进一步地,所述步骤S1具体包括以下步骤:Further, the step S1 specifically includes the following steps:

步骤S11:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵,计算公式为:Step S11 : using the Gaussian kernel function to calculate the similarity matrix of each angle of view of the input multi-angle data, the calculation formula is:

Figure BDA0002589846960000041
Figure BDA0002589846960000041

其中[Wt]ij为第t个视角下的第i个和第j个数据点之间的相似性,

Figure BDA0002589846960000042
为第t个视角下的第i个数据点,σ为带宽,控制高斯核函数的局部作用范围;where [W t ] ij is the similarity between the i-th and j-th data points under the t-th view,
Figure BDA0002589846960000042
is the i-th data point under the t-th viewing angle, and σ is the bandwidth, which controls the local scope of the Gaussian kernel function;

步骤S12:计算多视角数据的每个视角的转移概率矩阵,计算公式为:Step S12: Calculate the transition probability matrix of each view of the multi-view data, and the calculation formula is:

Figure BDA0002589846960000043
Figure BDA0002589846960000043

其中,

Figure BDA0002589846960000044
为对角矩阵,且
Figure BDA0002589846960000045
Pt为多视角数据的第t个视角的转移概率矩阵。in,
Figure BDA0002589846960000044
is a diagonal matrix, and
Figure BDA0002589846960000045
P t is the transition probability matrix of the t-th view of the multi-view data.

进一步地,步骤S2中所述根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态,计算公式为:Further, according to the category label of the multi-view data used for semi-supervised learning in step S2, the initial distribution state of each view of the multi-view data is established, and the calculation formula is:

[Qt](0)=[Q0,O],[Q t ] (0) = [Q 0 , O],

其中,

Figure BDA0002589846960000051
为全0矩阵,
Figure BDA0002589846960000052
c为多视角数据的类别数目,N为多视角数据的总数目,n为用于半监督学习的多视角数据的数目,[Qy](0)为多视角数据第t个视角的起始分布状态,Q0的计算公式如下:in,
Figure BDA0002589846960000051
is an all-zero matrix,
Figure BDA0002589846960000052
c is the number of categories of multi-view data, N is the total number of multi-view data, n is the number of multi-view data used for semi-supervised learning, [Q y ] (0) is the start of the t-th view of multi-view data Distribution state, the calculation formula of Q 0 is as follows:

Figure BDA0002589846960000053
Figure BDA0002589846960000053

其中Ci为第i个类别,若多视角数据的第j个数据点xj属于第i个类别Ci,则[Q0]ij的值为1,反之为0。Among them, C i is the ith category. If the jth data point x j of the multi-view data belongs to the ith category C i , the value of [Q 0 ] ij is 1, otherwise, it is 0.

进一步地,步骤S3中所述根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵,计算公式为:Further, in step S3, the arrival probability matrix of the first transition state of each view of the multi-view data is calculated according to the initial distribution state of each view of the multi-view data established in S2, and the calculation formula is:

Figure BDA0002589846960000054
Figure BDA0002589846960000054

其中

Figure BDA0002589846960000055
α为根据先验知识确定的重启概率,[Qt](1)为多视角数据的第t个视角在起始分布状态下的第一次转移状态的到达概率矩阵。in
Figure BDA0002589846960000055
α is the restart probability determined according to prior knowledge, [Q t ] (1) is the arrival probability matrix of the first transition state of the t-th view of the multi-view data in the initial distribution state.

进一步地,所述步骤S4具体包括以下步骤:Further, the step S4 specifically includes the following steps:

步骤S41:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,计算公式如下:Step S41: Iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data, and the calculation formula is as follows:

[Qt](k)=(1-α)[Qt](k-l)Pt+α[Qt](0)for k≥2,[Q t ] (k) = (1-α)[Q t ] (kl) P t +α[Q t ] (0) for k≥2,

其中,[Qt](k)为多视角数据第t个视角的第k次转移状态的到达概率矩阵;Among them, [Q t ] (k) is the arrival probability matrix of the k-th transition state of the t-th view of the multi-view data;

步骤S42:对多视角数据的每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵,计算公式如下:Step S42: Weighted summation is performed on the arrival probability matrices of all transition states of each view of the multi-view data to obtain the reward matrix of each view, and the calculation formula is as follows:

Figure BDA0002589846960000061
Figure BDA0002589846960000061

其中,顀为根据先验知识确定的转移步数的数目,[Rt](s)为多视角数据的第t个视角在转移步数顀下的奖励矩阵,γ为根据先验知识确定的衰变因子。Among them, J is the number of transition steps determined according to the prior knowledge, [R t ] (s) is the reward matrix of the t-th view of the multi-view data under the number of transition steps, and γ is determined according to the prior knowledge. decay factor.

进一步地,步骤S5中所述根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签,具体计算公式如下:Further, according to the reward matrix of each view of the multi-view data calculated in step S5, the category label of each multi-view data for testing is predicted according to the reward matrix of S4, and the specific calculation formula is as follows:

Figure BDA0002589846960000062
Figure BDA0002589846960000062

其中,V为多视角数据的视角数目,

Figure BDA0002589846960000063
为第j个数据点的预测标签。where V is the number of viewing angles of the multi-view data,
Figure BDA0002589846960000063
is the predicted label for the jth data point.

与现有技术相比,本发明有以下有益效果:Compared with the prior art, the present invention has the following beneficial effects:

本发明采用的算法基于带有重启概率的随机游走,可以有效地发掘数据点之间的相关性并捕捉数据集的全局结构信息,对比其他同类算法,具有较大的性能提升。本发明能够对图像、文本、视频等各类型数据进行准确有效的分类,在实验中,对现实应用中的八个公开数据集进行了分类,其中包含了文本、图像以及视频类型的数据集,在仅利用少量监督信息的情况下,在各个数据集上均取得了较高的分类性能,具有一定的实用价值。The algorithm adopted in the present invention is based on random walk with restart probability, which can effectively explore the correlation between data points and capture the global structure information of the data set, and has a large performance improvement compared with other similar algorithms. The invention can accurately and effectively classify various types of data such as images, texts, and videos. In the experiment, eight public data sets in practical applications are classified, including data sets of text, image and video types. In the case of using only a small amount of supervision information, high classification performance has been achieved on each dataset, which has certain practical value.

附图说明Description of drawings

图1为本发明实施例的流程图。FIG. 1 is a flowchart of an embodiment of the present invention.

图2为本发明的一实施例的整体方法的实现流程图。FIG. 2 is a flow chart of the implementation of the overall method according to an embodiment of the present invention.

具体实施方式Detailed ways

下面结合附图及实施例对本发明做进一步说明。The present invention will be further described below with reference to the accompanying drawings and embodiments.

应该指出,以下详细说明都是例示性的,旨在对本申请提供进一步的说明。除非另有指明,本文使用的所有技术和科学术语具有与本申请所属技术领域的普通技术人员通常理解的相同含义。It should be noted that the following detailed description is exemplary and intended to provide further explanation of the application. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

需要注意的是,这里所使用的术语仅是为了描述具体实施方式,而非意图限制根据本申请的示例性实施方式。如在这里所使用的,除非上下文另外明确指出,否则单数形式也意图包括复数形式,此外,还应当理解的是,当在本说明书中使用术语“包含”和/或“包括”时,其指明存在特征、步骤、操作、器件、组件和/或它们的组合。It should be noted that the terminology used herein is for the purpose of describing specific embodiments only, and is not intended to limit the exemplary embodiments according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural as well, furthermore, it is to be understood that when the terms "comprising" and/or "including" are used in this specification, it indicates that There are features, steps, operations, devices, components and/or combinations thereof.

如图1、2所示,本实施例提供了一种快速种子随机游走的多视角半监督分类方法,包括以下步骤:As shown in Figures 1 and 2, this embodiment provides a multi-view semi-supervised classification method with fast seed random walk, including the following steps:

步骤S1:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;Step S1: using a Gaussian kernel function to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data;

步骤S2:根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态;Step S2: establishing the initial distribution state of each view of the multi-view data according to the category label of the multi-view data for semi-supervised learning;

步骤S3:根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;Step S3: Calculate the arrival probability matrix of the first transition state of each view of the multi-view data according to the initial distribution state of each view of the multi-view data established in S2;

步骤S4:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵;Step S4: iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data and perform a weighted summation of the arrival probability matrices of all the transition states of each view to obtain the reward matrix of each view;

步骤S5:根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签。Step S5: Predict the category label of each multi-view data for testing according to the reward matrix of each view of the multi-view data calculated in S4.

在本实施例中,所述步骤S1具体包括以下步骤:In this embodiment, the step S1 specifically includes the following steps:

步骤S11:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵,计算公式为:Step S11 : using the Gaussian kernel function to calculate the similarity matrix of each angle of view of the input multi-angle data, the calculation formula is:

Figure BDA0002589846960000081
Figure BDA0002589846960000081

其中[Wt]ij为第t个视角下的第i个和第j个数据点之间的相似性,

Figure BDA0002589846960000082
为第t个视角下的第i个数据点,σ为带宽,控制高斯核函数的局部作用范围;where [W t ] ij is the similarity between the i-th and j-th data points under the t-th view,
Figure BDA0002589846960000082
is the i-th data point under the t-th viewing angle, and σ is the bandwidth, which controls the local scope of the Gaussian kernel function;

步骤S12:计算多视角数据的每个视角的转移概率矩阵,计算公式为:Step S12: Calculate the transition probability matrix of each view of the multi-view data, and the calculation formula is:

Figure BDA0002589846960000083
Figure BDA0002589846960000083

其中,

Figure BDA0002589846960000084
为对角矩阵,且
Figure BDA0002589846960000085
Pt为多视角数据的第t个视角的转移概率矩阵。in,
Figure BDA0002589846960000084
is a diagonal matrix, and
Figure BDA0002589846960000085
P t is the transition probability matrix of the t-th view of the multi-view data.

在本实施例中,步骤S2中所述根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态,计算公式为:In this embodiment, the initial distribution state of each view of the multi-view data is established according to the category label of the multi-view data used for semi-supervised learning in step S2, and the calculation formula is:

[Qt](0)=[Q0,O],[Q t ] (0) = [Q 0 , O],

其中,

Figure BDA0002589846960000091
为全0矩阵,
Figure BDA0002589846960000092
c为多视角数据的类别数目,N为多视角数据的总数目,n为用于半监督学习的多视角数据的数目,[Qt](0)为多视角数据第t个视角的起始分布状态,Q0的计算公式如下:in,
Figure BDA0002589846960000091
is an all-zero matrix,
Figure BDA0002589846960000092
c is the number of categories of multi-view data, N is the total number of multi-view data, n is the number of multi-view data used for semi-supervised learning, [Q t ] (0) is the start of the t-th view of multi-view data Distribution state, the calculation formula of Q 0 is as follows:

Figure BDA0002589846960000093
Figure BDA0002589846960000093

其中Ci为第i个类别,若多视角数据的第j个数据点xj属于第i个类别Ci,则[Q0]ij的值为1,反之为0。Among them, C i is the ith category. If the jth data point x j of the multi-view data belongs to the ith category C i , the value of [Q 0 ] ij is 1, otherwise, it is 0.

在本实施例中,步骤S3中所述根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵,计算公式为:In this embodiment, in step S3, the arrival probability matrix of the first transition state of each view of the multi-view data is calculated according to the initial distribution state of each view of the multi-view data established in S2, and the calculation formula is:

Figure BDA0002589846960000094
Figure BDA0002589846960000094

其中

Figure BDA0002589846960000095
α为根据先验知识确定的重启概率,[Qt](1)为多视角数据的第t个视角在起始分布状态下的第一次转移状态的到达概率矩阵。in
Figure BDA0002589846960000095
α is the restart probability determined according to prior knowledge, [Q t ] (1) is the arrival probability matrix of the first transition state of the t-th view of the multi-view data in the initial distribution state.

在本实施例中,所述步骤S4具体包括以下步骤:In this embodiment, the step S4 specifically includes the following steps:

步骤S41:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,计算公式如下:Step S41: Iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data, and the calculation formula is as follows:

[Qt](k)=(1-α)[Qt](k-l)Pt+α[Qt](0)for k≥2,[Q t ] (k) = (1-α)[Q t ] (kl) P t +α[Q t ] (0) for k≥2,

其中,[Qt](k)为多视角数据第t个视角的第k次转移状态的到达概率矩阵;Among them, [Q t ] (k) is the arrival probability matrix of the k-th transition state of the t-th view of the multi-view data;

步骤S42:对多视角数据的每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵,计算公式如下:Step S42: Weighted summation is performed on the arrival probability matrices of all transition states of each view of the multi-view data to obtain the reward matrix of each view, and the calculation formula is as follows:

Figure BDA0002589846960000101
Figure BDA0002589846960000101

其中,顀为根据先验知识确定的转移步数的数目,[Rt](s)为多视角数据的第t个视角在转移步数顀下的奖励矩阵,γ为根据先验知识确定的衰变因子。Among them, J is the number of transition steps determined according to the prior knowledge, [R t ] (s) is the reward matrix of the t-th view of the multi-view data under the number of transition steps, and γ is determined according to the prior knowledge. decay factor.

在本实施例中,步骤S5中所述根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签,具体计算公式如下:In this embodiment, according to the reward matrix of each view of the multi-view data calculated in step S4, the category label of each multi-view data for testing is predicted according to the step S5, and the specific calculation formula is as follows:

Figure BDA0002589846960000102
Figure BDA0002589846960000102

其中,V为多视角数据的视角数目,

Figure BDA0002589846960000103
为第j个数据点的预测标签。where V is the number of viewing angles of the multi-view data,
Figure BDA0002589846960000103
is the predicted label for the jth data point.

本实施例从实际应用出发,首先,采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;然后,根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态并计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;最后,迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵来生成用于测试的多视角数据的类别标签。本实施例基于带有重启概率的随机游走,可以有效地发掘数据点之间的相关性并捕捉数据集的全局结构信息,进而能够对图像、文本、音频等各类型数据进行准确有效的分类,具有一定的应用价值。This embodiment starts from practical application. First, the Gaussian kernel function is used to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data; The initial distribution state of each view of the view data and the arrival probability matrix of the first transition state of each view of the multi-view data are calculated; finally, the arrival of multiple transition states of each view of the multi-view data is iteratively calculated The probability matrix, and the weighted summation of the arrival probability matrix of all transition states of each view to obtain the reward matrix of each view to generate the class label for the multi-view data for testing. Based on the random walk with restart probability, this embodiment can effectively explore the correlation between data points and capture the global structure information of the data set, so as to accurately and effectively classify various types of data such as images, texts, and audios , has certain application value.

以上所述仅为本发明的较佳实施例,凡依本发明申请专利范围所做的均等变化与修饰,皆应属本发明的涵盖范围。The above descriptions are only preferred embodiments of the present invention, and all equivalent changes and modifications made according to the scope of the patent application of the present invention shall fall within the scope of the present invention.

Claims (6)

1.一种快速种子随机游走的多视角半监督分类方法,其特征在于:包括以下步骤:1. a multi-view semi-supervised classification method of fast seed random walk, is characterized in that: comprise the following steps: 步骤S1:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵和转移概率矩阵;Step S1: using a Gaussian kernel function to calculate the similarity matrix and transition probability matrix of each view of the input multi-view data; 步骤S2:根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态;Step S2: establishing the initial distribution state of each view of the multi-view data according to the category label of the multi-view data for semi-supervised learning; 步骤S3:根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵;Step S3: Calculate the arrival probability matrix of the first transition state of each view of the multi-view data according to the initial distribution state of each view of the multi-view data established in S2; 步骤S4:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵并对每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵;Step S4: iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data and perform a weighted summation of the arrival probability matrices of all the transition states of each view to obtain the reward matrix of each view; 步骤S5:根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签。Step S5: Predict the category label of each multi-view data for testing according to the reward matrix of each view of the multi-view data calculated in S4. 2.根据权利要求1所述的一种快速种子随机游走的多视角半监督分类方法,其特征在于:所述步骤S1具体包括以下步骤:2. the multi-view semi-supervised classification method of a kind of fast seed random walk according to claim 1, is characterized in that: described step S1 specifically comprises the following steps: 步骤S11:采用高斯核函数计算输入的多视角数据的每个视角的相似性矩阵,计算公式为:Step S11 : using the Gaussian kernel function to calculate the similarity matrix of each angle of view of the input multi-angle data, the calculation formula is:
Figure FDA0002589846950000011
Figure FDA0002589846950000011
其中[Wt]ij为第t个视角下的第i个和第j个数据点之间的相似性,
Figure FDA0002589846950000012
为第t个视角下的第i个数据点,σ为带宽,控制高斯核函数的局部作用范围;
where [W t ] ij is the similarity between the i-th and j-th data points under the t-th view,
Figure FDA0002589846950000012
is the i-th data point under the t-th viewing angle, and σ is the bandwidth, which controls the local scope of the Gaussian kernel function;
步骤S12:计算多视角数据的每个视角的转移概率矩阵,计算公式为:Step S12: Calculate the transition probability matrix of each view of the multi-view data, and the calculation formula is:
Figure FDA0002589846950000021
Figure FDA0002589846950000021
其中,
Figure FDA0002589846950000022
为对角矩阵,且
Figure FDA0002589846950000023
Pt为多视角数据的第t个视角的转移概率矩阵。
in,
Figure FDA0002589846950000022
is a diagonal matrix, and
Figure FDA0002589846950000023
P t is the transition probability matrix of the t-th view of the multi-view data.
3.根据权利要求1所述的一种快速种子随机游走的多视角半监督分类方法,其特征在于:步骤S2中所述根据用于半监督学习的多视角数据的类别标签建立多视角数据的每个视角的起始分布状态,计算公式为:3. The multi-view semi-supervised classification method of a kind of fast seed random walk according to claim 1, it is characterized in that: described in step S2, establish multi-view data according to the class label of multi-view data used for semi-supervised learning The initial distribution state of each viewing angle of , the calculation formula is: [Qt](0)=[Q0,O],[Q t ] (0) = [Q 0 , O], 其中,
Figure FDA0002589846950000024
为全0矩阵,
Figure FDA0002589846950000025
c为多视角数据的类别数目,N为多视角数据的总数目,n为用于半监督学习的多视角数据的数目,[Qt](0)为多视角数据第t个视角的起始分布状态,Q0的计算公式如下:
in,
Figure FDA0002589846950000024
is an all-zero matrix,
Figure FDA0002589846950000025
c is the number of categories of multi-view data, N is the total number of multi-view data, n is the number of multi-view data used for semi-supervised learning, [Q t ] (0) is the start of the t-th view of multi-view data Distribution state, the calculation formula of Q 0 is as follows:
Figure FDA0002589846950000026
Figure FDA0002589846950000026
其中Ci为第i个类别,若多视角数据的第j个数据点xj属于第i个类别Ci,则[Q0]ij的值为1,反之为0。Among them, C i is the ith category. If the jth data point x j of the multi-view data belongs to the ith category C i , the value of [Q 0 ] ij is 1, otherwise, it is 0.
4.根据权利要求1所述的一种快速种子随机游走的多视角半监督分类方法,其特征在于:步骤S3中所述根据S2建立的多视角数据的每个视角的起始分布状态计算多视角数据的每个视角的第一次转移状态的到达概率矩阵,计算公式为:4. the multi-view semi-supervised classification method of a kind of fast seed random walk according to claim 1, is characterized in that: the initial distribution state of each view of the multi-view data established according to S2 in step S3 is calculated The arrival probability matrix of the first transition state of each view of the multi-view data, the calculation formula is:
Figure FDA0002589846950000027
Figure FDA0002589846950000027
Figure FDA0002589846950000031
Figure FDA0002589846950000031
其中
Figure FDA0002589846950000032
α为根据先验知识确定的重启概率,[Qt](1)为多视角数据的第t个视角在起始分布状态下的第一次转移状态的到达概率矩阵。
in
Figure FDA0002589846950000032
α is the restart probability determined according to prior knowledge, [Q t ] (1) is the arrival probability matrix of the first transition state of the t-th view of the multi-view data in the initial distribution state.
5.根据权利要求1所述的一种快速种子随机游走的多视角半监督分类方法,其特征在于:所述步骤S4具体包括以下步骤:5. the multi-view semi-supervised classification method of a kind of fast seed random walk according to claim 1, is characterized in that: described step S4 specifically comprises the following steps: 步骤S41:迭代地计算多视角数据的每个视角的多次转移状态的到达概率矩阵,计算公式如下:Step S41: Iteratively calculate the arrival probability matrix of the multiple transition states of each view of the multi-view data, and the calculation formula is as follows: [Qt](k)=(1-α)[Qt](k-1)Pt+α[Qt](0)for k≥2,[Q t ] (k) = (1-α)[Q t ] (k-1) P t +α[Q t ] (0) for k≥2, 其中,[Qt](k)为多视角数据第t个视角的第k次转移状态的到达概率矩阵;Among them, [Q t ] (k) is the arrival probability matrix of the k-th transition state of the t-th view of the multi-view data; 步骤S42:对多视角数据的每个视角的所有转移状态的到达概率矩阵进行加权求和得到每个视角的奖励矩阵,计算公式如下:Step S42: Weighted summation is performed on the arrival probability matrices of all transition states of each view of the multi-view data to obtain the reward matrix of each view, and the calculation formula is as follows:
Figure FDA0002589846950000033
Figure FDA0002589846950000033
其中,s为根据先验知识确定的转移步数的数目,[Rt](s)为多视角数据的第t个视角在转移步数s下的奖励矩阵,γ为根据先验知识确定的衰变因子。Among them, s is the number of transition steps determined according to prior knowledge, [R t ] (s) is the reward matrix of the t-th view of the multi-view data under the number of transition steps s, and γ is determined according to prior knowledge decay factor.
6.根据权利要求1所述的一种快速种子随机游走的多视角半监督分类方法,其特征在于:步骤S5中所述根据S4计算得到的多视角数据的每个视角的奖励矩阵预测每个用于测试的多视角数据的类别标签,具体计算公式如下:6. the multi-view semi-supervised classification method of a kind of fast seed random walk according to claim 1, is characterized in that: the reward matrix of each view of the multi-view data calculated according to S4 described in step S5 predicts every A category label for multi-view data used for testing. The specific calculation formula is as follows:
Figure FDA0002589846950000041
Figure FDA0002589846950000041
其中,V为多视角数据的视角数目,
Figure FDA0002589846950000042
为第j个数据点的预测标签。
where V is the number of viewing angles of the multi-view data,
Figure FDA0002589846950000042
is the predicted label for the jth data point.
CN202010695386.8A 2020-07-17 2020-07-17 A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk Active CN111814894B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010695386.8A CN111814894B (en) 2020-07-17 2020-07-17 A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010695386.8A CN111814894B (en) 2020-07-17 2020-07-17 A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk

Publications (2)

Publication Number Publication Date
CN111814894A true CN111814894A (en) 2020-10-23
CN111814894B CN111814894B (en) 2022-09-09

Family

ID=72866453

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010695386.8A Active CN111814894B (en) 2020-07-17 2020-07-17 A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk

Country Status (1)

Country Link
CN (1) CN111814894B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113111721A (en) * 2021-03-17 2021-07-13 同济大学 Human behavior intelligent identification method based on multi-unmanned aerial vehicle visual angle image data driving

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110781788A (en) * 2019-10-18 2020-02-11 中国科学技术大学 A method and system for ground classification of field robots based on a small number of annotations
US20200125897A1 (en) * 2018-10-18 2020-04-23 Deepnorth Inc. Semi-Supervised Person Re-Identification Using Multi-View Clustering

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200125897A1 (en) * 2018-10-18 2020-04-23 Deepnorth Inc. Semi-Supervised Person Re-Identification Using Multi-View Clustering
CN110781788A (en) * 2019-10-18 2020-02-11 中国科学技术大学 A method and system for ground classification of field robots based on a small number of annotations

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CHARU C.AGGARWAL: "《社会网络数据分析》", 31 December 2016, 武汉大学出版社 *
JINJIANG TANG 等: "Multi-view non-negative matrix factorization for scene recognition", 《JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113111721A (en) * 2021-03-17 2021-07-13 同济大学 Human behavior intelligent identification method based on multi-unmanned aerial vehicle visual angle image data driving

Also Published As

Publication number Publication date
CN111814894B (en) 2022-09-09

Similar Documents

Publication Publication Date Title
CN111259786B (en) Pedestrian re-identification method based on synchronous enhancement of appearance and motion information of video
CN112613556B (en) A meta-learning-based approach for sentiment classification of few-shot images
CN106250915B (en) An Automatic Image Annotation Method Fused with Deep Features and Semantic Neighborhoods
CN107506712A (en) Method for distinguishing is known in a kind of human behavior based on 3D depth convolutional networks
CN111709518A (en) A method for enhanced network representation learning based on community awareness and relational attention
CN112036322A (en) Method, system and device for building cross-domain pedestrian re-identification model in multi-task network
Wen et al. Improving face recognition with domain adaptation
CN110210335B (en) A training method, system and device for a person re-identification learning model
CN105117429A (en) Scenario image annotation method based on active learning and multi-label multi-instance learning
CN110399518A (en) A Visual Question Answering Enhancement Method Based on Graph Convolution
CN105787513A (en) Transfer learning design method and system based on domain adaptation under multi-example multi-label framework
CN110163117B (en) Pedestrian re-identification method based on self-excitation discriminant feature learning
Li et al. Two-class 3D-CNN classifiers combination for video copy detection
Su et al. Transfer learning for video recognition with scarce training data for deep convolutional neural network
CN105320764A (en) 3D model retrieval method and 3D model retrieval apparatus based on slow increment features
CN111723600B (en) A feature descriptor for person re-identification based on multi-task learning
Han et al. 3d shape temporal aggregation for video-based clothing-change person re-identification
Wu et al. Inter-attribute awareness for pedestrian attribute recognition
CN116434033A (en) Cross-modal contrast learning method and system for RGB-D image dense prediction task
CN110516540B (en) Group behavior identification method based on multi-stream architecture and long-term and short-term memory network
CN116543192A (en) A small-sample classification method for remote sensing images based on multi-view feature fusion
CN111814894B (en) A Multi-View Semi-Supervised Classification Method Based on Fast Seed Random Walk
Sajid et al. Facial asymmetry-based feature extraction for different applications: a review complemented by new advances
CN104200220B (en) Dynamic texture identification method based on static texture model aggregation
CN114332491A (en) Saliency target detection algorithm based on feature reconstruction

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant