WO2022116479A1 - End-to-end multi-instance learning method based on automatic instance selection - Google Patents

End-to-end multi-instance learning method based on automatic instance selection Download PDF

Info

Publication number
WO2022116479A1
WO2022116479A1 PCT/CN2021/094142 CN2021094142W WO2022116479A1 WO 2022116479 A1 WO2022116479 A1 WO 2022116479A1 CN 2021094142 W CN2021094142 W CN 2021094142W WO 2022116479 A1 WO2022116479 A1 WO 2022116479A1
Authority
WO
WIPO (PCT)
Prior art keywords
instance
network
deep
data
examples
Prior art date
Application number
PCT/CN2021/094142
Other languages
French (fr)
Chinese (zh)
Inventor
詹德川
王魏
李新春
Original Assignee
南京智谷人工智能研究院有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京智谷人工智能研究院有限公司 filed Critical 南京智谷人工智能研究院有限公司
Publication of WO2022116479A1 publication Critical patent/WO2022116479A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the invention relates to the technical field of machine learning for processing multi-instance data by using a deep network, in particular to an end-to-end multi-instance learning method based on automatic example selection.
  • Multi-example learning assumes that there are some important examples in the multi-example package that determine the category of the example package, so how to automatically select important examples is a very critical technology.
  • the present invention endows the deep multi-instance network with the ability to "auto-select examples” so that the entire optimization process can be performed end-to-end.
  • the purpose of the present invention is to provide an end-to-end multiplexer based on automatic example selection, which can not only process a scenario where a group of examples corresponds to a single label, but also can effectively realize the automatic selection of examples in a deep network. Learning by Example.
  • An end-to-end multi-instance learning method based on automatic example selection comprising the following specific steps: (1) collecting multi-instance data, and dividing the data into several multi-instance data packets, wherein the multi-instance data packets include several and the multi-instance data package is set as a set of examples composed of several examples, the multi-instance data package has a label, and the examples are set as a multi-dimensional vector; (2), build a deep multi-instance network , the deep multi-instance network includes an example processing layer, an example selection layer and a classification layer; (3), each multi-instance data packet is processed through a deep multi-instance network, and is trained by forward or backward propagation. Including deep multi-instance network training and deep multi-instance network testing.
  • the multi-instance data collection includes the following specific steps:
  • the construction of a deep multi-instance network includes the following specific steps:
  • the training of the deep multi-instance network includes the following specific steps:
  • the deep multi-instance network test includes the following specific steps:
  • the present invention can automatically select important examples through the example selection layer, on the one hand, the optimization process of the entire deep network can be trained end-to-end, and on the other hand, it can assist in mining a multi-example package.
  • An important example to enhance the interpretability of the model the present invention is applicable to a multi-instance data scenario where a group of examples corresponds to a single label, and uses deep learning technology for training and prediction.
  • FIG. 1 is a flow chart of multi-example data collection according to an embodiment of the present invention
  • FIG. 2 is a flowchart of building a multi-example deep network according to an embodiment of the present invention
  • FIG. 3 is a flowchart of a multi-example deep network training according to an embodiment of the present invention.
  • FIG. 4 is a flowchart of multi-example deep network prediction according to an embodiment of the present invention.
  • An end-to-end multi-instance learning method based on automatic example selection comprising the following specific steps: (1) collecting multi-instance data, and dividing the data into several multi-instance data packets, wherein the multi-instance data packets include several and the multi-instance data package is set as a set of examples composed of several examples, the multi-instance data package has a label, and the examples are set as a multi-dimensional vector; (2), build a deep multi-instance network , the deep multi-instance network includes an example processing layer, an example selection layer and a classification layer; (3), each multi-instance data packet is processed through a deep multi-instance network, and is trained by forward or backward propagation. Including deep multi-instance network training and deep multi-instance network testing.
  • the multi-instance data collection includes the following specific steps in order: determining the meaning of the example and the multi-instance package in the fan fault diagnosis task (step 100), and the example refers to the fan fault signal in a certain frequency domain range.
  • Each example is represented as a vector of length D, the frequency domain range can be divided into K frequency bands, and the collected fan fault signals can be organized into a set of K D-dimensional vectors, that is, a multi-example package, denoted as ⁇ V1,V2,...,VK ⁇ (step 101); if the collected fan signal is from a faulty fan, the label is marked as 1, otherwise it is marked as 0 (step 102); all collected data are represented as ( ⁇ V1, V2,...,VK ⁇ , y), y is 0 or 1 (step 103).
  • the deep multi-instance network testing includes the following specific steps in order: express the collected time series signals in the form of (multi-instance package, ) (step 400); pass the example processing layer, the example selection layer, the aggregation operation and the final classification The layer performs prediction (steps 401, 402, 403, 404); and outputs the fault classification result (405).
  • the present invention can automatically select important examples through the example selection layer, on the one hand, the optimization process of the entire deep network can be trained end-to-end, and on the other hand, it can assist in mining a multi-example package.
  • An important example to enhance the interpretability of the model the present invention is applicable to a multi-instance data scenario where a group of examples corresponds to a single label, and uses deep learning technology for training and prediction.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An end-to-end multi-instance learning method based on automatic instance selection, comprising the following specific steps: (I) acquiring multi-instance data, and dividing the data into several multi-instance data packets, the multi-instance data packets comprising several instances, and the multi-instance data packets being set as a group of instance sets consisting of the several instances, the multi-instance data packets having labels, and the instances being set as a multi-dimensional vector; (II) building a deep multi-instance network, the deep multi-instance network comprising an instance processing layer, an instance selection layer and a classification layer; and (III) processing each of the multi-instance data packets by means of the deep multi-instance network, and performing training by means of forward or reverse propagation, the training comprising deep multi-instance network training and deep multi-instance network testing. The described method can automatically select important instances by means of an instance selection layer, so that the optimization process of the entire deep network can be trained in an end-to-end manner.

Description

一种基于自动示例选择的端到端多示例学习方法An end-to-end multi-instance learning method based on automatic example selection 技术领域technical field
本发明涉及使用深度网络处理多示例数据的机器学习技术领域,具体涉及一种基于自动示例选择的端到端多示例学习方法。The invention relates to the technical field of machine learning for processing multi-instance data by using a deep network, in particular to an end-to-end multi-instance learning method based on automatic example selection.
背景技术Background technique
传统机器学习技术经常假设样本和标签是一一对应的,比如:在文档分类任务中,一篇文档对应一个具体类别;在图像识别任务中,每张图片对应一个标签;在风机故障检测任务中,一个风机的一段时间内的采样信号具有同一个标签。然而,在实际任务中,文档包含很多句子,句子中又有很多短语,不同的句子可能描述的事物涉及了多个方面,只有某些核心句子所描述的事物才决定了该文档所属于的类别;每张图像可以包含多个物体,只有主要的物体才会被标注;风机的故障模式只会在某段时域或者频域范围内才出现。Traditional machine learning techniques often assume that there is a one-to-one correspondence between samples and labels. For example, in document classification tasks, a document corresponds to a specific category; in image recognition tasks, each image corresponds to a label; in fan fault detection tasks , the sampled signals of a fan over a period of time have the same label. However, in practical tasks, the document contains many sentences, and there are many phrases in the sentences. Different sentences may describe things that involve multiple aspects. Only the things described by some core sentences determine the category of the document. ; Each image can contain multiple objects, only the main objects will be marked; the failure mode of the fan will only appear in a certain time domain or frequency domain.
将一篇文档、一张图像、一段时序信号看做一组示例的集合,即多示例包,同时将文档中的多个句子、图像中的多个图像块、时序信号多个时域或者频域范围当做示例,使用多示例学习技术可以有效解决标记粒度的问题。多示例学习假设,多示例包中有一些重要示例决定了该示例包的类别,因此如何去自动选择出重要示例则是非常关键的技术。Consider a document, an image, and a time-series signal as a set of examples, that is, a multi-instance package. The domain range is used as an example, and the problem of labeling granularity can be effectively solved by using multi-instance learning techniques. Multi-example learning assumes that there are some important examples in the multi-example package that determine the category of the example package, so how to automatically select important examples is a very critical technology.
如何在深度网络中端到端地选择重要示例是一项难以实现的技术,主要是因为“示例选择”的过程是不可计算导数的,而深度网络的训练则主要是通过梯度传播进行优化的。因此,本发明赋予深度多示例网络“自动选择示例”的能力,使得整个优化过程可以端到端地进行。How to select important examples end-to-end in deep networks is a difficult technique to implement, mainly because the process of "example selection" is non-derivative, and the training of deep networks is mainly optimized by gradient propagation. Therefore, the present invention endows the deep multi-instance network with the ability to "auto-select examples" so that the entire optimization process can be performed end-to-end.
发明内容SUMMARY OF THE INVENTION
为解决现有技术的不足,本发明的目的在于提供一种既可以处理一组示例 对应单个标签的场景,还可以在深度网络中有效实现示例的自动选择的基于自动示例选择的端到端多示例学习方法。In order to solve the deficiencies of the prior art, the purpose of the present invention is to provide an end-to-end multiplexer based on automatic example selection, which can not only process a scenario where a group of examples corresponds to a single label, but also can effectively realize the automatic selection of examples in a deep network. Learning by Example.
为了实现上述目标,本发明采用如下的技术方案:In order to achieve above-mentioned goal, the present invention adopts following technical scheme:
一种基于自动示例选择的端到端多示例学习方法,包括以下具体步骤:(一)、对多示例数据进行采集,并将数据分成若干个多示例数据包,所述多示例数据包包括若干个示例,且多示例数据包设置为由若干个示例组合成的一组示例集合,所述多示例数据包上具有标签,所述示例设置为一个多维向量;(二)、搭建深度多示例网络,所述深度多示例网络包括示例处理层、示例选择层和分类层;(三)、每个多示例数据包通过深度多示例网络进行处理,通过前向或反向传播进行训练,所述训练包括深度多示例网络训练和深度多示例网络测试。An end-to-end multi-instance learning method based on automatic example selection, comprising the following specific steps: (1) collecting multi-instance data, and dividing the data into several multi-instance data packets, wherein the multi-instance data packets include several and the multi-instance data package is set as a set of examples composed of several examples, the multi-instance data package has a label, and the examples are set as a multi-dimensional vector; (2), build a deep multi-instance network , the deep multi-instance network includes an example processing layer, an example selection layer and a classification layer; (3), each multi-instance data packet is processed through a deep multi-instance network, and is trained by forward or backward propagation. Including deep multi-instance network training and deep multi-instance network testing.
优选地,前述步骤(一)中,多示例数据采集包括以下具体步骤:Preferably, in the aforementioned step (1), the multi-instance data collection includes the following specific steps:
100、确定示例和多示例数据包在具体任务中指代的目标;100. Determine the target that the example and multi-example data packets refer to in specific tasks;
101、将任务中具体数据构造为多示例数据包;101. Construct the specific data in the task into multiple example data packets;
102、为多示例数据包赋予标签;102. Assign a label to the multi-example data packet;
103、将数据组织成多组“(多示例数据包,标签)”的形式。103. Organize the data into the form of multiple groups of "(multiple instance data packets, tags)".
再优选地,前述步骤(二)中,深度多示例网络搭建包括以下具体步骤:Further preferably, in the aforementioned step (2), the construction of a deep multi-instance network includes the following specific steps:
200、搭建示例处理层模块;200. Build an example processing layer module;
201、搭建示例选择层模块;201. Build an example selection layer module;
202、搭建多示例数据包分类层模块。202. Build a multi-example data packet classification layer module.
更优选地,前述步骤(三)中,深度多示例网络训练包括以下具体步骤:More preferably, in the aforementioned step (3), the training of the deep multi-instance network includes the following specific steps:
300、准备一组“(多示例数据包,标签)”当做训练数据;300. Prepare a set of "(multi-example data packets, labels)" as training data;
301、通过示例处理层吃力多示例数据包中的每一个示例;301. Process each instance in the multiple instance data packets through the instance processing layer;
302、通过示例选择层从处理后的所有示例里面选择若干示例;302. Select several examples from all the processed examples through the example selection layer;
303、对选择得到的若干示例进行聚合;303. Aggregate several selected examples;
304、通过多示例数据包分类层对聚合得到的结果进行分类;304. Classify the result obtained by the aggregation through the multi-instance data packet classification layer;
305、根据分类损失函数计算损失值;305. Calculate the loss value according to the classification loss function;
306、通过梯度优化方法优化网络中所有参数;306. Optimize all parameters in the network through the gradient optimization method;
307、重复300-306,直到网络收敛。307. Repeat 300-306 until the network converges.
进一步优选地,前述步骤(三)中,深度多示例网络测试包括以下具体步骤:Further preferably, in the aforementioned step (3), the deep multi-instance network test includes the following specific steps:
400、将要测试的数据组织成“(多示例数据包,标签)”;400. Organize the data to be tested into "(multiple example data packets, labels)";
401、通过示例处理层处理多示例数据包中的每一个示例;401. Process each instance in the multi-instance data packet through an instance processing layer;
402、通过示例选择层从处理后的所有示例里面选择若干示例;402. Select several examples from all the processed examples through the example selection layer;
403、对选择得到的若干示例进行聚合;403. Aggregate several selected examples;
404、通过多示例数据包分类层对聚合得到的结果进行分类;404. Classify the result obtained by the aggregation through the multi-instance data packet classification layer;
405、输出预测结果。405. Output the prediction result.
本发明的有益之处在于:本发明可以通过示例选择层自动地选择重要的示例,一方面使得整个深度网络的优化过程可以端到端地进行训练,另一方面可以辅助挖掘一个多示例包中重要的示例,增强模型的可解释性;本发明适用于一组示例对应单一标签的多示例数据场景,并且使用深度学习技术进行训练和预测。The advantages of the present invention are: the present invention can automatically select important examples through the example selection layer, on the one hand, the optimization process of the entire deep network can be trained end-to-end, and on the other hand, it can assist in mining a multi-example package. An important example to enhance the interpretability of the model; the present invention is applicable to a multi-instance data scenario where a group of examples corresponds to a single label, and uses deep learning technology for training and prediction.
附图说明Description of drawings
图1为本发明实施例的多示例数据采集流程图;FIG. 1 is a flow chart of multi-example data collection according to an embodiment of the present invention;
图2为本发明实施例的多示例深度网络搭建流程图;FIG. 2 is a flowchart of building a multi-example deep network according to an embodiment of the present invention;
图3为本发明实施例的多示例深度网络训练流程图;3 is a flowchart of a multi-example deep network training according to an embodiment of the present invention;
图4为本发明实施例的多示例深度网络预测流程图。FIG. 4 is a flowchart of multi-example deep network prediction according to an embodiment of the present invention.
具体实施方式Detailed ways
以下结合附图和具体实施例对本发明作具体的介绍。The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.
一种基于自动示例选择的端到端多示例学习方法,包括以下具体步骤:(一)、对多示例数据进行采集,并将数据分成若干个多示例数据包,所述多示例数据包包括若干个示例,且多示例数据包设置为由若干个示例组合成的一组示例集合,所述多示例数据包上具有标签,所述示例设置为一个多维向量;(二)、搭建深度多示例网络,所述深度多示例网络包括示例处理层、示例选择层和分类层;(三)、每个多示例数据包通过深度多示例网络进行处理,通过前向或反向传播进行训练,所述训练包括深度多示例网络训练和深度多示例网络测试。An end-to-end multi-instance learning method based on automatic example selection, comprising the following specific steps: (1) collecting multi-instance data, and dividing the data into several multi-instance data packets, wherein the multi-instance data packets include several and the multi-instance data package is set as a set of examples composed of several examples, the multi-instance data package has a label, and the examples are set as a multi-dimensional vector; (2), build a deep multi-instance network , the deep multi-instance network includes an example processing layer, an example selection layer and a classification layer; (3), each multi-instance data packet is processed through a deep multi-instance network, and is trained by forward or backward propagation. Including deep multi-instance network training and deep multi-instance network testing.
结合图1,多示例数据采集包括以下具体步骤依次为:确定示例和多示例包在风机故障诊断任务中具体指代的含义(步骤100),示例指的是风机故障信号在某频域段范围内的信号值;每个示例表示成一个长度为D的向量,频域范围可以划分为K个频段,收集的风机故障信号可以组织成K个D维向量的集合,即多示例包,记作{V1,V2,…,VK}(步骤101);如果采集的风机信号来自于有故障风机,标签记为1,否则记为0(步骤102);将所有收集的数据表示为({V1,V2,…,VK},y)的形式,y为0或者1(步骤103)。With reference to Fig. 1, the multi-instance data collection includes the following specific steps in order: determining the meaning of the example and the multi-instance package in the fan fault diagnosis task (step 100), and the example refers to the fan fault signal in a certain frequency domain range. Each example is represented as a vector of length D, the frequency domain range can be divided into K frequency bands, and the collected fan fault signals can be organized into a set of K D-dimensional vectors, that is, a multi-example package, denoted as {V1,V2,...,VK} (step 101); if the collected fan signal is from a faulty fan, the label is marked as 1, otherwise it is marked as 0 (step 102); all collected data are represented as ({V1, V2,...,VK}, y), y is 0 or 1 (step 103).
结合图2,深度多示例网络搭建包括具体步骤依次为:搭建示例处理层模块(步骤201):示例处理层可以建模为一个全连接网络,记作hi=F(Vi,W_ins),Vi可以是任意一个示例,维度为D,W_ins是示例处理层的参数,最后输出的表示是hi,维度为d;搭建示例选择层模块(步骤202):主要包括打分模块,对每个示例进行打分si=S(hi,W_sel),其中si为示例hi对应的分数,W_sei为相关参数;搭建多示例包分类层模块(步骤203):具体包括g=C(h_agg,W_clf),其中h_agg是对选择示例聚合的结果,W_clf为分类参数,g是最终预测的概率分布,表示该示例包有故障的概率。Referring to Figure 2, the construction of a deep multi-instance network includes the following specific steps: building an example processing layer module (step 201): the example processing layer can be modeled as a fully connected network, denoted as hi=F(Vi, W_ins), Vi can be is any example, the dimension is D, W_ins is the parameter of the example processing layer, the final output representation is hi, and the dimension is d; build an example selection layer module (step 202): mainly includes a scoring module, and scores each example si =S(hi,W_sel), where si is the score corresponding to the example hi, and W_sei is the relevant parameter; build a multi-example bag classification layer module (step 203): specifically include g=C(h_agg, W_clf), where h_agg is the pair of selection The result of example aggregation, W_clf is the classification parameter, g is the probability distribution of the final prediction, indicating the probability that the example package is faulty.
结合图3,深度多示例网络训练包括具体步骤依次为:采样训练数据({V1,V2,…,VK},y)(步骤300);示例处理层处理每一个示例数据,hi=F(Vi,W_ins),i=1,2,…,K(步骤301);根据示例选择层选择重要示例(步骤302),首先对示例进行打分,si=S(hi,W_sel),然后经过ui=Softmax(log(si+gi)/lambda),gi~Gumbel(0,1)分布,然后选择Top-jK个最大的ui,其下标依次为j1,j2,…,jK;对选择的示例进行聚合(步骤303),比如取平均h_agg=(Vj1+Vj2+…+VjK)/jK作为聚合的示例包表示;通过分类层进行分类g=C(h_agg,W_clf)(步骤304);计算损失函数(步骤305),比如通过交叉熵损失计算;通过梯度反向传播优化所有参数(步骤306),优化的参数包括W_ins,W_sel,W_clf;最后迭代步骤300-306直到模型收敛(步骤307)。Referring to Fig. 3, the deep multi-instance network training includes the following specific steps: sampling training data ({V1, V2, ..., VK}, y) (step 300); the example processing layer processes each example data, hi=F(Vi ,W_ins),i=1,2,...,K (step 301); select important examples according to the example selection layer (step 302), first score the examples, si=S(hi, W_sel), and then go through ui=Softmax (log(si+gi)/lambda), gi~Gumbel(0,1) distribution, and then select the Top-jK largest ui, whose subscripts are j1, j2,...,jK in turn; aggregate the selected examples (step 303), for example, take the average h_agg=(Vj1+Vj2+...+VjK)/jK as the aggregated example bag representation; classify g=C(h_agg, W_clf) through the classification layer (step 304); calculate the loss function (step 305), such as calculation through cross-entropy loss; optimize all parameters through gradient backpropagation (step 306), the optimized parameters include W_ins, W_sel, W_clf; finally iterate steps 300-306 until the model converges (step 307).
结合图4,深度多示例网络测试包括具体步骤依次为:将收集到的时序信号表示为(多示例包,)的形式(步骤400);通过示例处理层、示例选择层、聚合操作和最终分类层进行预测(步骤401,402,403,404);输出故障分类结果(405)。Referring to Figure 4, the deep multi-instance network testing includes the following specific steps in order: express the collected time series signals in the form of (multi-instance package, ) (step 400); pass the example processing layer, the example selection layer, the aggregation operation and the final classification The layer performs prediction (steps 401, 402, 403, 404); and outputs the fault classification result (405).
本发明的有益之处在于:本发明可以通过示例选择层自动地选择重要的示例,一方面使得整个深度网络的优化过程可以端到端地进行训练,另一方面可以辅助挖掘一个多示例包中重要的示例,增强模型的可解释性;本发明适用于一组示例对应单一标签的多示例数据场景,并且使用深度学习技术进行训练和预测。The advantages of the present invention are: the present invention can automatically select important examples through the example selection layer, on the one hand, the optimization process of the entire deep network can be trained end-to-end, and on the other hand, it can assist in mining a multi-example package. An important example to enhance the interpretability of the model; the present invention is applicable to a multi-instance data scenario where a group of examples corresponds to a single label, and uses deep learning technology for training and prediction.
以上显示和描述了本发明的基本原理、主要特征和优点。本行业的技术人员应该了解,上述实施例不以任何形式限制本发明,凡采用等同替换或等效变换的方式所获得的技术方案,均落在本发明的保护范围内。The foregoing has shown and described the basic principles, main features and advantages of the present invention. Those skilled in the art should understand that the above-mentioned embodiments do not limit the present invention in any form, and all technical solutions obtained by means of equivalent replacement or equivalent transformation fall within the protection scope of the present invention.

Claims (5)

  1. 一种基于自动示例选择的端到端多示例学习方法,其特征在于,包括以下具体步骤:(一)、对多示例数据进行采集,并将数据分成若干个多示例数据包,所述多示例数据包包括若干个示例,且多示例数据包设置为由若干个示例组合成的一组示例集合,所述多示例数据包上具有标签,所述示例设置为一个多维向量;(二)、搭建深度多示例网络,所述深度多示例网络包括示例处理层、示例选择层和分类层;(三)、每个多示例数据包通过深度多示例网络进行处理,通过前向或反向传播进行训练,所述训练包括深度多示例网络训练和深度多示例网络测试。An end-to-end multi-instance learning method based on automatic example selection, which is characterized by comprising the following specific steps: (1), collecting multi-instance data, and dividing the data into several multi-instance data packets, the multi-instance The data package includes several examples, and the multi-instance data package is set as a set of examples composed of several examples, the multi-instance data package has a label, and the examples are set as a multi-dimensional vector; (2), build A deep multi-instance network, the deep multi-instance network includes an example processing layer, an example selection layer, and a classification layer; (3), each multi-instance data packet is processed through a deep multi-instance network, and trained by forward or backpropagation , the training includes deep multi-instance network training and deep multi-instance network testing.
  2. 根据权利要求1所述的一种基于自动示例选择的端到端多示例学习方法,其特征在于,所述步骤(一)中,多示例数据采集包括以下具体步骤:An end-to-end multi-instance learning method based on automatic example selection according to claim 1, wherein in the step (1), multi-instance data collection comprises the following specific steps:
    100、确定示例和多示例数据包在具体任务中指代的目标;100. Determine the target that the example and multi-example data packets refer to in specific tasks;
    101、将任务中具体数据构造为多示例数据包;101. Construct the specific data in the task into multiple example data packets;
    102、为多示例数据包赋予标签;102. Assign a label to the multi-example data packet;
    103、将数据组织成多组“(多示例数据包,标签)”的形式。103. Organize the data into the form of multiple groups of "(multiple instance data packets, tags)".
  3. 根据权利要求1所述的一种基于自动示例选择的端到端多示例学习方法,其特征在于,所述步骤(二)中,深度多示例网络搭建包括以下具体步骤:An end-to-end multi-instance learning method based on automatic example selection according to claim 1, wherein in the step (2), the construction of a deep multi-instance network includes the following specific steps:
    200、搭建示例处理层模块;200. Build an example processing layer module;
    201、搭建示例选择层模块;201. Build an example selection layer module;
    202、搭建多示例数据包分类层模块。202. Build a multi-example data packet classification layer module.
  4. 根据权利要求1所述的一种基于自动示例选择的端到端多示例学习方法,其特征在于,所述步骤(三)中,深度多示例网络训练包括以下具体步骤:An end-to-end multi-instance learning method based on automatic example selection according to claim 1, wherein in the step (3), the deep multi-instance network training comprises the following specific steps:
    300、准备一组“(多示例数据包,标签)”当做训练数据;300. Prepare a set of "(multi-example data packets, labels)" as training data;
    301、通过示例处理层吃力多示例数据包中的每一个示例;301. Process each instance in the multiple instance data packets through the instance processing layer;
    302、通过示例选择层从处理后的所有示例里面选择若干示例;302. Select several examples from all the processed examples through the example selection layer;
    303、对选择得到的若干示例进行聚合;303. Aggregate several selected examples;
    304、通过多示例数据包分类层对聚合得到的结果进行分类;304. Classify the result obtained by the aggregation through the multi-instance data packet classification layer;
    305、根据分类损失函数计算损失值;305. Calculate the loss value according to the classification loss function;
    306、通过梯度优化方法优化网络中所有参数;306. Optimize all parameters in the network through the gradient optimization method;
    307、重复300-306,直到网络收敛。307. Repeat 300-306 until the network converges.
  5. 根据权利要求1所述的一种基于自动示例选择的端到端多示例学习方法,其特征在于,所述步骤(三)中,深度多示例网络测试包括以下具体步骤:An end-to-end multi-instance learning method based on automatic example selection according to claim 1, wherein in the step (3), the deep multi-instance network test comprises the following specific steps:
    400、将要测试的数据组织成“(多示例数据包,标签)”;400. Organize the data to be tested into "(multiple example data packets, labels)";
    401、通过示例处理层处理多示例数据包中的每一个示例;401. Process each instance in the multi-instance data packet through an instance processing layer;
    402、通过示例选择层从处理后的所有示例里面选择若干示例;402. Select several examples from all the processed examples through the example selection layer;
    403、对选择得到的若干示例进行聚合;403. Aggregate several selected examples;
    404、通过多示例数据包分类层对聚合得到的结果进行分类;404. Classify the result obtained by the aggregation through the multi-instance data packet classification layer;
    405、输出预测结果。405. Output the prediction result.
PCT/CN2021/094142 2020-12-01 2021-05-17 End-to-end multi-instance learning method based on automatic instance selection WO2022116479A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011380710.3A CN112183752B (en) 2020-12-01 2020-12-01 End-to-end multi-example learning method based on automatic example selection
CN202011380710.3 2020-12-01

Publications (1)

Publication Number Publication Date
WO2022116479A1 true WO2022116479A1 (en) 2022-06-09

Family

ID=73918354

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/094142 WO2022116479A1 (en) 2020-12-01 2021-05-17 End-to-end multi-instance learning method based on automatic instance selection

Country Status (2)

Country Link
CN (1) CN112183752B (en)
WO (1) WO2022116479A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112183752B (en) * 2020-12-01 2021-02-19 南京智谷人工智能研究院有限公司 End-to-end multi-example learning method based on automatic example selection

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106980826A (en) * 2017-03-16 2017-07-25 天津大学 A kind of action identification method based on neutral net
CN108764192A (en) * 2018-06-04 2018-11-06 华中师范大学 A kind of more example multi-tag learning methods towards safe city video monitoring application
CN109614979A (en) * 2018-10-11 2019-04-12 北京大学 A kind of data augmentation method and image classification method based on selection with generation
US20200160177A1 (en) * 2018-11-16 2020-05-21 Royal Bank Of Canada System and method for a convolutional neural network for multi-label classification with partial annotations
CN111950602A (en) * 2020-07-21 2020-11-17 江苏大学 Image indexing method based on random gradient descent and multi-example multi-label learning
CN112183752A (en) * 2020-12-01 2021-01-05 南京智谷人工智能研究院有限公司 End-to-end multi-example learning method based on automatic example selection

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105046284A (en) * 2015-08-31 2015-11-11 鲁东大学 Feature selection based multi-example multi-tag learning method and system
CN106127247B (en) * 2016-06-21 2019-07-09 广东工业大学 Image classification method based on the more example support vector machines of multitask
CN110111399B (en) * 2019-04-24 2023-06-30 上海理工大学 Image text generation method based on visual attention

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106980826A (en) * 2017-03-16 2017-07-25 天津大学 A kind of action identification method based on neutral net
CN108764192A (en) * 2018-06-04 2018-11-06 华中师范大学 A kind of more example multi-tag learning methods towards safe city video monitoring application
CN109614979A (en) * 2018-10-11 2019-04-12 北京大学 A kind of data augmentation method and image classification method based on selection with generation
US20200160177A1 (en) * 2018-11-16 2020-05-21 Royal Bank Of Canada System and method for a convolutional neural network for multi-label classification with partial annotations
CN111950602A (en) * 2020-07-21 2020-11-17 江苏大学 Image indexing method based on random gradient descent and multi-example multi-label learning
CN112183752A (en) * 2020-12-01 2021-01-05 南京智谷人工智能研究院有限公司 End-to-end multi-example learning method based on automatic example selection

Also Published As

Publication number Publication date
CN112183752B (en) 2021-02-19
CN112183752A (en) 2021-01-05

Similar Documents

Publication Publication Date Title
CN107562918A (en) A kind of mathematical problem knowledge point discovery and batch label acquisition method
CN110263166A (en) Public sentiment file classification method based on deep learning
CN110135459B (en) Zero sample classification method based on double-triple depth measurement learning network
CN109034205A (en) Image classification method based on the semi-supervised deep learning of direct-push
CN108921285B (en) Bidirectional gate control cyclic neural network-based classification method for power quality disturbance
CN111368690B (en) Deep learning-based video image ship detection method and system under influence of sea waves
CN103605990B (en) Integrated multi-classifier fusion classification method and integrated multi-classifier fusion classification system based on graph clustering label propagation
CN110188047A (en) A kind of repeated defects report detection method based on binary channels convolutional neural networks
CN111143567B (en) Comment emotion analysis method based on improved neural network
CN110705630A (en) Semi-supervised learning type target detection neural network training method, device and application
CN111476307B (en) Lithium battery surface defect detection method based on depth field adaptation
CN112836067B (en) Intelligent searching method based on knowledge graph
CN112613428B (en) Resnet-3D convolution cattle video target detection method based on balance loss
CN111351860A (en) Wood internal defect detection method based on Faster R-CNN
CN112966708A (en) Chinese crowdsourcing test report clustering method based on semantic similarity
WO2022116479A1 (en) End-to-end multi-instance learning method based on automatic instance selection
CN114359199A (en) Fish counting method, device, equipment and medium based on deep learning
CN117516937A (en) Rolling bearing unknown fault detection method based on multi-mode feature fusion enhancement
CN112489689B (en) Cross-database voice emotion recognition method and device based on multi-scale difference countermeasure
CN113283467A (en) Weak supervision picture classification method based on average loss and category-by-category selection
CN117574262A (en) Underwater sound signal classification method, system and medium for small sample problem
CN111984790A (en) Entity relation extraction method
CN116502165A (en) Finite sample mechanical fault diagnosis method for self-supervision learning
CN115374859B (en) Classification method for unbalanced and multi-class complex industrial data
CN116361454A (en) Automatic course teaching case assessment method based on Bloom classification method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21899522

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21899522

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 13/11/2023)