WO2020143610A1 - Data processing method and apparatus, computer device, and storage medium - Google Patents

Data processing method and apparatus, computer device, and storage medium Download PDF

Info

Publication number
WO2020143610A1
WO2020143610A1 PCT/CN2020/070651 CN2020070651W WO2020143610A1 WO 2020143610 A1 WO2020143610 A1 WO 2020143610A1 CN 2020070651 W CN2020070651 W CN 2020070651W WO 2020143610 A1 WO2020143610 A1 WO 2020143610A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
model
processing
sub
processed
Prior art date
Application number
PCT/CN2020/070651
Other languages
French (fr)
Chinese (zh)
Inventor
何德裕
Original Assignee
鲁班嫡系机器人(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 鲁班嫡系机器人(深圳)有限公司 filed Critical 鲁班嫡系机器人(深圳)有限公司
Publication of WO2020143610A1 publication Critical patent/WO2020143610A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Image Analysis (AREA)

Abstract

A data processing method and apparatus, a computer device, and a storage medium. The method comprises: obtaining data to be processed (202); inputting the data into a data processing model (204); obtaining a preprocessing result output by each preprocessing sub-model in the data processing model (206); counting a pre-determination probability corresponding to each pre-processing result (208); and generating a processing result corresponding to the data according to the pre-determination probability corresponding to each preprocessing result (210). According to the pre-determination probability corresponding to each preprocessing result, the consistency of a plurality of preprocessing results can be verified, and the processing result corresponding to the data to be processed is generated according to each pre-determination probability, so that the model data processing accuracy is improved.

Description

数据处理方法、装置、计算机设备和存储介质Data processing method, device, computer equipment and storage medium 技术领域Technical field
本申请涉及计算机技术领域,特别是涉及一种数据处理方法、装置、计算机设备和存储介质。This application relates to the field of computer technology, and in particular, to a data processing method, device, computer equipment, and storage medium.
背景技术Background technique
随着计算机技术的发展,出现了机器学习技术。在机器学习时,首先需要建立模型、给模型提供训练数据进行训练,利用训练后的模型对未知数据进行预测。机器学习是人工智能的核心,已经广泛地应用于识别及分类等领域。With the development of computer technology, machine learning technology has emerged. In machine learning, you first need to build a model, provide training data to the model for training, and use the trained model to predict unknown data. Machine learning is the core of artificial intelligence and has been widely used in recognition and classification.
然而,传统的机器学习技术中,为了提高模型对输入数据处理的准确度,往往是在训练时给模型输入大量的训练数据,以便训练数据可以涵盖各种情况。即便如此,单一的模型在进行数据处理时仍存在出错的可能,且模型的处理结果只能由人工随机查验,准确性较低。However, in traditional machine learning techniques, in order to improve the accuracy of the model in processing input data, a large amount of training data is often input to the model during training so that the training data can cover various situations. Even so, there is still the possibility of errors in the data processing of a single model, and the processing results of the model can only be checked manually by humans with low accuracy.
发明内容Summary of the invention
基于此,有必要针对上述技术问题,提供一种能够提高模型对数据处理准确性的数据处理方法、装置、计算机设备和存储介质。Based on this, it is necessary to provide a data processing method, device, computer device, and storage medium that can improve the accuracy of data processing by the model in view of the above technical problems.
一种数据处理方法,所述方法包括:A data processing method, the method includes:
获取待处理数据;Obtain pending data;
将所述待处理数据输入数据处理模型;Input the data to be processed into a data processing model;
获取所述数据处理模型中各预处理子模型分别输出的预处理结果;Obtaining preprocessing results respectively output by each preprocessing sub-model in the data processing model;
统计各预处理结果分别对应的预判概率;Count the pre-judgement probability corresponding to each pre-processing result;
根据所述各预处理结果分别对应的预判概率,生成所述待处理数据对应的处理结果。The processing results corresponding to the data to be processed are generated according to the pre-judgment probabilities corresponding to the respective preprocessing results.
一种数据处理装置,所述装置包括:A data processing device, the device includes:
数据获取模块,用于获取待处理数据;Data acquisition module for acquiring data to be processed;
数据输入模块,用于将所述待处理数据输入数据处理模型;A data input module for inputting the data to be processed into a data processing model;
结果获取模块,用于获取所述数据处理模型中各预处理子模型分别输出的预处理结果;A result obtaining module, configured to obtain the preprocessing results respectively output by each preprocessing sub-model in the data processing model;
概率统计模块,用于统计各预处理结果分别对应的预判概率;Probability statistics module, used to count the pre-judgment probability corresponding to each pre-processing result;
结果生成模块,用于根据所述各预处理结果分别对应的预判概率,生成所述待处理数据对应的处理结果。The result generation module is configured to generate a processing result corresponding to the data to be processed according to the pre-judgement probabilities corresponding to the respective pre-processing results.
一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现以下步骤:A computer device includes a memory, a processor, and a computer program stored on the memory and executable on the processor. The processor implements the computer program to implement the following steps:
获取待处理数据;Obtain pending data;
将所述待处理数据输入数据处理模型;Input the data to be processed into a data processing model;
获取所述数据处理模型中各预处理子模型分别输出的预处理结果;Obtaining preprocessing results respectively output by each preprocessing sub-model in the data processing model;
统计各预处理结果分别对应的预判概率;Count the pre-judgement probability corresponding to each pre-processing result;
根据所述各预处理结果分别对应的预判概率,生成所述待处理数据对应的处理结果。The processing results corresponding to the data to be processed are generated according to the pre-judgment probabilities corresponding to the respective preprocessing results.
一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现以下步骤:A computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the following steps are realized:
获取待处理数据;Obtain pending data;
将所述待处理数据输入数据处理模型;Input the data to be processed into a data processing model;
获取所述数据处理模型中各预处理子模型分别输出的预处理结果;Obtaining preprocessing results respectively output by each preprocessing sub-model in the data processing model;
统计各预处理结果分别对应的预判概率;Count the pre-judgment probability corresponding to each pre-processing result;
根据所述各预处理结果分别对应的预判概率,生成所述待处理数据对应的处理结果。The processing results corresponding to the data to be processed are generated according to the pre-judgment probabilities corresponding to the respective preprocessing results.
上述数据处理方法、装置、计算机设备和存储介质,获取待处理数据,将 待处理数据输入数据处理模型中的多个预处理子模型,由多个预处理子模型同时对待处理数据进行处理;获取各预处理子模型分别输出的预处理结果,并统计各预处理结果分别对应的预判概率;根据各预处理结果分别对应的预判概率,可以验证多个预处理结果的一致性,并根据各预判概率生成待处理数据对应的处理结果,提高了模型数据处理的准确性。The above data processing method, device, computer equipment and storage medium obtain data to be processed, input the data to be processed into multiple pre-processing sub-models in the data processing model, and the multiple pre-processing sub-models simultaneously process the data to be processed; The pre-processing results output by each pre-processing sub-model separately, and the pre-judgement probability corresponding to each pre-processing result is counted; according to the pre-judgement probability corresponding to each pre-processing result, the consistency of multiple pre-processing results can be verified and Each prediction probability generates a processing result corresponding to the data to be processed, which improves the accuracy of model data processing.
附图说明BRIEF DESCRIPTION
图1为一个实施例中数据处理方法的应用环境图;FIG. 1 is an application environment diagram of a data processing method in an embodiment;
图2为一个实施例中数据处理方法的流程示意图;2 is a schematic flowchart of a data processing method in an embodiment;
图3为一个实施例中训练初始子模型的步骤的流程示意图;3 is a schematic flowchart of the steps of training an initial sub-model in an embodiment;
图4为另一个实施例中训练初始子模型的步骤的流程示意图;4 is a schematic flowchart of steps of training an initial sub-model in another embodiment;
图5为一个实施例中构建数据处理模型的步骤的流程示意图;5 is a schematic flowchart of steps of constructing a data processing model in an embodiment;
图6为一个实施例中结构来源模型的结构示意图;6 is a schematic structural diagram of a structural source model in an embodiment;
图7为一个实施例中生成处理结果的步骤的流程示意图;7 is a schematic flowchart of steps of generating a processing result in an embodiment;
图8为一个实施例中生成处理异常通知的步骤的流程示意图;8 is a schematic flowchart of steps for generating an exception notification in an embodiment;
图9为一个实施例中训练数据的示意图;9 is a schematic diagram of training data in an embodiment;
图10为一个实施例中数据处理的示意图;10 is a schematic diagram of data processing in an embodiment;
图11为一个实施例中数据处理装置的结构框图;11 is a structural block diagram of a data processing device in an embodiment;
图12为一个实施例中计算机设备的内部结构图。FIG. 12 is an internal structure diagram of a computer device in an embodiment.
具体实施方式detailed description
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solutions and advantages of the present application more clear, the following describes the present application in further detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, and are not used to limit the present application.
本申请提供的数据处理方法,可以应用于如图1所示的应用环境中,应用环境中可以包括终端102和服务器104,终端102通过网络与服务器104进行通信。该方法既可以应用在终端102,也可以应用于服务器104。其中,终端102可以但不限于是各种工业计算机、个人计算机、笔记本电脑、智能手机和平板 电脑。服务器104可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The data processing method provided by the present application may be applied to the application environment shown in FIG. 1, and the application environment may include a terminal 102 and a server 104, and the terminal 102 communicates with the server 104 through a network. This method can be applied to both the terminal 102 and the server 104. Among them, the terminal 102 may be, but not limited to, various industrial computers, personal computers, notebook computers, smart phones, and tablet computers. The server 104 may be implemented by an independent server or a server cluster composed of multiple servers.
在一个实施例中,如图2所示,提供了一种数据处理方法,以该方法应用于图1中的终端为例进行说明,包括以下步骤:In one embodiment, as shown in FIG. 2, a data processing method is provided. The method is applied to the terminal in FIG. 1 as an example for illustration, and includes the following steps:
步骤202,获取待处理数据。Step 202: Obtain data to be processed.
其中,待处理数据是在使用模型时,被输入模型进行处理的数据。Among them, the data to be processed is the data input to the model for processing when the model is used.
具体地,终端获取用户触发的数据处理指令,对数据处理指令进行解析,得到数据处理指令中待处理数据的存储地址。终端访问存储地址对应的存储空间,从访问的存储空间中提取存储的待处理数据。Specifically, the terminal obtains the data processing instruction triggered by the user, analyzes the data processing instruction, and obtains the storage address of the data to be processed in the data processing instruction. The terminal accesses the storage space corresponding to the storage address, and extracts the stored data to be processed from the accessed storage space.
在一个实施例中,终端获取录入的数据标识,根据数据标识生成数据获取请求,通过网络将数据获取请求发送至服务器。服务器接收数据获取请求,根据数据获取请求中的数据标识从数据库中提取待处理数据,通过网络将待处理数据发送至终端。In one embodiment, the terminal acquires the entered data identifier, generates a data acquisition request according to the data identifier, and sends the data acquisition request to the server through the network. The server receives the data acquisition request, extracts the data to be processed from the database according to the data identifier in the data acquisition request, and sends the data to be processed to the terminal through the network.
在一个实施例中,终端安装有图像采集装置。终端获取数据处理指令后,启动图像采集装置,终端将图像采集装置采集到的图像数据作为待处理数据。In one embodiment, the terminal is equipped with an image acquisition device. After the terminal acquires the data processing instruction, the image acquisition device is started, and the terminal uses the image data collected by the image acquisition device as data to be processed.
步骤204,将待处理数据输入数据处理模型。Step 204: Input the data to be processed into the data processing model.
其中,数据处理模型是由多个预处理子模型组成的模型,用于对输入的待处理数据进行处理。Among them, the data processing model is a model composed of multiple pre-processing sub-models, which is used to process the input data to be processed.
具体地,终端获取到待处理数据后,触发数据输入指令,根据数据输入指令将获取到的待处理数据输入数据处理模型中。Specifically, after acquiring the data to be processed, the terminal triggers a data input instruction, and inputs the acquired data to be processed into the data processing model according to the data input instruction.
步骤206,获取数据处理模型中各预处理子模型分别输出的预处理结果。Step 206: Obtain the preprocessing results respectively output by each preprocessing sub-model in the data processing model.
其中,预处理结果是数据处理模型中的预处理子模型对待处理数据的处理结果。Among them, the preprocessing result is the processing result of the data to be processed in the preprocessing sub-model in the data processing model.
具体地,数据处理模型由多个预处理子模型组成。终端将待处理数据输入数据处理模型后,数据处理模型中的每个预处理子模型都将对待处理数据进行处理,输出各自的预处理结果。终端获取数据处理模型中,每一个预处理子模型对待处理数据的预处理结果。Specifically, the data processing model is composed of multiple preprocessing sub-models. After the terminal inputs the data to be processed into the data processing model, each pre-processing sub-model in the data processing model will process the data to be processed and output its respective pre-processing results. In the data processing model obtained by the terminal, each pre-processing sub-model preprocesses the data to be processed.
步骤208,统计各预处理结果分别对应的预判概率。In step 208, the pre-judgment probability corresponding to each pre-processing result is counted.
其中,预判概率是各预处理结果在全部预处理子模型输出的预处理结果中出现的概率,可以是各预处理结果出现的次数与预处理结果总数的比值。The pre-judgement probability is the probability that each pre-processing result appears in the pre-processing results output by all pre-processing sub-models, and may be the ratio of the number of times each pre-processing result appears to the total number of pre-processing results.
具体地,终端读取各预处理子模型输出的预处理结果,并统计不同的预处理结果出现的次数。终端分别计算各预处理结果出现的次数与预处理结果总数的比值,将计算得到的比值作为各预处理结果的预判概率。Specifically, the terminal reads the preprocessing results output by each preprocessing sub-model, and counts the number of occurrences of different preprocessing results. The terminal calculates the ratio of the number of occurrences of each pre-processing result to the total number of pre-processing results, and uses the calculated ratio as the prediction probability of each pre-processing result.
步骤210,根据各预处理结果分别对应的预判概率,生成待处理数据对应的处理结果。Step 210: Generate a processing result corresponding to the data to be processed according to the pre-judgement probability corresponding to each pre-processing result.
其中,处理结果是数据处理模型对输入的待处理数据的输出结果。The processing result is the output result of the data processing model to the input data to be processed.
具体地,终端统计得到各预处理结果分别对应的预判概率后,获取预设概率条件,逐个将预处理结果分别对应的预判概率同预设概率条件进行比较,筛选符合预设概率条件的预判概率所对应的预处理结果。Specifically, the terminal statistically obtains the pre-judgement probabilities corresponding to each pre-processing result, obtains the preset probability conditions, compares the pre-judgement probabilities corresponding to the pre-processing results with the pre-set probability conditions one by one, and selects The pre-processing result corresponding to the prediction probability.
当终端筛选到符合预设概率条件的预判概率所对应的预处理结果时,终端完成对待处理数据的处理,将筛选到的预处理结果作为输入的待处理数据对应的处理结果。When the terminal filters out the preprocessing result corresponding to the pre-judgement probability that meets the preset probability condition, the terminal completes the processing of the data to be processed, and uses the filtered preprocessing result as the processing result corresponding to the input data to be processed.
其中,预设概率条件是预先设置好的用于从各预处理结果中筛选特定预处理结果的条件。预设概率条件可以是预判概率大于等于预设的概率阈值。Wherein, the preset probability condition is a condition set in advance for screening specific pre-processing results from each pre-processing result. The preset probability condition may be that the predicted probability is greater than or equal to a preset probability threshold.
在一个实施例中,终端通过排序算法对各预处理结果的预判概率进行排序,排序完成后,选取最高的预判概率所对应的预处理结果。排序算法可以是冒泡排序、选择排序、归并排序中的至少一种。In one embodiment, the terminal sorts the pre-judgement probabilities of the pre-processing results through a sorting algorithm. After the sorting is completed, the pre-processing result corresponding to the highest pre-judgement probability is selected. The sorting algorithm may be at least one of bubble sorting, selection sorting, and merge sorting.
在一个实施例中,终端根据各预处理结果和各预处理结果所对应的预判概率和处理结果,生成处理结果通知,通过显示屏对处理结果通知进行显示。In one embodiment, the terminal generates a processing result notification according to each preprocessing result and the pre-judgment probability and processing result corresponding to each preprocessing result, and displays the processing result notification through the display screen.
在一个实施例中,数据处理模型可以是任何种类的模型,待处理数据可以是与数据处理模型对应的任何种类的数据。比如,数据处理模型是识别模型或分类模型时,待处理数据可以是图像数据,数据处理模型的处理结果为识别到的产品缺陷、物品种类以及数量等。当数据处理模型是轨迹规划模型时,待处理数据可以是图像数据或物体的位姿数据,处理结果为物体的移动路径轨迹。In one embodiment, the data processing model may be any kind of model, and the data to be processed may be any kind of data corresponding to the data processing model. For example, when the data processing model is a recognition model or a classification model, the data to be processed may be image data, and the processing result of the data processing model is the identified product defect, item type, and quantity. When the data processing model is a trajectory planning model, the data to be processed may be image data or posture data of the object, and the processing result is the moving path trajectory of the object.
本实施例中,通过获取待处理数据,将待处理数据输入数据处理模型中的多个预处理子模型,由多个预处理子模型同时对待处理数据进行处理;获取各 预处理子模型分别输出的预处理结果,并统计各预处理结果分别对应的预判概率;根据各预处理结果分别对应的预判概率,可以验证多个预处理结果的一致性,并根据各预判概率生成待处理数据对应的处理结果,提高了模型数据处理的准确性。In this embodiment, by acquiring the data to be processed, the data to be processed is input into multiple pre-processing sub-models in the data processing model, and the multiple pre-processing sub-models simultaneously process the data to be processed; each pre-processing sub-model is obtained and output separately The pre-processing results of each pre-processing result are counted, and the pre-judging probabilities corresponding to each pre-processing result are counted; according to the pre-judge probabilities corresponding to each pre-processing result, the consistency of multiple pre-processing results can be verified, and the to-be-processed is generated according to each pre-judgement probability The processing results corresponding to the data improve the accuracy of model data processing.
如图3所示,在一个实施例中,步骤202之前还包括训练初始子模型的步骤,该步骤具体包括如下步骤:As shown in FIG. 3, in one embodiment, before step 202, a step of training an initial sub-model is further included. This step specifically includes the following steps:
步骤302,获取多个不同的初始子模型和训练数据。Step 302: Acquire multiple different initial sub-models and training data.
其中,初始子模型是未经参数调整的初始模型。训练数据是用于对初始子模型进行训练的数据样本。Among them, the initial sub-model is the initial model without parameter adjustment. Training data are data samples used to train the initial sub-model.
具体地,在使用模型之前,首先需要训练出用于数据处理的机器学习模型。终端获取用户触发的模型训练指令,对模型训练指令进行解析,得到初始子模型存储地址和训练数据存储地址。终端根据初始子模型存储地址和训练数据存储地址,从存储空间中读取多个初始子模型和训练数据,将读取到的各初始子模型和训练数据加载到内存中。多个初始子模型可以是不同种类的模型,也可以是相同种类但初始模型参数不同的模型,也可以同时包括不同种类的模型和相同种类但初始模型参数不同的模型。Specifically, before using the model, a machine learning model for data processing needs to be trained first. The terminal obtains the model training instruction triggered by the user, analyzes the model training instruction, and obtains the initial sub-model storage address and the training data storage address. The terminal reads multiple initial sub-models and training data from the storage space according to the initial sub-model storage address and the training data storage address, and loads the read initial sub-models and training data into the memory. The multiple initial sub-models may be different types of models, or models of the same type but with different initial model parameters, or different types of models and models of the same type but with different initial model parameters.
步骤304,以训练数据对各初始子模型进行训练,得到多个预处理子模型。Step 304: Train each initial sub-model with training data to obtain multiple pre-processing sub-models.
其中,预处理子模型是初始子模型训练完成后得到的模型。Among them, the pre-processing sub-model is the model obtained after the initial sub-model training is completed.
具体地,终端将训练数据输入各初始子模型。初始子模型依据输入的训练数据进行训练,调整模型参数,直至满足训练停止条件时停止训练,得到多个预处理子模型。训练数据可以包含标签数据,初始子模型根据训练数据输出初始结果,通过初始结果和标签数据确定预测误差,若预测误差大于等于预设的误差阈值,按照最小化预测误差的方向调整模型参数,并迭代执行上述训练过程,直至预测误差小于预设的误差阈值时,停止训练。Specifically, the terminal inputs the training data into each initial sub-model. The initial sub-model is trained according to the input training data, and the model parameters are adjusted until the training stop condition is met, and the training is stopped, and multiple pre-processing sub-models are obtained. The training data may include label data. The initial sub-model outputs initial results based on the training data. The initial results and label data determine the prediction error. If the prediction error is greater than or equal to the preset error threshold, the model parameters are adjusted in the direction that minimizes the prediction error, and Iteratively execute the above training process until the prediction error is less than the preset error threshold, then stop training.
在一个实施例中,训练停止条件可以是初始子模型在训练中的迭代次数。当初始子模型在训练中的迭代次数大于等于预设的迭代次数阈值时,停止训练,得到预处理子模型。In one embodiment, the training stop condition may be the number of iterations of the initial sub-model in training. When the number of iterations of the initial sub-model during training is greater than or equal to the preset threshold of the number of iterations, the training is stopped and the pre-processed sub-model is obtained.
步骤306,根据多个预处理子模型构建数据处理模型。Step 306: Construct a data processing model according to multiple pre-processing sub-models.
具体地,终端对各初始子模型训练完毕后,得到多个预处理子模型。终端以各预处理子模型为基础,组建预处理子模型集群,将得到的预处理子模型集群作为数据处理模型。Specifically, after the terminal trains each initial sub-model, multiple pre-processing sub-models are obtained. Based on each pre-processing sub-model, the terminal forms a pre-processing sub-model cluster, and uses the obtained pre-processing sub-model cluster as a data processing model.
本实施例中,获取多个不同的初始子模型和训练数据,将相同的训练数据输入到多个不同的初始子模型中进行训练,得到多个预处理子模型,提高了模型训练的可靠性,根据多个预处理子模型构建数据处理模型,提高了获取数据处理模型的效率。In this embodiment, multiple different initial sub-models and training data are obtained, and the same training data is input into multiple different initial sub-models for training to obtain multiple pre-processing sub-models, which improves the reliability of model training , Based on multiple pre-processing sub-models to build data processing models, improving the efficiency of obtaining data processing models.
如图4所示,在另一个实施例中,步骤202之前还包括训练初始子模型的步骤,该步骤具体包括如下步骤:As shown in FIG. 4, in another embodiment, before step 202, a step of training an initial sub-model is further included. This step specifically includes the following steps:
步骤402,获取多个相同的初始子模型和训练数据。Step 402: Acquire multiple identical initial submodels and training data.
具体地,终端获取模型训练指令,从模型训练指令中提取初始子模型存储地址、训练数据存储地址和训练参数。终端根据初始子模型存储地址和训练数据存储地址从存储空间中读取初始子模型和训练数据。终端根据训练参数中的预设子模型数量对初始子模型进行复制,得到与预设子模型数量匹配的多个相同的初始子模型。Specifically, the terminal obtains the model training instruction, and extracts the initial sub-model storage address, the training data storage address, and the training parameters from the model training instruction. The terminal reads the initial sub-model and training data from the storage space according to the initial sub-model storage address and the training data storage address. The terminal copies the initial sub-model according to the number of preset sub-models in the training parameters to obtain multiple identical initial sub-models that match the number of preset sub-models.
步骤404,从训练数据中抽取与多个初始子模型一一对应的多个训练样本集。Step 404: Extract multiple training sample sets corresponding to multiple initial sub-models from the training data.
具体地,对于每一个初始子模型,终端按照预设方式从训练数据中抽取部分数据,根据抽取到的部分数据得到训练样本集。终端可以从训练数据中随机抽取部分数据,根据随机抽取到的部分数据构建训练样本集。Specifically, for each initial sub-model, the terminal extracts part of the data from the training data in a preset manner, and obtains the training sample set according to the extracted part of the data. The terminal may randomly extract partial data from the training data, and construct a training sample set according to the randomly extracted partial data.
在一个实施例中,终端对训练数据进行划分,得到与初始子模型数量相匹配的多个训练数据子集,将多个训练数据子集作为与多个初始子模型一一对应的多个训练样本集。例如,有5个初始子模型,训练数据中包含10000张图片,终端按照第1-2000张、第2001-4000张、第4001-第6000张、第6001-8000张、第8001-第10000张的划分方式得到5个训练数据子集,将5个训练数据子集作为与5个初始子模型一一对应的训练样本集。In one embodiment, the terminal divides the training data to obtain multiple training data subsets that match the number of initial sub-models, and uses the multiple training data subsets as multiple trainings corresponding to the multiple initial sub-models. Sample set. For example, there are 5 initial sub-models, the training data contains 10000 pictures, the terminal according to the first 1-2000, 2001-4000, 4001-6000, 6001-8000, 8001-10000 5 partitions of the training data are obtained, and the five training data subsets are used as the training sample set corresponding to the five initial submodels.
步骤406,分别根据每个训练样本集训练对应的初始子模型,得到多个预处理子模型。Step 406: Train the corresponding initial sub-model according to each training sample set to obtain multiple pre-processing sub-models.
具体地,对于每个初始子模型,终端获取与初始子模型对应的训练样本集, 根据训练样本集对初始子模型进行训练。终端对多个初始子模型训练结束后,得到多个预处理子模型。Specifically, for each initial sub-model, the terminal obtains a training sample set corresponding to the initial sub-model, and trains the initial sub-model according to the training sample set. After the terminal trains the multiple initial sub-models, multiple pre-processing sub-models are obtained.
在一个实施例中,终端对初始子模型进行训练时,将待输入初始子模型的训练数据分为多组。终端首先将一组训练数据输入初始子模型,得到初始子模型输出的初始结果,将初始结果与标签数据进行对比并计算预测误差,根据预测误差调整模型参数;再将另一组训练数据输入调整后的初始子模型,重复上述过程,直至预测误差收敛,得到预处理子模型。In one embodiment, when the terminal trains the initial sub-model, the training data to be input into the initial sub-model is divided into multiple groups. The terminal first enters a set of training data into the initial sub-model to obtain the initial result of the initial sub-model output, compares the initial result with the label data and calculates the prediction error, adjusts the model parameters according to the prediction error; then inputs another set of training data to adjust After the initial sub-model, repeat the above process until the prediction error converges to obtain the pre-processing sub-model.
步骤408,根据多个预处理子模型构建数据处理模型。Step 408: Construct a data processing model according to multiple pre-processing sub-models.
具体地,终端对多个相同的初始子模型训练完成后,得到多个预处理子模型。终端根据多个预处理子模型构建子模型集群,将构建得到的子模型集群作为数据处理模型。Specifically, after the terminal trains multiple identical initial sub-models, multiple pre-processing sub-models are obtained. The terminal constructs a sub-model cluster according to multiple pre-processing sub-models, and uses the constructed sub-model cluster as a data processing model.
本实施例中,获取多个相同的初始子模型和训练数据,从训练数据中抽取与多个初始子模型一一对应的多个训练样本集,对于多个相同的初始子模型,采用控制变量的方法,分别输入不同的训练样本集进行训练,得到多个预处理子模型,提高了模型训练的可靠性,根据多个预处理子模型构建数据处理模型,提高了获取数据处理模型的效率。In this embodiment, multiple identical initial submodels and training data are acquired, and multiple training sample sets corresponding to the multiple initial submodels are extracted from the training data. For multiple identical initial submodels, control variables are used. The method of inputting different training sample sets for training respectively obtains multiple pre-processing sub-models, which improves the reliability of model training. Building a data processing model based on multiple pre-processing sub-models improves the efficiency of obtaining data processing models.
如图5所示,在一个实施例中,步骤204具体还包括构建数据处理模型的步骤,该步骤具体包括如下步骤:As shown in FIG. 5, in one embodiment, step 204 specifically includes the step of building a data processing model. The step specifically includes the following steps:
步骤502,从训练好的结构来源模型中抽取多个不同的模型子结构。Step 502: Extract multiple different model substructures from the trained structure source model.
其中,模型子结构是从结构来源模型中随机抽取得到的子结构,可以正常执行模型的功能。结构来源模型的结构包含多层链接,可以对待处理数据进行处理。Among them, the model substructure is a substructure randomly extracted from the source model of the structure, and can normally perform the function of the model. The structure of the structure source model contains multiple layers of links, and the data to be processed can be processed.
具体地,终端预存有训练好的结构来源模型。终端对获取到的数据处理指令进行解析,得到结构来源模型的存储地址,根据结构来源模型的存储地址,从存储空间中读取结构来源模型。终端读取到结构来源模型后,从结构来源模型中进行多次抽取,每次抽取得到一个模型子结构。在进行每次抽取时,终端分别从结构来源模型中随机抽掉某个链接,得到多个不同的模型子结构。Specifically, the terminal pre-stores a trained structure source model. The terminal parses the obtained data processing instruction to obtain the storage address of the structure source model, and reads the structure source model from the storage space according to the storage address of the structure source model. After reading the structure source model, the terminal performs multiple extractions from the structure source model, and each time a model substructure is extracted. During each extraction, the terminal randomly extracts a link from the structure source model to obtain multiple different model substructures.
在一个实施例中,结构来源模型的结构如图6所示。结构来源模型可以是 神经网络模型,包括输入层、隐藏层和输出层。终端可以抽掉任意两层之间的某条链接,得到多个不同的模型子结构。比如,终端可以抽掉图6中x 1到h 1的链接,也可以抽掉h 2到o 2的链接。 In one embodiment, the structure of the structure source model is shown in FIG. 6. The structure source model may be a neural network model, including an input layer, a hidden layer, and an output layer. The terminal can extract a certain link between any two layers to obtain multiple different model substructures. For example, the terminal may remove the link from x 1 to h 1 in FIG. 6 or the link from h 2 to o 2 .
在一个实施例中,终端在获取待处理数据之前,先要通过训练得到结构来源模型。终端根据模型训练指令获取初始子模型和训练数据,根据训练数据对初始子模型进行训练,训练结束后,得到结构来源模型。In one embodiment, before acquiring the data to be processed, the terminal first obtains the structure source model through training. The terminal obtains the initial sub-model and training data according to the model training instructions, and trains the initial sub-model according to the training data. After the training, the structure source model is obtained.
步骤504,以每个模型子结构作为数据处理模型中的预处理子模型,构建数据处理模型。Step 504: Use each model substructure as a preprocessing submodel in the data processing model to construct a data processing model.
具体地,终端通过抽取得到多个不同的模型子结构后,将每个模型子结构作为预处理子模型。终端根据多个预处理子模型构建子模型集群,得到数据处理模型。Specifically, after the terminal obtains a plurality of different model substructures through extraction, each model substructure is used as a preprocessing submodel. The terminal constructs a sub-model cluster according to multiple pre-processing sub-models to obtain a data processing model.
步骤506,将待处理数据分别输入数据处理模型中的各预处理子模型。Step 506: Input the data to be processed into each pre-processing sub-model in the data processing model.
具体地,终端构建数据处理模型完成后,根据预处理子模型的数量对待处理数据进行复制,得到与预处理子模型数量匹配的多份相同的待处理数据。终端分别将每份待处理数据输入数据处理模型中的各预处理子模型。Specifically, after the data processing model is constructed by the terminal, the data to be processed is copied according to the number of pre-processing sub-models to obtain multiple pieces of the same data to be processed matching the number of pre-processing sub-models. The terminal inputs each piece of data to be processed into each preprocessing sub-model in the data processing model.
在一个实施例中,终端中的预存模型分别被标识为第一模型、第二模型和/或第三模型。其中,第一模型和第二模型为数据处理模型,第一模型的各预处理子模型由多个不同的初始子模型训练得到,第二模型的各预处理子模型由多个相同的初始子模型训练得到。第三模型为结构来源模型。终端获取到待处理数据后,获取预存模型的模型标识,当获取到的模型标识为第一模型标识或第二模型标识时,终端将待处理数据输入第一模型或第二模型中的各预处理子模型;当获取到的模型标识为第三模型标识时,终端从第三模型中抽取多个不同的模型子结构,以每个模型子结构作为预处理子模型,构建数据处理模型,将待处理数据分别输入数据处理模型中的各预处理子模型。数据处理模型的各预处理子模型来源可以是第一模型、第二模型和/或第三模型中的一种。In one embodiment, the pre-stored models in the terminal are identified as the first model, the second model, and/or the third model, respectively. Among them, the first model and the second model are data processing models, each pre-processing sub-model of the first model is trained by multiple different initial sub-models, and each pre-processing sub-model of the second model is obtained by multiple identical initial sub-models The model is trained. The third model is the structure source model. After the terminal obtains the data to be processed, it obtains the model identifier of the pre-stored model. When the obtained model identifier is the first model identifier or the second model identifier, the terminal inputs the data to be processed into each of the first model or the second model. Processing sub-models; when the acquired model identifier is the third model identifier, the terminal extracts multiple different model sub-structures from the third model, and uses each model sub-structure as a pre-processing sub-model to construct a data processing model. The data to be processed is input to each pre-processing sub-model in the data processing model. The source of each pre-processing sub-model of the data processing model may be one of the first model, the second model, and/or the third model.
本实施例中,从训练好的结构来源模型中通过抽取得到多个不同的模型子结构,以每个模型子结构作为预处理子模型,构建数据处理模型,将待处理数据分别输入数据处理模型中的各预处理子模型。通过从结构来源模型中随机抽 取模型子结构,将模型子结构作为预处理子模型,保证了选取的预处理子模型的可靠性。In this embodiment, multiple different model substructures are obtained by extracting from the trained structure source model, each model substructure is used as a preprocessing submodel, a data processing model is constructed, and data to be processed are input into the data processing model respectively Each pre-processing sub-model in. By randomly extracting the model substructure from the source model of the structure and using the model substructure as the preprocessing submodel, the reliability of the selected preprocessing submodel is guaranteed.
如图7所示,在一个实施例中,步骤210具体还包括生成处理结果的步骤,该步骤具体包括如下步骤:As shown in FIG. 7, in one embodiment, step 210 specifically includes the step of generating a processing result. The step specifically includes the following steps:
步骤702,从各预处理结果中,筛选出符合预设概率条件的预判概率所对应的预处理结果。In step 702, from each preprocessing result, a preprocessing result corresponding to a pre-judgement probability that meets a preset probability condition is selected.
具体地,每个预处理子模型可以得到多个候选预处理结果以及与各候选预处理结果分别对应的候选概率。终端读取预存的预设转化概率,对于每一个预处理子模型,终端可以将候选概率大于等于预设转化概率的候选预处理结果,作为预处理子模型的预处理结果。终端统计各预处理子模型的各预处理结果分别对应的预判概率,并获取预设的概率条件,逐个将预处理结果对应的预判概率同预设概率条件相比较,筛选符合预设概率条件的预判概率所对应的预处理结果。Specifically, each pre-processing sub-model can obtain multiple candidate pre-processing results and candidate probabilities corresponding to each candidate pre-processing result. The terminal reads the pre-stored preset conversion probabilities. For each pre-processing sub-model, the terminal may use the candidate pre-processing result with a candidate probability greater than or equal to the preset conversion probability as the pre-processing result of the pre-processing sub-model. The terminal counts the pre-judgement probability corresponding to each pre-processing result of each pre-processing sub-model, and obtains the preset probability condition, compares the pre-judgement probability corresponding to the pre-processing result with the preset probability condition one by one, and selects the preset probability The preprocessing result corresponding to the conditional prediction probability.
步骤704,计算筛选到的预处理结果的不确定度。Step 704: Calculate the uncertainty of the screened preprocessing result.
其中,不确定度是对终端筛选到的预处理结果不确定性的量化评估值。不确定度越低,预处理结果的可信度越高。Among them, the uncertainty is a quantitative evaluation value of the uncertainty of the pre-processing results screened by the terminal. The lower the uncertainty, the higher the reliability of the preprocessing results.
具体地,终端可以基于贝叶斯学派(Bayesians)理论,给筛选到的预处理结果添加不确定度。终端获取预设的不确定度计算方式,根据不确定度计算方式和得到的各预处理结果或候选预处理结果,计算筛选到的预处理结果的不确定度。Specifically, the terminal may add uncertainty to the filtered preprocessing results based on the Bayesians theory. The terminal obtains a preset uncertainty calculation method, and calculates the uncertainty of the screened preprocessing result according to the uncertainty calculation method and the obtained preprocessing results or candidate preprocessing results.
步骤706,根据不确定度和筛选到的预处理结果,生成待处理数据对应的处理结果。Step 706: Generate a processing result corresponding to the data to be processed according to the uncertainty and the filtered preprocessing result.
具体地,终端将筛选到的预处理结果作为输入的待处理数据对应的处理结果,终端在显示处理结果时,同时显示与处理结果对应的不确定度。终端还可以获取筛选到的预处理结果的预判概率,同时显示处理结果、与处理结果对应的预判概率和不确定度。Specifically, the terminal uses the filtered preprocessing result as the processing result corresponding to the input to-be-processed data. When the terminal displays the processing result, it simultaneously displays the uncertainty corresponding to the processing result. The terminal can also obtain the pre-judgement probability of the pre-processed result screened, and simultaneously display the processing result, the pre-judgement probability corresponding to the processing result and the uncertainty.
在一个实施例中,终端可以基于频率学派(Frequentists)理论,不计算不确定度,输出确定的处理结果,输出筛选到的预处理结果及预判概率。In one embodiment, the terminal may be based on frequency theory (Frequentists) theory, do not calculate the uncertainty, output the determined processing result, and output the pre-filtered result and the pre-judged probability.
在一个实施例中,当计算得到的不确定度小于预设的不确定度阈值时,终端显示筛选到的预处理结果及计算得到的不确定度。In one embodiment, when the calculated uncertainty is less than the preset uncertainty threshold, the terminal displays the filtered preprocessing result and the calculated uncertainty.
终端基于贝叶斯学派理论计算不确定度,为筛选到的预处理结果的可信度提供量化评估值。当不确定度大于等于预设的不确定度阈值时,终端展示不确定度异常通知。不确定度异常通知可以提醒用户输入的待处理数据是否曾用于训练初始子模型或结构来源模型,并提醒用户补充训练数据;不确定度异常通知也可以提醒用户,预设的不确定度阈值是否合理,并提醒用户重新设定不确定度阈值。The terminal calculates the uncertainty based on the Bayesian theory, and provides a quantitative evaluation value for the credibility of the preprocessing results. When the uncertainty is greater than or equal to the preset uncertainty threshold, the terminal displays a notification of the uncertainty of abnormality. The uncertainty anomaly notification can remind the user whether the input data to be processed has been used to train the initial sub-model or structure source model, and remind the user to supplement the training data; the uncertainty anomaly notification can also remind the user, the preset uncertainty threshold Whether it is reasonable, and remind the user to reset the uncertainty threshold.
表1:Table 1:
Figure PCTCN2020070651-appb-000001
Figure PCTCN2020070651-appb-000001
举例说明,数据处理模型包括4个预处理子模型,待处理数据为图像数据,各预处理子模型得到两个候选预处理结果,候选预处理结果为识别到的动物种类。各预处理子模型的候选预处理结果可以如表格1所示,其中A、B和C分别表示三类候选预处理结果,表格中的数字为候选预处理结果的候选概率,C类预处理结果是根据预设转化概率,对A类候选预处理结果或B类候选预处理结果转换后得到的。终端将C类候选预处理结果中,候选概率为1的结果作为预处理子模型的预处理结果。比如,预处理子模型一的A类候选预处理结果中,候选预处理结果为猫的候选概率是0.9,候选预处理结果为狗的候选概率是0.1,预设转化概率可以是0.5;0.9大于0.5,故将A类候选预处理结果转换为C类候选预处理结果后,候选预处理结果为猫的候选概率是1,候选预处理结果为狗 的候选概率是0。终端将猫作为预处理子模型一的预处理结果。For example, the data processing model includes four pre-processing sub-models. The data to be processed is image data. Each pre-processing sub-model obtains two candidate pre-processing results. The candidate pre-processing results are the identified animal species. The candidate pre-processing results of each pre-processing sub-model can be shown in Table 1, where A, B and C respectively represent three types of candidate pre-processing results, the numbers in the table are the candidate probabilities of the candidate pre-processing results, and the type C pre-processing results It is obtained by converting the pre-processing result of the A-type candidate or the pre-processing result of the B-type candidate according to the preset conversion probability. The terminal takes the result of the candidate preprocessing result of class C as the preprocessing result of the preprocessing sub-model. For example, in the type A candidate preprocessing result of the preprocessing submodel 1, the candidate probability of the candidate preprocessing result is cat, the candidate probability of the candidate preprocessing result is dog is 0.1, and the preset conversion probability may be 0.5; 0.9 is greater than 0.5, so after converting the A-type candidate pre-processing result to the C-type candidate pre-processing result, the candidate probability for the candidate pre-processing result is cat, and the candidate probability for the dog pre-processing result is 0. The terminal uses the cat as the pre-processing result of the pre-processing sub-model 1.
假设各预处理子模型在两次使用中得到的候选预处理结果及候选概率,分别为表格1中A类候选预处理结果和B类候选预处理结果,则筛选到的预处理结果为猫,且预判概率为75%。假设终端计算得到A类候选预处理结果中,不确定度为0.2;B类候选预处理结果中,不确定度为0.1。对于A类候选预处理结果或B类候选预处理结果,在基于频率学派理论时,终端输出确定的处理结果,输出“筛选到的预处理结果为猫且预判概率为75%”。在基于贝叶斯学派理论时,对于A类候选预处理结果,终端输出“筛选到的预处理结果为猫,预判概率为75%且不确定度为0.2”;对于B类候选预处理结果,终端输出“筛选到的预处理结果为猫,预判概率为75%且不确定度为0.1”。在基于频率学派理论时,A类候选预处理结果和B类候选预处理结果相同;在基于贝叶斯学派理论时,B类候选预处理结果的确定度大于A类候选预处理结果的确定度。Assuming that the candidate pre-processing results and candidate probabilities obtained by each pre-processing sub-model in two uses are the A-type candidate pre-processing results and the B-type candidate pre-processing results in Table 1, the selected pre-processing results are cats, And the prediction probability is 75%. Assume that the terminal calculates the uncertainty in the class A candidate preprocessing result is 0.2; in the class B candidate preprocessing result, the uncertainty is 0.1. For the class A candidate preprocessing result or the class B candidate preprocessing result, when based on the frequency school theory, the terminal outputs the determined processing result, and outputs "the preprocessed result selected is a cat and the prediction probability is 75%". Based on the Bayesian theory, for the A-type candidate pre-processing results, the terminal outputs "screened pre-processing results are cats, the prediction probability is 75% and the uncertainty is 0.2"; for the B-type candidate pre-processing results ,The terminal outputs “The pre-screened result is cat, the prediction probability is 75% and the uncertainty is 0.1”. When based on frequency school theory, the results of class A candidate preprocessing are the same as those of class B candidate; when based on Bayesian theory, the certainty of class B candidate preprocessing results is greater than the certainty of class A candidate preprocessing results .
在一个实施例中,终端基于贝叶斯学派理论计算不确定度时,可以先将A类候选预处理结果或B类候选预处理结果转化为C类候选预处理结果,并根据公式1计算不确定度:In one embodiment, when the terminal calculates the uncertainty based on the Bayesian theory, it can first convert the A-type candidate pre-processing results or the B-type candidate pre-processing results into the C-type candidate pre-processing results, and calculate Certainty:
不确定度=1-出现次数最多的预处理结果的出现次数/预处理结果总数(1)Uncertainty = 1-the number of occurrences of the pre-processing results with the most occurrences / total number of pre-processing results
比如,终端将A类候选预处理结果转化为C类候选预处理结果后,各预处理子模型的预处理结果分别为“猫、猫、猫、狗”,出现次数最多的预处理结果为猫,出现次数为3,预处理结果总数为4。根据公式1,不确定度为:1-3/4=0.25。通过公式1计算不确定度时,最小不确定度为0,最大不确定度为0.5。For example, after the terminal converts the A-type candidate pre-processing results into C-type candidate pre-processing results, the pre-processing results of each pre-processing sub-model are "cat, cat, cat, dog" respectively, and the most frequent pre-processing results are cats , The number of occurrences is 3, and the total number of preprocessing results is 4. According to formula 1, the uncertainty is: 1-3/4=0.25. When calculating the uncertainty through formula 1, the minimum uncertainty is 0 and the maximum uncertainty is 0.5.
在一个实施例中,终端根据公式2计算不确定度:In one embodiment, the terminal calculates the uncertainty according to Equation 2:
不确定度=最终不确定度-各预处理子模型不确定度平均值(2)Uncertainty = final uncertainty-the average value of the uncertainty of each pre-processing sub-model (2)
其中,终端通过公式3计算各预处理子模型不确定度平均值:Among them, the terminal calculates the average value of the uncertainty of each pre-processing sub-model through Equation 3:
各预处理子模型不确定度平均值=各预处理子模型不确定度之和/预处理子模型总数(3)The average value of the uncertainty of each pre-processing sub-model = the sum of the uncertainty of each pre-processing sub-model / the total number of pre-processing sub-models (3)
其中,终端通过公式4计算各预处理子模型不确定度H:Among them, the terminal calculates the uncertainty H of each pre-processing sub-model through Equation 4:
Figure PCTCN2020070651-appb-000002
Figure PCTCN2020070651-appb-000002
其中,p i为预处理子模型中候选预处理结果的候选概率,i为正整数,是预 处理子模型的候选预处理结果的数量。举例说明,一个预处理子模型的候选预处理结果包括猫和狗,其中猫的候选概率为0.5,狗的候选概率为0.5,则该预处理子模型的不确定度h=-[0.5log(0.5)+0.5log(0.5)=1。当猫的候选概率为0、狗的候选概率为1时,不确定度为h=-[0×log0+1×log1=0。不确定度取值范围是[0,1]。终端分别得到各预处理子模型的不确定度后,将各预处理子模型的不确定度相加得到加数和,计算加数和与预处理子模型数量的比值,将得到的比值作为各预处理子模型不确定度的平均值。 Where, p i is the candidate probability of the candidate pre-processing result in the pre-processing sub-model, and i is a positive integer, which is the number of candidate pre-processing results of the pre-processing sub-model. For example, the candidate pre-processing results of a pre-processing sub-model include cats and dogs, where the cat candidate probability is 0.5 and the dog candidate probability is 0.5, then the uncertainty of the pre-processing sub-model h=-[0.5log( 0.5)+0.5log(0.5)=1. When the cat's candidate probability is 0 and the dog's candidate probability is 1, the uncertainty is h=-[0×log0+1×log1=0. The range of uncertainty is [0,1]. After obtaining the uncertainty of each pre-processing sub-model separately, the terminal adds the uncertainty of each pre-processing sub-model to obtain the addend sum, calculates the ratio of the add-on sum to the number of pre-processing sub-models, and uses the obtained ratio as each The average value of the uncertainty of the pre-processing submodel.
终端计算最终不确定度时,根据各预处理子模型,计算各类候选预处理结果的候选概率的简单算术平均数
Figure PCTCN2020070651-appb-000003
i为正整数,是预处理子模型的候选预处理结果的数量,再根据公式5计算最终不确定度:
When calculating the final uncertainty, the terminal calculates the simple arithmetic average of the candidate probabilities of various candidate preprocessing results according to each preprocessing submodel
Figure PCTCN2020070651-appb-000003
i is a positive integer, which is the number of candidate pre-processing results of the pre-processing sub-model, and then the final uncertainty is calculated according to Equation 5:
Figure PCTCN2020070651-appb-000004
Figure PCTCN2020070651-appb-000004
举例说明,表格1的A类候选预处理结果中,各预处理子模型中,候选预处理结果猫的候选概率分别是0.9、0.8、0.7和0.1,简单算术平均数
Figure PCTCN2020070651-appb-000005
候选预处理结果为狗的候选概率分别是0.1、0.2、0.3和0.9,简单算术平均数
Figure PCTCN2020070651-appb-000006
最终不确定度H=-(0.625×log0.625+0.375×log0.375)=0.63。
For example, in the candidate pre-processing results of Class A in Table 1, in each pre-processing sub-model, the candidate probabilities of the candidate pre-processing results are 0.9, 0.8, 0.7, and 0.1, respectively, and the simple arithmetic average
Figure PCTCN2020070651-appb-000005
The candidate preprocessing result is that the dog's candidate probabilities are 0.1, 0.2, 0.3, and 0.9, respectively, and the simple arithmetic average
Figure PCTCN2020070651-appb-000006
The final uncertainty H=-(0.625×log0.625+0.375×log0.375)=0.63.
终端根据第一模型、第二模型和第三模型构建得到的数据处理模型中,通过第三模型得到的数据处理模型,即从训练好的结构来源模型中抽取多个不同的模型子结构的方式,可以更好地适用于基于贝叶斯学派理论得到不确定度。The data processing model obtained by the third model from the data processing model constructed by the terminal according to the first model, the second model, and the third model, that is, a method of extracting multiple different model substructures from the trained structure source model , Can be better applied to the uncertainty based on Bayesian theory.
本实施例中,从各预处理结果中,筛选出符合预设概率条件的预判概率所对应的预处理结果,再计算筛选到的预处理结果的不确定度,不确定度反映了筛选到的预处理结果的可信程度;在根据预处理结果,生成待处理数据对应的处理结果时,添加进不确定度,提高了模型输出的处理结果的准确性。In this embodiment, from each preprocessing result, the preprocessing result corresponding to the pre-judgement probability that meets the preset probability condition is selected, and then the uncertainty of the pre-processing result screened is calculated, and the uncertainty reflects the screening to The credibility of the pre-processing results of the system; when generating the processing results corresponding to the data to be processed according to the pre-processing results, the uncertainty is added to improve the accuracy of the processing results output by the model.
如图8所示,在一个实施例中,步骤208之后还包括生成处理异常通知的步骤,该步骤具体包括如下步骤:As shown in FIG. 8, in one embodiment, after step 208, a step of generating an exception notification is included. This step specifically includes the following steps:
步骤802,获取待处理数据。Step 802: Obtain data to be processed.
步骤804,将待处理数据输入数据处理模型。Step 804: Input the data to be processed into the data processing model.
步骤806,获取数据处理模型中各预处理子模型分别输出的预处理结果。Step 806: Obtain the preprocessing results respectively output by each preprocessing sub-model in the data processing model.
步骤808,统计各预处理结果分别对应的预判概率。In step 808, the pre-judge probability corresponding to each pre-processing result is counted.
步骤810,当未根据各预判概率生成与待处理数据对应的处理结果时,根据各预处理结果生成处理异常通知。Step 810: When a processing result corresponding to the data to be processed is not generated according to each pre-judgment probability, a processing exception notification is generated according to each pre-processing result.
其中,处理异常通知是终端未筛选到预处理结果时生成的通知信息。Among them, the processing exception notification is notification information generated when the terminal does not filter the preprocessing result.
具体地,当终端未筛选到符合预设概率条件的预判概率所对应的预处理结果时,无法生成与待处理数据对应的处理结果,终端根据各预处理结果和各预处理结果所对应的预判概率,生成处理异常通知。Specifically, when the terminal does not filter the preprocessing result corresponding to the pre-judgement probability that meets the preset probability condition, it cannot generate the processing result corresponding to the data to be processed. The terminal according to each preprocessing result and the corresponding preprocessing result Predict the probabilities and generate notifications for handling exceptions.
步骤812,展示处理异常通知。Step 812: Display the exception notification.
具体地,终端生成处理异常通知后,将处理异常通知发送至终端的显示屏,通过显示屏展示处理异常通知。Specifically, after generating the processing exception notification, the terminal sends the processing exception notification to the display screen of the terminal, and displays the processing exception notification through the display screen.
在一个实施例中,处理异常通知可以提示用户重新输入待处理数据,以便用户输入新的待处理数据后,对新的待处理数据进行处理得到处理结果。处理异常通知还可以提示用户可以重新对模型进行训练。In one embodiment, the processing exception notification may prompt the user to re-enter the data to be processed, so that after the user inputs new data to be processed, the new data to be processed is processed to obtain a processing result. Handling exception notifications can also prompt users to retrain the model.
本实施例中,当未根据各预判概率生成与待处理数据对应的处理结果,即未筛选到预处理结果时,根据各预处理结果生成处理异常通知并展示处理异常通知,以便再次接收输入的待处理数据,提高了数据处理的可靠性。In this embodiment, when the processing result corresponding to the data to be processed is not generated according to each pre-judgment probability, that is, the preprocessing result is not filtered, a processing exception notification is generated according to each preprocessing result and the processing exception notification is displayed so as to receive the input again Data to be processed improves the reliability of data processing.
本申请提供的基于贝叶斯学派理论、对于从多个预处理子模型的预处理结果中筛选出的预处理结果计算不确定度的方法,可以应用在各种机器学习技术中,比如监督学习、半监督学习、强化学习和模仿学习等;基于本申请的各种机器学习技术可以解决各领域中,分类或回归等相关的各种问题,说明如下:The method provided by this application based on Bayesian theory and calculating the uncertainty of the preprocessing results selected from the preprocessing results of multiple preprocessing sub-models can be applied to various machine learning techniques, such as supervised learning , Semi-supervised learning, reinforcement learning and imitation learning, etc.; various machine learning techniques based on this application can solve various problems related to classification or regression in various fields, as described below:
1、监督学习1. Supervised learning
以基于监督学习的缺陷检测为例。缺陷检测可以应用在各种领域,比如加工产品的缺陷检测(比如划痕、气泡和完整度等)、AOI检测(Automated Optical Inspection,自动光学检测)等。Take defect detection based on supervised learning as an example. Defect detection can be applied in various fields, such as defect detection of processed products (such as scratches, bubbles and integrity), AOI inspection (Automated Optical Inspection, automatic optical inspection), etc.
当进行缺陷检测时,预处理结果为产品中的缺陷,当筛选到的预处理结果的不确定度数值较低时,筛选到的预处理结果的准确度较高;当通过筛选到的预处理结果判断产品无缺陷,但不确定度较高时,产品很可能存在缺陷。原因可能在于产品中的缺陷在监督学习中为进行标注,或极少出现在训练数据中; 可以在监督学习的训练数据中加入该类型的缺陷,使模型重新进行学习。原因也可能在于预设的不确定度阈值设置不合理,需要重新设定不确定度阈值。When performing defect detection, the pretreatment result is a defect in the product. When the uncertainty value of the screened pretreatment result is low, the accuracy of the screened pretreatment result is high; when the pretreatment screened As a result, it is judged that the product is not defective, but when the uncertainty is high, the product is likely to be defective. The reason may be that the defects in the product are marked for supervised learning, or rarely appear in the training data; this type of defect can be added to the supervised learning training data to make the model learn again. The reason may also be that the preset uncertainty threshold is set unreasonably, and the uncertainty threshold needs to be reset.
2、半监督学习2. Semi-supervised learning
半监督学习(Semi-Supervised Learning,SSL)是指使用一些未标注的训练数据,和已经标注的训练数据进行模型训练。Semi-Supervised Learning (SSL) refers to the use of some unlabeled training data and the already labeled training data for model training.
在训练时,输入一些有标注的训练数据对初始子模型或初始结构来源模型进行训练。训练结束后,将待处理数据输入各预处理子模型,并筛选预处理结果;筛选到的预处理结果的不确定度较高时,需要将未标注的训练数据通过人工方式进行标注,然后再训练模型。During training, input some labeled training data to train the initial sub-model or the original structure source model. After the training is completed, input the data to be processed into each pre-processing sub-model and filter the pre-processing results; when the uncertainty of the filtered pre-processing results is high, the unlabeled training data needs to be labeled manually, and then Train the model.
以处理回归问题为例,数据处理模型应用在目标物体位置或位姿识别中,用于识别目标物体的位姿。图9为一个实施例中,训练数据的示意图。具体地,参照图9,识别图9(a)中目标物体的位置或位姿的不确定度较低;图9(b)中,因物体之间存在遮挡关系,识别目标物体的位置或位姿的不确定度较高,则可能需要重新对训练数据进行标注。Taking the regression problem as an example, the data processing model is applied to target object position or pose recognition to identify the pose of the target object. FIG. 9 is a schematic diagram of training data in an embodiment. Specifically, referring to FIG. 9, the uncertainty in identifying the position or pose of the target object in FIG. 9(a) is low; in FIG. 9(b), the position or position of the target object is recognized due to the occlusion relationship between the objects The pose uncertainty is high, you may need to re-label the training data.
3、强化学习和模仿学习3. Reinforcement learning and imitation learning
在强化学习与模仿学习中,需要针对当前状态执行一个动作(行为),当前策略会为当前状态提供一个动作选项(强化学习和模仿学习需要智能体具有探索能力,比如机器人,但智能体在进行轨迹采用时不一定只能执行该动作选项),而行为值函数会针对当前状态以及该动作选项提供一个期望回报。当行为值函数利用多个模型的贝叶斯推断时,不仅为该当前状态与该当前动作选项提供一个期望回报,还会提供期望回报的不确定度;如果期望回报的不确定度低,即当前状态执行当前动作选项后,轨迹的回报期望是相对确定的,表明当前状态与当前动作选项的组合在智能体过去的学习过程中已经得到有效的探索;在强化学习或模仿学习需要智能体探索的阶段,不建议选择当前动作选项;当期望回报的不确定度高,即当前状态执行当前动作选项后的轨迹的回报期望是相对未知的,表明当前状态与当前动作选项的组合在智能体过去的学习过程中没有充分地探索,则在强化学习或模仿学习需要智能体探索的阶段,建议选择当前动作选项。In reinforcement learning and imitation learning, an action (behavior) needs to be performed for the current state, and the current strategy will provide an action option for the current state (reinforcement learning and imitation learning require agents with exploration capabilities, such as robots, but the agent is doing When the trajectory is adopted, the action option may not only be executed), and the behavior value function will provide a desired return for the current state and the action option. When the behavior value function uses Bayesian inference of multiple models, it not only provides an expected return for the current state and the current action option, but also provides the uncertainty of the expected return; if the uncertainty of the expected return is low, that is After executing the current action option in the current state, the expected return of the trajectory is relatively certain, indicating that the combination of the current state and the current action option has been effectively explored in the agent's past learning process; the agent needs to be explored in reinforcement learning or imitation learning It is not recommended to select the current action option at the stage; when the uncertainty of the expected return is high, that is, the expected return of the trajectory after the current action option is executed in the current state is relatively unknown, indicating that the combination of the current state and the current action option has passed Is not fully explored during the learning process, it is recommended to choose the current action option in the stage where reinforcement learning or imitation learning requires agent exploration.
图10为一个实施例中数据处理的示意图。具体地,参照图10,数据处理模型可以是图像识别模型,由4个预处理子模型构成。终端获取的待处理数据为图像数据,将图像数据输入到数据处理模型中的4个预处理子模型,4个预处理子模型对图像中的动物进行识别。终端获取到3个预处理子模型的预处理结果为猫,预判概率为75%,1个预处理子模型的预处理结果为狗,预判概率为25%。若预设的概率条件是预判概率大于等于75%,终端可以筛选到符合预设概率条件的预处理结果,即为猫,并将猫作为数据处理模型对图像数据的处理结果。若预设的概率条件是预判概率等于100%,则终端不能筛选到预处理结果。10 is a schematic diagram of data processing in an embodiment. Specifically, referring to FIG. 10, the data processing model may be an image recognition model, which is composed of four pre-processing sub-models. The data to be processed acquired by the terminal is image data, and the image data is input to 4 pre-processing sub-models in the data processing model, and the 4 pre-processing sub-models identify the animals in the image. The terminal obtains the preprocessing results of the three preprocessing sub-models as cats, with a pre-judgment probability of 75%, and the pre-processing results of one pre-processing sub-model as dogs, with a pre-judgment probability of 25%. If the preset probability condition is that the pre-judgment probability is greater than or equal to 75%, the terminal can filter out the preprocessing result that meets the preset probability condition, that is, a cat, and use the cat as a data processing model to process image data. If the preset probability condition is that the pre-judgment probability is equal to 100%, the terminal cannot select the pre-processing result.
应该理解的是,虽然图2-5和7-8的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2-5和7-8中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flowcharts of FIGS. 2-5 and 7-8 are sequentially displayed according to the arrows, the steps are not necessarily executed in the order indicated by the arrows. Unless clearly stated in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least some of the steps in FIGS. 2-5 and 7-8 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. The execution order of the sub-steps or stages is not necessarily sequential, but may be executed in turn or alternately with at least a part of other steps or sub-steps or stages of other steps.
在一个实施例中,如图11所示,提供了一种数据处理装置1100,包括:数据处理模块1102、数据输入模块1104、结果获取模块1106、概率统计模块1108和结果生成模块1110其中:In one embodiment, as shown in FIG. 11, a data processing apparatus 1100 is provided, including: a data processing module 1102, a data input module 1104, a result acquisition module 1106, a probability statistics module 1108, and a result generation module 1110 wherein:
数据获取模块1102,用于获取待处理数据。The data obtaining module 1102 is used to obtain data to be processed.
数据输入模块1104,用于将待处理数据输入数据处理模型。The data input module 1104 is used to input the data to be processed into the data processing model.
结果获取模块1106,用于获取数据处理模型中各预处理子模型分别输出的预处理结果。The result obtaining module 1106 is used to obtain the preprocessing results respectively output by each preprocessing sub-model in the data processing model.
概率统计模块1108,用于统计各预处理结果分别对应的预判概率。The probability statistics module 1108 is used to count the pre-judgement probability corresponding to each pre-processing result.
结果生成模块1110,用于根据各预处理结果分别对应的预判概率,生成待处理数据对应的处理结果。The result generating module 1110 is configured to generate processing results corresponding to the data to be processed according to the pre-judgement probabilities corresponding to the respective pre-processing results.
本实施例中,通过获取待处理数据,将待处理数据输入数据处理模型中的 多个预处理子模型,由多个预处理子模型同时对待处理数据进行处理;获取各预处理子模型分别输出的预处理结果,并统计各预处理结果分别对应的预判概率;根据各预处理结果分别对应的预判概率,可以验证多个预处理结果的一致性,并根据各预判概率生成待处理数据对应的处理结果,提高了模型数据处理的准确性。In this embodiment, by acquiring the data to be processed, the data to be processed is input into multiple pre-processing sub-models in the data processing model, and the multiple pre-processing sub-models simultaneously process the data to be processed; each pre-processing sub-model is obtained and output separately The pre-processing results of each pre-processing result are counted, and the pre-judging probabilities corresponding to each pre-processing result are counted; according to the pre-judge probabilities corresponding to each pre-processing result, the consistency of multiple pre-processing results can be verified, and the to-be-processed is generated according to each pre-judgement probability The processing results corresponding to the data improve the accuracy of model data processing.
在一个实施例中,数据处理装置1100还包括模型训练模块,模型训练模块用于获取多个不同的初始子模型和训练数据;以训练数据对各初始子模型进行训练,得到多个预处理子模型;根据多个预处理子模型构建数据处理模型。In one embodiment, the data processing apparatus 1100 further includes a model training module, the model training module is used to obtain a plurality of different initial sub-models and training data; the training data is used to train each initial sub-model to obtain multiple pre-processors Model; build a data processing model based on multiple preprocessing submodels.
本实施例中,获取多个不同的初始子模型和训练数据,将相同的训练数据输入到多个不同的初始子模型中进行训练,得到多个预处理子模型,提高了模型训练的可靠性,根据多个预处理子模型构建数据处理模型,提高了获取数据处理模型的效率。In this embodiment, multiple different initial sub-models and training data are obtained, and the same training data is input into multiple different initial sub-models for training to obtain multiple pre-processing sub-models, which improves the reliability of model training , Based on multiple pre-processing sub-models to build data processing models, improving the efficiency of obtaining data processing models.
在另一个实施例中,模型训练模块还用于获取多个相同的初始子模型和训练数据;从训练数据中抽取与多个初始子模型一一对应的多个训练样本集;分别根据每个训练样本集训练对应的初始子模型,得到多个预处理子模型;根据多个预处理子模型构建数据处理模型。In another embodiment, the model training module is also used to obtain multiple identical initial sub-models and training data; extract multiple training sample sets corresponding to the multiple initial sub-models from the training data; The training sample set trains the corresponding initial sub-model to obtain multiple pre-processing sub-models; the data processing model is constructed according to the multiple pre-processing sub-models.
本实施例中,获取多个相同的初始子模型和训练数据,从训练数据中抽取与多个初始子模型一一对应的多个训练样本集,对于多个相同的初始子模型,采用控制变量的方法,分别输入不同的训练样本集进行训练,得到多个预处理子模型,提高了模型训练的可靠性,根据多个预处理子模型构建数据处理模型,提高了获取数据处理模型的效率。In this embodiment, multiple identical initial submodels and training data are acquired, and multiple training sample sets corresponding to the multiple initial submodels are extracted from the training data. For multiple identical initial submodels, control variables are used. The method of inputting different training sample sets for training respectively obtains multiple pre-processing sub-models, which improves the reliability of model training. Building a data processing model based on multiple pre-processing sub-models improves the efficiency of obtaining data processing models.
在一个实施例中,数据输入模块1104具体包括:结构抽取模块、模型构建模块和输入模块,其中:In one embodiment, the data input module 1104 specifically includes: a structure extraction module, a model construction module, and an input module, where:
结构抽取模块,用于从训练好的处理模型中抽取多个不同的模型子结构。The structure extraction module is used to extract multiple different model substructures from the trained processing model.
模型构建模块,用于以每个模型子结构作为数据处理模型中的预处理子模型,构建数据处理模型。The model building module is used to construct the data processing model by using each model substructure as a preprocessing submodel in the data processing model.
输入模块,用于将待处理数据分别输入数据处理模型中的各预处理子模型。The input module is used to input the data to be processed into each pre-processing sub-model in the data processing model.
本实施例中,从训练好的结构来源模型中通过抽取得到多个不同的模型子 结构,以每个模型子结构作为预处理子模型,构建数据处理模型,将待处理数据分别输入数据处理模型中的各预处理子模型。通过从结构来源模型中随机抽取模型子结构,将模型子结构作为预处理子模型,保证了选取的预处理子模型的可靠性。In this embodiment, multiple different model substructures are obtained by extracting from the trained structure source model, each model substructure is used as a preprocessing submodel, a data processing model is constructed, and data to be processed are input into the data processing model respectively Each pre-processing sub-model in. By randomly extracting the model substructure from the source model of the structure and using the model substructure as the preprocessing submodel, the reliability of the selected preprocessing submodel is guaranteed.
在一个实施例中,结果生成模块1110用于从各预处理结果中,筛选出符合预设概率条件的预判概率所对应的预处理结果;计算筛选到的预处理结果的不确定度;根据不确定度和筛选到的预处理结果,生成待处理数据对应的处理结果。In one embodiment, the result generation module 1110 is configured to filter out the pre-processing results corresponding to the pre-judgement probability that meets the preset probability condition from each pre-processing result; calculate the uncertainty of the screened pre-processing results; Uncertainty and the pre-processed results screened to generate processing results corresponding to the data to be processed.
本实施例中,从各预处理结果中,筛选出符合预设概率条件的预判概率所对应的预处理结果,再计算筛选到的预处理结果的不确定度,不确定度反映了筛选到的预处理结果的可信程度;在根据预处理结果,生成待处理数据对应的处理结果时,添加进不确定度,提高了模型输出的处理结果的准确性。In this embodiment, from each preprocessing result, the preprocessing result corresponding to the pre-judgement probability that meets the preset probability condition is selected, and then the uncertainty of the pre-processing result screened is calculated, and the uncertainty reflects the screening to The credibility of the pre-processing results of the system; when generating the processing results corresponding to the data to be processed according to the pre-processing results, the uncertainty is added to improve the accuracy of the processing results output by the model.
在一个实施例中,数据处理装置1100还包括:通知生成模块和通知展示模块,其中:In one embodiment, the data processing apparatus 1100 further includes: a notification generation module and a notification display module, where:
通知生成模块,用于当未根据各预判概率生成与待处理数据对应的处理结果时,根据各预处理结果生成处理异常通知。The notification generating module is configured to generate a processing exception notification according to each preprocessing result when the processing result corresponding to the data to be processed is not generated according to each pre-judgment probability.
通知展示模块,用于展示处理异常通知。Notification display module, used to display exception notifications.
本实施例中,当未根据各预判概率生成与待处理数据对应的处理结果,即未筛选到预处理结果时,根据各预处理结果生成处理异常通知并展示处理异常通知,以便再次接收输入的待处理数据,提高了数据处理的可靠性。In this embodiment, when the processing result corresponding to the data to be processed is not generated according to each pre-judgment probability, that is, the preprocessing result is not filtered, a processing exception notification is generated according to each preprocessing result and the processing exception notification is displayed so as to receive the input again Data to be processed improves the reliability of data processing.
关于数据处理装置的具体限定可以参见上文中对于数据处理方法的限定,在此不再赘述。上述数据处理装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific limitation of the data processing device, reference may be made to the above limitation on the data processing method, and details are not described herein again. Each module in the above data processing device may be implemented in whole or in part by software, hardware, and a combination thereof. The above modules may be embedded in the hardware form or independent of the processor in the computer device, or may be stored in the memory in the computer device in the form of software so that the processor can call and execute the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是终端,其内部结构图可以如图12所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口、显示屏、输入装置和图像采集装置。其中,该计算机设备 的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种数据处理方法。该计算机设备的显示屏可以是液晶显示屏或者电子墨水显示屏,该计算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。图像采集装置用于采集图像数据,采集到的图像数据将作为待处理数据。In one embodiment, a computer device is provided. The computer device may be a terminal, and an internal structure diagram thereof may be as shown in FIG. The computer equipment includes a processor, a memory, a network interface, a display screen, an input device, and an image acquisition device connected through a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and computer programs. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used to communicate with external terminals through a network connection. The computer program is executed by the processor to implement a data processing method. The display screen of the computer device may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer device may be a touch layer covered on the display screen, or may be a button, a trackball, or a touchpad provided on the computer device housing , Can also be an external keyboard, touchpad or mouse. The image collection device is used to collect image data, and the collected image data will be used as data to be processed.
本领域技术人员可以理解,图12中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 12 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. The specific computer equipment may Include more or less components than shown in the figure, or combine certain components, or have a different arrangement of components.
在一个实施例中,提供了一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,处理器执行计算机程序时实现以下步骤:获取待处理数据;将待处理数据输入数据处理模型;获取数据处理模型中各预处理子模型分别输出的预处理结果;统计各预处理结果分别对应的预判概率;根据各预处理结果分别对应的预判概率,生成待处理数据对应的处理结果。In one embodiment, a computer device is provided, including a memory, a processor, and a computer program stored on the memory and executable on the processor. When the processor executes the computer program, the following steps are implemented: acquiring data to be processed; The data to be processed is input into the data processing model; the preprocessing results respectively output by the preprocessing sub-models in the data processing model are obtained; the pre-judgement probabilities corresponding to each pre-processing result are counted; The processing result corresponding to the data to be processed.
在一个实施例中,获取待处理数据之前,处理器执行计算机程序时还实现以下步骤:获取多个不同的初始子模型和训练数据;以训练数据对各初始子模型进行训练,得到多个预处理子模型;根据多个预处理子模型构建数据处理模型。In one embodiment, before acquiring the data to be processed, the processor also implements the following steps when executing the computer program: acquiring multiple different initial sub-models and training data; training each initial sub-model with training data to obtain multiple Processing sub-models; construct data processing models based on multiple pre-processing sub-models.
在另一个实施例中,获取待处理数据之前,处理器执行计算机程序时还实现以下步骤:获取多个相同的初始子模型和训练数据;从训练数据中抽取与多个初始子模型一一对应的多个训练样本集;分别根据每个训练样本集训练对应的初始子模型,得到多个预处理子模型;根据多个预处理子模型构建数据处理 模型。In another embodiment, before acquiring the data to be processed, the processor also implements the following steps when executing the computer program: acquiring multiple identical initial sub-models and training data; extracting from the training data one-to-one correspondence with the multiple initial sub-models Multiple training sample sets; train the corresponding initial sub-model according to each training sample set respectively to obtain multiple pre-processing sub-models; construct a data processing model based on multiple pre-processing sub-models.
在一个实施例中,将待处理数据输入数据处理模型包括:从训练好的结构来源模型中抽取多个不同的模型子结构;以每个模型子结构作为数据处理模型中的预处理子模型,构建数据处理模型;将待处理数据分别输入数据处理模型中的各预处理子模型。In one embodiment, inputting the data to be processed into the data processing model includes: extracting a plurality of different model substructures from the trained structure source model; using each model substructure as a preprocessing submodel in the data processing model, Build a data processing model; input the data to be processed into each pre-processing sub-model in the data processing model.
在一个实施例中,根据各预处理结果分别对应的预判概率,生成待处理数据对应的处理结果包括:从各预处理结果中,筛选出符合预设概率条件的预判概率所对应的预处理结果;计算筛选到的预处理结果的不确定度;根据不确定度和筛选到的预处理结果,生成待处理数据对应的处理结果。In one embodiment, generating the processing result corresponding to the data to be processed according to the pre-judgement probability corresponding to each pre-processing result includes: filtering out the pre-judgement corresponding to the pre-judgement probability that meets the preset probability condition from each pre-processing result Processing results; Calculate the uncertainty of the screened preprocessing results; generate processing results corresponding to the data to be processed based on the uncertainty and the screened preprocessing results.
在一个实施例中,从各预处理结果中,统计各预处理结果分别对应的预判概率之后,处理器执行计算机程序时还实现以下步骤:当未根据各预判概率生成与待处理数据对应的处理结果时,根据各预处理结果生成处理异常通知;展示处理异常通知。In one embodiment, after pre-judgement probabilities corresponding to each pre-treatment result are counted from each pre-treatment result, the processor also implements the following steps when the computer program is executed: when the pre-judgement probability is not generated according to each pre-judgement probability When the processing result is processed, a processing exception notification is generated according to each preprocessing result; the processing exception notification is displayed.
本实施例中,通过获取待处理数据,将待处理数据输入数据处理模型中的多个预处理子模型,由多个预处理子模型同时对待处理数据进行处理;获取各预处理子模型分别输出的预处理结果,并统计各预处理结果分别对应的预判概率;根据各预处理结果分别对应的预判概率,可以验证多个预处理结果的一致性,并根据各预判概率生成待处理数据对应的处理结果,提高了模型数据处理的准确性。In this embodiment, by acquiring the data to be processed, the data to be processed is input into multiple pre-processing sub-models in the data processing model, and the multiple pre-processing sub-models simultaneously process the data to be processed; each pre-processing sub-model is obtained and output separately The pre-processing results of each pre-processing result are counted, and the pre-judging probabilities corresponding to each pre-processing result are counted; according to the pre-judge probabilities corresponding to each pre-processing result, the consistency of multiple pre-processing results can be verified, and the to-be-processed is generated according to each pre-judgement probability The processing results corresponding to the data improve the accuracy of model data processing.
在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现以下步骤:获取待处理数据;将待处理数据输入数据处理模型;获取数据处理模型中各预处理子模型分别输出的预处理结果;统计各预处理结果分别对应的预判概率;根据各预处理结果分别对应的预判概率,生成待处理数据对应的处理结果。在一个实施例中,获取待处理数据之前,计算机程序被处理器执行时还实现以下步骤:获取多个不同的初始子模型和训练数据;以训练数据对各初始子模型进行训练,得到多个预处理子模型;根据多个预处理子模型构建数据处理模型。In one embodiment, a computer-readable storage medium is provided on which a computer program is stored. When the computer program is executed by a processor, the following steps are achieved: acquiring data to be processed; inputting the data to be processed into a data processing model; acquiring data The pre-processing results output by each pre-processing sub-model in the processing model; the pre-judgment probability corresponding to each pre-processing result is counted; according to the pre-judge probability corresponding to each pre-processing result, the processing result corresponding to the data to be processed is generated. In one embodiment, before acquiring the data to be processed, the computer program is executed by the processor to implement the following steps: acquiring multiple different initial sub-models and training data; training each initial sub-model with training data to obtain multiple Pre-processing sub-model; build a data processing model based on multiple pre-processing sub-models.
在另一个实施例中,获取待处理数据之前,计算机程序被处理器执行时还实现以下步骤:获取多个相同的初始子模型和训练数据;从训练数据中抽取与多个初始子模型一一对应的多个训练样本集;分别根据每个训练样本集训练对应的初始子模型,得到多个预处理子模型;根据多个预处理子模型构建数据处理模型。In another embodiment, before acquiring the data to be processed, the computer program is executed by the processor to implement the following steps: acquiring multiple identical initial sub-models and training data; extracting from the training data one by one with the multiple initial sub-models Corresponding multiple training sample sets; train the corresponding initial sub-model according to each training sample set respectively to obtain multiple pre-processing sub-models; construct a data processing model according to multiple pre-processing sub-models.
在一个实施例中,将待处理数据输入数据处理模型包括:从训练好的结构来源模型中抽取多个不同的模型子结构;以每个模型子结构作为数据处理模型中的预处理子模型,构建数据处理模型;将待处理数据分别输入数据处理模型中的各预处理子模型。在一个实施例中,根据各预处理结果分别对应的预判概率,生成待处理数据对应的处理结果包括:从各预处理结果中,筛选出符合预设概率条件的预判概率所对应的预处理结果;计算筛选到的预处理结果的不确定度;根据不确定度和筛选到的预处理结果,生成待处理数据对应的处理结果。In one embodiment, inputting the data to be processed into the data processing model includes: extracting multiple different model substructures from the trained structure source model; using each model substructure as a preprocessing submodel in the data processing model, Build a data processing model; input the data to be processed into each pre-processing sub-model in the data processing model. In one embodiment, generating the processing result corresponding to the data to be processed according to the pre-judgement probability corresponding to each pre-processing result includes: filtering out the pre-judgement corresponding to the pre-judgement probability that meets the preset probability condition from each pre-processing result Processing results; Calculate the uncertainty of the screened preprocessing results; generate processing results corresponding to the data to be processed based on the uncertainty and the screened preprocessing results.
在一个实施例中,从各预处理结果中,统计各预处理结果分别对应的预判概率之后,计算机程序被处理器执行时还实现以下步骤:当未根据各预判概率生成与待处理数据对应的处理结果时,根据各预处理结果生成处理异常通知;展示处理异常通知。In one embodiment, after pre-judgement probabilities corresponding to each pre-treatment result are counted from each pre-treatment result, when the computer program is executed by the processor, the following steps are also realized: when the data to be processed is not generated according to each pre-judgement probability For the corresponding processing result, generate a processing exception notification according to each preprocessing result; display the processing exception notification.
本实施例中,通过获取待处理数据,将待处理数据输入数据处理模型中的多个预处理子模型,由多个预处理子模型同时对待处理数据进行处理;获取各预处理子模型分别输出的预处理结果,并统计各预处理结果分别对应的预判概率;根据各预处理结果分别对应的预判概率,可以验证多个预处理结果的一致性,并根据各预判概率生成待处理数据对应的处理结果,提高了模型数据处理的准确性。In this embodiment, by acquiring the data to be processed, the data to be processed is input into multiple pre-processing sub-models in the data processing model, and the multiple pre-processing sub-models simultaneously process the data to be processed; each pre-processing sub-model is obtained and output separately The pre-processing results of each pre-processing result are counted, and the pre-judging probabilities corresponding to each pre-processing result are counted; according to the pre-judge probabilities corresponding to each pre-processing result, the consistency of multiple pre-processing results can be verified, and the to-be-processed is generated according to each pre-judgement probability The processing results corresponding to the data improve the accuracy of model data processing.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM (EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art may understand that all or part of the processes in the method of the above embodiments may be completed by instructing relevant hardware through a computer program, and the computer program may be stored in a non-volatile computer readable storage In the medium, when the computer program is executed, the process of the foregoing method embodiments may be included. Wherein, any reference to the memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be arbitrarily combined. In order to simplify the description, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, they should be It is considered as the scope described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation manners of the present application, and their descriptions are more specific and detailed, but they should not be construed as limiting the scope of the invention patent. It should be noted that, for those of ordinary skill in the art, without departing from the concept of the present application, a number of modifications and improvements can also be made, which all fall within the protection scope of the present application. Therefore, the protection scope of the patent of this application shall be subject to the appended claims.

Claims (10)

  1. 一种数据处理方法,所述方法包括:A data processing method, the method includes:
    获取待处理数据;Obtain pending data;
    将所述待处理数据输入数据处理模型;Input the data to be processed into a data processing model;
    获取所述数据处理模型中各预处理子模型分别输出的预处理结果;Obtaining preprocessing results respectively output by each preprocessing sub-model in the data processing model;
    统计各预处理结果分别对应的预判概率;Count the pre-judgment probability corresponding to each pre-processing result;
    根据所述各预处理结果分别对应的预判概率,生成所述待处理数据对应的处理结果。The processing results corresponding to the data to be processed are generated according to the pre-judgment probabilities corresponding to the respective preprocessing results.
  2. 根据权利要求1所述的方法,其特征在于,所述获取待处理数据之前,还包括:The method according to claim 1, wherein before acquiring the data to be processed, the method further comprises:
    获取多个不同的初始子模型和训练数据;Obtain multiple different initial sub-models and training data;
    以所述训练数据对各初始子模型进行训练,得到多个预处理子模型;Training each initial sub-model with the training data to obtain multiple pre-processing sub-models;
    根据所述多个预处理子模型构建数据处理模型。Construct a data processing model according to the multiple preprocessing sub-models.
  3. 根据权利要求1所述的方法,其特征在于,所述获取待处理数据之前,还包括:The method according to claim 1, wherein before acquiring the data to be processed, the method further comprises:
    获取多个相同的初始子模型和训练数据;Obtain multiple identical initial submodels and training data;
    从所述训练数据中抽取与多个初始子模型一一对应的多个训练样本集;Extract multiple training sample sets corresponding to multiple initial submodels from the training data;
    分别根据每个训练样本集训练对应的初始子模型,得到多个预处理子模型;Train the corresponding initial sub-model according to each training sample set respectively to obtain multiple pre-processing sub-models;
    根据所述多个预处理子模型构建数据处理模型。Construct a data processing model according to the multiple preprocessing sub-models.
  4. 根据权利要求1所述的方法,其特征在于,所述将所述待处理数据输入数据处理模型包括:The method according to claim 1, wherein the inputting the data to be processed into a data processing model comprises:
    从训练好的结构来源模型中抽取多个不同的模型子结构;Extract multiple different model substructures from the trained structure source model;
    以每个模型子结构作为数据处理模型中的预处理子模型,构建所述数据处理模型;Use each model substructure as a preprocessing submodel in the data processing model to construct the data processing model;
    将所述待处理数据分别输入所述数据处理模型中的各预处理子模型。The data to be processed is input into each pre-processing sub-model in the data processing model.
  5. 根据权利要求1所述的方法,其特征在于,所述根据所述各预处理结果分别对应的预判概率,生成所述待处理数据对应的处理结果包括:The method according to claim 1, wherein the generating a processing result corresponding to the data to be processed according to the pre-judgement probabilities corresponding to the respective preprocessing results includes:
    从所述各预处理结果中,筛选出符合预设概率条件的预判概率所对应的预 处理结果;From the pre-processing results, select the pre-processing results corresponding to the pre-judgement probability that meets the preset probability conditions;
    计算筛选到的预处理结果的不确定度;Calculate the uncertainty of the pre-processing results screened;
    根据所述不确定度和所述筛选到的预处理结果,生成所述待处理数据对应的处理结果。Generate a processing result corresponding to the data to be processed according to the uncertainty and the filtered preprocessing result.
  6. 根据权利要求1所述的方法,其特征在于,所述统计各预处理结果分别对应的预判概率之后,还包括:The method according to claim 1, wherein after counting the pre-judgement probability corresponding to each pre-processing result, the method further comprises:
    当未根据各预判概率生成与所述待处理数据对应的处理结果时,根据所述各预处理结果生成处理异常通知;When a processing result corresponding to the data to be processed is not generated according to each pre-judgment probability, a processing exception notification is generated according to each pre-processing result;
    展示所述处理异常通知。Demonstrate the handling exception notification.
  7. 一种数据处理装置,其特征在于,所述装置包括:A data processing device, characterized in that the device includes:
    数据获取模块,用于获取待处理数据;Data acquisition module for acquiring data to be processed;
    数据输入模块,用于将所述待处理数据输入数据处理模型;A data input module for inputting the data to be processed into a data processing model;
    结果获取模块,用于获取所述数据处理模型中各预处理子模型分别输出的预处理结果;A result obtaining module, configured to obtain the preprocessing results respectively output by each preprocessing sub-model in the data processing model;
    概率统计模块,用于统计各预处理结果分别对应的预判概率;Probability statistics module, used to count the pre-judgment probability corresponding to each pre-processing result;
    结果生成模块,用于根据所述各预处理结果分别对应的预判概率,生成所述待处理数据对应的处理结果。The result generation module is configured to generate a processing result corresponding to the data to be processed according to the pre-judgement probabilities corresponding to the respective pre-processing results.
  8. 根据权利要求6所述的装置,其特征在于,所述数据输入模块包括:The device according to claim 6, wherein the data input module comprises:
    结构抽取模块,用于从训练好的处理模型中抽取多个不同的模型子结构;Structure extraction module, used to extract multiple different model substructures from the trained processing model;
    模型构建模块,用于以每个模型子结构作为数据处理模型中的预处理子模型,构建所述数据处理模型;The model building module is used to construct each data processing model by using each model sub-structure as a pre-processing sub-model in the data processing model;
    输入模块,用于将所述待处理数据分别输入所述数据处理模型中的各预处理子模型。The input module is used to input the data to be processed into each pre-processing sub-model in the data processing model.
  9. 一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至6中任一项所述方法的步骤。A computer device, including a memory, a processor, and a computer program stored on the memory and executable on the processor, characterized in that, when the processor executes the computer program, any one of claims 1 to 6 is realized The steps of the method.
  10. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1至6中任一项所述的方法的步骤。A computer-readable storage medium on which a computer program is stored, characterized in that when the computer program is executed by a processor, the steps of the method according to any one of claims 1 to 6 are realized.
PCT/CN2020/070651 2019-01-07 2020-01-07 Data processing method and apparatus, computer device, and storage medium WO2020143610A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910012742.9 2019-01-07
CN201910012742.9A CN109886415A (en) 2019-01-07 2019-01-07 Data processing method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2020143610A1 true WO2020143610A1 (en) 2020-07-16

Family

ID=66925695

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/070651 WO2020143610A1 (en) 2019-01-07 2020-01-07 Data processing method and apparatus, computer device, and storage medium

Country Status (2)

Country Link
CN (1) CN109886415A (en)
WO (1) WO2020143610A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109886415A (en) * 2019-01-07 2019-06-14 鲁班嫡系机器人(深圳)有限公司 Data processing method, device, computer equipment and storage medium
CN111079175B (en) * 2019-11-26 2022-05-03 微民保险代理有限公司 Data processing method, data processing device, computer readable storage medium and computer equipment
CN113051069B (en) * 2019-12-28 2023-12-08 华为技术有限公司 Data analysis method and device based on multitasking and terminal equipment
CN113592059A (en) * 2020-04-30 2021-11-02 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for processing data
CN111984414B (en) * 2020-08-21 2022-05-24 苏州浪潮智能科技有限公司 Data processing method, system, equipment and readable storage medium
CN112272362B (en) * 2020-09-11 2022-06-10 安徽中科新辰技术有限公司 Method for realizing message notification sending service

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107463966A (en) * 2017-08-17 2017-12-12 电子科技大学 Radar range profile's target identification method based on dual-depth neutral net
CN107679491A (en) * 2017-09-29 2018-02-09 华中师范大学 A kind of 3D convolutional neural networks sign Language Recognition Methods for merging multi-modal data
CN108665457A (en) * 2018-05-16 2018-10-16 腾讯科技(深圳)有限公司 Image-recognizing method, device, storage medium and computer equipment
US20180314944A1 (en) * 2016-10-14 2018-11-01 Tencent Technology (Shenzhen) Company Limited Data processing method and apparatus
CN109886415A (en) * 2019-01-07 2019-06-14 鲁班嫡系机器人(深圳)有限公司 Data processing method, device, computer equipment and storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170262766A1 (en) * 2016-03-08 2017-09-14 Linkedin Corporation Variable grouping for entity analysis
CN107193836B (en) * 2016-03-15 2021-08-10 腾讯科技(深圳)有限公司 Identification method and device
CN107527065B (en) * 2017-07-25 2021-02-05 北京联合大学 Flower variety identification model establishment method based on convolutional neural network
CN107480774A (en) * 2017-08-11 2017-12-15 山东师范大学 Dynamic neural network model training method and device based on integrated study
CN109146076A (en) * 2018-08-13 2019-01-04 东软集团股份有限公司 model generating method and device, data processing method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180314944A1 (en) * 2016-10-14 2018-11-01 Tencent Technology (Shenzhen) Company Limited Data processing method and apparatus
CN107463966A (en) * 2017-08-17 2017-12-12 电子科技大学 Radar range profile's target identification method based on dual-depth neutral net
CN107679491A (en) * 2017-09-29 2018-02-09 华中师范大学 A kind of 3D convolutional neural networks sign Language Recognition Methods for merging multi-modal data
CN108665457A (en) * 2018-05-16 2018-10-16 腾讯科技(深圳)有限公司 Image-recognizing method, device, storage medium and computer equipment
CN109886415A (en) * 2019-01-07 2019-06-14 鲁班嫡系机器人(深圳)有限公司 Data processing method, device, computer equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
LIU, BOYUN ET AL.: "Diesel Engine Condition Assessment Based on Neural Network and D-S Evidence Theory", VEHICLE ENGINE, no. 5, 31 October 2005 (2005-10-31), pages 14 - 17, XP009521908, ISSN: 1001-2222 *

Also Published As

Publication number Publication date
CN109886415A (en) 2019-06-14

Similar Documents

Publication Publication Date Title
WO2020143610A1 (en) Data processing method and apparatus, computer device, and storage medium
Braiek et al. On testing machine learning programs
Wang et al. Software reliability prediction using a deep learning model based on the RNN encoder–decoder
Díaz-Rodríguez et al. EXplainable Neural-Symbolic Learning (X-NeSyL) methodology to fuse deep learning representations with expert knowledge graphs: The MonuMAI cultural heritage use case
Zhang et al. Training set debugging using trusted items
US10803398B2 (en) Apparatus and method for information processing
Yang et al. An advanced deep generative framework for temporal link prediction in dynamic networks
Urban et al. Deep learning: A primer for psychologists.
US20220019870A1 (en) Verification of classification decisions in convolutional neural networks
Wu et al. Applications of deep learning for smart water networks
US10380459B2 (en) System and method for image classification
CA3225621A1 (en) Ai-augmented auditing platform including techniques for automated document processing
US20230048405A1 (en) Neural network optimization method and apparatus
WO2020034801A1 (en) Medical feature screening method and apparatus, computer device, and storage medium
RU2689818C1 (en) Method of interpreting artificial neural networks
US10311404B1 (en) Software product development defect and issue prediction and diagnosis
Dam et al. DeepSoft: A vision for a deep model of software
Ghandeharioun et al. Characterizing sources of uncertainty to proxy calibration and disambiguate annotator and data bias
Singh et al. Melford: Using neural networks to find spreadsheet errors
Huang et al. Challenges of testing machine learning applications
Mills et al. A machine learning approach for determining the validity of traceability links
Konstantinidis et al. Automating dairy production lines with the yoghurt cups recognition and detection process in the Industry 4.0 era
Schmidt et al. Probabilistic attention based on gaussian processes for deep multiple instance learning
Elkins et al. Developing an app to interpret chest X-rays to support the diagnosis of respiratory pathology with artificial intelligence
CN117372424B (en) Defect detection method, device, equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20738000

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 191121)

122 Ep: pct application non-entry in european phase

Ref document number: 20738000

Country of ref document: EP

Kind code of ref document: A1