WO2023024578A1 - Method and apparatus for configuring decision apparatus, and related device - Google Patents

Method and apparatus for configuring decision apparatus, and related device Download PDF

Info

Publication number
WO2023024578A1
WO2023024578A1 PCT/CN2022/091969 CN2022091969W WO2023024578A1 WO 2023024578 A1 WO2023024578 A1 WO 2023024578A1 CN 2022091969 W CN2022091969 W CN 2022091969W WO 2023024578 A1 WO2023024578 A1 WO 2023024578A1
Authority
WO
WIPO (PCT)
Prior art keywords
inference
samples
type
model
sample
Prior art date
Application number
PCT/CN2022/091969
Other languages
French (fr)
Chinese (zh)
Inventor
李治军
张庭豪
李涛
谢达奇
Original Assignee
华为云计算技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为云计算技术有限公司 filed Critical 华为云计算技术有限公司
Publication of WO2023024578A1 publication Critical patent/WO2023024578A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models

Definitions

  • multiple samples may be obtained first, and the inference results corresponding to the multiple samples may be further obtained, and the inference results corresponding to the multiple samples are respectively analyzed by the first inference model
  • Multiple samples are obtained by reasoning; then, according to the reasoning results corresponding to the multiple samples, the samples of the first type and the samples of the second type among the multiple samples can be determined.
  • the first type of samples that can be more accurately identified by the first inference model and the second type of samples that are difficult to accurately identify by the first inference model can be determined from the multiple samples.
  • the present application provides a computer device, the computer device includes a processor and a memory; the memory is used to store instructions, and when the computer device is running, the processor executes the instructions stored in the memory, so that the The computer device executes the first aspect above or the method for configuring the decision-making apparatus in any possible implementation manner of the first aspect.
  • the memory may be integrated in the processor, or independent of the processor.
  • a computer device may also include a bus. Wherein, the processor is connected to the memory through the bus.
  • the memory may include a readable memory and a random access memory.
  • inference models can be configured in both the first device set 101 and the second device set 102.
  • the reasoning model configured in the first device set 101 will be referred to as the first reasoning model below, and the configuration in The reasoning models in the second device set 102 are called the second reasoning models.
  • the specification of the first inference model is different from that of the second inference model.
  • the size of the first inference model is 400KB (kilobytes)
  • the size of the second reasoning model is 40000KB.
  • the specification of the first reasoning model is smaller than the specification of the second reasoning model as an example for illustration.
  • the first type of samples refers to samples that are difficult to be accurately inferred by the first inference model.
  • such samples may also be referred to as difficult samples corresponding to the first inference model.
  • the sample may be determined as the first type of sample (ie, a difficult sample).
  • the second type of samples refers to samples that can be inferred relatively accurately by the first reasoning model. In practical applications, such samples can also be called simple samples corresponding to the first reasoning model.
  • Figure 5 shows a computer device.
  • the computer device 500 shown in FIG. 5 can be specifically used to implement the functions of the configuration apparatus 105 in the above-mentioned embodiment shown in FIG. 3 .
  • the decision parameters are used to identify the model input samples inferred by the first inference model as being transmitted to the second inference model Sample model input for the two-device collection.
  • the embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores instructions, and when it is run on the computer equipment, the computer equipment executes the configuration device 105 of the above-mentioned embodiment. method.

Abstract

The present application provides a method for configuring a decision apparatus. A first inference result corresponding to a first type of samples and a second inference result corresponding to a second type of samples are first acquired, and the precision of inferring the first type of samples by using a first inference model is lower than the precision of inferring the second type of samples by using the first inference model; decision parameters in a decision apparatus are configured according to the first inference result and the second inference result. In this way, on the basis of the configured decision parameters, the decision apparatus generally can accurately recognize the first type of samples that is difficult to be accurately inferred by using the first inference model having a smaller specification, and transmit the first type of samples to a second device set, so as to perform inference by using the second inference model having a larger specification, so that the precision of inferring a model input sample can be maintained at a high level. In addition, the present application further provides a corresponding apparatus and a related device.

Description

一种配置决策装置的方法、装置及相关设备A method, device and related equipment for configuring a decision-making device
本申请要求于2021年8月25日递交中国国家知识产权局、申请号为202110981922.5,发明名称为“一种配置决策装置的方法、装置及相关设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application submitted to the State Intellectual Property Office of China on August 25, 2021, the application number is 202110981922.5, and the invention title is "a method, device and related equipment for configuring a decision-making device", the entire content of which Incorporated in this application by reference.
技术领域technical field
本申请涉及人工智能技术领域,尤其涉及一种配置决策装置的方法、装置及相关设备。The present application relates to the technical field of artificial intelligence, and in particular to a method, device and related equipment for configuring a decision-making device.
背景技术Background technique
在人工智能(artificial intelligence,AI)领域中,机器学习技术作为AI领域一种重要的方法和手段,旨在通过机器学习算法对训练数据集进行规律分析得到模型,并利用该模型持续对未知的样本数据进行推理。In the field of artificial intelligence (AI), machine learning technology is an important method and means in the field of AI. Sample data for inference.
目前,可以根据部署推理模型的环境的资源量设置两级的推理机制。比如,在边云协同的推理场景中,可以在边缘侧以及云端分别设置不同规格的推理模型,并且,由于边缘侧的计算资源通常少于云端的计算资源,因此,部署于在边缘侧的推理模型可以是通过对云端的推理模型进行压缩所得到的规格较小的模型。相应的,针对相同的模型输入样本,云端的推理模型对于该模型输入样本的推理效果(如推理精度、效率等),通常优于边缘侧的推理模型对于该模型输入样本的推理效果。同时,在边缘侧部署决策装置,该决策装置可以在边缘侧的推理模型难以对模型输入样本进行有效性推理时(如推理结果的置信度较低等),确定将该模型输入样本发送至云端,以便利用云端的规格更大的推理模型对该模型输入样本进行推理,以此提高最终得到的推理结果的精度。At present, a two-level inference mechanism can be set according to the amount of resources in the environment where the inference model is deployed. For example, in an edge-cloud collaborative inference scenario, inference models of different specifications can be set on the edge side and on the cloud, and since the computing resources on the edge side are usually less than those on the cloud, the reasoning models deployed on the edge side The model may be a model with a smaller size obtained by compressing the inference model in the cloud. Correspondingly, for the same model input sample, the inference effect (such as inference accuracy, efficiency, etc.) of the inference model on the cloud for the input sample of the model is usually better than that of the inference model on the edge side for the input sample of the model. At the same time, a decision-making device is deployed on the edge side, and the decision-making device can determine to send the model input sample to the cloud when the reasoning model on the edge side is difficult to infer the validity of the model input sample (such as the confidence of the reasoning result is low, etc.). , in order to use the inference model with a larger specification in the cloud to infer the input samples of the model, so as to improve the accuracy of the final inference result.
但是,实际应用时,推理系统的精度可能难以保持在较高的水平,比如,在部分时间段内,推理系统针对模型输入样本所确定的推理结果的准确度较低等。因此,目前亟需一种推理方案,以使得推理系统推理模型输入样本的精度保持在较高水平。However, in practical applications, it may be difficult to maintain the accuracy of the inference system at a high level. For example, in a certain period of time, the accuracy of the inference results determined by the inference system for the model input samples is low. Therefore, there is an urgent need for an inference scheme so that the accuracy of the inference system inference model input samples can be kept at a high level.
发明内容Contents of the invention
本申请提供了一种配置决策装置的方法,用于使得推理系统针对模型输入样本的推理准确度保持在较高的水平。此外,本申请还提供了一种配置决策装置的装置、计算机设备、计算机可读存储介质以及计算机程序产品。The present application provides a method for configuring a decision-making device, which is used to keep the reasoning accuracy of the reasoning system at a high level for model input samples. In addition, the present application also provides a device for configuring a decision-making device, a computer device, a computer-readable storage medium, and a computer program product.
第一方面,本申请提供了一种配置决策装置的方法,该方法应用于包括第一设备集合、第二设备集合以及决策装置的推理系统,其中,第一设备集合以及第二设备集合均包括至少一个计算设备,并且,第一设备集合中的第一推理模型的规格小于第二设备集合中的第二推理模型的规格。在执行该方法时,先获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,并且第一推理模型推理第一类样本的精度低于第一推理模型推理第二类样本的精度,实际应用时,该第一类型样本可以称之为难例样本,第二类样本可以称之为简单例样本;并根据第一推理结果以及第二推理结果,配置决策装置中的决策参数,该决策参数用于将第一推理模型推理的模型输入样本识别为传输给第二设备集合的模型输入样本。In a first aspect, the present application provides a method for configuring a decision-making device, the method is applied to an inference system including a first device set, a second device set, and a decision-making device, wherein the first device set and the second device set both include At least one computing device, and the size of the first inference model in the first set of devices is smaller than the size of the second inference model in the second set of devices. When executing the method, the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample are obtained first, and the accuracy of the first inference model inference of the first type of sample is lower than that of the first inference model inference of the first inference result. The accuracy of the second type of samples, in actual application, the first type of samples can be called difficult samples, and the second type of samples can be called simple samples; and according to the first reasoning result and the second reasoning result, configure the decision-making device A decision parameter for identifying a model input sample inferred by the first inference model as a model input sample transmitted to the second set of devices.
由于决策装置的决策参数是通过第一推理模型能够较为准确推理的第一类样本的推理 结果以及第一推理模型难以准确推理的第二类样本的推理结果进行配置,因此,决策装置基于该决策参数通常能够准确识别出第一推理模型难以准确推理的第二类样本。这样,推理系统可以利用规格更大的第二推理模型对该类样本进行推理,以此可以使得推理系统推理模型输入样本的精度能够保持在较高水平。Since the decision parameters of the decision-making device are configured through the inference results of the first type of samples that the first inference model can infer accurately and the inference results of the second type of samples that the first inference model is difficult to infer accurately, the decision-making device is based on the decision The parameters are usually able to accurately identify the second class of samples that the first inference model has difficulty inferring accurately. In this way, the inference system can use the second inference model with a larger specification to infer this type of sample, so that the accuracy of the inference model input samples of the inference system can be maintained at a relatively high level.
在一种可能的实施方式中,在获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果时,具体可以是指示第一设备集合利用第一推理模型分别对第一类样本以及第二类样本进行推理,得到第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果。如此,可以通过第一推理模型分别对两类样本进行推理,得到两类样本对应的推理结果,以便后续基于该推理结果对决策装置中的决策参数进行配置。In a possible implementation manner, when obtaining the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, it may specifically instruct the first device set to use the first inference model to respectively The samples of one type and the samples of the second type are inferred to obtain a first inference result corresponding to the sample of the first type and a second inference result corresponding to the sample of the second type. In this way, the first inference model can be used to infer the two types of samples respectively to obtain the inference results corresponding to the two types of samples, so that the decision parameters in the decision-making device can be configured subsequently based on the inference results.
在一种可能的实施方式中,获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果时,具体可以是指示第一设备集合利用所述第一推理模型对第一类样本进行推理,得到第一类样本对应的第一推理结果;同时,还指示第二设备集合利用第二推理模型对第二类样本进行推理,得到第二类样本对应的第二推理结果。如此,可以通过第一推理模型以及第二推理模型,得到两类样本分别对应的推理结果,以便后续基于该推理结果对决策装置中的决策参数进行配置。In a possible implementation manner, when obtaining the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, it may specifically instruct the first device set to use the first inference model to Perform inference on one type of sample to obtain the first inference result corresponding to the first type of sample; at the same time, instruct the second device set to use the second inference model to perform inference on the second type of sample to obtain the second inference result corresponding to the second type of sample . In this way, the inference results corresponding to the two types of samples can be obtained through the first inference model and the second inference model, so as to subsequently configure the decision parameters in the decision-making device based on the inference results.
在一种可能的实施方式中,在获取推理结果之前,还可以先获取多个样本,并进一步获取该多个样本对应的推理结果,该多个样本对应的推理结果通过第一推理模型分别对多个样本进行推理得到;然后,根据该多个样本对应的推理结果,可以确定多个样本中的第一类样本以及第二类样本。如此,可以根据第一推理模型对多个样本的推理结果,从多个样本中确定出第一推理模型能够更准确识别的第一类样本以及第一推理模型难以准确识别的第二类样本。In a possible implementation manner, before obtaining the inference results, multiple samples may be obtained first, and the inference results corresponding to the multiple samples may be further obtained, and the inference results corresponding to the multiple samples are respectively analyzed by the first inference model Multiple samples are obtained by reasoning; then, according to the reasoning results corresponding to the multiple samples, the samples of the first type and the samples of the second type among the multiple samples can be determined. In this way, according to the inference results of the first inference model on the multiple samples, the first type of samples that can be more accurately identified by the first inference model and the second type of samples that are difficult to accurately identify by the first inference model can be determined from the multiple samples.
在一种可能的实施方式中,根据多个样本对应的推理结果,确定多个样本中的第一类样本以及第二类样本时,具体可以是先呈现标注界面,该标注界面包括多个样本对应的推理结果,从而可以根据标注人员针对该多个样本对应的推理结果的标注操作,确定多个样本中的第一类样本以及第二类样本。如此,可以通过标注人员的人工标注结果,从多个样本中确定出第一类样本以及第二类样本,以此提高确定第一类型样本以及第二类样本的准确度。In a possible implementation manner, when determining the samples of the first type and the samples of the second type among the multiple samples according to the inference results corresponding to the multiple samples, the labeling interface may be presented first, and the labeling interface includes multiple samples The corresponding inference results, so that the first type of samples and the second type of samples among the multiple samples can be determined according to the labeling operation of the labeling personnel on the inference results corresponding to the multiple samples. In this way, the first type of samples and the second type of samples can be determined from multiple samples through the manual annotation results of the annotators, so as to improve the accuracy of determining the first type of samples and the second type of samples.
在一种可能的实施方式中,在获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果之前,还可以为第一设备集合配置第一推理模型,以及为第二设备集合配置第二推理模型,以便后续能够利用配置的第一推理模型和/或第二推理模型确定决策参数,其中,第一推理模型是通过对第二推理模型进行模型压缩得到。例如,可以先通过强化学习算法对第二推理模型进行结构化搜索,以确定第一推理模型的网络结构;再通过对第二推理模型进行知识蒸馏的方式,确定第一推理模型的网络参数,以此得到第一推理模型。In a possible implementation manner, before obtaining the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, the first inference model may be configured for the first device set, and the first inference model may be configured for the second type of sample. The second device set configures the second reasoning model, so that the configured first reasoning model and/or the second reasoning model can be used to determine decision parameters later, wherein the first reasoning model is obtained by performing model compression on the second reasoning model. For example, a structured search of the second reasoning model can be performed first through a reinforcement learning algorithm to determine the network structure of the first reasoning model; and then the network parameters of the first reasoning model can be determined by performing knowledge distillation on the second reasoning model, In this way, the first reasoning model is obtained.
在一种可能的实施方式中,推理系统可以采用端边协同的方式进行部署,即第一设备集合部署于本地网络,第二设备集合部署于边缘网络;或,推理系统可以采用边云协同的方式进行部署,即第一设备集合部署于边缘网络,第二设备集合部署于云端。In a possible implementation, the inference system can be deployed in a device-edge collaborative manner, that is, the first device set is deployed on the local network, and the second device set is deployed on the edge network; or, the inference system can be deployed in an edge-cloud collaborative manner. Deployment in a different way, that is, the first set of devices is deployed on the edge network, and the second set of devices is deployed on the cloud.
第二方面,本申请提供一种配置决策装置的方法,该方法应用于包括第一设备集合、 第二设备集合以及决策装置的推理系统,该第一设备集合以及第二设备集合均包括至少一个计算设备,并且,第一设备集合中的第一推理模型的规格小于第二设备集合中的第二推理模型的规格。在执行该方法时,可以先获取第一类样本以及第二类样本,第一推理模型推理第一类样本的精度低于第一推理模型推理第二类样本的精度;然后,根据第一类样本以及第二类样本,配置决策装置中的决策参数,该决策参数用于将所述第一推理模型推理的模型输入样本识别为传输给第二设备集合的模型输入样本。In a second aspect, the present application provides a method for configuring a decision-making device. The method is applied to an inference system including a first device set, a second device set, and a decision-making device. The first device set and the second device set each include at least one Computing devices, and the specification of the first inference model in the first set of devices is smaller than the specification of the second inference model in the second set of devices. When executing the method, the first type of samples and the second type of samples can be obtained first, the accuracy of the first inference model inferring the first type of samples is lower than the accuracy of the first inference model inference of the second type of samples; then, according to the first type The sample and the second type of sample configure the decision parameter in the decision device, and the decision parameter is used to identify the model input sample inferred by the first inference model as the model input sample transmitted to the second device set.
由于决策装置的决策参数是通过第一推理模型能够较为准确推理的第一类样本以及第一推理模型难以准确推理的第二类样本进行配置,因此,决策装置基于该决策参数通常能够准确识别出第一推理模型难以准确推理的第一类样本。这样,推理系统可以利用规格更大的第二推理模型对该类样本进行推理,以此可以使得推理系统推理模型输入样本的精度能够保持在较高水平。Since the decision parameters of the decision-making device are configured through the first type of samples that the first reasoning model can reason about accurately and the second type of samples that the first reasoning model is difficult to reason about accurately, the decision-making device can usually accurately identify The first type of samples that the first inference model is difficult to infer accurately. In this way, the inference system can use the second inference model with a larger specification to infer this type of sample, so that the accuracy of the inference model input samples of the inference system can be maintained at a relatively high level.
第三方面,本申请提供一种配置装置,所述配置装置包括用于实现第一方面中的配置决策装置的方法的各个模块。In a third aspect, the present application provides a configuration device, and the configuration device includes various modules for implementing the method of the configuration decision device in the first aspect.
第四方面,本申请提供一种配置装置,所述配置装置应用于推理系统,所述推理系统包括第一设备集合、第二设备集合以及决策装置,所述第一设备集合以及所述第二设备集合均包括至少一个计算设备,所述第一设备集合中的第一推理模型的规格小于所述第二设备集合中的第二推理模型的规格,配置装置包括:样本获取模块,用于获取第一类样本以及第二类样本,所述第一推理模型推理所述第一类样本的精度低于所述第一推理模型推理所述第二类样本的精度;配置模块,用于根据所述第一类样本以及所述第二类样本,配置所述决策装置中的决策参数,所述决策参数用于将所述第一推理模型推理的模型输入样本识别为传输给所述第二设备集合的模型输入样本。In a fourth aspect, the present application provides a configuration device. The configuration device is applied to an inference system. The reasoning system includes a first device set, a second device set, and a decision-making device. The first device set and the second device set The device sets each include at least one computing device, the specification of the first reasoning model in the first device set is smaller than the specification of the second reasoning model in the second device set, and the configuration device includes: a sample acquisition module, configured to acquire The first type of sample and the second type of sample, the accuracy of the first inference model inferring the first type of sample is lower than the accuracy of the first inference model inference of the second type of sample; the configuration module is used to according to the The first type of samples and the second type of samples are configured, and the decision parameters in the decision-making device are configured, and the decision parameters are used to identify the model input samples inferred by the first inference model as being transmitted to the second device A collection of model input samples.
第五方面,本申请提供一种计算机设备,所述计算机设备包括处理器和存储器;该存储器用于存储指令,当该计算机设备运行时,该处理器执行该存储器存储的该指令,以使该计算机设备执行上述第一方面或第一方面任一种可能实现方式中的配置决策装置的方法。需要说明的是,该存储器可以集成于处理器中,也可以是独立于处理器之外。计算机设备还可以包括总线。其中,处理器通过总线连接存储器。其中,存储器可以包括可读存储器以及随机存取存储器。In a fifth aspect, the present application provides a computer device, the computer device includes a processor and a memory; the memory is used to store instructions, and when the computer device is running, the processor executes the instructions stored in the memory, so that the The computer device executes the first aspect above or the method for configuring the decision-making apparatus in any possible implementation manner of the first aspect. It should be noted that the memory may be integrated in the processor, or independent of the processor. A computer device may also include a bus. Wherein, the processor is connected to the memory through the bus. Wherein, the memory may include a readable memory and a random access memory.
第六方面,本申请提供一种计算机设备,所述计算机设备包括处理器和存储器;该存储器用于存储指令,当该计算机设备运行时,该处理器执行该存储器存储的该指令,以使该计算机设备执行上述第二方面中的配置决策装置的方法。需要说明的是,该存储器可以集成于处理器中,也可以是独立于处理器之外。计算机设备还可以包括总线。其中,处理器通过总线连接存储器。其中,存储器可以包括可读存储器以及随机存取存储器。In a sixth aspect, the present application provides a computer device, the computer device includes a processor and a memory; the memory is used to store instructions, and when the computer device is running, the processor executes the instructions stored in the memory, so that the The computer equipment executes the method of configuring the decision-making device in the second aspect above. It should be noted that the memory may be integrated in the processor, or independent of the processor. A computer device may also include a bus. Wherein, the processor is connected to the memory through the bus. Wherein, the memory may include a readable memory and a random access memory.
第七方面,本申请提供一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,当其在计算机设备上运行时,使得计算机设备执行上述第一方面或第一方面的任一种实现方式所述的方法。In a seventh aspect, the present application provides a computer-readable storage medium, where instructions are stored in the computer-readable storage medium, and when the computer-readable storage medium is run on a computer device, the computer device executes the above-mentioned first aspect or any of the first aspects. A method described in an implementation.
第八方面,本申请提供一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,当其在计算机设备上运行时,使得计算机设备执行上述第二方面所述的方法。In an eighth aspect, the present application provides a computer-readable storage medium, where instructions are stored in the computer-readable storage medium, and when the computer-readable storage medium is run on a computer device, the computer device executes the method described in the second aspect above.
第九方面,本申请提供了一种包含指令的计算机程序产品,当其在计算机设备上运行 时,使得计算机设备执行上述第一方面或第一方面的任一种实现方式所述的方法。In a ninth aspect, the present application provides a computer program product containing instructions, which, when run on a computer device, causes the computer device to execute the method described in the first aspect or any implementation manner of the first aspect.
第十方面,本申请提供了一种包含指令的计算机程序产品,当其在计算机设备上运行时,使得计算机设备执行上述第二方面所述的方法。In a tenth aspect, the present application provides a computer program product containing instructions, which, when run on a computer device, causes the computer device to execute the method described in the second aspect above.
本申请在上述各方面提供的实现方式的基础上,还可以进行进一步组合以提供更多实现方式。On the basis of the implementation manners provided in the foregoing aspects, the present application may further be combined to provide more implementation manners.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请中记载的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings that need to be used in the description of the embodiments. Obviously, the drawings in the following description are only some implementations recorded in the application. For example, those skilled in the art can also obtain other drawings based on these drawings.
图1为本申请实施例提供的一种推理系统的架构示意图;FIG. 1 is a schematic diagram of the architecture of an inference system provided by an embodiment of the present application;
图2为本申请实施例提供的另一种推理系统的架构示意图;FIG. 2 is a schematic structural diagram of another reasoning system provided by an embodiment of the present application;
图3为本申请实施例提供的一种配置决策装置的方法流程示意图;FIG. 3 is a schematic flowchart of a method for configuring a decision-making device provided in an embodiment of the present application;
图4为本申请实施例提供的一示例性标注界面示意图;FIG. 4 is a schematic diagram of an exemplary labeling interface provided by the embodiment of the present application;
图5为本申请实施例提供的一种计算机设备500的结构示意图;FIG. 5 is a schematic structural diagram of a computer device 500 provided in an embodiment of the present application;
图6为本申请实施例提供的一种计算机设备600的结构示意图。FIG. 6 is a schematic structural diagram of a computer device 600 provided by an embodiment of the present application.
具体实施方式Detailed ways
本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解,这样使用的术语在适当情况下可以互换,这仅仅是描述本申请的实施例中对相同属性的对象在描述时所采用的区分方式。The terms "first", "second" and the like in the specification and claims of the present application and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It should be understood that the terms used in this way can be interchanged under appropriate circumstances, and this is merely a description of the manner in which objects with the same attribute are described in the embodiments of the present application.
参见图1,为一种推理系统的架构示意图。如图1所示,该推理系统100包括第一设备集合101、第二设备集合102以及决策装置103。其中,第一设备集合101以及第二设备集合102均包括至少一个计算设备,图1中以第一设备集合101以及第二设备集合102分别包括多个服务器为例。实际应用时,构成第一设备集合101以及第二设备集合102的计算设备也可以是其它具有计算能力的设备,并不局限于图1所示的服务器。第一设备集合101以及第二设备集合102可以部署于不同的环境中。示例性地,如图1所示,第一设备集合101可以部署于边缘网络,用于在边缘侧执行相应的计算过程,如下述基于第一推理模型的推理过程等;第二设备集合102可以部署于云端,用于在云端执行相应的计算过程,如下述基于第二推理模型的推理过程等。而在其它示例中,第一设备集合101可以部署于用户侧的本地网络,如本地的终端或者服务器等;第二设备集合102可以部署于边缘网络。本实施例中,对于第一设备集合101以及第二设备集合102的具体部署方式并不进行限定。Referring to FIG. 1 , it is a schematic diagram of an architecture of an inference system. As shown in FIG. 1 , the reasoning system 100 includes a first device set 101 , a second device set 102 and a decision-making device 103 . Wherein, both the first device set 101 and the second device set 102 include at least one computing device. In FIG. 1 , it is taken that the first device set 101 and the second device set 102 respectively include multiple servers as an example. In practical applications, the computing devices constituting the first device set 101 and the second device set 102 may also be other devices with computing capabilities, and are not limited to the servers shown in FIG. 1 . The first device set 101 and the second device set 102 may be deployed in different environments. Exemplarily, as shown in FIG. 1 , the first set of devices 101 can be deployed on the edge network to execute corresponding calculation processes on the edge side, such as the following inference process based on the first inference model; the second set of devices 102 can be It is deployed on the cloud, and is used to execute a corresponding calculation process on the cloud, such as the following reasoning process based on the second reasoning model. In other examples, the first set of devices 101 may be deployed on a local network on the user side, such as a local terminal or server; the second set of devices 102 may be deployed on an edge network. In this embodiment, specific deployment manners of the first device set 101 and the second device set 102 are not limited.
决策装置103可以与第一设备集合101部署于相同的环境中,比如,决策装置103可以与第一设备集合101部署于如图所示的边缘侧网络,或者也可以是与第一设备集合101部署于本地网络等。其中,决策装置103可以通过软件或者硬件实现。当通过软件实现时,决策装置103可以是应用于计算设备上的应用程序,该计算设备与第一设备集合101部署于相同的环境。当通过硬件实现时,决策装置103可以是与第一设备集合101位于相同环境的计算设备,如服务器;或者,决策装置103可以是利用专用集成电路(application-specific integrated circuit,ASIC)实现、或可编程逻辑器件(programmable logic device,PLD)实现的设备等。 其中,上述PLD可以是复杂程序逻辑器件(complex programmable logical device,CPLD)、现场可编程门阵列(field-programmable gate array,FPGA)、通用阵列逻辑(generic array logic,GAL)或其任意组合实现。The decision-making device 103 can be deployed in the same environment as the first device set 101. For example, the decision-making device 103 can be deployed with the first device set 101 on the edge side network as shown in the figure, or can also be deployed with the first device set 101 Deployed on a local network, etc. Wherein, the decision-making device 103 may be implemented by software or hardware. When implemented by software, the decision-making device 103 may be an application program applied to a computing device, and the computing device is deployed in the same environment as the first device set 101 . When implemented by hardware, the decision-making device 103 may be a computing device located in the same environment as the first device set 101, such as a server; or, the decision-making device 103 may be realized by using an application-specific integrated circuit (ASIC), or Devices implemented by programmable logic devices (programmable logic device, PLD), etc. Wherein, the above-mentioned PLD can be implemented by complex programmable logic device (complex programmable logical device, CPLD), field-programmable gate array (field-programmable gate array, FPGA), general array logic (generic array logic, GAL) or any combination thereof.
其中,第一设备集合101以及第二设备集合102中均可以配置有推理模型,为便于描述,以下将配置在第一设备集合101中的推理模型称之为第一推理模型,将配置在第二设备集合102中的推理模型称之为第二推理模型。实际应用场景中,基于第一设备集合101与第二设备集合102的部署环境中的物理资源差异,第一推理模型的规格与第二推理模型的规格不同,例如,当第一设备集合部署于边缘网络而第二设备集合部署于云端时,第一推理模型的规格为400KB(千字节),而第二推理模型的规格是40000KB。本申请中以第一推理模型的规格小于第二推理模型的规格为例进行示例性说明。Wherein, inference models can be configured in both the first device set 101 and the second device set 102. For the convenience of description, the reasoning model configured in the first device set 101 will be referred to as the first reasoning model below, and the configuration in The reasoning models in the second device set 102 are called the second reasoning models. In an actual application scenario, based on the difference in physical resources in the deployment environments of the first device set 101 and the second device set 102, the specification of the first inference model is different from that of the second inference model. For example, when the first set of devices is deployed in When the edge network and the second set of devices are deployed on the cloud, the size of the first inference model is 400KB (kilobytes), and the size of the second reasoning model is 40000KB. In this application, the specification of the first reasoning model is smaller than the specification of the second reasoning model as an example for illustration.
推理系统100在推理模型输入样本时,如图1所示,第一设备集合101可以接收用户侧的终端设备104发送的模型输入样本,该模型输入样本例如可以是终端设备104拍摄(或者通过其它设备拍摄)得到的图像等。然后,第一设备集合101可以利用预先配置的第一推理模型对获取的模型输入样本进行推理,并得到推理结果。然后,决策装置103可以基于预先配置的决策参数以及第一推理模型输出的推理结果,判定该模型输入样本是否为第一类样本,即判定第一推理模型对该模型输入样本进行的推理是否准确。当判定该模型输入样本为第一类样本时,表征决策装置103确定第一推理模型对于该模型输入样本的推理不准确,则决策装置103可以指示第一设备集合101将该模型输入样本发送至第二设备集合102,以便第二设备集合102利用规格更大的第二推理模型对该模型输入样本进行准确推理并反馈。而当判定该模型输入样本为第二类样本时,表征决策装置103确定第一推理模型能够准确推理该模型输入样本,则决策装置103可以指示第一设备集合101直接向终端设备104反馈该模型输入样本对应的推理结果。如此,可以使得推理系统100针对该模型输入样本的推理准确性达到较高水平。When the inference system 100 infers model input samples, as shown in FIG. 1 , the first set of devices 101 can receive the model input samples sent by the terminal device 104 on the user side, and the model input samples can be taken by the terminal device 104 (or through other equipment shooting) obtained images, etc. Then, the first set of devices 101 may use the preconfigured first inference model to perform inference on the acquired model input samples, and obtain an inference result. Then, the decision-making device 103 can determine whether the model input sample is a first-type sample based on the pre-configured decision parameters and the inference result output by the first inference model, that is, determine whether the inference performed by the first inference model on the model input sample is accurate. . When it is determined that the model input sample is the first type of sample, the characterization decision-making unit 103 determines that the reasoning of the model input sample by the first reasoning model is inaccurate, and the decision-making unit 103 may instruct the first device set 101 to send the model input sample to The second set of devices 102, so that the second set of devices 102 uses a second reasoning model with a larger specification to perform accurate reasoning on the model input samples and give feedback. And when it is determined that the model input sample is a sample of the second type, the characterization decision-making means 103 determines that the first inference model can accurately infer the model input sample, then the decision-making means 103 can instruct the first device set 101 to directly feed back the model to the terminal device 104 The inference result corresponding to the input sample. In this way, the inference accuracy of the inference system 100 for the model input samples can reach a higher level.
但是,实际应用场景中,决策装置103中的决策参数,通常是由技术人员根据经验进行人工设定,这使得决策装置103在基于人工设定的决策参数判定模型输入样本是否为第一类样本时,判定准确性较低。这样,实际上属于第一类样本的模型输入样本,因为决策装置103的错误判定而被误识别为第二类样本,从而导致该模型输入样本未被传输至第二设备集合102中进行推理,同时第一推理模型针对第一类样本的推理准确性较低。如此,降低了推理系统100对于模型输入样本的推理准确性。而若将所有的模型输入样本均传输给第二设备集合102,并利用第二设备集合102中的第二推理模型进行推理,则会占用第一设备集合101与第二设备集合102之间的大量传输带宽。However, in actual application scenarios, the decision parameters in the decision-making device 103 are usually manually set by technicians based on experience, which makes the decision-making device 103 determine whether the model input samples are the first-type samples based on the manually-set decision parameters. , the judgment accuracy is low. In this way, the model input samples that actually belong to the first type of samples are misidentified as the second type of samples due to the erroneous determination of the decision-making device 103, so that the model input samples are not transmitted to the second device set 102 for reasoning. At the same time, the inference accuracy of the first inference model for the first type of samples is relatively low. In this way, the inference accuracy of the inference system 100 for model input samples is reduced. However, if all the model input samples are transmitted to the second device set 102, and the second inference model in the second device set 102 is used for inference, the space between the first device set 101 and the second device set 102 will be occupied. Mass transfer bandwidth.
基于此,本申请实施例提供了一种配置决策装置的方法,以提高决策装置103判定第一类样本的准确性,从而提高推理系统100对于模型输入样本的推理准确性。该配置决策装置的方法可以应用于图2所示的推理系统200中。在图1所示的推理系统100的基础上,图2所示的推理系统200中新增有配置装置105,该配置装置105可以用于对决策装置103中的决策参数进行配置。具体实现时,配置装置105先获取第一推理模型难以准确识别的第一类样本对应的第一推理结果以及第一推理模型能够准确识别的第二类样本对应的第二推理结果,即第一推理模型推理第一类样本的精度低于第一推理模型推理第二类样本的精度。然后,配 置装置105根据该第一推理结果以及第二推理结果,配置决策装置103中的决策参数。由于决策装置103的决策参数是通过第一推理模型分别对应的第一类样本对应的推理结果以及第二类样本对应的推理结果进行配置,因此,决策装置103基于该决策参数通常能够准确识别出第一推理模型难以准确推理的第一类样本。这样,推理系统100可以利用规格更大的第二推理模型对该第一类样本进行推理,以此可以提高推理系统100推理模型输入样本的准确性。Based on this, the embodiment of the present application provides a method for configuring a decision-making device to improve the accuracy of the decision-making device 103 in determining the first type of samples, thereby improving the reasoning accuracy of the reasoning system 100 for model input samples. The method for configuring a decision-making device can be applied to the reasoning system 200 shown in FIG. 2 . On the basis of the reasoning system 100 shown in FIG. 1 , a configuration device 105 is added to the reasoning system 200 shown in FIG. 2 , and the configuration device 105 can be used to configure decision parameters in the decision device 103 . During specific implementation, the configuration device 105 first obtains the first inference result corresponding to the first type of sample that the first inference model is difficult to accurately identify, and the second inference result corresponding to the second type of sample that the first inference model can accurately identify, that is, the first The accuracy of the inference model for inferring the first type of samples is lower than the accuracy of the first inference model for inferring the second type of samples. Then, the configuration means 105 configures the decision parameters in the decision means 103 according to the first reasoning result and the second reasoning result. Since the decision parameters of the decision-making device 103 are configured through the inference results corresponding to the first-type samples and the inference results corresponding to the second-type samples respectively corresponding to the first inference model, the decision-making device 103 can usually accurately identify The first type of samples that the first inference model is difficult to infer accurately. In this way, the inference system 100 can use the second inference model with a larger specification to perform inference on the first type of samples, thereby improving the accuracy of the inference system 100 inference model input samples.
其中,配置装置105可以与第一设备集合101部署于相同的环境中,或者可以与第二设备集合102部署于相同的环境中。并且,配置装置105可以通过软件或者硬件的方式实现。当通过软件实现时,配置装置105可以是应用于推理系统200中的计算设备上的应用程序,如可以是应用于第一设备集合101中任意计算设备上的程序,或者应用于第二设备集合102中任意计算设备上的程序,或者是应用于推理系统200中单独部署的计算设备上的程序等。而当通过硬件实现时,配置装置105可以是推理系统200中单独部署的计算设备,如具有配置功能的服务器等。Wherein, the configuration apparatus 105 may be deployed in the same environment as the first device set 101 , or may be deployed in the same environment as the second device set 102 . Moreover, the configuring device 105 may be realized by means of software or hardware. When implemented by software, the configuration means 105 may be an application program applied to a computing device in the reasoning system 200, such as a program applied to any computing device in the first device set 101, or applied to the second device set A program on any computing device in 102, or a program applied to a computing device deployed separately in the reasoning system 200, etc. When implemented by hardware, the configuration device 105 may be a computing device deployed separately in the inference system 200, such as a server with a configuration function.
需要说明的是,图2所示的推理系统仅作为一种示例性说明,并不用于限定推理系统的具体实现。例如,在其它可能的实施方式中,推理系统200可以包括更多的功能模块以支持推理系统具有更多其它的功能;或者,当推理系统200中的第一设备集合101部署于用户侧的本地网络时,第一设备集合101中的设备具体可以是终端设备104等。It should be noted that the reasoning system shown in FIG. 2 is only used as an exemplary description, and is not used to limit the specific implementation of the reasoning system. For example, in other possible implementations, the inference system 200 may include more functional modules to support the inference system to have more other functions; or, when the first device set 101 in the inference system 200 is deployed on the local In network, the devices in the first device set 101 may specifically be terminal devices 104 and the like.
为便于理解,下面结合附图,对本申请提供的配置决策装置的实施例进行描述。For ease of understanding, embodiments of the configuration decision-making device provided by the present application are described below with reference to the accompanying drawings.
参见图3,图3为本申请实施例提供的一种配置决策装置的方法流程示意图。其中,图3所示的推理方法可以应用于图2所示的推理系统200,或者应用于其它可适用的推理系统中。为便于说明,本实施例中以应用于图2所示的推理系统200,并且由推理系统200中的配置装置105进行执行为例进行示例性说明。Referring to FIG. 3 , FIG. 3 is a schematic flowchart of a method for configuring a decision-making device according to an embodiment of the present application. Wherein, the reasoning method shown in FIG. 3 can be applied to the reasoning system 200 shown in FIG. 2 , or to other applicable reasoning systems. For ease of description, in this embodiment, it is applied to the inference system 200 shown in FIG. 2 and executed by the configuration device 105 in the inference system 200 as an example for illustration.
基于图2所示的推理系统200,图3所示的配置决策装置的方法具体可以包括:Based on the reasoning system 200 shown in FIG. 2, the method for configuring the decision-making device shown in FIG. 3 may specifically include:
S301:配置装置105分别在第一设备集合101以及第二设备集合102中配置第一推理模型以及第二推理模型,其中,第一推理模型的规格小于第二推理模型的规格。S301: The configuration module 105 configures a first inference model and a second inference model in the first device set 101 and the second device set 102 respectively, wherein a specification of the first inference model is smaller than a specification of the second inference model.
本实施例中,推理系统200可以采用两级的推理机制对模型输入样本进行推理。具体实现时,推理系统200可以是优先利用部署在边缘网络的第一设备集合中的第一推理模型对模型输入样本进行推理。若第一推理模型的推理结果较为准确,则推理系统100可以将第一推理模型输出的推理结果作为反馈给终端设备104的推理结果。而若第一推理模型的推理结果不准确,则第一设备集合101可以将模型输入样本传输至部署于云端的第二设备集合102,以便利用云端的规格更大的第二模型推理对该模型输入样本进行精确推理,从而推理系统200向终端设备104反馈的推理结果即为该第二推理模型输出的推理结果。其中,是否将模型输入样本传输至第二设备集合102,可以由决策装置103进行判定。In this embodiment, the inference system 200 may use a two-stage inference mechanism to infer model input samples. During specific implementation, the inference system 200 may preferentially use the first inference model deployed in the first device set of the edge network to perform inference on model input samples. If the reasoning result of the first reasoning model is relatively accurate, the reasoning system 100 may use the reasoning result output by the first reasoning model as the reasoning result fed back to the terminal device 104 . And if the inference result of the first inference model is inaccurate, the first set of devices 101 can transmit the model input samples to the second set of devices 102 deployed in the cloud, so that the second model with a larger specification in the cloud can be used to reason about the model. The input samples are used for precise reasoning, so the reasoning result fed back by the reasoning system 200 to the terminal device 104 is the reasoning result output by the second reasoning model. Wherein, whether to transmit the model input samples to the second device set 102 may be determined by the decision-making unit 103 .
在一种可能的实施方式中,第二设备集合102中的第二推理模型,可以预先在技术人员的干预下,完成模型构建以及训练过程,以使得第二推理模型达到较高的推理精度。并且,在生成该第二推理模型后,配置装置105中的配置模块1052可以将其配置于第二设备集合102中。而在生成第一推理模型时,配置模块1052可以通过模型压缩的方式,基于第二推理 模型生成规格较小的第一推理模型。例如,配置模块1052可以基于强化学习(reinforcement learning,RL)算法对第二推理模型进行结构搜索,以确定出第一推理模型的网络结构;然后,配置模块1052可以通过知识蒸馏的方式,对已确定网络结构的第一推理模型进行训练,以此确定出第一推理模型中的网络参数。此时,第一推理模型可以称之为学生模型,而第二推理模型可以称之为教师模型。由于基于教师模型生成学生模型的具体实现过程,在相关技术中已有应用,在此不做赘述。或者,配置模块1052也可以通过与生成第二推理模型类似的方式,构建并训练得到第一推理模型。然后,配置模块1052可以将生成的第一推理模型部署于第一设备集合101中。In a possible implementation manner, the second inference model in the second device set 102 may complete the model building and training process in advance with the intervention of technicians, so that the second inference model can achieve higher inference accuracy. Moreover, after the second reasoning model is generated, the configuration module 1052 in the configuring device 105 can configure it in the second device set 102 . When generating the first inference model, the configuration module 1052 can generate the first inference model with a smaller size based on the second inference model by means of model compression. For example, the configuration module 1052 can perform a structural search on the second reasoning model based on a reinforcement learning (RL) algorithm to determine the network structure of the first reasoning model; then, the configuration module 1052 can perform knowledge distillation on the existing The first reasoning model that determines the network structure is trained to determine network parameters in the first reasoning model. At this time, the first reasoning model can be called a student model, and the second reasoning model can be called a teacher model. Since the specific implementation process of generating the student model based on the teacher model has already been applied in related technologies, details will not be described here. Alternatively, the configuration module 1052 may also construct and train the first inference model in a manner similar to generating the second inference model. Then, the configuration module 1052 can deploy the generated first reasoning model in the first device set 101 .
S302:配置装置105获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,其中,第一推理模型推理第一类样本的精度低于第一推理模型推理第二类样本的精度。S302: The configuration device 105 obtains the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, wherein the accuracy of the first inference model inferring the first type of sample is lower than that of the first inference model inferring the second The precision of class samples.
其中,第一类样本,是指第一推理模型难以准确推理的样本,实际应用时,这类样本也可以是称之为第一推理模型对应的难例样本。比如,第一推理模型针对样本进行推理所得到的推理结果的置信度小于第一预设值时,可以将该样本确定为第一类样本(也即难例样本)等。相应的,第二类样本,是指第一推理模型能够较为准确推理的样本,实际应用时,这类样本也可以是称之为第一推理模型对应的简单样本。比如,可以将推理结果的置信度大于第二预设值的样本确定为第二类样本(也即简单例样本)等,该第二预设值大于前述第一预设值。本实施例中,配置装置105可以通过获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,对决策装置103进行配置,以便配置后的决策装置103能够识别出第一推理模型难以准确推理的第一类样本以及第一推理模型能够准确识别的第二类样本。Wherein, the first type of samples refers to samples that are difficult to be accurately inferred by the first inference model. In practical applications, such samples may also be referred to as difficult samples corresponding to the first inference model. For example, when the confidence of the inference result obtained by the first inference model inference on the sample is less than the first preset value, the sample may be determined as the first type of sample (ie, a difficult sample). Correspondingly, the second type of samples refers to samples that can be inferred relatively accurately by the first reasoning model. In practical applications, such samples can also be called simple samples corresponding to the first reasoning model. For example, a sample whose confidence degree of the inference result is greater than a second preset value may be determined as the second type of sample (that is, a simple example sample), etc., and the second preset value is greater than the aforementioned first preset value. In this embodiment, the configuration device 105 can configure the decision-making device 103 by acquiring the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, so that the configured decision-making device 103 can identify The first type of samples that the first reasoning model is difficult to accurately reason about and the second type of samples that the first reasoning model can accurately identify.
在一种可能的实施方式中,如图2所示,配置装置105中可以包括推理结果获取模块1051以及配置模块1052。其中,推理结果获取模块1051可以获取多个样本,该多个样本例如可以是由技术人员提供。然后,推理结果获取模块1051可以将该多个样本发送给第一设备集合101。同时,配置模块1052可以指示第一设备集合101利用第一推理模型分别对该多个样本中的各个样本进行推理,得到每个样本对应的推理结果。这样,推理结果获取模块1051可以根据多个样本对应的推理结果,进一步确定该多个样本中相对于第一推理模型的第一类样本以及第二类样本,例如可以将多个样本中推理结果的置信度小于第一预设值的样本确定为第一类样本,将该多个样本中推理结果的置信度大于第二预设值的样本确定为第二类样本,该第二预设值不小于第一预设值。如此,推理结果获取模块1051可以从多个样本对应的推理结果中,确定第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果。In a possible implementation manner, as shown in FIG. 2 , the configuration device 105 may include an inference result acquisition module 1051 and a configuration module 1052 . Wherein, the inference result obtaining module 1051 may obtain multiple samples, and the multiple samples may be provided by technicians, for example. Then, the inference result acquisition module 1051 may send the multiple samples to the first device set 101 . At the same time, the configuration module 1052 may instruct the first set of devices 101 to use the first inference model to perform inference on each sample in the plurality of samples, and obtain an inference result corresponding to each sample. In this way, the inference result acquisition module 1051 can further determine the first type of samples and the second type of samples in the multiple samples relative to the first inference model according to the inference results corresponding to the multiple samples, for example, the inference results in the multiple samples can be The samples whose confidence degree is less than the first preset value are determined as the first type of samples, and the samples whose inference result confidence degree is greater than the second preset value among the plurality of samples are determined as the second type of samples, and the second preset value not less than the first preset value. In this way, the inference result acquisition module 1051 can determine the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample from the inference results corresponding to the plurality of samples.
作为一种确定第一类样本的实现示例,推理结果获取模块1051具体可以根据人工标记结果确定第一类样本。具体的,推理结果获取模块1051可以向标注人员呈现标注界面,例如可以呈现如图4所示的标注界面,该标注界面中可以包括第一推理模型针对多个样本对应的推理结果。这样,标注人员可以在标注界面上为各个样本对应的推理结果进行推理正确以及推理错误的标注,从而推理结果获取模块1051可以根据标注人员的标注操作,从多个样本中确定出第一类样本以及第二类样本,例如,推理结果获取模块1051可以将标注“推 理正确”的推理结果所对应的样本确定为第二类样本,将标注“推理错误”的推理结果所对应的样本确定为第一类样本等。As an implementation example of determining the samples of the first type, the inference result obtaining module 1051 may specifically determine the samples of the first type according to the manually marked results. Specifically, the inference result acquisition module 1051 may present a labeling interface to the labeler, for example, the labeling interface shown in FIG. 4 may be presented, and the labeling interface may include inference results corresponding to multiple samples of the first inference model. In this way, the annotator can mark the reasoning results corresponding to each sample on the labeling interface for correct reasoning and wrong reasoning, so that the inference result acquisition module 1051 can determine the first type of samples from multiple samples according to the labeling operation of the labeler As well as the second type of samples, for example, the inference result acquisition module 1051 may determine the samples corresponding to the inference results labeled "correct inference" as the second type of samples, and determine the samples corresponding to the inference results labeled "inference error" as the second type of samples. A class of samples, etc.
值得注意的是,第一推理模型能够准确识别的第二类样本以及难以准确识别的第一类样本的数量差异可能较大,比如,实际应用场景中,第一类样本的数量可能远小于第二类样本的数量。因此,在一些可能的实施方式中,推理结果获取模块1051可以基于已确定的第一类样本或者第二类样本,通过委员会投票(query-by-committee,QBC)的方式获得数量相当的第一类样本以及第二类样本。或者,推理结果获取模块1051也可以是通过有放回抽样的方式,从已确定的第一类样本或者第二类样本中获得数量相当的第一类样本以及第二类样本等。本实施例中,对于推理结果获取模块1051获取足够数量的第一类样本以及第二类样本的具体实现方式并不进行限定。It is worth noting that the number of samples of the second type that can be accurately identified by the first inference model and the number of samples of the first type that are difficult to accurately identify may vary greatly. For example, in actual application scenarios, the number of samples of the first type may be much smaller than that of the first type. The number of samples of the second class. Therefore, in some possible implementations, the inference result acquisition module 1051 can obtain a corresponding number of first-class samples by means of query-by-committee (QBC) based on the determined first-type samples or second-type samples. class samples and the second class samples. Alternatively, the inference result acquisition module 1051 may also obtain a comparable number of first-type samples and second-type samples from the determined first-type samples or second-type samples by means of sampling with replacement. In this embodiment, there is no limitation on the specific implementation manner of obtaining a sufficient number of samples of the first type and samples of the second type by the inference result obtaining module 1051 .
S303:配置装置105根据第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,配置决策装置103中的决策参数,该决策参数用于将第一推理模型推理的模型输入样本识别为传输给第二设备集合102的模型输入样本。S303: The configuration device 105 configures the decision parameters in the decision device 103 according to the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, and the decision parameter is used to infer the first reasoning model to the model The input samples are identified as model input samples transmitted to the second set of devices 102 .
作为一种示例,决策装置103中的决策参数,例如可以是置信度阈值,从而决策装置103可以通过比较该置信度阈值与推理结果的置信度,确定该推理结果对应的模型输入样本是否为第一推理模型难以准确推理的第一类样本。具体的,当第一推理模型输出的推理结果的置信度小于该置信度阈值时,决策装置103可以确定该推理结果对应的模型输入样本为第一类样本,并可以进一步指示第一设备集合101将该模型输入样本传输至第二设备集合102。而当第一推理模型输出的推理结果的置信度大于该置信度阈值时,决策装置103可以确定该推理结果对应的模型输入样本为第一推理模型能够准确识别的第二类样本,从而决策装置103可以指示第一设备集合101将第一推理模型针对该模型输入样本的推理结果反馈给终端设备104。As an example, the decision parameter in the decision-making device 103 may be, for example, a confidence threshold, so that the decision-making device 103 can determine whether the model input sample corresponding to the reasoning result is the first by comparing the confidence threshold with the confidence of the reasoning result. A first type of sample that is difficult for the inference model to infer accurately. Specifically, when the confidence of the inference result output by the first inference model is less than the confidence threshold, the decision-making device 103 may determine that the model input sample corresponding to the inference result is a first-type sample, and may further instruct the first device set 101 to The model input samples are transmitted to the second set of devices 102 . And when the confidence of the inference result output by the first inference model is greater than the confidence threshold, the decision-making device 103 may determine that the model input sample corresponding to the inference result is the second type of sample that the first inference model can accurately identify, so that the decision-making device 103 may instruct the first set of devices 101 to feed back the inference result of the first inference model for the model input sample to the terminal device 104 .
而在另一种示例中,决策装置103中的决策参数,例如可以是神经网络模型中的网络参数。具体实现时,决策装置103中可以包括神经网络模型,该神经网络模型的输入为模型输入样本或者第一推理模型针对该模型输入样本所输出的推理结果,神经网络模型的输出为该模型输入样本为第一类样本或者为第二类样本的判定结果,从而配置装置105所确定的决策参数,即为该神经网络模型中的网络参数。In another example, the decision parameters in the decision device 103 may be, for example, network parameters in a neural network model. During specific implementation, the decision-making device 103 may include a neural network model, the input of the neural network model is the model input sample or the inference result output by the first reasoning model for the model input sample, and the output of the neural network model is the model input sample The decision parameters determined by the configuration device 105 are the network parameters in the neural network model for the determination result of the first type of sample or the second type of sample.
相应的,配置装置105在确定决策装置103中的决策参数时,具体可以是利用获取的第一类样本以及第二类样本进行确定。Correspondingly, when the configuring device 105 determines the decision-making parameters in the decision-making device 103 , it may specifically use the obtained samples of the first type and samples of the second type to determine.
在一种可能的实施方式中,配置模块1052可以指示第一设备集合101利用第一推理模型对第一类样本以及第二类样本进行推理,得到该第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,从而配置模块1052可以根据该第一推理结果以及第二推理结果,确定决策装置103中的决策参数。In a possible implementation manner, the configuration module 1052 may instruct the first set of devices 101 to use the first inference model to perform inference on the first type of samples and the second type of samples, and obtain the first inference results corresponding to the first type of samples and The second type of sample corresponds to the second inference result, so the configuration module 1052 can determine the decision parameters in the decision-making device 103 according to the first inference result and the second inference result.
例如,当决策参数具体为置信度阈值时,第一类样本对应的第一推理结果中可以包括第一置信度,该第一置信度例如可以是多个第一类样本分别对应的推理结果的置信度的平均值等,第二类样本对应的第二推理结果中可以包括第二置信度,该第二置信度例如可以是多个第二类样本分别对应的推理结果的置信度的平均值等,从而配置模块1052可以根据该第一置信度与第二置信度的取值,确定决策参数的置信度阈值的取值,并基于所确定的 取值为决策装置103配置决策参数。For example, when the decision parameter is specifically a confidence threshold, the first inference result corresponding to the first type of sample may include a first confidence level, and the first confidence level may be, for example, the inference result corresponding to a plurality of first type samples respectively. The average value of the confidence degree, etc., the second inference result corresponding to the second type of sample may include the second confidence degree, for example, the second confidence degree may be the average value of the confidence degree of the inference result corresponding to the plurality of second type samples respectively etc., so that the configuration module 1052 can determine the value of the confidence threshold of the decision parameter according to the values of the first confidence degree and the second confidence degree, and configure the decision parameter for the decision-making device 103 based on the determined value.
又例如,当决策参数具体为神经网络模型中的网络参数时,配置模块1052可以将部分第一类样本对应的第一推理结果以及部分第二类样本对应的第二推理结果,作为决策装置103中的神经网络模型的输入,将样本属于第一类样本还是第二类样本的标注结果作为该神经网络模型的输出,以此对该神经网络模型中的网络参数进行训练。然后,配置模块1052可以利用剩余部分的第一类样本对应的第一推理结果以及剩余部分的第二类样本对应的第二推理结果作为测试集,对训练得到的神经网络模型进行测试,以测试该神经网络模型对于第一类样本与第二类样本的分类准确性,示例性地,该神经网络模型例如可以是二分类模型。当神经网络模型通过测试时,该神经网络模型中的网络参数即为配置模块1052最终所确定的决策参数。For another example, when the decision parameters are specifically network parameters in the neural network model, the configuration module 1052 may use the first inference results corresponding to some samples of the first type and the second inference results corresponding to some samples of the second type as the decision-making device 103 The input of the neural network model in , the labeling result of whether the sample belongs to the first type of sample or the second type of sample is used as the output of the neural network model, so as to train the network parameters in the neural network model. Then, the configuration module 1052 can use the first inference results corresponding to the remaining samples of the first type and the second inference results corresponding to the remaining samples of the second type as a test set to test the trained neural network model to test The classification accuracy of the neural network model for the first type of samples and the second type of samples, for example, the neural network model may be a binary classification model. When the neural network model passes the test, the network parameters in the neural network model are the decision parameters finally determined by the configuration module 1052 .
上述实施方式中,第一推理结果以及第二推理结果均为第一推理模型输出的推理结果,而在另一种可能的实施方式中,第一推理结果以及第二推理结果是由不同推理模型输出的推理结果。具体的,配置模块1052可以指示第一设备集合101利用第一推理模型对第二类样本进行推理,得到第二类样本对应的第二推理结果。同时,配置模块1052还可以指示第二设备集合102利用第二推理模型对第一类样本进行推理,得到第一类样本对应的第一推理结果,从而配置模块1052可以根据该第一推理结果以及第二推理结果,确定决策装置103中的决策参数。其中,配置模块1052根据第一推理结果以及第二推理结果确定决策参数的具体过程,可以参见前述实施方式中根据第一推理结果以及第二推理结果确定决策参数的具体过程描述,在此不做赘述。In the above embodiment, the first reasoning result and the second reasoning result are the reasoning results output by the first reasoning model, and in another possible embodiment, the first reasoning result and the second reasoning result are output by different reasoning models output inference results. Specifically, the configuration module 1052 may instruct the first set of devices 101 to use the first inference model to perform inference on samples of the second type to obtain a second inference result corresponding to the samples of the second type. At the same time, the configuration module 1052 can also instruct the second set of devices 102 to use the second inference model to perform inference on the first type of sample to obtain the first inference result corresponding to the first type of sample, so that the configuration module 1052 can use the first inference result and The second reasoning result is to determine the decision parameters in the decision device 103 . Wherein, for the specific process of the configuration module 1052 determining the decision parameters according to the first reasoning result and the second reasoning result, please refer to the description of the specific process of determining the decision parameters according to the first reasoning result and the second reasoning result in the foregoing embodiment, which will not be described here. repeat.
值得注意的是,上述确定决策参数的具体实现方式仅作为一些示例性说明,实际应用时,决策参数也可以是其它类型的参数,从而配置模块1052可以通过其它可能的实现方式确定决策参数,本实施例对此并不进行限定。It is worth noting that the specific implementation methods for determining the decision-making parameters above are only used as some exemplary illustrations. In practical applications, the decision-making parameters can also be other types of parameters, so that the configuration module 1052 can determine the decision-making parameters through other possible implementation methods. The embodiment does not limit this.
这样,在确定出决策参数后,决策装置103可以根据该决策参数分析第一推理模型后续所推理的各个模型输入样本是否属于第一类样本,以便在确定其属于第一类样本时,指示第一设备集合101将该第一类样本发送至第二设备集合102中进行推理,以此提高针对第一类样本的推理精度,从而可以使得推理系统200针对模型输入样本的推理精度保持在较高水平。同时,对于第二类样本,利用第一设备集合101中的第一推理模型即可完成准确推理,从而可以无需将其传输至第二设备集合102,以此可以减少第一设备集合101与第二设备集合102之间的传输带宽的资源消耗。In this way, after the decision parameter is determined, the decision-making device 103 can analyze whether each model input sample subsequently inferred by the first inference model belongs to the first type of sample according to the decision parameter, so that when it is determined that it belongs to the first type of sample, indicate the first A set of devices 101 sends the first-type samples to the second set of devices 102 for inference, so as to improve the inference accuracy for the first-type samples, so that the inference accuracy of the inference system 200 for model input samples can be maintained at a high level level. At the same time, for the second type of samples, accurate inference can be completed by using the first inference model in the first device set 101, so that it is not necessary to transmit them to the second device set 102, thereby reducing the number of connections between the first device set 101 and the second set of devices. Resource consumption of the transmission bandwidth between the two device sets 102 .
需要说明的是,本实施例中,是以配置装置105根据两类样本对应的推理结果确定决策参数为例进行示例性说明,在其它可能的实施例中,配置装置105也可以是直接根据两类样本确定决策参数。比如,在一种可能的实施方式中,当决策参数具体为神经网络模型中的网络参数时,配置装置105中的配置模块1052可以将部分第一类样本以及部分第二类样本,作为决策装置103中的神经网络模型的输入,将样本属于第一类样本还是第二类样本的标注结果作为该神经网络模型的输出,以此对该神经网络模型中的网络参数进行训练。然后,配置模块1052可以利用剩余部分的第一类样本以及剩余部分的第二类样本作为测试集,对训练得到的神经网络模型进行测试,以测试该神经网络模型识别第一类与第二类的准确性。例如,神经网络模型可以根据测试集中的样本的特征,与第一类样本以及第二类样本中的 样本特征(如图像特征等)进行差异性分析,以确定测试集中的样本属于第一类样本或者第二类样本。如此,当神经网络模型通过测试时,该神经网络模型中的网络参数即为配置模块1052最终所确定的决策参数。It should be noted that in this embodiment, the configuration device 105 determines the decision parameters according to the inference results corresponding to the two types of samples as an example for illustration. In other possible embodiments, the configuration device 105 may also directly Class samples determine the decision parameters. For example, in a possible implementation, when the decision-making parameters are specifically network parameters in the neural network model, the configuration module 1052 in the configuration device 105 can use some samples of the first type and some samples of the second type as the decision-making device The input of the neural network model in 103 is to use the labeling result of whether the sample belongs to the first type of sample or the second type of sample as the output of the neural network model, so as to train the network parameters in the neural network model. Then, the configuration module 1052 can use the remaining part of the first-type samples and the remaining part of the second-type samples as a test set to test the trained neural network model, so as to test that the neural network model recognizes the first type and the second type accuracy. For example, the neural network model can perform difference analysis with the sample features (such as image features, etc.) in the first type of samples and the second type of samples according to the characteristics of the samples in the test set to determine that the samples in the test set belong to the first type of samples Or the second type of samples. In this way, when the neural network model passes the test, the network parameters in the neural network model are the decision parameters finally determined by the configuration module 1052 .
本实施例中,通过在MNIST、SVHN、CIFAR-10等公开数据集上进行测,可以确定在决策装置103利用决策参数对模型输入样本进行第一类样本或者第二类样本的判断后,可以使得推理系统200的推理准确度达到较高的水平(近似于全部利用第二推理模型进行推理的精度),如下表1所示:In this embodiment, by testing on public data sets such as MNIST, SVHN, and CIFAR-10, it can be determined that after the decision-making device 103 uses the decision parameters to judge the first type of sample or the second type of sample on the model input sample, it can be Make the reasoning accuracy of the reasoning system 200 reach a higher level (approximate to the accuracy of reasoning using the second reasoning model), as shown in Table 1 below:
表1Table 1
Figure PCTCN2022091969-appb-000001
Figure PCTCN2022091969-appb-000001
具体的,本实施例在确定出决策参数后,还可以包括以下对模型输入样本进行推理的步骤:Specifically, after the decision parameters are determined in this embodiment, the following steps of reasoning the model input samples may also be included:
S304:第一设备集合101接收模型输入样本。S304: The first device set 101 receives a model input sample.
例如,用户侧的终端设备104可以向第一设备集合101发送模型输入样本,该模型输入样本例如可以是拍摄图像,如在安全帽场景中针对施工工地的拍摄图像等,或者可以是其它用于作为第一推理模型的输入的样本。For example, the terminal device 104 on the user side may send a model input sample to the first set of devices 101. The model input sample may be, for example, a captured image, such as a captured image of a construction site in a helmet scene, or other A sample as input to the first inference model.
S305:第一设备集合101利用预先配置的规格较小的第一推理模型对模型输入样本进行推理,得到推理结果。S305: The first set of devices 101 uses the pre-configured first inference model with a smaller specification to perform inference on the model input samples to obtain an inference result.
S306:决策装置103根据已确定的决策参数以及第一推理模型针对该模型输入样本的推理结果,判定该模型输入样本是否为第一类样本。S306: The decision-making device 103 determines whether the model input sample is a first-type sample according to the determined decision parameters and the inference result of the first inference model for the model input sample.
举例来说,假设决策参数具体为置信度阈值时,若第一推理模型针对该模型输入样本的推理结果的置信度大于该置信度阈值,表征第一推理模型输出的推理结果为正确的可信程度较高,也即可以视为该推理结果的准确度较高。此时,决策装置103可以确定该模型输入样本为第二类样本,并可以进一步指示第一设备集合101可以将第一推理模型输出的推理结果反馈给终端设备104。For example, assuming that the decision-making parameter is specifically a confidence threshold, if the confidence of the first inference model’s inference result for the input sample of the model is greater than the confidence threshold, it means that the inference result output by the first inference model is correct and credible A higher degree means that it can be regarded as a higher accuracy of the inference result. At this time, the decision-making device 103 may determine that the model input sample is a second-type sample, and may further instruct the first set of devices 101 to feed back the inference result output by the first inference model to the terminal device 104 .
反之,若第一推理模型针对该模型输入样本的推理结果的置信度小于该置信度阈值,则可以视为第一推理模型输出的推理结果不准确。此时,决策装置103可以判定该模型输入样本为第一类样本,并可以进一步指示第一设备集合101将该模型输入样本发送至第二设备集合102,以便利用第二设备集合102上的规格更大的第二推理模型对该模型输入样本进行更加准确的推理。Conversely, if the confidence of the inference result of the first inference model for the model input sample is less than the confidence threshold, it may be considered that the inference result output by the first inference model is inaccurate. At this time, the decision-making unit 103 can determine that the model input sample is a first-type sample, and can further instruct the first device set 101 to send the model input sample to the second device set 102, so as to utilize the specification on the second device set 102 The larger second inference model makes more accurate inferences on the model input samples.
需要说明的是,当决策装置103中的决策参数根据第一类样本以及第二类样本进行确定时,在其它可能的实施例中,决策装置103根据已确定的决策参数以及第一推理模型当前所推理的模型输入样本,判定该模型输入样本是否为第一类样本。It should be noted that when the decision-making parameters in the decision-making device 103 are determined according to the samples of the first type and the samples of the second type, in other possible embodiments, the decision-making device 103 determines the current The inferred model input sample is used to determine whether the model input sample is a first-type sample.
S307:决策装置103在确定模型输入样本为第一类样本的情况下,指示第一设备集合101 将模型输入样本上传至第二设备集合102。S307: The decision-making unit 103 instructs the first set of devices 101 to upload the model input sample to the second set of devices 102 when determining that the model input sample is a first-type sample.
S308:第一设备集合101将模型输入样本发送给第二设备集合102。S308: The first set of devices 101 sends the model input samples to the second set of devices 102 .
S309:第二设备集合102利用预先配置的规格较大的第二推理模型对接收到的模型输入样本进行推理,得到推理结果。S309: The second set of devices 102 performs inference on the received model input samples by using a pre-configured second inference model with a larger specification to obtain an inference result.
S310:第二设备集合102将推理结果发送给终端设备104。S310: The second set of devices 102 sends the inference result to the terminal device 104.
值得注意的是,本实施例是以第一设备集合101部署于边缘网络、第二设备集合102部署于云端为例进行示例性说明,在其它实现方式中,第一设备集合101也可以部署于本地网络,而第二设备集合102部署于边缘网络,此时,推理系统200对于模型输入样本的推理过程以及更新置信度阈值与模型的过程,与上述过程类似,具体可参见前述实施例的相关之处描述,在此不做赘述。It should be noted that this embodiment uses the first set of devices 101 deployed on the edge network and the second set of devices 102 deployed on the cloud as an example for illustration. In other implementations, the first set of devices 101 may also be deployed on The local network, while the second set of devices 102 is deployed on the edge network. At this time, the inference process of the inference system 200 on the input samples of the model and the process of updating the confidence threshold and the model are similar to the above-mentioned process. For details, please refer to the related described here, and will not be repeated here.
上述各实施例中,配置决策装置的过程中所涉及到的配置装置105可以以单独的硬件设备实现,而在其它可能的实现方式中,其也可以是配置于计算机设备上的软件,并且,通过在计算机设备上运行该软件,可以使得计算机设备分别实现上述配置装置105所具有的功能。下面,基于硬件设备实现的角度,对配置决策装置的过程中所涉及的配置装置105分别进行详细介绍。In the above-mentioned embodiments, the configuration device 105 involved in the process of configuring the decision-making device may be implemented as a separate hardware device, and in other possible implementation manners, it may also be software configured on a computer device, and, By running the software on the computer equipment, the computer equipment can respectively realize the functions of the configuration device 105 described above. In the following, based on the perspective of hardware device implementation, the configuring device 105 involved in the process of configuring the decision-making device will be introduced in detail respectively.
图5示出了一种计算机设备。图5所示的计算机设备500具体可以用于实现上述图3所示实施例中配置装置105的功能。Figure 5 shows a computer device. The computer device 500 shown in FIG. 5 can be specifically used to implement the functions of the configuration apparatus 105 in the above-mentioned embodiment shown in FIG. 3 .
计算机设备500包括总线501、处理器502、通信接口503和存储器504。处理器502、存储器504和通信接口503之间通过总线501通信。总线501可以是外设部件互连标准(peripheral component interconnect,PCI)总线或扩展工业标准结构(extended industry standard architecture,EISA)总线等。总线可以分为地址总线、数据总线、控制总线等。为便于表示,图5中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。通信接口503用于与外部通信,例如指示第一设备集合利用第一推理模型进行推理等。The computer device 500 includes a bus 501 , a processor 502 , a communication interface 503 and a memory 504 . The processor 502 , the memory 504 and the communication interface 503 communicate through the bus 501 . The bus 501 may be a peripheral component interconnect standard (peripheral component interconnect, PCI) bus or an extended industry standard architecture (extended industry standard architecture, EISA) bus, etc. The bus can be divided into address bus, data bus, control bus and so on. For ease of representation, only one thick line is used in FIG. 5 , but it does not mean that there is only one bus or one type of bus. The communication interface 503 is used for communicating with the outside, for example, instructing the first set of devices to use the first reasoning model to perform reasoning and the like.
其中,处理器502可以为中央处理器(central processing unit,CPU)。存储器504可以包括易失性存储器(volatile memory),例如随机存取存储器(random access memory,RAM)。存储器504还可以包括非易失性存储器(non-volatile memory),例如只读存储器(read-only memory,ROM),快闪存储器,HDD或SSD。Wherein, the processor 502 may be a central processing unit (central processing unit, CPU). The memory 504 may include a volatile memory (volatile memory), such as a random access memory (random access memory, RAM). The memory 504 may also include a non-volatile memory (non-volatile memory), such as a read-only memory (read-only memory, ROM), flash memory, HDD or SSD.
存储器504中存储有可执行代码,处理器502执行该可执行代码以执行前述配置装置105所执行的方法。Executable codes are stored in the memory 504 , and the processor 502 executes the executable codes to execute the method executed by the aforementioned configuration device 105 .
具体地,在实现图3所示实施例的情况下,且图3所示实施例中所描述的配置装置105为通过软件实现的情况下,执行图3中的配置装置105的功能所需的软件或程序代码存储在存储器504中,配置装置105与其它设备的交互通过通信接口503实现,处理器用于执行存储器504中的指令,实现配置装置105所执行的方法。Specifically, in the case of implementing the embodiment shown in FIG. 3, and the configuration device 105 described in the embodiment shown in FIG. Software or program codes are stored in the memory 504 , the interaction between the configuration device 105 and other devices is realized through the communication interface 503 , and the processor is used to execute the instructions in the memory 504 to realize the method executed by the configuration device 105 .
图6示出了另一种计算设备,图6所示的计算机设备600包括总线601、处理器602、通信接口603和存储器604。处理器602、存储器604和通信接口603之间通过总线601通信。总线601可以是PCI总线或EISA总线等。总线可以分为地址总线、数据总线、控制总线等。为便于表示,图6中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。通信接口 603用于与外部通信,例如指示第一设备集合利用第一推理模型进行推理等。FIG. 6 shows another computing device. The computer device 600 shown in FIG. 6 includes a bus 601 , a processor 602 , a communication interface 603 and a memory 604 . The processor 602 , the memory 604 and the communication interface 603 communicate through the bus 601 . The bus 601 can be a PCI bus or an EISA bus, etc. The bus can be divided into address bus, data bus, control bus and so on. For ease of representation, only one thick line is used in FIG. 6 , but it does not mean that there is only one bus or one type of bus. The communication interface 603 is used for communicating with the outside, for example, instructing the first set of devices to use the first reasoning model to perform reasoning and the like.
其中,处理器602可以为CPU。存储器604可以包括易失性存储器(volatile memory),例如RAM。存储器604还可以包括非易失性存储器(non-volatile memory),例如ROM,快闪存储器,HDD或SSD。Wherein, the processor 602 may be a CPU. Memory 604 may include volatile memory, such as RAM. The memory 604 may also include non-volatile memory (non-volatile memory), such as ROM, flash memory, HDD or SSD.
存储器604中存储有可执行代码,处理器602执行该可执行代码以执行如下步骤:Executable codes are stored in the memory 604, and the processor 602 executes the executable codes to perform the following steps:
获取第一类样本以及第二类样本,所述第一推理模型推理所述第一类样本的精度低于所述第一推理模型推理所述第二类样本的精度;Acquiring samples of the first type and samples of the second type, the accuracy of inferring the samples of the first type by the first inference model is lower than the accuracy of inferring the samples of the second type by the first inference model;
根据所述第一类样本以及所述第二类样本,配置所述决策装置中的决策参数,所述决策参数用于将所述第一推理模型推理的模型输入样本识别为传输给所述第二设备集合的模型输入样本。According to the first type of samples and the second type of samples, configure the decision parameters in the decision-making device, the decision parameters are used to identify the model input samples inferred by the first inference model as being transmitted to the second inference model Sample model input for the two-device collection.
此外,本申请实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机设备上运行时,使得计算机设备执行上述实施例配置装置105所执行的方法。In addition, the embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores instructions, and when it is run on the computer equipment, the computer equipment executes the configuration device 105 of the above-mentioned embodiment. method.
此外,本申请实施例还提供了一种计算机程序产品,所述计算机程序产品被计算机执行时,所述计算机执行前述配置决策装置的方法的任一方法。该计算机程序产品可以为一个软件安装包,在需要使用前述配置决策装置的方法的任一方法的情况下,可以下载该计算机程序产品并在计算机上执行该计算机程序产品。In addition, an embodiment of the present application further provides a computer program product, and when the computer program product is executed by a computer, the computer executes any one of the aforementioned methods for configuring a decision-making device. The computer program product may be a software installation package, which may be downloaded and executed on a computer if any of the above-mentioned methods for configuring the decision-making device needs to be used.
另外需说明的是,以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。另外,本申请提供的装置实施例附图中,模块之间的连接关系表示它们之间具有通信连接,具体可以实现为一条或多条通信总线或信号线。In addition, it should be noted that the device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, and the components shown as units may or may not be A physical unit can be located in one place, or it can be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment. In addition, in the drawings of the device embodiments provided in the present application, the connection relationship between the modules indicates that they have communication connections, which can be specifically implemented as one or more communication buses or signal lines.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件的方式来实现,当然也可以通过专用硬件包括专用集成电路、专用CPU、专用存储器、专用元器件等来实现。一般情况下,凡由计算机程序完成的功能都可以很容易地用相应的硬件来实现,而且,用来实现同一功能的具体硬件结构也可以是多种多样的,例如模拟电路、数字电路或专用电路等。但是,对本申请而言更多情况下软件程序实现是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在可读取的存储介质中,如计算机的软盘、U盘、移动硬盘、ROM、RAM、磁碟或者光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,训练设备,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the present application can be implemented by means of software plus necessary general-purpose hardware, and of course it can also be realized by special hardware including application-specific integrated circuits, dedicated CPUs, dedicated memories, Special components, etc. to achieve. In general, all functions completed by computer programs can be easily realized by corresponding hardware, and the specific hardware structure used to realize the same function can also be varied, such as analog circuits, digital circuits or special-purpose circuit etc. However, for this application, software program implementation is a better implementation mode in most cases. Based on this understanding, the essence of the technical solution of this application or the part that contributes to the prior art can be embodied in the form of a software product, and the computer software product is stored in a readable storage medium, such as a floppy disk of a computer , U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk, etc., including several instructions to make a computer device (which can be a personal computer, training device, or network device, etc.) execute the instructions described in various embodiments of the present application method.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product.
所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储 在计算机可读存储介质中,或者从一个计算机可读存储介质向另一计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、训练设备或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、训练设备或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存储的任何可用介质或者是包含一个或多个可用介质集成的训练设备、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(Solid State Disk,SSD))等。The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present application will be generated in whole or in part. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable devices. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transferred from a website, computer, training device, or data The center transmits to another website site, computer, training device or data center via wired (eg, coaxial cable, fiber optic, digital subscriber line (DSL)) or wireless (eg, infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be stored by a computer, or a data storage device such as a training device or a data center integrated with one or more available media. The available medium may be a magnetic medium (such as a floppy disk, a hard disk, or a magnetic tape), an optical medium (such as a DVD), or a semiconductor medium (such as a solid state disk (Solid State Disk, SSD)), etc.

Claims (20)

  1. 一种配置决策装置的方法,其特征在于,所述方法应用于推理系统,所述推理系统包括第一设备集合、第二设备集合以及决策装置,所述第一设备集合以及所述第二设备集合均包括至少一个计算设备,所述第一设备集合中的第一推理模型的规格小于所述第二设备集合中的第二推理模型的规格,所述方法包括:A method for configuring a decision-making device, characterized in that the method is applied to a reasoning system, the reasoning system includes a first device set, a second device set, and a decision-making device, the first device set and the second device The sets each include at least one computing device, the size of the first inference model in the first set of devices is smaller than the size of the second inference model in the second set of devices, the method comprising:
    获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,所述第一推理模型推理所述第一类样本的精度低于所述第一推理模型推理所述第二类样本的精度;Acquiring a first inference result corresponding to the first type of sample and a second inference result corresponding to the second type of sample, the accuracy of the first inference model inferring the first type of sample is lower than that of the first inference model inferring the first inference result The precision of the second class sample;
    根据所述第一推理结果以及所述第二推理结果,配置所述决策装置中的决策参数,所述决策参数用于将所述第一推理模型推理的模型输入样本识别为传输给所述第二设备集合的模型输入样本。According to the first inference result and the second inference result, configure the decision parameters in the decision-making device, the decision parameters are used to identify the model input samples inferred by the first inference model as being transmitted to the second inference model Sample model input for the two-device collection.
  2. 根据权利要求1所述的方法,其特征在于,所述获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,包括:The method according to claim 1, wherein said acquiring the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample comprises:
    指示所述第一设备集合利用所述第一推理模型分别对所述第一类样本以及所述第二类样本进行推理,得到所述第一类样本对应的第一推理结果以及所述第二类样本对应的第二推理结果。Instructing the first set of devices to use the first inference model to perform inference on the first type of samples and the second type of samples respectively, to obtain a first inference result corresponding to the first type of samples and the second The second inference result corresponding to the class sample.
  3. 根据权利要求1所述的方法,其特征在于,所述获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,包括:The method according to claim 1, wherein said acquiring the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample comprises:
    指示所述第一设备集合利用所述第一推理模型对所述第一类样本进行推理,得到所述第一类样本对应的第一推理结果;Instructing the first set of devices to use the first inference model to perform inference on the first type of samples, and obtain a first inference result corresponding to the first type of samples;
    指示所述第二设备集合利用所述第二推理模型对所述第二类样本进行推理,得到所述第二类样本对应的第二推理结果。Instructing the second set of devices to use the second inference model to perform inference on the second type of samples to obtain a second inference result corresponding to the second type of samples.
  4. 根据权利要求1至3任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1 to 3, wherein the method further comprises:
    获取多个样本;Get multiple samples;
    获取所述多个样本对应的推理结果,所述多个样本对应的推理结果通过所述第一推理模型分别对所述多个样本进行推理得到;Acquiring inference results corresponding to the multiple samples, where the inference results corresponding to the multiple samples are respectively obtained by inferring the multiple samples through the first inference model;
    根据所述多个样本对应的推理结果,确定所述多个样本中的第一类样本以及第二类样本。According to the inference results corresponding to the multiple samples, the first type of samples and the second type of samples among the multiple samples are determined.
  5. 根据权利要求4所述的方法,其特征在于,所述根据所述多个样本对应的推理结果,确定所述多个样本中的第一类样本以及第二类样本,包括:The method according to claim 4, wherein said determining the first type of samples and the second type of samples among the plurality of samples according to the inference results corresponding to the plurality of samples includes:
    呈现标注界面,所述标注界面包括所述多个样本对应的推理结果;Presenting an annotation interface, the annotation interface including inference results corresponding to the plurality of samples;
    根据针对所述多个样本对应的推理结果的标注操作,确定所述多个样本中的第一类样本以及第二类样本。According to the labeling operation on the inference results corresponding to the multiple samples, the first type of samples and the second type of samples among the multiple samples are determined.
  6. 根据权利要求1至5任一项所述的方法,其特征在于,在获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果之前,所述方法还包括:The method according to any one of claims 1 to 5, wherein, before acquiring the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, the method further includes:
    为所述第一设备集合配置所述第一推理模型,以及为所述第二设备集合配置所述第二推理模型,所述第一推理模型是通过对所述第二推理模型进行模型压缩得到。configuring the first inference model for the first set of devices, and configuring the second inference model for the second set of devices, the first inference model being obtained by performing model compression on the second inference model .
  7. 根据权利要求1至6任一项所述的方法,其特征在于,所述第一设备集合部署于本地网络,所述第二设备集合部署于边缘网络;The method according to any one of claims 1 to 6, wherein the first set of devices is deployed on a local network, and the second set of devices is deployed on an edge network;
    或,所述第一设备集合部署于边缘网络,所述第二设备集合部署于云端。Or, the first set of devices is deployed on the edge network, and the second set of devices is deployed on the cloud.
  8. 一种配置决策装置的方法,其特征在于,所述方法应用于推理系统,所述推理系统包括第一设备集合、第二设备集合以及决策装置,所述第一设备集合以及所述第二设备集合均包括至少一个计算设备,所述第一设备集合中的第一推理模型的规格小于所述第二设备集合中的第二推理模型的规格,所述方法包括:A method for configuring a decision-making device, characterized in that the method is applied to a reasoning system, the reasoning system includes a first device set, a second device set, and a decision-making device, the first device set and the second device The sets each include at least one computing device, the size of the first inference model in the first set of devices is smaller than the size of the second inference model in the second set of devices, the method comprising:
    获取第一类样本以及第二类样本,所述第一推理模型推理所述第一类样本的精度低于所述第一推理模型推理所述第二类样本的精度;Acquiring samples of the first type and samples of the second type, the accuracy of inferring the samples of the first type by the first inference model is lower than the accuracy of inferring the samples of the second type by the first inference model;
    根据所述第一类样本以及所述第二类样本,配置所述决策装置中的决策参数,所述决策参数用于将所述第一推理模型推理的模型输入样本识别为传输给所述第二设备集合的模型输入样本。According to the first type of samples and the second type of samples, configure the decision parameters in the decision-making device, the decision parameters are used to identify the model input samples inferred by the first inference model as being transmitted to the second inference model Sample model input for the two-device collection.
  9. 一种配置装置,其特征在于,所述配置装置应用于推理系统,所述推理系统包括第一设备集合、第二设备集合以及决策装置,所述第一设备集合以及所述第二设备集合均包括至少一个计算设备,所述第一设备集合中的第一推理模型的规格小于所述第二设备集合中的第二推理模型的规格,所述配置装置包括:A configuration device, characterized in that the configuration device is applied to a reasoning system, and the reasoning system includes a first device set, a second device set, and a decision-making device, and the first device set and the second device set are both Including at least one computing device, the specification of the first inference model in the first set of devices is smaller than the specification of the second inference model in the second set of devices, the configuration means includes:
    推理结果获取模块,用于获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果,所述第一推理模型推理所述第一类样本的精度低于所述第一推理模型推理所述第二类样本的精度;An inference result acquisition module, configured to acquire a first inference result corresponding to a first type of sample and a second inference result corresponding to a second type of sample, the accuracy of the first inference model inferring the first type of sample is lower than that of the first type an inference model infers the accuracy of the second type of samples;
    配置模块,用于根据所述第一推理结果以及所述第二推理结果,配置所述决策装置中的决策参数,所述决策参数用于将所述第一推理模型推理的模型输入样本识别为传输给所述第二设备集合的模型输入样本。A configuration module, configured to configure a decision parameter in the decision device according to the first reasoning result and the second reasoning result, the decision parameter is used to identify a model input sample reasoned by the first reasoning model as Model input samples transmitted to the second set of devices.
  10. 根据权利要求9所述的配置装置,其特征在于,所述推理结果获取模块,具体用于:The configuration device according to claim 9, wherein the inference result acquisition module is specifically used for:
    指示所述第一设备集合利用所述第一推理模型分别对所述第一类样本以及所述第二类样本进行推理,得到所述第一类样本对应的第一推理结果以及所述第二类样本对应的第二推理结果。Instructing the first set of devices to use the first inference model to perform inference on the first type of samples and the second type of samples respectively, to obtain a first inference result corresponding to the first type of samples and the second The second inference result corresponding to the class sample.
  11. 根据权利要求9所述的配置装置,其特征在于,所述推理结果获取模块,具体用于:The configuration device according to claim 9, wherein the inference result acquisition module is specifically used for:
    指示所述第一设备集合利用所述第一推理模型对所述第一类样本进行推理,得到所述第一类样本对应的第一推理结果;Instructing the first set of devices to use the first inference model to perform inference on the first type of samples, and obtain a first inference result corresponding to the first type of samples;
    指示所述第二设备集合利用所述第二推理模型对所述第二类样本进行推理,得到所述第二类样本对应的第二推理结果。Instructing the second set of devices to use the second inference model to perform inference on the second type of samples to obtain a second inference result corresponding to the second type of samples.
  12. 根据权利要求9至11任一项所述的配置装置,其特征在于,所述推理结果获取模块,还用于:The configuration device according to any one of claims 9 to 11, wherein the inference result acquisition module is also used for:
    获取多个样本;Get multiple samples;
    获取所述多个样本对应的推理结果,所述多个样本对应的推理结果通过所述第一推理模型分别对所述多个样本进行推理得到;Acquiring inference results corresponding to the multiple samples, where the inference results corresponding to the multiple samples are respectively obtained by inferring the multiple samples through the first inference model;
    根据所述多个样本对应的推理结果,确定所述多个样本中的第一类样本以及第二类样本。According to the inference results corresponding to the multiple samples, the first type of samples and the second type of samples among the multiple samples are determined.
  13. 根据权利要求12所述的配置装置,其特征在于,所述推理结果获取模块,具体用于:The configuration device according to claim 12, wherein the inference result acquisition module is specifically used for:
    呈现标注界面,所述标注界面包括所述多个样本对应的推理结果;Presenting an annotation interface, the annotation interface including inference results corresponding to the plurality of samples;
    根据针对所述多个样本对应的推理结果的标注操作,确定所述多个样本中的第一类样本以及第二类样本。According to the labeling operation on the inference results corresponding to the multiple samples, the first type of samples and the second type of samples among the multiple samples are determined.
  14. 根据权利要求9至13任一项所述的配置装置,其特征在于,在获取第一类样本对应的第一推理结果以及第二类样本对应的第二推理结果之前,所述配置装置还用于:The configuration device according to any one of claims 9 to 13, characterized in that, before obtaining the first inference result corresponding to the first type of sample and the second inference result corresponding to the second type of sample, the configuration device also uses At:
    为所述第一设备集合配置所述第一推理模型,以及为所述第二设备集合配置所述第二推理模型,所述第一推理模型是通过对所述第二推理模型进行模型压缩得到。configuring the first inference model for the first set of devices, and configuring the second inference model for the second set of devices, the first inference model being obtained by performing model compression on the second inference model .
  15. 根据权利要求9至14任一项所述的配置装置,其特征在于,所述第一设备集合部署于本地网络,所述第二设备集合部署于边缘网络;The configuration device according to any one of claims 9 to 14, wherein the first set of devices is deployed on a local network, and the second set of devices is deployed on an edge network;
    或,所述第一设备集合部署于边缘网络,所述第二设备集合部署于云端。Or, the first set of devices is deployed on the edge network, and the second set of devices is deployed on the cloud.
  16. 一种配置装置,其特征在于,所述配置装置应用于推理系统,所述推理系统包括第一设备集合、第二设备集合以及决策装置,所述第一设备集合以及所述第二设备集合均包括至少一个计算设备,所述第一设备集合中的第一推理模型的规格小于所述第二设备集合中的第二推理模型的规格,所述配置装置包括:A configuration device, characterized in that the configuration device is applied to a reasoning system, and the reasoning system includes a first device set, a second device set, and a decision-making device, and the first device set and the second device set are both Including at least one computing device, the specification of the first inference model in the first set of devices is smaller than the specification of the second inference model in the second set of devices, the configuration means includes:
    样本获取模块,用于获取第一类样本以及第二类样本,所述第一推理模型推理所述第一类样本的精度低于所述第一推理模型推理所述第二类样本的精度;A sample acquisition module, configured to acquire a first type of sample and a second type of sample, the accuracy of the first inference model inferring the first type of sample is lower than the accuracy of the first inference model inference of the second type of sample;
    配置模块,用于根据所述第一类样本以及所述第二类样本,配置所述决策装置中的决策参数,所述决策参数用于将所述第一推理模型推理的模型输入样本识别为传输给所述第二设备集合的模型输入样本。A configuration module, configured to configure decision parameters in the decision-making device according to the first type of samples and the second type of samples, and the decision parameters are used to identify model input samples inferred by the first inference model as Model input samples transmitted to the second set of devices.
  17. 一种计算机设备,其特征在于,所述计算机设备包括处理器和存储器;A computer device, characterized in that the computer device includes a processor and a memory;
    所述处理器用于执行所述存储器中存储的指令,以使得所述计算机设备执行权利要求1至7中任一项所述的方法。The processor is configured to execute instructions stored in the memory, so that the computer device performs the method of any one of claims 1-7.
  18. 一种计算机设备,其特征在于,所述计算机设备包括处理器和存储器;A computer device, characterized in that the computer device includes a processor and a memory;
    所述处理器用于执行所述存储器中存储的指令,以使得所述计算机设备执行权利要求8所述的方法。The processor is configured to execute instructions stored in the memory to cause the computer device to perform the method of claim 8 .
  19. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有指令,当其在计算设备上运行时,使得所述计算设备执行如权利要求1至7中任一项所述的方法。A computer-readable storage medium, characterized in that instructions are stored in the computer-readable storage medium, and when the computer-readable storage medium is run on a computing device, the computing device executes the described method.
  20. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有指令,当其在计算设备上运行时,使得所述计算设备执行如权利要求8所述的方法。A computer-readable storage medium, wherein instructions are stored in the computer-readable storage medium, and when the computer-readable storage medium is run on a computing device, the computing device executes the method as claimed in claim 8 .
PCT/CN2022/091969 2021-08-25 2022-05-10 Method and apparatus for configuring decision apparatus, and related device WO2023024578A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110981922.5A CN115730205A (en) 2021-08-25 2021-08-25 Method and device for configuring decision device and related equipment
CN202110981922.5 2021-08-25

Publications (1)

Publication Number Publication Date
WO2023024578A1 true WO2023024578A1 (en) 2023-03-02

Family

ID=85289657

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/091969 WO2023024578A1 (en) 2021-08-25 2022-05-10 Method and apparatus for configuring decision apparatus, and related device

Country Status (2)

Country Link
CN (1) CN115730205A (en)
WO (1) WO2023024578A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116819964A (en) * 2023-06-20 2023-09-29 小米汽车科技有限公司 Model optimization method, model optimization device, electronic device, vehicle and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112232528A (en) * 2020-12-15 2021-01-15 之江实验室 Method and device for training federated learning model and federated learning system
US20210173831A1 (en) * 2015-10-28 2021-06-10 Qomplx, Inc. System and method for analysis of graph databases using intelligent reasoning systems
CN112990211A (en) * 2021-01-29 2021-06-18 华为技术有限公司 Neural network training method, image processing method and device
CN113067873A (en) * 2021-03-19 2021-07-02 北京邮电大学 Edge cloud collaborative optimization method based on deep reinforcement learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210173831A1 (en) * 2015-10-28 2021-06-10 Qomplx, Inc. System and method for analysis of graph databases using intelligent reasoning systems
CN112232528A (en) * 2020-12-15 2021-01-15 之江实验室 Method and device for training federated learning model and federated learning system
CN112990211A (en) * 2021-01-29 2021-06-18 华为技术有限公司 Neural network training method, image processing method and device
CN113067873A (en) * 2021-03-19 2021-07-02 北京邮电大学 Edge cloud collaborative optimization method based on deep reinforcement learning

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116819964A (en) * 2023-06-20 2023-09-29 小米汽车科技有限公司 Model optimization method, model optimization device, electronic device, vehicle and medium
CN116819964B (en) * 2023-06-20 2024-02-06 小米汽车科技有限公司 Model optimization method, model optimization device, electronic device, vehicle and medium

Also Published As

Publication number Publication date
CN115730205A (en) 2023-03-03

Similar Documents

Publication Publication Date Title
US11593703B2 (en) System and method for large-scale multi-label learning using incomplete label assignments
US20190279088A1 (en) Training method, apparatus, chip, and system for neural network model
CN106228175B (en) Horizontal decision tree learning from very high rate data streams
CN109308490B (en) Method and apparatus for generating information
WO2019169700A1 (en) Data classification method and device, equipment, and computer readable storage medium
CN111460155B (en) Knowledge graph-based information credibility assessment method and device
WO2020253127A1 (en) Facial feature extraction model training method and apparatus, facial feature extraction method and apparatus, device, and storage medium
US8769677B2 (en) System and method for spammer host detection from network flow data profiles
CN111178410A (en) Illegal picture identification method and device
WO2022179138A1 (en) Image processing method and apparatus, and computer device and storage medium
JP7480811B2 (en) Method of sample analysis, electronic device, computer readable storage medium, and computer program product
JP2020042330A (en) Information processing apparatus, data classification method and program
WO2022001918A1 (en) Method and apparatus for building predictive model, computing device, and storage medium
WO2023024578A1 (en) Method and apparatus for configuring decision apparatus, and related device
WO2021104444A1 (en) Data flow classification method, apparatus and system
US10885593B2 (en) Hybrid classification system
CN113901991A (en) 3D point cloud data semi-automatic labeling method and device based on pseudo label
US20220415019A1 (en) Method and apparatus for image classification
US10867255B2 (en) Efficient annotation of large sample group
WO2018166499A1 (en) Text classification method and device, and storage medium
WO2023241385A1 (en) Model transferring method and apparatus, and electronic device
US10969964B2 (en) Method, device, and computer program product for parallel data transmission using machine learning to determine number of concurrencies and data slice size
US20210073686A1 (en) Self-structured machine learning classifiers
WO2022227169A1 (en) Image classification method and apparatus, and electronic device and storage medium
CN110826616B (en) Information processing method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE