WO2023134181A1 - Resource allocation method, apparatus and system based on federated learning - Google Patents

Resource allocation method, apparatus and system based on federated learning Download PDF

Info

Publication number
WO2023134181A1
WO2023134181A1 PCT/CN2022/117168 CN2022117168W WO2023134181A1 WO 2023134181 A1 WO2023134181 A1 WO 2023134181A1 CN 2022117168 W CN2022117168 W CN 2022117168W WO 2023134181 A1 WO2023134181 A1 WO 2023134181A1
Authority
WO
WIPO (PCT)
Prior art keywords
model
information
training
contributors
resource
Prior art date
Application number
PCT/CN2022/117168
Other languages
French (fr)
Chinese (zh)
Inventor
刘嘉
李增祥
Original Assignee
新智我来网络科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN202210050167.3A external-priority patent/CN116502811A/en
Priority claimed from CN202210050177.7A external-priority patent/CN116502812A/en
Application filed by 新智我来网络科技有限公司 filed Critical 新智我来网络科技有限公司
Publication of WO2023134181A1 publication Critical patent/WO2023134181A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling

Abstract

The present disclosure provides a resource allocation method, apparatus and system based on federated learning. The method comprises: reading preset resource allocation configuration information; obtaining model demand information provided by a plurality of model demanders, the resource allocation configuration information comprising attribute configuration information, contribution degree configuration information and monitoring configuration information, and determining a target demander according to the attribute configuration information and the model demand information; determining a plurality of target resource contributors matched with the model demand information, and obtaining model resources of each target resource contributor; and determining an allocation value corresponding to each target resource contributor according to the attribute configuration information, the contribution degree configuration information, the monitoring configuration information and the model resources. According to the present disclosure, each target resource contributor can obtain the allocation value matched with the model resources provided by the target resource contributor, the enthusiasm of the resource contributor to provide model resources is stimulated, and long-term good sustainable development of federated learning is facilitated.

Description

一种基于联合学习的资源分配方法、装置及系统A resource allocation method, device and system based on joint learning 技术领域technical field
本公开涉及机器学习技术领域,尤其涉及一种基于联合学习的资源分配方法、装置及系统。The present disclosure relates to the technical field of machine learning, and in particular to a resource allocation method, device and system based on joint learning.
背景技术Background technique
在隐私和数据保护问题日益受到重视的大环境下,联合学习已经成为人工智能领域中的一个非常热门的研究方向。联合学习是指在确保数据安全及用户隐私的前提下,综合利用多种AI(Artificial Intelligence,人工智能)技术,联合多方合作共同挖掘数据价值,催生基于联合建模的新的智能业态和模式。In the context of increasing attention to privacy and data protection issues, federated learning has become a very popular research direction in the field of artificial intelligence. Federated learning refers to the comprehensive utilization of various AI (Artificial Intelligence, artificial intelligence) technologies on the premise of ensuring data security and user privacy, and joint multi-party cooperation to jointly mine data value and generate new intelligent business models and models based on joint modeling.
对于联合学习而言,参与方持续地参与到联合学习进程(例如,通过共享加密的模型参数)是其长期成功的关键所在。然而,在现有技术中,联合学习的评估奖励机制大多是采用平均分配的方式,难免会存在分配不合理、不公平等问题,这就容易因激励分配不公平等原因而流失掉部分拥有充分且有效数据的参与方,这无疑将会不利于联合学习的长期良好可持续发展。For federated learning, the continuous participation of participants in the federated learning process (for example, by sharing encrypted model parameters) is the key to its long-term success. However, in the existing technology, most of the evaluation and reward mechanisms of joint learning adopt the method of equal distribution, which inevitably has problems such as unreasonable and unfair distribution. Participants without valid data will undoubtedly be detrimental to the long-term sustainable development of joint learning.
发明内容Contents of the invention
有鉴于此,本公开实施例提供了一种基于联合学习的资源分配方法、装置及系统,以解决现有技术中联合学习的评估奖励机制大多是采用平均分配的方式,难免会存在分配不合理、不公平等问题,容易导致因激励分配不公平等原因而流失掉部分拥有充分且有效数据的参与方,不利于联合学习的长期良好可持续发展的问题。In view of this, the embodiments of the present disclosure provide a resource allocation method, device, and system based on joint learning to solve the problem that most of the evaluation and reward mechanisms of joint learning in the prior art adopt the method of average distribution, which inevitably leads to unreasonable allocation. , unfairness and other issues may easily lead to the loss of some participants with sufficient and valid data due to unfair incentive distribution and other reasons, which is not conducive to the long-term sustainable development of joint learning.
本公开实施例的第一方面,提供了一种基于联合学习的资源分配方法,包括:The first aspect of the embodiments of the present disclosure provides a resource allocation method based on joint learning, including:
读取预设的资源分配配置信息,资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;Read preset resource allocation configuration information, resource allocation configuration information includes attribute configuration information, contribution configuration information and monitoring configuration information;
获取多个模型需求方提供的模型需求信息,根据属性配置信息和模型需求信息,确定目标需求方;Obtain model demand information provided by multiple model demand parties, and determine the target demand party according to attribute configuration information and model demand information;
确定与模型需求信息匹配的多个目标资源贡献方,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量;Determine multiple target resource contributors that match the model requirement information, and obtain the model resources of each target resource contributor. The model resources include model parameters and effective training data volume;
根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方。According to attribute configuration information, contribution degree configuration information, monitoring configuration information and model resources, determine the distribution value corresponding to each target resource contributor, and feed back the distribution value to each target resource contributor.
本公开实施例的第二方面,提供了一种基于联合学习的资源分配装置,包括:The second aspect of the embodiments of the present disclosure provides a resource allocation device based on joint learning, including:
读取模块,被配置为读取预设的资源分配配置信息,资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;The reading module is configured to read preset resource allocation configuration information, and the resource allocation configuration information includes attribute configuration information, contribution degree configuration information and monitoring configuration information;
需求方确定模块,被配置为获取多个模型需求方提供的模型需求信息,根据属性配置信息和模型需求信息,确定目标需求方;The demander determination module is configured to obtain model demand information provided by multiple model demand parties, and determine the target demand party according to attribute configuration information and model demand information;
资源获取模块,被配置为确定与模型需求信息匹配的多个目标资源贡献方,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量;The resource acquisition module is configured to determine multiple target resource contributors that match the model requirement information, and acquire model resources of each target resource contributor, where the model resources include model parameters and effective training data volume;
分配模块,被配置为根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方。The allocation module is configured to determine the allocation value corresponding to each target resource contributor according to attribute configuration information, contribution configuration information, monitoring configuration information and model resources, and feed back the allocation value to each target resource contributor.
本公开实施例的第三方面,提供了一种基于联合学习的资源分配系统,包括协调中心,分别与协调中心通信连接的贡献方传输模块和需求方传输模块;The third aspect of the embodiments of the present disclosure provides a resource allocation system based on joint learning, including a coordination center, a contributor transmission module and a demand transmission module respectively connected to the coordination center in communication;
需求方传输模块,被配置为按照预设的时间步长,向协调中心发送模型需求信息,模型需求信息包括需求模型;The demand-side transmission module is configured to send model demand information to the coordination center according to a preset time step, where the model demand information includes a demand model;
贡献方传输模块,被配置为当接收到模型训练信息,且确定参与模型训练时,向协调中心发送应邀申请;The contributor transmission module is configured to send an invitation application to the coordination center when the model training information is received and it is determined to participate in the model training;
协调中心,被配置为根据模型需求信息生成模型训练信息,并广播模型训练信息,模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略;The coordination center is configured to generate model training information according to the model requirement information, and broadcast the model training information. The model training information includes the preset basic model, training sample type, sample size required for each round and participation strategy;
根据应邀申请锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个目标贡献方使用其训练资源对基本模型进行训练,直至满足预设的结束条件,得到全局模型,计算各个目标贡献方应得的贡献分配资源,将贡献分配资源反馈给相应的各个目标贡献方。According to the invited application, the training resources of at least two target contributors are locked, and the preset training program is started, so that each target contributor uses its training resources to train the basic model until the preset end conditions are met, and the global model is obtained. The contribution allocation resources that the target contributors should deserve, and the contribution allocation resources are fed back to the corresponding target contributors.
本公开实施例的第四方面,提供了一种基于联合学习的资源分配系统的资源分配方法,包括:According to the fourth aspect of the embodiments of the present disclosure, a resource allocation method of a joint learning-based resource allocation system is provided, including:
接收需求方发送的模型需求信息;其中,需求方为多个参与方之一;Receive model demand information sent by the demander; where the demander is one of the multiple participants;
根据模型需求信息生成模型训练信息,并广播模型训练信息,模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略;Generate model training information according to the model requirement information, and broadcast the model training information. The model training information includes the preset basic model, training sample type, sample size required for each round and participation strategy;
确定多个参与方中的贡献方;Identify contributors among multiple parties;
响应于贡献方发送的基于模型训练信息确定参与模型训练的消息;Responding to the message sent by the contributor to determine participation in model training based on the model training information;
根据消息锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个目标贡献方使用其训练资源对基本模型进行训练,直至满足预设的结束条件,聚合各个目标贡献方提供的模型参数,得到全局模型,计算各个目标贡献方应得的贡献分配资源,将贡献分配资源反馈给相应的各个目标贡献方。Lock the training resources of at least two target contributors according to the message, start the preset training program, make each target contributor use its training resources to train the basic model, until the preset end condition is met, and aggregate the training resources provided by each target contributor Model parameters, get the global model, calculate the contribution allocation resources that each target contributor should get, and feed back the contribution allocation resources to the corresponding target contributors.
本公开实施例的第五方面,提供了一种电子设备,包括存储器、处理器以及存储在存储器中并且可在处理器上运行的计算机程序,该处理器执行计算机程序时实现上述方法的步骤。A fifth aspect of the embodiments of the present disclosure provides an electronic device, including a memory, a processor, and a computer program stored in the memory and operable on the processor. The processor implements the steps of the above method when executing the computer program.
本公开实施例的第六方面,提供了一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,该计算机程序被处理器执行时实现上述方法的步骤。A sixth aspect of the embodiments of the present disclosure provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the foregoing method are implemented.
本公开实施例与现有技术相比,有益效果至少包括:本公开实施例通过读取预设的资源分配配置信息,资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;获取多个模型需求方提供的模型需求信息,根据属性配置信息和模型需求信息,确定目标需求方;确定与模型需求信息匹配的多个目标资源贡献方,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量;根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方,充分考虑到各个目标资源贡献方的对模型需求方所需要的模型的效用的贡献度等方面因素,使得各个目标资源贡献方能够得到与其所提供的模型资源相匹配的分配值,不仅可激发资源贡献方提供模型资源的积极性,还可以进一步约束资源贡献方提供真实、有效的模型参数,以利于联合学习的长期良好可持续发展。Compared with the prior art, the beneficial effects of the embodiments of the present disclosure at least include: the embodiments of the present disclosure read preset resource allocation configuration information, and the resource allocation configuration information includes attribute configuration information, contribution degree configuration information, and monitoring configuration information; Based on the model demand information provided by multiple model demand parties, the target demand party is determined according to the attribute configuration information and model demand information; multiple target resource contributors that match the model demand information are determined, and the model resources of each target resource contributor are obtained. Model resources include model parameters and effective training data volume; according to attribute configuration information, contribution configuration information, monitoring configuration information and model resources, determine the allocation value corresponding to each target resource contributor, and feed back the allocation value to each target resource contribution fully consider the contribution of each target resource contributor to the utility of the model required by the model demander, so that each target resource contributor can obtain an allocation value that matches the model resources it provides, not only can Stimulating the enthusiasm of resource contributors to provide model resources can further constrain resource contributors to provide real and effective model parameters, so as to facilitate the long-term sustainable development of joint learning.
附图说明Description of drawings
为了更清楚地说明本公开实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本公开的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present disclosure, the following will briefly introduce the drawings that need to be used in the embodiments or the description of the prior art. Obviously, the drawings in the following description are only of the present disclosure For some embodiments, those skilled in the art can also obtain other drawings based on these drawings without creative efforts.
图1是本公开实施例的一种联合学习的架构示意图;FIG. 1 is a schematic diagram of a joint learning architecture according to an embodiment of the present disclosure;
图2是本公开实施例提供的一种基于联合学习的资源分配方法的流程示意图;FIG. 2 is a schematic flowchart of a resource allocation method based on joint learning provided by an embodiment of the present disclosure;
图3是本公开实施例提供的一种基于联合学习的资源分配装置的示意图;Fig. 3 is a schematic diagram of a resource allocation device based on joint learning provided by an embodiment of the present disclosure;
图4是本公开实施例的一种基于联合学习的资源分配系统的结构示意图;FIG. 4 is a schematic structural diagram of a resource allocation system based on joint learning according to an embodiment of the present disclosure;
图5是本公开实施例提供的另一种基于联合学习的资源分配方法的时序图;FIG. 5 is a sequence diagram of another resource allocation method based on joint learning provided by an embodiment of the present disclosure;
图6是本公开实施例提供的另一种基于联合学习的资源分配方法的流程示意图。Fig. 6 is a schematic flowchart of another resource allocation method based on joint learning provided by an embodiment of the present disclosure.
图7是本公开实施例提供的一种电子设备的结构示意图。Fig. 7 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
具体实施方式Detailed ways
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细 节,以便透彻理解本公开实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本公开。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本公开的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as specific system structures and techniques are presented for a thorough understanding of the embodiments of the present disclosure. It will be apparent, however, to one skilled in the art that the present disclosure may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present disclosure with unnecessary detail.
联合学习是指在确保数据安全及用户隐私的前提下,综合利用多种AI(Artificial Intelligence,人工智能)技术,联合多方合作共同挖掘数据价值,催生基于联合建模的新的智能业态和模式。联合学习至少具有以下特点:Federated learning refers to the comprehensive utilization of various AI (Artificial Intelligence, artificial intelligence) technologies on the premise of ensuring data security and user privacy, and joint multi-party cooperation to jointly mine data value and generate new intelligent business models and models based on joint modeling. Federated learning has at least the following characteristics:
(1)参与节点控制自有数据的弱中心化联合训练模式,确保共创智能过程中的数据隐私安全。(1) Participating nodes control the weakly centralized joint training mode of their own data to ensure data privacy and security in the process of co-creating intelligence.
(2)在不同应用场景下,利用筛选和/或组合AI算法、隐私保护计算,建立多种模型聚合优化策略,以获取高层次、高质量的模型。(2) In different application scenarios, use screening and/or combining AI algorithms and privacy-preserving calculations to establish multiple model aggregation optimization strategies to obtain high-level, high-quality models.
(3)在确保数据安全及用户隐私的前提下,基于多种模型聚合优化策略,获取提升联合学习引擎的效能方法,其中效能方法可以是通过解决包括计算架构并行、大规模跨域网络下的信息交互、智能感知、异常处理机制等,提升联合学习引擎的整体效能。(3) On the premise of ensuring data security and user privacy, based on a variety of model aggregation optimization strategies, obtain a performance method to improve the joint learning engine, where the performance method can be solved by solving problems including parallel computing architecture and large-scale cross-domain network Information interaction, intelligent perception, exception handling mechanism, etc., improve the overall performance of the joint learning engine.
(4)获取各场景下多方用户的需求,通过互信机制,确定合理评估各联合参与方的真实贡献度,进行分配激励。(4) Obtain the needs of multi-party users in each scenario, determine and reasonably evaluate the true contribution of each joint participant through the mutual trust mechanism, and distribute incentives.
基于上述方式,可以建立基于联合学习的AI技术生态,充分发挥行业数据价值,推动垂直领域的场景落地。Based on the above methods, it is possible to establish an AI technology ecology based on joint learning, give full play to the value of industry data, and promote the implementation of scenarios in vertical fields.
在本公开实施例中,联合学习的架构可以包括服务器、多个贡献方以及多个需求方,其具体结构以及作用可以根据具体需要进行调整。下面将结合附图详细说明根据本公开实施例的联合学习的资源分配方法和装置。In the embodiment of the present disclosure, the architecture of the federated learning may include a server, multiple contributors, and multiple demanders, and its specific structure and functions may be adjusted according to specific needs. The resource allocation method and device for joint learning according to the embodiments of the present disclosure will be described in detail below with reference to the accompanying drawings.
图1是本公开实施例的联合学习的架构的一种结构示意图。如图1所示,联合学习的架构可以包括服务器101、多个贡献方102以及多个需求方103,其中服务器101为资源分配中心,贡献方102为目标资源贡献方,需求方103为模型需求方。FIG. 1 is a schematic structural diagram of a federated learning architecture according to an embodiment of the disclosure. As shown in Figure 1, the architecture of joint learning can include a server 101, multiple contributors 102, and multiple demanders 103, where the server 101 is the resource allocation center, the contributor 102 is the target resource contributor, and the demander 103 is the model demand square.
在联合学习过程中,可以预先通过人工配置该资源分配中心101在后续的仿真拍卖场景中可能会使用到的资源分配配置信息,该资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;在启动仿真拍卖程序时,资源分配中心101可读取预设的资源分配配置信息,然后,获取多个模型需求方103提供的模型需求信息,根据属性配置信息和模型需求信息,确定目标需求方;接着,进一步确定与模型需求信息匹配的多个目标资源贡献方102,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量;再根据上述属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方,从而完成整个模型资源拍卖流程。In the joint learning process, the resource allocation configuration information that may be used by the resource allocation center 101 in the subsequent simulated auction scene can be manually configured in advance. The resource allocation configuration information includes attribute configuration information, contribution configuration information and monitoring configuration. information; when starting the simulated auction program, the resource allocation center 101 can read the preset resource allocation configuration information, and then obtain the model demand information provided by multiple model demanders 103, and determine the target according to the attribute configuration information and model demand information The demander; then, further determine a plurality of target resource contributors 102 that match the model demand information, obtain the model resources of each target resource contributor, and the model resources include model parameters and effective training data volume; then configure information according to the above attributes, Contribution configuration information, monitoring configuration information and model resources, determine the allocation value corresponding to each target resource contributor, and feed back the allocation value to each target resource contributor, thus completing the entire model resource auction process.
本公开实施例充分考虑到各个目标资源贡献方的对模型需求方所需要的模型的效用的贡献度等方面因素,使得各个目标资源贡献方能够得到与其所提供的模型资源相匹配的分配值,不仅可激发资源贡献方提供模型资源的积极性,还可以进一步约束资源贡献方提供真实、有效的模型参数,以利于联合学习的长期良好可持续发展。The embodiments of the present disclosure fully consider factors such as the contribution of each target resource contributor to the utility of the model required by the model demander, so that each target resource contributor can obtain an allocation value that matches the model resources it provides, It can not only stimulate the enthusiasm of resource contributors to provide model resources, but also further constrain resource contributors to provide real and effective model parameters, so as to facilitate the long-term sustainable development of joint learning.
需要说明的是,目标资源方和模型需求方的数量和类型等可以根据实际情况具体设置,在本公开中不做具体限制。It should be noted that the number and types of target resource parties and model demand parties can be specifically set according to actual conditions, and are not specifically limited in this disclosure.
图2是本公开实施例提供的一种基于联合学习的资源分配方法的流程示意图。图2的基于联合学习的资源分配方法可以由图1的资源分配中心101执行。如图2所示,该基于联合学习的资源分配方法包括:Fig. 2 is a schematic flowchart of a resource allocation method based on joint learning provided by an embodiment of the present disclosure. The resource allocation method based on joint learning in FIG. 2 may be executed by the resource allocation center 101 in FIG. 1 . As shown in Figure 2, the resource allocation method based on joint learning includes:
步骤S201,读取预设的资源分配配置信息,资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息。Step S201, read preset resource allocation configuration information, the resource allocation configuration information includes attribute configuration information, contribution degree configuration information and monitoring configuration information.
作为一示例,资源分配配置信息可由人工预先配置具体的参数值得到,也可以由人工预先配置的参数范围值内随机产生。As an example, the resource allocation configuration information may be obtained by manually preconfiguring specific parameter values, or may be randomly generated within a manually preconfigured parameter range.
具体的,属性配置信息,可包括拍卖方式,拍卖方式包括但不限于首价拍卖、VCG价格拍卖等。其中,首价拍卖,其原理是出价最高者胜出。VCG价格拍卖,其原理则是计算 竞价者赢得拍卖品后,给整个竞价收入带来的收益损失,理论上这种损失就是竞价获胜者应该支付的费用。Specifically, the attribute configuration information may include auction methods, including but not limited to first-price auctions, VCG price auctions, and the like. Among them, the principle of first-price auction is that the highest bidder wins. The principle of VCG price auction is to calculate the profit loss brought to the entire bidding revenue after the bidder wins the auction item. In theory, this loss is the fee that the bid winner should pay.
贡献度配置信息,包括但不限于贡献度衡量方法,比如,按边际贡献分配(指各节点的效益是它加入团队时所产生的效用)、基于Shapely值分配(旨在排除节点以不同顺序加入集合体中所带来的影响,从而更公平地预估它们对集合体做出的贡献)。Contribution configuration information, including but not limited to contribution measurement methods, such as distribution according to marginal contribution (meaning that the benefit of each node is the utility generated when it joins the team), distribution based on Shapely value (designed to exclude nodes joining in different order impact on the aggregate, thereby more fairly estimating their contribution to the aggregate).
监控配置信息,包括但不限于训练数据偏差(是指资源贡献方所提供的训练数据与模型需求方所需要的模型样本的偏差程度)、资源偏差(一般指资源贡献方所拥有的计算资源与训练模型时所需要的计算资源之间的偏差)、在线稳定度偏差(一般是指资源贡献方在参与联合训练过程中出现掉线的概率与预设的在线率的偏差)等。Monitoring configuration information, including but not limited to training data deviation (refers to the degree of deviation between the training data provided by the resource contributor and the model sample required by the model demander), resource deviation (generally refers to the difference between the computing resources owned by the resource contributor and The deviation between the computing resources required for training the model), the deviation of online stability (generally refers to the deviation between the probability that the resource contributor will be offline during the joint training process and the preset online rate), etc.
作为一示例,可以通过与运行预设的计算机程序在平台内存中产生一个用于配置资源分配中心101这一对象的数据结构,该数据结构可以是“[属性配置信息,贡献度配置信息,监控配置信息]”这种一维数组的形式。As an example, a data structure for configuring the object of the resource allocation center 101 can be generated in the platform memory by running a preset computer program, and the data structure can be "[attribute configuration information, contribution configuration information, monitoring configuration information]" in the form of a one-dimensional array.
步骤S202,获取多个模型需求方提供的模型需求信息,根据属性配置信息和模型需求信息,确定目标需求方。Step S202, acquiring model demand information provided by multiple model demand parties, and determining a target demand party according to attribute configuration information and model demand information.
作为一示例,可以预先构建资源分配中心101与多个目标资源贡献方102和多个模型需求方103之间的通信通道。资源分配中心101可在启动仿真程序后,对该通信通道进行监听,并实现与多个目标资源贡献方102、多个模型需求方103的通信及信息交换。As an example, communication channels between the resource allocation center 101 and multiple target resource contributors 102 and multiple model demanders 103 may be constructed in advance. The resource allocation center 101 can monitor the communication channel after starting the simulation program, and realize communication and information exchange with multiple target resource contributors 102 and multiple model demanders 103 .
在一些实施例中,上述模型需求信息包括需求模型。根据属性配置信息和模型需求信息,确定目标需求方,具体包括:根据需求模型对多个模型需求方进行分类,得到与每个需求模型对应的模型需求方集合;根据属性配置信息,从模型需求方集合中筛选出目标需求方。In some embodiments, the above-mentioned model requirement information includes a requirement model. According to the attribute configuration information and model demand information, determine the target demand party, specifically including: classify multiple model demand parties according to the demand model, and obtain the set of model demand parties corresponding to each demand model; according to the attribute configuration information, from the model demand Select the target demand party from the party set.
具体的,在运行预设的模型拍卖仿真程序时,资源分配中心101可启动对通信通道的监听,并通过该通信通道接收多个模型需求方103提供的模型需求信息,该模型需求信息包括每个模型需求方的需求模型。这里的需求模型可以根据模型需求方的实际业务需求来确定,比如,模型需求方的业务需求是提高考勤系统的人脸识别准确性,那么其需求的模型可为人脸识别模型。Specifically, when running the preset model auction simulation program, the resource allocation center 101 can start monitoring the communication channel, and receive model demand information provided by multiple model demanders 103 through the communication channel. A demand model on the demand side of a model. The demand model here can be determined according to the actual business needs of the model demander. For example, if the business demand of the model demander is to improve the face recognition accuracy of the attendance system, then the required model can be a face recognition model.
接着,再根据所获取到的每个模型需求方的需求模型,对这些模型需求方进行分类,以获得与每个需求模型对应的模型需求方集合,该模型需求方集合包括至少一个模型需求方。Then, according to the obtained demand models of each model demand party, these model demand parties are classified to obtain a model demand party set corresponding to each demand model, and the model demand party set includes at least one model demand party .
示例性的,假设当前有5个模型需求方,分别为模型需求方A、B、C、D、E,其中,模型需求方A、B的需求模型为人脸识别模型,模型需求方C、D、E的需求模型为负荷预测模型。那么,可根据需求模型将上述的5个模型需求方分成两类,一类为与人脸识别模型对应的模型需求方集合01(包括模型需求方A、B),另一类为与负荷预测模型对应的模型需求方集合02(包括模型需求方C、D、E)。For example, assume that there are currently five model demanders, namely model demanders A, B, C, D, and E. Among them, the demand models of model demanders A and B are face recognition models, and model demanders C and D , The demand model of E is a load forecasting model. Then, according to the demand model, the above five model demanders can be divided into two categories, one is the model demander set 01 corresponding to the face recognition model (including model demanders A and B), and the other is the load forecasting model The set of model demanders 02 corresponding to the model (including model demanders C, D, and E).
在一些实施例中,根据属性配置信息,从模型需求方集合中筛选出目标需求方,具体包括:获取模型需求方集合中的每个模型需求方的预算资源;将预算资源最多的模型需求方确定为目标需求方。In some embodiments, according to the attribute configuration information, the target demander is screened out from the set of model demanders, which specifically includes: obtaining the budget resources of each model demander in the set of model demanders; Determined as the target demand side.
假设给定的属性配置信息为首价拍卖,即价高者获胜。那么,可进一步获取每个模型需求方的预算资源,这里的预算资源可以是指预算费用,即购买其所需模型的预算。然后,比较每个模型需求方集合中的两两模型需求方之间的预算资源大小,并将其中预算资源最多(即出价最高)的模型需求方确定为目标需求方。Assume that the given attribute configuration information is a first-price auction, that is, the one with the highest price wins. Then, the budget resources of each model demander can be further obtained, where the budget resources can refer to budget expenses, that is, the budget for purchasing the required models. Then, compare the size of budget resources between any pair of model demanders in each set of model demanders, and determine the model demander with the most budget resources (that is, the highest bid) as the target demander.
示例性的,若模型需求方集合01中的模型需求方A的预算资源为X元,模型需求方B的预算资源为Y元,且X>Y,则将模型需求方A确定为目标需求方。Exemplarily, if the budget resource of model demander A in model demander set 01 is X yuan, and the budget resource of model demander B is Y yuan, and X>Y, then determine model demander A as the target demander .
步骤S203,确定与模型需求信息匹配的多个目标资源贡献方,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量。Step S203, determine multiple target resource contributors matching the model requirement information, and obtain the model resources of each target resource contributor, where the model resources include model parameters and effective training data volume.
在一些实施例中,资源分配中心101可通过预设的通信通道广播模型招标信息,以使各个资源贡献方接收模型招标信息,模型招标信息包括需求模型、所需训练样本、所需样本数量和激励系数;接收多个资源贡献方基于模型招标信息反馈的待拍卖模型的模型信息。In some embodiments, the resource allocation center 101 can broadcast model bidding information through a preset communication channel, so that each resource contributor can receive the model bidding information. The model bidding information includes the demand model, the required training samples, the required number of samples, and Incentive coefficient; receive the model information of the model to be auctioned based on the model bidding information fed back by multiple resource contributors.
其中,所需训练样本类型,可根据需求模型确定,假设需求模型为人脸识别模型,那么其所需训练样本可为包含人脸的图片/图像/视频。所需样本数量,通常是指至少满足一轮次联合训练需求的样本量。比如,一轮训练需要100条样本,那么所需样本数量则为100条。Wherein, the required training sample type can be determined according to the requirement model, assuming that the requirement model is a face recognition model, then the required training samples can be pictures/images/videos containing faces. The required number of samples usually refers to the sample size that meets at least one round of joint training requirements. For example, if a round of training requires 100 samples, then the number of samples required is 100.
激励系数,与训练样本和需求模型的关联度有关,通常关联度越高,对应的激励系数越大。作为一示例,假设所需训练样本为包含人脸的图片/图像/视频,那么包含人脸的图片/图像/视频的训练样本,可视为关联度较高的样本,其所对应的激励系数可设置为0.8,而不包含人脸的图片/图像/视频的训练样本,可视为关联度较低(或者不关联)的样本,其所对应的激励系数可设置为0.2。The incentive coefficient is related to the degree of correlation between the training sample and the demand model. Usually, the higher the degree of correlation, the greater the corresponding incentive coefficient. As an example, assuming that the required training samples are pictures/images/videos containing faces, then the training samples containing pictures/images/videos of faces can be regarded as samples with high correlation, and the corresponding excitation coefficient It can be set to 0.8, and the training samples of pictures/images/videos that do not contain human faces can be regarded as samples with low correlation (or no correlation), and the corresponding excitation coefficient can be set to 0.2.
作为一示例,当资源贡献方(拥有训练数据的参与方)通过上述通信通道接收到资源分配中心101广播的模型招标信息为需求模型为人脸识别模型、所需训练样本(包含人脸的图片/图像/视频)、所需样本数量(一轮次100条样本)和激励系数(含人脸的图片/图像/视频的样本的激励系数为0.8,不含人脸的图片/图像/视频的样本的激励系数为0.2)时,可以根据其自身拥有的训练数据以及上述模型招标信息,确定其是否参与本次投标竞拍活动。As an example, when the resource contributor (participant with training data) receives the model bidding information broadcast by the resource allocation center 101 through the above communication channel, the required model is a face recognition model, and the required training samples (pictures/ image/video), the number of required samples (100 samples per round) and the excitation coefficient (the excitation coefficient of the sample of the picture/image/video containing the face is 0.8, and the sample of the picture/image/video without the face When the incentive coefficient of is 0.2), it can be determined whether to participate in this bidding activity according to its own training data and the above-mentioned model bidding information.
在一些实施例中,可先获取多个资源贡献方上报的待拍卖模型的模型信息,待拍卖模型的模型信息包括待拍卖模型的模型类型;再计算待拍卖模型的模型类型与需求模型的模型类型之间的相似度,最后,根据相似度确定与需求模型的模型类型相匹配的多个目标资源贡献方。In some embodiments, the model information of the model to be auctioned reported by multiple resource contributors can be obtained first, and the model information of the model to be auctioned includes the model type of the model to be auctioned; then the model type of the model to be auctioned and the model of the demand model can be calculated The similarity between types, and finally, multiple target resource contributors matching the model type of the requirement model are determined according to the similarity.
作为一示例,根据上述模型招标信息,结合自身拥有的训练数据,确定参与本次投标竞拍活动的资源贡献方,可以使用其拥有的训练数据训练资源分配中心101下发的基本模型,得到训练模型,并将该训练模型的模型信息(即待拍卖模型的模型信息)反馈给资源分配中心101。As an example, according to the above model bidding information, combined with its own training data, determine the resource contributors participating in this bidding activity, and can use the training data they have to train the basic model issued by the resource allocation center 101 to obtain the training model , and feed back the model information of the training model (that is, the model information of the model to be auctioned) to the resource allocation center 101 .
作为一示例,资源分配中心101在获取到各个资源贡献方上报的待拍卖模型的模型信息(包括待拍卖模型的模型类型)后,可进一步计算每个资源贡献方上报的待拍卖模型的模型类型与需求模型的模型类型之间的相似度(比如,余弦相似度),再按照相似度从大到小对各个资源贡献方进行排序,得到排序列表。最后,根据实际需要,按排序列表的顺序依次选取至少两个资源贡献方作为目标资源贡献方。As an example, after obtaining the model information (including the model type of the model to be auctioned) reported by each resource contributor, the resource allocation center 101 can further calculate the model type of the model to be auctioned reported by each resource contributor The similarity (for example, cosine similarity) with the model type of the requirement model, and then sort each resource contributor according to the similarity from large to small to obtain a sorted list. Finally, according to actual needs, at least two resource contributors are sequentially selected as target resource contributors in the order of the sorted list.
步骤S204,根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方。Step S204, according to attribute configuration information, contribution degree configuration information, monitoring configuration information and model resources, determine the distribution value corresponding to each target resource contributor, and feed back the distribution value to each target resource contributor.
在一些实施例中,根据属性配置信息和预设的第一权重,计算每个目标资源贡献方的拍卖值;In some embodiments, the auction value of each target resource contributor is calculated according to the attribute configuration information and the preset first weight;
根据贡献度配置信息、模型参数、有效训练数据量及预设的第二权重,计算每个目标资源贡献方的贡献值;Calculate the contribution value of each target resource contributor according to the contribution configuration information, model parameters, effective training data volume and the preset second weight;
根据监控配置信息和预设的第三权重,计算每个目标资源贡献方的惩罚值;Calculate the penalty value of each target resource contributor according to the monitoring configuration information and the preset third weight;
根据拍卖值、贡献值和惩罚值,确定每个目标资源贡献方对应的分配值。According to the auction value, contribution value and penalty value, determine the allocation value corresponding to each target resource contributor.
其中,第一权重、第二权重和第三权重可以根据实际情况灵活设置,具体在此不做限制。例如,第一权重、第二权重和第三权重可以分别设置为0.5、0.3、0.2。Wherein, the first weight, the second weight, and the third weight can be flexibly set according to actual conditions, and there is no specific limitation here. For example, the first weight, the second weight, and the third weight may be set to 0.5, 0.3, and 0.2, respectively.
作为一示例,根据属性配置信息和预设的第一权重,计算每个目标资源贡献方的拍卖值。具体的,可以根据拍卖方式所对应的定价及其对应的预设第一权重值来计算出拍卖值。比如,拍卖方式为首价拍卖,首价拍卖的定价可为目标需求方提供的预算资源扣除资源分配中心的服务成本之后的剩余部分。As an example, the auction value of each target resource contributor is calculated according to the attribute configuration information and the preset first weight. Specifically, the auction value may be calculated according to the pricing corresponding to the auction mode and the corresponding preset first weight value. For example, the auction method is a first-price auction, and the pricing of the first-price auction may be the remaining part after deducting the service cost of the resource distribution center from the budget resources provided by the target demander.
有效训练数据量,是指与目标需求方所需要的模型的关联度达到预设关联度阈值的训练数据的数量。比如,目标需求方所需要的模型为人脸识别模型,那么与人脸识别模型关联的训练数据是包含人脸的图像/图片/视频。预设的关联度阈值可以根据实际情况设置,如可以设置为50%、100%等。示例性,假设包含人脸的图像/图片/视频的训练数据与人脸识别模型的关联度为100%,不包含人脸的图像/图片/视频的训练数据与人脸识别模型的关联度为0%,预设的关联度阈值可以设置为100%。也就是说,该示例中的有效训练数据量,是指与目标 需求方所需要的人脸识别模型的关联度达到100%(即包含人脸的图像/图片/视频的训练数据)的条数。The amount of effective training data refers to the amount of training data whose degree of correlation with the model required by the target demander reaches the preset correlation degree threshold. For example, if the model required by the target demand side is a face recognition model, then the training data associated with the face recognition model is an image/picture/video containing a face. The preset correlation degree threshold can be set according to actual conditions, for example, it can be set to 50%, 100%, and so on. Exemplarily, it is assumed that the training data of images/pictures/videos containing faces has a 100% correlation with the face recognition model, and the training data of images/pictures/videos not containing faces has a correlation of 100% with the face recognition model. 0%, the preset correlation threshold can be set to 100%. That is to say, the amount of effective training data in this example refers to the number of pieces that are 100% related to the face recognition model required by the target demander (that is, the training data of images/pictures/videos containing faces) .
一般地,当训练模型为深度神经网络模型,其模型参数一般包括权重和偏置。Generally, when the training model is a deep neural network model, its model parameters generally include weights and biases.
作为一示例,根据贡献度配置信息、模型参数、有效训练数据量及预设的第二权重,计算每个目标资源贡献方的贡献值。具体的,可以将资源贡献方提供的模型参数更新资源分配中心处存储的目标需求方所需的模型的迭代模型,然后,用预存的测试数据对更新后的迭代模型进行推演测试,以确定该资源贡献方所提供的模型参数对该迭代模型的性能改善的程度。通常改善程度越高,则证明该资源贡献方的贡献度越高,贡献值也就越高。同理,当资源贡献方所提供的有效训练数据量越多,则对模型性能的改善的贡献则越大,贡献值也就越高。As an example, the contribution value of each target resource contributor is calculated according to the contribution degree configuration information, the model parameters, the effective training data amount and the preset second weight. Specifically, the model parameters provided by the resource contributor can be updated to the iterative model of the model required by the target demand party stored in the resource distribution center, and then the updated iterative model is deduced and tested with the pre-stored test data to determine the The degree to which the model parameters provided by the resource contributor improve the performance of the iterative model. Usually, the higher the degree of improvement, the higher the contribution of the resource contributor and the higher the contribution value. Similarly, when the amount of effective training data provided by resource contributors is greater, the contribution to the improvement of model performance will be greater, and the contribution value will be higher.
在实际应用中,可以根据预先配置的贡献度衡量方法(如按边际贡献分配),以及各资源贡献方所提供的模型参数、有效训练数据量和预设的第二权重,计算出各资源贡献方的贡献值。In practical applications, the contribution of each resource can be calculated according to the pre-configured contribution measurement method (such as distribution by marginal contribution), as well as the model parameters provided by each resource contributor, the amount of effective training data, and the preset second weight. party's contribution.
在一些实施例中,根据监控配置信息和预设的第三权重,计算每个目标资源贡献方的惩罚值,具体包括:In some embodiments, the penalty value of each target resource contributor is calculated according to the monitoring configuration information and the preset third weight, specifically including:
确定每个目标资源贡献方的第一惩罚项、第二惩罚项和第三惩罚项;Determine the first penalty item, the second penalty item, and the third penalty item for each target resource contributor;
根据第一惩罚项、第二惩罚项和第三惩罚项,计算每个目标资源贡献方的惩罚值。According to the first penalty item, the second penalty item and the third penalty item, the penalty value of each target resource contributor is calculated.
惩罚值,主要是为了惩罚资源贡献方提供不真实或者无效的模型信息而设置的。The penalty value is mainly set to punish resource contributors for providing false or invalid model information.
其中,第一惩罚项可以是训练数据偏差,第二惩罚项可以是资源偏差,第三惩罚项可以是在线稳定度偏差。Wherein, the first penalty item may be training data deviation, the second penalty item may be resource deviation, and the third penalty item may be online stability deviation.
对于上述每种偏差项可以根据其偏差对模型性能的影响程度来确定其对应的惩罚权重。比如,第一惩罚项对模型性能的影响程度>第二惩罚项>第三惩罚项,那么第一惩罚项、第二惩罚项和第三惩罚项的惩罚权重可分别设置为0.7、0.2、0.1。需要说明的是,惩罚权重可以根据实际情况灵活设置,在此不做具体限制。For each of the above deviation items, the corresponding penalty weight can be determined according to the influence degree of the deviation on the model performance. For example, the degree of influence of the first penalty item on model performance > the second penalty item > the third penalty item, then the penalty weights of the first penalty item, the second penalty item, and the third penalty item can be set to 0.7, 0.2, and 0.1 respectively . It should be noted that the penalty weight can be flexibly set according to the actual situation, and no specific limitation is set here.
作为一示例,可根据公式:惩罚值=第一惩罚项*第一惩罚权重+第二惩罚项*第二惩罚权重+第三惩罚项*第三惩罚权重,计算每个资源贡献方的惩罚值。As an example, the penalty value of each resource contributor can be calculated according to the formula: penalty value = first penalty item * first penalty weight + second penalty item * second penalty weight + third penalty item * third penalty weight .
之后,再根据公式:分配值=拍卖值+贡献值-惩罚值,计算出每个目标资源贡献方对应的分配值。After that, according to the formula: allocation value = auction value + contribution value - penalty value, the allocation value corresponding to each target resource contributor is calculated.
本公开实施例提供的技术方案,通过读取预设的资源分配配置信息,资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;获取多个模型需求方提供的模型需求信息,根据属性配置信息和模型需求信息,确定目标需求方;确定与模型需求信息匹配的多个目标资源贡献方,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量;根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方,充分考虑到各个目标资源贡献方的对模型需求方所需要的模型的效用的贡献度等方面因素,使得各个目标资源贡献方能够得到与其所提供的模型资源相匹配的分配值,不仅可激发资源贡献方提供模型资源的积极性,还可以进一步约束资源贡献方提供真实、有效的模型参数,以利于联合学习的长期良好可持续发展。In the technical solution provided by the embodiments of the present disclosure, by reading the preset resource allocation configuration information, the resource allocation configuration information includes attribute configuration information, contribution degree configuration information and monitoring configuration information; obtaining model demand information provided by multiple model demanders, Determine the target demander according to the attribute configuration information and model demand information; determine multiple target resource contributors that match the model demand information, and obtain the model resources of each target resource contributor. The model resources include model parameters and effective training data volume; According to the attribute configuration information, contribution degree configuration information, monitoring configuration information and model resources, determine the corresponding distribution value of each target resource contributor, and feed back the distribution value to each target resource contributor, fully considering each target resource contributor Factors such as the degree of contribution to the utility of the model required by the model demander enable each target resource contributor to obtain an allocation value that matches the model resources it provides, which not only stimulates the enthusiasm of the resource contributor to provide model resources, but also Resource contributors can be further constrained to provide real and effective model parameters, so as to facilitate the long-term sustainable development of joint learning.
上述所有可选技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above optional technical solutions may be combined in any way to form optional embodiments of the present application, which will not be repeated here.
下述为本公开装置实施例,可以用于执行本公开方法实施例。对于本公开装置实施例中未披露的细节,请参照本公开方法实施例。The following are device embodiments of the present disclosure, which can be used to implement the method embodiments of the present disclosure. For details not disclosed in the disclosed device embodiments, please refer to the disclosed method embodiments.
基于与图2所示的基于联合学习的资源分配方法相同的发明构思,本公开实施例还提供一种基于联合学习的资源分配装置。如图3所示,该基于联合学习的资源分配装置包括:Based on the same inventive concept as the resource allocation method based on joint learning shown in FIG. 2 , an embodiment of the present disclosure further provides a resource allocation device based on joint learning. As shown in Figure 3, the resource allocation device based on joint learning includes:
读取模块301,被配置为读取预设的资源分配配置信息,资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;The reading module 301 is configured to read preset resource allocation configuration information, where the resource allocation configuration information includes attribute configuration information, contribution degree configuration information, and monitoring configuration information;
需求方确定模块302,被配置为获取多个模型需求方提供的模型需求信息,根据属性配 置信息和模型需求信息,确定目标需求方;The demander determination module 302 is configured to obtain the model demand information provided by multiple model demand parties, and determine the target demand party according to the attribute configuration information and the model demand information;
资源获取模块303,被配置为确定与模型需求信息匹配的多个目标资源贡献方,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量;The resource acquisition module 303 is configured to determine multiple target resource contributors that match the model requirement information, and acquire the model resources of each target resource contributor, where the model resources include model parameters and effective training data volume;
分配模块304,被配置为根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方。The allocation module 304 is configured to determine the allocation value corresponding to each target resource contributor according to attribute configuration information, contribution configuration information, monitoring configuration information, and model resources, and feed back the allocation value to each target resource contributor.
本公开实施例提供的技术方案,通过读取模块301读取预设的资源分配配置信息,资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;需求方确定模块302获取多个模型需求方提供的模型需求信息,根据属性配置信息和模型需求信息,确定目标需求方;资源获取模块303确定与模型需求信息匹配的多个目标资源贡献方,获取每个目标资源贡献方的模型资源,模型资源包括模型参数及有效训练数据量;分配模块304根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,并将分配值反馈至各个目标资源贡献方,充分考虑到各个目标资源贡献方的对模型需求方所需要的模型的效用的贡献度等方面因素,使得各个目标资源贡献方能够得到与其所提供的模型资源相匹配的分配值,不仅可激发资源贡献方提供模型资源的积极性,还可以进一步约束资源贡献方提供真实、有效的模型参数,以利于联合学习的长期良好可持续发展。In the technical solution provided by the embodiments of the present disclosure, the preset resource allocation configuration information is read through the reading module 301. The resource allocation configuration information includes attribute configuration information, contribution configuration information, and monitoring configuration information; the demand side determination module 302 obtains multiple The model demand information provided by the model demand party determines the target demand party according to the attribute configuration information and model demand information; the resource acquisition module 303 determines multiple target resource contributors that match the model demand information, and obtains the model of each target resource contributor Resources, model resources include model parameters and effective training data volume; allocation module 304 determines the allocation value corresponding to each target resource contributor according to attribute configuration information, contribution configuration information, monitoring configuration information and model resources, and feeds back the allocation value To each target resource contributor, fully consider the contribution of each target resource contributor to the utility of the model required by the model demander, so that each target resource contributor can get the model resources that match the model resources it provides. The assigned value can not only stimulate the enthusiasm of resource contributors to provide model resources, but also further constrain resource contributors to provide real and effective model parameters, so as to facilitate the long-term sustainable development of joint learning.
在一些实施例中,上述模型需求信息包括需求模型。上述步骤,根据属性配置信息和模型需求信息,确定目标需求方,包括:In some embodiments, the above-mentioned model requirement information includes a requirement model. In the above steps, the target demander is determined according to the attribute configuration information and model demand information, including:
根据需求模型对多个模型需求方进行分类,得到与每个需求模型对应的模型需求方集合;Classify multiple model demanders according to the demand model, and obtain a set of model demanders corresponding to each demand model;
根据属性配置信息,从模型需求方集合中筛选出目标需求方。According to the attribute configuration information, the target demander is filtered out from the set of model demanders.
在一些实施例中,上述步骤,据属性配置信息,从模型需求方集合中筛选出目标需求方,包括:In some embodiments, the above step, according to the attribute configuration information, screens out the target demander from the set of model demanders, including:
获取模型需求方集合中的每个模型需求方的预算资源;Obtain the budget resources of each model demander in the model demander collection;
将预算资源最多的模型需求方确定为目标需求方。Determine the model demander with the most budget resources as the target demander.
在一些实施例中,上述步骤,确定与模型需求信息匹配的多个目标资源贡献方,包括:In some embodiments, the above step of determining multiple target resource contributors that match the model requirement information includes:
获取多个资源贡献方上报的待拍卖模型的模型信息,待拍卖模型的模型信息包括待拍卖模型的模型类型;Obtain the model information of the model to be auctioned reported by multiple resource contributors, the model information of the model to be auctioned includes the model type of the model to be auctioned;
计算待拍卖模型的模型类型与需求模型的模型类型之间的相似度,根据相似度确定与需求模型的模型类型相匹配的多个目标资源贡献方。Calculate the similarity between the model type of the model to be auctioned and the model type of the demand model, and determine multiple target resource contributors matching the model type of the demand model according to the similarity.
在一些实施例中,上述步骤,获取多个资源贡献方上报的待拍卖模型的模型信息之前,还包括:In some embodiments, the above steps, before obtaining the model information of the models to be auctioned reported by multiple resource contributors, further include:
通过预设的通信通道广播模型招标信息,以使各个资源贡献方接收模型招标信息,模型招标信息包括需求模型、所需训练样本、所需样本数量和激励系数;The model bidding information is broadcast through the preset communication channel so that each resource contributor can receive the model bidding information. The model bidding information includes the demand model, the required training samples, the required number of samples and the incentive coefficient;
接收多个资源贡献方基于模型招标信息反馈的待拍卖模型的模型信息。Receive the model information of the model to be auctioned based on the model bidding information fed back by multiple resource contributors.
在一些实施例中,上述步骤,根据属性配置信息、贡献度配置信息、监控配置信息及模型资源,确定每个目标资源贡献方对应的分配值,包括:In some embodiments, the above step, according to attribute configuration information, contribution degree configuration information, monitoring configuration information and model resources, determines the allocation value corresponding to each target resource contributor, including:
根据属性配置信息和预设的第一权重,计算每个目标资源贡献方的拍卖值;Calculate the auction value of each target resource contributor according to the attribute configuration information and the preset first weight;
根据贡献度配置信息、模型参数、有效训练数据量及预设的第二权重,计算每个目标资源贡献方的贡献值;Calculate the contribution value of each target resource contributor according to the contribution configuration information, model parameters, effective training data volume and the preset second weight;
根据监控配置信息和预设的第三权重,计算每个目标资源贡献方的惩罚值;Calculate the penalty value of each target resource contributor according to the monitoring configuration information and the preset third weight;
根据拍卖值、贡献值和惩罚值,确定每个目标资源贡献方对应的分配值。According to the auction value, contribution value and penalty value, determine the allocation value corresponding to each target resource contributor.
在一些实施例中,上述步骤,根据监控配置信息和预设的第三权重,计算每个目标资源贡献方的惩罚值,包括:In some embodiments, the above step, according to the monitoring configuration information and the preset third weight, calculates the penalty value of each target resource contributor, including:
确定每个目标资源贡献方的第一惩罚项、第二惩罚项和第三惩罚项;Determine the first penalty item, the second penalty item, and the third penalty item for each target resource contributor;
根据第一惩罚项、第二惩罚项和第三惩罚项,计算每个目标资源贡献方的惩罚值。According to the first penalty item, the second penalty item and the third penalty item, the penalty value of each target resource contributor is calculated.
图4是本公开实施例的一种基于联合学习的资源分配系统的结构示意图。如图4所示,该系统中服务器为协调中心401,贡献方和需求方分别与协调中心401通信连接。其中,贡 献方包括贡献方传输模块402,需求方包括需求方传输模块403。Fig. 4 is a schematic structural diagram of a resource allocation system based on joint learning according to an embodiment of the present disclosure. As shown in FIG. 4 , the server in this system is the coordination center 401 , and the contributor and the demander communicate with the coordination center 401 respectively. Wherein, the contributor includes the contributor's transmission module 402, and the demander includes the demander's transmission module 403.
其中,协调中心401、贡献方和需求方可由该系统预设的计算机程序产生,具体的,可以产生一个协调中心401,多个贡献方和至少一个需求方。协调中心401可分别与多个贡献方和至少一个需求方通过预设的通信通道进行通信,并实现信息交换。Wherein, the coordination center 401, the contributor and the demander can be generated by a computer program preset by the system, specifically, one coordination center 401, multiple contributors and at least one demander can be generated. The coordination center 401 can respectively communicate with multiple contributors and at least one demander through a preset communication channel, and realize information exchange.
上述的贡献方和需求方均属于联合学习的参与方。当某参与方基于其业务需求等原因而想要通过联合学习的方式获取其所需的应用模型时,可由系统预设的计算机程序将该参与方配置为需求方,而其他参与方则配置为贡献方。可以理解的,当该需求方参与其他联合学习任务时,可由系统预设的计算机程序将该需求方重新配置为贡献方。也就是说,这里的需求方和贡献方的角色并不是固定的,而是可以根据实际情况灵活配置的。The contributors and demanders mentioned above belong to the participants of joint learning. When a participant wants to obtain the application model he needs through federated learning based on its business needs and other reasons, the computer program preset by the system can configure the participant as the demander, while other participants can be configured as contributor. It can be understood that when the demander participates in other joint learning tasks, the computer program preset by the system can reconfigure the demander as a contributor. In other words, the roles of demanders and contributors here are not fixed, but can be flexibly configured according to the actual situation.
在实际应用中,可以通过预设的智能体产生规则来随机产生多个贡献方和至少一个需求方。其中,预设的智能体产生规则,可以是按照报价范围来生成多个报价较高的贡献方和多个报价较低的贡献方,以及至少一个需求方。贡献方,是指拥有训练资源(比如,训练数据、计算资源等)的智能体。需求方,可以是拥有训练资源(比如,拥有的训练资源不足或者与其所需的模型不符合)的智能体,也可以是没有任何训练资源的智能体。在一些实施例中,需求方可以是从多个贡献方中指定的任意一个或多个。In practical applications, multiple contributors and at least one demander can be randomly generated through preset agent generation rules. Wherein, the preset agent generation rule may be to generate a plurality of contributors with higher quotations, a plurality of contributors with lower quotations, and at least one demander according to the quotation range. Contributors refer to agents that have training resources (such as training data, computing resources, etc.). The demand side can be an agent with training resources (for example, insufficient training resources or inconsistent with its required model), or an agent without any training resources. In some embodiments, the requester may be any one or more specified from multiple contributors.
此外,预设的智能体产生规则,还可以是按照训练数据的数量范围或者质量等级来产生多个贡献方和至少一个需求方。In addition, the preset agent generation rule may also be to generate multiple contributors and at least one demander according to the quantity range or quality level of the training data.
需求方传输模块403,被配置为按照预设的时间步长,向协调中心发送模型需求信息,模型需求信息包括需求模型。The demander transmission module 403 is configured to send model demand information to the coordination center according to a preset time step, where the model demand information includes a demand model.
其中,预设的时间步长,是指仿真训练的启动间隔时间。该间隔时间可以是每间隔相等的时长,也可以是每间隔一个随机的时长。比如,每间隔1分钟自动向协调中心401发送一次模型需求信息;或者,在间隔1分钟、5分钟、7分钟…时自动向协调中心401发送一次模型需求信息。预设的时间步长,可以根据实际需求灵活设置,在此不做限制。Wherein, the preset time step refers to the starting interval time of the simulation training. The interval time may be an equal duration for each interval, or may be a random duration for each interval. For example, the model requirement information is automatically sent to the coordination center 401 at intervals of 1 minute; or, the model requirement information is automatically sent to the coordination center 401 at intervals of 1 minute, 5 minutes, 7 minutes.... The preset time step can be flexibly set according to actual needs, and there is no limit here.
需求模型,可以根据需求方的具体业务需求来确定,比如,需求方的业务需求是提高考勤系统的人脸识别准确性,那么其需求的模型可为人脸识别模型。The demand model can be determined according to the specific business needs of the demand side. For example, if the business demand of the demand side is to improve the face recognition accuracy of the attendance system, then the demand model can be a face recognition model.
协调中心401,被配置为根据模型需求信息生成模型训练信息,并广播模型训练信息,模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略。The coordination center 401 is configured to generate model training information according to the model requirement information, and broadcast the model training information. The model training information includes the preset basic model, training sample type, sample size required for each round, and participation strategy.
在一些实施例中,可根据模型需求信息和预设的配置信息,生成模型训练信息;再将模型训练信息发送至处于空闲状态的空闲贡献方和/或资源富余贡献方。In some embodiments, model training information can be generated according to model requirement information and preset configuration information; and then the model training information can be sent to idle contributors and/or resource surplus contributors in an idle state.
其中,预设的配置信息,可包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略。Wherein, the preset configuration information may include a preset basic model, training sample type, sample size required for each round, and participation strategy.
预设的基本模型,可以是一个随机初始化的模型,也可以是需求方提供的模型。The preset basic model can be a randomly initialized model, or a model provided by the demand side.
训练样本类型,可以根据需求模型的类型来具体确定,比如,需求模型为人脸识别模型,训练样本类型,则可以是包含人脸的图片/照片/视频。The type of training sample can be specifically determined according to the type of the required model. For example, the required model is a face recognition model, and the type of training sample can be a picture/photo/video containing a human face.
每轮次所需样本量,是指参与一轮次的联合训练所需要的样本数量,也称为批次样本。该样本量可根据实际需求来确定。The sample size required for each round refers to the number of samples required to participate in a round of joint training, also known as batch samples. The sample size can be determined according to actual needs.
关于参与策略,针对贡献方的参与策略,可以是必须是仿真系统的注册会员,也可以是拥有XX训练资源(通常是与需求方所需的模型关联的训练数据),还可以是拥有XX计算资源,抑或者是拥有XX通信资源等等。具体的,可以根据实际情况来灵活设置,在此不做限定。Regarding the participation strategy, the participation strategy for the contributor can be that it must be a registered member of the simulation system, it can also be the possession of XX training resources (usually the training data associated with the model required by the demander), or it can be the possession of XX computing resources, or have XX communication resources and so on. Specifically, it can be flexibly set according to actual conditions, and is not limited here.
而针对需求方的参与策略,可以是拍卖方式,比如,首价拍卖(价高者胜)、VCG价格拍卖(其原理是计算竞价者赢得拍卖品(如模型)后,给整个竞价收入带来的收益损失,理论上这种损失就是竞价获胜者应该支付的费用)等。The participation strategy for the demand side can be an auction method, for example, the first price auction (the one with the highest price wins), the VCG price auction (the principle is to calculate the contribution to the entire bidding revenue after the bidder wins the auction item (such as a model) loss of income, theoretically this loss is the fee that the winner of the auction should pay), etc.
作为一示例,假设模型需求信息中的需求模型为用气负荷预设模型,预设的配置信息为基本模型M1、用气负荷测量数据、每轮次所需样本量100条、系统的注册会员。那么,根据上述配置信息和模型需求信息,可生成如下的模型训练信息“招募训练用气负荷预设负荷 模型的贡献方,应邀条件包括:拥有至少100条用气负荷测量数据,且系该系统的注册会员;附件为基本模型M1”。As an example, assume that the demand model in the model demand information is a preset model of gas load, and the preset configuration information is the basic model M1, gas load measurement data, 100 samples required for each round, registered members of the system . Then, according to the above configuration information and model demand information, the following model training information can be generated: "Recruitment of contributors to the training gas load preset load model. The invitation conditions include: having at least 100 pieces of gas load measurement data, and the system Registered member of ; the accessory is the basic model M1".
在一些实施例中,将模型训练信息发送至资源富余贡献方,具体包括:In some embodiments, sending model training information to resource surplus contributors specifically includes:
收集所有贡献方的资源状态信息,资源状态信息包括贡献方的计算资源信息和通信资源信息;根据资源状态信息判断贡献方是否属于资源富余贡献方;若是,则将模型训练信息发送至资源富余贡献方。Collect the resource status information of all contributors. The resource status information includes the contributor’s computing resource information and communication resource information; judge whether the contributor belongs to the resource surplus contributor according to the resource status information; if so, send the model training information to the resource surplus contribution square.
作为一示例,协调中心401可预先收集系统中的所有贡献方的资源状态信息(包括计算资源信息和通信资源信息),其中,计算资源信息,包括CPU资源、内存资源、硬盘资源和网络资源的相关信息。通信资源信息,通常包括数据传输延迟、传输带宽等方面的相关信息。As an example, the coordination center 401 can pre-collect the resource status information (including computing resource information and communication resource information) of all contributors in the system, wherein the computing resource information includes CPU resources, memory resources, hard disk resources, and network resources. Related Information. Communication resource information, usually including relevant information on data transmission delay, transmission bandwidth, etc.
作为一示例,资源富余贡献方,通常是指当前具有多余的计算资源和/或通信资源,且能够稳定支撑至少一轮次的联合训练的需要的贡献方。As an example, a resource surplus contributor usually refers to a contributor that currently has redundant computing resources and/or communication resources and can stably support at least one round of joint training.
本公开实施例提供的技术方案,通过先收集各贡献方的资源状态信息,再根据资源状态信息从多个贡献方中挑选出属于资源富余的贡献方,并将模型训练信息给这些资源富余贡献方,有利于提高联合训练的模型质量和模型的收敛效率。The technical solution provided by the embodiments of the present disclosure first collects the resource status information of each contributor, and then selects the contributors who belong to resource surplus from multiple contributors according to the resource status information, and gives the model training information to these resource surplus contributors. It is beneficial to improve the quality of the joint training model and the convergence efficiency of the model.
在另一些实施例中,将模型训练信息发送至处于空闲状态的空闲贡献方,具体包括:In some other embodiments, sending the model training information to the idle contributor in the idle state specifically includes:
获取所有贡献方的训练任务执行状态信息;根据训练任务执行状态信息,确定当前处于空闲状态的空闲贡献方,或者,当前执行的训练任务即将完成,可在预设的时间节点参与下一训练任务的空闲贡献方。Obtain the training task execution status information of all contributors; according to the training task execution status information, determine the idle contributors who are currently idle, or, the currently executing training task is about to be completed, and can participate in the next training task at the preset time node idle contributors.
其中,训练任务执行状态信息,包括当前是否有联合学习任务,当前参与的联合学习任务的执行进度(比如,未开始训练、训练中、训练结束)等方面的相关信息。Among them, the training task execution status information includes whether there is currently a joint learning task, and the execution progress of the currently participating joint learning task (for example, training has not started, training is in progress, training is over) and other related information.
作为一示例,假设获取到的贡献方A的训练任务执行状态信息为当前没有参与任何联合学习任务;贡献方B的训练任务执行状态信息为当前有2个联合学习任务,其中一个联合学习任务已经训练结束,另一个联合学习任务正在训练;贡献方C的训练任务执行状态信息为当前有1个联合学习任务,该任务在XX时间节点前可结束训练。那么,根据上述获取到的训练任务执行状态信息,可以确定当前处于空闲状态的空闲贡献方为贡献方A,当前执行的训练任务即将完成,可在预设的时间节点参与下一训练任务的空闲贡献方为C。然后,可以将模型训练信息发送至贡献方A和C。As an example, assume that the acquired training task execution status information of contributor A is not currently participating in any joint learning task; the training task execution status information of contributor B is that there are currently two joint learning tasks, and one of the joint learning tasks has been completed. The training is over, and another joint learning task is being trained; the execution status information of the training task of contributor C is that there is currently one joint learning task, and the task can end the training before the XX time node. Then, according to the training task execution status information obtained above, it can be determined that the idle contributor who is currently idle is Contributor A, the currently executing training task is about to be completed, and can participate in the next training task at the preset time node. The contributor is C. Then, the model training information can be sent to contributors A and C.
在又一些实施例中,广播模型训练信息,具体包括:In still other embodiments, the broadcast model training information specifically includes:
确定每个贡献方所拥有的训练数据与需求模型的关联程度;将关联程度满足预设的关联阈值的贡献方确定为拥有与需求模型相匹配的训练数据的贡献方;向拥有与需求模型相匹配的训练数据的贡献方下发模型训练信息。Determine the degree of association between the training data owned by each contributor and the demand model; determine the contributor who has the degree of association that meets the preset association threshold as the contributor that has training data that matches the demand model; The contributors of the matching training data deliver model training information.
作为一示例,首先,可以先向系统内的所有贡献方发送训练数据相关信息上报指令,所有贡献方在接收到该指令后,将其拥有的训练数据的相关信息上报给协调中心401,协调中心401即可通过此渠道收集到所有贡献方所拥有的训练数据的相关信息(比如,拥有XX训练数据)。As an example, firstly, an instruction to report training data related information may be sent to all contributors in the system. After receiving the instruction, all contributors will report information about their training data to the coordination center 401. The coordination center 401, the relevant information of the training data owned by all contributors can be collected through this channel (for example, XX training data).
作为另一示例,贡献方在系统注册时,就上报其拥有什么类型的训练数据,系统可将贡献方及其所拥有的训练数据进行关联存储,比如,建立贡献方与训练数据的对应关系表。在进行仿真训练过程中,协调中心401可从系统调取出该贡献方与训练数据的对应关系表,从而获取到所有贡献方所拥有的训练数据。As another example, when a contributor registers with the system, he reports what type of training data he owns, and the system can associate and store the contributor and the training data he owns, for example, to establish a correspondence table between the contributor and the training data . During the simulation training process, the coordination center 401 can retrieve the correspondence table between the contributor and the training data from the system, so as to obtain the training data owned by all contributors.
在一实施例中,确定每个贡献方所拥有的训练数据与需求模型的关联程度。具体的,可以根据每个贡献方所拥有的训练数据与训练需求模型所需要用到的数据之间的相似度来确定二者是否关联,以及关联程度如何。In one embodiment, it is determined how relevant the training data held by each contributor is to the demand model. Specifically, according to the similarity between the training data owned by each contributor and the data required for training the demand model, it can be determined whether the two are related and to what extent.
示例性的,假设需求模型为人脸识别模型(其所需的训练数据为包含人脸的图片/照片/视频),某贡献方A所拥有的训练数据为用气负荷预测数据,与人脸识别模型所需的训练数据完全不相关,即可确定贡献方A的训练数据与需求模型的关联度为0。若某贡献方B所拥 有的训练数据为包含人脸的图片/照片/视频,与人脸识别模型所需的训练数据完全吻合,即可确定贡献方B的训练数据与需求模型的关联度为100%。Exemplarily, it is assumed that the demand model is a face recognition model (the required training data is pictures/photos/videos containing faces), and the training data owned by a contributor A is gas load prediction data, which is related to face recognition The training data required by the model is completely irrelevant, so it can be determined that the correlation between the training data of contributor A and the demand model is 0. If the training data owned by a contributor B is a picture/photo/video containing a face, which is completely consistent with the training data required by the face recognition model, it can be determined that the correlation between the training data of the contributor B and the demand model is 100%.
本公开实施例提供的技术方案,通过先确定系统内的贡献方所拥有的训练数据是否与需求模型的关联程度,并从中挑选出拥有与需求模型相匹配的训练数据的贡献方,再将模型训练信息下发给这些贡献方,可以提高贡献方的响应率,并且有利于提高后续联合训练的模型效果和效率。The technical solution provided by the embodiments of the present disclosure first determines whether the training data owned by the contributors in the system is related to the demand model, and then selects the contributors who have training data that match the demand model, and then the model Sending training information to these contributors can improve the response rate of the contributors and help improve the model effect and efficiency of subsequent joint training.
贡献方传输模块402,被配置为当接收到模型训练信息,且确定参与模型训练时,向协调中心发送应邀申请。The contributor transmission module 402 is configured to send an invitation application to the coordination center when the model training information is received and it is determined to participate in the model training.
在一些实施例中,当接收到模型训练信息时,查询并判断预设的资源库中是否存储有与训练样本类型相同且数量满足每轮次所需样本量的训练样本;若有,则判断按照参与策略参与训练是否可达到其预设的期望回报资源;若可达到其预设的期望资源回报,则向协调中心发送应邀申请。In some embodiments, when the model training information is received, query and determine whether there are training samples of the same type as the training samples in the preset resource library and whose quantity satisfies the sample size required for each round; if so, determine Whether participating in the training according to the participation strategy can achieve the preset expected return resources; if the preset expected resource return can be achieved, an invitation application will be sent to the coordination center.
作为一示例,假设模型训练信息为“招募训练用气负荷预设负荷模型的贡献方,应邀条件包括:拥有至少100条用气负荷测量数据,且系该系统的注册会员,训练回报为X元;附件为基本模型M1”,那么各贡献方在接收到协调中心广播的上述模型训练信息时,可以先查询并判断其自身的资源库(如数据库)中是否存储有与广告中所要求的训练样本类型(用气负荷测量)相同且数量满足每轮次所需样本量(至少100条)的训练样本(用气负荷测量数据)。若查询到其资源库中存储有200条用气负荷测量数据,则进一步判断其是否为该系统的注册用户,若是,则再进一步判断其应邀参与到本次的联合训练中能否得到其预设的期望回报资源(如期望收益)。As an example, assume that the model training information is "Recruiting contributors to the preset load model of the training gas load. The invitation conditions include: having at least 100 pieces of gas load measurement data, and being a registered member of the system, and the training return is X yuan ; the attachment is the basic model M1", then when each contributor receives the above-mentioned model training information broadcast by the coordination center, he can first query and judge whether the training required in the advertisement is stored in his own resource library (such as a database). Training samples (data measured by gas load) with the same sample type (measured by gas load) and the number of samples meeting the required sample size (at least 100) for each round. If it is found that there are 200 pieces of gas load measurement data stored in its resource bank, it will be further judged whether it is a registered user of the system, and if so, it will be further judged whether it is invited to participate in this joint training. Set expected return resources (such as expected benefits).
假设贡献方A的预设期望回报资源(如期望收益)为不低于Y元,而X>Y,那么A参与到上述的联合训练中,可以获得X元的训练回报,可满足其预设的期望回报资源,此时,贡献方A可向协调中心401发送应邀申请。Assuming that contributor A’s preset expected return resource (such as expected income) is not less than Y yuan, and X>Y, then A participates in the above-mentioned joint training, and can obtain X yuan training return, which can meet its preset is expected to return resources, at this time, contributor A can send an invited application to the coordination center 401 .
协调中心401,还被配置为根据应邀申请锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个目标贡献方使用其训练资源对基本模型进行训练,直至满足预设的结束条件,得到全局模型,计算各个目标贡献方应得的贡献分配资源,将贡献分配资源反馈给相应的各个目标贡献方。The coordination center 401 is also configured to lock the training resources of at least two target contributors according to the invited application, and start a preset training program, so that each target contributor uses its training resources to train the basic model until the preset end Conditions, get the global model, calculate the contribution allocation resources that each target contributor should get, and feed back the contribution allocation resources to the corresponding target contributors.
在一些实施例中,协调中心401可通过接收多个贡献方发送的应邀申请,应邀申请包括训练资源和期望回报资源;再根据训练资源和期望回报资源,确定至少两个目标贡献方,并向目标贡献方下发中标通知,以锁定目标贡献方的训练资源。In some embodiments, the coordination center 401 can receive the invitation application sent by multiple contributors, the invitation application includes training resources and expected return resources; then determine at least two target contributors according to the training resources and expected return resources, and send to The target contributor issues a bid winning notice to lock the training resources of the target contributor.
其中,训练资源,包括训练数据(包括数据类型和数据数量)、计算资源、通信资源等。Among them, training resources include training data (including data type and data quantity), computing resources, communication resources, and the like.
预设的训练程序,通常是指程序员预先根据联合学习的训练过程设计的应用程序文件。The preset training program usually refers to the application program file designed by the programmer in advance according to the training process of federated learning.
作为一示例,根据训练资源和期望回报资源,确定至少两个目标贡献方。具体的,可通过判断贡献方是否拥有与需求方需求的模型相关的训练数据,且其期望回报资源在预设的训练回报范围内来筛选出至少两个目标贡献方。As an example, at least two target contributors are determined according to the training resources and the expected reward resources. Specifically, at least two target contributors can be screened out by judging whether the contributor has training data related to the model required by the demander, and its expected return resources are within a preset training return range.
假设需求方需求的模型为用气负荷预测模型,对应的所需要的训练数据为用气负荷测量数据,每轮次所需的样本量为100条,预设的训练回报范围为X~Y元。若某贡献方A拥有用气负荷测量数据200条,期望回报资源为X元,则可将贡献方A确定为目标贡献方之一。类似地,根据上述筛选方式,可以从多个贡献方中选定至少两个目标贡献方,并向这些目标贡献方下发中标通知(例如,“恭喜XX中标”的文字/语音等通知信息),并锁定所选定的目标贡献方的训练资源。Assume that the model demanded by the demand side is the gas load forecasting model, and the corresponding required training data is gas load measurement data. The sample size required for each round is 100, and the preset training return range is X to Y yuan . If a contributor A has 200 pieces of gas load measurement data and the expected return resource is X yuan, then contributor A can be determined as one of the target contributors. Similarly, according to the above screening method, at least two target contributors can be selected from multiple contributors, and bid winning notifications can be issued to these target contributors (for example, text/voice notification information such as "Congratulations to XX for winning the bid") , and lock the training resources of the selected target contributor.
接着,启动预设的训练程序,在系统内模拟各个目标贡献方在“本地”使用其训练数据对基本模型M1进行第一轮联合训练,得到各个贡献方上传的第一轮模型参数,并将这些第一轮模型参数进行聚合,得到第一聚合参数,再根据该第一聚合参数对基本模型M1进行更新,得到更新模型M2;然后,再将更新模型M2下发至各个目标贡献方,使得各个目标贡献方再使用其训练数据对更新模型M2进行第二轮训练,得到第二轮模型参数,并将这些第 二轮模型参数进行聚合,得到第二聚合参数,再根据第二聚合参数对更新模型M2进行更新,得到更新模型M3,再将更新模型M3下发至各个目标贡献方,使得各个目标贡献方再使用其训练数据对更新模型M3进行第三轮训练,即不断重复上述迭代训练过程,直至达到预设的结束条件(如预设的仿真训练轮数,预设的模型精度等),得到全局模型。Then, start the preset training program, simulate each target contributor in the system to use its training data "locally" to conduct the first round of joint training on the basic model M1, obtain the first round of model parameters uploaded by each contributor, and These first-round model parameters are aggregated to obtain the first aggregation parameters, and then the basic model M1 is updated according to the first aggregation parameters to obtain the updated model M2; then, the updated model M2 is sent to each target contributor, so that Each target contributor uses its training data to conduct a second round of training on the updated model M2 to obtain the second round of model parameters, and aggregate these second round model parameters to obtain the second aggregation parameters, and then according to the second aggregation parameters to The updated model M2 is updated to obtain the updated model M3, and then the updated model M3 is sent to each target contributor, so that each target contributor can use its training data to perform the third round of training on the updated model M3, that is, repeat the above iterative training process until the preset end condition is reached (such as the preset number of simulation training rounds, the preset model accuracy, etc.), and the global model is obtained.
在一些实施例中,在得到全局模型之后,计算各个目标贡献方应得的贡献分配资源,具体的,可根据预设的贡献度衡量策略、拍卖策略和惩罚策略,计算各个目标贡献方应得的贡献分配资源。In some embodiments, after obtaining the global model, calculate the contribution allocation resources that each target contributor should receive. Specifically, the calculation of each target contributor’s share of resources can be done according to the preset contribution measurement strategy, auction strategy, and punishment strategy. contribution to allocate resources.
其中,预设的贡献度衡量策略,包括按同等贡献分配资源(即平均分配),按边际贡献分配资源(按各节点(贡献方)加入团队(联合训练同盟)时所产生的效用),基于Shapely值分配(在排除节点以不同顺序加入集合体中所带来的影响,从而更公平地预估它们对集合体做出的贡献)。Among them, the preset contribution measurement strategy includes allocating resources according to equal contribution (that is, average distribution), allocating resources according to marginal contribution (according to the utility generated when each node (contributor) joins the team (joint training alliance)), based on Shapely value assignment (by excluding the effect of nodes joining the ensemble in a different order to more fairly estimate their contribution to the ensemble).
拍卖策略,通常是指拍卖方式,包括首价拍卖、VCG价格拍卖等。其中,首价拍卖,其原理是出价最高者胜出。VCG价格拍卖,其原理则是计算竞价者赢得拍卖品后,给整个竞价收入带来的收益损失,理论上这种损失就是竞价获胜者应该支付的费用。Auction strategies usually refer to auction methods, including first-price auctions, VCG price auctions, etc. Among them, the principle of first-price auction is that the highest bidder wins. The principle of VCG price auction is to calculate the profit loss brought to the entire bidding revenue after the bidder wins the auction item. In theory, this loss is the fee that the bid winner should pay.
惩罚策略,主要是指针对个体成员的某些恶意行为而设置的惩罚措施。示例性的,恶意行为(惩罚项目)包括:上报的训练数据与实际参与训练的训练数据存在较大偏差;上报的计算资源与实际参与训练的计算资源存在较大偏差;在训练过程中,在线率低于预设的阈值等。惩罚措施,可以是针对不同的恶意行为制定相应的扣分值或者扣分系数。Punishment strategy mainly refers to the punishment measures set for certain malicious behaviors of individual members. Exemplary, the malicious behavior (punishment item) includes: there is a large deviation between the reported training data and the training data actually participating in the training; there is a large deviation between the reported computing resources and the computing resources actually participating in the training; during the training process, online rate below a preset threshold, etc. Punishment measures can be to formulate corresponding deduction values or deduction coefficients for different malicious behaviors.
作为一示例,可以根据拍卖策略和需求方所提供的预算,计算出拍卖定价,该拍卖定价为扣除了协调中心的服务预算。比如,拍卖策略为首价拍卖,拍卖胜出的需求方提供的预算为K元,协调中心的服务预算为S元,那么拍卖定价为(K-S)元。再根据惩罚策略中每个惩罚项目所对应需要扣除的惩罚值,计算出每个目标贡献方的惩罚值。例如,惩罚项目有训练数据偏差和计算资源偏差,其中训练数据偏差的扣除基数为X,偏差权重分别为严重偏差100%,中度偏差50%,轻度偏差10%;计算资源偏差一律扣除Y。假设贡献方A的训练数据偏差为中度偏差,且有计算资源偏差,那么贡献方A的惩罚值为X*50%+Y。再根据贡献度衡量策略和预设的贡献基础值,确定每个目标贡献方的贡献值。最后,将拍卖值(取正数)、惩罚值(去负数)和贡献值(取正数)加起来,即可计算出每个目标贡献方应得的贡献分配资源(如收益)。As an example, the auction price may be calculated according to the auction strategy and the budget provided by the demander, and the auction price is the service budget of the coordination center deducted. For example, if the auction strategy is a first-price auction, the budget provided by the winning buyer is K yuan, and the service budget of the coordination center is S yuan, then the auction price is (K-S) yuan. Then calculate the penalty value of each target contributor according to the penalty value that needs to be deducted corresponding to each penalty item in the penalty strategy. For example, the penalty items include training data deviation and computing resource deviation. The deduction base for training data deviation is X, and the deviation weights are 100% for severe deviation, 50% for moderate deviation, and 10% for mild deviation; Y is deducted for all deviations in computing resources. . Assuming that Contributor A's training data deviation is moderate and there is computing resource deviation, then Contributor A's penalty value is X*50%+Y. Then, according to the contribution measurement strategy and the preset contribution base value, the contribution value of each target contributor is determined. Finally, add up the auction value (positive number), penalty value (negative number) and contribution value (positive number) to calculate the contribution allocation resources (such as income) that each target contributor should receive.
在一些实施例中,计算各个目标贡献方应得的贡献分配资源之后,还包括:In some embodiments, after calculating the contribution allocation resources due to each target contributor, it also includes:
使用预设的测试数据对全局模型进行推演预测,得到推演预测结果;Use the preset test data to perform deduction prediction on the global model, and obtain the deduction prediction result;
根据推演预测结果和测试数据的标签值的比较结果,确定全局模型的模型性能。Determine the model performance of the global model based on a comparison of the inference predictions and the label values of the test data.
其中,预设的测试数据,可以是带标签值的数据。Wherein, the preset test data may be data with tag values.
作为一示例,假设得到的全局模型为人脸识别模型,测试数据可以是带标签值的图片(可为人脸图片或其他图片(如动物图片等))。将这些测试数据输入到该人脸识别模型中,输出人脸识别结果(例如,是人脸,或者非人脸的二分类结果)。根据推演预测结果和测试数据的真实标签值进行比较,得出比较结果,再根据该比较结果确定全局模型的模型性能(比如,准确率、召回率等)。As an example, assuming that the obtained global model is a face recognition model, the test data may be pictures with labels (it may be a picture of a face or other pictures (such as pictures of animals, etc.)). These test data are input into the face recognition model, and a face recognition result (for example, a human face or a binary classification result of a non-human face) is output. According to the comparison between the deduction prediction result and the real label value of the test data, the comparison result is obtained, and then the model performance (such as accuracy rate, recall rate, etc.) of the global model is determined according to the comparison result.
在本公开实施例中,在每次迭代训练完成后,计算出各个目标参与方各自应得的贡献分配资源,可根据贡献分配资源从高至低对各个目标贡献方进行排序,同时,可对训练所得到的全局模型进行模型效用的测试,得出贡献方参与训练的情况与全局模型效用的对应关系。根据该对应关系,可以进一步分析在同一仿真训练机制下的最优资源分配方案。In the embodiment of the present disclosure, after each iteration of training is completed, the respective contribution allocation resources of each target participant are calculated, and each target contributor can be sorted from high to low according to the contribution allocation resources. At the same time, the The global model obtained by training is tested for model utility, and the corresponding relationship between the contribution party's participation in training and the utility of the global model is obtained. According to the corresponding relationship, the optimal resource allocation scheme under the same simulation training mechanism can be further analyzed.
此外,还可设定不同的仿真训练机制,参照上述仿真训练流程,对每个机制下的各个贡献方的资源分配值进行排名、统计,以及测试模型效用,以进一步分析在不同的仿真训练机制下,哪种机制可以让贡献方的行为更加符合激励目标,以及哪种机制可以让需求方能够获得效用更佳的模型。In addition, different simulation training mechanisms can also be set. Referring to the above simulation training process, the resource allocation values of each contributor under each mechanism can be ranked, counted, and the effectiveness of the test model can be further analyzed in different simulation training mechanisms. Next, which mechanism can make the behavior of the contributor more in line with the incentive goal, and which mechanism can enable the demander to obtain a model with better utility.
通过本公开实施例提供的系统能够设置各种不同的仿真训练机制,并通过各种仿真训练 机制可以模拟各种场景的联合训练,从而得出最优的资源分配方案,有利于激励贡献方积极参与到联合学习中,建立健壮的联合模型,同时可降低联合训练的综合成本。The system provided by the embodiments of the present disclosure can set various simulation training mechanisms, and through various simulation training mechanisms can simulate joint training in various scenarios, so as to obtain the optimal resource allocation plan, which is conducive to motivating contributors to actively Participate in joint learning, build a robust joint model, and reduce the overall cost of joint training.
应理解,上述实施例中各模块排列先后顺序并不意味着执行步骤顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本公开实施例的实施过程构成任何限定。It should be understood that the order in which the modules are arranged in the above embodiments does not mean the order in which the steps are executed, and the execution order of each process should be determined by its functions and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present disclosure.
图5是本公开实施例提供的另一种基于联合学习的资源分配方法的时序图。如图5所示,该方法包括如下步骤:Fig. 5 is a sequence diagram of another resource allocation method based on joint learning provided by an embodiment of the present disclosure. As shown in Figure 5, the method includes the following steps:
需求方传输模块,按照预设的时间步长,向协调中心发送模型需求信息,模型需求信息包括需求模型;The demand-side transmission module sends model demand information to the coordination center according to the preset time step, and the model demand information includes the demand model;
贡献方传输模块,当接收到模型训练信息,且确定参与模型训练时,向协调中心发送应邀申请;Contributor transmission module, when receiving model training information and confirming to participate in model training, sends an invitation application to the coordination center;
协调中心,根据模型需求信息生成模型训练信息,并广播模型训练信息,模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略;The coordination center generates model training information according to the model requirement information, and broadcasts the model training information. The model training information includes the preset basic model, training sample type, sample size required for each round and participation strategy;
根据应邀申请锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个目标贡献方使用其训练资源对基本模型进行训练,直至满足预设的结束条件,聚合各个目标贡献方提供的模型参数,得到全局模型,计算各个目标贡献方应得的贡献分配资源,将贡献分配资源反馈给相应的各个目标贡献方。According to the invited application, the training resources of at least two target contributors are locked, and the preset training program is started, so that each target contributor uses its training resources to train the basic model until the preset end conditions are met, and the aggregated contributions provided by each target contributor The model parameters of , get the global model, calculate the contribution allocation resources that each target contributor should get, and feed back the contribution allocation resources to the corresponding target contributors.
本公开实施例提供的技术方案,通过需求方按照预设的时间步长,向协调中心发送模型需求信息,模型需求信息包括需求模型;协调中心根据模型需求信息生成模型训练信息,并广播模型训练信息,模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略;贡献方当接收到模型训练信息,且确定参与模型训练时,向协调中心发送应邀申请;协调中心根据应邀申请锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个目标贡献方使用其训练资源对基本模型进行训练,直至满足预设的结束条件,聚合各个目标贡献方提供的模型参数,得到全局模型,计算各个目标贡献方应得的贡献分配资源,将贡献分配资源反馈给相应的各个目标贡献方,能够相对公平、合理地分配联合学习成果,可更好地激励参与方参与训练和共享他们的模型。In the technical solution provided by the embodiments of the present disclosure, the demander sends model demand information to the coordination center according to the preset time step, and the model demand information includes the demand model; the coordination center generates model training information according to the model demand information, and broadcasts the model training Information, model training information includes the preset basic model, training sample type, sample size required for each round, and participation strategy; when the contributor receives the model training information and determines to participate in the model training, it sends an invitation application to the coordination center; According to the invited application, the coordination center locks the training resources of at least two target contributors, starts the preset training program, and enables each target contributor to use its training resources to train the basic model until the preset end conditions are met, and aggregates the contributions of each target The model parameters provided by the party can be used to obtain the global model, calculate the contribution allocation resources that each target contributor should deserve, and feed back the contribution allocation resources to the corresponding target contributors, so that the joint learning results can be allocated relatively fairly and reasonably, and better Incentivize parties to participate in training and share their models.
图6是本公开实施例提供的另一种基于联合学习的资源分配方法的流程示意图,该方法可以由图4中的协调中心401执行。如图6所示,该方法包括如下步骤:FIG. 6 is a schematic flowchart of another resource allocation method based on joint learning provided by an embodiment of the present disclosure, and the method may be executed by the coordination center 401 in FIG. 4 . As shown in Figure 6, the method includes the following steps:
步骤S601,接收需求方发送的模型需求信息;其中,所述需求方为多个参与方之一;Step S601, receiving model demand information sent by a demander; wherein, the demander is one of multiple participants;
步骤S602,根据所述模型需求信息生成模型训练信息,并广播所述模型训练信息,所述模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略;Step S602, generating model training information according to the model requirement information, and broadcasting the model training information, the model training information including the preset basic model, training sample type, sample size required for each round and participation strategy;
步骤S603,确定所述多个参与方中的贡献方;Step S603, determining a contributing party among the multiple participating parties;
步骤S604,响应于贡献方发送的基于所述模型训练信息确定参与模型训练的消息;Step S604, responding to the message sent by the contributor to determine participation in model training based on the model training information;
步骤S605,根据所述消息锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个所述目标贡献方使用其训练资源对所述基本模型进行训练,直至满足预设的结束条件,聚合各个所述目标贡献方提供的模型参数,得到全局模型,计算各个所述目标贡献方应得的贡献分配资源,将所述贡献分配资源反馈给相应的各个目标贡献方。Step S605, lock the training resources of at least two target contributors according to the message, and start a preset training program, so that each of the target contributors uses its training resources to train the basic model until the preset end is satisfied. Conditions, aggregate the model parameters provided by each of the target contributors to obtain a global model, calculate the contribution allocation resources that each of the target contributors should receive, and feed back the contribution allocation resources to the corresponding target contributors.
其中,上述的贡献方发送的基于所述模型训练信息确定参与模型训练的消息,是指前文所述的应邀申请。Wherein, the above-mentioned message sent by the contributor to determine participation in model training based on the model training information refers to the above-mentioned invitation application.
作为一示例,确定多个参与方中的贡献方,具体可以是首先根据需求方发送的模型需求信息生成的模型训练信息,确定符合该模型训练信息中的相关要求的参与方,然后可进一步通过系统预设的计算机程序将这些参与方配置为贡献方。As an example, to determine the contributor among the multiple participants, specifically, it may firstly determine the participant that meets the relevant requirements in the model training information based on the model training information generated by the model requirement information sent by the demander, and then further pass A computer program programmed into the system configures these parties as contributors.
作为另一示例,还可以是通过系统预设的计算机程序根据预设的配置信息,初始化生成至少一个需求方和多个贡献方。As another example, at least one demander and multiple contributors may be initialized and generated through a computer program preset by the system according to preset configuration information.
作为又一示例,还可以是将处于同一联合学习社区中参与同一联合学习任务且没有向协调中心提交模型需求信息的参与方确定为贡献方,向协调中心提交模型需求信息的参与方则确定为需求方。As yet another example, participants who participate in the same joint learning task in the same joint learning community and have not submitted model requirement information to the coordination center may be determined as contributors, and participants who submit model requirement information to the coordination center may be determined as demand side.
在一些实施例中,上述步骤,根据模型需求信息生成模型训练信息,并广播模型训练信息,包括:In some embodiments, the above steps, generating model training information according to model requirement information, and broadcasting model training information include:
根据模型需求信息和预设的配置信息,生成模型训练信息;Generate model training information according to model requirement information and preset configuration information;
将模型训练信息发送至处于空闲状态的空闲贡献方和/或资源富余贡献方。Send model training information to idle contributors and/or resource-abundant contributors in an idle state.
在一些实施例中,上述步骤,将模型训练信息发送至资源富余贡献方,包括:In some embodiments, the above steps of sending model training information to resource surplus contributors include:
收集所有贡献方的资源状态信息,资源状态信息包括贡献方的计算资源信息和通信资源信息;Collect the resource status information of all contributors, including the computing resource information and communication resource information of the contributor;
根据资源状态信息判断贡献方是否属于资源富余贡献方;Judging whether the contributor is a resource surplus contributor according to the resource status information;
若是,则将模型训练信息发送至资源富余贡献方。If yes, send the model training information to the resource surplus contributor.
在一些实施例中,上述步骤,将模型训练信息发送至处于空闲状态的空闲贡献方,包括:In some embodiments, the above steps of sending model training information to idle contributors in an idle state include:
获取所有贡献方的训练任务执行状态信息;Get the training task execution status information of all contributors;
根据训练任务执行状态信息,确定当前处于空闲状态的空闲贡献方,或者,当前执行的训练任务即将完成,可在预设的时间节点参与下一训练任务的空闲贡献方。According to the execution status information of the training task, determine the idle contributor who is currently idle, or the idle contributor who can participate in the next training task at the preset time node when the currently executing training task is about to be completed.
在一些实施例中,上述步骤,广播模型训练信息,包括:In some embodiments, the above steps, broadcasting model training information, include:
确定每个贡献方所拥有的训练数据与需求模型的关联程度;Determine how relevant the training data each contributor has to the required model;
将关联程度满足预设的关联阈值的贡献方确定为拥有与需求模型相匹配的训练数据的贡献方;Determine the contributor whose degree of association meets the preset association threshold as the contributor who has training data that matches the demand model;
向拥有与需求模型相匹配的训练数据的贡献方下发模型训练信息。Send model training information to contributors who have training data that matches the required model.
在一些实施例中,当贡献方接收到模型训练信息时,查询并判断预设的资源库中是否存储有与训练样本类型相同且数量满足每轮次所需样本量的训练样本;若有,则判断按照参与策略参与训练是否可达到其预设的期望回报资源;若可达到其预设的期望资源回报,则向协调中心发送确定参与模型训练的消息。In some embodiments, when the contributor receives the model training information, it queries and determines whether there are training samples of the same type as the training samples and the number of which meets the required sample size for each round in the preset resource library; if so, Then judge whether participating in the training according to the participation strategy can achieve the preset expected return resources; if the preset expected resource return can be achieved, send a message to the coordination center to confirm participation in model training.
在一些实施例中,上述步骤,根据消息锁定至少两个目标贡献方的训练资源,包括:In some embodiments, the above step of locking the training resources of at least two target contributors according to the message includes:
接收多个贡献方发送的应邀申请,应邀申请包括训练资源和期望回报资源;Receive invited applications from multiple contributors, including training resources and expected return resources;
根据训练资源和期望回报资源,确定至少两个目标贡献方,并向目标贡献方下发中标通知,以锁定目标贡献方的训练资源。Determine at least two target contributors based on the training resources and expected return resources, and issue a bid winning notice to the target contributors to lock the training resources of the target contributors.
在一些实施例中,上述步骤,计算各个目标贡献方应得的贡献分配资源,包括:In some embodiments, the above steps of calculating the contribution allocation resources that each target contributor should receive include:
根据预设的贡献度衡量策略、拍卖策略和惩罚策略,计算各个目标贡献方应得的贡献分配资源。According to the preset contribution measurement strategy, auction strategy and penalty strategy, calculate the contribution allocation resources that each target contributor should deserve.
在一些实施例中,上述步骤,计算各个目标贡献方应得的贡献分配资源之后,还包括:In some embodiments, the above steps, after calculating the contribution allocation resources that each target contributor should receive, further include:
使用预设的测试数据对全局模型进行推演预测,得到推演预测结果;Use the preset test data to perform deduction prediction on the global model, and obtain the deduction prediction result;
根据推演预测结果和测试数据的标签值的比较结果,确定全局模型的模型性能。Determine the model performance of the global model based on a comparison of the inference predictions and the label values of the test data.
上述所有可选技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above optional technical solutions may be combined in any way to form optional embodiments of the present application, which will not be repeated here.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本公开实施例的实施过程构成任何限定。It should be understood that the sequence numbers of the steps in the above embodiments do not mean the order of execution, and the execution order of each process should be determined by its functions and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present disclosure.
图7是本公开实施例提供的电子设备700的示意图。如图7所示,该实施例的电子设备700包括:处理器701、存储器702以及存储在该存储器702中并且可在处理器701上运行的计算机程序703。处理器701执行计算机程序703时实现上述各个方法实施例中的步骤。或者,处理器701执行计算机程序703时实现上述各装置实施例中各模块/单元的功能。FIG. 7 is a schematic diagram of an electronic device 700 provided by an embodiment of the present disclosure. As shown in FIG. 7 , an electronic device 700 in this embodiment includes: a processor 701 , a memory 702 , and a computer program 703 stored in the memory 702 and operable on the processor 701 . When the processor 701 executes the computer program 703, the steps in the foregoing method embodiments are implemented. Alternatively, when the processor 701 executes the computer program 703, the functions of the modules/units in the foregoing device embodiments are realized.
示例性地,计算机程序703可以被分割成一个或多个模块/单元,一个或多个模块/单元被存储在存储器702中,并由处理器701执行,以完成本公开。一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述计算机程序703在电子设备700中的执行过程。Exemplarily, the computer program 703 can be divided into one or more modules/units, and one or more modules/units are stored in the memory 702 and executed by the processor 701 to complete the present disclosure. One or more modules/units may be a series of computer program instruction segments capable of accomplishing specific functions, and the instruction segments are used to describe the execution process of the computer program 703 in the electronic device 700 .
电子设备700可以是桌上型计算机、笔记本、掌上电脑及云端服务器等电子设备。电子设备700可以包括但不仅限于处理器701和存储器702。本领域技术人员可以理解,图7仅 仅是电子设备700的示例,并不构成对电子设备700的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如,电子设备还可以包括输入输出设备、网络接入设备、总线等。The electronic device 700 may be an electronic device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The electronic device 700 may include but not limited to a processor 701 and a memory 702 . Those skilled in the art can understand that FIG. 7 is only an example of the electronic device 700, and does not constitute a limitation to the electronic device 700. It may include more or less components than shown in the figure, or combine certain components, or different components. , for example, an electronic device may also include an input and output device, a network access device, a bus, and the like.
处理器701可以是中央处理单元(Central Processing Unit,CPU),也可以是其它通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其它可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。 Processor 701 can be a central processing unit (Central Processing Unit, CPU), and can also be other general processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), on-site Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, and the like.
存储器702可以是电子设备700的内部存储单元,例如,电子设备700的硬盘或内存。存储器702也可以是电子设备700的外部存储设备,例如,电子设备700上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,存储器702还可以既包括电子设备700的内部存储单元也包括外部存储设备。存储器702用于存储计算机程序以及电子设备所需的其它程序和数据。存储器702还可以用于暂时地存储已经输出或者将要输出的数据。The storage 702 may be an internal storage unit of the electronic device 700 , for example, a hard disk or a memory of the electronic device 700 . The memory 702 can also be an external storage device of the electronic device 700, for example, a plug-in hard disk equipped on the electronic device 700, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash memory card ( Flash Card), etc. Further, the memory 702 may also include both an internal storage unit of the electronic device 700 and an external storage device. The memory 702 is used to store computer programs and other programs and data required by the electronic device. The memory 702 can also be used to temporarily store data that has been output or will be output.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of description, only the division of the above-mentioned functional units and modules is used for illustration. In practical applications, the above-mentioned functions can be assigned to different functional units, Completion of modules means that the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. Each functional unit and module in the embodiment may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit, and the above-mentioned integrated units may adopt hardware It can also be implemented in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing each other, and are not used to limit the protection scope of the present application. For the specific working processes of the units and modules in the above system, reference may be made to the corresponding processes in the aforementioned method embodiments, and details will not be repeated here.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the above-mentioned embodiments, the descriptions of each embodiment have their own emphases, and for parts that are not detailed or recorded in a certain embodiment, refer to the relevant descriptions of other embodiments.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本公开的范围。Those skilled in the art can appreciate that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementation should not be considered beyond the scope of the present disclosure.
在本公开所提供的实施例中,应该理解到,所揭露的装置/电子设备和方法,可以通过其它的方式实现。例如,以上所描述的装置/电子设备实施例仅仅是示意性的,例如,模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided in the present disclosure, it should be understood that the disclosed device/electronic equipment and method may be implemented in other ways. For example, the device/electronic device embodiments described above are only illustrative. For example, the division of modules or units is only a logical function division. In actual implementation, there may be other division methods. Multiple units or components can be Incorporation may either be integrated into another system, or some features may be omitted, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。A unit described as a separate component may or may not be physically separated, and a component displayed as a unit may or may not be a physical unit, that is, it may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本公开各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present disclosure may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读存储介质中。基于这样的理解,本公开实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,计算机程序可以存储在计算机可读存储介质中,该计算机程序在被处理器执行时,可以实现上述各个方法实施例的步骤。计算机程序可以包括计算机程序代码,计算机程序代码可以为源代码形式、对象代码形 式、可执行文件或某些中间形式等。计算机可读介质可以包括:能够携带计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、电载波信号、电信信号以及软件分发介质等。需要说明的是,计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如,在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。If an integrated module/unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the present disclosure realizes all or part of the processes in the methods of the above embodiments, and can also be completed by instructing related hardware through computer programs. The computer programs can be stored in computer-readable storage media, and the computer programs can be processed. When executed by the controller, the steps in the above-mentioned method embodiments can be realized. A computer program may include computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer memory, read-only memory (Read-Only Memory, ROM), random access Memory (Random Access Memory, RAM), electrical carrier signal, telecommunication signal and software distribution medium, etc. It should be noted that the content contained in computer readable media may be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdiction. For example, in some jurisdictions, computer readable media may not Including electrical carrier signals and telecommunication signals.
以上实施例仅用以说明本公开的技术方案,而非对其限制;尽管参照前述实施例对本公开进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本公开各实施例技术方案的精神和范围,均应包含在本公开的保护范围之内。The above embodiments are only used to illustrate the technical solutions of the present disclosure, rather than to limit them; although the present disclosure has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be described in the foregoing embodiments Modifications to the technical solutions recorded, or equivalent replacements for some of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present disclosure, and should be included in this disclosure. within the scope of protection.

Claims (15)

  1. 一种基于联合学习的资源分配方法,其特征在于,包括:A resource allocation method based on joint learning, characterized in that it includes:
    读取预设的资源分配配置信息,所述资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;Reading preset resource allocation configuration information, the resource allocation configuration information includes attribute configuration information, contribution degree configuration information and monitoring configuration information;
    获取多个模型需求方提供的模型需求信息,根据所述属性配置信息和所述模型需求信息,确定目标需求方;Obtaining model demand information provided by multiple model demand parties, and determining a target demand party according to the attribute configuration information and the model demand information;
    确定与所述模型需求信息匹配的多个目标资源贡献方,获取每个所述目标资源贡献方的模型资源,所述模型资源包括模型参数及有效训练数据量;Determining multiple target resource contributors that match the model requirement information, and acquiring model resources of each target resource contributor, where the model resources include model parameters and effective training data volumes;
    根据所述属性配置信息、贡献度配置信息、监控配置信息及所述模型资源,确定每个所述目标资源贡献方对应的分配值,并将所述分配值反馈至各个所述目标资源贡献方。According to the attribute configuration information, contribution degree configuration information, monitoring configuration information and the model resources, determine the allocation value corresponding to each of the target resource contributors, and feed back the allocation value to each of the target resource contributors .
  2. 根据权利要求1所述的方法,其特征在于,所述模型需求信息包括需求模型;The method according to claim 1, wherein the model requirement information includes a requirement model;
    所述根据所述属性配置信息和所述模型需求信息,确定目标需求方,包括:The determining the target demander according to the attribute configuration information and the model requirement information includes:
    根据所述需求模型对所述多个模型需求方进行分类,得到与每个所述需求模型对应的模型需求方集合;classify the plurality of model demanders according to the demand model, and obtain a set of model demanders corresponding to each of the demand models;
    根据所述属性配置信息,从所述模型需求方集合中筛选出目标需求方。According to the attribute configuration information, a target demander is selected from the set of model demanders.
  3. 根据权利要求2所述的方法,其特征在于,所述根据所述属性配置信息,从所述模型需求方集合中筛选出目标需求方,包括:获取所述模型需求方集合中的每个模型需求方的预算资源;将所述预算资源最多的模型需求方确定为目标需求方;The method according to claim 2, wherein, according to the attribute configuration information, selecting the target demander from the set of model demanders comprises: acquiring each model in the set of model demanders Budget resources of the demand side; determine the model demand side with the most budget resources as the target demand side;
    所述确定与所述模型需求信息匹配的多个目标资源贡献方,包括:获取多个资源贡献方上报的待拍卖模型的模型信息,所述待拍卖模型的模型信息包括待拍卖模型的模型类型;计算所述待拍卖模型的模型类型与所述需求模型的模型类型之间的相似度,根据所述相似度确定与所述需求模型的模型类型相匹配的多个目标资源贡献方。The determining multiple target resource contributors that match the model demand information includes: obtaining model information of models to be auctioned reported by multiple resource contributors, the model information of models to be auctioned includes model types of models to be auctioned ; Calculate the similarity between the model type of the model to be auctioned and the model type of the demand model, and determine a plurality of target resource contributors matching the model type of the demand model according to the similarity.
  4. 根据权利要求3所述的方法,其特征在于,所述获取所述多个资源贡献方上报的待拍卖模型的模型信息之前,还包括:The method according to claim 3, wherein before acquiring the model information of the models to be auctioned reported by the multiple resource contributors, further comprising:
    通过预设的通信通道广播模型招标信息,以使各个资源贡献方接收所述模型招标信息,所述模型招标信息包括需求模型、所需训练样本、所需样本数量和激励系数;Broadcast model bidding information through a preset communication channel, so that each resource contributor receives the model bidding information, and the model bidding information includes a demand model, required training samples, required number of samples, and incentive coefficients;
    接收多个资源贡献方基于所述模型招标信息反馈的待拍卖模型的模型信息。The model information of the model to be auctioned based on the model bidding information fed back by multiple resource contributors is received.
  5. 根据权利要求1所述的方法,其特征在于,所述根据所述属性配置信息、贡献度配置信息、监控配置信息及所述模型资源,确定每个所述目标资源贡献方对应的分配值,包括:The method according to claim 1, characterized in that, according to the attribute configuration information, contribution degree configuration information, monitoring configuration information and the model resources, the allocation value corresponding to each of the target resource contributors is determined, include:
    根据所述属性配置信息和预设的第一权重,计算每个所述目标资源贡献方的拍卖值;calculating the auction value of each of the target resource contributors according to the attribute configuration information and the preset first weight;
    根据所述贡献度配置信息、所述模型参数、有效训练数据量及预设的第二权重,计算每个所述目标资源贡献方的贡献值;calculating the contribution value of each target resource contributor according to the contribution configuration information, the model parameters, the amount of effective training data, and the preset second weight;
    根据所述监控配置信息和预设的第三权重,计算每个所述目标资源贡献方的惩罚值;calculating a penalty value for each target resource contributor according to the monitoring configuration information and a preset third weight;
    根据所述拍卖值、贡献值和惩罚值,确定每个所述目标资源贡献方对应的分配值。According to the auction value, the contribution value and the penalty value, an allocation value corresponding to each of the target resource contributors is determined.
  6. 根据权利要求5所述的方法,其特征在于,所述根据所述监控配置信息和预设的第三权重,计算每个所述目标资源贡献方的惩罚值,包括:The method according to claim 5, wherein the calculation of the penalty value of each of the target resource contributors according to the monitoring configuration information and the preset third weight includes:
    确定每个所述目标资源贡献方的第一惩罚项、第二惩罚项和第三惩罚项;determining a first penalty term, a second penalty term, and a third penalty term for each of said target resource contributors;
    根据所述第一惩罚项、第二惩罚项和第三惩罚项,计算每个所述目标资源贡献方的惩罚值。Calculate a penalty value for each target resource contributor according to the first penalty item, the second penalty item, and the third penalty item.
  7. 一种基于联合学习的资源分配装置,其特征在于,包括:A resource allocation device based on joint learning, characterized in that it includes:
    读取模块,被配置为读取预设的资源分配配置信息,所述资源分配配置信息包括属性配置信息、贡献度配置信息及监控配置信息;The reading module is configured to read preset resource allocation configuration information, and the resource allocation configuration information includes attribute configuration information, contribution degree configuration information and monitoring configuration information;
    需求方确定模块,被配置为获取多个模型需求方提供的模型需求信息,根据所述属性配置信息和所述模型需求信息,确定目标需求方;The demander determination module is configured to obtain model demand information provided by multiple model demand parties, and determine a target demand party according to the attribute configuration information and the model demand information;
    资源获取模块,被配置为确定与所述模型需求信息匹配的多个目标资源贡献方,获取每个所述目标资源贡献方的模型资源,所述模型资源包括模型参数及有效训练数据量;The resource acquisition module is configured to determine multiple target resource contributors that match the model requirement information, and acquire model resources of each target resource contributor, where the model resources include model parameters and effective training data volumes;
    分配模块,被配置为根据所述属性配置信息、贡献度配置信息、监控配置信息及所述模型资源,确定每个所述目标资源贡献方对应的分配值,并将所述分配值反馈至各个所述目标资源贡献方。The allocation module is configured to determine the allocation value corresponding to each of the target resource contributors according to the attribute configuration information, contribution configuration information, monitoring configuration information, and the model resources, and feed back the allocation value to each The target resource contributor.
  8. 一种基于联合学习的资源分配系统,其特征在于,包括协调中心,分别与所述协调中心通信连接的贡献方传输模块和需求方传输模块;A resource allocation system based on joint learning, characterized in that it includes a coordination center, a contributor transmission module and a demand transmission module respectively connected to the coordination center in communication;
    所述需求方传输模块,被配置为按照预设的时间步长,向所述协调中心发送模型需求信息,所述模型需求信息包括需求模型;The demander transmission module is configured to send model demand information to the coordination center according to a preset time step, and the model demand information includes a demand model;
    所述贡献方传输模块,被配置为当接收到模型训练信息,且确定参与模型训练时,向所述协调中心发送应邀申请;The contributor transmission module is configured to send an invitation application to the coordination center when receiving model training information and determining to participate in model training;
    所述协调中心,被配置为根据所述模型需求信息生成模型训练信息,并广播所述模型训练信息,所述模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略;The coordination center is configured to generate model training information according to the model requirement information, and broadcast the model training information, the model training information includes a preset basic model, training sample type, sample size required for each round and engagement strategies;
    根据所述应邀申请锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个所述目标贡献方使用其训练资源对所述基本模型进行训练,直至满足预设的结束条件,得到全局模型,计算各个所述目标贡献方应得的贡献分配资源,将所述贡献分配资源反馈给相应的各个目标贡献方。lock the training resources of at least two target contributors according to the invited application, and start a preset training program, so that each of the target contributors uses its training resources to train the basic model until the preset end condition is met, Obtain the global model, calculate the contribution allocation resources that each of the target contributors should have, and feed back the contribution allocation resources to the corresponding target contributors.
  9. 根据权利要求8所述的系统,其特征在于,所述根据所述模型需求信息生成模型训练信息,并广播所述模型训练信息,包括:The system according to claim 8, wherein said generating model training information according to said model requirement information, and broadcasting said model training information comprises:
    根据所述模型需求信息和预设的配置信息,生成模型训练信息;generating model training information according to the model requirement information and preset configuration information;
    将所述模型训练信息发送至处于空闲状态的空闲贡献方和/或资源富余贡献方。Sending the model training information to idle contributors and/or resource surplus contributors in an idle state.
  10. 根据权利要求9所述的系统,其特征在于,所述将所述模型训练信息发送至资源富余贡献方,包括:收集所有贡献方的资源状态信息,所述资源状态信息包括所述贡献方的计算资源信息和通信资源信息;根据所述资源状态信息判断所述贡献方是否属于资源富余贡献方;若是,则将所述模型训练信息发送至所述资源富余贡献方;The system according to claim 9, wherein the sending the model training information to resource surplus contributors comprises: collecting resource status information of all contributors, the resource status information including the contributors Computing resource information and communication resource information; judging whether the contributor belongs to a resource surplus contributor according to the resource state information; if so, sending the model training information to the resource surplus contributor;
    或者,所述将所述模型训练信息发送至处于空闲状态的空闲贡献方,包括:获取所有贡献方的训练任务执行状态信息;根据所述训练任务执行状态信息,确定当前处于空闲状态的空闲贡献方,或者,当前执行的训练任务即将完成,可在预设的时间节点参与下一训练任务的空闲贡献方。Alternatively, the sending the model training information to the idle contributors in the idle state includes: obtaining the training task execution status information of all contributors; determining the idle contributors currently in the idle state according to the training task execution status information Party, or, the currently executing training task is about to be completed, and can participate in the idle contributor of the next training task at the preset time node.
  11. 根据权利要求8所述的系统,其特征在于,所述广播所述模型训练信息,包括:确定每个所述贡献方所拥有的训练数据与所述需求模型的关联程度;将所述关联程度满足预设的关联阈值的贡献方确定为拥有与所述需求模型相匹配的训练数据的贡献方;The system according to claim 8, wherein the broadcasting of the model training information includes: determining the degree of association between the training data owned by each of the contributors and the demand model; Contributors that meet a preset association threshold are determined as contributors that have training data that matches the requirement model;
    向拥有与所述需求模型相匹配的训练数据的贡献方下发所述模型训练信息;Sending the model training information to contributors who have training data matching the demand model;
    或者,所述当接收到所述模型训练信息,且确定参与模型训练时,向所述协调中心发送应邀申请,包括:当接收到所述模型训练信息时,查询并判断预设的资源库中是否存储有与所述训练样本类型相同且数量满足所述每轮次所需样本量的训练样本;若有,则判断按照所述参与策略参与训练是否可达到其预设的期望回报资源;若可达到其预设的期望资源回报,则向所述协调中心发送应邀申请。Alternatively, when receiving the model training information and determining to participate in the model training, sending an invitation application to the coordination center includes: when receiving the model training information, querying and judging the preset resource library Whether there are training samples of the same type as the training samples and the number of samples required for each round are stored; if so, it is judged whether participation in training according to the participation strategy can achieve its preset expected return resources; if If the preset expected resource return can be achieved, an invited application is sent to the coordination center.
    或者,所述根据所述应邀申请锁定至少两个目标贡献方的训练资源,包括:接收多个贡献方发送的应邀申请,所述应邀申请包括训练资源和期望回报资源;根据所述训练资源和期望回报资源,确定至少两个目标贡献方,并向所述目标贡献方下发中标通知,以锁定所述目标贡献方的训练资源。Alternatively, the locking the training resources of at least two target contributors according to the invitation application includes: receiving invitation applications sent by multiple contributors, the invitation application including training resources and expected return resources; according to the training resources and It is expected to return resources, determine at least two target contributors, and issue a bid winning notification to the target contributors, so as to lock the training resources of the target contributors.
  12. 根据权利要求8所述的系统,其特征在于,所述计算各个所述目标贡献方应得的贡献分配资源,包括:根据预设的贡献度衡量策略、拍卖策略和惩罚策略,计算各个所述目标贡献方应得的贡献分配资源;The system according to claim 8, wherein the calculation of the contribution allocation resources that each of the target contributors should receive includes: calculating each of the target contributors according to the preset contribution measurement strategy, auction strategy and penalty strategy. Contribution allocation resources due to target contributors;
    或者,所述计算各个所述目标贡献方应得的贡献分配资源之后,还包括:使用预设的测试数据对所述全局模型进行推演预测,得到推演预测结果;根据所述推演预测结果和所述测试数据的标签值的比较结果,确定所述全局模型的模型性能。Alternatively, after the calculation of the resource allocation due to the contribution of each of the target contributors, it further includes: using preset test data to perform deduction prediction on the global model to obtain a deduction prediction result; according to the deduction prediction result and the obtained The model performance of the global model is determined by comparing the label values of the test data.
  13. 一种基于联合学习的资源分配系统的资源分配方法,其特征在于,包括:A resource allocation method based on a joint learning resource allocation system, characterized in that it includes:
    接收需求方发送的模型需求信息;其中,所述需求方为多个参与方之一;Receiving model demand information sent by the demander; wherein, the demander is one of the multiple participants;
    根据所述模型需求信息生成模型训练信息,并广播所述模型训练信息,所述模型训练信息包括预设的基本模型、训练样本类型、每轮次所需样本量及参与策略;Generate model training information according to the model requirement information, and broadcast the model training information, the model training information includes a preset basic model, training sample type, sample size required for each round, and participation strategy;
    确定所述多个参与方中的贡献方;determining a contributing party of the plurality of parties;
    响应于贡献方发送的基于所述模型训练信息确定参与模型训练的消息;Responding to a message sent by the contributor to determine participation in model training based on the model training information;
    根据所述消息锁定至少两个目标贡献方的训练资源,启动预设的训练程序,使各个所述目标贡献方使用其训练资源对所述基本模型进行训练,直至满足预设的结束条件,聚合各个所述目标贡献方提供的模型参数,得到全局模型,计算各个所述目标贡献方应得的贡献分配资源,将所述贡献分配资源反馈给相应的各个目标贡献方。Lock the training resources of at least two target contributors according to the message, start a preset training program, and make each of the target contributors use their training resources to train the basic model until the preset end condition is met, aggregate The model parameters provided by each of the target contributors are used to obtain a global model, and the contribution allocation resources due to each of the target contributors are calculated, and the contribution allocation resources are fed back to the corresponding target contributors.
  14. 一种电子设备,包括存储器、处理器以及存储在所述存储器中并且可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1所述方法的步骤。An electronic device, comprising a memory, a processor, and a computer program stored in the memory and operable on the processor, characterized in that, when the processor executes the computer program, the computer program according to claim 1 is implemented. steps of the method described above.
  15. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1所述方法的步骤。A computer-readable storage medium storing a computer program, wherein the computer program implements the steps of the method according to claim 1 when the computer program is executed by a processor.
PCT/CN2022/117168 2022-01-17 2022-09-06 Resource allocation method, apparatus and system based on federated learning WO2023134181A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN202210050167.3A CN116502811A (en) 2022-01-17 2022-01-17 Resource allocation system and method based on joint learning
CN202210050167.3 2022-01-17
CN202210050177.7 2022-01-17
CN202210050177.7A CN116502812A (en) 2022-01-17 2022-01-17 Resource allocation method and device for joint learning

Publications (1)

Publication Number Publication Date
WO2023134181A1 true WO2023134181A1 (en) 2023-07-20

Family

ID=87280091

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/117168 WO2023134181A1 (en) 2022-01-17 2022-09-06 Resource allocation method, apparatus and system based on federated learning

Country Status (1)

Country Link
WO (1) WO2023134181A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050171786A1 (en) * 2004-01-29 2005-08-04 International Business Machines Corporation Method for fair sharing limited resources between multiple customers
CN112650583A (en) * 2020-12-23 2021-04-13 新智数字科技有限公司 Resource allocation method, device, readable medium and electronic equipment
US20210133663A1 (en) * 2017-03-07 2021-05-06 B.G. Negev Technologies And Applications Ltd., At Ben-Gurion University Market equilibrium mechanism for task allocation
CN112784994A (en) * 2020-12-31 2021-05-11 浙江大学 Block chain-based federated learning data participant contribution value calculation and excitation method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050171786A1 (en) * 2004-01-29 2005-08-04 International Business Machines Corporation Method for fair sharing limited resources between multiple customers
US20210133663A1 (en) * 2017-03-07 2021-05-06 B.G. Negev Technologies And Applications Ltd., At Ben-Gurion University Market equilibrium mechanism for task allocation
CN112650583A (en) * 2020-12-23 2021-04-13 新智数字科技有限公司 Resource allocation method, device, readable medium and electronic equipment
CN112784994A (en) * 2020-12-31 2021-05-11 浙江大学 Block chain-based federated learning data participant contribution value calculation and excitation method

Similar Documents

Publication Publication Date Title
Gao et al. A survey of incentive mechanisms for participatory sensing
US20100023386A1 (en) Social networking platform for intellectual property assets
CN110928951A (en) Data sharing and arbitration method based on block chain
Zhang et al. Privacy-preserving crowdsensing: Privacy valuation, network effect, and profit maximization
Zeng et al. Incentive mechanisms in federated learning and a game-theoretical approach
Xu et al. An incentive mechanism for crowdsourcing markets with social welfare maximization in cloud‐edge computing
CN115292413A (en) Crowd sensing excitation method based on block chain and federal learning
Dong et al. Federated learning service market: A game theoretic analysis
CN109919616B (en) Agricultural product transaction control method, device and maintenance system based on block chain
CN112907340B (en) Excitation method and system based on two-way auction model in crowd sensing
Wu et al. Fedab: Truthful federated learning with auction-based combinatorial multi-armed bandit
CN114202018A (en) Modular joint learning method and system
WO2023134181A1 (en) Resource allocation method, apparatus and system based on federated learning
CN116451806A (en) Federal learning incentive distribution method and device based on block chain
Allahbakhsh et al. Harnessing implicit teamwork knowledge to improve quality in crowdsourcing processes
CN110599337A (en) Alliance chain safety incentive method based on crowd sensing technology
Liew et al. Economics of semantic communication in metaverse: An auction approach
CN114219338A (en) Resource allocation method and device based on joint learning
Coleff et al. Information provision in procurement auctions
Phoomvuthisarn A survey study on reputation-based trust mechanisms in service-oriented computing
CN113888016A (en) Personalized customization system based on block chain
Xu et al. CAB: a combinatorial-auction-and-bargaining-based federated learning incentive mechanism
CN113436011A (en) Intelligent contract processing method, device, equipment and computer readable storage medium
Bai et al. A fair incentive mechanism in federated learning
CN116502811A (en) Resource allocation system and method based on joint learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22919818

Country of ref document: EP

Kind code of ref document: A1