CN114860416A

CN114860416A - Distributed multi-agent detection task assignment method and device in confrontation scene

Info

Publication number: CN114860416A
Application number: CN202210633921.6A
Authority: CN
Inventors: 刘华平; 李阳; 张新钰
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2022-06-06
Filing date: 2022-06-06
Publication date: 2022-08-05
Anticipated expiration: 2042-06-06
Also published as: CN114860416B

Abstract

The application discloses a distributed multi-agent detection task allocation method and device in a confrontation scene, wherein the method comprises the following steps: establishing a multi-agent detection efficiency function according to detection task distribution weight, target detector configuration and detection capacity carried by the multi-agent system, target disguise strategy and multi-agent detection task distribution strategy; establishing a distributed multi-agent detection task distribution model according to preset multi-agent detection capability constraint, target disguising capability constraint and detection task constraint; and alternately solving a task allocation strategy and a corresponding target disguise strategy of the multi-agent system according to the gradient information to generate a distributed multi-agent detection task allocation result. Therefore, the technical problem that the execution result of the detection task is easily influenced by the target disguise strategy due to the fact that the potential disguise countermeasure process of the target cannot be fully considered through a distributed detection task allocation method in the related art is solved, and therefore the acquisition amount of target characteristic information is reduced.

Description

Distributed multi-agent detection task assignment method and device in confrontation scene

技术领域technical field

本申请涉及多智能体系统智能决策技术领域，特别涉及一种对抗场景中的分布式多智能体探测任务分配方法及装置。The present application relates to the technical field of intelligent decision-making in multi-agent systems, and in particular, to a method and device for allocating distributed multi-agent detection tasks in confrontation scenarios.

背景技术Background technique

任务分配是多智能体在协同目标探测中的关键问题，好的任务分配可以充分发挥不同智能体的能力优势，尤其是在对抗场景下，目标可以伪装自身的一类特征信息、造成部分探测手段失效，因此，根据智能体携带的探测器的类型与能力，正确匹配智能体集合与探测任务集合，是获取更多目标特征信息的重要基础。Task allocation is a key issue for multi-agents in collaborative target detection. Good task allocation can give full play to the capabilities and advantages of different agents. Especially in confrontation scenarios, the target can disguise its own feature information and cause some detection methods. Therefore, according to the type and capability of the detector carried by the agent, correctly matching the agent set and the detection task set is an important basis for obtaining more target feature information.

任务分配方法按照多智能体系统的组织形式可分为集中式、分布式两类。集中式任务分配方法由一个中心节点收集所有智能体信息，计算并向所有智能体发送任务分配结果，中心节点可以是多智能体系统中的某个智能体或中心基站。而在分布式任务分配方法中，每个智能体可以根据自身状态信息及通信结果，自主决策并动态调整任务分配结果，相较于集中式任务分配方法，具有更高的灵活性和更强的鲁棒性，受到智能体规模的影响更小。Task allocation methods can be divided into two types: centralized and distributed according to the organization of the multi-agent system. In the centralized task assignment method, a central node collects the information of all agents, calculates and sends the task assignment results to all agents. The central node can be an agent or a central base station in a multi-agent system. In the distributed task assignment method, each agent can make autonomous decisions and dynamically adjust the task assignment results according to its own state information and communication results. Compared with the centralized task assignment method, it has higher flexibility and stronger performance. Robustness, less affected by agent size.

然而，相关技术中分布式探测任务分配方法，尚未充分考虑目标潜在的伪装对抗过程，探测任务的执行结果易受目标伪装策略的影响，有待改进。However, the distributed detection task assignment method in the related art has not fully considered the potential camouflage confrontation process of the target, and the execution result of the detection task is easily affected by the target camouflage strategy, which needs to be improved.

发明内容SUMMARY OF THE INVENTION

本申请提供一种对抗场景中的分布式多智能体探测任务分配方法及装置，以解决相关技术中通过分布式探测任务分配方法，由于未能充分考虑目标潜在的伪装对抗过程，导致探测任务的执行结果易受目标伪装策略的影响，从而降低了目标特征信息的获取量的技术问题。The present application provides a distributed multi-agent detection task assignment method and device in a confrontation scenario, so as to solve the problem that the distributed detection task assignment method in the related art fails to fully consider the potential camouflage confrontation process of the target, resulting in the detection task The execution result is easily affected by the target camouflage strategy, thus reducing the technical problem of the acquisition of target feature information.

本申请第一方面实施例提供一种对抗场景中的分布式多智能体探测任务分配方法，包括以下步骤：根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略建立多智能体探测效能函数；根据预设的多智能体探测能力约束、目标的伪装能力约束、探测任务约束建立分布式多智能体探测任务分配模型；基于所述分布式多智能体探测任务分配模型，根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，生成分布式多智能体探测任务分配结果。The embodiment of the first aspect of the present application provides a distributed multi-agent detection task assignment method in a confrontation scenario, including the following steps: assigning weights according to detection tasks, target detector configurations and detection capabilities carried by a multi-agent system, and target detection capabilities. The camouflage strategy and the multi-agent detection task assignment strategy establish the multi-agent detection efficiency function; according to the preset multi-agent detection capability constraints, target camouflage capability constraints, and detection task constraints, the distributed multi-agent detection task assignment model is established; The distributed multi-agent detection task assignment model alternately solves the task assignment strategy of the multi-agent system and the corresponding target camouflage strategy according to the gradient information, and generates a distributed multi-agent detection task assignment result.

可选地，在本申请的一个实施例中，在求解所述多智能体系统的任务分配策略及相应的目标伪装策略之前，还包括：设置优化参数及终止条件；建立探测效能的增广拉格朗日函数，并增加探测效能函数的非收敛惩罚项。Optionally, in an embodiment of the present application, before solving the task allocation strategy and the corresponding target camouflage strategy of the multi-agent system, the method further includes: setting optimization parameters and termination conditions; establishing an augmented pull-out of detection efficiency Grange function, and increase the non-convergence penalty term of the detection efficiency function.

可选地，在本申请的一个实施例中，所述多智能体探测效能函数为：Optionally, in an embodiment of the present application, the multi-agent detection efficiency function is:

其中，X_i表示智能体i对多智能体系统全局任务分配结果的估计，B表示智能体与目标之间的分配权重参数，C表示智能体系统携带的目标探测器类型，D表示目标伪装策略，E_i表示第i列全为1其余为0的M行N列矩阵，Λ为不同探测器的目标探测能力参数。Among them, X _i represents the estimation of the global task assignment result of the multi-agent system by the agent i, B represents the distribution weight parameter between the agent and the target, C represents the target detector type carried by the agent system, and D represents the target camouflage strategy , E _i represents a matrix with M rows and N columns in which the i-th column is all 1 and the rest are 0, and Λ is the target detection capability parameter of different detectors.

可选地，在本申请的一个实施例中，所述生成分布式多智能体探测任务分配结果，包括：利用所述多智能体任务分配模型求解探测效能函数在所有伪装策略下的最大值。Optionally, in an embodiment of the present application, the generating a distributed multi-agent detection task assignment result includes: using the multi-agent task assignment model to find the maximum value of the detection efficiency function under all camouflage strategies.

可选地，在本申请的一个实施例中，所述根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，包括：求解当前探测任务分配策略下的最优目标伪装策略；并行求解所述当前最优目标伪装策略下的各个智能体最优任务分配策略，以并行求解所述当前探测任务分配策略、所述最优目标伪装策略下的对偶参数；基于所述对偶参数，求解原问题残差及对偶问题残差，直至优化结束，使得所述多智能体系统的探测任务分配解为所有智能体局部探测任务分配解的平均值。Optionally, in an embodiment of the present application, the alternatively solving the task allocation strategy and the corresponding target camouflage strategy of the multi-agent system according to the gradient information includes: solving the optimal target camouflage strategy under the current detection task allocation strategy. ; Solve the optimal task allocation strategy of each agent under the current optimal target camouflage strategy in parallel to solve the dual parameters under the current detection task allocation strategy and the optimal target camouflage strategy in parallel; Based on the dual parameters , solve the residual of the original problem and the residual of the dual problem until the optimization ends, so that the detection task assignment solution of the multi-agent system is the average value of the local detection task assignment solutions of all agents.

本申请第二方面实施例提供一种对抗场景中的分布式多智能体探测任务分配装置，包括：第一函数建立模块，用于根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略建立多智能体探测效能函数；模型建立模块，用于根据预设的多智能体探测能力约束、目标的伪装能力约束、探测任务约束建立分布式多智能体探测任务分配模型；分配模块，用于基于所述分布式多智能体探测任务分配模型，根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，生成分布式多智能体探测任务分配结果。The embodiment of the second aspect of the present application provides a distributed multi-agent detection task assignment device in a confrontation scenario, including: a first function establishment module for assigning weights according to detection tasks, and the configuration of target detectors carried by the multi-agent system and detection ability, target camouflage strategy, and multi-agent detection task assignment strategy to establish multi-agent detection efficiency function; model building module is used for the preset multi-agent detection ability constraints, target camouflage ability constraints, detection task constraints A distributed multi-agent detection task assignment model is established; the assignment module is used to alternately solve the task assignment strategy and the corresponding target camouflage strategy of the multi-agent system based on the distributed multi-agent detection task assignment model according to the gradient information, and generate Distributed multi-agent detection task assignment results.

可选地，在本申请的一个实施例中，还包括：设置模块，用于设置优化参数及终止条件；第二函数建立模块，用于建立探测效能的增广拉格朗日函数，并增加探测效能函数的非收敛惩罚项。Optionally, in an embodiment of the present application, it further includes: a setting module for setting optimization parameters and termination conditions; a second function establishing module for establishing an augmented Lagrangian function of detection efficiency, and adding A non-convergence penalty term for the detection efficiency function.

可选地，在本申请的一个实施例中，所述分配模块包括：第一求解单元，用于利用所述多智能体任务分配模型求解探测效能函数在所有伪装策略下的最大值。Optionally, in an embodiment of the present application, the allocation module includes: a first solving unit, configured to use the multi-agent task allocation model to solve the maximum value of the detection efficiency function under all camouflage strategies.

可选地，在本申请的一个实施例中，所述分配模块还包括：第二求解单元，用于求解当前探测任务分配策略下的最优目标伪装策略；第三求解单元，用于并行求解所述当前最优目标伪装策略下的各个智能体最优任务分配策略，以并行求解所述当前探测任务分配策略、所述最优目标伪装策略下的对偶参数；分配单元，用于基于所述对偶参数，求解原问题残差及对偶问题残差，直至优化结束，使得所述多智能体系统的探测任务分配解为所有智能体局部探测任务分配解的平均值。Optionally, in an embodiment of the present application, the allocation module further includes: a second solving unit for solving the optimal target camouflage strategy under the current detection task allocation strategy; a third solving unit for solving in parallel the optimal task allocation strategy of each agent under the current optimal target camouflage strategy to solve the current detection task allocation strategy and the dual parameters under the optimal target camouflage strategy in parallel; The dual parameter is used to solve the residual of the original problem and the residual of the dual problem until the optimization ends, so that the detection task assignment solution of the multi-agent system is the average value of the local detection task assignment solutions of all agents.

本申请第三方面实施例提供一种电子设备，包括：存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序，所述处理器执行所述程序，以实现如上述实施例所述的对抗场景中的分布式多智能体探测任务分配方法。An embodiment of a third aspect of the present application provides an electronic device, including: a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor executes the program to achieve The distributed multi-agent detection task assignment method in the confrontation scene as described in the above embodiment.

本申请第四方面实施例提供一种计算机可读存储介质，所述计算机可读存储介质存储计算机指令，所述计算机指令用于使所述计算机执行如上述实施例所述的对抗场景中的分布式多智能体探测任务分配方法。Embodiments of the fourth aspect of the present application provide a computer-readable storage medium, where the computer-readable storage medium stores computer instructions, where the computer instructions are used to cause the computer to execute the distribution in the confrontation scenario described in the foregoing embodiments A multi-agent detection task assignment method.

本申请实施例可以根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略，利用多智能体探测效能函数和分布式多智能体探测任务分配模型，实现对不同探测能力的多智能体的合理分配，从而最大限度降低目标伪装对多智能体探测效能的影响，进而最大化对抗环境下的目标特征信息的收集。由此，解决了相关技术中通过分布式探测任务分配方法，由于未能充分考虑目标潜在的伪装对抗过程，导致探测任务的执行结果易受目标伪装策略的影响，从而降低了目标特征信息的获取量的技术问题。The embodiment of the present application can use the multi-agent detection efficiency function and distributed multi-agent according to the detection task allocation weight, the target detector configuration and detection capability carried by the multi-agent system, the camouflage strategy of the target, and the multi-agent detection task allocation strategy. The task allocation model of body detection is used to achieve a reasonable allocation of multi-agents with different detection capabilities, thereby minimizing the impact of target camouflage on the detection performance of multi-agents, and then maximizing the collection of target feature information in the confrontation environment. As a result, the distributed detection task assignment method in the related art is solved, because the potential camouflage confrontation process of the target is not fully considered, the execution result of the detection task is easily affected by the target camouflage strategy, thereby reducing the acquisition of target feature information. amount of technical issues.

本申请附加的方面和优点将在下面的描述中部分给出，部分将从下面的描述中变得明显，或通过本申请的实践了解到。Additional aspects and advantages of the present application will be set forth, in part, in the following description, and in part will be apparent from the following description, or learned by practice of the present application.

附图说明Description of drawings

本申请上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解，其中：The above and/or additional aspects and advantages of the present application will become apparent and readily understood from the following description of embodiments taken in conjunction with the accompanying drawings, wherein:

图1为根据本申请实施例提供的一种对抗场景中的分布式多智能体探测任务分配方法的流程图；FIG. 1 is a flowchart of a distributed multi-agent detection task assignment method in a confrontation scenario provided according to an embodiment of the present application;

图2为根据本申请一个实施例的对抗场景中的分布式多智能体探测任务分配方法的流程图；2 is a flowchart of a distributed multi-agent detection task assignment method in a confrontation scenario according to an embodiment of the present application;

图3为根据本申请实施例提供的一种对抗场景中的分布式多智能体探测任务分配装置的结构示意图；3 is a schematic structural diagram of a distributed multi-agent detection task assignment device in a confrontation scenario provided according to an embodiment of the present application;

图4为根据本申请实施例提供的电子设备的结构示意图。FIG. 4 is a schematic structural diagram of an electronic device provided according to an embodiment of the present application.

具体实施方式Detailed ways

下面详细描述本申请的实施例，所述实施例的示例在附图中示出，其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的，旨在用于解释本申请，而不能理解为对本申请的限制。The following describes in detail the embodiments of the present application, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, and are intended to be used to explain the present application, but should not be construed as a limitation to the present application.

下面参考附图描述本申请实施例的对抗场景中的分布式多智能体探测任务分配方法及装置。针对上述背景技术中心提到的相关技术中通过分布式探测任务分配方法，由于未能充分考虑目标潜在的伪装对抗过程，导致探测任务的执行结果易受目标伪装策略的影响，从而降低了目标特征信息的获取量的技术问题，本申请提供了一种对抗场景中的分布式多智能体探测任务分配方法，在该方法中，可以根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略，利用多智能体探测效能函数和分布式多智能体探测任务分配模型，实现对不同探测能力的多智能体的合理分配，从而最大限度降低目标伪装对多智能体探测效能的影响，进而最大化对抗环境下的目标特征信息的收集。由此，解决了相关技术中通过分布式探测任务分配方法，由于未能充分考虑目标潜在的伪装对抗过程，导致探测任务的执行结果易受目标伪装策略的影响，从而降低了目标特征信息的获取量的技术问题。The method and apparatus for allocating distributed multi-agent detection tasks in a confrontation scenario according to embodiments of the present application will be described below with reference to the accompanying drawings. In view of the distributed detection task allocation method in the related art mentioned in the above background technology center, because the potential camouflage confrontation process of the target is not fully considered, the execution result of the detection task is easily affected by the target camouflage strategy, thereby reducing the target characteristics. The technical problem of the acquisition amount of information, the present application provides a distributed multi-agent detection task assignment method in confrontation scenarios, in which the weight can be assigned according to the detection task, and the target detector configuration carried by the multi-agent system can be configured. and detection ability, target camouflage strategy, multi-agent detection task allocation strategy, and use multi-agent detection efficiency function and distributed multi-agent detection task allocation model to achieve a reasonable allocation of multi-agents with different detection capabilities, so as to maximize the Minimize the impact of target camouflage on the multi-agent detection performance, and then maximize the collection of target feature information in the confrontation environment. As a result, the distributed detection task assignment method in the related art is solved, because the potential camouflage confrontation process of the target is not fully considered, the execution result of the detection task is easily affected by the target camouflage strategy, thereby reducing the acquisition of target feature information. amount of technical issues.

具体而言，图1为本申请实施例所提供的一种对抗场景中的分布式多智能体探测任务分配方法的流程示意图。Specifically, FIG. 1 is a schematic flowchart of a distributed multi-agent detection task assignment method in a confrontation scenario provided by an embodiment of the present application.

如图1所示，该对抗场景中的分布式多智能体探测任务分配方法包括以下步骤：As shown in Figure 1, the distributed multi-agent detection task assignment method in the confrontation scene includes the following steps:

在步骤S101中，根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略建立多智能体探测效能函数。In step S101, a multi-agent detection efficiency function is established according to the detection task allocation weight, the target detector configuration and detection capability carried by the multi-agent system, the target camouflage strategy, and the multi-agent detection task allocation strategy.

在实际执行过程中，本申请实施例可以根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略，建立多智能体探测效能函数，便于后续结合分布式多智能体探测任务分配模型，实现对不同探测能力的多智能体的合理分配，从而最大限度降低目标伪装对多智能体探测效能的影响，进而最大化对抗环境下的目标特征信息的收集。In the actual execution process, the embodiment of the present application can establish the multi-agent detection efficiency according to the detection task allocation weight, the target detector configuration and detection capability carried by the multi-agent system, the target camouflage strategy, and the multi-agent detection task allocation strategy function, it is convenient to combine the distributed multi-agent detection task allocation model to achieve a reasonable allocation of multi-agents with different detection capabilities, so as to minimize the impact of target camouflage on the detection performance of multi-agents, and thus maximize the countermeasures in the environment. Collection of target feature information.

其中，多智能体系统为分布式拓扑结构，仅可获知通信邻域智能体的探测任务分配结果。Among them, the multi-agent system is a distributed topology, and only the detection task assignment results of the communication neighborhood agents can be obtained.

可选地，在本申请的一个实施例中，多智能体探测效能函数为：Optionally, in an embodiment of the present application, the multi-agent detection efficiency function is:

具体地，本申请实施例建立的多智能体探测任务分配的探测效能函数可以为：Specifically, the detection efficiency function of the multi-agent detection task assignment established in the embodiment of the present application may be:

其中，X_i∈[0,1]^M×N表示智能体i对多智能体系统全局任务分配结果的估计，[X_i]_mn表示目标m分配至智能体n的概率；

表示智能体与目标之间的分配权重参数；C∈{0,1}^S×N表示智能体系统携带的目标探测器类型，[C]_sn＝1表示智能体n带有传感器s；D∈[0,1]^S×M表示目标伪装策略，[D]_sm表示目标m伪装自身可被传感器s探测的特征信息。Among them, X _i ∈ [0,1] ^M×N represents the estimation of the global task assignment result of the multi-agent system by the agent i, and [X _i ] _mn represents the probability that the target m is assigned to the agent n;

Represents the distribution weight parameter between the agent and the target; C∈{0,1} ^S×N indicates the type of target detector carried by the agent system, [C] _sn = 1 indicates that the agent n has a sensor s; D∈ [0,1] ^S×M represents the target camouflage strategy, and [D] _sm represents the feature information that the target m camouflages itself can be detected by the sensor s.

在步骤S102中，根据预设的多智能体探测能力约束、目标的伪装能力约束、探测任务约束建立分布式多智能体探测任务分配模型。In step S102, a distributed multi-agent detection task assignment model is established according to preset multi-agent detection capability constraints, target camouflage capability constraints, and detection task constraints.

进一步地，本申请实施例可以结合预设的多智能体探测能力约束、目标的伪装能力约束、探测任务约束建立分布式多智能体探测任务分配模型。Further, the embodiment of the present application can establish a distributed multi-agent detection task assignment model in combination with preset multi-agent detection capability constraints, target camouflage capability constraints, and detection task constraints.

其中，多智能体探测能力约束指同一时刻，每个智能体至多可执行一个探测任务；目标的伪装能力约束指同一时刻，目标至多伪装自身一类特征信息；探测任务约束指所有目标均至少被分配至一个智能体；多智能体一致性约束指各个智能体的全局任务分配解相同。Among them, the multi-agent detection capability constraint means that each agent can perform at most one detection task at the same time; the target camouflage capability constraint means that the target can camouflage at most one type of feature information at the same time; the detection task constraint means that all targets are at least Assigned to one agent; the multi-agent consistency constraint means that the global task assignment solution of each agent is the same.

具体而言，本申请实施例可以建立多智能体探测能力约束条件0≤X_i≤1，

并要求每个智能体至多可执行一个探测任务；Specifically, the embodiment of the present application can establish a multi-agent detection capability constraint condition 0≤X _i ≤1,

And requires each agent to perform at most one detection task;

建立目标的伪装能力约束条件0≤D≤1,

并要求目标至多伪装自身一类特征信息；Establish the target's camouflage ability constraint 0≤D≤1,

And require the target to disguise at most one type of characteristic information of itself;

建立探测任务约束条件

并要求所有目标均至少被分配至一个智能体。Establishing Probing Mission Constraints

and requires all targets to be assigned to at least one agent.

进一步地，本申请实施例可以估计各个智能体对非通信邻域的智能体的探测任务分配结果，联合自身及通信邻域的探测任务分配结果，形成对多智能体系统的探测任务的全局估计，同时增加多智能体系统的一致性约束条件

并要求所有智能体对多智能体系统的探测任务分配的全局估计需保持一致。Further, the embodiment of the present application can estimate the detection task assignment results of each agent to the agent in the non-communication neighborhood, and combine the detection task assignment results of itself and the communication neighborhood to form a global estimation of the detection task of the multi-agent system. , while increasing the consistency constraints of the multi-agent system

And it is required that the global estimation of the detection task assignment of all agents to the multi-agent system should be consistent.

在步骤S103中，基于分布式多智能体探测任务分配模型，根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，生成分布式多智能体探测任务分配结果。In step S103, based on the distributed multi-agent detection task assignment model, the task assignment strategy and the corresponding target camouflage strategy of the multi-agent system are alternately solved according to the gradient information, and the distributed multi-agent detection task assignment result is generated.

在实际执行过程中，本申请实施例可以基于分布式多智能体探测任务分配模型，依据最优解分配多智能体探测任务，并根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，从而生成分布式多智能体探测任务分配结果，实现对不同探测能力的多智能体的合理分配，从而最大限度降低目标伪装对多智能体探测效能的影响，进而最大化对抗环境下的目标特征信息的收集。In the actual execution process, the embodiments of the present application may, based on the distributed multi-agent detection task allocation model, allocate the multi-agent detection task according to the optimal solution, and alternately solve the task allocation strategy and the corresponding multi-agent system according to the gradient information. Target camouflage strategy, so as to generate distributed multi-agent detection task assignment results, to achieve a reasonable allocation of multi-agents with different detection capabilities, so as to minimize the impact of target camouflage on multi-agent detection performance, and maximize the confrontation environment. collection of target feature information.

可选地，在本申请的一个实施例中，在求解多智能体系统的任务分配策略及相应的目标伪装策略之前，还包括：设置优化参数及终止条件；建立探测效能的增广拉格朗日函数，并增加探测效能函数的非收敛惩罚项。Optionally, in an embodiment of the present application, before solving the task assignment strategy of the multi-agent system and the corresponding target camouflage strategy, the method further includes: setting optimization parameters and termination conditions; establishing an augmented Lagrang of detection efficiency day function, and increase the non-convergence penalty term of the detection efficiency function.

作为一种可能实现的方式，本申请实施例可以设置优化参数及终止条件，建立探测效能的增广拉格朗日函数，从而增加探测效能函数的非收敛惩罚项，并根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，直至满足终止条件。As a possible way of implementation, the embodiment of the present application can set optimization parameters and termination conditions, establish an augmented Lagrangian function of detection efficiency, thereby increasing the non-convergence penalty term of the detection efficiency function, and alternately solve multiple problems according to gradient information. The task allocation strategy of the agent system and the corresponding target camouflage strategy until the termination conditions are met.

具体地，本申请实施例可以增加非收敛惩罚项

该惩罚项对非{0,1}的分配结果进行惩罚，从而保证分布式任务分配结果收敛至整数集合{0,1}。Specifically, in this embodiment of the present application, a non-convergence penalty term may be added

The penalty term penalizes the assignment results other than {0,1}, so as to ensure that the distributed task assignment results converge to the integer set {0,1}.

进一步地，多智能体探测效能函数的增广拉格朗日函数可以为：Further, the augmented Lagrangian function of the multi-agent detection efficiency function can be:

其中，g(X_i)为目标分配策略X_i的指示函数，h(D)为伪装策略的指示函数：Among them, g(X _i ) is the indicator function of the target allocation strategy Xi _, and h(D) is the indicator function of the camouflage strategy:

可选地，在本申请的一个实施例中，生成分布式多智能体探测任务分配结果，包括：利用多智能体任务分配模型求解探测效能函数在所有伪装策略下的最大值。Optionally, in an embodiment of the present application, generating a distributed multi-agent detection task assignment result includes: using a multi-agent task assignment model to find the maximum value of the detection efficiency function under all camouflage strategies.

进一步地，利用多智能体任务分配模型求解探测效能函数在所有伪装策略下的最大值可以为：Further, using the multi-agent task assignment model to solve the maximum value of the detection efficiency function under all camouflage strategies can be:

0≤D≤1，0≤D≤1,

可选地，在本申请的一个实施例中，根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，包括：求解当前探测任务分配策略下的最优目标伪装策略；并行求解当前最优目标伪装策略下的各个智能体最优任务分配策略，以并行求解当前探测任务分配策略、最优目标伪装策略下的对偶参数；基于对偶参数，求解原问题残差及对偶问题残差，直至优化结束，使得多智能体系统的探测任务分配解为所有智能体局部探测任务分配解的平均值。Optionally, in an embodiment of the present application, alternately solving the task allocation strategy and the corresponding target camouflage strategy of the multi-agent system according to the gradient information, including: solving the optimal target camouflage strategy under the current detection task allocation strategy; parallel Solve the optimal task allocation strategy of each agent under the current optimal target camouflage strategy, and solve the dual parameters of the current detection task allocation strategy and the optimal target camouflage strategy in parallel; based on the dual parameters, solve the residual of the original problem and the residual of the dual problem difference until the end of the optimization, so that the detection task assignment solution of the multi-agent system is the average value of the local detection task assignment solution of all agents.

具体而言，本申请实施例可以初始化优化步长ρ，初始化多智能体探测任务参数B,C,Λ，并设置多智能体通信拓扑

其中V＝{1,…,N}为多智能体集合，

为多智能体通信边，初始化探测任务分配策略{X_i}⁰，初始化目标伪装策略D⁰，设置优化终止条件∈_pri、∈_dual。Specifically, the embodiment of the present application can initialize the optimization step size ρ, initialize the multi-agent detection task parameters B, C, Λ, and set the multi-agent communication topology

where V={1,...,N} is a multi-agent set,

For the multi-agent communication edge, initialize the detection task assignment strategy {X _i } ⁰ , initialize the target camouflage strategy D ⁰ , and set the optimization termination conditions ∈ _pri and ∈ _dual .

进一步地，本申请实施例可以求解当前探测任务分配策略下的最优目标伪装策略：Further, the embodiment of the present application can solve the optimal target camouflage strategy under the current detection task allocation strategy:

进一步地，本申请实施例可以并行求解当前最优目标伪装策略下的各个智能体最优任务分配策略：Further, the embodiment of the present application can solve the optimal task allocation strategy of each agent under the current optimal target camouflage strategy in parallel:

其中，

in,

进一步地，本申请实施例可以并行求解当前探测任务分配策略、最优目标伪装策略下的对偶参数：Further, the embodiment of the present application can solve the dual parameters under the current detection task allocation strategy and the optimal target camouflage strategy in parallel:

进一步地，本申请实施例可以求解原问题残差

及对偶问题残差

若同时满足

及

则终止优化，否则重复上述过程直至优化时间消耗完毕。Further, the embodiment of the present application can solve the residual of the original problem

and dual problem residuals

If both meet

and

Then terminate the optimization, otherwise repeat the above process until the optimization time is exhausted.

进一步地，本申请实施例可以令多智能体系统的探测任务分配解为所有智能体局部探测任务分配解的平均值：Further, in the embodiment of the present application, the detection task assignment solution of the multi-agent system can be the average value of the local detection task assignment solutions of all agents:

综上所述，本申请实施例与相关技术相比，解决了多目标探测任务中目标的伪装能力对探测效能影响的问题，具有较强的鲁棒性，适用于具有连通通信拓扑的多智能体分布式探测任务中，具备较高的收敛速率，可以在较短的时间内搜寻到目标伪装下的合理探测任务分配结果。To sum up, compared with the related art, the embodiment of the present application solves the problem of the influence of the camouflage ability of the target on the detection efficiency in the multi-target detection task, has strong robustness, and is suitable for multi-intelligence with connected communication topology. In the volume distributed detection task, it has a high convergence rate, and can search for a reasonable detection task assignment result under the target camouflage in a short time.

具体地，结合图2所示，以一个具体实施例对本申请实施例的对抗场景中的分布式多智能体探测任务分配方法的工作原理进行详细阐述。Specifically, with reference to FIG. 2 , the working principle of the distributed multi-agent detection task assignment method in the confrontation scenario according to the embodiment of the present application will be described in detail with a specific embodiment.

如图2所示，本申请实施例可以包括以下步骤：As shown in FIG. 2, this embodiment of the present application may include the following steps:

步骤S201：初始化参数，设置终止条件。本申请实施例可以设置多智能体通信拓扑

初始化探测任务分配权重B、多智能体系统携带的目标探测器配置参数C、不同探测器的目标探测能力参数Λ、目标伪装策略D⁰、多智能体探测任务分配策略

Step S201: Initialize parameters and set termination conditions. This embodiment of the present application can set a multi-agent communication topology

Initialized detection task assignment weight B, target detector configuration parameter C carried by the multi-agent system, target detection capability parameter Λ of different detectors, target camouflage strategy D ⁰ , multi-agent detection task assignment strategy

多智能体通信拓扑

表示多智能体系统的通信拓扑结构，其中，V＝{1,…,N}为多智能体集合，

为多智能体通信边集合。Multi-agent communication topology

represents the communication topology of the multi-agent system, where V={1,...,N} is the multi-agent set,

Sets of edges for multi-agent communication.

本申请实施例可以依据当前多智能体系统的通信范围，如果智能体i与智能体j之间的距离小于通信距离阈值，则认为智能体i可与智能体j进行连续稳定通信，本发明中认为通信拓扑

为连通图。In the embodiment of the present application, according to the communication range of the current multi-agent system, if the distance between the agent i and the agent j is less than the communication distance threshold, it is considered that the agent i can communicate continuously and stably with the agent j. In the present invention think communication topology

is a connected graph.

需要注意的是，通信距离阈值可以由本领域技术人员根据实际情况进行设置，在此不做具体限制。It should be noted that the communication distance threshold can be set by those skilled in the art according to actual conditions, and no specific limitation is made here.

探测任务分配权重B表示智能体执行探测任务的权重，其中[B]_mn为标量值，取[B]_mn≥0且

[B]_mn表示智能体n执行探测任务m的权重大小，本申请实施例通过智能体执行探测任务的权重，可表示多智能体与目标的距离等因素产生的探测任务的执行成本，权重值越小表示执行成本越高。The detection task assignment weight B represents the weight of the agent to perform the detection task, where [B] _mn is a scalar value, and [B] _mn ≥ 0 and

[B] _mn represents the weight of the detection task m performed by the agent n. In this embodiment of the application, the weight of the detection task performed by the agent can represent the execution cost of the detection task caused by factors such as the distance between the multi-agent and the target, and the weight value Smaller means higher execution cost.

目标探测器配置参数C表示不同智能体携带的探测器类型，例如红外探测器、激光雷达、摄像头等。若多智能体系统共有S类探测器，取[C]_sn∈{0,1}，[C]_sn＝1表示智能体n具有s型探测器，否则为0。The target detector configuration parameter C represents the types of detectors carried by different agents, such as infrared detectors, lidars, cameras, etc. If the multi-agent system has S-type detectors, take [C] _sn ∈ {0,1}, [C] _sn = 1 means that the agent n has s-type detectors, otherwise it is 0.

不同探测器的目标探测能力参数Λ表示探测器对目标不同特征信息的获取能力，取Λ＝diag(λ₁,…，λ_S)，其中λ_s表示s型探测器的目标探测能力，通过目标探测能力参数Λ，可表示如低能见度、夜间等各类复杂气象条件对探测器能力的影响，参数值越大表示探测能力越强。The target detection ability parameter Λ of different detectors represents the ability of the detector to acquire different characteristic information of the target, and takes Λ=diag(λ ₁ ,...,λ _S ), where λ _s represents the target detection ability of the s-type detector. The detection capability parameter Λ can represent the influence of various complex meteorological conditions, such as low visibility and night time, on the capability of the detector. The larger the parameter value, the stronger the detection capability.

目标伪装策略D表示目标伪装的与目标探测器类型相对应的自身特征信息类型，其中[D]_ms∈[0,1]，表示目标m伪装的特征信息类型为s，在该伪装策略下，目标探测器s无法探测到目标m。在目标伪装策略D的初始化中，本申请实施例可以以区间[0,1]上的均匀分布采样S×M维D⁰。The target camouflage strategy D represents the type of self-feature information corresponding to the target detector type of target camouflage, where [D] _ms ∈ [0,1], indicating that the feature information type of target m camouflage is s. Under this camouflage strategy, Target detector s cannot detect target m. In the initialization of the target camouflage strategy D, the embodiment of the present application may sample the S×M dimension D ⁰ with a uniform distribution in the interval [0,1].

多智能体探测任务分配策略{X_i}表示智能体i的探测任务分配结果，其中[X_i]_mn∈[0,1]表示智能体i认为的探测任m分配至智能体n的概率。本申请实施例可以取多智能体探测任务分配策略{X_i}的初始化，以区间[0,1]上的均匀分布采样N个M×N维

The multi-agent detection task assignment strategy {X _i } represents the detection task assignment result of the agent i, where [X _i ] _mn ∈[0,1] represents the probability that the detection task m is assigned to the agent n by the agent i. This embodiment of the present application may take the initialization of the multi-agent detection task assignment strategy {X _i }, and sample N M×N dimensions with a uniform distribution in the interval [0,1]

步骤S202：优化目标伪装策略。本申请实施例可以输入当前探测任务分配策略

更新目标伪装策略D^k+1。目标伪装策略的目标是最小化探测效能函数，同时满足目标伪装能力约束条件0≤D≤1,

结合增广拉格朗日函数，可以将当前探测任务分配策略

代入目标伪装优化函数，得到k+1时刻的目标伪装策略。Step S202: Optimizing the target camouflage strategy. In this embodiment of the present application, the current detection task allocation strategy can be input

Update the target camouflage policy D ^k+1 . The goal of the target camouflage strategy is to minimize the detection efficiency function while satisfying the target camouflage capability constraint 0≤D≤1,

Combined with the augmented Lagrangian function, the current detection task assignment strategy can be

Substitute into the target camouflage optimization function to obtain the target camouflage strategy at time k+1.

目标伪装策略优化方法，以交替方向乘子法为基础，迭代求解k+1时刻的目标伪装策略。具体而言，本申请实施例可以首先初始化以k时刻的最优目标伪装策略D^k为初始值，即D^k+1,0＝D^k；其次交替优化伪装策略的目标函数和约束函数的局部最优值，其中伪装策略的目标函数的局部最优值初值为

并通过以下公式进行更新：The optimization method of target camouflage strategy is based on the alternating direction multiplier method, and iteratively solves the target camouflage strategy at time k+1. Specifically, the embodiment of the present application may firstly initialize the optimal target camouflage strategy Dk at time ^k as the initial value, that is, Dk ^+1,0 = ^Dk ; secondly, alternately optimize the objective function of the camouflage strategy and the local part of the constraint function The optimal value, where the initial value of the local optimal value of the objective function of the camouflage strategy is

and updated by the following formula:

其中，

U_d＝0_M×N，约束函数的局部最优值初值为

并通过以下公式进行更新：in,

U _d =0 _M×N , the initial value of the local optimal value of the constraint function is

and updated by the following formula:

Prox为近端算子，定义可以如下：Prox is a near-end operator, which can be defined as follows:

再次，本申请实施例可以更新t+1步

Again, this embodiment of the present application can update step t+1

最后，当收敛误差

以及

足够小时(通常取10^-3)，令k+1时刻的目标伪装策略

Finally, when the convergence error

as well as

Small enough (usually 10 ^-3 ) to make the target camouflage strategy at time k+1

步骤S203：优化智能体探测任务分配策略。本申请实施例可以输入k+1步的目标伪装策略D^k+1，并行更新k+1多智能体探测任务分配策略

多智能体探测任务分配策略的目标是最大化探测效能函数，同时满足多智能体能力约束条件0≤X_i≤1,

满足探测任务约束条件|X^m|_∞＝1，结合增广拉格朗日函数，将k+1步的目标伪装策略D^k+1代入探测任务分配策略优化函数，得到k+1时刻的探测任务分配策略。Step S203: Optimizing the agent detection task assignment strategy. In this embodiment of the present application, the target camouflage strategy D ^k+1 of k+1 steps can be input, and the k+1 multi-agent detection task assignment strategy can be updated in parallel

The goal of the multi-agent detection task assignment strategy is to maximize the detection efficiency function while satisfying the multi-agent capability constraint 0≤X _i ≤1,

Satisfy the detection task constraint |X ^m | _∞ =1, combined with the augmented Lagrangian function, substitute the target camouflage strategy D ^k+1 of k+1 steps into the detection task assignment strategy optimization function, and obtain the detection at time k+1 task assignment strategy.

探测任务分配策略优化方法，本申请实施例可以以交替方向乘子法为基础，迭代求解k+1时刻的探测任务分配策略。具体而言，对于智能体i，本申请实施例可以首先初始化以k时刻的最优探测任务分配策略

为初始值，即

其次交替优化探测任务分配策略的目标函数和约束函数的局部最优值，其中探测任务分配的目标函数的局部最优值初值为

并通过以下公式进行更新：For the detection task allocation strategy optimization method, the embodiment of the present application may be based on the alternate direction multiplier method to iteratively solve the detection task allocation strategy at time k+1. Specifically, for agent i, this embodiment of the present application may first initialize the optimal detection task assignment strategy at time k

is the initial value, that is

Secondly, the objective function of the detection task assignment strategy and the local optimal value of the constraint function are alternately optimized, wherein the initial value of the local optimal value of the objective function of the detection task assignment is

and updated by the following formula:

其中，

in,

探测任务分配策略的约束函数的局部最优值初值为

并可以通过以下公式进行更新：The initial value of the local optimal value of the constraint function of the detection task assignment strategy is

and can be updated by the following formula:

再次，更新t+1步

Again, update t+1 step

最后，当收敛误差

以及

足够小时(通常取10^-3)，令k+1时刻的探测任务分配策略

Finally, when the convergence error

as well as

is small enough (usually 10 ^-3 ), so that the detection task allocation strategy at time k+1

本申请实施例可以根据以下公式求解解残差

及对偶问题残差

In this embodiment of the present application, the solution residual can be solved according to the following formula

and dual problem residuals

若同时满足

及

则终止优化，否则跳至上述步骤直至优化时间消耗完毕。If both meet

and

Then terminate the optimization, otherwise skip to the above steps until the optimization time is exhausted.

本申请实施例可以输出最优探测任务分配策略

以及对应的最差目标伪装策略D^*＝^k。This embodiment of the present application can output an optimal detection task allocation strategy

and the corresponding worst target camouflage strategy D ^* = ^k .

步骤S204：判断是否达到优化最大步数。若达到最大步数，则进入步骤S206，反之则进入步骤S205。Step S204: Determine whether the maximum number of optimization steps is reached. If the maximum number of steps is reached, go to step S206, otherwise go to step S205.

步骤S205：判断是否满足一致性误差和最优性误差阈值条件。若满足，则进入步骤S206，反之则进入步骤S203。Step S205: Determine whether the threshold conditions of the consistency error and the optimality error are satisfied. If satisfied, go to step S206, otherwise go to step S203.

步骤S206：返回多智能体探测任务分配最优解。Step S206: Return to the optimal solution of multi-agent detection task assignment.

根据本申请实施例提出的对抗场景中的分布式多智能体探测任务分配方法，可以根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略，利用多智能体探测效能函数和分布式多智能体探测任务分配模型，实现对不同探测能力的多智能体的合理分配，从而最大限度降低目标伪装对多智能体探测效能的影响，进而最大化对抗环境下的目标特征信息的收集。由此，解决了相关技术中通过分布式探测任务分配方法，由于未能充分考虑目标潜在的伪装对抗过程，导致探测任务的执行结果易受目标伪装策略的影响，从而降低了目标特征信息的获取量的技术问题。According to the distributed multi-agent detection task assignment method in the confrontation scenario proposed by the embodiment of the present application, the weight of the detection task can be assigned, the target detector configuration and detection capability carried by the multi-agent system, the camouflage strategy of the target, the multi-agent The detection task allocation strategy uses the multi-agent detection efficiency function and the distributed multi-agent detection task allocation model to achieve a reasonable allocation of multi-agents with different detection capabilities, thereby minimizing the impact of target camouflage on multi-agent detection performance. , and then maximize the collection of target feature information in the adversarial environment. As a result, the distributed detection task assignment method in the related art is solved, because the potential camouflage confrontation process of the target is not fully considered, the execution result of the detection task is easily affected by the target camouflage strategy, thereby reducing the acquisition of target feature information. amount of technical issues.

其次参照附图描述根据本申请实施例提出的对抗场景中的分布式多智能体探测任务分配装置。Next, the distributed multi-agent detection task assignment device in the confrontation scenario proposed according to the embodiment of the present application will be described with reference to the accompanying drawings.

图3是本申请实施例的对抗场景中的分布式多智能体探测任务分配装置的方框示意图。FIG. 3 is a schematic block diagram of a distributed multi-agent detection task assignment device in a confrontation scenario according to an embodiment of the present application.

如图3所示，该对抗场景中的分布式多智能体探测任务分配装置10包括：第一函数建立模块100、模型建立模块200和分配模块300。As shown in FIG. 3 , the distributed multi-agent detection task assignment device 10 in the confrontation scenario includes: a first function establishment module 100 , a model establishment module 200 and an assignment module 300 .

具体地，第一函数建立模块100，用于根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略建立多智能体探测效能函数。Specifically, the first function establishment module 100 is used to establish the multi-agent detection efficiency according to the detection task allocation weight, the target detector configuration and detection capability carried by the multi-agent system, the target camouflage strategy, and the multi-agent detection task allocation strategy function.

模型建立模块200，用于根据预设的多智能体探测能力约束、目标的伪装能力约束、探测任务约束建立分布式多智能体探测任务分配模型。The model establishment module 200 is configured to establish a distributed multi-agent detection task assignment model according to preset multi-agent detection capability constraints, target camouflage capability constraints, and detection task constraints.

分配模块300，用于基于分布式多智能体探测任务分配模型，根据梯度信息交替求解多智能体系统的任务分配策略及相应的目标伪装策略，生成分布式多智能体探测任务分配结果。The allocation module 300 is configured to alternately solve the task allocation strategy and the corresponding target camouflage strategy of the multi-agent system according to the gradient information based on the distributed multi-agent detection task allocation model, and generate a distributed multi-agent detection task allocation result.

可选地，在本申请的一个实施例中，对抗场景中的分布式多智能体探测任务分配装置10还包括：设置模块和第二函数建立模块。Optionally, in an embodiment of the present application, the distributed multi-agent detection task assignment device 10 in the confrontation scene further includes: a setting module and a second function establishment module.

其中，设置模块，用于设置优化参数及终止条件。Among them, the setting module is used to set optimization parameters and termination conditions.

第二函数建立模块，用于建立探测效能的增广拉格朗日函数，并增加探测效能函数的非收敛惩罚项。The second function establishment module is used for establishing the augmented Lagrangian function of the detection efficiency, and adding a non-convergence penalty term of the detection efficiency function.

可选地，在本申请的一个实施例中，分配模块300包括：第一求解单元。Optionally, in an embodiment of the present application, the allocation module 300 includes: a first solving unit.

其中，第一求解单元，用于利用多智能体任务分配模型求解探测效能函数在所有伪装策略下的最大值。Wherein, the first solving unit is used to solve the maximum value of the detection efficiency function under all camouflage strategies by using the multi-agent task assignment model.

可选地，在本申请的一个实施例中，分配模块300还包括：第二求解单元、第三求解单元和分配单元。Optionally, in an embodiment of the present application, the allocation module 300 further includes: a second solving unit, a third solving unit, and an allocating unit.

其中，第二求解单元，用于求解当前探测任务分配策略下的最优目标伪装策略。Among them, the second solving unit is used to solve the optimal target camouflage strategy under the current detection task allocation strategy.

第三求解单元，用于并行求解当前最优目标伪装策略下的各个智能体最优任务分配策略，以并行求解当前探测任务分配策略、最优目标伪装策略下的对偶参数。The third solving unit is used to solve the optimal task allocation strategy of each agent under the current optimal target camouflage strategy in parallel, so as to solve the dual parameters of the current detection task allocation strategy and the optimal target camouflage strategy in parallel.

分配单元，用于基于对偶参数，求解原问题残差及对偶问题残差，直至优化结束，使得多智能体系统的探测任务分配解为所有智能体局部探测任务分配解的平均值。The allocation unit is used to solve the residual of the original problem and the residual of the dual problem based on the dual parameters until the end of the optimization, so that the detection task allocation solution of the multi-agent system is the average value of the local detection task allocation solutions of all agents.

需要说明的是，前述对对抗场景中的分布式多智能体探测任务分配方法实施例的解释说明也适用于该实施例的对抗场景中的分布式多智能体探测任务分配装置，此处不再赘述。It should be noted that the foregoing explanation of the embodiment of the distributed multi-agent detection task assignment method in the confrontation scene is also applicable to the distributed multi-agent detection task assignment device in the confrontation scene of this embodiment, which is not repeated here. Repeat.

根据本申请实施例提出的对抗场景中的分布式多智能体探测任务分配装置，可以根据探测任务分配权重、多智能体系统携带的目标探测器配置及探测能力、目标的伪装策略、多智能体探测任务分配策略，利用多智能体探测效能函数和分布式多智能体探测任务分配模型，实现对不同探测能力的多智能体的合理分配，从而最大限度降低目标伪装对多智能体探测效能的影响，进而最大化对抗环境下的目标特征信息的收集。由此，解决了相关技术中通过分布式探测任务分配方法，由于未能充分考虑目标潜在的伪装对抗过程，导致探测任务的执行结果易受目标伪装策略的影响，从而降低了目标特征信息的获取量的技术问题。According to the distributed multi-agent detection task assignment device in the confrontation scenario proposed by the embodiment of the present application, the weight of detection tasks can be assigned, the target detector configuration and detection capability carried by the multi-agent system, the camouflage strategy of the target, and the multi-agent system. The detection task allocation strategy uses the multi-agent detection efficiency function and the distributed multi-agent detection task allocation model to achieve a reasonable allocation of multi-agents with different detection capabilities, thereby minimizing the impact of target camouflage on multi-agent detection performance. , and then maximize the collection of target feature information in the adversarial environment. As a result, the distributed detection task assignment method in the related art is solved, because the potential camouflage confrontation process of the target is not fully considered, the execution result of the detection task is easily affected by the target camouflage strategy, thereby reducing the acquisition of target feature information. amount of technical issues.

图4为本申请实施例提供的电子设备的结构示意图。该电子设备可以包括：FIG. 4 is a schematic structural diagram of an electronic device provided by an embodiment of the present application. The electronic device may include:

存储器401、处理器402及存储在存储器401上并可在处理器402上运行的计算机程序。Memory 401 , processor 402 , and computer programs stored on memory 401 and executable on processor 402 .

处理器402执行程序时实现上述实施例中提供的对抗场景中的分布式多智能体探测任务分配方法。When the processor 402 executes the program, the distributed multi-agent detection task assignment method in the confrontation scenario provided in the above embodiment is implemented.

进一步地，电子设备还包括：Further, the electronic device also includes:

通信接口403，用于存储器401和处理器402之间的通信。The communication interface 403 is used for communication between the memory 401 and the processor 402 .

存储器401，用于存放可在处理器402上运行的计算机程序。The memory 401 is used to store computer programs that can be executed on the processor 402 .

存储器401可能包含高速RAM存储器，也可能还包括非易失性存储器(non-volatile memory)，例如至少一个磁盘存储器。The memory 401 may include high-speed RAM memory, and may also include non-volatile memory, such as at least one disk memory.

如果存储器401、处理器402和通信接口403独立实现，则通信接口403、存储器401和处理器402可以通过总线相互连接并完成相互间的通信。总线可以是工业标准体系结构(Industry Standard Architecture，简称为ISA)总线、外部设备互连(PeripheralComponent，简称为PCI)总线或扩展工业标准体系结构(Extended Industry StandardArchitecture，简称为EISA)总线等。总线可以分为地址总线、数据总线、控制总线等。为便于表示，图4中仅用一条粗线表示，但并不表示仅有一根总线或一种类型的总线。If the memory 401, the processor 402 and the communication interface 403 are independently implemented, the communication interface 403, the memory 401 and the processor 402 can be connected to each other through a bus and complete communication with each other. The bus may be an Industry Standard Architecture (referred to as ISA) bus, a Peripheral Component (referred to as PCI) bus, or an Extended Industry Standard Architecture (referred to as EISA) bus or the like. The bus can be divided into address bus, data bus, control bus and so on. For ease of presentation, only one thick line is used in FIG. 4, but it does not mean that there is only one bus or one type of bus.

可选地，在具体实现上，如果存储器401、处理器402及通信接口403，集成在一块芯片上实现，则存储器401、处理器402及通信接口403可以通过内部接口完成相互间的通信。Optionally, in terms of specific implementation, if the memory 401, the processor 402 and the communication interface 403 are integrated on one chip, the memory 401, the processor 402 and the communication interface 403 can communicate with each other through an internal interface.

处理器402可能是一个中央处理器(Central Processing Unit，简称为CPU)，或者是特定集成电路(Application Specific Integrated Circuit，简称为ASIC)，或者是被配置成实施本申请实施例的一个或多个集成电路。The processor 402 may be a central processing unit (Central Processing Unit, CPU for short), or a specific integrated circuit (Application Specific Integrated Circuit, ASIC for short), or is configured to implement one or more of the embodiments of the present application integrated circuit.

本实施例还提供一种计算机可读存储介质，其上存储有计算机程序，该程序被处理器执行时实现如上的对抗场景中的分布式多智能体探测任务分配方法。This embodiment also provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the above distributed multi-agent detection task assignment method in a confrontation scenario.

在本说明书的描述中，参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任一个或N个实施例或示例中以合适的方式结合。此外，在不相互矛盾的情况下，本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structure, material or feature is included in at least one embodiment or example of the present application. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or N of the embodiments or examples. Furthermore, those skilled in the art may combine and combine the different embodiments or examples described in this specification, as well as the features of the different embodiments or examples, without conflicting each other.

此外，术语“第一”、“第二”仅用于描述目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此，限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本申请的描述中，“N个”的含义是至少两个，例如两个，三个等，除非另有明确具体的限定。In addition, the terms "first" and "second" are only used for descriptive purposes, and should not be construed as indicating or implying relative importance or implying the number of indicated technical features. Thus, a feature delimited with "first", "second" may expressly or implicitly include at least one of that feature. In the description of the present application, "N" means at least two, such as two, three, etc., unless otherwise expressly and specifically defined.

流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为，表示包括一个或更N个用于实现定制逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分，并且本申请的优选实施方式的范围包括另外的实现，其中可以不按所示出或讨论的顺序，包括根据所涉及的功能按基本同时的方式或按相反的顺序，来执行功能，这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method description in the flowchart or otherwise described herein may be understood to represent a module, segment or portion of code comprising one or N more executable instructions for implementing custom logical functions or steps of the process , and the scope of the preferred embodiments of the present application includes alternative implementations in which the functions may be performed out of the order shown or discussed, including performing the functions substantially concurrently or in the reverse order depending upon the functions involved, which should It is understood by those skilled in the art to which the embodiments of the present application belong.

在流程图中表示或在此以其他方式描述的逻辑和/或步骤，例如，可以被认为是用于实现逻辑功能的可执行指令的定序列表，可以具体实现在任何计算机可读介质中，以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用，或结合这些指令执行系统、装置或设备而使用。就本说明书而言，"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下：具有一个或N个布线的电连接部(电子装置)，便携式计算机盘盒(磁装置)，随机存取存储器(RAM)，只读存储器(ROM)，可擦除可编辑只读存储器(EPROM或闪速存储器)，光纤装置，以及便携式光盘只读存储器(CDROM)。另外，计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质，因为可以例如通过对纸或其他介质进行光学扫描，接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序，然后将其存储在计算机存储器中。The logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, may be embodied in any computer-readable medium, For use with, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from and execute instructions from an instruction execution system, apparatus, or apparatus) or equipment. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in conjunction with an instruction execution system, apparatus, or apparatus. More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections (electronic devices) with one or N wires, portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.

应当理解，本申请的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中，N个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。如，如果用硬件来实现和在另一实施方式中一样，可用本领域公知的下列技术中的任一项或他们的组合来实现：具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路，具有合适的组合逻辑门电路的专用集成电路，可编程门阵列(PGA)，现场可编程门阵列(FPGA)等。It should be understood that various parts of this application may be implemented in hardware, software, firmware, or a combination thereof. In the above-described embodiments, the N steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware as in another embodiment, it can be implemented by any one of the following techniques known in the art, or a combination thereof: discrete with logic gates for implementing logic functions on data signals Logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.

本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成，所述的程序可以存储于一种计算机可读存储介质中，该程序在执行时，包括方法实施例的步骤之一或其组合。Those skilled in the art can understand that all or part of the steps carried by the methods of the above embodiments can be completed by instructing the relevant hardware through a program, and the program can be stored in a computer-readable storage medium, and the program can be stored in a computer-readable storage medium. When executed, one or a combination of the steps of the method embodiment is included.

此外，在本申请各个实施例中的各功能单元可以集成在一个处理模块中，也可以是各个单元单独物理存在，也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现，也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时，也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in each embodiment of the present application may be integrated into one processing module, or each unit may exist physically alone, or two or more units may be integrated into one module. The above-mentioned integrated modules can be implemented in the form of hardware, and can also be implemented in the form of software function modules. If the integrated modules are implemented in the form of software functional modules and sold or used as independent products, they may also be stored in a computer-readable storage medium.

上述提到的存储介质可以是只读存储器，磁盘或光盘等。尽管上面已经示出和描述了本申请的实施例，可以理解的是，上述实施例是示例性的，不能理解为对本申请的限制，本领域的普通技术人员在本申请的范围内可以对上述实施例进行变化、修改、替换和变型。The above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, and the like. Although the embodiments of the present application have been shown and described above, it should be understood that the above embodiments are exemplary and should not be construed as limitations to the present application. Embodiments are subject to variations, modifications, substitutions and variations.

Claims

1. a distributed multi-agent detection task assignment method in a confrontation scene, is characterized in that, comprises the following steps:

The multi-agent detection efficiency function is established according to the detection task allocation weight, the target detector configuration and detection capability carried by the multi-agent system, the target camouflage strategy, and the multi-agent detection task allocation strategy;

Establish a distributed multi-agent detection task assignment model according to the preset multi-agent detection capability constraints, target camouflage capability constraints, and detection task constraints;

Based on the distributed multi-agent detection task assignment model, the task assignment strategy of the multi-agent system and the corresponding target camouflage strategy are alternately solved according to the gradient information, and the distributed multi-agent detection task assignment result is generated.

2. The method according to claim 1, characterized in that, before solving the task allocation strategy of the multi-agent system and the corresponding target camouflage strategy, further comprising:

Set optimization parameters and termination conditions;

The augmented Lagrangian function of the detection efficiency is established, and the non-convergence penalty term of the detection efficiency function is added.

3. The method according to claim 1, wherein the multi-agent detection efficiency function is:

Among them, X _i represents the estimation of the global task assignment result of the multi-agent system by the agent i, B represents the distribution weight parameter between the agent and the target, C represents the target detector type carried by the agent system, and D represents the target camouflage strategy , E _i represents a matrix with M rows and N columns in which the i-th column is all 1 and the rest are 0, and Λ is the target detection capability parameter of different detectors.

4. The method according to claim 1, wherein the generating a distributed multi-agent detection task assignment result comprises:

The multi-agent task assignment model is used to solve the maximum value of the detection efficiency function under all camouflage strategies.

5. The method according to any one of claims 1-4, wherein the alternately solving the task allocation strategy and the corresponding target camouflage strategy of the multi-agent system according to the gradient information, comprising:

Solve the optimal target camouflage strategy under the current detection task assignment strategy;

Solve the optimal task assignment strategy of each agent under the current optimal target camouflage strategy in parallel, so as to solve the current detection task assignment strategy and dual parameters under the optimal target camouflage strategy in parallel;

Based on the dual parameters, the original problem residual and the dual problem residual are solved until the optimization ends, so that the detection task assignment solution of the multi-agent system is the average value of the local detection task assignment solutions of all agents.

6. A distributed multi-agent detection task assignment device in a confrontation scene, characterized in that, comprising:

The first function establishment module is used to establish the multi-agent detection efficiency function according to the detection task allocation weight, the target detector configuration and detection capability carried by the multi-agent system, the target camouflage strategy, and the multi-agent detection task allocation strategy;

The model establishment module is used to establish a distributed multi-agent detection task assignment model according to preset multi-agent detection capability constraints, target camouflage capability constraints, and detection task constraints;

The assignment module is configured to alternately solve the task assignment strategy and the corresponding target camouflage strategy of the multi-agent system according to the gradient information based on the distributed multi-agent detection task assignment model, and generate a distributed multi-agent detection task assignment result.

7. The apparatus of claim 6, further comprising:

Setting module, used to set optimization parameters and termination conditions;

The second function establishment module is used for establishing the augmented Lagrangian function of the detection efficiency, and adding a non-convergence penalty term of the detection efficiency function.

8. The apparatus according to claim 6, wherein the multi-agent detection efficiency function is:

9. The apparatus according to claim 6, wherein the distribution module comprises:

The first solving unit is configured to use the multi-agent task assignment model to solve the maximum value of the detection efficiency function under all camouflage strategies.

10. The device according to any one of claims 6-9, wherein the distribution module further comprises:

The second solving unit is used to solve the optimal target camouflage strategy under the current detection task allocation strategy;

The third solving unit is used to solve the optimal task allocation strategy of each agent under the current optimal target camouflage strategy in parallel, so as to solve the current detection task allocation strategy and the dual parameters of the optimal target camouflage strategy in parallel ;

an allocation unit, configured to solve the residual of the original problem and the residual of the dual problem based on the dual parameters, until the optimization ends, so that the detection task allocation solution of the multi-agent system is the average value of the local detection task allocation solutions of all agents .

11. An electronic device, comprising: a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor executing the program to implement the program as claimed in the claim Distributed multi-agent detection task assignment method in confrontation scene according to any one of requirements 1-5.

12. A computer-readable storage medium on which a computer program is stored, characterized in that the program is executed by a processor to implement the distributed system in the confrontation scenario according to any one of claims 1-5. A multi-agent detection task assignment method.