CN115220473A

CN115220473A - Multi-unmanned aerial vehicle swarm cooperative task dynamic allocation method

Info

Publication number: CN115220473A
Application number: CN202210822637.3A
Authority: CN
Inventors: 郝文龙; 李五洲; 张文伟; 汤鑫; 王晓卫; 熊伟; 李世忠; 林世聪
Original assignee: Chinese People's Liberation Army Aviation College
Current assignee: Chinese People's Liberation Army Aviation College
Priority date: 2022-07-12
Filing date: 2022-07-12
Publication date: 2022-10-21

Abstract

The invention discloses a multi-unmanned aerial vehicle swarm cooperative task dynamic allocation method, which comprises the steps of S1, establishing a cooperative task allocation model; s2, optimizing a task allocation strategy of the cooperative task allocation model in the S1 based on a selection optimization method; and making a circle with a certain radius by taking each task as a center, adjusting the selection of each task to the Agent in the circle to realize strategy optimization, ensuring that each iteration enables the allocation strategy of the Multi Agent system to move towards a more optimal direction, and obtaining the optimal allocation strategy by the Multi Agent system after limited iterations. The selective optimization method takes the selectable unmanned aerial vehicle in a certain range as an optimization object, reduces the possible solution scale and improves the optimization speed. The optimal selection algorithm ensures that the allocation strategy is continuously close to the optimal allocation strategy along with the iterative process. The requirement of rapid and dynamic task allocation is met, meanwhile, suboptimal strategies are rapidly obtained under specific conditions, and cooperative task allocation is achieved.

Description

Multi-unmanned aerial vehicle swarm cooperative task dynamic allocation method

Technical Field

The invention relates to the technical field of unmanned aerial vehicles, in particular to a dynamic allocation method for cooperative tasks of a multi-unmanned aerial vehicle swarm.

Background

The problems of high requirement and multiple factors needing to be considered exist in the multi-swarm cooperative task allocation process. In the actual battlefield environment, the task entering system is independent and random, and the time, the position, the type and the like of the task are difficult to predict, which brings a series of problems to the task allocation:

1. real-time performance: in the dynamic task allocation problem, the change of factors such as environment, members, tasks and the like requires an effective task allocation method to realize quick decision. Otherwise, the assignment algorithm takes too long resulting in a delay of the fighter plane. However, battlefield situations change instantaneously in the actual task allocation process, the appearance of new tasks is unpredictable, and the real-time response to the situations puts high requirements on a task allocation algorithm.

2. And (3) scale limitation: the application of drone swarm in future wars will be large-scale. However, the increase of the number of the unmanned aerial vehicles and the number of the targets enables the scale of the task allocation algorithm to increase at an exponential function speed, limits the scale of dynamic task allocation, and brings huge challenges to the solution of a satisfactory allocation strategy.

3. The coordination requirement is as follows: the large-scale unmanned aerial vehicle group aims at realizing multi-machine cooperation. The advantage that unmanned aerial vehicle carries out the task can be reflected in the cooperation of multimachine, gives play to unmanned aerial vehicle's performance better, improves the success rate of carrying out the task, reduce cost. For the collaboration, no suitable evaluation index is provided at present, and the collaboration is realized only from the aspects of grouping, executing task time sequence, carrying weapon grouping and the like.

4. Mixing property: the task distribution system should be able to manage different types of members, which may be different in terms of software structure, hardware composition, etc., and the tasks to be completed may also be different, requiring the task distribution system to be an open, extensible system.

The factors to be considered in the multi-unmanned aerial vehicle dynamic task allocation model are as follows:

1. threat: the threats existing in the process of executing the task by the unmanned aerial vehicle include known threats and unknown threats. Known threats are ground anti-air fire, radar and air threats that have been determined prior to performing a mission. Unknown threats are emerging threats and unforeseeable threats during the task execution of the unmanned aerial vehicle. The unmanned aerial vehicle task system needs to take corresponding measures for different threats, including threat assessment, threat reporting to other unmanned aerial vehicles and ground command centers, threat avoidance or attack implementation and the like.

2. Disorder: mainly refer to terrain obstacles, confirm before carrying out the task, need avoid the barrier when the task is distributed, bring new problem for unmanned aerial vehicle flight safety when low latitude is suddenly prevented: after entering an enemy defense area, a flight height that is too high will increase the probability of being threatened, and a flight height that is too low will increase the safety risk.

3. Own strength: when dynamic task allocation is carried out, own helicopters and the like need to be considered, and cooperation among all fighting forces is achieved.

In addition, in the problem of cooperative task allocation of the swarm, a strategy set which maximizes the total profit of the system is defined as an optimal strategy. The optimal strategy means that all unmanned aerial vehicles execute tasks in an optimal distribution mode, and cooperation of all unmanned aerial vehicles is achieved. The optimal strategy can be solved through monotonicity of a state space in a special problem by adopting a linear and nonlinear programming method and the like, but a plurality of constraint conditions are mutually coupled and have complex relations in multi-unmanned aerial vehicle cooperative task allocation, a plurality of variables are not found with monotonicity, and the optimal strategy is difficult to obtain by adopting the method.

Due to the change of battlefield situation in task allocation, no fixed allocation strategy can be circulated, and the decision at each moment has influence on the total benefit of the system. The optimal strategy is generally obtained by an intelligent optimization algorithm, and the optimization of the system structure also helps to obtain the optimal strategy.

The system has an optimal strategy. Because the total system time is a finite value, the state space is discrete and finite, the number of the unmanned aerial vehicles and the tasks is also finite, and the system selectable strategy set space is finite. Therefore, a certain strategy set in the system has performance not inferior to the performance of other possible solutions, and the solution is the optimal solution.

Although the optimal strategy certainly exists, the possible solution scale is huge, and the optimal strategy is extremely difficult to find. The method for obtaining the optimal solution by comparing all possible solutions in the possible solution space by adopting the traversal method cannot meet the requirement of rapid and dynamic task allocation, meanwhile, the global optimal strategy is not necessarily obtained under a specific condition, and the key for realizing the cooperative task allocation is to obtain the suboptimal strategy rapidly.

The invention is provided for overcoming the defects in the prior art.

Disclosure of Invention

In view of the above problems, the present invention provides a method for dynamically allocating cooperative tasks of a multi-drone swarm.

In order to realize the purpose of the invention, the technical scheme provided by the invention is as follows: a multi-unmanned aerial vehicle swarm cooperative task dynamic allocation method comprises the following steps:

s1, establishing a cooperative task allocation model;

based on a Multi-Agent system, regarding each unmanned aerial vehicle as one Agent, and distributing and executing tasks by endowing the agents with autonomous capacity;

the collaborative task allocation model is described as being composed of five recombinations:

{ Time, task, agent, policy _ Set, objective _ Function } (equation 1)

Wherein, time is Time, task is Task, agent is unmanned plane, policy _ Set is strategy Set, and obj partial _ Function is evaluation Function;

s2, optimizing a task allocation strategy of the collaborative task allocation model in the S1 based on a selection optimization method;

and making a circle with a certain radius by taking each task as a center, adjusting the selection of each task to the Agent in the circle to realize strategy optimization, ensuring that each iteration enables the distribution strategy of the Multi Agent system to move towards a more optimal direction by a selection mechanism, and obtaining the optimal distribution strategy by the Multi Agent system through limited iterations.

Wherein,

the task strategy optimization of the selection optimization method in the step S2 specifically comprises the following steps:

setting M agents and N tasks in a Multi Agent system;

step S21: by task T _k At the position of the circle center, r _k Making a circle for the radius; the radius of the circle is selected to satisfy: within each circle there is at least n _min Each Agent and the radius of each circle is larger than r _min 。

Step S22: generating an initial allocation strategy;

randomly selecting one Agent for each task in the Multi Agent system, and representing the decision variable at the moment as D _t (0) And calculating a profit value:

wherein

And satisfies:

when M is less than N, the (N-M) tasks are not distributed to execute;

step S23: randomly selecting a task

Adjust the task to

And satisfies the following conditions: a is a _k ≠a′ _k The allocation policy at this time is represented as D _t (1)；

Step S24: checking the rationality of the strategy: judgment of D _t (1) Whether or not there is: i, j belongs to N,

if present, will

Is adjusted to

Let D be _t (1) Satisfies the following conditions:

the allocation policy at this time is denoted as D _t (2) The strategic profit values are:

step S25: : for D _t (2) For the rest of tasks other than k, i

In turn select

Agent in the circle and represent the newly adjusted strategy as D _t (3) I is equal to N, such that

Satisfies the following conditions: v. of ₃ ≥v ₂ ；

Step S26: and updating the Multi Agent system optimal strategy. Defining an optimal policy as v _max . If v is _max ＜v ₃ Then v is _max ＝v ₃ ；

Step S27: if v is _max And meeting the requirement of the Multi Agent system, ending the optimizing process, or else, step S23.

The beneficial effects of the invention include:

the selective optimization method takes the selectable unmanned aerial vehicles in a certain range as the optimization objects, reduces the possible solution scale and improves the optimization speed. The optimal selection algorithm ensures that the allocation strategy is continuously close to the optimal allocation strategy along with the iterative process. The requirement of rapid and dynamic task allocation is met, meanwhile, a suboptimal strategy is rapidly obtained under a specific condition, and cooperative task allocation is realized.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a schematic diagram illustrating the calculation of the meeting time and position of the unmanned aerial vehicle and the task;

FIG. 2 is a schematic diagram illustrating the effect of the number of drones and the number of tasks on the possible solution size;

FIG. 3 is a schematic diagram illustrating the effect of the number of drones and the number of targets on the possible solution size;

FIG. 4 is a schematic diagram of the effect of task type on the possible solution size;

FIG. 5 is a flowchart of a policy optimization step of the selection optimization method.

Detailed Description

The technical solution in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings.

The invention discloses a multi-unmanned aerial vehicle swarm cooperative task allocation method, which comprises the following steps:

s1, establishing a collaborative task allocation model;

the Multi-unmanned aerial vehicle collaborative task allocation model is based on a Multi-Agent system, each unmanned aerial vehicle is regarded as one Agent, and tasks are allocated and executed by endowing the agents with autonomous capacity. Two types of agents are stored in the system: decision agents and member agents. The decision Agent has task allocation decision capability, acquires system state information and makes task allocation decision for the member Agent; and the member agents cooperatively execute the distribution task and feed back the state of the distribution task.

Each Agent in the collaborative task allocation model has different initial states, the Multi-Agent system allocates tasks according to task attributes and Agent states, and state information exchange and decision instruction issuing are completed through a data chain.

The collaborative task allocation model is described as follows:

based on a Multi Agent Markov decision process theory, with reference to a modeling method of the Markov theory, the model is established to be composed of the following five components:

{ Time, task, agent, policy _ Set, objective _ Function } (equation 1)

The parts are described as follows:

1. time (Time)

The Multi Agent system time is finite and has finite discrete time points t ₀ ,t ₁ ,t ₂ ,...,t _e Is represented by the formula (I) in which t ₀ 、t _e Respectively start and end times.

2. Task (Task)

In the collaborative task allocation model, the unmanned aerial vehicle executes a certain task or attacks on a certain target and the like are collectively called as tasks. The task type has a significant impact on the size of the task allocation. The more task types, the task execution order to be considered when distributing tasks will increase the possible solution size sharply. A

The increase in task types will dramatically increase the possible solution size, severely impacting dynamic task allocation efficiency. To simplify problem modeling, a single task type is employed, and tasks assigned to each Agent are treated as the same type of task. When each target enters the Multi Agent system, all tasks are decoupled into different and independent tasks through the task evaluation system, and different initial states are set for each task. By decoupling the tasks and controlling the time and the sequence of the tasks entering the task queue, the problems of processing the types and the execution sequence of the multiple tasks can be avoided, the possible solution scale is effectively reduced, and the efficiency is improved.

the newly-appeared task number of the Multi Agent system at the time t is n (t), obeys the distribution theta (n (t)), and n (t) belongs to theta and meets the following conditions:

where P (-) represents the probability and D (-) is the correlation function. All tasks in the Multi Agent system at the time t are expressed as:

[T ₁ ,T ₂ ,...,T _N ](formula 3)

T _n At time t the state is:

wherein,

respectively represent T time T _n The position and the speed of the sensor are two-dimensional vectors,

ζ _n performing a task T for an Agent _n Including threat costs, weapon consumptions, etc., at T _n When entering the Multi Agent system. Zeta when different agents execute the same task _n The same is true. To simplify the modeling process, it is assumed that each task enters the Multi Agent system with randomly determined speed and direction and is invariant in the course of executing the task. w is a _n Performing a task T for an Agent _n Is given a prize value of, and w _n ＞ζ _n Is greater than 0. The magnitude of the reward value reflects the importance of each task. A large value of reward indicates that the task is important, and the more revenue the Multi Agent system receives for performing the task.

The task queue Φ refers to the list of tasks that need to be executed currently:

Φ＝{T ₁ ,T ₂ ,…,T _k waiter, k =1,2, 3. (formula 5)

3. Unmanned plane (Agent)

Regarding each unmanned aerial vehicle with the capability of executing tasks and making autonomous decisions as one Agent, the Agent is expressed as:

a _m m =1,2,., M (formula 6)

The agents are set as follows: the Agent is a rigid body, the ground coordinates are inertia coordinates, the ground is regarded as a plane, and the gravity acceleration does not change along with the height. The state of each Agent changes with time in the process of executing the task, and the state information comprises:

wherein,

is a _m At the time of the state at the moment t,

respectively, the position and the velocity information,

is a _m Attack capability at time t.

Is a _m The task state is assigned at time t.

Indicating that the Agent is not currently assigned a task,

shows that Agent has currently assigned T _n 。

Setting:

(1) When the whole Agent executes the tasks, the speed is the same and is a fixed value, and when the tasks are not distributed, the Agent performs hovering flight at the current position at the fixed speed and is regarded as the position is unchanged;

(2) All agents consume the same in unit time;

(3) Each Agent can only execute one task at a time;

(4) Different agents have the same income value and different consumption values when completing the same task;

(5) The Agent is regarded as a mass point and does not consider the turning radius, the Agent communication capacity is limited, and the communication range is limited;

(6) The problem of collision avoidance between agents is not considered;

(7) The task allocation adopts a single-step planning mechanism;

(8) The route cost of an Agent to perform a task is proportional to the Agent's flight distance.

4. Policy Set (Policy _ Set)

the Agent and the task number in the Multi Agent system at the moment t are respectively marked as m and n, and the optional task allocation strategy can be expressed as follows:

D(t)＝{T ₁ ',T ₂ ',…,T _m '} ^T (formula 8)

Where D (T) is a matrix of m × 1, T _i '∈(T ₁ ,T ₂ ,…,T _n ). Considering the number of tasks and drones, the scale of the possible solution is:

wherein:

(equation 9) satisfies the following constraint:

(1) Each Agent can only execute one task at any time and can not execute any task;

(2) Each task can be executed by only one Agent at any time, and can not be executed.

As can be seen from (equation 9), the possible solution size of the Multi Agent system increases sharply as m and n increase.

And a single-step planning mechanism is adopted, namely, only one task is allocated to each Agent in each task allocation process, and the task of each Agent is dynamically adjusted according to the change of the battlefield situation in the task allocation process. The set of policies in the whole task allocation process of the Multi-Agent system is defined as a policy set, and is expressed as:

Ω＝{D(t ₀ ),D(t ₁ ),D(t ₂ ),…,D(t _e ) } (formula 11)

The strategy set reflects the task execution condition of each Agent in the whole task execution process, and is a basis for evaluating the cooperation among the agents.

5. Evaluation Function (Objective _ Function)

The evaluation function refers to an objective function in task allocation, and factors such as the flight distance of the unmanned aerial vehicle, the income value of an attack target, the weapon consumption value and the threat cost need to be considered. For example, when the unmanned aerial vehicle is required to complete a task in the shortest time, an objective function is set to aim at the shortest flight distance; when the unmanned aerial vehicle is required to execute the minimum consumption in the task, the consumption value is the minimum. The above factors are coupled with each other, and trade-off between the factors is required, and generally, the task allocation evaluation function is a set of the factors, and the weights of the factors are distinguished by weights.

The method comprises the steps of taking the income value obtained by all agents executing tasks in the whole task allocation process of a Multi Agent system as an objective function, enabling the task allocation to be the maximum objective of the objective function value, and introducing the flight distance and the task consumption value into the objective function through a time discount factor and a task consumption discount factor.

the profit value of the Multi Agent system at the moment t in decision D (t) and state S (t) is defined as:

wherein beta is _t Is a time discount factor, beta is more than or equal to 0 _t ≤1。β _t The smaller the revenue value obtained by the Multi Agent system decreases faster over time. Beta is a _t Without considering the influence of the passage of time on the profit value, =1 _t The profit value is not considered when = 0. Delta of _m And t is the time consumed in the task allocation decision making process and the time for the Agent to move to the target position after the decision is made. The decision time is generally short, and may not be considered, the flight time is proportional to the distance, and Δ t may be represented as: delta of _m t＝d _t (m,n)/V _m In which d is _t (m, n) is time a _m Fly to and T _n Flight distance at the time of encounter, V _m Is a _m The rate. d is a radical of _t And (m, n) is determined by Agent and task position, speed magnitude and direction together. Delta. For the preparation of a coating ^t For performing task consumption, including processing (N-N) _L ) Agent attack capability consumption zeta at individual task _n And the communication cost and the decision cost are determined during task allocation, the communication cost is independent of the execution process, and the decision cost is related to the possible solution scale and the adopted allocation algorithm. Beta is a beta _δ Discount primer for task consumption, beta _δ ≥0，β _δ =0 represents no consideration of task consumption. The communication cost only considers the communication cost when the Agent state is acquired for task distribution, and the cost is not counted by the communication between the agents when the task is executed.

ξ is a penalty function when n is present in a Multi Agent System _L When an individual task is not assigned to it,

represents the sum of the reward values of the unassigned tasks, eta is a penalty function factor, and eta is greater than or equal to 0.η =0 indicates that the influence of the penalty function on the profit value is not considered, and the larger η is, the heavier the penalty is for unallocated tasks, ensuring that the Multi Agent system completes more tasks as much as possible.

Under the cooperative task allocation model, both the Agent and the task are in motion states, the flight distance when the Agent meets the task is not the linear distance between the Agent and the task, and d _t The calculation of (m, n) is schematically shown in FIG. 1:

setting the initial time Agent position as (x) _a ,y _a ) Absolute value of velocity v _a In a direction of θ(ii) a The task position is (x) _T ,y _T ) And is made of

The Agent meets the task at the moment t, and meeting points meet the equation:

and (4) eliminating the variable theta to obtain an equation:

the condition for the solution of equation (equation 16) is:

for the solution to be meaningful, it must also satisfy: t is more than or equal to 0.

To obtain:

t＝max(t ₁ ,t ₂ ) When t is more than or equal to 0, t is effective solution, and when t is less than 0, the equation has no solution.

The heading and required flight time when the Agent is performing the task from (equation 18) available, and the path length is:

in summary, the Multi Agent system yields at time t are:

the total income Γ of the Multi Agent system in the whole task allocation process is as follows:

the complexity of the task allocation strategy is set forth as the size of the tasks and the number of drones increases. The rule of the possible solution of the cooperative task allocation of the multiple drones is determined by the formula 9. The Agent number in the Multi Agent system is set to be m, the task number is set to be n, and the influence of the increase of the Agent number and the task number on the possible solution scale is shown in fig. 2:

it was found by calculation that when m = n =20, the possible solution size Num =1.73 × 10 ²¹ . Finding an optimal strategy in such a large solution space is extremely difficult. To reduce the possible solution size, grouping of agents and tasks in a Multi-Agent system is an efficient method. For example, when the number of tasks is 20, the tasks are allocated after 20 agents are divided into 4 groups, and then the possible solution size becomes: num =4 × (3.13 × 10) ¹⁹ )＝1.25×10 ²⁰ In this case, the possible solution size is only 7.2% of the size before the grouping, and the larger the number of groupings, the more advantageous the reduction of the possible solution size. Therefore, grouping agents using a distributed model is an effective way to reduce the complexity of the problem.

In order to simplify the model, a single task type model is adopted, and task allocation is only performed once for each task, so that the possible solution scale is reduced. To discuss the effect of task type on possible solution size, assume that when the number of Agents is m and the number of tasks is N, N _m The smallest possible solution size for a particular task type is:

divide into N for each task _m The number of tasks to be distributed is nN _m There are m allocation patterns for each task, and the possible solution size of the Multi-Agent system is determined by equation (9) and is larger than the value calculated by equation (22). The influence of Agents and task numbers on the possible solution size is analyzed by equation (22), as shown in FIG. 3, whereN _m ＝1。

Task type N _m The effect on the possible solution size is shown in fig. 4:

as can be seen in FIG. 3, N _m The increase in (c) will also cause the possible solution space to grow at the rate of an exponential function, as will the number of tasks and agents.

the possible solution size is related to the number of the alternative agents of each task, and the reduction of the number of the alternative agents of each task can reduce the possible solution size. Through analyzing the optimal task allocation strategy, each task under the optimal task allocation strategy is basically executed by the agents within a certain range around the task. On one hand, the shorter route between the task and the Agent is beneficial to reducing the cost, and on the other hand, the task execution delay and the threat cost brought by the shorter route are lower.

When the optimizing method is selected to carry out task allocation strategy optimizing, a circle with a certain radius is made by taking each task as a center, and the selection of each task on the Agent in the circle is adjusted to realize strategy optimizing. The selection mechanism ensures that each iteration moves the Multi Agent system allocation policy to a more optimal direction. After a limited number of iterations, the Multi Agent system will obtain an optimal allocation strategy.

As shown in fig. 5, the strategy optimization steps of the selection optimization method of the present invention are as follows:

m agents and N tasks exist in the Multi Agent system.

Step S21: by task T _k At the position of the circle center, r _k Making a circle for the radius, wherein the radius of the circle is selected to meet the following conditions:

1. within each circle there is at least n _min Each Agent;

2. the radius of each circle is larger than r _min 。

Step S22: generating an initial allocation strategy: randomly selecting one Agent for each task in the Multi Agent system, and representing the decision variable at the moment as D _t (0) And calculating a profit value:

wherein

k belongs to N and satisfies:

when M < N, there will be (N-M) tasks that are not allocated for execution.

Step S23: randomly selecting a task

Adjust the task to

And satisfies the following conditions: a is a _k ≠a′ _k The allocation policy at this time is represented as D _t (1)。

Step S24: checking the rationality of the strategy: judgment of D _t (1) Whether or not there is: i, j ∈ N,

k ≠ i, if present, will

Is adjusted to

Let D be _t (1) Satisfies the following conditions:

denote the allocation policy at this time as D _t (2) The strategy profit value is:

step S25: : for D _t (2) For other tasks than k, i

In turn select

Satisfies the following conditions: v. of ₃ ≥v ₂ 。

Step S26: and updating the Multi Agent system optimal strategy. Defining an optimal policy as v _max . If v is _max ＜v ₃ Then v is _max ＝v ₃ 。

Step S27: if v is _max And (5) meeting the requirement of the Multi Agent system, finishing the optimizing process, and otherwise, step S23.

The optimization method is selected, and the selectable agents in a certain range are used as optimization objects, so that the possible solution scale is reduced, and the optimization speed is improved. The most preferred selection algorithm ensures that the allocation strategy is continuously close to the optimal allocation strategy along with the iterative process.

By adopting a selection optimization method and limited iteration, the Multi Agent system can obtain an optimal distribution strategy. At each moment, the task number and the Agent number in the Multi Agent system are finite values, and all the selectable allocation strategy numbers are finite values. When r is _k When the circle centered on each task is large enough to cover all the agents in the Multi Agent system, the rotation selection algorithm requires that the optimal strategy is selected in each selection process, which ensures that the Multi Agent system will obtain the optimal distribution strategy after a limited number of iterations.

The selection of the optimization algorithm can reduce the possible solution size. Suppose the Agent number in the Multi Agent system is m, a _i I =1,2, \8230;, m, number of tasks T _n And T is _n N is less than or equal to N. The number of the agents distributed around each task is x, and x is more than or equal to 1 and less than or equal to m. The possible solution scale for the optimization method is chosen as:

in the selection optimization method, the selection of the radius has an important influence on the obtained optimal strategy. When the radius of selection is infinite, the selectable agents of each task are all the agents in the Multi Agent system, so that the possible solution scale of the selection optimization method is the same as that of strategy optimization based on a genetic algorithm, and the selection optimization method can obtain the global optimal solution of the Multi Agent system. When the selection radius is reduced so that the possible solution scale of the Multi-Agent system is reduced, the optimal agents of certain tasks in the Multi-Agent system are likely to be out of the selection circle, and the Multi-Agent system cannot obtain the optimal distribution strategy. Therefore, in the implementation process, the selection of the selection radius needs to balance the possible solution size and the strategic performance.

The described embodiments are only some embodiments of the present application and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Claims

1. A multi-unmanned aerial vehicle swarm cooperative task dynamic allocation method is characterized by comprising the following steps:

s1, establishing a collaborative task allocation model;

{ Time, task, agent, policy _ Set, objective _ Function } (equation 1)

2. The method of claim 1, wherein the method for dynamically allocating the cooperative tasks of the multi-drone swarm comprises the following steps:

setting M agents and N tasks in a Multi Agent system;

step S21: by task T _k At the position of the circle r _k Making a circle for the radius;

step S22: generating an initial allocation strategy;

wherein

And satisfies:

when M is less than N, the (N-M) tasks are not distributed to execute;

step S23: randomly selecting a task

Adjust the task to

And satisfies the following conditions: a is a _k ≠a′ _k The allocation policy at this time is denoted as D _t (1)；

k ≠ i, if present, will

Is adjusted to

Let D _t (1) Satisfies the following conditions:

step S25: : for D _t (2) For other tasks than k, i

In turn select

Agent within the circle and represent the newly adjusted strategy as D _t (3) I is equal to N, such that

Satisfies the following conditions: v. of ₃ ≥v ₂ ；

3. The method of claim 2, wherein the method for dynamically allocating the cooperative tasks of the multi-drone swarm comprises the following steps:

in step S21, the radius of the circle is selected to satisfy: within each circle there is at least n _min Each Agent and the radius of each circle is larger than r _min 。