CN115809547A - Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm - Google Patents

Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm Download PDF

Info

Publication number
CN115809547A
CN115809547A CN202211459220.1A CN202211459220A CN115809547A CN 115809547 A CN115809547 A CN 115809547A CN 202211459220 A CN202211459220 A CN 202211459220A CN 115809547 A CN115809547 A CN 115809547A
Authority
CN
China
Prior art keywords
agent
particle
target
value
task
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211459220.1A
Other languages
Chinese (zh)
Inventor
高阳
彭张弛
钱晨
吴潇瑞
黄卓
陈庆伟
吴益飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Science and Technology
Original Assignee
Nanjing University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Science and Technology filed Critical Nanjing University of Science and Technology
Priority to CN202211459220.1A priority Critical patent/CN115809547A/en
Publication of CN115809547A publication Critical patent/CN115809547A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm, which establishes a multi-agent cooperative task allocation model and realizes simultaneous optimization of multiple indexes such as hitting profit, resource consumption, damage probability and the like. Aiming at the characteristics of the problem of multi-agent cooperative task allocation, the multi-target particle swarm optimization and the non-dominated sorting algorithm are combined, the non-dominated sorting algorithm and a cross mutation mechanism are integrated, a nonlinear method for improving the value of the inertial weight is designed, a maximum distance method for obtaining the optimal solution of the pareto solution obtained by the algorithm is provided, and the global search optimization capability and the engineering application value of the algorithm are greatly improved. Compared with the traditional optimization method, the method has better convergence and accuracy when solving the multi-objective optimization problem in the dynamic environment.

Description

Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm
Technical Field
The invention belongs to the field of multi-agent cooperative control, and particularly relates to a non-dominated sorting-based multi-agent cooperative task allocation method based on an improved particle swarm algorithm.
Background
With the development of unmanned technology, the Agent can gradually replace human beings to execute various complex tasks in boring, severe and dangerous environments, such as rescue detection, large-range search, air defense suppression and striking, electronic attack, information reconnaissance and monitoring and the like. In the face of such complex task, it is impossible to independently execute and complete by means of a single agent, and often multiple agents are required to cooperate and cooperate with each other to complete together. Therefore, the rationality and effectiveness of the task allocation scheme is of great importance throughout the job.
In order to effectively perform task allocation on multiple intelligent agents, scholars at home and abroad carry out some researches. The existing research is mainly to solve the problems based on algorithms such as intelligent optimization algorithm, intelligent search and the like. For example, when a genetic algorithm is used for solving a problem, the defects of irregular codes, premature convergence of the algorithm and the like are easily caused, and the most common particle swarm algorithm solves the problem of irregular codes in the genetic algorithm, but the parameter adjustment is difficult, and the global and local searching capability of particles cannot be well balanced. Moreover, for a specific multi-objective optimization problem, a common intelligent optimization algorithm generally converts multiple objectives into a single-objective optimization problem in a weighting manner, and due to the subjectivity of weight selection, a problem that multiple optimization objectives cannot be well balanced easily occurs.
Disclosure of Invention
The invention aims to solve the problems in the prior art and provides a multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm, which comprises the following steps:
step 1, establishing a target profit model for multi-agent task allocation by combining operation environment information;
step 2, establishing a loss cost model for multi-agent task allocation by combining the operation environment information;
step 3, establishing a multi-agent cooperative task allocation model based on the multi-target function obtained in the step 1 and the step 2 and by combining constraint conditions in the task execution process of the agents;
step 4, solving the model obtained in the step 3 by using an improved particle swarm algorithm based on non-dominated sorting to obtain a pareto solution set;
and 5, obtaining a pareto optimal solution by a maximum distance method based on the pareto solution set obtained in the step 4.
Preferably, the step 1 of establishing a target profit model when the multi-agent performs the task according to the operating environment information specifically includes:
the equipment model adopted when the ith agent executes the task on the target j is recorded as
Figure BDA0003954780090000021
The equipment has the suitability degree of
Figure BDA0003954780090000022
The hit rate to the target j is
Figure BDA0003954780090000023
The damage P to the target j when the ith agent performs a task on the target j ij Determined by both, i.e.
Figure BDA0003954780090000024
Value of V corresponding to target j j And then the target value and income of the formation of the intelligent agent is as follows:
P ij ×V j
thus, the overall revenue model for the overall multi-agent system task allocation is shown by the following equation:
Figure BDA0003954780090000025
where M is the number of agents, N is the number of targets,
Figure BDA0003954780090000026
V j is the value of target j, V max Represents the target maximum value; x ij The allocation scheme for agents, represented by a task allocation decision matrix, is defined as follows:
Figure BDA0003954780090000027
preferably, the step 2 of establishing a loss cost function of task allocation of the agent according to the operating environment information specifically includes:
(1) Shortest flight distance index f 2
Is provided with
Figure BDA0003954780090000028
For the length of flight of the ith agent in selecting path p, since all agents may be confronted with multiple target tasks, note that target k is the first target point to which the agent flies,
Figure BDA0003954780090000029
represents the other target points after k, where T max Representing the maximum target number of executables in one task of the intelligent agent; note D ik Is the flight path of the ith agent from the initial position to the k target point,
Figure BDA00039547800900000210
is the flight path of the ith agent from the k target point to the r target point; the agent formation flight distance is then expressed as:
Figure BDA00039547800900000211
in the formula, L max Is the maximum flight distance of a unit agent in executing a task, M is the number of participating agents, N is the number of targets, L max M is a normalization factor;
Figure BDA00039547800900000212
in the formula, D ikmax The maximum flight range of the ith intelligent agent from the initial position to the k target point is shown;
Figure BDA00039547800900000213
the maximum flight range of the ith intelligent agent from the k target point to the r target point is shown; t is max -1 refers to the maximum number of times the agent can perform a task;
therefore, the shortest flight distance index of the agent is:
Figure BDA0003954780090000031
(2) Minimum index f of self-loss cost 3
The loss cost minimum index is formalized as shown in the following formula:
Figure BDA0003954780090000032
in the formula, the model of the equipment used for executing the task on the target j is recorded as
Figure BDA0003954780090000033
Figure BDA0003954780090000034
Is composed of
Figure BDA0003954780090000035
The unit cost of the model equipment is,
Figure BDA0003954780090000036
(3) Sub-target coverage maximum index f 4
The formalization of the index with the maximum sub-target coverage degree is shown as the following formula:
Figure BDA0003954780090000037
preferably, the step 3 of building an overall model for multi-agent cooperative task allocation by adding various constraint conditions, which are faced by agent formation when the agent formation executes a task, based on the models obtained in the steps 1 and 2 specifically includes:
and synthesizing the four indexes, performing multi-objective optimization, and obtaining an overall evaluation function, namely an overall model for multi-agent cooperative task allocation, as follows:
min f=[f 1 ;f 2 ;f 3 ;f 4 ]
the constraint conditions include:
(1) Multi-agent cooperative constraint c 1 : in order to ensure the cooperativity of the multiple agents in the task execution process and prevent the cooperation problems of invalid tasks, repeated tasks, mismatching of the number of agents and the target number and the like, certain constraint conditions need to be added to the established model, and the following two main points are provided:
for the agent, any one target point can only be executed by the agent once at most, namely:
Figure BDA0003954780090000038
for the target point, the number of tasks when the agent executes the tasks cannot exceed the task load that the agent can bear, namely:
Figure BDA0003954780090000039
wherein Z is imax The task load which can be borne by the ith agent;
for a task, all the number of tasks must be executed, namely:
Figure BDA0003954780090000041
wherein N is type The number of the executed task types is represented;
(2) Multi-agent operating radius constraint c 2 : in the process of cooperatively executing tasks by multiple agents, certain constraints exist on the radius of the executed tasks, namely:
Figure BDA0003954780090000042
wherein R is i I =1,2, M represents the working radius of the i-th agent.
Preferably, the step 4 is to solve the model obtained in the step 3, and the specific steps are as follows:
step 4.1, initializing a particle swarm according to the constraint condition of multi-agent task allocation input in the step 3, randomly setting the speed and the position of each particle, setting t =0, and randomly generating an initial solution;
step 4.2, solving the fitness of each particle according to the overall model distributed by the multi-agent cooperative task, and storing the position and the fitness value of the particle in the individual extreme value p of the particle best In (1), all p are best The individual position of the optimum adaptive value and the adaptive value are stored in the global extreme value g best Performing the following steps;
step 4.3, update particle position and velocity
x i,j (t+1)=x i,j (t)+v i,j (t+1)
v i,j (t+1)=ω·v i,j (t)+c 1 r 1 [p i,j -x i,j (t)]+c 2 r 2 [p g,j -x i,j (t)]
Wherein x is position information of the particle, v represents velocity information of the particle, c 1 And c 2 Is a learning factor of a particle, r 1 And r 2 Is a random number between (0,1), p i,j For local optima of particles, p g,j Is the global optimum of the particle, w is the inertial weight; guiding the value of the inertia weight w by the difference value of the particle position and the current optimal position, and adjusting the size of the inertia weight w and the difference value of the ith particle j and the global optimal particle in a non-linear way according to the difference value of the value
Figure BDA0003954780090000043
Is composed of
Figure BDA0003954780090000044
Figure BDA0003954780090000045
Wherein D is the solution space dimension; x is the number of max 、x min Upper and lower bounds, respectively, of the particle position component;
Figure BDA0003954780090000046
indicating the globally optimal particle position at time j,
Figure BDA0003954780090000047
represents the position of the ith particle at time j;
Figure BDA0003954780090000048
is the inertial weight of particle i at time j; w is a start 、w end Respectively an initial value and a final value of w;
step 4.4, comparing the adaptive value of each particle with the best position of the particle, and if the difference value of the adaptive value of each particle and the best position of the particle is within a preset range, taking the adaptive value of the current particle as the best position of the particle; and comparing all current p best And g best Update g best
And 4.5, introducing cross and mutation operations: judging whether cross variation exists according to the difference value X between the particle position component and the global optimal position, so that the particles can quickly jump out of the local optimal position; the method comprises the following specific steps:
(1) determining a threshold X of X min Cross ratio p c And the rate of variation p m
(2) Judging the difference value X of the particles i i If the value is smaller than the threshold value, executing downwards, otherwise jumping out and not executing;
(3) choosing [0,1 ] for each dimension of particle i]The j-th dimension of the random number r is r ij If r is ij <p m Then, performing mutation operation:
x ij =x min +(x max -x min )r
(4) then judging the random number r corresponding to the j dimension of the particle ij Whether or not less than the crossover rate p c If so, performing cross operation on the jth dimension, wherein a cross object is a global optimal solution, and assigning the global optimal solution of the jth dimension to the jth dimension of the particle;
4.6, sorting the target functions by adopting a non-dominated sorting method, and selecting a next generation population according to a sorting result;
step 4.7, stopping searching and outputting a result when the algorithm reaches a preset stop condition; otherwise go to step 4.3 to continue the search.
Preferably, the pareto optimal solution is obtained by a maximum distance method, which specifically includes:
for each non-dominant solution in the pareto solution set, calculating the maximum value of each objective function, and determining the vector of the worst index function as follows:
Figure BDA0003954780090000051
wherein the content of the first and second substances,
Figure BDA0003954780090000052
a value representing an nth objective function in an ith non-dominated solution;
the distance formula for each non-dominated solution and worst case indicator function is established as follows:
Figure BDA0003954780090000053
the set of distances D = { D can be obtained 1 ,D 2 ,...,D i In which D is i The distance value obtained by the i-th non-dominant solution is shown. The solution with the largest distance D value will be selected as the final solution, worst _ F denotes the Worst index function,
Figure BDA0003954780090000054
represents the value of the nth objective function in the ith non-dominated solution, with n representing the order of the objective function.
The invention also discloses a multi-agent cooperative task allocation system based on the non-dominated sorting improved particle swarm algorithm, which comprises the following steps:
the first construction module is used for establishing a task allocation target profit model in the task execution process of the intelligent agent according to the operation environment information;
the second construction module is used for establishing a loss cost model in the process of executing the task by the intelligent agent according to the operation environment information;
the third construction module is used for adding constraint conditions in the task execution process of the intelligent agent based on the models obtained by the first construction module and the second construction module, and establishing an integral model for multi-intelligent-agent cooperative task distribution;
and the solving module is used for solving the model obtained by the third building module by utilizing the improved particle swarm optimization algorithm based on the non-dominated sorting.
The invention also discloses computer equipment which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the steps of the method are realized when the processor executes the computer program.
The invention also discloses a computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of the invention.
Has the advantages that:
the invention is improved on the basis of a multi-objective particle swarm algorithm, the value of the inertial weight is guided according to the difference degree between the particles and the current optimal particles, a non-dominated sorting algorithm and a cross variation mechanism are integrated, and a maximum distance method is designed to select the optimal solution from a pareto solution set.
Drawings
FIG. 1 is a flow chart of a multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments.
In one embodiment, in combination with fig. 1, a multi-agent cooperative task allocation method based on non-dominated sorting improved particle swarm optimization is provided, which comprises the following steps:
step 1, establishing a target profit model for multi-agent task allocation by combining operation environment information;
step 2, establishing a loss cost model for multi-agent task allocation by combining the operation environment information;
step 3, establishing a multi-agent cooperative task allocation model based on the multi-target function obtained in the step 1 and the step 2 and by combining constraint conditions in the task execution process of the agents;
and 4, solving the model obtained in the step 3 by using an improved particle swarm algorithm based on non-dominated sorting.
And 5, obtaining a pareto optimal solution by a maximum distance method based on the pareto solution set obtained in the step 4.
Further, in one embodiment, the step 1 of establishing a target profit model in the task execution process of the multi-agent according to the operating environment information specifically includes:
the value benefit of the multi-agent attacking a single sub-target is the product of the damage degree of the sub-target and the value of the sub-target, and when the multi-agent executes tasks in a cluster unit, the damage degree of the sub-target refers to the damage degree of the multi-agent cluster on the sub-target.
The equipment model adopted when the ith agent executes the task on the target j is recorded as
Figure BDA0003954780090000071
The equipment has the suitability degree of
Figure BDA0003954780090000072
The hit rate to the target j is
Figure BDA0003954780090000073
The damage P to the target j when the ith agent performs a task on the target j ij Determined by both, i.e.
Figure BDA0003954780090000074
The value corresponding to target j is V j Then the target value gain of the intelligent agent formation is:
P ij ×V j
thus, the overall revenue model for the overall multi-agent system task allocation is shown by the following equation:
Figure BDA0003954780090000075
where M is the number of agents, N is the number of targets,
Figure BDA0003954780090000076
V j for each sub-target's value, V max Represents the target maximum value; x ij The allocation scheme for agents can be represented by a task allocation decision matrix, defined as follows:
Figure BDA0003954780090000077
further, in one embodiment, the step 2 of establishing a loss cost model during the task execution process of the multi-agent according to the operating environment information specifically includes:
(1) Shortest flight distance index f 2
Is provided with
Figure BDA0003954780090000078
For the length of flight of the ith agent formation when selecting path p, the k target node is the first target point on this path,
Figure BDA0003954780090000079
represents the other nodes after the k target node, where T max Representing the maximum number of targets that the agent can execute in one task. Note D ik Is the flight path of the ith agent from the initial position to the k target nodes.
Figure BDA00039547800900000710
Is the flight path of the ith agent from the k target node to the r target node. The agent formation flight distance can be expressed as:
Figure BDA00039547800900000711
in the formula, L max Is the maximum flight distance of a unit agent in executing a task, M is the number of participating agents, L max M is a normalization factor.
Figure BDA00039547800900000712
In the formula, D ikmax The maximum flight path of the ith agent from the initial position to the k target nodes is shown.
Figure BDA00039547800900000713
The maximum flight path of the ith agent from the k target node to the r target node is shown. T is max -1 refers to the maximum number of times the agent can perform the task.
Therefore, the shortest flight distance index of the agent is:
Figure BDA0003954780090000081
(2) Minimum index f of self-loss cost 3
The loss amount cost minimum index is formalized as shown in the following formula:
Figure BDA0003954780090000082
in the formula, the model of the equipment used for executing the task on the target j is recorded as
Figure BDA0003954780090000083
Figure BDA0003954780090000084
Is composed of
Figure BDA0003954780090000085
The unit cost of the model equipment is,
Figure BDA0003954780090000086
(3) Sub-target coverage maximum index f 4
The formalization of the index with the maximum sub-target coverage degree is shown as the following formula:
Figure BDA0003954780090000087
further, in one embodiment, based on the models obtained in step 1 and step 2, adding constraints in the task execution process of the multi-agent, and establishing an overall model for multi-agent cooperative task allocation specifically includes:
and synthesizing the four indexes, performing multi-objective optimization, and obtaining an overall evaluation function, namely an overall model for multi-agent cooperative task allocation, as follows:
min f=[f 1 ;f 2 ;f 3 ;f 4 ]
the constraint conditions include:
(1) Multi-agent cooperative constraint c 1 : in order to ensure the cooperativity of the multiple agents in the task execution process and prevent the cooperation problems of invalid tasks, repeated tasks, mismatching of the number of agents and the target number and the like, certain constraint conditions need to be added to the established model, and the following two main points are provided:
for the agent, any one target point can be executed by the agent only once at most, namely:
Figure BDA0003954780090000088
for the target point, the number of tasks when the agent executes the tasks cannot exceed the task load that the agent can bear, namely:
Figure BDA0003954780090000091
wherein Z is imax The task load that the ith agent can bear.
For a task, all the number of tasks must be executed, namely:
Figure BDA0003954780090000092
wherein N is type The number of types of tasks performed is represented.
(2) Multi-agent operating radius constraint c 2 : in the process of cooperatively executing the task by the multiple intelligent agents, factors such as fuel which can be carried by the intelligent agents are considered, the radius of the executed task has certain constraint, and the multi-intelligent agent is also considered in the modeling process. The flight course of the executed task should meet the self operation radius of the intelligent agent, namely:
Figure BDA0003954780090000093
wherein R is i I =1,2, M represents the working radius of the i-th agent.
Further, in one embodiment, the solving of the model obtained in step 3 by using the improved particle swarm optimization algorithm based on the non-dominated sorting in step 4 specifically includes:
step 4.1, initializing a particle swarm according to the constraint condition of multi-agent task allocation input in the step 3, randomly setting the speed and the position of each particle, setting t =0, and randomly generating an initial solution;
step 4.2, solving the fitness of each particle according to the overall model distributed by the multi-agent cooperative task, and storing the position and the fitness value of the particle in the individual extreme value p of the particle best In (1), all p are best The individual position of the optimum adaptive value and the adaptive value are stored in the global extreme value g best Performing the following steps;
step 4.3, update particle position and velocity
x i,j (t+1)=x i,j (t)+v i,j (t+1)
v i,j (t+1)=ω·v i,j (t)+c 1 r 1 [p i,j -x i,j (t)]+c 2 r 2 [p g,j -x i,j (t)]
Wherein x is position information of the particle, v represents velocity information of the particle, c 1 And c 2 Is a learning factor of a particle, r 1 And r 2 Is a random number between (0,1), p i,j For local optima of particles, p g,j For the global optimum of the particle, w is the inertial weight. Because the value of the common inertia weight w is generally linearly decreased or increased and guidance of particles on the value of w is lacked in the iterative process, the value of the inertia weight w is guided by the difference value of the position of the particle and the current optimal position in the algorithm, the size of the inertia weight w is nonlinearly adjusted according to the difference value of the value, and the difference value of the ith particle j and the global optimal particle is adjusted
Figure BDA0003954780090000094
Is composed of
Figure BDA0003954780090000095
Figure BDA0003954780090000101
Wherein D is the solution space dimension; x is the number of max 、x min Upper and lower bounds, respectively, of the particle position component;
Figure BDA0003954780090000102
indicating the globally optimal particle position at time j,
Figure BDA0003954780090000103
represents the position of the ith particle at time j;
Figure BDA0003954780090000104
is the inertial weight of particle i at time j; w is a start 、w end Respectively an initial value and a final value of w;
and (4) performing a step (4.4),comparing the adaptive value of each particle with the best position of the particle, and if the difference value of the adaptive value of each particle and the best position of the particle is within a preset range, taking the adaptive value of the current particle as the best position of the particle; and comparing all current p best And g best Update g best
And 4.5, introducing cross and mutation operations: and judging whether cross variation exists according to the difference value X between the particle position component and the global optimal position, so that the particles can quickly jump out of the local optimal position. The method comprises the following specific steps:
(1) determining a threshold X of X min Cross rate p c And the rate of variation p m
(2) Judging the difference value X of the particles i i If the value is smaller than the threshold value, executing downwards, otherwise jumping out and not executing;
(3) choosing [0,1 ] for each dimension of particle i]The j-th dimension of the random number r is r ij If r is ij <p m Then, performing mutation operation:
x ij =x min +(x max -x min )r
(4) then judging the random number r corresponding to the j dimension of the particle ij Whether or not less than the crossover rate p c If so, performing cross operation on the jth dimension, wherein the cross object is a global optimal solution, and assigning the global optimal solution of the jth dimension to the jth dimension of the particle.
4.6, sorting the target functions by adopting a non-dominated sorting method, and selecting a next generation population according to a sorting result;
step 4.7, stopping searching and outputting a result when the algorithm reaches a preset stop condition; otherwise go to step 4.3 to continue the search.
Further, in one embodiment, the maximum value of each objective function is calculated according to each non-dominant solution in the resulting pareto solution set, and the vector of the worst indicator function is determined as follows:
Figure BDA0003954780090000105
wherein the content of the first and second substances,
Figure BDA0003954780090000106
representing the value of the nth objective function in the ith non-dominated solution.
The distance formula for each non-dominated solution and worst case index function is established as follows:
Figure BDA0003954780090000107
the set of distances D = { D can be obtained 1 ,D 2 ,...,D i In which D is i The distance value obtained by the i-th non-dominant solution is shown. Will select D i The solution with the largest value of D in the set, worst _ F representing the Worst indicator function,
Figure BDA0003954780090000108
represents the value of the nth objective function in the ith non-dominated solution, with n representing the order of the objective function.
The invention also discloses a multi-agent cooperative task allocation system based on the non-dominated sorting improved particle swarm algorithm, which comprises the following steps:
the first construction module is used for establishing a task allocation target profit model in the task execution process of the intelligent agent according to the operation environment information;
the second construction module is used for establishing a loss cost model in the process of executing the task by the intelligent agent according to the operation environment information;
the third construction module is used for adding constraint conditions in the task execution process of the intelligent agent based on the models obtained by the first construction module and the second construction module, and establishing an integral model for multi-intelligent-agent cooperative task distribution;
and the solving module is used for solving the model obtained by the third building module by utilizing the improved particle swarm optimization algorithm based on the non-dominated sorting.
The invention also discloses computer equipment which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the steps of the method are realized when the processor executes the computer program.
The invention also discloses a computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of the invention.
The invention solves the task allocation problem of the multi-agent based on the improved particle swarm optimization of non-dominated sorting. A cost function for measuring the quality of a task allocation scheme is established, an improved particle swarm algorithm is designed based on the model, a value taking method of inertial weight is designed, a non-dominated sorting and cross variation mechanism is introduced, a solution method of an optimal solution in a pareto solution set is provided, and compared with the traditional particle swarm algorithm, the convergence precision and the convergence speed are improved, and a better convergence result can be obtained.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are given by way of illustration of the principles of the present invention, but that various changes and modifications may be made without departing from the spirit and scope of the invention, and such changes and modifications are within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (9)

1. A multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm, the method comprising the following steps:
step 1, establishing a target profit model for multi-agent task allocation by combining operation environment information;
step 2, establishing a loss cost model for multi-agent task allocation by combining the operation environment information;
step 3, establishing a multi-agent cooperative task allocation model based on the multi-target function obtained in the step 1 and the step 2 and by combining constraint conditions in the task execution process of the agents;
step 4, solving the model obtained in the step 3 by using an improved particle swarm algorithm based on non-dominated sorting to obtain a pareto solution set;
and 5, obtaining a pareto optimal solution through a maximum distance method based on the pareto solution set obtained in the step 4.
2. The method for distributing the multi-agent cooperative tasks based on the non-dominated sorting improved particle swarm algorithm according to claim 1, wherein the step 1 of establishing a target profit model when the multi-agent executes the tasks according to the operating environment information specifically comprises the following steps:
the equipment model adopted when the ith agent executes the task on the target j is recorded as
Figure FDA0003954780080000011
The equipment has the suitability degree of
Figure FDA0003954780080000012
The hit rate to the target j is
Figure FDA0003954780080000013
The damage P to the target j when the ith agent performs a task on the target j ij Determined by both, i.e.
Figure FDA0003954780080000014
The value corresponding to target j is V j And then the target value and income of the formation of the intelligent agent is as follows:
P ij ×V j
thus, the overall revenue model for the overall multi-agent system task allocation is shown by the following equation:
Figure FDA0003954780080000015
where M is the number of agents, N is the number of targets,
Figure FDA0003954780080000016
V j is the value of target j, V max Represents the target maximum value; x ij The allocation scheme for agents, represented by a task allocation decision matrix, is defined as follows:
Figure FDA0003954780080000017
3. the method for distributing multi-agent cooperative tasks based on the non-dominated sorting improved particle swarm algorithm according to claim 2, wherein the step 2 of establishing the loss cost function of the agent task distribution according to the operating environment information specifically comprises:
(1) Shortest flight distance index f 2
Is provided with
Figure FDA0003954780080000018
For the length of flight of the ith agent in selecting path p, since all agents may be confronted with multiple target tasks, note that target k is the first target point to which the agent flies,
Figure FDA0003954780080000019
represents the other target points after k, where T max Representing the maximum target number of executables in one task of the intelligent agent; note D ik Is the flight path of the ith agent from the initial position to the k target point,
Figure FDA0003954780080000021
is the flight path of the ith agent from the k target point to the r target point; the agent formation flight distance is then expressed as:
Figure FDA0003954780080000022
in the formula, L max Unit agent in executing taskM is the number of participating agents, N is the number of targets, L max M is a normalization factor;
Figure FDA0003954780080000023
in the formula, D ikmax The maximum flight range of the ith intelligent agent from the initial position to the k target point is shown;
Figure FDA0003954780080000024
the maximum flight range of the ith intelligent agent from the k target point to the r target point is shown; t is max -1 refers to the maximum number of tasks that an agent can perform;
therefore, the shortest flight distance index of the agent is:
Figure FDA0003954780080000025
(2) Minimum index f of self-loss cost 3
The loss cost minimum index is formalized as shown in the following formula:
Figure FDA0003954780080000026
in the formula, the model of the equipment used for executing the task on the target j is recorded as
Figure FDA0003954780080000027
Figure FDA0003954780080000028
Is composed of
Figure FDA0003954780080000029
The unit cost of the model equipment is,
Figure FDA00039547800800000210
Figure FDA00039547800800000211
(3) Sub-target coverage maximum index f 4
The formalization of the index with the maximum sub-target coverage degree is shown as the following formula:
Figure FDA00039547800800000212
4. the multi-agent cooperative task allocation method based on the non-dominated sorting improved particle swarm algorithm as claimed in claim 3, wherein the step 3 is to add each constraint condition faced by agent formation in executing the task based on the models obtained in the steps 1 and 2, and establish an overall model of multi-agent cooperative task allocation, which specifically comprises:
and synthesizing the four indexes, performing multi-objective optimization, and obtaining an overall evaluation function, namely an overall model for multi-agent cooperative task allocation, as follows:
min f=[f 1 ;f 2 ;f 3 ;f 4 ]
the constraint conditions include:
(1) Multi-agent cooperative constraint c 1 : in order to ensure the cooperativity of the multiple agents in the task execution process and prevent the cooperation problems of invalid tasks, repeated tasks, mismatching of the number of agents and the target number and the like, certain constraint conditions need to be added to the established model, and the following two main points are provided:
for the agent, any one target point can only be executed by the agent once at most, namely:
Figure FDA0003954780080000031
for the target point, the number of tasks when the agent executes the tasks cannot exceed the task load that the agent can bear, namely:
Figure FDA0003954780080000032
wherein Z is imax The task load which can be borne by the ith agent;
for a task, all the number of tasks must be executed, namely:
Figure FDA0003954780080000033
wherein N is type The number of the executed task types is represented;
(2) Multi-agent operating radius constraint c 2 : in the process of cooperatively executing tasks by multiple agents, certain constraints exist on the radius of the executed tasks, namely:
Figure FDA0003954780080000034
wherein R is i I =1,2, M represents the working radius of the i-th agent.
5. The method for distributing multi-agent cooperative tasks based on the non-dominated sorting improved particle swarm algorithm according to claim 4, wherein the step 4 is to solve the model obtained in the step 3, and the specific steps are as follows:
step 4.1, initializing a particle swarm according to the constraint condition of multi-agent task allocation input in the step 3, randomly setting the speed and the position of each particle, setting t =0, and randomly generating an initial solution;
step 4.2, solving the fitness of each particle according to the overall model distributed by the multi-agent cooperative task, and storing the position and the fitness value of the particle in the individual extreme value p of the particle best In (1), all p are best The individual position of the optimal adaptive value and the adaptive value are stored in the global extreme value g best Performing the following steps;
step 4.3, update particle position and velocity
x i,j (t+1)=x i,j (t)+v i,j (t+1)
v i,j (t+1)=ω·v i,j (t)+c 1 r 1 [p i,j -x i,j (t)]+c 2 r 2 [p g,j -x i,j (t)]
Wherein x is position information of the particle, v represents velocity information of the particle, c 1 And c 2 Is a learning factor of a particle, r 1 And r 2 Is a random number between (0,1), p i,j For local optima of particles, p g,j Is the global optimum of the particle, w is the inertial weight; guiding the value of the inertia weight w by the difference value of the particle position and the current optimal position, and adjusting the size of the inertia weight w and the difference value of the ith particle j and the global optimal particle in a non-linear way according to the difference value of the value
Figure FDA0003954780080000041
Is composed of
Figure FDA0003954780080000042
Figure FDA0003954780080000043
Wherein D is the solution space dimension; x is the number of max 、x min Upper and lower bounds, respectively, of the particle position component;
Figure FDA0003954780080000044
indicating the globally optimal particle position at time j,
Figure FDA0003954780080000045
indicating the ith particle j timeThe position of (a);
Figure FDA0003954780080000046
is the inertial weight of particle i at time j; w is a start 、w end Respectively an initial value and a final value of w;
step 4.4, comparing the adaptive value of each particle with the best position of the particle, and if the difference value of the adaptive value of each particle and the best position of the particle is within a preset range, taking the adaptive value of the current particle as the best position of the particle; and comparing all current p best And g best Update g best
And 4.5, introducing cross and mutation operations: judging whether cross variation exists according to the difference value X between the particle position component and the global optimal position, so that the particles can quickly jump out of the local optimal position; the method comprises the following specific steps:
(1) determining a threshold X of X min Cross rate p c And the rate of variation p m
(2) Judging the difference value X of the particles i i If the value is smaller than the threshold value, executing downwards, otherwise jumping out and not executing;
(3) choosing [0,1 ] for each dimension of particle i]The j-th dimension of the random number r is r ij If r is ij <p m Then, performing mutation operation:
x ij =x min +(x max -x min )r
(4) then judging the random number r corresponding to the j dimension of the particle ij Whether or not less than the crossover rate p c If so, performing cross operation on the jth dimension, wherein the cross object is a global optimal solution, and assigning the global optimal solution of the jth dimension to the jth dimension of the particle;
4.6, sorting the target functions by adopting a non-dominated sorting method, and selecting a next generation population according to a sorting result;
step 4.7, stopping searching and outputting a result when the algorithm reaches a preset stop condition; otherwise go to step 4.3 to continue searching.
6. The method according to claim 5, wherein the pareto optimal solution is obtained by a maximum distance method, and the method includes:
for each non-dominant solution in the pareto solution set, calculating the maximum value of each objective function, and determining the vector of the worst index function as follows:
Figure FDA0003954780080000051
wherein the content of the first and second substances,
Figure FDA0003954780080000052
a value representing an nth objective function in an ith non-dominated solution;
the distance formula for each non-dominated solution and worst case index function is established as follows:
Figure FDA0003954780080000053
the set of distances D = { D can be obtained 1 ,D 2 ,...,D i In which D is i Represents a distance value obtained by the i-th non-dominant solution; the solution with the largest distance D value will be selected as the final solution, worst _ F denotes the Worst index function,
Figure FDA0003954780080000054
represents the value of the nth objective function in the ith non-dominated solution, with n representing the order of the objective function.
7. The system for multi-agent cooperative task allocation method based on the non-dominated sorting improved particle swarm algorithm of any one of claims 1 to 6, which is characterized in that the system comprises:
the first construction module is used for establishing a task allocation target profit model in the task execution process of the intelligent agent according to the operation environment information;
the second construction module is used for establishing a loss cost model in the process of executing the task by the intelligent agent according to the operation environment information;
the third construction module is used for adding constraint conditions in the task execution process of the intelligent agent based on the models obtained by the first construction module and the second construction module, and establishing an integral model for multi-intelligent-agent cooperative task distribution;
and the solving module is used for solving the model obtained by the third building module by using the improved particle swarm optimization algorithm based on the non-dominated sorting.
8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of the method of any of claims 1 to 6 are implemented when the computer program is executed by the processor.
9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 6.
CN202211459220.1A 2022-11-17 2022-11-17 Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm Pending CN115809547A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211459220.1A CN115809547A (en) 2022-11-17 2022-11-17 Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211459220.1A CN115809547A (en) 2022-11-17 2022-11-17 Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm

Publications (1)

Publication Number Publication Date
CN115809547A true CN115809547A (en) 2023-03-17

Family

ID=85483658

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211459220.1A Pending CN115809547A (en) 2022-11-17 2022-11-17 Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm

Country Status (1)

Country Link
CN (1) CN115809547A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113887919A (en) * 2021-09-28 2022-01-04 南京理工大学 Hybrid-discrete particle swarm algorithm-based multi-unmanned aerial vehicle cooperative task allocation method and system
CN117009057A (en) * 2023-08-04 2023-11-07 中国科学院软件研究所 Concurrent transaction scheduling method based on dynamic value

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113887919A (en) * 2021-09-28 2022-01-04 南京理工大学 Hybrid-discrete particle swarm algorithm-based multi-unmanned aerial vehicle cooperative task allocation method and system
CN117009057A (en) * 2023-08-04 2023-11-07 中国科学院软件研究所 Concurrent transaction scheduling method based on dynamic value
CN117009057B (en) * 2023-08-04 2024-03-22 中国科学院软件研究所 Concurrent transaction scheduling method based on dynamic value

Similar Documents

Publication Publication Date Title
CN115809547A (en) Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm
Cheng et al. An innovative hybrid multi-objective particle swarm optimization with or without constraints handling
CN110490422B (en) Target combat effectiveness situation assessment method based on game cloud model
CN113220033B (en) Multi-heterogeneous unmanned aerial vehicle task allocation method based on improved multi-universe optimization algorithm
CN114510072B (en) Multi-unmanned aerial vehicle path planning method, terminal and medium based on evolution migration optimization
Behjat et al. Adaptive genomic evolution of neural network topologies (agent) for state-to-action mapping in autonomous agents
CN104539601A (en) Reliability analysis method and system for dynamic network attack process
CN115933693A (en) Robot path planning method based on adaptive chaotic particle swarm algorithm
Cao et al. Autonomous maneuver decision of UCAV air combat based on double deep Q network algorithm and stochastic game theory
Liang et al. Multi-UAV autonomous collision avoidance based on PPO-GIC algorithm with CNN–LSTM fusion network
Zhang et al. Design of the fruit fly optimization algorithm based path planner for UAV in 3D environments
CN113722980A (en) Ocean wave height prediction method, system, computer equipment, storage medium and terminal
CN111797966B (en) Multi-machine collaborative global target distribution method based on improved flock algorithm
CN116933948A (en) Prediction method and system based on improved seagull algorithm and back propagation neural network
Guo et al. Data mining and application of ship impact spectrum acceleration based on PNN neural network
CN116523158A (en) Multi-unmanned aerial vehicle track planning method, device, equipment and storage medium
Gaowei et al. Using multi-layer coding genetic algorithm to solve time-critical task assignment of heterogeneous UAV teaming
Ma et al. CMSRAS: A novel chaotic multi-specular reflection optimization algorithm considering shared nodes
CN113419524B (en) Robot path learning and obstacle avoidance system and method combining deep Q learning
CN114840016A (en) Rule heuristic-based multi-ant colony search submarine target cooperative path optimization method
CN110658833B (en) Multi-AUV real-time rescue task allocation algorithm in underwater environment
Hu et al. Research on AUV global path planning based on multi-objective ant colony strategy
Wei et al. Long-term autonomous mission planning of AUV in large-scale complex marine environment
Cui An Improved Chaotic Self-Adapting Monkey Algorithm for Multi-UAV Task Assignment
Wu et al. Harris hawks optimization for solving flexible jobshop scheduling problem considering energy consumption

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination