CN115809547A - Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm - Google Patents
Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm Download PDFInfo
- Publication number
- CN115809547A CN115809547A CN202211459220.1A CN202211459220A CN115809547A CN 115809547 A CN115809547 A CN 115809547A CN 202211459220 A CN202211459220 A CN 202211459220A CN 115809547 A CN115809547 A CN 115809547A
- Authority
- CN
- China
- Prior art keywords
- agent
- particle
- target
- value
- task
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000002245 particle Substances 0.000 title claims abstract description 122
- 238000000034 method Methods 0.000 title claims abstract description 62
- 238000005457 optimization Methods 0.000 claims abstract description 19
- 230000035772 mutation Effects 0.000 claims abstract description 7
- 239000003795 chemical substances by application Substances 0.000 claims description 159
- 230000006870 function Effects 0.000 claims description 33
- 230000008569 process Effects 0.000 claims description 22
- 230000003044 adaptive effect Effects 0.000 claims description 15
- 238000010276 construction Methods 0.000 claims description 15
- 230000015572 biosynthetic process Effects 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 9
- 238000011156 evaluation Methods 0.000 claims description 3
- 230000009191 jumping Effects 0.000 claims description 3
- 239000011159 matrix material Substances 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 230000002194 synthesizing effect Effects 0.000 claims description 3
- 230000007246 mechanism Effects 0.000 abstract description 3
- 230000008901 benefit Effects 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 2
- 230000001788 irregular Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 238000012804 iterative process Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm, which establishes a multi-agent cooperative task allocation model and realizes simultaneous optimization of multiple indexes such as hitting profit, resource consumption, damage probability and the like. Aiming at the characteristics of the problem of multi-agent cooperative task allocation, the multi-target particle swarm optimization and the non-dominated sorting algorithm are combined, the non-dominated sorting algorithm and a cross mutation mechanism are integrated, a nonlinear method for improving the value of the inertial weight is designed, a maximum distance method for obtaining the optimal solution of the pareto solution obtained by the algorithm is provided, and the global search optimization capability and the engineering application value of the algorithm are greatly improved. Compared with the traditional optimization method, the method has better convergence and accuracy when solving the multi-objective optimization problem in the dynamic environment.
Description
Technical Field
The invention belongs to the field of multi-agent cooperative control, and particularly relates to a non-dominated sorting-based multi-agent cooperative task allocation method based on an improved particle swarm algorithm.
Background
With the development of unmanned technology, the Agent can gradually replace human beings to execute various complex tasks in boring, severe and dangerous environments, such as rescue detection, large-range search, air defense suppression and striking, electronic attack, information reconnaissance and monitoring and the like. In the face of such complex task, it is impossible to independently execute and complete by means of a single agent, and often multiple agents are required to cooperate and cooperate with each other to complete together. Therefore, the rationality and effectiveness of the task allocation scheme is of great importance throughout the job.
In order to effectively perform task allocation on multiple intelligent agents, scholars at home and abroad carry out some researches. The existing research is mainly to solve the problems based on algorithms such as intelligent optimization algorithm, intelligent search and the like. For example, when a genetic algorithm is used for solving a problem, the defects of irregular codes, premature convergence of the algorithm and the like are easily caused, and the most common particle swarm algorithm solves the problem of irregular codes in the genetic algorithm, but the parameter adjustment is difficult, and the global and local searching capability of particles cannot be well balanced. Moreover, for a specific multi-objective optimization problem, a common intelligent optimization algorithm generally converts multiple objectives into a single-objective optimization problem in a weighting manner, and due to the subjectivity of weight selection, a problem that multiple optimization objectives cannot be well balanced easily occurs.
Disclosure of Invention
The invention aims to solve the problems in the prior art and provides a multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm, which comprises the following steps:
step 1, establishing a target profit model for multi-agent task allocation by combining operation environment information;
step 2, establishing a loss cost model for multi-agent task allocation by combining the operation environment information;
step 3, establishing a multi-agent cooperative task allocation model based on the multi-target function obtained in the step 1 and the step 2 and by combining constraint conditions in the task execution process of the agents;
step 4, solving the model obtained in the step 3 by using an improved particle swarm algorithm based on non-dominated sorting to obtain a pareto solution set;
and 5, obtaining a pareto optimal solution by a maximum distance method based on the pareto solution set obtained in the step 4.
Preferably, the step 1 of establishing a target profit model when the multi-agent performs the task according to the operating environment information specifically includes:
the equipment model adopted when the ith agent executes the task on the target j is recorded asThe equipment has the suitability degree ofThe hit rate to the target j isThe damage P to the target j when the ith agent performs a task on the target j ij Determined by both, i.e.Value of V corresponding to target j j And then the target value and income of the formation of the intelligent agent is as follows:
P ij ×V j
thus, the overall revenue model for the overall multi-agent system task allocation is shown by the following equation:
where M is the number of agents, N is the number of targets,V j is the value of target j, V max Represents the target maximum value; x ij The allocation scheme for agents, represented by a task allocation decision matrix, is defined as follows:
preferably, the step 2 of establishing a loss cost function of task allocation of the agent according to the operating environment information specifically includes:
(1) Shortest flight distance index f 2
Is provided withFor the length of flight of the ith agent in selecting path p, since all agents may be confronted with multiple target tasks, note that target k is the first target point to which the agent flies,represents the other target points after k, where T max Representing the maximum target number of executables in one task of the intelligent agent; note D ik Is the flight path of the ith agent from the initial position to the k target point,is the flight path of the ith agent from the k target point to the r target point; the agent formation flight distance is then expressed as:
in the formula, L max Is the maximum flight distance of a unit agent in executing a task, M is the number of participating agents, N is the number of targets, L max M is a normalization factor;
in the formula, D ikmax The maximum flight range of the ith intelligent agent from the initial position to the k target point is shown;the maximum flight range of the ith intelligent agent from the k target point to the r target point is shown; t is max -1 refers to the maximum number of times the agent can perform a task;
therefore, the shortest flight distance index of the agent is:
(2) Minimum index f of self-loss cost 3
The loss cost minimum index is formalized as shown in the following formula:
in the formula, the model of the equipment used for executing the task on the target j is recorded as Is composed ofThe unit cost of the model equipment is,
(3) Sub-target coverage maximum index f 4
The formalization of the index with the maximum sub-target coverage degree is shown as the following formula:
preferably, the step 3 of building an overall model for multi-agent cooperative task allocation by adding various constraint conditions, which are faced by agent formation when the agent formation executes a task, based on the models obtained in the steps 1 and 2 specifically includes:
and synthesizing the four indexes, performing multi-objective optimization, and obtaining an overall evaluation function, namely an overall model for multi-agent cooperative task allocation, as follows:
min f=[f 1 ;f 2 ;f 3 ;f 4 ]
the constraint conditions include:
(1) Multi-agent cooperative constraint c 1 : in order to ensure the cooperativity of the multiple agents in the task execution process and prevent the cooperation problems of invalid tasks, repeated tasks, mismatching of the number of agents and the target number and the like, certain constraint conditions need to be added to the established model, and the following two main points are provided:
for the agent, any one target point can only be executed by the agent once at most, namely:
for the target point, the number of tasks when the agent executes the tasks cannot exceed the task load that the agent can bear, namely:
wherein Z is imax The task load which can be borne by the ith agent;
for a task, all the number of tasks must be executed, namely:
wherein N is type The number of the executed task types is represented;
(2) Multi-agent operating radius constraint c 2 : in the process of cooperatively executing tasks by multiple agents, certain constraints exist on the radius of the executed tasks, namely:
wherein R is i I =1,2, M represents the working radius of the i-th agent.
Preferably, the step 4 is to solve the model obtained in the step 3, and the specific steps are as follows:
step 4.1, initializing a particle swarm according to the constraint condition of multi-agent task allocation input in the step 3, randomly setting the speed and the position of each particle, setting t =0, and randomly generating an initial solution;
step 4.2, solving the fitness of each particle according to the overall model distributed by the multi-agent cooperative task, and storing the position and the fitness value of the particle in the individual extreme value p of the particle best In (1), all p are best The individual position of the optimum adaptive value and the adaptive value are stored in the global extreme value g best Performing the following steps;
step 4.3, update particle position and velocity
x i,j (t+1)=x i,j (t)+v i,j (t+1)
v i,j (t+1)=ω·v i,j (t)+c 1 r 1 [p i,j -x i,j (t)]+c 2 r 2 [p g,j -x i,j (t)]
Wherein x is position information of the particle, v represents velocity information of the particle, c 1 And c 2 Is a learning factor of a particle, r 1 And r 2 Is a random number between (0,1), p i,j For local optima of particles, p g,j Is the global optimum of the particle, w is the inertial weight; guiding the value of the inertia weight w by the difference value of the particle position and the current optimal position, and adjusting the size of the inertia weight w and the difference value of the ith particle j and the global optimal particle in a non-linear way according to the difference value of the valueIs composed of
Wherein D is the solution space dimension; x is the number of max 、x min Upper and lower bounds, respectively, of the particle position component;indicating the globally optimal particle position at time j,represents the position of the ith particle at time j;is the inertial weight of particle i at time j; w is a start 、w end Respectively an initial value and a final value of w;
step 4.4, comparing the adaptive value of each particle with the best position of the particle, and if the difference value of the adaptive value of each particle and the best position of the particle is within a preset range, taking the adaptive value of the current particle as the best position of the particle; and comparing all current p best And g best Update g best ;
And 4.5, introducing cross and mutation operations: judging whether cross variation exists according to the difference value X between the particle position component and the global optimal position, so that the particles can quickly jump out of the local optimal position; the method comprises the following specific steps:
(1) determining a threshold X of X min Cross ratio p c And the rate of variation p m ;
(2) Judging the difference value X of the particles i i If the value is smaller than the threshold value, executing downwards, otherwise jumping out and not executing;
(3) choosing [0,1 ] for each dimension of particle i]The j-th dimension of the random number r is r ij If r is ij <p m Then, performing mutation operation:
x ij =x min +(x max -x min )r
(4) then judging the random number r corresponding to the j dimension of the particle ij Whether or not less than the crossover rate p c If so, performing cross operation on the jth dimension, wherein a cross object is a global optimal solution, and assigning the global optimal solution of the jth dimension to the jth dimension of the particle;
4.6, sorting the target functions by adopting a non-dominated sorting method, and selecting a next generation population according to a sorting result;
step 4.7, stopping searching and outputting a result when the algorithm reaches a preset stop condition; otherwise go to step 4.3 to continue the search.
Preferably, the pareto optimal solution is obtained by a maximum distance method, which specifically includes:
for each non-dominant solution in the pareto solution set, calculating the maximum value of each objective function, and determining the vector of the worst index function as follows:
the distance formula for each non-dominated solution and worst case indicator function is established as follows:
the set of distances D = { D can be obtained 1 ,D 2 ,...,D i In which D is i The distance value obtained by the i-th non-dominant solution is shown. The solution with the largest distance D value will be selected as the final solution, worst _ F denotes the Worst index function,represents the value of the nth objective function in the ith non-dominated solution, with n representing the order of the objective function.
The invention also discloses a multi-agent cooperative task allocation system based on the non-dominated sorting improved particle swarm algorithm, which comprises the following steps:
the first construction module is used for establishing a task allocation target profit model in the task execution process of the intelligent agent according to the operation environment information;
the second construction module is used for establishing a loss cost model in the process of executing the task by the intelligent agent according to the operation environment information;
the third construction module is used for adding constraint conditions in the task execution process of the intelligent agent based on the models obtained by the first construction module and the second construction module, and establishing an integral model for multi-intelligent-agent cooperative task distribution;
and the solving module is used for solving the model obtained by the third building module by utilizing the improved particle swarm optimization algorithm based on the non-dominated sorting.
The invention also discloses computer equipment which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the steps of the method are realized when the processor executes the computer program.
The invention also discloses a computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of the invention.
Has the advantages that:
the invention is improved on the basis of a multi-objective particle swarm algorithm, the value of the inertial weight is guided according to the difference degree between the particles and the current optimal particles, a non-dominated sorting algorithm and a cross variation mechanism are integrated, and a maximum distance method is designed to select the optimal solution from a pareto solution set.
Drawings
FIG. 1 is a flow chart of a multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments.
In one embodiment, in combination with fig. 1, a multi-agent cooperative task allocation method based on non-dominated sorting improved particle swarm optimization is provided, which comprises the following steps:
step 1, establishing a target profit model for multi-agent task allocation by combining operation environment information;
step 2, establishing a loss cost model for multi-agent task allocation by combining the operation environment information;
step 3, establishing a multi-agent cooperative task allocation model based on the multi-target function obtained in the step 1 and the step 2 and by combining constraint conditions in the task execution process of the agents;
and 4, solving the model obtained in the step 3 by using an improved particle swarm algorithm based on non-dominated sorting.
And 5, obtaining a pareto optimal solution by a maximum distance method based on the pareto solution set obtained in the step 4.
Further, in one embodiment, the step 1 of establishing a target profit model in the task execution process of the multi-agent according to the operating environment information specifically includes:
the value benefit of the multi-agent attacking a single sub-target is the product of the damage degree of the sub-target and the value of the sub-target, and when the multi-agent executes tasks in a cluster unit, the damage degree of the sub-target refers to the damage degree of the multi-agent cluster on the sub-target.
The equipment model adopted when the ith agent executes the task on the target j is recorded asThe equipment has the suitability degree ofThe hit rate to the target j isThe damage P to the target j when the ith agent performs a task on the target j ij Determined by both, i.e.The value corresponding to target j is V j Then the target value gain of the intelligent agent formation is:
P ij ×V j
thus, the overall revenue model for the overall multi-agent system task allocation is shown by the following equation:
where M is the number of agents, N is the number of targets,V j for each sub-target's value, V max Represents the target maximum value; x ij The allocation scheme for agents can be represented by a task allocation decision matrix, defined as follows:
further, in one embodiment, the step 2 of establishing a loss cost model during the task execution process of the multi-agent according to the operating environment information specifically includes:
(1) Shortest flight distance index f 2
Is provided withFor the length of flight of the ith agent formation when selecting path p, the k target node is the first target point on this path,represents the other nodes after the k target node, where T max Representing the maximum number of targets that the agent can execute in one task. Note D ik Is the flight path of the ith agent from the initial position to the k target nodes.Is the flight path of the ith agent from the k target node to the r target node. The agent formation flight distance can be expressed as:
in the formula, L max Is the maximum flight distance of a unit agent in executing a task, M is the number of participating agents, L max M is a normalization factor.
In the formula, D ikmax The maximum flight path of the ith agent from the initial position to the k target nodes is shown.The maximum flight path of the ith agent from the k target node to the r target node is shown. T is max -1 refers to the maximum number of times the agent can perform the task.
Therefore, the shortest flight distance index of the agent is:
(2) Minimum index f of self-loss cost 3
The loss amount cost minimum index is formalized as shown in the following formula:
in the formula, the model of the equipment used for executing the task on the target j is recorded as Is composed ofThe unit cost of the model equipment is,
(3) Sub-target coverage maximum index f 4
The formalization of the index with the maximum sub-target coverage degree is shown as the following formula:
further, in one embodiment, based on the models obtained in step 1 and step 2, adding constraints in the task execution process of the multi-agent, and establishing an overall model for multi-agent cooperative task allocation specifically includes:
and synthesizing the four indexes, performing multi-objective optimization, and obtaining an overall evaluation function, namely an overall model for multi-agent cooperative task allocation, as follows:
min f=[f 1 ;f 2 ;f 3 ;f 4 ]
the constraint conditions include:
(1) Multi-agent cooperative constraint c 1 : in order to ensure the cooperativity of the multiple agents in the task execution process and prevent the cooperation problems of invalid tasks, repeated tasks, mismatching of the number of agents and the target number and the like, certain constraint conditions need to be added to the established model, and the following two main points are provided:
for the agent, any one target point can be executed by the agent only once at most, namely:
for the target point, the number of tasks when the agent executes the tasks cannot exceed the task load that the agent can bear, namely:
wherein Z is imax The task load that the ith agent can bear.
For a task, all the number of tasks must be executed, namely:
wherein N is type The number of types of tasks performed is represented.
(2) Multi-agent operating radius constraint c 2 : in the process of cooperatively executing the task by the multiple intelligent agents, factors such as fuel which can be carried by the intelligent agents are considered, the radius of the executed task has certain constraint, and the multi-intelligent agent is also considered in the modeling process. The flight course of the executed task should meet the self operation radius of the intelligent agent, namely:
wherein R is i I =1,2, M represents the working radius of the i-th agent.
Further, in one embodiment, the solving of the model obtained in step 3 by using the improved particle swarm optimization algorithm based on the non-dominated sorting in step 4 specifically includes:
step 4.1, initializing a particle swarm according to the constraint condition of multi-agent task allocation input in the step 3, randomly setting the speed and the position of each particle, setting t =0, and randomly generating an initial solution;
step 4.2, solving the fitness of each particle according to the overall model distributed by the multi-agent cooperative task, and storing the position and the fitness value of the particle in the individual extreme value p of the particle best In (1), all p are best The individual position of the optimum adaptive value and the adaptive value are stored in the global extreme value g best Performing the following steps;
step 4.3, update particle position and velocity
x i,j (t+1)=x i,j (t)+v i,j (t+1)
v i,j (t+1)=ω·v i,j (t)+c 1 r 1 [p i,j -x i,j (t)]+c 2 r 2 [p g,j -x i,j (t)]
Wherein x is position information of the particle, v represents velocity information of the particle, c 1 And c 2 Is a learning factor of a particle, r 1 And r 2 Is a random number between (0,1), p i,j For local optima of particles, p g,j For the global optimum of the particle, w is the inertial weight. Because the value of the common inertia weight w is generally linearly decreased or increased and guidance of particles on the value of w is lacked in the iterative process, the value of the inertia weight w is guided by the difference value of the position of the particle and the current optimal position in the algorithm, the size of the inertia weight w is nonlinearly adjusted according to the difference value of the value, and the difference value of the ith particle j and the global optimal particle is adjustedIs composed of
Wherein D is the solution space dimension; x is the number of max 、x min Upper and lower bounds, respectively, of the particle position component;indicating the globally optimal particle position at time j,represents the position of the ith particle at time j;is the inertial weight of particle i at time j; w is a start 、w end Respectively an initial value and a final value of w;
and (4) performing a step (4.4),comparing the adaptive value of each particle with the best position of the particle, and if the difference value of the adaptive value of each particle and the best position of the particle is within a preset range, taking the adaptive value of the current particle as the best position of the particle; and comparing all current p best And g best Update g best ;
And 4.5, introducing cross and mutation operations: and judging whether cross variation exists according to the difference value X between the particle position component and the global optimal position, so that the particles can quickly jump out of the local optimal position. The method comprises the following specific steps:
(1) determining a threshold X of X min Cross rate p c And the rate of variation p m ;
(2) Judging the difference value X of the particles i i If the value is smaller than the threshold value, executing downwards, otherwise jumping out and not executing;
(3) choosing [0,1 ] for each dimension of particle i]The j-th dimension of the random number r is r ij If r is ij <p m Then, performing mutation operation:
x ij =x min +(x max -x min )r
(4) then judging the random number r corresponding to the j dimension of the particle ij Whether or not less than the crossover rate p c If so, performing cross operation on the jth dimension, wherein the cross object is a global optimal solution, and assigning the global optimal solution of the jth dimension to the jth dimension of the particle.
4.6, sorting the target functions by adopting a non-dominated sorting method, and selecting a next generation population according to a sorting result;
step 4.7, stopping searching and outputting a result when the algorithm reaches a preset stop condition; otherwise go to step 4.3 to continue the search.
Further, in one embodiment, the maximum value of each objective function is calculated according to each non-dominant solution in the resulting pareto solution set, and the vector of the worst indicator function is determined as follows:
The distance formula for each non-dominated solution and worst case index function is established as follows:
the set of distances D = { D can be obtained 1 ,D 2 ,...,D i In which D is i The distance value obtained by the i-th non-dominant solution is shown. Will select D i The solution with the largest value of D in the set, worst _ F representing the Worst indicator function,represents the value of the nth objective function in the ith non-dominated solution, with n representing the order of the objective function.
The invention also discloses a multi-agent cooperative task allocation system based on the non-dominated sorting improved particle swarm algorithm, which comprises the following steps:
the first construction module is used for establishing a task allocation target profit model in the task execution process of the intelligent agent according to the operation environment information;
the second construction module is used for establishing a loss cost model in the process of executing the task by the intelligent agent according to the operation environment information;
the third construction module is used for adding constraint conditions in the task execution process of the intelligent agent based on the models obtained by the first construction module and the second construction module, and establishing an integral model for multi-intelligent-agent cooperative task distribution;
and the solving module is used for solving the model obtained by the third building module by utilizing the improved particle swarm optimization algorithm based on the non-dominated sorting.
The invention also discloses computer equipment which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the steps of the method are realized when the processor executes the computer program.
The invention also discloses a computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of the invention.
The invention solves the task allocation problem of the multi-agent based on the improved particle swarm optimization of non-dominated sorting. A cost function for measuring the quality of a task allocation scheme is established, an improved particle swarm algorithm is designed based on the model, a value taking method of inertial weight is designed, a non-dominated sorting and cross variation mechanism is introduced, a solution method of an optimal solution in a pareto solution set is provided, and compared with the traditional particle swarm algorithm, the convergence precision and the convergence speed are improved, and a better convergence result can be obtained.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are given by way of illustration of the principles of the present invention, but that various changes and modifications may be made without departing from the spirit and scope of the invention, and such changes and modifications are within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.
Claims (9)
1. A multi-agent cooperative task allocation method based on a non-dominated sorting improved particle swarm algorithm, the method comprising the following steps:
step 1, establishing a target profit model for multi-agent task allocation by combining operation environment information;
step 2, establishing a loss cost model for multi-agent task allocation by combining the operation environment information;
step 3, establishing a multi-agent cooperative task allocation model based on the multi-target function obtained in the step 1 and the step 2 and by combining constraint conditions in the task execution process of the agents;
step 4, solving the model obtained in the step 3 by using an improved particle swarm algorithm based on non-dominated sorting to obtain a pareto solution set;
and 5, obtaining a pareto optimal solution through a maximum distance method based on the pareto solution set obtained in the step 4.
2. The method for distributing the multi-agent cooperative tasks based on the non-dominated sorting improved particle swarm algorithm according to claim 1, wherein the step 1 of establishing a target profit model when the multi-agent executes the tasks according to the operating environment information specifically comprises the following steps:
the equipment model adopted when the ith agent executes the task on the target j is recorded asThe equipment has the suitability degree ofThe hit rate to the target j isThe damage P to the target j when the ith agent performs a task on the target j ij Determined by both, i.e.The value corresponding to target j is V j And then the target value and income of the formation of the intelligent agent is as follows:
P ij ×V j
thus, the overall revenue model for the overall multi-agent system task allocation is shown by the following equation:
where M is the number of agents, N is the number of targets,V j is the value of target j, V max Represents the target maximum value; x ij The allocation scheme for agents, represented by a task allocation decision matrix, is defined as follows:
3. the method for distributing multi-agent cooperative tasks based on the non-dominated sorting improved particle swarm algorithm according to claim 2, wherein the step 2 of establishing the loss cost function of the agent task distribution according to the operating environment information specifically comprises:
(1) Shortest flight distance index f 2
Is provided withFor the length of flight of the ith agent in selecting path p, since all agents may be confronted with multiple target tasks, note that target k is the first target point to which the agent flies,represents the other target points after k, where T max Representing the maximum target number of executables in one task of the intelligent agent; note D ik Is the flight path of the ith agent from the initial position to the k target point,is the flight path of the ith agent from the k target point to the r target point; the agent formation flight distance is then expressed as:
in the formula, L max Unit agent in executing taskM is the number of participating agents, N is the number of targets, L max M is a normalization factor;
in the formula, D ikmax The maximum flight range of the ith intelligent agent from the initial position to the k target point is shown;the maximum flight range of the ith intelligent agent from the k target point to the r target point is shown; t is max -1 refers to the maximum number of tasks that an agent can perform;
therefore, the shortest flight distance index of the agent is:
(2) Minimum index f of self-loss cost 3
The loss cost minimum index is formalized as shown in the following formula:
in the formula, the model of the equipment used for executing the task on the target j is recorded as Is composed ofThe unit cost of the model equipment is,
(3) Sub-target coverage maximum index f 4
The formalization of the index with the maximum sub-target coverage degree is shown as the following formula:
4. the multi-agent cooperative task allocation method based on the non-dominated sorting improved particle swarm algorithm as claimed in claim 3, wherein the step 3 is to add each constraint condition faced by agent formation in executing the task based on the models obtained in the steps 1 and 2, and establish an overall model of multi-agent cooperative task allocation, which specifically comprises:
and synthesizing the four indexes, performing multi-objective optimization, and obtaining an overall evaluation function, namely an overall model for multi-agent cooperative task allocation, as follows:
min f=[f 1 ;f 2 ;f 3 ;f 4 ]
the constraint conditions include:
(1) Multi-agent cooperative constraint c 1 : in order to ensure the cooperativity of the multiple agents in the task execution process and prevent the cooperation problems of invalid tasks, repeated tasks, mismatching of the number of agents and the target number and the like, certain constraint conditions need to be added to the established model, and the following two main points are provided:
for the agent, any one target point can only be executed by the agent once at most, namely:
for the target point, the number of tasks when the agent executes the tasks cannot exceed the task load that the agent can bear, namely:
wherein Z is imax The task load which can be borne by the ith agent;
for a task, all the number of tasks must be executed, namely:
wherein N is type The number of the executed task types is represented;
(2) Multi-agent operating radius constraint c 2 : in the process of cooperatively executing tasks by multiple agents, certain constraints exist on the radius of the executed tasks, namely:
wherein R is i I =1,2, M represents the working radius of the i-th agent.
5. The method for distributing multi-agent cooperative tasks based on the non-dominated sorting improved particle swarm algorithm according to claim 4, wherein the step 4 is to solve the model obtained in the step 3, and the specific steps are as follows:
step 4.1, initializing a particle swarm according to the constraint condition of multi-agent task allocation input in the step 3, randomly setting the speed and the position of each particle, setting t =0, and randomly generating an initial solution;
step 4.2, solving the fitness of each particle according to the overall model distributed by the multi-agent cooperative task, and storing the position and the fitness value of the particle in the individual extreme value p of the particle best In (1), all p are best The individual position of the optimal adaptive value and the adaptive value are stored in the global extreme value g best Performing the following steps;
step 4.3, update particle position and velocity
x i,j (t+1)=x i,j (t)+v i,j (t+1)
v i,j (t+1)=ω·v i,j (t)+c 1 r 1 [p i,j -x i,j (t)]+c 2 r 2 [p g,j -x i,j (t)]
Wherein x is position information of the particle, v represents velocity information of the particle, c 1 And c 2 Is a learning factor of a particle, r 1 And r 2 Is a random number between (0,1), p i,j For local optima of particles, p g,j Is the global optimum of the particle, w is the inertial weight; guiding the value of the inertia weight w by the difference value of the particle position and the current optimal position, and adjusting the size of the inertia weight w and the difference value of the ith particle j and the global optimal particle in a non-linear way according to the difference value of the valueIs composed of
Wherein D is the solution space dimension; x is the number of max 、x min Upper and lower bounds, respectively, of the particle position component;indicating the globally optimal particle position at time j,indicating the ith particle j timeThe position of (a);is the inertial weight of particle i at time j; w is a start 、w end Respectively an initial value and a final value of w;
step 4.4, comparing the adaptive value of each particle with the best position of the particle, and if the difference value of the adaptive value of each particle and the best position of the particle is within a preset range, taking the adaptive value of the current particle as the best position of the particle; and comparing all current p best And g best Update g best ;
And 4.5, introducing cross and mutation operations: judging whether cross variation exists according to the difference value X between the particle position component and the global optimal position, so that the particles can quickly jump out of the local optimal position; the method comprises the following specific steps:
(1) determining a threshold X of X min Cross rate p c And the rate of variation p m ;
(2) Judging the difference value X of the particles i i If the value is smaller than the threshold value, executing downwards, otherwise jumping out and not executing;
(3) choosing [0,1 ] for each dimension of particle i]The j-th dimension of the random number r is r ij If r is ij <p m Then, performing mutation operation:
x ij =x min +(x max -x min )r
(4) then judging the random number r corresponding to the j dimension of the particle ij Whether or not less than the crossover rate p c If so, performing cross operation on the jth dimension, wherein the cross object is a global optimal solution, and assigning the global optimal solution of the jth dimension to the jth dimension of the particle;
4.6, sorting the target functions by adopting a non-dominated sorting method, and selecting a next generation population according to a sorting result;
step 4.7, stopping searching and outputting a result when the algorithm reaches a preset stop condition; otherwise go to step 4.3 to continue searching.
6. The method according to claim 5, wherein the pareto optimal solution is obtained by a maximum distance method, and the method includes:
for each non-dominant solution in the pareto solution set, calculating the maximum value of each objective function, and determining the vector of the worst index function as follows:
the distance formula for each non-dominated solution and worst case index function is established as follows:
the set of distances D = { D can be obtained 1 ,D 2 ,...,D i In which D is i Represents a distance value obtained by the i-th non-dominant solution; the solution with the largest distance D value will be selected as the final solution, worst _ F denotes the Worst index function,represents the value of the nth objective function in the ith non-dominated solution, with n representing the order of the objective function.
7. The system for multi-agent cooperative task allocation method based on the non-dominated sorting improved particle swarm algorithm of any one of claims 1 to 6, which is characterized in that the system comprises:
the first construction module is used for establishing a task allocation target profit model in the task execution process of the intelligent agent according to the operation environment information;
the second construction module is used for establishing a loss cost model in the process of executing the task by the intelligent agent according to the operation environment information;
the third construction module is used for adding constraint conditions in the task execution process of the intelligent agent based on the models obtained by the first construction module and the second construction module, and establishing an integral model for multi-intelligent-agent cooperative task distribution;
and the solving module is used for solving the model obtained by the third building module by using the improved particle swarm optimization algorithm based on the non-dominated sorting.
8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of the method of any of claims 1 to 6 are implemented when the computer program is executed by the processor.
9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211459220.1A CN115809547A (en) | 2022-11-17 | 2022-11-17 | Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211459220.1A CN115809547A (en) | 2022-11-17 | 2022-11-17 | Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115809547A true CN115809547A (en) | 2023-03-17 |
Family
ID=85483658
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211459220.1A Pending CN115809547A (en) | 2022-11-17 | 2022-11-17 | Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115809547A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113887919A (en) * | 2021-09-28 | 2022-01-04 | 南京理工大学 | Hybrid-discrete particle swarm algorithm-based multi-unmanned aerial vehicle cooperative task allocation method and system |
CN117009057A (en) * | 2023-08-04 | 2023-11-07 | 中国科学院软件研究所 | Concurrent transaction scheduling method based on dynamic value |
-
2022
- 2022-11-17 CN CN202211459220.1A patent/CN115809547A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113887919A (en) * | 2021-09-28 | 2022-01-04 | 南京理工大学 | Hybrid-discrete particle swarm algorithm-based multi-unmanned aerial vehicle cooperative task allocation method and system |
CN117009057A (en) * | 2023-08-04 | 2023-11-07 | 中国科学院软件研究所 | Concurrent transaction scheduling method based on dynamic value |
CN117009057B (en) * | 2023-08-04 | 2024-03-22 | 中国科学院软件研究所 | Concurrent transaction scheduling method based on dynamic value |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115809547A (en) | Multi-agent cooperative task allocation method based on non-dominated sorting and improved particle swarm algorithm | |
Cheng et al. | An innovative hybrid multi-objective particle swarm optimization with or without constraints handling | |
CN110490422B (en) | Target combat effectiveness situation assessment method based on game cloud model | |
CN113220033B (en) | Multi-heterogeneous unmanned aerial vehicle task allocation method based on improved multi-universe optimization algorithm | |
CN112422699B (en) | Unmanned aerial vehicle cluster action scheme generation method based on dynamic adjustment | |
CN111797966B (en) | Multi-machine collaborative global target distribution method based on improved flock algorithm | |
CN116048071A (en) | Mobile robot path planning method based on particle swarm and differential evolution algorithm | |
CN115933693A (en) | Robot path planning method based on adaptive chaotic particle swarm algorithm | |
CN114510072A (en) | Multi-unmanned aerial vehicle path planning method, terminal and medium based on evolution migration optimization | |
CN116933948A (en) | Prediction method and system based on improved seagull algorithm and back propagation neural network | |
CN113887919A (en) | Hybrid-discrete particle swarm algorithm-based multi-unmanned aerial vehicle cooperative task allocation method and system | |
Cao et al. | Autonomous maneuver decision of UCAV air combat based on double deep Q network algorithm and stochastic game theory | |
CN114840016A (en) | Rule heuristic-based multi-ant colony search submarine target cooperative path optimization method | |
Liang et al. | Multi-UAV autonomous collision avoidance based on PPO-GIC algorithm with CNN–LSTM fusion network | |
CN113419524B (en) | Robot path learning and obstacle avoidance system and method combining deep Q learning | |
Ma et al. | CMSRAS: A novel chaotic multi-specular reflection optimization algorithm considering shared nodes | |
Zhang et al. | Design of the fruit fly optimization algorithm based path planner for UAV in 3D environments | |
Guo et al. | Data mining and application of ship impact spectrum acceleration based on PNN neural network | |
CN117519816A (en) | Internet of vehicles computing task unloading method based on sparrow search optimization strategy | |
CN116523158A (en) | Multi-unmanned aerial vehicle track planning method, device, equipment and storage medium | |
Gaowei et al. | Using multi-layer coding genetic algorithm to solve time-critical task assignment of heterogeneous UAV teaming | |
Hu et al. | Research on AUV global path planning based on multi-objective ant colony strategy | |
CN116360437A (en) | Intelligent robot path planning method, device, equipment and storage medium | |
CN110658833B (en) | Multi-AUV real-time rescue task allocation algorithm in underwater environment | |
Wei et al. | Long-term autonomous mission planning of AUV in large-scale complex marine environment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |