CN114043476B

CN114043476B - Swarm robot control method based on particle swarm algorithm under rejection environment

Info

Publication number: CN114043476B
Application number: CN202111301771.0A
Authority: CN
Inventors: 张军旗; 刘欢; 王成; 臧笛; 刘春梅; 康琦
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2021-11-04
Filing date: 2021-11-04
Publication date: 2023-03-31
Anticipated expiration: 2041-11-04
Also published as: CN114043476A

Abstract

The invention relates to a swarm robot control method based on a particle swarm algorithm in a rejection environment, which comprises the following steps: step 1, establishing an attack and defense confrontation scene in a rejection environment, and initializing parameters of a particle swarm algorithm; step 2, the attacking robot detects surrounding environment information through a sensor, acquires situation information of friend and enemy robots, and calculates the position of the enemy territory in real time by using an inertial navigation technology; step 3, constructing a fitness function by the attack robot; step 4, optimizing a fitness function by utilizing a particle swarm algorithm to obtain the optimal occupation of the attacking robot; step 5, moving and attacking the attacking robot; step 6, if any attacking robot enters enemy territory, the task is completed; otherwise, judging whether the maximum operation time is reached, if so, failing the task; otherwise, turning to step 2 to iterate the next time slice. Compared with the prior art, the method avoids global positioning, the control does not need pre-training, and the problem of dimension disaster is solved.

Description

Swarm robot control method based on particle swarm algorithm under rejection environment

Technical Field

The invention relates to the field of cooperative and game confrontation control among multi-agent clusters, in particular to a swarm robot control method based on a particle swarm algorithm in a rejection environment.

Background

With the rapid development and mature application of intelligent unmanned technology, cooperative combat among unmanned devices has become possible, and cooperative attack and defense confrontation among unmanned clusters gradually becomes an important mode of future war. As an application carrier of the multi-agent technology, the unmanned cluster judges the surrounding situation through sensing the environment, and takes the actions of fire gathering attack, wounded recoiling, obstacle avoidance, group collision avoidance, dispersion, concentration, cooperation, assistance and the like according to a certain attack and defense strategy to realize attack and defense confrontation.

The cooperative attack and defense confrontation of the group of robots can be described as an optimal decision problem under complex multi-constraint conditions, and the most classical is the problem of earth guard. In this problem, the confrontation environment is composed of two multiple intelligent groups of an intruder and a defender. Where the intruder is intended to try to get as close to and into a territory as possible and the defender is intended to intercept the intruder as far from the territory as possible. The advantages and disadvantages of the situation under the confrontation environment depend on the relationship among an invader, a defender and the territory, and because the state space dimension of the multi-agent attack-defense confrontation task is high, the strategy solving space is exponentially increased along with the increase of the scale of the entity object, the situation is complex and fast to change, attack-defense strategies are various, the solving difficulty is large, and an efficient decision algorithm is needed.

The most popular group confrontation method at present is a multi-agent deep reinforcement learning method. However, such algorithms require extensive pre-training, are limited by dimensional disaster problems, and rely on accurate global positioning and communication, failing to achieve effective collaboration and countermeasures in a denial environment.

Disclosure of Invention

The invention aims to overcome the defects in the prior art and provide a swarm robot control method based on a particle swarm algorithm in a rejection environment, and the control method makes up the bottleneck problem that global positioning and communication are limited by dimension disasters since most of multi-agent algorithms currently exist.

The purpose of the invention can be realized by the following technical scheme:

the invention provides a swarm robot control method based on a particle swarm algorithm in a rejection environment, which comprises the following steps:

step 1, establishing an attack and defense confrontation scene in a refused environment, and initializing parameters of a particle swarm algorithm;

step 2, the attacking robot detects surrounding environment information through a sensor, acquires situation information of the friend robot and the enemy robot, and calculates the position of the enemy territory in real time by utilizing an inertial navigation technology

Step 3, the attacking robot constructs a fitness function containing enemy robot state information, friend robot state information and enemy territory information by using the detected surrounding environment information;

step 4, optimizing the fitness function by utilizing a particle swarm algorithm to obtain the optimal occupation of the attacking robot and guide the attacking robot to move and attack in the next time slice;

step 5, the attacking robot carries out moving and attacking operations;

step 6, if an attacking robot enters enemy territory, the task is completed; otherwise, judging whether the maximum running time is reached, if so, failing the task; otherwise, the iteration of the next time slice is carried out in the step 2.

Preferably, the attack and defense confrontation scene in the rejection environment is established in step 1, and parameters of the particle swarm algorithm are initialized, specifically: initializing the positions of N attacking robots, the positions of M defending robots and a GPS positioning coordinate U of enemy territory; initializing initial number of particles in particle swarm optimization

Acceleration factor c ₁ And c ₂ Inertial weight w and dimension of problem D;

all the robots have the same attribute and have P-point blood volume, when the robots are attacked, the blood volume is reduced by P, and when the blood volume of the robots is less than or equal to 0, the robots are killed.

Preferably, the step 2 specifically comprises: each attacking robot constructs a coordinate system according to the position and the moving direction of the attacking robot; the method comprises the steps of detecting surrounding environment information through a sensor, obtaining coordinates of a friend robot and an enemy robot, and calculating the position of enemy territory in real time by utilizing an inertial navigation technology

Preferably, in step 3, the attack robot constructs a fitness function of the attack robot by using the detected ambient environment information, specifically: attacking robot A _i Constructing a fitness function F according to the state information of the enemy robot, the state information of the friend robot and the information of the enemy territory _i The expression is as follows:

F _i ＝f ₁ +f ₂ +f ₃

wherein f is ₁ Is a confrontation fitness f constructed according to the state information of the enemy robot ₂ Is a cooperative fitness f constructed according to the state information of the friend robot ₃ Is the adaptability of enemy to soil.

Preferably, said fitness opposition function f ₁ The expression of (a) is:

where Ψ is the attacking robot A _i Neighborhood region R ₁ The indices of all enemy robots within,

is a sub-fitness function generated according to the state information of the kth enemy robot;

Is the coordinates of the kth enemy robot; x = (x) ¹ ，x ² ) Is an independent variable representing the coordinates of a certain position in the rectangular coordinate system of the attack robot; sigma ₁ And w ₁ Respectively defining the width and the amplitude of the Gaussian-like model; s (i, k) is the measurement of the ith attack robot A _i Whether the situation of (2) is better than that of the enemy robot B _k Determines the attacking robot A _i Is to enemy robot B _k Whether to move or to back is expressed as:

the situation information of the robot is obtained by calculation according to the number and the total blood volume of friend robots, wherein '-1' and '1' respectively represent an attacking robot A _i In unfavorable and favorable situations; n is a radical of _i And

respectively represent attack robots A _i Scope of attackR ₀ The number and total blood volume values of all friend robots within; m _k And &>

Respectively enemy robot B _k The number of all enemy robots and the total blood volume value within the attack range R0;

when S (i, k) = -1, f _Bk Is a valley-shaped function which represents the attacking robot A _i Robot B away from enemy _k The farther away, the attacking robot A _i The higher the fitness of (a);

when S (i, k) =1, f _Bk Is a peak-shaped function and represents the attacking robot A _i With enemy robot B _k The closer the approach, the higher the fitness of the attacking robot Ai.

Preferably, the attack robot A _i Of the cooperative fitness function f ₂ The expression of (a) is:

where Φ is the attacking robot A _i Neighborhood region R ₁ Indices of all friends in the content; f. of _Ak Is a sub-fitness function generated according to the state information of the kth attack robot, wherein

Is the coordinates of the kth attacker; x = (x) ¹ ，x ² ) Is an independent variable representing the coordinate of a certain position in the rectangular coordinate system of the attack robot; sigma ₂ And w ₂ Respectively defining the width and the amplitude of the Gaussian-like model;

if attacking robot A _i In the time slices of tau, the attacking robot a is in a disadvantage facing the surrounding enemy robot all the time _i The cooperative fitness function f needs to be released from the constraint of the fellow and quit the group ₂ Set to 0, attack robot A _i Act independently to search for better attack locations;

when A is _i Friend with it A _k Is less than a preset threshold value Δ =10| Ψ |, f _Ak Set to 0 to avoid collision of two attacking robots.

Preferably, the attack robot A _i Fitness f of enemy territory ₃ The expression is as follows:

wherein U = (U) ¹ ，U ² ) Is the coordinate of the center position of enemy territory, x = (x) ¹ ，x ² ) Is an independent variable representing the coordinate of a certain position in the rectangular coordinate system of the attack robot; sigma ₃ And w ₃ The width and amplitude of the gaussian-like model are defined separately.

Preferably, the step 4 specifically includes: in each time slice, executing a particle swarm algorithm to optimize the fitness function to obtain an attacking robot A _i Best position p in its current situation _g (ii) a The search space of the particle swarm algorithm is an attacking robot A _i Is the center of the coordinate of (c), R ₁ Is within a circular area of radius.

Preferably, the speed and location update expression of the fitness function is:

wherein, c ₁ ，c ₂ Is a constant acceleration factor, w is an inertial weight,

represents the velocity of the ith particle in the D-dimension, D ∈ [1]，

Indicates that the ith particle is in the d-dimension position>

And &>

Is a random number vector; d is the dimension of the environment, for a two-dimensional confrontation environment D =2, a three-dimensional confrontation environment D =3.

Preferably, the step 4 specifically includes:

each attacking robot is oriented to the calculated optimal position p _g Direction movement if the robot is in the optimal position p _g Is less than the maximum distance of the attack robot moving in a time slice

The attacking robot moves to the optimal position p _g (ii) a Otherwise, go to the optimal position p _g Direction shift>

A distance; in the moving process of the robot, if other robots exist in the warning area of the position of the next time slice, the moving direction of the robot is rotated anticlockwise by a degrees, and after the rotating times exceed the preset times, if the robot still cannot find a proper collision-free path, the robot stays at the current position until the next time slice;

if an enemy of the attacking robot enters the attacking range in the moving process, the nearest enemy is selected to attack; if the situation of the attacking robot is better than that of the enemy robot, the attacking robot can move towards the enemy robot and attack, namely the closer the attacking robot is to the enemy robot, the higher the fitness of the attacking robot is; conversely, if the attacking robot is inferior to the enemy robot in posture, the closer it is to the enemy robot, the lower its fitness;

the attacking robot and the friend of the attacking robot are in a cooperative relationship and form a group to attack the robot group of the enemy, and the closer the attacking robot is to the friend of the attacking robot, the higher the fitness of the attacking robot is.

Compared with the prior art, the invention has the following advantages:

1) The group robot senses the motion of the group robot from the surrounding environment by using the sensor carried by the group robot, and does not depend on a global navigation system; each robot constructs a respective coordinate system, and cooperation and confrontation are realized by acquiring relative coordinates of surrounding intelligent agents without depending on a global positioning system;

2) The fitness function of the robot integrates information of friend, enemy and enemy territory, and the control of the robot in the cooperative and antagonistic environments is realized;

3) Compared with a reinforcement learning method, the distributed control of the swarm robots based on the particle swarm algorithm has the advantages of no need of pre-training and strong expandability.

Drawings

FIG. 1 is a flow chart of a swarm robot control method based on a particle swarm algorithm in a rejection environment.

FIG. 2 is an exemplary diagram of a rectangular coordinate system of an attack robot;

FIG. 3 shows an attack robot A ₁ An example of a fitness model of (a);

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, shall fall within the scope of protection of the present invention.

The embodiment provides a swarm robot attacking method based on a particle swarm optimization in a denial environment, as shown in fig. 1, including the following steps:

step 1, establishing an attack and defense confrontation scene in a refused environment, setting the positions of N attack group robots, the positions of M defense group robots and the position coordinates of enemy territory, setting all robots to have the same attribute and 10 pointsThe blood volume is reduced by 1 when the robot is attacked, and the robot is destroyed when the blood volume of the robot is less than or equal to 0; initial number of particles for initializing particle swarm optimization

Acceleration factor c ₁ And c ₂ Inertial weight w and dimension D of the problem; all robots have the same attribute and have 10-point blood volume, when the robots are attacked, the blood volume is reduced by 1, and when the blood volume of the robots is less than or equal to 0, the robots are destroyed.

Step 2, the attacking robot detects the surrounding environment through a sensor, acquires situation information of the friend robot and the enemy robot, and calculates the position of the enemy territory in real time by utilizing the inertial navigation technology

as shown in FIG. 2, each attacking robot A _i Constructing a rectangular coordinate system by taking the position of the mobile terminal as the origin of coordinates and the moving direction as the positive direction of the horizontal axis; wherein Q _j 、P _k And U is the coordinates of the jth defender, kth attacker and territory respectively, attacking robot A _i Obtaining Q by sensing environment using its sensors _j And P _k ；

The coordinate U of the enemy territory is known when each attacking robot is at the initial position, and the approximate coordinate of the enemy territory is calculated in real time by applying the inertial navigation technology in the moving process of the robots

For each attack robot, the fitness function comprises three parts of information: state information of enemy robots, state information of friend military robots and enemy territory information;

the attack robot A _i Fitness function F _i The calculation formula of (2) is as follows:

F _i ＝f ₁ +f ₂ +f ₃

wherein, f ₁ Is a confrontation fitness f constructed according to the state information of the enemy robot ₂ Is a cooperative fitness f constructed according to the state information of the friend robot ₃ Is the adaptability of enemy to soil.

1) Constructing an confrontation fitness function f of the attack robot ₁

If the situation of the attacking robot is better than that of the enemy robot, the attacking robot moves towards the enemy robot and attacks are initiated. This situation is mapped into the fitness function model, i.e. the closer the attacking robot is to the enemy robot, the higher its fitness. Conversely, if the attacking robot is less dominant than the enemy robot, the closer it is to the enemy robot, the lower its fitness. The confrontation fitness function model is constructed by utilizing Gaussian-like distribution. Confrontation fitness function f of attack robot ₁ Comprises the following steps:

where Ψ is the attacking robot A _i Neighborhood region R ₁ Index f of all enemy robots therein _Bk Is a sub-fitness function generated from the state information of the kth enemy robot.

Is the coordinates of the kth enemy robot; x = (x) ₁ ，x ₂ ) Is an independent variable representing the coordinate of a certain position in a rectangular coordinate system; sigma ₁ And w ₁ The width and amplitude of the gaussian-like model are defined separately.

S (i, k) is the measurement of the ith attack robot A _i Whether the situation of (2) is better than that of the enemy robot B _k The index of (1). The situation of the robot is calculated according to the number of friends and the total blood volume, wherein "-1" and "1" represent A respectively _i In unfavorable and favorable situations; n is a radical of _i And

is A _i Attack range R ₀ Attack robot (including attacker A) _i ) Number of and total blood volume values; m _k And &>

Are respectively B _k Attack range R ₀ Number of enemy robots and total blood volume value. When S (i, k) = -1, f _Bk Is a valley-shaped function. This means that A _i From B _k The farther away, A _i The higher the fitness. When S (i, k) =1, f _Bk Is a peak-shaped function. This means that A _i And B _k The closer to A _i The higher the fitness. S (i, k) determines A _i Is directed to B _k Whether to move or to back.

2) Construction of attack robot A _i Of the cooperative fitness function f ₂

The attacking robot and its friend are in a cooperative relationship and form a group to attack the enemy robot population. This situation is mapped into the fitness function model, i.e. the closer the attacking robot is to its friend, the higher its fitness. Attacking robot A _i Is a cooperative fitness function f ₂ Comprises the following steps:

wherein Φ is A _i Neighborhood region R ₁ Index of all friends in the content. f. of _Ak Is a sub-fitness function generated according to the state information of the kth attack robot, wherein

Is the coordinates of the kth attacker. Notably, according to f ₂ The attacking population may dynamically form subgroups. If A is _i Within a period of tau time, a is always at a disadvantage facing surrounding enemy robots, then a _i It is necessary to escape from the constraints of its partners and exit the population, at which point f will be ₂ Set to 0,ai to act independently to search for a better attack position. When A is _i Friend with it A _k Is less than a threshold value Δ =10| Ψ |, f _Ak Set to 0 to avoid collision of two attacking robots.

3) Constructing a fitness function f according to the position information of enemy territory ₃

The attacking robot aims to enter enemy territory, and the closer the attacking robot is to the enemy territory, the higher the fitness is. Therefore, the fitness function f ₃ Comprises the following steps:

wherein U = (U) ¹ ，U ² ) Is the coordinate of the center position of the enemy territory.

FIG. 3 shows an attacking robot A ₁ An example of a fitness model of (1), where two diamonds represent A ₁ And A ₂ Three triangles are enemy robot B ₁ 、B ₂ And B ₃ The star is the enemy territory T. Wherein A is ₁ And A ₂ Are in cooperative relationship with each other, A ₁ And B ₁ 、B ₂ And B ₃ The relationship between is a resistance relationship. The fitness model is applied to a particle swarm algorithm to optimize and obtain a group robot attack strategy in a rejection environment so as to guide the movement and attack of an attacking robot.

in each time slice, a particle swarm optimization PSO optimization attack robot A is executed _i Best position p in its current situation _g . Since the attacker's distance of movement is limited within a time slice and the environment is dynamically changing, it only needs to be in its neighborhood R ₁ Finding the position with the best fitness.

Therefore, the search space of the particle swarm algorithm PSO is constrained to attack robot a _i Is the center of the coordinate of (c), R ₁ Is within a circular area of radius.

The speed and location update formula for optimizing the fitness function is as follows:

represents the velocity of the ith particle in the D-dimension, D ∈ [1]，

Indicates the position of the i-th particle in dimension d>

And &>

Is a random number vector; d is the dimension of the environment, for a two-dimensional confrontation environment D =2, a three-dimensional confrontation environment D =3;

step 5, the attack robot carries out moving and attack operation;

each robot is directed to the calculated optimal position p _g Direction movement if the robot is in the optimal position p _g Is less than the maximum distance the robot moves within a time slice

The robot moves to the optimal position p _g (ii) a Otherwise to p _g Direction shift->

Distance.

In the moving process, if the robot has enemy army entering the attack range in the moving process, the nearest enemy army attack to the robot is selected. During the moving process of the robot, if other robots exist in the warning area of the position where the next time slice is located, the moving direction of the robot is rotated by 15 degrees anticlockwise, and after a series of rotations (23 times), if the robot still cannot find a proper collision-free path, the robot stays at the current position until the next time slice.

In order to verify the performance of the method in the group robot territory invasion problem more intuitively, the method carries out the antagonism experiment with the following three algorithms applied to the enemy robot group:

(1) Based on a rule algorithm, the enemy robot always moves to the middle point position of the attack robot closest to the territory and the enemy robot to intercept the attack robot.

(2) DPSO attack task assignment algorithm, "Cooperative Multi-task assignment for multiple UAVs," Electronics Optics & controls, vol.24, no.1, pp.46-50, 2017.

(3) SDPSO attack task allocation algorithm, "UAV cooperative multiple-task assistance based on discrete particle timing algorithm," Computer Simulation, vol.35, no.2, pp.22-28, 2018.

The challenge results of the algorithm are shown in table 1.

TABLE 1

It is apparent that the overall odds of the attacking robot group using the PSO-AS method is 100% when the numbers of the attacking robot group and the enemy robot group are the same. When the number of attacking robot groups is only 75% of the enemy robot groups, the proposed method still has a success rate of winning of more than 50%.

While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A swarm robot control method based on a particle swarm algorithm under a rejection environment is characterized by comprising the following steps of:

step 1, establishing an attack and defense confrontation scene in a rejection environment, and initializing parameters of a particle swarm algorithm;

step 4, optimizing the fitness function by utilizing a particle swarm algorithm to obtain the optimal occupation of the attack robot and guide the attack robot to move and attack in the next time slice;

step 5, the attacking robot carries out moving and attacking operations;

step 6, if an attacking robot enters enemy territory, the task is completed; otherwise, judging whether the maximum operation time is reached, if so, failing the task; otherwise, the iteration of the next time slice is carried out in the step 2.

2. The method for controlling group robots based on particle swarm optimization in the denial environment according to claim 1, wherein the attack and defense confrontation scene in the denial environment is established in step 1, and parameters of the particle swarm optimization are initialized, specifically: initializing the positions of N attacking robots, the positions of M defending robots and a GPS positioning coordinate U of enemy territory; initializing initial number of particles in particle swarm optimization

all robots have the same attribute and have P-point blood volume, when the robots are attacked, the blood volume is reduced by P, and when the blood volume of the robots is less than or equal to 0, the robots are destroyed.

3. The swarm robot control method based on the particle swarm algorithm in the denial environment according to claim 1, wherein the step 2 specifically comprises: each attacking robot constructs a coordinate system according to the position and the moving direction of the attacking robot; the method comprises the steps of detecting surrounding environment information through a sensor, obtaining coordinates of a friend robot and an enemy robot, and calculating the position of enemy territory in real time by utilizing an inertial navigation technology

4. The method of claim 1A swarm robot control method based on a particle swarm algorithm in a denial environment is characterized in that in step 3, an attack robot utilizes detected surrounding environment information to construct a fitness function of the attack robot, and the method specifically comprises the following steps: attacking robot A _i Constructing a fitness function F according to the enemy robot state information, the friend robot state information and the enemy territory information _i The expression is as follows:

F _i ＝f ₁ +f ₂ +f ₃

5. The swarm robot control method based on particle swarm optimization in the rejection environment of claim 4, wherein the countervailing fitness function f is ₁ The expression of (a) is:

Is the coordinates of the kth enemy robot; x = (x) ¹ ,x ² ) Is an independent variable representing the coordinates of a certain position in the rectangular coordinate system of the attack robot; sigma ₁ And w ₁ Respectively defining the width and the amplitude of the Gaussian-like model; s (i, k) is the measurement of the ith attack robot A _i Whether the situation of (2) is better than that of the enemy robot B _k Determines the attack machinePerson A _i Is to enemy robot B _k Whether to move or to back, the expression is: />

respectively represent attack robots A _i Attack range R ₀ The number and total blood volume values of all friend robots within; m _k And &>

Respectively enemy robot B _k Attack range R ₀ Number of all enemy robots within and total blood volume value;

when S (i, k) = -1, f _Bk Is a valley-shaped function and represents the attacking robot A _i Robot B away from enemy _k The farther away, the attacking robot A _i The higher the fitness;

when S (i, k) =1, f _Bk Is a peak-shaped function and represents the attacking robot A _i With enemy robot B _k The closer the attacking robot A is _i The higher the fitness.

6. The swarm robot control method based on particle swarm optimization in the denial environment of claim 4, wherein the attacking robot A _i Is a cooperative fitness function f ₂ The expression of (a) is:

Is the coordinates of the kth attacker; x = (x) ¹ ,x ² ) Is an independent variable representing the coordinate of a certain position in the rectangular coordinate system of the attack robot; sigma ₂ And w ₂ Respectively defining the width and the amplitude of the Gaussian-like model;

if attacking robot A _i In the time slice of tau, the robot facing the surrounding enemy is always in a disadvantage, and then the attacking robot A _i The cooperative fitness function f needs to be released from the constraint of the fellow and quit the group ₂ Set to 0, attack robot A _i Act independently to search for better attack locations;

7. The swarm robot control method based on particle swarm optimization in the denial environment as claimed in claim 4, wherein the attacking robot A _i Land-taking fitness f of enemy ₃ The expression is as follows:

wherein U = (U) ¹ ,U ² ) Is the coordinate of the center position of enemy territory, x = (x) ¹ ,x ² ) Is an independent variable representing the coordinate of a certain position in the rectangular coordinate system of the attack robot; sigma ₃ And w ₃ The width and amplitude of the gaussian-like model are defined separately.

8. A denial according to claim 1The swarm robot control method based on the particle swarm algorithm under the environment is characterized in that the step 4 specifically comprises the following steps: in each time slice, executing a particle swarm algorithm to optimize the fitness function to obtain an attacking robot A _i Best position p in its current situation _g (ii) a The search space of the particle swarm algorithm is an attacking robot A _i Is the coordinate of (A) as the center, R ₁ Is within a circular area of radius.

9. The swarm robot control method based on the particle swarm optimization algorithm in the denial environment according to claim 8, wherein the speed and location update expressions of the fitness function are:

represents the velocity of the ith particle in the D-dimension, D ∈ [1]，

Indicates that the ith particle is in the d-dimension position>

And &>

10. The swarm robot control method based on the particle swarm algorithm in the denial environment according to claim 8, wherein the step 4 specifically comprises:

each attacking robot moves to the calculated optimal position p _g Direction movement if the robot is in the optimal position p _g Is less than the maximum distance of the attack robot moving in a time slice

The attacking robot moves to the optimal position p _g (ii) a Otherwise, go to the optimal position p _g Direction shift->