CN114326747A

CN114326747A - Multi-target enclosure control method and device for group robots and computer equipment

Info

Publication number: CN114326747A
Application number: CN202210013119.7A
Authority: CN
Inventors: 潘庆涛; 汤俊; 老松杨; 詹建军; 李�浩; 秦婉亭; 万宇; 陈曦; 王浩森; 赵子鹏
Original assignee: National University of Defense Technology
Current assignee: National University of Defense Technology
Priority date: 2022-01-06
Filing date: 2022-01-06
Publication date: 2022-04-12
Anticipated expiration: 2042-01-06
Also published as: CN114326747B

Abstract

The application relates to a multi-target enclosure control method and device for swarm robots and computer equipment. The method comprises the following steps: inputting position information, target position information and obstacle position information of the current robot, obstacle information sent by other robots in a sensing area and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, outputting an enclosure mode and action information of the current robot, and constructing an updating formula for optimizing individual positions and speeds in a population based on an artificial electric field algorithm; according to the updating formula, carrying out optimization calculation on the influence factors and the weights to obtain optimized influence factors and optimized weights; and inputting the optimized influence factors and the optimized weight into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots. By adopting the method, the group trapping efficiency can be improved.

Description

Multi-target enclosure control method and device for group robots and computer equipment

Technical Field

The application relates to the technical field of robot enclosure navigation, in particular to a swarm robot multi-target enclosure control method, a swarm robot multi-target enclosure control device and computer equipment.

Background

A multi-robot system (MRS), also known as a swarm robot, is composed of a large number of small, simple robots (with a definite language: any one has a certain autonomy, limited communication capabilities and computational resources). Therefore, it is necessary to systematize robots in MRS individually, and through close mutual cooperation and cooperation, advantages such as completion of complex tasks, spatial-temporal distribution work, functions, and perceptual distribution that a single robot does not have can be brought. Furthermore, due to the extremely attractive properties of a single robot, such as low cost, high adaptability and high reliability, MRS has been successfully applied in a wide range of fields, including marine exploration, medical services, search and rescue, positioning and navigation, etc. Meanwhile, with the rapid development of MRS theory and application, the cooperative capture of targets by using group robots is becoming a new research direction.

At present, target containment, especially for randomly moving targets, is a very challenging task. This not only requires close cooperation of robots in MRS to track closely targets, but also their perception and adaptation to the surrounding dynamic environment in real time. Therefore, how and better to create a complete and reliable enclosure around the target after it is detected becomes a key and difficult point of this task, which has led to increased attention in the research on cooperative control strategies for group agents. Currently, methods for forming a multi-robot shape structure and pattern can be broadly classified into five categories according to control methods: 1) a virtual structure method; 2) leader-follower method; 3) an artificial potential field method; 4) a behavior-based approach; 5) intelligent algorithm excited by natural life.

Despite the above considerable research on the methods of multi-robot shape construction and pattern formation, self-organizing MRS still has a number of unsolved problems: for example, a strict and complex global coordinate system is needed for acquiring environment information by relying on a large amount of communication between intelligent bodies and behavior control, the intelligent bodies are difficult to cooperate to consider surrounding intelligent bodies or obstacles in the environment, dynamic targets are easy to escape from an enclosure, and the like.

Disclosure of Invention

In view of the above, it is necessary to provide a multi-target enclosure control method, apparatus and computer device for swarm robots, aiming at the above technical problems.

A swarm robot multi-target enclosure control method comprises the following steps:

inputting position information, target position information and obstacle position information of the current robot, obstacle information sent by other robots in a sensing area and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting an enclosure capture mode and action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;

establishing an updating formula for optimizing the position and the speed of individuals in the population based on an artificial electric field algorithm;

according to the updating formula, carrying out optimization calculation on the influence factors and the weights to obtain optimized influence factors and optimized weights;

and inputting the optimization influence factors and the optimization weights into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.

In one embodiment, the method further comprises the following steps: inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the trapping mode of the current robot as follows:

P＝ω_tP^t+ω_nPⁿ+ω_oP^o

wherein P represents a trapping mode, P^tComponent of the hunting pattern, P, representing a determination of target position informationⁿRepresenting the component of the trapping pattern, P, determined by the neighbor location information^oAn entrapment mode component representing the determination of the obstacle position information; omega_t、ω_n and ω_oAll represent influencing factors.

In one embodiment, the mobility policy includes: the robot approach or departure strategy comprises a robot approach or departure strategy, a robot low-density motion tendency strategy, a robot high-density motion tendency strategy and a robot reverse direction motion strategy; further comprising: inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the action information of the current robot as follows:

wherein ,S_iIndicating the position vector of the ith robot, t_iUnit vector, n, representing the proximity or distance of the robot to the target strategy_iUnit vector representing low-density motion tendency strategy of robot, c_iUnit vector, o, representing a high-density motion trend strategy for a robot_iUnit vector, r, representing a strategy for counter-directional movement of the robot₁-r₄Both represent weights.

In one embodiment, the method further comprises the following steps: based on the artificial field algorithm, an updating formula for optimizing the position and the speed of the individual in the population is constructed as follows:

wherein ,

representing the velocity of robot i at time t +1,

representing the velocity of the robot i at time t,

represents the acceleration to which the ith robot is subjected;

indicating the position of robot i at time t +1,

indicating the position of robot i at time t.

In one embodiment, the method further comprises the following steps: dividing the enclosure area according to the perception radius of the robot to obtain a plurality of sub-areas, and setting a weight value for each sub-area;

assigning values to the robot, the barrier and the blank area in the surrounding area of the target of the surrounding area;

obtaining the enclosure capturing strength according to the weight of each subarea, the assignment of the robot, the barrier and the blank area;

determining the enclosure occupancy rate according to the angle of the robot for limiting the target;

and constructing an evaluation function for evaluating the surrounding capture effect of the group robots according to the surrounding capture strength and the surrounding capture occupancy rate.

In one embodiment, the method further comprises the following steps: and according to the weight of each subregion, the robot, the barrier and the blank region assignment, obtaining the enclosure capturing strength as follows:

wherein λ represents a weight for each sub-region, T represents a robot, an obstacle, and a blank region assignment,

representing the maximum value of the trapping intensity.

In one embodiment, the method further comprises the following steps: according to the angle of the robot for limiting the target, determining the enclosure capture occupancy as follows:

wherein ,θ_iRepresenting the angle at which the robot acts to limit the target.

In one embodiment, the merit function includes: a first merit function and a second merit function; further comprising: carrying out average weighting on the enclosure intensity and the enclosure occupancy within a preset time to obtain a first evaluation function;

and carrying out maximum value weighting on the enclosure intensity and the enclosure occupancy within preset time to obtain a second evaluation function.

A swarm robot multi-target enclosure control device, the device comprising:

the capture output module is used for inputting the position information, the target position information and the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network and outputting the capture mode and the action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;

the updating module is used for constructing an updating formula for optimizing the position and the speed of the individual in the population based on an artificial electric field algorithm;

the optimization module is used for carrying out optimization calculation on the influence factors and the weights according to the updating formula to obtain optimized influence factors and optimized weights;

and the optimization output module is used for inputting the optimization influence factors and the optimization weights into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target enclosure of the group robots.

A computer device comprising a memory and a processor, the memory storing a computer program, the processor implementing the following steps when executing the computer program:

On the basis of a self-adaptive cooperative gene regulation and control network, when a trapping mode is generated in Layer1 of a model, the multi-target trapping control method, the multi-target trapping control device and the computer equipment can fuse and consider various situation information of a target, a neighbor and an obstacle, and optimize the weight of the information of the target, the neighbor and the obstacle according to a task environment so as to generate a surrounding ring more efficiently and accurately. Secondly, in the Layer2 of the model, when the robot movement is guided, the weight relation of four concentration vectors is further measured, so that the robot can form an enclosure more quickly and firmly in an enclosure task.

Drawings

FIG. 1 is a schematic flow chart of a multi-target enclosure control method for group robots in one embodiment;

FIG. 2 is a schematic diagram of a framework for an adaptive cooperative gene regulation network in one embodiment;

FIG. 3 is a schematic structural diagram of the division of the trapping region in another embodiment;

FIG. 4 is a block diagram of a multi-target enclosure control device of the group robot in one embodiment;

FIG. 5 is a diagram illustrating an internal structure of a computer device according to an embodiment.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

In one embodiment, as shown in fig. 1, a swarm robot multi-target enclosure control method is provided, which includes the following steps:

and 102, inputting the position information, the target position information and the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the capture mode and the action information of the current robot.

The capture mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights.

And 104, constructing an updating formula for optimizing the position and the speed of the individual in the population based on an artificial electric field algorithm.

And 106, performing optimization calculation on the influence factors and the weights according to the updating formula to obtain optimized influence factors and optimized weights.

And 108, inputting the optimization influence factors and the optimization weights into the adaptive cooperative gene regulation and control network so that the adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.

On the basis of a self-adaptive cooperative gene regulation and control network, when a trapping mode is generated in Layer1 of a model, the multi-target trapping control method for the group robots can fuse and consider various situation information of targets, neighbors and obstacles, and optimize the weight of the information of the targets, the neighbors and the obstacles according to a task environment so as to generate a surrounding ring more efficiently and accurately. Secondly, in the Layer2 of the model, when the robot movement is guided, the weight relation of four concentration vectors is further measured, so that the robot can form an enclosure more quickly and firmly in an enclosure task.

In one embodiment, the existing gene regulatory network is improved, and a gene regulatory network Co-GRN with a high synergistic mechanism is provided, and the model of the gene regulatory network is shown in FIG. 2. It contains a total of two key modules. The Layer1 can fuse multi-state potential information of a target, a neighbor and an obstacle, and collaboratively generate a capture mode; layer2 can direct the robot to act more efficiently, adapt to the patterns generated by Layer1 and complete the enclosure task. In the network, robot represents the robot, P represents the protein concentration, and gene represents the gene fragment. Each robot corresponds to a cell containing multiple proteins. This means that each cell has multiple gene segments to control its growth, and the external environment including targets, obstacles, neighbors can cause changes in intracellular protein concentration, ultimately controlling its growth and development. That is, each robot has many gene segments that regulate changes in its internal protein concentration, ultimately controlling the robot's pattern generation and movement.

Co-GRN is designed primarily based on the cell growth mechanism of the organism, which cells can communicate via the medium of protein concentration. Similarly, the robot can acquire information of the surrounding environment through diffusion and perception of protein concentration to guide the robot to move.

When a robot diffuses positional information through protein concentration, its positional protein G_iTraditionally, exponential diffusion, but the additional diffusion source protein F added_iIs non-exponential diffusion. G_i and F_iAre defined as Eq. (1) and Eq. (2), respectively:

wherein ,P_G and P_FAre each protein G_i and F_iConcentration of (a) (. sigma.)_iIs the distance from the diffusion source. The concentration ratio of the two proteins is represented by k₁ and k₂And (6) determining. Alpha and z are both positive constants which determine the slope and variance of the sigmoid function, respectively. After fusion of the protein, the concentration spread changes almost linearly, see Eq (3):

in the proposed Co-GRN model, the swarm robots can acquire the relative position relation with the target, the neighbor and the barrier boundary through the position sensor within a certain communication range. For more efficient robot movement, more accurate containment patterns need to be generated. Thus, the designed Layer1 will fuse multi-state potential information of targets, neighbors, obstacles to generate trapping patterns.

As shown in fig. 2, the target triangle will diffuse the protein concentration information of its position into Layer 1. Obtaining target concentration information in the global environment through the following Eq. (4) and Eq. (5)

wherein ,P_jDenotes the j (th)^thConcentration information of individual targets, then P₁ ^tIs the sum of the concentration information of all targets. P₁ ^tWill activate the robots R respectively_iGene1, Gene2, resulting in protein concentration values

And

and (4) changing. Followed by

And

in turn, Gene3 will be activated, resulting in a protein concentration value P^tA change in (c). And P is^tThe trapping mode determined by the target concentration information is expressed by the following specific formula:

in fig. 2, a neighbor robot means a neighbor robot within a robot sensing range. These neighbors will also diffuse the protein concentration information of their location into Layer 1. The concentration information and P of all neighbors can be obtained through Eq. (9) and Eq. (10)₁ ⁿ[18]：

wherein ,n_nRepresenting the number of all neighbor robots within a robot's perception range. Similarly, after regulation and control of Gene4 and Gene5, a trapping pattern P determined by neighbor concentration information can be finally obtainedⁿ. In different task scenarios, the parameter values in the equation need to be adjusted so as to keep a proper distance between the robot and the neighbor, and the specific formula is as follows:

in the present invention, a special robot is used to simulate an obstacle, and these robots are referred to as virtual robots. Thus, protein concentration information P of the disorder₁ ^oAlso through the regulation of Gene4 and Gene5, and finally generating the information of the concentration of the obstacleThe specific formula of the determined trapping mode is as follows:

in order to control the robot more accurately and take the influence of a target, a neighbor and an obstacle on an enclosure mode into consideration in a fusion manner, a final enclosure mode P is transferred into Layer2 for mode construction after being regulated and controlled by Gene6, the position information, the target position information and the obstacle position information of the current robot, the obstacle information sent by other robots in a sensing area and the neighbor position information of other robots are input into a pre-constructed adaptive coordinated Gene regulation and control network, and the enclosure mode of the current robot is output as follows:

P＝ω_tP^t+ω_nPⁿ+ω_oP^o

In the structure of Co-GRN, Layer1 is mainly responsible for the generation of an adaptive mode, and Layer2 accurately guides the movement of the robot according to the generation mode. In the network, the sum of the respective modes of a target, a neighbor and a barrier, namely a final capture mode P, is converted into the sum of concentration vectors through gene regulation, and the vectors guide a robot group to adapt to a generation mode more quickly and complete a capture task better.

When guiding specific actions of the robot, the most critical regulatory Gene is Gene7, and the specific formula is as follows:

wherein, i is 1,2, N represents the number of all the trapping robots, and the concentration S_iRepresents a robot R_iThe position vector of (2). Four parameters r₁，r₂，r₃，r₄Respectively, the weight of the corresponding concentration vector.

In Eq. (18), unit concentration vector t_iDepending on the target, which directs the robot to approach or move away from the target, the calculation formula is as follows:

wherein ,T_nIs an object of trapping, G_iAnd

respectively, the position concentration information of the robot and the target. If the distance of the robot from the task target exceeds the distance limit of the current target

The robot is guided close to the target. However, it can be dangerous if too close to the target, resulting in population damage.

In Eq. (18), robot R_iThe number of all neighbor robots in the sensing range is N_iConcentration vector n_iIs the concentration sum of all neighboring robot positions, which guides the robot to tend to the area of low concentration, and the calculation formula is as follows:

in Eq. (18), concentration vector c_iUnder the regulation of Gene8, it directs the robot to move to the position with highest concentration in the global trapping mode if G_maxThe maximum value of the concentration at the position is expressed by the following calculation formula [18 ]]：

In Eq. (18), in robot R_iThe number of the surrounding neighbor robots is equal to that of neighbors with the distance less than the safety distance

Then the concentration vector o_iThe robot is guided to move in the opposite direction, and the calculation formula is as follows [18 ]]：

Finally, Gene9 is also required for the final trapping pattern G_iThe control is carried out to prevent the robot from impacting an obstacle in the moving process, and a specific formula is as follows [18 ]]：

In one embodiment, based on the artificial electric field algorithm, an update formula for optimizing the position and the speed of individuals in the population is constructed as follows:

wherein ,

representing the velocity of robot i at time t +1,

representing the velocity of the robot i at time t,

represents the acceleration to which the ith robot is subjected;

indicating the position of robot i at time t +1,

indicating the position of robot i at time t.

In particular, Co-GRN with a synergistic mechanism, we can note ω_t，ω_n，ω_o，r₁，r₂，r₃，r₄These seven parameters, which greatly affect the final result in Layer1 and Layer2, directly determine the quality of the trapping performance. Therefore, in order to make Co-GRN with cooperative trapping mechanism more adaptive to task environment, we need to optimize these core parameters. And fusing an artificial electric field algorithm with Co-GRN.

The application of the artificial electric field algorithm is described in detail by taking a system with particles as an example. The position of each particle in the search space in d-dimension is first defined according to the following formula:

wherein ,

representing the coordinates of the ith particle in the d-dimension. At the number of iterations t, the electric field force exerted on particle i by particle j is:

wherein ,Q_i(t) and Q_j(t) are the charge amounts of particle i and particle j, respectively, at the number of iterations t. k (t) is a coulomb constant, with the number of iterations dynamically changing, ε is a very small positive number, R_ij(t) is the Euclidean distance between two particles. Thus, the total force exerted on the ith particle is defined by the following equation:

according to Newton's law of motion, we can obtain that the acceleration to which the ith particle is subjected is:

furthermore, the velocity magnitude of the ith particle at the next moment is equal to the sum of the current acceleration and the current velocity component, and the specific velocity and position updating formula is as follows:

through repeated iteration of the above process, the optimal solution of the optimization problem, namely the values of seven core parameters in Co-GRN, can be finally obtained.

In one embodiment, the method comprises the steps that an enclosure area is divided into a plurality of sub-areas according to the perception radius of the robot, and a weight is set for each sub-area; assigning values to the robot, the barrier and the blank area in the surrounding area of the target of the surrounding area; obtaining the enclosure capturing strength according to the weight of each subarea, the assignment of the robot, the barrier and the blank area; determining the enclosure occupancy rate according to the angle of the robot for limiting the target; and constructing an evaluation function for evaluating the surrounding capture effect of the group robots according to the surrounding capture strength and the surrounding capture occupancy rate.

Specifically, as shown in fig. 3, the capture area is divided into three sub-areas, which are: the first region1, the second region2 and the third region3 are the first region when the distance between the first region and the target is too close or too far, the capture weight of the first region is recorded as lambda ₁1 is ═ 1; most suitably, the second region is weighted by λ ₂2; if the distance is too far away from the target, the area which can not generate the trapping effect is a third area, and the weight is recorded as lambda₃0. Meanwhile, a capture value T is respectively given to the robot, the barrier and the blank in the surrounding environment of the target₁＝1，T₂＝1.5，T₃＝0。

In one embodiment, the capture strength is obtained according to the weight of each sub-region and the assignment of the robot, the obstacle and the blank region:

representing the maximum value of the trapping intensity.

In one embodiment, according to the angle of the robot for limiting the target, the enclosure capture occupancy is determined as follows:

In one embodiment, the merit function includes: the first evaluation function and the second evaluation function are obtained by carrying out average weighting on the enclosure capturing strength and the enclosure capturing occupancy within preset time; and carrying out maximum value weighting on the trapping strength and the trapping occupancy within the preset time to obtain a second evaluation function.

Specifically, in order to better perform quantitative evaluation on the trapping result, two new evaluation indexes of the fitness function are defined through weighting on the basis of the indexes introduced above. One is an average value index Ave, namely, firstly, aiming at each target, calculating the average value of the capture intensity and the average value of the capture occupancy within a certain time, then averaging the capture intensity and the capture occupancy of the target, and weighting and summing the two average values by 50% respectively to obtain a final average value index; and the other is a maximum value index Max, namely within a certain time, the trapping strength and the trapping occupancy of the target are respectively summed according to the proportion of 50 percent, then the weighted sum value just calculated by the target at each time is summed, then the sum values at different times are compared, and the maximum value is taken as the maximum value index.

It should be understood that, although the steps in the flowchart of fig. 1 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least a portion of the steps in fig. 1 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternately with other steps or at least a portion of the sub-steps or stages of other steps.

In one embodiment, as shown in fig. 4, there is provided a swarm robot multi-target enclosure control device, including: an enclosure output module 402, an update module 404, an optimization module 406, and an optimization output module 408, wherein:

an enclosure output module 402, configured to input position information of the current robot, target position information, obstacle information sent by other robots in the sensing area, and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, and output an enclosure mode and action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;

an updating module 404, configured to construct an updating formula for optimizing the position and speed of the individual in the population based on an artificial electric field algorithm;

the optimization module 406 is configured to perform optimization calculation on the impact factors and the weights according to the update formula to obtain optimized impact factors and optimized weights;

and an optimization output module 408, configured to input the optimization influence factors and the optimization weights to the adaptive cooperative gene regulation and control network, so that the adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.

For specific limitations of the multi-target capturing control device for the group robot, reference may be made to the above limitations of the multi-target capturing control method for the group robot, and details are not repeated here. All or part of the modules in the multi-target capture control device for the group robots can be realized by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.

In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 5. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to realize the multi-target capture control method for the swarm robots. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on the shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.

Those skilled in the art will appreciate that the architecture shown in fig. 5 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.

In an embodiment, a computer device is provided, comprising a memory storing a computer program and a processor implementing the steps of the method in the above embodiments when the processor executes the computer program.

In an embodiment, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, carries out the steps of the method in the above-mentioned embodiments.

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).

The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims

1. A multi-target enclosure control method for swarm robots is characterized by comprising the following steps:

2. The method according to claim 1, wherein the inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network and outputting the capture mode of the current robot comprises:

inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the trapping mode of the current robot as follows:

P＝ω_tP^t+ω_nPⁿ+ω_oP^o

3. The method of claim 1, wherein the mobility policy comprises: the robot approach or departure strategy comprises a robot approach or departure strategy, a robot low-density motion tendency strategy, a robot high-density motion tendency strategy and a robot reverse direction motion strategy;

the method for inputting position information, target position information, obstacle position information of the current robot, obstacle information sent by other robots in the sensing area and neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network and outputting action information of the current robot comprises the following steps:

inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the action information of the current robot as follows:

4. The method of claim 1, wherein constructing an updated formula that optimizes the location and velocity of individuals in the population based on an artificial electric field algorithm comprises:

based on an artificial electric field algorithm, an updating formula for optimizing the position and the speed of individuals in the population is constructed as follows:

wherein ,

representing the velocity of robot i at time t +1,

representing the velocity of the individual i at time t,

representing the acceleration to which the ith individual is subjected;

representing the position of the individual i at time t +1,

representing the position of the individual i at time t.

5. The method according to any one of claims 1 to 4, further comprising:

dividing the enclosure area according to the distance between the robot and the target to obtain a plurality of sub-areas, and setting a weight value for each sub-area;

6. The method of claim 5, wherein obtaining the containment strength according to the weight of each subregion and the robot, obstacle, and blank area assignments comprises:

and according to the weight of each subregion, the robot, the barrier and the blank region assignment, obtaining the enclosure capturing strength as follows:

representing the maximum value of the trapping intensity.

7. The method of claim 5, wherein determining the containment occupancy based on the angle at which the robot acts to limit the target comprises:

according to the angle of the robot for limiting the target, determining the enclosure capture occupancy as follows:

8. The method according to any one of claims 5-7, wherein the merit function comprises: a first merit function and a second merit function;

the method for constructing the evaluation function for evaluating the trapping effect of the group robots according to the trapping strength and the trapping occupancy comprises the following steps:

carrying out average weighting on the enclosure intensity and the enclosure occupancy within a preset time to obtain a first evaluation function;

9. A multi-target enclosure control device for swarm robots is characterized by comprising:

10. A computer device comprising a memory and a processor, the memory storing a computer program, wherein the processor implements the steps of the method of any one of claims 1 to 8 when executing the computer program.