CN114326747A - Multi-target enclosure control method and device for group robots and computer equipment - Google Patents

Multi-target enclosure control method and device for group robots and computer equipment Download PDF

Info

Publication number
CN114326747A
CN114326747A CN202210013119.7A CN202210013119A CN114326747A CN 114326747 A CN114326747 A CN 114326747A CN 202210013119 A CN202210013119 A CN 202210013119A CN 114326747 A CN114326747 A CN 114326747A
Authority
CN
China
Prior art keywords
robot
position information
target
robots
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210013119.7A
Other languages
Chinese (zh)
Other versions
CN114326747B (en
Inventor
潘庆涛
汤俊
老松杨
詹建军
李�浩
秦婉亭
万宇
陈曦
王浩森
赵子鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN202210013119.7A priority Critical patent/CN114326747B/en
Publication of CN114326747A publication Critical patent/CN114326747A/en
Application granted granted Critical
Publication of CN114326747B publication Critical patent/CN114326747B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)
  • Feedback Control In General (AREA)

Abstract

The application relates to a multi-target enclosure control method and device for swarm robots and computer equipment. The method comprises the following steps: inputting position information, target position information and obstacle position information of the current robot, obstacle information sent by other robots in a sensing area and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, outputting an enclosure mode and action information of the current robot, and constructing an updating formula for optimizing individual positions and speeds in a population based on an artificial electric field algorithm; according to the updating formula, carrying out optimization calculation on the influence factors and the weights to obtain optimized influence factors and optimized weights; and inputting the optimized influence factors and the optimized weight into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots. By adopting the method, the group trapping efficiency can be improved.

Description

Multi-target enclosure control method and device for group robots and computer equipment
Technical Field
The application relates to the technical field of robot enclosure navigation, in particular to a swarm robot multi-target enclosure control method, a swarm robot multi-target enclosure control device and computer equipment.
Background
A multi-robot system (MRS), also known as a swarm robot, is composed of a large number of small, simple robots (with a definite language: any one has a certain autonomy, limited communication capabilities and computational resources). Therefore, it is necessary to systematize robots in MRS individually, and through close mutual cooperation and cooperation, advantages such as completion of complex tasks, spatial-temporal distribution work, functions, and perceptual distribution that a single robot does not have can be brought. Furthermore, due to the extremely attractive properties of a single robot, such as low cost, high adaptability and high reliability, MRS has been successfully applied in a wide range of fields, including marine exploration, medical services, search and rescue, positioning and navigation, etc. Meanwhile, with the rapid development of MRS theory and application, the cooperative capture of targets by using group robots is becoming a new research direction.
At present, target containment, especially for randomly moving targets, is a very challenging task. This not only requires close cooperation of robots in MRS to track closely targets, but also their perception and adaptation to the surrounding dynamic environment in real time. Therefore, how and better to create a complete and reliable enclosure around the target after it is detected becomes a key and difficult point of this task, which has led to increased attention in the research on cooperative control strategies for group agents. Currently, methods for forming a multi-robot shape structure and pattern can be broadly classified into five categories according to control methods: 1) a virtual structure method; 2) leader-follower method; 3) an artificial potential field method; 4) a behavior-based approach; 5) intelligent algorithm excited by natural life.
Despite the above considerable research on the methods of multi-robot shape construction and pattern formation, self-organizing MRS still has a number of unsolved problems: for example, a strict and complex global coordinate system is needed for acquiring environment information by relying on a large amount of communication between intelligent bodies and behavior control, the intelligent bodies are difficult to cooperate to consider surrounding intelligent bodies or obstacles in the environment, dynamic targets are easy to escape from an enclosure, and the like.
Disclosure of Invention
In view of the above, it is necessary to provide a multi-target enclosure control method, apparatus and computer device for swarm robots, aiming at the above technical problems.
A swarm robot multi-target enclosure control method comprises the following steps:
inputting position information, target position information and obstacle position information of the current robot, obstacle information sent by other robots in a sensing area and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting an enclosure capture mode and action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;
establishing an updating formula for optimizing the position and the speed of individuals in the population based on an artificial electric field algorithm;
according to the updating formula, carrying out optimization calculation on the influence factors and the weights to obtain optimized influence factors and optimized weights;
and inputting the optimization influence factors and the optimization weights into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.
In one embodiment, the method further comprises the following steps: inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the trapping mode of the current robot as follows:
P=ωtPtnPnoPo
wherein P represents a trapping mode, PtComponent of the hunting pattern, P, representing a determination of target position informationnRepresenting the component of the trapping pattern, P, determined by the neighbor location informationoAn entrapment mode component representing the determination of the obstacle position information; omegat、ωn and ωoAll represent influencing factors.
In one embodiment, the mobility policy includes: the robot approach or departure strategy comprises a robot approach or departure strategy, a robot low-density motion tendency strategy, a robot high-density motion tendency strategy and a robot reverse direction motion strategy; further comprising: inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the action information of the current robot as follows:
Figure BDA0003458525730000031
wherein ,SiIndicating the position vector of the ith robot, tiUnit vector, n, representing the proximity or distance of the robot to the target strategyiUnit vector representing low-density motion tendency strategy of robot, ciUnit vector, o, representing a high-density motion trend strategy for a robotiUnit vector, r, representing a strategy for counter-directional movement of the robot1-r4Both represent weights.
In one embodiment, the method further comprises the following steps: based on the artificial field algorithm, an updating formula for optimizing the position and the speed of the individual in the population is constructed as follows:
Figure BDA0003458525730000032
Figure BDA0003458525730000033
wherein ,
Figure BDA0003458525730000034
representing the velocity of robot i at time t +1,
Figure BDA0003458525730000035
representing the velocity of the robot i at time t,
Figure BDA0003458525730000036
represents the acceleration to which the ith robot is subjected;
Figure BDA0003458525730000037
indicating the position of robot i at time t +1,
Figure BDA0003458525730000038
indicating the position of robot i at time t.
In one embodiment, the method further comprises the following steps: dividing the enclosure area according to the perception radius of the robot to obtain a plurality of sub-areas, and setting a weight value for each sub-area;
assigning values to the robot, the barrier and the blank area in the surrounding area of the target of the surrounding area;
obtaining the enclosure capturing strength according to the weight of each subarea, the assignment of the robot, the barrier and the blank area;
determining the enclosure occupancy rate according to the angle of the robot for limiting the target;
and constructing an evaluation function for evaluating the surrounding capture effect of the group robots according to the surrounding capture strength and the surrounding capture occupancy rate.
In one embodiment, the method further comprises the following steps: and according to the weight of each subregion, the robot, the barrier and the blank region assignment, obtaining the enclosure capturing strength as follows:
Figure BDA0003458525730000039
wherein λ represents a weight for each sub-region, T represents a robot, an obstacle, and a blank region assignment,
Figure BDA00034585257300000310
representing the maximum value of the trapping intensity.
In one embodiment, the method further comprises the following steps: according to the angle of the robot for limiting the target, determining the enclosure capture occupancy as follows:
Figure BDA0003458525730000041
wherein ,θiRepresenting the angle at which the robot acts to limit the target.
In one embodiment, the merit function includes: a first merit function and a second merit function; further comprising: carrying out average weighting on the enclosure intensity and the enclosure occupancy within a preset time to obtain a first evaluation function;
and carrying out maximum value weighting on the enclosure intensity and the enclosure occupancy within preset time to obtain a second evaluation function.
A swarm robot multi-target enclosure control device, the device comprising:
the capture output module is used for inputting the position information, the target position information and the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network and outputting the capture mode and the action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;
the updating module is used for constructing an updating formula for optimizing the position and the speed of the individual in the population based on an artificial electric field algorithm;
the optimization module is used for carrying out optimization calculation on the influence factors and the weights according to the updating formula to obtain optimized influence factors and optimized weights;
and the optimization output module is used for inputting the optimization influence factors and the optimization weights into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target enclosure of the group robots.
A computer device comprising a memory and a processor, the memory storing a computer program, the processor implementing the following steps when executing the computer program:
inputting position information, target position information and obstacle position information of the current robot, obstacle information sent by other robots in a sensing area and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting an enclosure capture mode and action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;
establishing an updating formula for optimizing the position and the speed of individuals in the population based on an artificial electric field algorithm;
according to the updating formula, carrying out optimization calculation on the influence factors and the weights to obtain optimized influence factors and optimized weights;
and inputting the optimization influence factors and the optimization weights into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.
On the basis of a self-adaptive cooperative gene regulation and control network, when a trapping mode is generated in Layer1 of a model, the multi-target trapping control method, the multi-target trapping control device and the computer equipment can fuse and consider various situation information of a target, a neighbor and an obstacle, and optimize the weight of the information of the target, the neighbor and the obstacle according to a task environment so as to generate a surrounding ring more efficiently and accurately. Secondly, in the Layer2 of the model, when the robot movement is guided, the weight relation of four concentration vectors is further measured, so that the robot can form an enclosure more quickly and firmly in an enclosure task.
Drawings
FIG. 1 is a schematic flow chart of a multi-target enclosure control method for group robots in one embodiment;
FIG. 2 is a schematic diagram of a framework for an adaptive cooperative gene regulation network in one embodiment;
FIG. 3 is a schematic structural diagram of the division of the trapping region in another embodiment;
FIG. 4 is a block diagram of a multi-target enclosure control device of the group robot in one embodiment;
FIG. 5 is a diagram illustrating an internal structure of a computer device according to an embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
In one embodiment, as shown in fig. 1, a swarm robot multi-target enclosure control method is provided, which includes the following steps:
and 102, inputting the position information, the target position information and the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the capture mode and the action information of the current robot.
The capture mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights.
And 104, constructing an updating formula for optimizing the position and the speed of the individual in the population based on an artificial electric field algorithm.
And 106, performing optimization calculation on the influence factors and the weights according to the updating formula to obtain optimized influence factors and optimized weights.
And 108, inputting the optimization influence factors and the optimization weights into the adaptive cooperative gene regulation and control network so that the adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.
On the basis of a self-adaptive cooperative gene regulation and control network, when a trapping mode is generated in Layer1 of a model, the multi-target trapping control method for the group robots can fuse and consider various situation information of targets, neighbors and obstacles, and optimize the weight of the information of the targets, the neighbors and the obstacles according to a task environment so as to generate a surrounding ring more efficiently and accurately. Secondly, in the Layer2 of the model, when the robot movement is guided, the weight relation of four concentration vectors is further measured, so that the robot can form an enclosure more quickly and firmly in an enclosure task.
In one embodiment, the existing gene regulatory network is improved, and a gene regulatory network Co-GRN with a high synergistic mechanism is provided, and the model of the gene regulatory network is shown in FIG. 2. It contains a total of two key modules. The Layer1 can fuse multi-state potential information of a target, a neighbor and an obstacle, and collaboratively generate a capture mode; layer2 can direct the robot to act more efficiently, adapt to the patterns generated by Layer1 and complete the enclosure task. In the network, robot represents the robot, P represents the protein concentration, and gene represents the gene fragment. Each robot corresponds to a cell containing multiple proteins. This means that each cell has multiple gene segments to control its growth, and the external environment including targets, obstacles, neighbors can cause changes in intracellular protein concentration, ultimately controlling its growth and development. That is, each robot has many gene segments that regulate changes in its internal protein concentration, ultimately controlling the robot's pattern generation and movement.
Co-GRN is designed primarily based on the cell growth mechanism of the organism, which cells can communicate via the medium of protein concentration. Similarly, the robot can acquire information of the surrounding environment through diffusion and perception of protein concentration to guide the robot to move.
When a robot diffuses positional information through protein concentration, its positional protein GiTraditionally, exponential diffusion, but the additional diffusion source protein F addediIs non-exponential diffusion. Gi and FiAre defined as Eq. (1) and Eq. (2), respectively:
Figure BDA0003458525730000071
Figure BDA0003458525730000072
wherein ,PG and PFAre each protein Gi and FiConcentration of (a) (. sigma.)iIs the distance from the diffusion source. The concentration ratio of the two proteins is represented by k1 and k2And (6) determining. Alpha and z are both positive constants which determine the slope and variance of the sigmoid function, respectively. After fusion of the protein, the concentration spread changes almost linearly, see Eq (3):
Figure BDA0003458525730000073
in the proposed Co-GRN model, the swarm robots can acquire the relative position relation with the target, the neighbor and the barrier boundary through the position sensor within a certain communication range. For more efficient robot movement, more accurate containment patterns need to be generated. Thus, the designed Layer1 will fuse multi-state potential information of targets, neighbors, obstacles to generate trapping patterns.
As shown in fig. 2, the target triangle will diffuse the protein concentration information of its position into Layer 1. Obtaining target concentration information in the global environment through the following Eq. (4) and Eq. (5)
Figure BDA0003458525730000074
Figure BDA0003458525730000075
wherein ,PjDenotes the j (th)thConcentration information of individual targets, then P1 tIs the sum of the concentration information of all targets. P1 tWill activate the robots R respectivelyiGene1, Gene2, resulting in protein concentration values
Figure BDA0003458525730000076
And
Figure BDA0003458525730000077
and (4) changing. Followed by
Figure BDA0003458525730000078
And
Figure BDA0003458525730000079
in turn, Gene3 will be activated, resulting in a protein concentration value PtA change in (c). And P istThe trapping mode determined by the target concentration information is expressed by the following specific formula:
Figure BDA00034585257300000710
Figure BDA00034585257300000711
Figure BDA0003458525730000081
in fig. 2, a neighbor robot means a neighbor robot within a robot sensing range. These neighbors will also diffuse the protein concentration information of their location into Layer 1. The concentration information and P of all neighbors can be obtained through Eq. (9) and Eq. (10)1 n[18]:
Figure BDA0003458525730000082
Figure BDA0003458525730000083
wherein ,nnRepresenting the number of all neighbor robots within a robot's perception range. Similarly, after regulation and control of Gene4 and Gene5, a trapping pattern P determined by neighbor concentration information can be finally obtainedn. In different task scenarios, the parameter values in the equation need to be adjusted so as to keep a proper distance between the robot and the neighbor, and the specific formula is as follows:
Figure BDA0003458525730000084
Figure BDA0003458525730000085
Figure BDA0003458525730000086
in the present invention, a special robot is used to simulate an obstacle, and these robots are referred to as virtual robots. Thus, protein concentration information P of the disorder1 oAlso through the regulation of Gene4 and Gene5, and finally generating the information of the concentration of the obstacleThe specific formula of the determined trapping mode is as follows:
Figure BDA0003458525730000087
Figure BDA0003458525730000088
Figure BDA0003458525730000089
in order to control the robot more accurately and take the influence of a target, a neighbor and an obstacle on an enclosure mode into consideration in a fusion manner, a final enclosure mode P is transferred into Layer2 for mode construction after being regulated and controlled by Gene6, the position information, the target position information and the obstacle position information of the current robot, the obstacle information sent by other robots in a sensing area and the neighbor position information of other robots are input into a pre-constructed adaptive coordinated Gene regulation and control network, and the enclosure mode of the current robot is output as follows:
P=ωtPtnPnoPo
wherein P represents a trapping mode, PtComponent of the hunting pattern, P, representing a determination of target position informationnRepresenting the component of the trapping pattern, P, determined by the neighbor location informationoAn entrapment mode component representing the determination of the obstacle position information; omegat、ωn and ωoAll represent influencing factors.
In the structure of Co-GRN, Layer1 is mainly responsible for the generation of an adaptive mode, and Layer2 accurately guides the movement of the robot according to the generation mode. In the network, the sum of the respective modes of a target, a neighbor and a barrier, namely a final capture mode P, is converted into the sum of concentration vectors through gene regulation, and the vectors guide a robot group to adapt to a generation mode more quickly and complete a capture task better.
When guiding specific actions of the robot, the most critical regulatory Gene is Gene7, and the specific formula is as follows:
Figure BDA0003458525730000091
wherein, i is 1,2, N represents the number of all the trapping robots, and the concentration SiRepresents a robot RiThe position vector of (2). Four parameters r1,r2,r3,r4Respectively, the weight of the corresponding concentration vector.
In Eq. (18), unit concentration vector tiDepending on the target, which directs the robot to approach or move away from the target, the calculation formula is as follows:
Figure BDA0003458525730000092
wherein ,TnIs an object of trapping, GiAnd
Figure BDA0003458525730000093
respectively, the position concentration information of the robot and the target. If the distance of the robot from the task target exceeds the distance limit of the current target
Figure BDA0003458525730000094
The robot is guided close to the target. However, it can be dangerous if too close to the target, resulting in population damage.
In Eq. (18), robot RiThe number of all neighbor robots in the sensing range is NiConcentration vector niIs the concentration sum of all neighboring robot positions, which guides the robot to tend to the area of low concentration, and the calculation formula is as follows:
Figure BDA0003458525730000101
in Eq. (18), concentration vector ciUnder the regulation of Gene8, it directs the robot to move to the position with highest concentration in the global trapping mode if GmaxThe maximum value of the concentration at the position is expressed by the following calculation formula [18 ]]:
Figure BDA0003458525730000102
In Eq. (18), in robot RiThe number of the surrounding neighbor robots is equal to that of neighbors with the distance less than the safety distance
Figure BDA0003458525730000103
Then the concentration vector oiThe robot is guided to move in the opposite direction, and the calculation formula is as follows [18 ]]:
Figure BDA0003458525730000104
Finally, Gene9 is also required for the final trapping pattern GiThe control is carried out to prevent the robot from impacting an obstacle in the moving process, and a specific formula is as follows [18 ]]:
Figure BDA0003458525730000105
In one embodiment, based on the artificial electric field algorithm, an update formula for optimizing the position and the speed of individuals in the population is constructed as follows:
Figure BDA0003458525730000106
Figure BDA0003458525730000107
wherein ,
Figure BDA0003458525730000108
representing the velocity of robot i at time t +1,
Figure BDA0003458525730000109
representing the velocity of the robot i at time t,
Figure BDA00034585257300001010
represents the acceleration to which the ith robot is subjected;
Figure BDA00034585257300001011
indicating the position of robot i at time t +1,
Figure BDA00034585257300001012
indicating the position of robot i at time t.
In particular, Co-GRN with a synergistic mechanism, we can note ωt,ωn,ωo,r1,r2,r3,r4These seven parameters, which greatly affect the final result in Layer1 and Layer2, directly determine the quality of the trapping performance. Therefore, in order to make Co-GRN with cooperative trapping mechanism more adaptive to task environment, we need to optimize these core parameters. And fusing an artificial electric field algorithm with Co-GRN.
The application of the artificial electric field algorithm is described in detail by taking a system with particles as an example. The position of each particle in the search space in d-dimension is first defined according to the following formula:
Figure BDA0003458525730000111
wherein ,
Figure BDA0003458525730000112
representing the coordinates of the ith particle in the d-dimension. At the number of iterations t, the electric field force exerted on particle i by particle j is:
Figure BDA0003458525730000113
wherein ,Qi(t) and Qj(t) are the charge amounts of particle i and particle j, respectively, at the number of iterations t. k (t) is a coulomb constant, with the number of iterations dynamically changing, ε is a very small positive number, Rij(t) is the Euclidean distance between two particles. Thus, the total force exerted on the ith particle is defined by the following equation:
Figure BDA0003458525730000114
according to Newton's law of motion, we can obtain that the acceleration to which the ith particle is subjected is:
Figure BDA0003458525730000115
furthermore, the velocity magnitude of the ith particle at the next moment is equal to the sum of the current acceleration and the current velocity component, and the specific velocity and position updating formula is as follows:
Figure BDA0003458525730000116
Figure BDA0003458525730000117
through repeated iteration of the above process, the optimal solution of the optimization problem, namely the values of seven core parameters in Co-GRN, can be finally obtained.
In one embodiment, the method comprises the steps that an enclosure area is divided into a plurality of sub-areas according to the perception radius of the robot, and a weight is set for each sub-area; assigning values to the robot, the barrier and the blank area in the surrounding area of the target of the surrounding area; obtaining the enclosure capturing strength according to the weight of each subarea, the assignment of the robot, the barrier and the blank area; determining the enclosure occupancy rate according to the angle of the robot for limiting the target; and constructing an evaluation function for evaluating the surrounding capture effect of the group robots according to the surrounding capture strength and the surrounding capture occupancy rate.
Specifically, as shown in fig. 3, the capture area is divided into three sub-areas, which are: the first region1, the second region2 and the third region3 are the first region when the distance between the first region and the target is too close or too far, the capture weight of the first region is recorded as lambda 11 is ═ 1; most suitably, the second region is weighted by λ 22; if the distance is too far away from the target, the area which can not generate the trapping effect is a third area, and the weight is recorded as lambda30. Meanwhile, a capture value T is respectively given to the robot, the barrier and the blank in the surrounding environment of the target1=1,T2=1.5,T3=0。
In one embodiment, the capture strength is obtained according to the weight of each sub-region and the assignment of the robot, the obstacle and the blank region:
Figure BDA0003458525730000121
wherein λ represents a weight for each sub-region, T represents a robot, an obstacle, and a blank region assignment,
Figure BDA0003458525730000122
representing the maximum value of the trapping intensity.
In one embodiment, according to the angle of the robot for limiting the target, the enclosure capture occupancy is determined as follows:
Figure BDA0003458525730000123
wherein ,θiRepresenting the angle at which the robot acts to limit the target.
In one embodiment, the merit function includes: the first evaluation function and the second evaluation function are obtained by carrying out average weighting on the enclosure capturing strength and the enclosure capturing occupancy within preset time; and carrying out maximum value weighting on the trapping strength and the trapping occupancy within the preset time to obtain a second evaluation function.
Specifically, in order to better perform quantitative evaluation on the trapping result, two new evaluation indexes of the fitness function are defined through weighting on the basis of the indexes introduced above. One is an average value index Ave, namely, firstly, aiming at each target, calculating the average value of the capture intensity and the average value of the capture occupancy within a certain time, then averaging the capture intensity and the capture occupancy of the target, and weighting and summing the two average values by 50% respectively to obtain a final average value index; and the other is a maximum value index Max, namely within a certain time, the trapping strength and the trapping occupancy of the target are respectively summed according to the proportion of 50 percent, then the weighted sum value just calculated by the target at each time is summed, then the sum values at different times are compared, and the maximum value is taken as the maximum value index.
It should be understood that, although the steps in the flowchart of fig. 1 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least a portion of the steps in fig. 1 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternately with other steps or at least a portion of the sub-steps or stages of other steps.
In one embodiment, as shown in fig. 4, there is provided a swarm robot multi-target enclosure control device, including: an enclosure output module 402, an update module 404, an optimization module 406, and an optimization output module 408, wherein:
an enclosure output module 402, configured to input position information of the current robot, target position information, obstacle information sent by other robots in the sensing area, and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, and output an enclosure mode and action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;
an updating module 404, configured to construct an updating formula for optimizing the position and speed of the individual in the population based on an artificial electric field algorithm;
the optimization module 406 is configured to perform optimization calculation on the impact factors and the weights according to the update formula to obtain optimized impact factors and optimized weights;
and an optimization output module 408, configured to input the optimization influence factors and the optimization weights to the adaptive cooperative gene regulation and control network, so that the adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.
For specific limitations of the multi-target capturing control device for the group robot, reference may be made to the above limitations of the multi-target capturing control method for the group robot, and details are not repeated here. All or part of the modules in the multi-target capture control device for the group robots can be realized by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 5. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to realize the multi-target capture control method for the swarm robots. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on the shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.
Those skilled in the art will appreciate that the architecture shown in fig. 5 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In an embodiment, a computer device is provided, comprising a memory storing a computer program and a processor implementing the steps of the method in the above embodiments when the processor executes the computer program.
In an embodiment, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, carries out the steps of the method in the above-mentioned embodiments.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. A multi-target enclosure control method for swarm robots is characterized by comprising the following steps:
inputting position information, target position information and obstacle position information of the current robot, obstacle information sent by other robots in a sensing area and neighbor position information of the other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting an enclosure capture mode and action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;
establishing an updating formula for optimizing the position and the speed of individuals in the population based on an artificial electric field algorithm;
according to the updating formula, carrying out optimization calculation on the influence factors and the weights to obtain optimized influence factors and optimized weights;
and inputting the optimization influence factors and the optimization weights into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target capture of the group robots.
2. The method according to claim 1, wherein the inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network and outputting the capture mode of the current robot comprises:
inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the trapping mode of the current robot as follows:
P=ωtPtnPnoPo
wherein P represents a trapping mode, PtComponent of the hunting pattern, P, representing a determination of target position informationnRepresenting the component of the trapping pattern, P, determined by the neighbor location informationoAn entrapment mode component representing the determination of the obstacle position information; omegat、ωn and ωoAll represent influencing factors.
3. The method of claim 1, wherein the mobility policy comprises: the robot approach or departure strategy comprises a robot approach or departure strategy, a robot low-density motion tendency strategy, a robot high-density motion tendency strategy and a robot reverse direction motion strategy;
the method for inputting position information, target position information, obstacle position information of the current robot, obstacle information sent by other robots in the sensing area and neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network and outputting action information of the current robot comprises the following steps:
inputting the position information, the target position information, the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network, and outputting the action information of the current robot as follows:
Figure FDA0003458525720000021
wherein ,SiIndicating the position vector of the ith robot, tiUnit vector, n, representing the proximity or distance of the robot to the target strategyiUnit vector representing low-density motion tendency strategy of robot, ciUnit vector, o, representing a high-density motion trend strategy for a robotiUnit vector, r, representing a strategy for counter-directional movement of the robot1-r4Both represent weights.
4. The method of claim 1, wherein constructing an updated formula that optimizes the location and velocity of individuals in the population based on an artificial electric field algorithm comprises:
based on an artificial electric field algorithm, an updating formula for optimizing the position and the speed of individuals in the population is constructed as follows:
Figure FDA0003458525720000022
Figure FDA0003458525720000023
wherein ,
Figure FDA0003458525720000024
representing the velocity of robot i at time t +1,
Figure FDA0003458525720000025
representing the velocity of the individual i at time t,
Figure FDA0003458525720000026
representing the acceleration to which the ith individual is subjected;
Figure FDA0003458525720000027
representing the position of the individual i at time t +1,
Figure FDA0003458525720000028
representing the position of the individual i at time t.
5. The method according to any one of claims 1 to 4, further comprising:
dividing the enclosure area according to the distance between the robot and the target to obtain a plurality of sub-areas, and setting a weight value for each sub-area;
assigning values to the robot, the barrier and the blank area in the surrounding area of the target of the surrounding area;
obtaining the enclosure capturing strength according to the weight of each subarea, the assignment of the robot, the barrier and the blank area;
determining the enclosure occupancy rate according to the angle of the robot for limiting the target;
and constructing an evaluation function for evaluating the surrounding capture effect of the group robots according to the surrounding capture strength and the surrounding capture occupancy rate.
6. The method of claim 5, wherein obtaining the containment strength according to the weight of each subregion and the robot, obstacle, and blank area assignments comprises:
and according to the weight of each subregion, the robot, the barrier and the blank region assignment, obtaining the enclosure capturing strength as follows:
Figure FDA0003458525720000031
wherein λ represents a weight for each sub-region, T represents a robot, an obstacle, and a blank region assignment,
Figure FDA0003458525720000032
representing the maximum value of the trapping intensity.
7. The method of claim 5, wherein determining the containment occupancy based on the angle at which the robot acts to limit the target comprises:
according to the angle of the robot for limiting the target, determining the enclosure capture occupancy as follows:
Figure FDA0003458525720000033
wherein ,θiRepresenting the angle at which the robot acts to limit the target.
8. The method according to any one of claims 5-7, wherein the merit function comprises: a first merit function and a second merit function;
the method for constructing the evaluation function for evaluating the trapping effect of the group robots according to the trapping strength and the trapping occupancy comprises the following steps:
carrying out average weighting on the enclosure intensity and the enclosure occupancy within a preset time to obtain a first evaluation function;
and carrying out maximum value weighting on the enclosure intensity and the enclosure occupancy within preset time to obtain a second evaluation function.
9. A multi-target enclosure control device for swarm robots is characterized by comprising:
the capture output module is used for inputting the position information, the target position information and the obstacle position information of the current robot, the obstacle information sent by other robots in the sensing area and the neighbor position information of other robots into a pre-constructed adaptive cooperative gene regulation and control network and outputting the capture mode and the action information of the current robot; the trapping mode is obtained by weighting target position information, neighbor position information and obstacle position information through influence factors, and the action information is obtained by weighting each moving strategy through weights;
the updating module is used for constructing an updating formula for optimizing the position and the speed of the individual in the population based on an artificial electric field algorithm;
the optimization module is used for carrying out optimization calculation on the influence factors and the weights according to the updating formula to obtain optimized influence factors and optimized weights;
and the optimization output module is used for inputting the optimization influence factors and the optimization weights into the self-adaptive cooperative gene regulation and control network so that the self-adaptive cooperative gene regulation and control network outputs maneuvering information for target enclosure of the group robots.
10. A computer device comprising a memory and a processor, the memory storing a computer program, wherein the processor implements the steps of the method of any one of claims 1 to 8 when executing the computer program.
CN202210013119.7A 2022-01-06 2022-01-06 Multi-target capture control method and device for group robot and computer equipment Active CN114326747B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210013119.7A CN114326747B (en) 2022-01-06 2022-01-06 Multi-target capture control method and device for group robot and computer equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210013119.7A CN114326747B (en) 2022-01-06 2022-01-06 Multi-target capture control method and device for group robot and computer equipment

Publications (2)

Publication Number Publication Date
CN114326747A true CN114326747A (en) 2022-04-12
CN114326747B CN114326747B (en) 2023-10-24

Family

ID=81024051

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210013119.7A Active CN114326747B (en) 2022-01-06 2022-01-06 Multi-target capture control method and device for group robot and computer equipment

Country Status (1)

Country Link
CN (1) CN114326747B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115455812A (en) * 2022-08-25 2022-12-09 临涣水务股份有限公司 Water supply pump station optimization method and system
CN115629600A (en) * 2022-08-01 2023-01-20 北方工业大学 Multi-machine cooperative trapping method in complex dynamic security environment based on buffer voronoi diagram

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016045615A1 (en) * 2014-09-25 2016-03-31 科沃斯机器人有限公司 Robot static path planning method
CN112462779A (en) * 2020-11-30 2021-03-09 汕头大学 Group robot dynamic capture control method and system based on gene regulation network
CN112527020A (en) * 2020-11-30 2021-03-19 汕头大学 Target capture control method and system based on group robots
WO2021065501A1 (en) * 2019-09-30 2021-04-08 日本電気株式会社 Passage possibility determining method, passage possibility determining device, and travel route generation system
CN112684700A (en) * 2020-11-30 2021-04-20 汕头大学 Multi-target searching and trapping control method and system for swarm robots
CN113172626A (en) * 2021-04-30 2021-07-27 汕头大学 Intelligent robot group control method based on three-dimensional gene regulation and control network
CN113485340A (en) * 2021-07-12 2021-10-08 汕头大学 Distributed enclosure control method and system for group robots
CN113580129A (en) * 2021-07-19 2021-11-02 中山大学 Multi-target cooperative trapping method, device and medium based on robot

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016045615A1 (en) * 2014-09-25 2016-03-31 科沃斯机器人有限公司 Robot static path planning method
WO2021065501A1 (en) * 2019-09-30 2021-04-08 日本電気株式会社 Passage possibility determining method, passage possibility determining device, and travel route generation system
CN112462779A (en) * 2020-11-30 2021-03-09 汕头大学 Group robot dynamic capture control method and system based on gene regulation network
CN112527020A (en) * 2020-11-30 2021-03-19 汕头大学 Target capture control method and system based on group robots
CN112684700A (en) * 2020-11-30 2021-04-20 汕头大学 Multi-target searching and trapping control method and system for swarm robots
CN113172626A (en) * 2021-04-30 2021-07-27 汕头大学 Intelligent robot group control method based on three-dimensional gene regulation and control network
CN113485340A (en) * 2021-07-12 2021-10-08 汕头大学 Distributed enclosure control method and system for group robots
CN113580129A (en) * 2021-07-19 2021-11-02 中山大学 Multi-target cooperative trapping method, device and medium based on robot

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张红强;吴亮红;周游;章兢;周少武;刘朝华: "复杂环境下群机器人自组织协同多目标围捕", 控制理论与应用, no. 005, pages 1054 - 1062 *
范衠;王诏君;石泽;王琛;王柳;蔡堉伟;黄华兴;宁为博;董朝晖;李文姬;朱贵杰;李恪;李兵;谷敏强;朱晓敏;包卫东: "基于合作自主定位的群体模式自动生成方法", 汕头大学学报(自然科学版), no. 002, pages 14 - 28 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115629600A (en) * 2022-08-01 2023-01-20 北方工业大学 Multi-machine cooperative trapping method in complex dynamic security environment based on buffer voronoi diagram
CN115629600B (en) * 2022-08-01 2023-12-12 北方工业大学 Multi-machine collaborative trapping method based on buffer Wino diagram in complex dynamic security environment
CN115455812A (en) * 2022-08-25 2022-12-09 临涣水务股份有限公司 Water supply pump station optimization method and system

Also Published As

Publication number Publication date
CN114326747B (en) 2023-10-24

Similar Documents

Publication Publication Date Title
Das et al. Multi-robot path planning using improved particle swarm optimization algorithm through novel evolutionary operators
CN114326747A (en) Multi-target enclosure control method and device for group robots and computer equipment
US10589368B2 (en) Machine learning device having function of adjusting welding positions of core in wire electric discharge machine
Ni et al. An improved spinal neural system-based approach for heterogeneous AUVs cooperative hunting
JP5448841B2 (en) Method for computer-aided closed-loop control and / or open-loop control of technical systems, in particular gas turbines
US11897066B2 (en) Simulation apparatus
Kuo et al. A migrant-inspired path planning algorithm for obstacle run using particle swarm optimization, potential field navigation, and fuzzy logic controller
CN113609786A (en) Mobile robot navigation method and device, computer equipment and storage medium
Tan et al. Multi-type task allocation for multiple heterogeneous unmanned surface vehicles (USVs) based on the self-organizing map
CN113778127A (en) Unmanned aerial vehicle reconnaissance path planning method and device based on genetic algorithm
Ordaz-Rivas et al. Autonomous foraging with a pack of robots based on repulsion, attraction and influence
Jiang et al. Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle
Allison et al. Uav collaborative search using probabilistic finite state machines
Veselov et al. Management of Behavior of a Swarm of Robots Applicable to the Tasks of Monitoring a Some Territory
Wu et al. Multi-robot target entrapment using cooperative hierarchical gene regulatory network
Peng et al. Pattern formation in constrained environments: A swarm robot target trapping method
CN113790729B (en) Unmanned overhead traveling crane path planning method and device based on reinforcement learning algorithm
CN111882577B (en) Group obstacle avoidance and consistent traveling method and device based on field-of-view perception
Jain et al. Peregrine preying pattern-based differential evolution
Valdez et al. Nature-Inspired Optimization Algorithms for Neuro-Fuzzy Models in Real-World Control and Robotics Applications.
Blinov et al. Deep q-learning algorithm for solving inverse kinematics of four-link manipulator
Al-Hussaini et al. A policy synthesis-based framework for robot rescue decision-making in multi-robot exploration of disaster sites
Edwards et al. Stochastic nonlinear ensemble modeling and control for robot team environmental monitoring
Rao et al. A trust based navigation control for multi-robot to avoid deadlock in a static environment using improved Krill Herd
da Silva Tchilian et al. Assisted teleoperation for a human-swarm interaction system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant