CN114692397A - Cluster distributed capture method based on multi-mechanism combination strategy - Google Patents
Cluster distributed capture method based on multi-mechanism combination strategy Download PDFInfo
- Publication number
- CN114692397A CN114692397A CN202210256828.8A CN202210256828A CN114692397A CN 114692397 A CN114692397 A CN 114692397A CN 202210256828 A CN202210256828 A CN 202210256828A CN 114692397 A CN114692397 A CN 114692397A
- Authority
- CN
- China
- Prior art keywords
- escaper
- chaser
- max
- strategy
- person
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 75
- 239000002245 particle Substances 0.000 claims abstract description 40
- 238000005457 optimization Methods 0.000 claims abstract description 35
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 33
- 230000008569 process Effects 0.000 claims abstract description 21
- 238000004891 communication Methods 0.000 claims abstract description 17
- 230000007246 mechanism Effects 0.000 claims description 120
- 230000006870 function Effects 0.000 claims description 59
- 239000013598 vector Substances 0.000 claims description 41
- 230000015572 biosynthetic process Effects 0.000 claims description 33
- 230000009467 reduction Effects 0.000 claims description 30
- 230000006399 behavior Effects 0.000 claims description 15
- 230000003542 behavioural effect Effects 0.000 claims description 14
- 230000008602 contraction Effects 0.000 claims description 13
- 230000008447 perception Effects 0.000 claims description 10
- 238000004364 calculation method Methods 0.000 claims description 9
- 239000011159 matrix material Substances 0.000 claims description 9
- 230000000694 effects Effects 0.000 claims description 7
- 238000009826 distribution Methods 0.000 claims description 4
- 239000003795 chemical substances by application Substances 0.000 claims description 3
- 230000008030 elimination Effects 0.000 claims description 3
- 238000003379 elimination reaction Methods 0.000 claims description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 3
- VWDWKYIASSYTQR-UHFFFAOYSA-N sodium nitrate Chemical compound [Na+].[O-][N+]([O-])=O VWDWKYIASSYTQR-UHFFFAOYSA-N 0.000 claims description 2
- 238000009795 derivation Methods 0.000 abstract description 5
- 238000012549 training Methods 0.000 abstract description 5
- 238000012545 processing Methods 0.000 abstract description 4
- 238000004088 simulation Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 10
- 238000004590 computer program Methods 0.000 description 8
- 238000003860 storage Methods 0.000 description 7
- 230000009471 action Effects 0.000 description 5
- 238000011156 evaluation Methods 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000004140 cleaning Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 230000002787 reinforcement Effects 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/006—Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2111/00—Details relating to CAD techniques
- G06F2111/06—Multi-objective optimisation, e.g. Pareto optimisation using simulated annealing [SA], ant colony algorithms or genetic algorithms [GA]
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Computer Hardware Design (AREA)
- Geometry (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a cluster distributed capture method based on a multi-mechanism combination strategy, which comprises the steps of constructing an confrontation scene and a kinematics model of a chaser and an escaper, constructing a multi-mechanism capture strategy of the chaser according to the confrontation scene and the kinematics model, constructing a multi-target optimization function according to the multi-mechanism capture strategy of the chaser, introducing an analytic hierarchy process to distribute weights for the multi-target optimization function, solving the multi-target optimization function by using a particle swarm algorithm, outputting a solution result as the strategy of the chaser, and the like. The method has simple and distinct behavior strategies, does not need complex modeling derivation, has simple and convenient algorithm, does not need complex hyper-parameter adjustment and long-time training process, and has rapid deployment capability; in the case of limited communication and processing capabilities, task capabilities better than centralized control may be exerted; the constructed model is fast in solving speed under the resolving of the particle swarm algorithm, and the real-time requirement can be met. The invention is widely applied to the technical field of intelligent agent pursuit and escape control.
Description
Technical Field
The invention relates to the technical field of intelligent agent pursuit and escape control, in particular to a cluster distributed capture method based on a multi-mechanism combination strategy.
Background
Tasks such as unmanned aerial vehicle group search and rescue tasks, contaminated target object cleaning tasks, missile or satellite interception tasks and the like can correspond to the pursuit model, for example, an unmanned aerial vehicle in the unmanned aerial vehicle group search and rescue tasks is equivalent to a pursuit catcher, a searched and rescued object is equivalent to an escaper, an intercepted missile or satellite is equivalent to an escaper, and a missile which is launched out for interception is equivalent to a pursuit catcher, so that a good pursuit or catching algorithm is designed, and the completion efficiency of the tasks is favorably improved, such as the unmanned aerial vehicle group search and rescue efficiency, the contaminated target object cleaning efficiency and the missile or satellite interception efficiency.
At present, most of research on the pursuit escape problem is concentrated on a small number of individuals, namely one individual and two individuals, but the pursuit escape problem in reality generally comprises a plurality of individuals, for example, a plurality of unmanned aerial vehicle search and rescue missing persons are dispatched, and a plurality of pursuits and escapers are involved, at the moment, the research on the pursuit escape problem is easy to fall into a multidimensional trap, and the corresponding analysis countermeasure is difficult to solve in the prior art.
Currently, methods such as dynamic programming, gradient algorithm, reinforcement learning, etc. are used to obtain numerical solutions or approximate solutions of multiple body pursuit problems, and methods based on reinforcement learning are gradually attracting attention in the research of pursuit problems, and intelligent bodies learn to high-yield strategies using algorithms such as Deep Q-network (DQN), Fuzzy Actor Criticality Learning (FACL), Deep Deterministic Policy Gradient (DDPG), and double delayed Deep deterministic policy gradient (TD 3). However, applying the above-mentioned technology related to artificial intelligence will face complex modeling derivation, hyper-parameter tuning, and long-time training process, and have high requirements on hardware performance such as data processing capability, communication delay, bandwidth, and the like, so that the problems of poor real-time performance, slow deployment, and the like easily occur.
Disclosure of Invention
Aiming at least one technical problem of poor real-time performance, slow deployment and the like of the existing escape problem solving technology, the invention aims to provide a cluster distributed capture method based on a multi-mechanism combination strategy, which comprises the following steps:
constructing a confrontation scene of a chaser and an escaper; the chaser and the escaper are both intelligent agents;
establishing a kinematic model of the chaser and the escaper;
constructing a multi-mechanism capture strategy of a chaser according to the confrontation scene and the kinematics model;
constructing a multi-objective optimization function according to the multi-mechanism capture strategy of the chaser, and introducing an analytic hierarchy process to distribute weights for the multi-objective optimization function;
and solving the multi-objective optimization function by using a particle swarm algorithm, and outputting a solving result as a strategy of the chaser.
Further, the cluster distributed capture method based on the multi-mechanism combination strategy further includes:
constructing an optimized model of an escaper by an artificial potential field method;
and solving the optimized model of the escaper by using a particle swarm algorithm to obtain a solution result as the strategy output of the escaper.
Further, the confrontation scenario includes:
catch up person P i1,2,3,.., N, escaper E, starting point (x)start,ystart) Target point (x)target,ytarget) And a countermeasure area ΩA;
The maximum speed v of the chaserp,maxThe maximum velocity of the escaper is ve,maxBoth running at maximum speed and having vp,max<ve,max;
The number of the chaser individuals is N, the number of the escaper individuals is 1, and N is more than 1;
the perception radius of the chaser is rp,maxRadius of perception of escaper is re,maxHas r ofp,max=re,maxAnd both the pursuit and the evasion parties can acquire the position coordinates and the speed vectors of the individuals of the opposite party in the self-perception range through perception;
the most effective is the chasing oneLarge angular velocity limit omegap,maxThe maximum angular velocity of the escaper is limited to ωe,maxHas omegap,max=ωe,max;
The chaser has communication distance limitation with r as the communication radiusp,comHas r ofp,com=2·rp,maxAnd the chaser can mutually acquire position coordinates and speed vectors with neighbors in a communication range;
the catching radius of the chaser is rp,capThe warning radius of the escaper is ralert。
Further, the kinematic model includes:
wherein x ispi=(xpi,ypi) To catch person PiCoordinate of (v)piTo catch person PiSpeed of (e), ωpiTo catch person PiThe angular velocity of (a) of (b),to catch person PiYaw angle of, chasing person PiIs expressed asvpi∈[0,vp,max],|ωpi|≤ωp,max;xe=(xe,ye) As the coordinate of the escaper E, veSpeed of the escaper E, ωeThe angular velocity of the escaper E, phi the yaw angle of the escaper E, and the state quantity of the escaper E is represented as Xe={xe,ye,φ},ve∈[0,ve,max],|ωe|≤ωe,max。
Further, the pursuit multi-mechanism catching strategy comprises a core mechanism and an auxiliary mechanism; the core mechanism includes an angle reduction mechanism, a distance reduction mechanism, and a structure contraction mechanism, and the auxiliary mechanism includes a collision mechanism, an overlay mechanism, and an edge mechanism.
Further, the angle reduction mechanism includes:
behavior trend function Vθ(Xe,xpi)=minθepi;
Wherein, thetaepiThe included angle between the connecting line of the chaser and the escaper and the speed direction of the escaper is formed;
wherein upsilon isee'Is the velocity direction vector of the escaper, upsilonepiConnecting line vector for both chaser and escaperVelocity of (d);
the distance reduction mechanism comprises:
Wherein d isepi=|xe'-xpi||2To catch person PiThe distance to the predicted position E' of the escaper at the next moment;
the structural contraction mechanism comprises:
Wherein,to catch person PgState quantity of (P)gTo be able to react with PiNeighbor individuals, r, of escapes are simultaneously perceivedkThe radius of the circumscribed circle of the kth triangle formed by any three chasers,is a radius vector, KrIs the total number of circumscribed circles, and E (r) and σ (r) are the mean and standard deviation, respectively, of the radii of the circumscribed circles, where r is the mean and standard deviationkThe calculation method is as follows:
wherein, ak、bk、ckThree sides of the kth triangle, AkIs an edge akThe internal angles i, P and q are three vertexes P of a triangle respectivelyi、Pp、PqSubscript of (1), NsIs PgTotal number of individuals of (S)kIs the area of the triangle, CkThe perimeter of the triangle.
Further, the collision mechanism comprises:
Wherein d iscolFor a safe distance, dig=||xpi-xpg||2To catch person PiAnd PgDistance of (m)cIs in the power of the day;
the overlay mechanism includes:
Wherein,in order to be a repulsive force,in order to have a strong attraction force,is weak attractive force, xifleTo determine the character, xi, for attractive repulsive forcesswMu as a strong or weak attraction determinerrepScaling factor, mu, for repulsive forcesattScaling factor for attractive force, dfleIs the formation spacing;
in the coverage mechanism, the stable coverage structure is that a chaser cluster takes a regular triangle as a basic coverage unit to contrast an area omegaAUniform coverage is performed at known ΩAArea S ofAAnd, in case one of the formation spacing or the number of individual clusters is known, another parameter is calculated by the following formula:
wherein, FsumIn order to cover the required number of faces,to get the integer operator, SAIs omegaAArea of (S)ΔIs the area of the base unit for area coverage;
the edge mechanism includes:
the potential collaborators are judged by the following calculation:
wherein, thetaigTo catch person PiAnd PgAngle v based on escaper EepgTo catch person PgVector of line connecting with escaper E Is a vector vepiAnd upsilonee'The outer product of the two phases is,is vector upsilonepiAnd upsilonepiThe outer product between; for PiIn other words, e.g. thetaig>θepi+ pi/2 andwhen is, PgIs PiFor P at this timegIn other words, e.g. thetaig<N andwhen is, PiIs also PgPotential collaborators.
Further, the multi-objective optimization function includes:
wherein,to catch person PiThe optimum yaw angle of the aircraft is determined,is a cooperative trend function, where κ is a scale factor and λ ═ λ (λ)r,λd,λθ,λf) Is a weight vector, V ═ Vr,Vd,Vθ,Vf) As a behavioral trend vector, ΩcoAs a cooperation area, Ωco,tIn the wake region, the flow of the water is controlled,for the trend function based on the elimination of wake effects,is a trend function based on an enhanced distance reduction mechanism, wherein Vd=exp(depi/rp,cap) -1 is a trend function of the behaviour of the enhanced distance reduction mechanism,is a region coverage trend function based on the formation distance;
the introduction of the analytic hierarchy process to distribute the weight to the multi-objective optimization function comprises the following steps:
according to the behavior trend vector V ═ V (V)r,Vd,Vθ,Vf) The sequence of interest of the mechanism involved, lambdac>λr>λθ=λdConstructing a decision matrix U, the elements U of whichijRepresents the influence factor AiIn AjThe degree of importance of;
maximum eigenvalue λ of the calculation matrix UmaxAnd its corresponding feature vector v, as the consistency ratio CR<When 0.1, the constructed matrix U passes consistency check;
after normalizing the feature vector, the obtained trend vector V is equal to (V)r,Vd,Vθ,Vf) The weight distribution scheme of each element is v ═ w1,w2,w3,w4]T=[λc,λr,λθ,λd]T。
Further, the escaper optimization model includes:
wherein phi is*Optimum yaw angle for the escaper, det=||xe-xt||2The distance between the escaper and the target point,the escape person and the pursuit person PjAnd (4) estimating the distance between the positions at the next moment, wherein eta is a relaxation coefficient, and beta is a judgment coefficient of the existence of the chaser.
Further, the particle swarm algorithm comprises:
setting the particle size N, the dimension d of the parameter to be optimized and the maximum iteration number Niter(ii) a The velocity and position are updated by the following formula:
wherein,is the velocity of the d-th dimension in which the particle h is located at time instant t,is the position of the d-th dimension of the particle h at time τ, w is the inertial weight, c1Learning factors for individuals, c2As a group learning factor, r1And r2Random numbers, pbest, all 0 to 1hdIs the local extreme value of the d-th dimension of the particle h, gbestdIs the global extremum for the d-th dimension.
The invention has the beneficial effects that: the cluster distributed capture method based on the multi-mechanism combination strategy in the embodiment realizes a mechanism modularized capture algorithm taking behavior trend as guidance, has simpler and more vivid behavior strategy compared with the existing differential chess playing and geometric method, does not need complex modeling derivation, has simple and convenient algorithm and rapid operation, does not need complex hyper-parameter adjustment and long-time training process, and has rapid deployment capability; the method has higher robustness, adaptivity and expansibility, and can exert task capability superior to centralized control under the condition of limited communication and processing capability; the method has better practical application potential, and the constructed model has higher solving speed under the resolving of the particle swarm algorithm, and can meet the real-time requirement.
Drawings
FIG. 1 is a flowchart of a cluster distributed capture method based on a multi-mechanism combination policy in an embodiment;
FIG. 2 is a detailed flowchart of the confrontation process between the catcher and the escaper in the embodiment;
FIG. 3 is a schematic diagram of the confrontation scenario of both the chaser and the escaper in the embodiment
Fig. 4 is a schematic core mechanism diagram of a cluster distributed capture method based on a multi-mechanism combination policy in the embodiment;
FIG. 5 is a schematic diagram of a wake mechanism of a cluster distributed capture method based on a multi-mechanism combination policy in the embodiment;
FIG. 6 is a diagram illustrating an overlay mechanism and a collaborative enclave mode in an embodiment;
FIG. 7 is a diagram illustrating a driving function of the overlay mechanism in an embodiment;
fig. 8 is an edge mechanism diagram of a cluster distributed capture method based on a multi-mechanism combination policy in the embodiment;
FIG. 9 is a diagram showing a simulation result of the countermeasure process in the embodiment;
FIG. 10 is a graph of convergence efficiency of particle swarm optimization applied in the examples;
FIG. 11(a) is a graph of the success rate based on the speed ratio and the formation pitch obtained by the simulation process in the example;
FIG. 11(b) is a graph of collision rate based on speed ratio and formation pitch obtained by the simulation process in the example;
fig. 12 is a comprehensive performance thermodynamic diagram of the chaser cluster obtained by the simulation process in the embodiment.
Detailed Description
In this embodiment, referring to fig. 1, the cluster distributed capture method based on the multi-mechanism combination policy includes the following steps:
s1, constructing a confrontation scene of a chaser and an escaper;
s2, establishing a kinematics model of the chaser and the escaper;
s3, constructing a multi-mechanism catching strategy of the chaser according to the confrontation scene and the kinematics model;
s4, constructing a multi-objective optimization function according to a multi-mechanism catching strategy of a chaser, and distributing weights for the multi-objective optimization function by introducing an analytic hierarchy process;
s5, solving the multi-target optimization function by using a particle swarm algorithm, and outputting a solving result as a strategy of the chaser;
s6, constructing an escaper optimization model by a manual potential field method;
and S7, solving the optimized model of the escaper by using a particle swarm algorithm, and outputting a solving result as a strategy of the escaper.
In this embodiment, the steps S1-S7 may be executed by a computer. Specifically, all of the steps S1-S7 may be performed by the same computer, where the computer may perform all of the steps S1-S7, may perform only the steps S1-S5, or may perform only the steps S6-S7 after performing the steps S1-S5. Since both the catcher and the escaper are intelligent, the steps related to the catcher in steps S1 to S7 may be executed by the catcher, and the steps related to the escaper in steps S1 to S7 may be executed by the escaper.
In step S1, a confrontation scenario between the chaser and the escaper is constructed. Specifically, the confrontation scene may be formed of the following set parameters:
pursuing person P i1,2,3,.., N, escaper E, starting point (x)start,ystart) Target point (x)target,ytarget) And a countermeasure area ΩA;
The maximum speed v of the chaserp,maxThe maximum velocity of the escaper is ve,maxBoth running at maximum speed and having vp,max<ve,max;
The number of the chaser individuals is N, the number of the escaper individuals is 1, and N is more than 1;
the perception radius of the chaser is rp,maxRadius of perception of escaper is re,maxHas r ofp,max=re,maxAnd both the pursuit and the evasion parties can acquire the position coordinates and the speed vectors of the individuals of the opposite party in the self-perception range through perception;
the maximum angular velocity of the chaser is limited to omegap,maxThe maximum angular velocity of the escaper is limited to ωe,maxHas omegap,max=ωe,max;
The chaser has communication distance limit with r as communication radiusp,comHas r ofp,com=2·rp,maxAnd the chaser can mutually acquire position coordinates and speed vectors with neighbors in a communication range;
the arresting radius of the chaser is rp,capThe warning radius of the escaper is ralert。
In executing step S1, (x) may be generated in the memory of the computerstart,ystart)、(xtarget,ytarget) And the variables can store specific numerical values, so that parameters of the chaser and the escaper are described through a plurality of variables, and a confrontation scene is formed.
In step S2, kinematic models of the chaser and the escaper are established. Specifically, the kinematic model may refer to the motion constraint conditions in the confrontation scenario established by the chaser and the escaper at step S1, including the equation between their positions, velocities, angular velocities, and the like. In this embodiment, step S2 may establish the following kinematic model:
wherein x ispi=(xpi,ypi) To catch person PiV coordinates ofpiTo catch person PiSpeed of (e), ωpiTo catch person PiThe angular velocity of (a) of (b),to catch person PiYaw angle of, chasing person PiIs expressed asvpi∈[0,vp,max],|ωpi|≤ωp,max;xe=(xe,ye) As the coordinate of the escaper E, veSpeed of the escaper E, ωeThe angular velocity of the escaper E, phi is the yaw angle of the escaper E, and the state quantity of the escaper E is represented as Xe={xe,ye,φ},ve∈[0,ve,max],|ωe|≤ωe,max。
In the following steps S3-S7, steps S3-S5 are actually detailed procedures for decision-making on the side of the chaser under the motion condition constrained by step S2 in the confrontation scene constructed in step S1; steps S6-S7 are actually detailed procedures for decision-making on the side of the escaper under the motion condition constrained by step S2 in the confrontation scene constructed by step S2. The principle of steps S3-S7 is shown in FIG. 2.
Referring to fig. 2, for the catcher cluster, the specific execution strategy when executing steps S3-S5 is:
detecting whether an escaper exists, if so, executing an edge mechanism, and if not, executing a covering mechanism;
when executing the edge mechanism, firstly judging whether the edge mechanism is positioned at the cluster boundary, if so, continuously judging whether a potential partner exists, if so, starting to enter a core mechanism, and if not, executing a covering mechanism;
when a core mechanism is executed, firstly, judging whether the number of collaborators in the collaboration area is more than or equal to two so as to form a polygon with the collaborators, if the number of the collaborators is less than two, not executing a structure contraction mechanism, but only executing a distance reduction mechanism, an angle reduction mechanism and a collision avoidance mechanism, if the number of the collaborators is not less than two, further judging whether the collaborators are in a wake flow area, if the collaborators are in the wake flow area, not executing the distance reduction mechanism, but only executing the structure contraction mechanism, the angle reduction mechanism and the collision avoidance mechanism, otherwise, executing all the mechanisms;
after passing through the core mechanism, weight distribution is carried out on each sub-mechanism by adopting an AHP method, a comprehensive objective function is constructed, the PSO algorithm is utilized for solving, and the optimal yaw angle is generated to be output as a real-time strategy.
Referring to fig. 2, the catching effect of the chaser can be verified by the escaper, and the specific execution strategy when the escaper executes steps S6-S7 is:
detecting whether a chaser exists or not, if so, entering an escape mode, constructing an artificial potential field environment according to the detected chaser, constructing a target function, solving by utilizing a PSO (particle swarm optimization) algorithm, generating an optimal yaw angle as a real-time strategy output, and if not, continuing moving towards a target point.
In this embodiment, the confrontation scenario established in step S1 may be as shown in fig. 3, and steps S3-S7 are specifically described based on the confrontation scenario shown in fig. 3.
In step S3, a multi-mechanism capture strategy is constructed for the chaser according to the confrontation scene and the kinematic model. Specifically, the constructed catcher multi-mechanism catching strategy comprises a core mechanism and an auxiliary mechanism.
In this embodiment, to meet the requirement of the active capture function of the chaser cluster, the core mechanism includes three sub-mechanisms, namely an angle reduction mechanism, a distance reduction mechanism, and a structure contraction mechanism. The principle of the core mechanism is shown in fig. 4. In FIG. 4(a), the angle reduction mechanism drives the chaser individual to move to the front of the escaper for blocking, so that the included angle θ is formedepiReducing; the distance reduction mechanism drives the pursuit individual to reduce the distance between the pursuit individual and the escaper, so that the distance depiReducing; while FIGS. 4(b) and 4(c) show that the mechanism of structural contraction will be driven at ΩcoThe chasers in the area have a tendency to change from 4(b) to 4 (c); FIG. 4(d) shows the action of the entire core mechanism. OmegacoThe area indicates that all chasers in the area can sense the escapers, and the chasers communicate information.
(A1) In this embodiment, the sub-mechanism of the angle reduction mechanism is described by the following equation:
behavior trend function Vθ(Xe,xpi)=minθepi;
Wherein, thetaepiThe included angle between the connecting line of the chaser and the escaper and the speed direction of the escaper is formed;
wherein upsilon isee'Is the velocity direction vector of the escaper, upsilonepiConnecting line vector for both chaser and escaperA speed of the motor;
(A2) in this embodiment, the distance reduction mechanism, i.e., the sub-mechanism, is described by the following equation:
Wherein d isepi=||xe'-xpi||2To catch person PiThe distance to the predicted position E' of the escaper at the next moment;
(A3) in this embodiment, the sub-mechanism of the structure contraction mechanism is described by the following equation:
Wherein,to catch person PgQuantity of state of (P)gTo be able to react with PiNeighbor individuals, r, of escapes are simultaneously perceivedkThe radius of the circumscribed circle of the kth triangle formed by any three chasers,is a radius vector, KrIs the total number of circumscribed circles, and E (r) and σ (r) are the mean and standard deviation, respectively, of the radii of the circumscribed circles, where r is the mean and standard deviationkThe calculation method is as follows:
wherein, ak、bk、ckThree sides of the kth triangle, AkIs an edge akThe internal angles i, P and q are three vertexes P of a triangle respectivelyi、Pp、PqSubscript of (1), NsIs PgTotal number of individuals of (1), SkIs the area of the triangle, CkThe perimeter of the triangle.
As shown in fig. 4(b), when it is at ΩcoIn the cooperative chasers in the area, any three chasers can form a triangle and must have a circumcircle. The structure contraction mechanism makes the circumscribed circle of each triangle tend to the mean value of the circumscribed circle set and show a reduction trend by minimizing the radius of the circumscribed circle of all the triangles, eliminates the difference between the circumscribed circles of each triangle, namely reduces the standard deviation of the circumscribed circle set as much as possible, and finally can ensure that the value is in omegacoWherein randomly scattered chasers tend to form a regular polygonal enclosure with the escapers as the center, and continuously shrink to complete the cooperative catching task.
In addition, in order to improve the dynamic stability of the structure contraction mechanism, a wake mechanism is designed to accelerate the cooperative chaser in a cooperative region omegacoInner update iteration as shown in fig. 5. When the chaser is in the wake region omegaco,tI.e. thetaepi>θtailThe chaser will not have the distance reduction mechanism and will be expelled through the arc AA' into the region omega under the action of the speed difference between the opposing partiescoThe problem that the chaser clusters are gathered at the tail of the escaper due to the speed difference is solved, and the omega is favorable forcoThe internal structure contracts the mechanism.
In this embodiment, in order to give good cooperative capture conditions to the core mechanism, the auxiliary mechanism includes three sub-mechanisms, namely, a collision mechanism, an overlay mechanism, and an edge mechanism. Wherein, the collision prevention mechanism prevents the collision inside the chaser cluster, especially the collision prevention when the core mechanism acts; the covering mechanism drives the catcher cluster to be converted from the initial random position state to the uniform covering omegaAThe state of the area, the coverage density of which is controlled by the formation pitch; the edge mechanism judges whether potential collaborators exist at the edge of the cluster, if so, the edge mechanism enters the core mechanism, if not, the queue form interval is maintained, and evacuees are induced to go deep into the cluster to capture.
(B1) In this embodiment, the sub-mechanism of the collision mechanism is described by the following equation:
Wherein d iscolFor a safe distance, dig=||xpi-xpg||2To catch person PiAnd PgDistance of (m)cIs in the power of the day;
(B2) in this embodiment, the detailed design of the sub-mechanism, i.e. the collision mechanism, is as follows:
based on the same virtual force, the coverage of the area by the cluster should be uniform, i.e. the area tends to be tiled and filled with the same basic polygon module. When the perimeter of the basic polygon is fixed, namely the cluster density/formation space is fixed, the coverage area of the regular polygon is maximum. Assuming that the regular polygon of the base has M sides, the internal angle is pi (M-2)/M, and since the planar tiling is required, the internal angle and the circumferential angle 2 pi should be integer division, i.e. 2M/(M-2) should also be integer. Thus, M can take on values of 3, 4, 6. When M is 6, the polygon is a regular hexagon, and actually is a combination of six regular hexagons; when M is 4, the distance between any two vertexes is inconsistent, and the stability is insufficient; thus, the base polygon for planar overlay is a regular triangle. Fig. 6(a) shows a cluster area coverage pattern. Aiming at the problem that one vertex can be added with two sides of one surface or two surfaces and three sides of two surfaces, the core module and the combined module are arranged, the core module (6 vertexes, 4 surfaces and 8 sides) are used as the center, and the whole spliced combined module (2 vertexes, 3 surfaces and 5 sides) covers the game area. When the area of the game area and the preset formation space are known, the required number of individuals can be calculated; or the required formation spacing can be obtained when the area and cluster size are known.
In the area coverage mode, the calculation relationship between the vertex and the surface is as follows:
in the above formula, VsumAnd FsumTotal number of vertices and faces, V, respectivelycAnd FcRespectively the number of vertices and the number of facets in the core module, Vs,addAnd Fs,addNumber of peaks and faces, Q, added separately to a single combined modulesThe number of the combined modules is the same as the number of the combined modules,to take down the integer operator. In addition, Vc=6,Fc=4,Vs,add=2,Fs,add=3。
In the covering mechanism, the stable covering structure is that the catcher cluster takes regular triangles as basic covering units to contrast the area omegaAUniform coverage is performed at known ΩAArea S ofAAnd, in case one of the formation spacing or the number of individual clusters is known, another parameter is calculated by the following formula:
in the above formula, FsumIn order to cover the required number of faces,to get the integer operator, SAIs omegaAArea of (S)ΔIs the area of the base unit for area coverage.
In area coverage, when the cluster density is the formation spacing dfleIn case of change, the cooperation region ΩcoThe number of individuals inside will change as shown in fig. 6 (b). Can be combined withfleIs specifically divided into [1,3 ]]、[3,4]And [4,6]Three cases for each partner, calculated as:andand use it as dfleOf three segmentation points, wherein rcoIs omegacoOf (c) is used.
The overlay mechanism behavior trend function is as follows:
in the above formula, the first and second carbon atoms are,in order to be a repulsive force,in order to have a strong attraction force,is weak attractive force, xifleTo determine the character, xi, for attractive repulsive forcesswMu as a strong or weak attraction determinerrepAnd muattScaling factors for repulsive and attractive forces, respectively, dfleIs the formation pitch. Each virtual force drive function is shown in fig. 7.
(B3) In this embodiment, the sub-mechanism is an edge mechanismIntended as shown in fig. 8. For PiIn other words, when P isgIn the potential cooperation area omegapciWhen is, PgIs PiThe potential collaborators can execute corresponding core mechanisms to capture the escapers; when P isgAt omegaco\ΩpciAnd in the process, the escapers are considered to be still beyond the edge of the cluster, and each chaser keeps the formation distance, so that the self threat is reduced, and the escapers are attracted to go deep into the cluster.
The potential partner judgment is calculated by the following formula:
in the above formula, [ theta ]igTo catch person PiAnd PgAngle v based on escaper EepgTo catch person PgVector of line connecting with escaper E Andare respectively vector upsilonepiAnd upsilonee'、υepiThe outer product between. For PiIn other words, e.g. thetaig>θepi+ π/2 andwhen is, PgIs PiPotential collaborators of (a); at this time for PgIn other words, e.g. thetaig<N andwhen is, PiIs also PgPotential collaborators.
In step S4, a multi-objective optimization function of the chaser, that is, a multi-objective optimization function, is constructed according to the multi-mechanism capture strategy of the chaser, which is constructed in step S3 and composed of the angle reduction mechanism, the distance reduction mechanism, the structure contraction mechanism, the collision mechanism, the coverage mechanism, the edge mechanism, and the like. Specifically, the multi-objective optimization function may be represented by the following formula:
wherein,to catch person PiThe optimum yaw angle of the aircraft is determined,is a cooperative trend function, where κ is a scale factor and λ ═ λ (λ)r,λd,λθ,λf) Is a weight vector, V ═ Vr,Vd,Vθ,Vf) As a behavioral trend vector, ΩcoAs a cooperation area, Ωco,tIn the wake region, the flow of the water is controlled,for the trend function based on the elimination of wake effects,is a trend function based on an enhanced distance reduction mechanism, wherein Vd=exp(depi/rp,cap) -1 is a trend function of the behaviour of the enhanced distance reduction mechanism,is a function of the area coverage trend based on the formation distance.
In step S4, when the introduced analytic hierarchy process is executed to assign weights to the multi-objective optimization function, the following steps may be specifically executed:
according to the behavior trend vector V ═ V (V)r,Vd,Vθ,Vf) The sequence of interest of the containing mechanism lambdac>λr>λθ=λdConstructing a decision matrix U, the elements U of whichijRepresents the influence factor AiIn AjThe degree of importance of;
maximum eigenvalue λ of the calculation matrix UmaxAnd its corresponding feature vector v, as the consistency ratio CR<When 0.1, the constructed matrix U passes consistency check;
after normalizing the feature vector, the obtained trend vector V is equal to (V)r,Vd,Vθ,Vf) The weight distribution scheme of each element is v ═ w1,w2,w3,w4]T=[λc,λr,λθ,λd]T。
And executing the step S5 aiming at the multi-objective optimization function, and solving the multi-objective optimization function by using a particle swarm optimization algorithm to obtain a solution result as the strategy output of the chaser. Specifically, the pellets used in step S5A subgroup algorithm, wherein the particle size N, the dimension d of a parameter to be optimized and the maximum iteration number N are setiterAfter waiting for the parameters, the speed and position are updated by the following formula:
wherein,is the velocity of the d-th dimension in which the particle h is located at time instant tau,is the position of the d-th dimension of the particle h at time τ, w is the inertial weight, c1Learning factors for individuals, c2As a group learning factor, r1And r2Random numbers, pbest, all 0 to 1hdIs the local extreme value of the d-th dimension of the particle h, gbestdIs the global extremum for the d-th dimension.
The result of performing the particle swarm algorithm in step S5 is to obtain the optimal yaw angle of the multiple-objective function phi (-) of the chaser at each momentAnd (4) outputting the strategy as the pursuit party. Specifically, the computer obtains the optimal yaw angle of the chaser at each momentThereafter, the movement of the chaser can be simulated accordingly in the simulation environment, and the movement of the chaser can be controlled accordingly in the actual environment.
In step S6, according to the pursuit multi-mechanism catching strategy composed of the angle reduction mechanism, the distance reduction mechanism, the structure contraction mechanism, the collision mechanism, the coverage mechanism, the edge mechanism, and the like, which is constructed in step S3, a multi-objective escape person optimization function, that is, an escape person optimization model, is constructed through an artificial potential field method. Specifically, the escaper optimization model can be represented by the following equation:
wherein phi is*Optimum yaw angle for the escaper, det=||xe-xt||2The distance between the escaper and the target point,the escape person and the pursuit person PjAnd (4) estimating the distance between the positions at the next moment, wherein eta is a relaxation coefficient, and beta is a judgment coefficient of the existence of the chaser.
And executing the step S7 aiming at the optimum model of the escaper, and solving the multi-objective optimization function by using a particle swarm optimization so as to obtain a solution result as the strategy output of the chaser. Specifically, the particle swarm algorithm used in step S7 is the same as the particle swarm algorithm used in step S5, and is also set for the particle size N, the dimension d of the parameter to be optimized, and the maximum iteration number NiterAfter waiting for the parameters, the speed and position are updated by the following formula:
wherein,is the velocity of the d-th dimension in which the particle h is located at time instant tau,is the position of the d-th dimension of the particle h at time τ, w is the inertial weight, c1Learning factors for individuals, c2As a group learning factor, r1And r2Random numbers, pbest, all 0 to 1hdIs the local extreme value of the d-th dimension of the particle h, gbestdIs the global extremum for the d-th dimension.
The result of executing the particle swarm optimization in step S7 is to obtain the optimal yaw angle phi of the multi-target function psi (-) of the escaper at each moment*And the strategy is output as the strategy of the escaper side. Specifically, the computer obtains the optimum yaw angle φ of the escaper at each moment*Then, the motion of the escaper can be simulated under the simulation environment, and the motion of the escaper can be controlled under the actual environment.
By executing steps S1-S5 or S1-S7, the cluster distributed capture method based on the multi-mechanism combination policy in this embodiment can achieve the following technical effects:
(1) compared with the existing differential game and geometric method, the technical means in the embodiment has simpler and more distinct behavior strategy, does not need complex modeling derivation, has simple and convenient algorithm and quick operation, does not need complex hyper-parameter adjustment and long-time training process, and has quick deployment capability;
(2) the distributed interaction mechanism is utilized to utilize local sensing and communication capacity, and by means of the designed simple rule and the local interaction mechanism, the group has higher robustness, self-adaptability and expansibility, and can exert task capacity superior to centralized control under the condition of limited communication and processing capacity;
(3) the intelligent sensing range, the communication range and the steering maneuverability limit are considered by combining with actual conditions, the method has better practical application potential, and the constructed model is fast in solving speed under the resolving of the particle swarm algorithm, so that the real-time requirement can be met.
In this embodiment, in order to test the effectiveness of the cooperative capture algorithm based on the multi-mechanism combination strategy, relevant simulation verification is performed, and a 20 × 20m barrier-free scene is established in the simulation environment, wherein the game area is a 10 × 10m area framed by a red wire frame, and a chaser cluster is randomly generated in the area. The coordinates of the starting point of the escaper are (2.5 ), and the coordinates of the target point are (17.5 ).
The basic parameter settings of both parties are shown in table 1:
TABLE 1 run-on simulation parameter settings
From the parameters in the table, the speed ratio α ═ v between the chaser and the escaperp,max/ve,maxThe ratio of the number of individuals is 15:1, the maximum angular velocity, the radius of perception and the radius of the device are all the same, and the catch radius of the chaser is equal to the alert radius of the escaper. In addition, the hyper-parameter setting in the algorithm is as follows: k is set to 1, λ ═ λr,λd,λθ,λf]=[0.2624,0.1411,0.1411,0.4554]The maximum iteration number of the PSO algorithm is 35, the particle population size is 50, and the speed updating related parameter is [ w, c ]1,c2]=[0.73,1.5,1.5]。
In this embodiment, the algorithm simulation process is as shown in fig. 9, and the result shows that the chaser cluster can finally successfully catch a single high-speed escaper. Specifically, the method comprises the following steps: when t is 0s, the chaser cluster is randomly generated in the game area, and the escaper starts from the starting position; when t is 5s to 15s, the chaser starts to cover the game area under the action of the covering mechanism to prepare for catching; when t is 15s to 20s, the escaper finds that more chasers are in the sensing range of the escaper and searches for the weak direction of the chaser to escape, and the escaper does not enter the inside of the chaser cluster but moves around the edge of the cluster at the moment, so that the maneuvering flexibility of the escaper strategy is embodied; when t is 25s, under the action of an edge mechanism, a chaser at the edge of the cluster does not directly catch an escaper and let the escaper go deep into the cluster; when t is 30s, the core mechanism and the collision avoidance mechanism start to execute, but the escaper has escaped to cooperatively catch the edge of the enclosure ring because the speed of the escaper is high; when t is 35s, the surrounding structure in the previous period smoothly transits to the stage under the action of a wake mechanism, and the surrounding structure in the stage stably and accurately catches the high-speed escaper. Through the analysis, the multi-mechanism combination strategy provided by the work can activate different sub-mechanism combinations under different conditions, so that a chaser cluster can successfully cooperate to catch a single high-speed escaper under the framework of distributed local area communication, meanwhile, a model constructed by the invention has higher convergence efficiency under the resolving of a particle swarm algorithm, and a single-step resolving convergence algebraic statistical graph is shown in a figure 10.
Preferably, in the simulation example, the speed ratio and the formation pitch setting are respectively equal to α and 0.5, and dfleThe predetermined task can be successfully completed as 3.69. In order to find out the performance domain of the proposed method, the section develops tests in two dimensions of speed ratio and formation spacing based on success rate and collision rate indexes, each data point is subjected to 100 repeated tests, and the results are shown in fig. 11.
In the success rate graph 11(a), the speed ratio ranges from 0.4 to 1.0, with an interval of 0.1; the formation spacing is from 2.0 to 5.5, and the interval is 0.5. Analysis shows that the cluster capture success rate increases with the increase of the speed ratio and the power increase decreases with the increase of the speed ratio under the same formation spacing; under the same speed ratio, on the scale of the formation spacing, the success rate is higher in the middle and lower on two sides, and is in a single-peak shape, and along with the increase of the speed ratio, the disadvantage state of the formation spacing of the chaser clusters on two sides or too large or too small is gradually compensated by the increase of the speed. Therefore, when the formation distance is properly selected, the cluster has better capturing capacity. In the collision rate graph 11(b), the speed ratio range is set in accordance with the success rate curve. The collision rate indicates that any individual has a collision event during a single test, and is recorded as a collision. As can be seen from the data diagram, under the same formation pitch, the collision rate increases with the increase of the speed ratio; the smaller the formation pitch, the higher the collision rate at the same speed ratio. Therefore, the larger the formation pitch, the smaller the speed ratio, and the lower the collision rate, but a higher success rate cannot be ensured. Therefore, a comprehensive evaluation index is needed to evaluate the cluster performance, and the optimal formation spacing setting under different speed ratio conditions is found according to the evaluation index.
The formula of the constructed comprehensive evaluation index is shown as follows:
in the above formula, Eh,scFor comprehensive evaluation of the index value, Rs,RcRespectively indicating success rate and collision rate. Eh,scMeans based on RcTo RsThe discount of (1). In particular, when RcWhen equal to 0, Eh,sc=Rs。
The cluster performance thermodynamic diagrams presented by the composite evaluation index in the speed ratio and formation pitch dimensions are shown in fig. 12. The darker the color in the graph indicates that the cluster is better in comprehensive performance, i.e., high in power and low in collision rate, and is not good enough otherwise. According to analysis, when d isfleWhen the speed ratio is 4, the cluster has better performance under each speed ratio; when α is 0.7, the chaser cluster has a wide excellent performance area without a large speed capability. Meanwhile, areas A, B and C in the graph represent a comprehensive performance stable area, namely when data points are located in the area, even if the speed ratio and the formation spacing fluctuate within a certain range in the actual situation, the cluster comprehensive performance does not fluctuate obviously, and the robustness is good. In summary, the best performance region of the strategy proposed in this section is the B region, which is the most suitable for the case where the speed ratio is about 0.7 and the formation pitch is set to 4, and when facing other speed ratio cases, the formation pitch may also be selected to be 4, so as to ensure that the cluster has better comprehensive performance. In addition, in consideration of the bounding mode shown in fig. 6(b), the specific formation interval can be calculated asTherefore, when ΩcoWhen there are three or four collaborators in the area, the cluster performance is better, and too many or too few collaborators can influence the comprehensive performance of the cluster.
Through the description of the steps and the simulation results shown in fig. 9-12, the invention establishes a distributed capture method based on a multi-mechanism combination strategy by taking the arrival-rejection game in the pursuit game as the scene cut-in and emphasizing the pursuit player cluster, various sub-mechanisms driven by behavior guidance are designed in a modularized way, the cluster capture problem under the manufacture of each sub-mechanism is converted into a multi-objective optimization problem, and the output strategy is solved in real time by utilizing a particle swarm algorithm. The method has simple and clear behavior strategy, does not need complicated modeling derivation, hyper-parameter adjustment and long-time training process, has rapid deployment capability, higher robustness, adaptivity and expansibility, has high algorithm solving speed, and can meet the real-time requirement.
The cluster distributed capture method based on the multi-mechanism combination strategy in the embodiment can be implemented by writing a computer program for implementing the cluster distributed capture method based on the multi-mechanism combination strategy in the embodiment, writing the computer program into a computer device or a storage medium, and when the computer program is read out and operated, implementing the cluster distributed capture method based on the multi-mechanism combination strategy in the embodiment, thereby achieving the same technical effect as the cluster distributed capture method based on the multi-mechanism combination strategy in the embodiment.
It should be noted that, unless otherwise specified, when a feature is referred to as being "fixed" or "connected" to another feature, it can be directly fixed or connected to the other feature or indirectly fixed or connected to the other feature. Furthermore, the descriptions of upper, lower, left, right, etc. used in the present disclosure are only relative to the mutual positional relationship of the constituent parts of the present disclosure in the drawings. As used in this disclosure, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. In addition, unless defined otherwise, all technical and scientific terms used in this example have the same meaning as commonly understood by one of ordinary skill in the art. The terminology used in the description of the embodiments herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in this embodiment, the term "and/or" includes any combination of one or more of the associated listed items.
It will be understood that, although the terms first, second, third, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element of the same type from another. For example, a first element could be termed a second element, and, similarly, a second element could be termed a first element, without departing from the scope of the present disclosure. The use of any and all examples, or exemplary language ("e.g.," such as "or the like") provided with this embodiment is intended merely to better illuminate embodiments of the invention and does not pose a limitation on the scope of the invention unless otherwise claimed.
It should be recognized that embodiments of the present invention can be realized and implemented in computer hardware, a combination of hardware and software, or by computer instructions stored in a non-transitory computer readable memory. The methods may be implemented in a computer program using standard programming techniques, including a non-transitory computer-readable storage medium configured with the computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner, according to the methods and figures described in the detailed description. Each program may be implemented in a high level procedural or object oriented programming language to communicate with a computer system. However, the program(s) can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language. Furthermore, the program can be run on a programmed application specific integrated circuit for this purpose.
Further, operations of processes described in this embodiment can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The processes described in this embodiment (or variations and/or combinations thereof) may be performed under the control of one or more computer systems configured with executable instructions, and may be implemented as code (e.g., executable instructions, one or more computer programs, or one or more applications) collectively executed on one or more processors, by hardware, or combinations thereof. The computer program includes a plurality of instructions executable by one or more processors.
Further, the method may be implemented in any type of computing platform operatively connected to a suitable interface, including but not limited to a personal computer, mini computer, mainframe, workstation, networked or distributed computing environment, separate or integrated computer platform, or in communication with a charged particle tool or other imaging device, and the like. Aspects of the invention may be embodied in machine-readable code stored on a non-transitory storage medium or device, whether removable or integrated into a computing platform, such as a hard disk, optically read and/or write storage medium, RAM, ROM, or the like, such that it may be read by a programmable computer, which when read by the storage medium or device, is operative to configure and operate the computer to perform the procedures described herein. Further, the machine-readable code, or portions thereof, may be transmitted over a wired or wireless network. The invention described in this embodiment includes these and other different types of non-transitory computer-readable storage media when such media include instructions or programs that implement the steps described above in conjunction with a microprocessor or other data processor. The invention also includes the computer itself when programmed according to the methods and techniques described herein.
A computer program can be applied to input data to perform the functions described in the present embodiment to convert the input data to generate output data that is stored to a non-volatile memory. The output information may also be applied to one or more output devices, such as a display. In a preferred embodiment of the invention, the transformed data represents physical and tangible objects, including particular visual depictions of physical and tangible objects produced on a display.
The above description is only a preferred embodiment of the present invention, and the present invention is not limited to the above embodiment, and any modifications, equivalent substitutions, improvements, etc. within the spirit and principle of the present invention should be included in the protection scope of the present invention as long as the technical effects of the present invention are achieved by the same means. The invention is capable of other modifications and variations in its technical solution and/or its implementation, within the scope of protection of the invention.
Claims (10)
1. The cluster distributed capture method based on the multi-mechanism combination strategy is characterized by comprising the following steps:
constructing a confrontation scene of a chaser and an escaper; the chaser and the escaper are both intelligent agents;
establishing a kinematic model of the chaser and the escaper;
constructing a multi-mechanism capture strategy of a chaser according to the confrontation scene and the kinematics model;
constructing a multi-objective optimization function according to the multi-mechanism capture strategy of the chaser, and introducing an analytic hierarchy process to distribute weights for the multi-objective optimization function;
and solving the multi-objective optimization function by using a particle swarm algorithm, and outputting a solving result as a strategy of the chaser.
2. The cluster distributed capture method based on multi-mechanism combination strategy as claimed in claim 1, wherein the cluster distributed capture method based on multi-mechanism combination strategy further comprises:
constructing an optimized model of an escaper by an artificial potential field method;
and solving the optimized model of the escaper by using a particle swarm algorithm to obtain a solution result as the strategy output of the escaper.
3. The multi-mechanism combination strategy-based cluster distributed capture method according to claim 1 or 2, wherein the countermeasure scenario comprises:
catch up person Pi1,2,3,.., N, escaper E, starting point (x)start,ystart) Target point (x)target,ytarget) And a countermeasure area ΩA;
The maximum speed v of the chaserp,maxThe maximum velocity of the escaper is ve,maxBoth running at maximum speed and having vp,max<ve,max;
The number of the chaser individuals is N, the number of the escaper individuals is 1, and N is more than 1;
the perception radius of the chaser is rp,maxRadius of perception of escaper is re,maxIs at a distance of rp,max=re,maxAnd both the pursuit and the evasion parties can acquire the position coordinates and the speed vectors of the individuals of the opposite party in the self-perception range through perception;
the maximum angular velocity of the chaser is limited to omegap,maxThe maximum angular velocity of the escaper is limited to ωe,maxHas omegap,max=ωe,max;
The chaser has communication distance limitation with r as the communication radiusp,comHas r ofp,com=2·rp,maxAnd the chaser can mutually acquire position coordinates and speed vectors with neighbors in a communication range;
the arresting radius of the chaser is rp,capThe warning radius of the escaper is ralert。
4. The multi-mechanism combination strategy-based cluster distributed capture method according to claim 1 or 2, wherein the kinematic model comprises:
wherein x ispi=(xpi,ypi) To catch person PiCoordinate of (v)piTo catch person PiSpeed of (e), ωpiTo catch person PiThe angular velocity of (a) of (b),to catch person PiYaw angle of, chasing person PiIs expressed asvpi∈[0,vp,max],|ωpi|≤ωp,max;xe=(xe,ye) As the coordinate of the escaper E, veSpeed of the escaper E, ωeThe angular velocity of the escaper E, phi the yaw angle of the escaper E, and the state quantity of the escaper E is represented as Xe={xe,ye,φ},ve∈[0,ve,max],|ωe|≤ωe,max。
5. The multi-mechanism combination strategy-based cluster distributed capture method according to claim 1 or 2, wherein the chaser multi-mechanism capture strategy comprises a core mechanism and an auxiliary mechanism; the core mechanism includes an angle reduction mechanism, a distance reduction mechanism, and a structure contraction mechanism, and the auxiliary mechanism includes a collision mechanism, an overlay mechanism, and an edge mechanism.
6. The cluster distributed capture method based on the multi-mechanism combination strategy as claimed in claim 5, characterized in that:
the angle reduction mechanism includes:
behavior trend function Vθ(Xe,xpi)=minθepi;
Wherein, thetaepiThe included angle between the connecting line of the chaser and the escaper and the speed direction of the escaper;
wherein upsilon isee'Is the velocity direction vector of the escaper, upsilonepiConnecting line vector for both chaser and escaperVelocity of (d);
the distance reduction mechanism comprises:
Wherein d isepi=||xe'-xpi||2To catch person PiThe distance to the predicted position E' of the escaper at the next moment;
the structural contraction mechanism comprises:
Wherein,to catch person PgQuantity of state of (P)gTo be combined with PiNeighbor individuals, r, of escapes are simultaneously perceivedkThe radius of the circumscribed circle of the kth triangle formed by any three chasers,is a radius vector, KrIs the total number of circumscribed circles, and E (r) and σ (r) are the mean and standard deviation, respectively, of the radii of the circumscribed circles, where r is the mean and standard deviationkThe calculation method is as follows:
wherein, ak、bk、ckThree sides of the kth triangle, AkIs an edge akThe internal angles i, P and q are three vertexes P of a triangle respectivelyi、Pp、PqSubscript of (2), NsIs PgTotal number of individuals of (1), SkIs the area of the triangle, CkThe perimeter of the triangle.
7. The cluster distributed capture method based on the multi-mechanism combination strategy according to claim 5, characterized in that:
the collision mechanism includes:
Wherein d iscolFor a safe distance, dig=||xpi-xpg||2To catch person PiAnd PgM, mcIs in the power of the day;
the overlay mechanism includes:
Wherein,in order to act as a repulsive force,in order to have strong attraction force, the air conditioner is provided with a fan,is weak attractive force, xifleIs an attractive repulsive force determiner, ξswMu as a strong or weak attraction determinerrepScaling factor, mu, for repulsive forcesattScaling factor for attractive force, dfleIs the formation spacing;
in the coverage mechanism, the stable coverage structure is that a chaser cluster takes a regular triangle as a basic coverage unit to contrast an area omegaAUniform coverage is performed at known ΩAArea S ofAAnd, in case one of the formation spacing or the number of individual clusters is known, another parameter is calculated by the following formula:
wherein, FsumIn order to cover the required number of faces,to get the integer operator, SAIs omegaAArea of (S)ΔIs the area of the base unit for area coverage;
the edge mechanism includes:
the potential collaborators are judged by the following calculation:
wherein, thetaigTo catch person PiAnd PgAngle v based on escaper EepgTo catch person PgTo escapers ELine vector Is a vector vepiAnd upsilonee'The outer product of the two phases is,is a vector vepiAnd upsilonepiThe outer product between; for PiIn other words, e.g. thetaig>θepi+ pi/2 andwhen is, PgIs PiFor P at this timegIn other words, e.g. thetaig<N andwhen P is presentiIs also PgPotential collaborators.
8. The cluster distributed capture method based on the multi-mechanism combination strategy as claimed in claim 1 or 2, characterized in that:
the multi-objective optimization function comprises:
wherein,to catch person PiThe optimum yaw angle of the aircraft is determined,is a cooperative trend function, where κ is a scale factor and λ ═ λ (λ)r,λd,λθ,λf) Is a weight vector, V ═ Vr,Vd,Vθ,Vf) As a behavioral trend vector, ΩcoAs a cooperation area, Ωco,tIn the wake region, the flow of the water is controlled,for the trend function based on the elimination of wake effects,is a trend function based on an enhanced distance reduction mechanism, wherein Vd=exp(depi/rp,cap) -1 is a trend function of the behaviour of the enhanced distance reduction mechanism,is a region coverage trend function based on the formation distance;
the introduction of the analytic hierarchy process to distribute the weight to the multi-objective optimization function comprises the following steps:
according to the behavior trend vector V ═ V (V)r,Vd,Vθ,Vf) The sequence of interest of the mechanism involved, lambdac>λr>λθ=λdConstructing a judgment matrix U, the elements U of whichijRepresents the influence factor AiTo AjThe degree of importance of;
maximum eigenvalue λ of the calculation matrix UmaxAnd its corresponding feature vector v, as the consistency ratio CR<When 0.1, the constructed matrix U passes consistency check;
after the feature vector is normalized, the obtained trend vector V is equal to (V)r,Vd,Vθ,Vf) The weight distribution scheme of each element is v ═ w1,w2,w3,w4]T=[λc,λr,λθ,λd]T。
9. The multi-mechanism combination strategy-based cluster distributed capture method according to claim 2, wherein the escaper optimization model comprises:
wherein phi*Optimum yaw angle for the escaper, det=||xe-xt||2The distance between the escaper and the target point,the escape person and the pursuit person PjAnd (4) estimating the distance between the positions at the next moment, wherein eta is a relaxation coefficient, and beta is a judgment coefficient of the existence of the chaser.
10. The multi-mechanism combination strategy based cluster distributed capture method according to claim 1 or 2, wherein the particle swarm algorithm comprises:
setting the particle size nDimension d of parameter to be optimized, maximum number of iterations Niter(ii) a The velocity and position are updated by the following formula:
wherein,is the velocity of the d-th dimension in which the particle h is located at time instant tau,is the position of the d-th dimension of the particle h at time τ, w is the inertial weight, c1Learning factors for individuals, c2As a group learning factor, r1And r2Random numbers, pbest, all 0 to 1hdIs the local extreme value of the d-th dimension of the particle h, gbestdIs the global extremum for the d-th dimension.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210256828.8A CN114692397A (en) | 2022-03-16 | 2022-03-16 | Cluster distributed capture method based on multi-mechanism combination strategy |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210256828.8A CN114692397A (en) | 2022-03-16 | 2022-03-16 | Cluster distributed capture method based on multi-mechanism combination strategy |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114692397A true CN114692397A (en) | 2022-07-01 |
Family
ID=82138212
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210256828.8A Pending CN114692397A (en) | 2022-03-16 | 2022-03-16 | Cluster distributed capture method based on multi-mechanism combination strategy |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114692397A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116378897A (en) * | 2023-05-04 | 2023-07-04 | 华北电力大学 | Wind farm yaw angle control method and device |
CN117687322A (en) * | 2024-02-04 | 2024-03-12 | 青岛哈尔滨工程大学创新发展中心 | AUV cluster countermeasure simulation system and method considering individual faults |
-
2022
- 2022-03-16 CN CN202210256828.8A patent/CN114692397A/en active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116378897A (en) * | 2023-05-04 | 2023-07-04 | 华北电力大学 | Wind farm yaw angle control method and device |
CN116378897B (en) * | 2023-05-04 | 2023-12-26 | 华北电力大学 | Wind farm yaw angle control method and device |
CN117687322A (en) * | 2024-02-04 | 2024-03-12 | 青岛哈尔滨工程大学创新发展中心 | AUV cluster countermeasure simulation system and method considering individual faults |
CN117687322B (en) * | 2024-02-04 | 2024-05-03 | 青岛哈尔滨工程大学创新发展中心 | AUV cluster countermeasure simulation system and method considering individual faults |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Chen et al. | Parallel planning: A new motion planning framework for autonomous driving | |
Dang et al. | Visual saliency-aware receding horizon autonomous exploration with application to aerial robotics | |
Kebria et al. | Deep imitation learning for autonomous vehicles based on convolutional neural networks | |
CN104881043B (en) | A kind of multiple no-manned plane for many dynamic objects is intelligent coordinated to examine printing method | |
CN114692397A (en) | Cluster distributed capture method based on multi-mechanism combination strategy | |
CN108983823B (en) | Plant protection unmanned aerial vehicle cluster cooperative control method | |
JP2021176109A (en) | Room layout estimating method and technique | |
CN110795833B (en) | Crowd evacuation simulation method, system, medium and equipment based on cat swarm algorithm | |
Hamandi et al. | Deepmotion: Learning to navigate like humans | |
CN104331917B (en) | A kind of panic crowd's escape analogy method | |
CN114489144B (en) | Unmanned aerial vehicle autonomous maneuver decision method and device and unmanned aerial vehicle | |
Hamdi et al. | SADA: semantic adversarial diagnostic attacks for autonomous applications | |
CN112512755A (en) | Robotic manipulation using domain-invariant 3D representations predicted from 2.5D visual data | |
Yu et al. | Review of deep reinforcement learning | |
CN114387462A (en) | Dynamic environment sensing method based on binocular camera | |
Bouwmeester et al. | Nanoflownet: Real-time dense optical flow on a nano quadcopter | |
Li et al. | Vg-swarm: A vision-based gene regulation network for uavs swarm behavior emergence | |
CN114815891A (en) | PER-IDQN-based multi-unmanned aerial vehicle enclosure capture tactical method | |
CN113255893A (en) | Self-evolution generation method of multi-agent action strategy | |
CN109117571B (en) | Automatic control method for interconnected cooperative motion of unmanned aerial vehicle cluster | |
Rojas-Perez et al. | A temporal CNN-based approach for autonomous drone racing | |
Sanket | Active vision based embodied-ai design for nano-uav autonomy | |
Li et al. | YOLO-A2G: An air-to-ground high-precision object detection algorithm based on YOLOv5 | |
CN113589810A (en) | Intelligent dynamic autonomous obstacle avoidance movement method and device, server and storage medium | |
DeLima et al. | Toward developing an optimal cooperative search algorithm for multiple unmanned aerial vehicles |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |