CN112327923A - Multi-unmanned aerial vehicle collaborative path planning method - Google Patents

Multi-unmanned aerial vehicle collaborative path planning method Download PDF

Info

Publication number
CN112327923A
CN112327923A CN202011303855.3A CN202011303855A CN112327923A CN 112327923 A CN112327923 A CN 112327923A CN 202011303855 A CN202011303855 A CN 202011303855A CN 112327923 A CN112327923 A CN 112327923A
Authority
CN
China
Prior art keywords
unmanned aerial
path
point
aerial vehicle
particle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011303855.3A
Other languages
Chinese (zh)
Other versions
CN112327923B (en
Inventor
周志浪
刘小波
郑可心
张超超
杨建峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Geosciences
Original Assignee
China University of Geosciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Geosciences filed Critical China University of Geosciences
Priority to CN202011303855.3A priority Critical patent/CN112327923B/en
Publication of CN112327923A publication Critical patent/CN112327923A/en
Application granted granted Critical
Publication of CN112327923B publication Critical patent/CN112327923B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/10Simultaneous control of position or course in three dimensions
    • G05D1/101Simultaneous control of position or course in three dimensions specially adapted for aircraft
    • G05D1/104Simultaneous control of position or course in three dimensions specially adapted for aircraft involving a plurality of aircrafts, e.g. formation flying

Landscapes

  • Engineering & Computer Science (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)

Abstract

The invention provides a multi-unmanned aerial vehicle collaborative path planning method, which comprises the following steps: the unmanned aerial vehicle group obtains a starting point position, an obstacle point and existing radar coordinates in a map environment, and establishes a two-dimensional grid map of a combat environment; according to the obtained two-dimensional grid map, establishing an optimization target of the multiple unmanned aerial vehicles according to the path length of each unmanned aerial vehicle and the threat strength; initializing the population by combining random generation path points and Q learning to obtain an initial population of each unmanned aerial vehicle path; optimizing the initial population by adopting an improved particle swarm algorithm according to the optimization target; the method takes the overall benefits of the multiple unmanned aerial vehicles as an optimization target, and ensures the safety and the high efficiency of the flight path of each unmanned aerial vehicle.

Description

Multi-unmanned aerial vehicle collaborative path planning method
Technical Field
The invention relates to the technical field of unmanned aerial vehicles, in particular to a multi-unmanned aerial vehicle collaborative path planning method.
Background
The cooperation of many unmanned aerial vehicles is the main style of future unmanned aerial vehicle application, and many unmanned aerial vehicles cooperate the performance of route planning has decided many unmanned aerial vehicles synergy effect to a great extent, is the focus of unmanned aerial vehicle route planning research at present.
The cooperative path planning of the multiple unmanned aerial vehicles refers to planning paths for the multiple unmanned aerial vehicles under the condition of considering cooperative constraint, so that the multiple unmanned aerial vehicles can cooperatively complete tasks at minimum cost. Compared with a single unmanned aerial vehicle, the implementation of path planning of multiple unmanned aerial vehicles is more limited and more difficult, which is a key point and a difficult point of current path planning.
The traditional algorithms such as A, Voronoi and the like have high calculation efficiency and simple planning. MA P B et al use Voronoi diagrams and collaborative strategies to achieve collaborative track planning that satisfies time constraints. Lidewei et al, by improving the search sequence and optimizing the valuation function, changes the undirected search in the A-algorithm into a directed search, changes the global valuation into the local valuation, and improves the algorithm efficiency. However, the traditional algorithm has many limitations in solving the path planning problem, for example, the precision method is only suitable for small-scale path planning, and when the objective function and the constraint condition are complicated, the precision method is difficult to provide an effective solution. The heuristic algorithm is easy to fall into local optimization and is also not suitable for the problem of large scale.
In view of the limitation of solving the path planning of multiple unmanned aerial vehicles by the traditional algorithm, more and more learners utilize the intelligent optimization algorithm to solve the unmanned aerial vehicle cluster path planning, the intelligent optimization algorithm has strong self-organization, strong robustness and strong expandability, and can be used for solving the unmanned aerial vehicle cooperation problem under complex constraint. The ant colony algorithm, the particle swarm algorithm and the genetic algorithm are the most widely used 3 types of methods.
All quantum ants complete path search according to tabu search and quantum pheromone updating node selection probability, quantum rotation angles are updated according to comprehensive cost of optimal paths, quantum pheromone is updated by using simulated quantum rotating gates, the output optimal paths are stored in a path set, whether the number of the paths in the path set reaches the maximum number of the paths is judged, and the paths in the path set are sorted according to length for selection of the unmanned aerial vehicle. Yu et al generated a solution sequence of the unmanned aerial vehicle cluster path using the track point numbers, and the multiple sequences divided by the division points represented multiple unmanned aerial vehicle paths, and introduced differential evolution operation to maintain the diversity of particles for solving the problem of particle precocity, and solved by a hybrid particle swarm algorithm in combination with an adaptive adjustment inertial weight strategy. XIAO et al, in order to solve the problem of path planning of multiple unmanned aerial vehicles traversing multiple target points with obstacle area restriction, aiming at the lowest total flight cost, design a new crossover operator in the process of solving by using a genetic algorithm, randomly select a sub-path from a parent generation and lead the sub-path, and generate paths traversing other target points according to a certain rule, thereby improving the convergence speed while retaining the excellent chromosomes of the parent generation.
In the invention, the particle swarm algorithm based on single route point evaluation can realize learning more potential better route points in the unmanned aerial vehicle cluster, but cross route points can be generated, and in this case, the obtained routes need to be subjected to conflict processing, so that the convergence speed is slowed.
The UUV path planning method based on the particle swarm algorithm does not consider threats possibly encountered in the running process of the unmanned ship, and is a single-target path planning method.
The unmanned aerial vehicle cluster path planning based on the improved Q learning algorithm only plans a path for each unmanned aerial vehicle independently, and does not consider the overall benefit of the unmanned aerial vehicle cluster.
The path planning method based on the ant colony algorithm judges whether the cooperation can be achieved or not by smoothing the initial path of each unmanned aerial vehicle, does not consider the cooperation problem among the unmanned aerial vehicles in the algorithm process, and only plans the path for each unmanned aerial vehicle independently.
Aiming at the current existing unmanned aerial vehicle path planning method, the characteristics of multi-target and cooperative optimization in the multi-unmanned aerial vehicle path planning in the actual situation are combined, the cooperative nature of the multi-unmanned aerial vehicle path planning needs to be deeply researched, and the overall benefit of the unmanned aerial vehicle cluster can be increased under the condition that the unmanned aerial vehicle can reach the destination smoothly.
The invention provides a multi-unmanned aerial vehicle collaborative path planning method based on an improved particle swarm algorithm. In the initialization of particle swarm, the invention adopts a mode of combining random initialization and Q learning initialization, thereby ensuring the diversity of the initial population, accelerating the convergence speed of the algorithm, and improving the position and speed updating mode of the particle swarm algorithm, so that the method is suitable for the path planning problem in a coordinate form; the path lengths and threat strengths of the multiple unmanned aerial vehicles are considered in the flight process of the unmanned aerial vehicles, so that the high efficiency and safety of the multiple unmanned aerial vehicles for executing tasks are ensured; the weighted path length and the threat intensity of the multiple unmanned aerial vehicles are used as optimization targets, and the overall benefit of the unmanned aerial vehicle cluster is increased.
Disclosure of Invention
In view of the above, the present invention provides a method for planning a collaborative path of multiple drones, including the following steps:
s1, acquiring radar coordinates including a starting point position, an obstacle point and an existing radar in a map environment by the unmanned aerial vehicle cluster, and establishing a two-dimensional grid map of a battle environment;
s2, constructing an optimization target of the multiple unmanned aerial vehicles according to the two-dimensional grid map established in the S1 through the path length of each unmanned aerial vehicle and the threat strength;
s3, initializing the population by combining random generation path points and Q learning to obtain an initial population of each unmanned aerial vehicle path;
s4, optimizing the initial population obtained in the S3 by adopting an improved particle swarm optimization according to the optimization target constructed in the step S2;
and S5, obtaining the optimal particles of the population, and outputting the air routes of all unmanned aerial vehicles in the unmanned aerial vehicle cluster.
The technical scheme provided by the invention has the beneficial effects that: 1. the mixed mode of the random generation path and the Q learning generation path is adopted, so that the superiority of the initial solution of the population is ensured;
2. the variation operation is based on the distance and the included angle between the point to be selected and the end point, different selection probabilities are given to the points to be selected, and the convergence speed of the algorithm is accelerated;
3. the overall benefit of the multiple unmanned aerial vehicles is taken as an optimization target, and the safety and the high efficiency of flight paths of the unmanned aerial vehicles are guaranteed.
Drawings
Fig. 1 is a flow chart of a multi-drone collaborative path planning method of the present invention;
FIG. 2 is a grid environment diagram of a multi-UAV collaborative path planning method according to the present invention;
fig. 3 is a schematic diagram illustrating an action variation of a multi-drone collaborative path planning method according to the present invention;
fig. 4 is a cross operation mode 1 of optimal particle and individual for the multi-drone collaborative path planning method of the present invention;
fig. 5 is a cross operation mode 2 of the optimal particle and individual for the multi-unmanned aerial vehicle collaborative path planning method of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be further described with reference to the accompanying drawings.
Referring to fig. 1, the present invention provides a method for planning a collaborative path of multiple drones, including the following steps:
s1, acquiring radar coordinates including a starting point position, an obstacle point and an existing radar in a map environment by the unmanned aerial vehicle cluster, and establishing a two-dimensional grid map of a battle environment;
the path points are represented in a form of coordinates in a two-dimensional grid graph, and the flight environment of the unmanned aerial vehicle is described as follows:
{(x,y)|xmin≤x≤xmax,ymin≤y≤ymax};xmin、xmax、yminand ymaxRepresented as the boundary of the drone flight;
as shown in fig. 2, the environment is divided into 20 × 20 cells having the same area, each cell carries different parameter information of 0, -1, 1, and 2, and when the cell parameter is 0, it indicates that the area has no obstacle, when the cell parameter is 1, it indicates that the area contains an obstacle or a threat source, when the cell parameter is-1, and it indicates that the area has an end point, and by constructing a two-dimensional grid map, it is possible to obtain environment information well.
S2, constructing an optimization target of the multiple unmanned aerial vehicles according to the two-dimensional grid map established in the S1 through the path length of each unmanned aerial vehicle and the threat strength;
number of steps f taken by i unmanned aerial vehiclesstepConsidering the constraint of the turning angle of the unmanned aerial vehicle as step, if the unmanned aerial vehicle has oblique flight in the path, a penalty f is imposedpenaltyAnd obtaining the path length expression of the single unmanned aerial vehicle:
fi=fstep+fpenalty
the path comprehensive weighting length of the multiple unmanned aerial vehicles is calculated by the following formula:
Figure BDA0002787681360000051
wherein m is the number of unmanned aerial vehicles;
dividing a complete path from a starting point to a target point into three segments, and taking each threat point to 1/4, 2/4 and 3/4 of each segment for summation to obtain the threat intensity of the threat point j to the unmanned aerial vehicle i as represented by
Figure BDA0002787681360000052
Where t is the three parts of the path, itFor the t-th segment of the drone path,
Figure BDA0002787681360000053
and
Figure BDA0002787681360000054
for the fourth power of the distance from threat point j to the t-th paths 1/4, 2/4, and 3/4, K is the adjustment factor, and the threat strength of a single drone under the path is:
Figure BDA0002787681360000055
wherein N is the number of threat points, and the comprehensive threat intensity that the unmanned aerial vehicle received of definition is:
Figure BDA0002787681360000056
constructing an overall optimization target f of multiple unmanned aerial vehicles: f ═ w1*f1+w2*f2Wherein w is1And w2Respectively, the total weighted path length and the weight of the total weighted threat strength.
S3, initializing the population by combining random generation path points and Q learning to obtain an initial population of each unmanned aerial vehicle path;
the invention adopts a random path point generation mode to obtain part of initial population, the random path point generation mode is that when the initial species group is generated, if a continuous path is obtained, the unmanned aerial vehicle can select a free grid from 8 adjacent grids around the unmanned aerial vehicle as a next grid, the selection method can ensure that the path is continuous, the generated path has diversity, but the generated path is longer, the path threat intensity is higher, and simultaneously, the evolution algebra is increased to influence the convergence speed.
Therefore, the invention obtains another part of the initial population by using a Q learning algorithm, the purpose of the Q learning algorithm is to obtain the maximum accumulated reward value in the interaction with the environment, and therefore, the construction of a reasonable reward function is very important. In the present invention, the reward function is designed as shown in table 1;
setting of table-return function
Unmanned plane status Starting point Target point Obstacle Threat point Feasible point
Reward value
0 20 -1 1 r
In order to accelerate the convergence rate of Q learning, a feasible point reward function based on a sparse function is constructed in Q learning:
Figure BDA0002787681360000061
wherein r is0Based on the prize value, rmaxTo take into account the maximum reward value at the time of threat, rthreatAs a threat coefficient, xUAVAnd yUAVCoordinates for unmanned aerial vehicles, xgoalAnd ygoalCoordinate of end point, xithreatAnd yithreatAs coordinates of each threat source, kgoalAnd kthreatTo adjust the control coefficient of the reward function, r is guaranteed during the parameter setting>0;
In order to ensure good exploration, the action selection strategy needs to give consideration to exploration and application, the action selection strategy adopted by Q learning in the invention is a Boltzmann distribution method, and the expression of the action selection strategy is as follows:
Figure BDA0002787681360000071
T=λkT0
wherein ViRepresents a state s or a state-action pair (s,ai) Any of the value functions of p (a)i| s) represents action a under state siThe probability of selection is T as a control parameter, lambda is an attenuation coefficient, the smaller the value of lambda is, the faster the attenuation speed of the value of T is, 0<λ<1, k is iteration times;
each unmanned aerial vehicle path obtained through the Q learning algorithm has the characteristics of shorter path length and smaller threat intensity compared with the path obtained in a random generation mode.
Selecting a path as a candidate initial solution of a population from the unmanned aerial vehicle path solutions obtained by the Q learning algorithm, prescribing the number of path solutions of each unmanned aerial vehicle in different initial directions by using priori knowledge, and combining the initial solution obtained by the Q learning and the initial solution obtained by randomly generating path points, namely obtaining the initial population of the particle swarm algorithm, wherein the form of the initial solution of the population is a coordinate.
S4, optimizing the initial population obtained in the S3 by adopting an improved particle swarm optimization according to the optimization target constructed in the step S2;
aiming at the problem that a speed and position updating operator of a basic particle swarm algorithm is difficult to express a discrete domain such as coordinate-form-based collaborative path planning, therefore, a position and speed updating formula of the particle swarm algorithm is analyzed and an improvement strategy is proposed;
the updating formula of the position and the speed of the particle swarm algorithm is as follows:
Figure BDA0002787681360000072
wherein xi,j(t)=(xi,j(1),xi,j(2),Λ,xi,j(t)) is the position of the ith particle at the t-th iteration, pi,j,best(t)=(pi,j,best(1),pi,j,best(2),Λ,pi,j,best(3) G) is the optimum value of the particle in t iterationsi,j,best(t)=(gi,j,best(1),gi,j,best(2),Λ,gi,j,best(3) Is the global optimum of the particle in t iterations, w is the inertial weight, c1And c2Is a non-negative acceleration coefficient, f1As a function of the influence of the particle on itself, f2Is a particle xi,j(t) learning of the optimal value of its individual history, f3Is a particle xi,j(t) learning of its global historical optimum;
in an evolution operation F1Defined as a mutation operation, a particle i mutates itself with a probability of w, with a probability of 1-w remaining unchanged, expressed as:
Figure BDA0002787681360000081
the convergence of the algorithm is accelerated by adopting a linear time-varying inertia weight strategy:
Figure BDA0002787681360000082
w has a value of from wmaxLinearly decreasing to a final value wminT is the current iteration number of the algorithm, itermaxThe maximum number of iterations allowed for the algorithm to continue;
the mutation operation comprises the following specific steps:
s41, randomly selecting a point in the path coordinate, and removing the starting point and the end point;
s42, as shown in fig. 3, the next action of the unmanned aerial vehicle has 8 choices according to the point where the unmanned aerial vehicle is located, and if the original coordinate point in the path is 1, the coordinate point to be selected is 2 to 8, and there are 7 coordinate points to be selected. In order to reduce the search blindness, the distance D between each point i to be selected and the target point is calculatediCalculating the probability p that the point i is selectedi
Figure BDA0002787681360000083
Wherein k isiThe adjustment coefficient is determined according to the included angle between the action point to be selected and the target point, and particularly, when the point to be selected is an obstacle point, the probability of the point to be selected is 0;
s43, randomly generating oneA number p of 0 to 1, if p1+p2+Λpi-1+<p≤p1+p2+ΛpiSelecting the point i as the next action point;
s44, adding the current grid into a search tabu table to avoid repeated selection, and replacing the current grid serial number with the next grid serial number;
s45, if all the coordinate points to be selected are obstacle points, canceling the variation operation of the point, and directly jumping to S41 to reselect the variation coordinates;
s46, judging whether the new action point is continuous with the original path, if not, inserting a new non-obstacle coordinate point at the discontinuous position, and judging whether the path is continuous according to the following formula;
δ=max{abs(xi+1-xi),abs(yi+1-yi)}
wherein (x)i+1,yi+1) Is the coordinate point of the original path, (x)i,yi) If δ is 1, the path is continuous, and the mutation operation is finished; when delta>When 1, the path is discontinuous, new coordinates are directly inserted into the two coordinate points through an average value method, and the coordinate expression of the inserted points is as follows;
Figure BDA0002787681360000091
and judging whether the coordinates of the insertion point have obstacles or not, if so, sequentially selecting the coordinates in the order of the upper right part, the lower left part and the lower right part of the insertion point to judge whether the coordinates are the obstacles or not until the free coordinate point is selected, judging whether the inserted point is continuous with the original path point again, and if not, repeating the step S45 until the path is continuous.
F is to be2Defined as particles i and c1Is interleaved with its individual optimum values by 1-c1The probability of (c) remains unchanged, which is expressed as:
Figure BDA0002787681360000092
f is to be3Defined as particles i and c2Is interleaved with its global optimum by 1-c2The probability of (c) remains unchanged, which is expressed as:
Figure BDA0002787681360000093
step F of cross operation of particles in population and globally optimal particles2Similarly;
and obtaining the optimal particles of each sub-population through the evolution operation, judging whether the optimal particles meet the cycle ending condition, if so, storing the optimal particles of each sub-population, terminating the evolution operation, and if not, re-performing the evolution operation.
S401, judging whether the particle i and the individual optimal particle have the same coordinate point, if so, carrying out the next step, otherwise, ending the mutation operation;
s402, if there is only one identical point, exchanging coordinates of a section from the point to the end point of the individual optimal particle with the particle i, as shown in fig. 4, if there are two or more identical points, randomly selecting two points, and exchanging a section between the two points of the individual optimal particle with the particle i, as shown in fig. 5.
And S5, obtaining the optimal particles of the population, and outputting the air routes of all unmanned aerial vehicles in the unmanned aerial vehicle cluster.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (7)

1. A multi-unmanned aerial vehicle collaborative path planning method is characterized by comprising the following steps:
s1, acquiring radar coordinates including a starting point position, an obstacle point and an existing radar in a map environment by the unmanned aerial vehicle cluster, and establishing a two-dimensional grid map of a battle environment;
s2, constructing an optimization target of the multiple unmanned aerial vehicles according to the two-dimensional grid map established in the S1 through the path length of each unmanned aerial vehicle and the threat strength;
s3, initializing the population by combining random generation path points and Q learning to obtain an initial population of each unmanned aerial vehicle path;
s4, optimizing the initial population obtained in the S3 by adopting an improved particle swarm optimization according to the optimization target constructed in the step S2;
and S5, obtaining the optimal particles of the population, and outputting the air routes of all unmanned aerial vehicles in the unmanned aerial vehicle cluster.
2. The method for planning the collaborative path for multiple unmanned aerial vehicles according to claim 1, wherein the S1 is specifically as follows:
the path points are represented in a form of coordinates in a two-dimensional grid graph, and the flight environment of the unmanned aerial vehicle is described as follows: { (x, y) | xmin≤x≤xmax,ymin≤y≤ymax};xmin、xmax、yminAnd ymaxRepresented as the boundary of the drone flight;
dividing the flight environment into 20 × 20 grids with the same area, wherein each grid carries different parameter information of 0, -1, 1 and 2, and when the grid parameter is 0, the grid parameter represents the position information of a starting point; when the grid parameter is 1, the area is free of obstacles, when the grid parameter is-1, the area contains obstacles or threat sources, and when the grid parameter is 2, the area represents the position information of the end point.
3. The method for planning the collaborative path for the multiple drones according to claim 1, wherein the specific method for constructing the optimization target for the multiple drones in S2 is as follows:
number of steps f taken by i unmanned aerial vehiclesstepStep is taken as the path length, and the constraint of the turning angle of the unmanned aerial vehicle is considered, if the unmanned aerial vehicle is in the pathIf there is an oblique flight, a penalty f is imposedpenaltyAnd obtaining the path length expression of the single unmanned aerial vehicle:
fi=fstep+fpenalty
the specific calculation formula of the path comprehensive weighting length of the multiple unmanned aerial vehicles is as follows:
Figure FDA0002787681350000021
wherein m is the number of unmanned aerial vehicles;
dividing a complete path from a starting point to a target point into three segments, and taking each threat point to 1/4, 2/4 and 3/4 of each segment for summation to obtain the threat intensity of the threat point j to the unmanned aerial vehicle i as represented by
Figure FDA0002787681350000022
Where t is the three parts of the path, itFor the t-th segment of the drone path,
Figure FDA0002787681350000023
and
Figure FDA0002787681350000024
for the fourth power of the distance from threat point j to the t-th paths 1/4, 2/4, and 3/4, K is the adjustment factor, and the threat strength of a single drone under the path is:
Figure FDA0002787681350000025
wherein N is the number of threat points, and the comprehensive threat intensity that the unmanned aerial vehicle received of definition is:
Figure FDA0002787681350000026
build the overall optimization objective of multiple unmanned aerial vehiclesMarking f: f ═ w1*f1+w2*f2Wherein w is1And w2Respectively, the total weighted path length and the weight of the total weighted threat strength.
4. The method for planning the collaborative path for the multiple unmanned aerial vehicles according to claim 1, wherein the specific method for obtaining the initial population in S3 is as follows:
firstly, obtaining a part of initial population by adopting a random path point generation mode, then obtaining a part of initial population by adopting Q learning, and constructing a feasible point reward function based on a sparse function in the Q learning:
Figure FDA0002787681350000031
wherein r is0Based on the prize value, rmaxTo take into account the maximum reward value at the time of threat, rthreatAs a threat coefficient, xUAVAnd yUAVCoordinates for unmanned aerial vehicles, xgoalAnd ygoalCoordinate of end point, xithreatAnd yithreatAs coordinates of each threat source, kgoalAnd kthreatTo adjust the control coefficient of the reward function, r is guaranteed during the parameter setting>0;
The action selection strategy adopted in Q learning is a Boltzmann distribution method, and the expression of the action selection strategy is as follows:
Figure FDA0002787681350000032
T=λkT0
wherein ViRepresenting state s or state-action pairs (s, a)i) Any of the value functions of p (a)i| s) represents action a under state siThe probability of selection is T as a control parameter, lambda is an attenuation coefficient, the smaller the value of lambda is, the faster the attenuation speed of the value of T is, 0<λ<1, k is iteration times;
selecting a path as a candidate initial solution of a population in the unmanned aerial vehicle path solutions obtained by the Q learning algorithm, prescribing the number of path solutions of each unmanned aerial vehicle in different initial directions by using prior knowledge, combining the initial population obtained by the Q learning and the initial population obtained by randomly generating path points, namely the initial population of the particle swarm algorithm, wherein the form of the initial solution of the population is a coordinate.
5. The method for planning the collaborative path of the multiple unmanned aerial vehicles according to claim 1, wherein an initial population obtained in step S3 is optimized by using an improved particle swarm algorithm according to the optimization target constructed in step S2, and the update formula of the position and the speed of the particle swarm algorithm is as follows:
Figure FDA0002787681350000033
wherein xi,j(t)=(xi,j(1),xi,j(2),Λ,xi,j(t)) is the position of the ith particle at the t-th iteration, pi,j,best(t)=(pi,j,best(1),pi,j,best(2),Λ,pi,j,best(3) G) is the optimum value of the particle in t iterationsi,j,best(t)=(gi,j,best(1),gi,j,best(2),Λ,gi,j,best(3) Is the global optimum of the particle in t iterations, w is the inertial weight, c1And c2Is a non-negative acceleration coefficient, f1As a function of the influence of the particle on itself, f2Is a particle xi,j(t) learning of the optimal value of its individual history, f3Is a particle xi,j(t) learning of its global historical optimum;
in an evolution operation F1Defined as a mutation operation, a particle i mutates itself with a probability of w, with a probability of 1-w remaining unchanged, expressed as:
Figure FDA0002787681350000041
the convergence of the algorithm is accelerated by adopting a linear time-varying inertia weight strategy:
Figure FDA0002787681350000042
w has a value of from wmaxLinearly decreasing to a final value wminT is the current iteration number of the algorithm, itermaxThe maximum number of iterations allowed for the algorithm to continue;
f is to be2Defined as particles i and c1Is interleaved with its individual optimum values by 1-c1The probability of (c) remains unchanged, which is expressed as:
Figure FDA0002787681350000043
f is to be3Defined as particles i and c2Is interleaved with its global optimum by 1-c2The probability of (c) remains unchanged, which is expressed as:
Figure FDA0002787681350000044
particle and global optimum particle cross operation step and F2Similarly;
and obtaining the optimal particles of each sub-population through the evolution operation, judging whether the optimal particles meet the cycle ending condition, if so, storing the optimal particles of each sub-population, terminating the evolution operation, and if not, re-performing the evolution operation.
6. The method for planning the collaborative path of multiple unmanned aerial vehicles according to claim 5, wherein the mutation operation comprises the following steps:
s41, randomly selecting a point in the path coordinates;
s42, calculating the distance D between each point i to be selected and the target pointiCalculating the probability p that the point i is selectedi
Figure FDA0002787681350000045
Wherein k isiAn adjustment coefficient determined according to prior knowledge, in particular, when the point to be selected is an obstacle point, the probability is 0;
s43, randomly generating a number p of 0-1, if p1+p2+Λpi-1+<p≤p1+p2+ΛpiSelecting the point i as the next action point;
s44, adding the current grid into a search tabu table to avoid repeated selection, and replacing the current grid serial number with the next grid serial number;
s45, if all the coordinate points to be selected are obstacle points, canceling the variation operation of the point, and directly jumping to S41 to reselect the variation coordinates;
s46, judging whether the new action point is continuous with the original path, if not, inserting a new non-obstacle coordinate point at the discontinuous position, and judging whether the path is continuous according to the following formula;
δ=max{abs(xi+1-xi),abs(yi+1-yi)}
wherein (x)i+1,yi+1) Is the coordinate point of the original path, (x)i,yi) If δ is 1, the path is continuous, and the mutation operation is finished; when delta>When 1, the path is discontinuous, new coordinates are directly inserted into the two coordinate points through an average value method, and the coordinate expression of the inserted points is as follows;
Figure FDA0002787681350000051
and judging whether the coordinates of the insertion point have obstacles or not, if so, sequentially selecting the coordinates in the order of the upper right part, the lower left part and the lower right part of the insertion point to judge whether the coordinates are the obstacles or not until the free coordinate point is selected, judging whether the inserted point is continuous with the original path point again, and if not, repeating the step S45 until the path is continuous.
7. The method of claim 5, wherein the particle i is expressed as c1The specific steps of the cross operation of the probability and the individual optimal particles are as follows:
s401, judging whether the particle i and the individual optimal particle have the same coordinate point, if so, carrying out the next step, otherwise, ending the mutation operation;
s402, if only one identical point exists, the coordinates of the interval from the point to the end point of the individual optimal particle are exchanged with the particle i, if two or more identical points exist, two points are randomly selected, and the interval between the two points of the individual optimal particle is exchanged with the particle i.
CN202011303855.3A 2020-11-19 2020-11-19 Multi-unmanned aerial vehicle collaborative path planning method Active CN112327923B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011303855.3A CN112327923B (en) 2020-11-19 2020-11-19 Multi-unmanned aerial vehicle collaborative path planning method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011303855.3A CN112327923B (en) 2020-11-19 2020-11-19 Multi-unmanned aerial vehicle collaborative path planning method

Publications (2)

Publication Number Publication Date
CN112327923A true CN112327923A (en) 2021-02-05
CN112327923B CN112327923B (en) 2022-04-01

Family

ID=74321588

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011303855.3A Active CN112327923B (en) 2020-11-19 2020-11-19 Multi-unmanned aerial vehicle collaborative path planning method

Country Status (1)

Country Link
CN (1) CN112327923B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113342008A (en) * 2021-05-21 2021-09-03 武汉理工大学 Path planning system and method for tracking sea-air cooperative underwater target
CN113421171A (en) * 2021-03-12 2021-09-21 国网山东省电力公司临沂供电公司 MOEA/D evolution multi-objective optimization-based path navigation system and method
CN113639750A (en) * 2021-07-20 2021-11-12 中国地质大学(武汉) Unmanned aerial vehicle monitoring path planning method and device considering time-varying demand during peak period
CN114138005A (en) * 2021-10-26 2022-03-04 南京航空航天大学 Urban mass logistics unmanned aerial vehicle track planning method and device based on improved A-star algorithm
CN114326792A (en) * 2021-12-09 2022-04-12 中国电子科技集团公司第五十四研究所 Unmanned aerial vehicle collaborative track control method based on mixed group intelligent algorithm
CN115220478A (en) * 2022-08-16 2022-10-21 哈尔滨逐宇航天科技有限责任公司 Aircraft path planning method based on cluster evolution reinforcement learning
CN115308233A (en) * 2022-09-29 2022-11-08 中国科学院过程工程研究所 Radiographic imaging apparatus, radiographic imaging method, and storage medium
CN116088396A (en) * 2023-03-06 2023-05-09 北京理工大学 Unmanned cluster double-layer cooperative task control method and system
WO2023142316A1 (en) * 2022-01-25 2023-08-03 南方科技大学 Flight decision generation method and apparatus, computer device, and storage medium
CN116542468A (en) * 2023-05-06 2023-08-04 中国人民解放军32370部队 Unmanned aerial vehicle cluster task planning method
CN117494919A (en) * 2023-11-13 2024-02-02 广州力生机器人技术有限公司 Path planning method and device based on multi-robot collaborative stacking operation
CN115220478B (en) * 2022-08-16 2024-05-31 哈尔滨逐宇航天科技有限责任公司 Aircraft path planning method based on cluster evolution reinforcement learning

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104036332A (en) * 2014-06-17 2014-09-10 中国地质大学(武汉) Average gradient value and improved multi-objective particle swarm optimization based robust optimization system
CN105700549A (en) * 2016-01-21 2016-06-22 北京理工大学 Unmanned plane multi-track planning method based on sequence ecological niche PSO (particle swarm optimization) algorithm
CN107479575A (en) * 2017-08-25 2017-12-15 中国地质大学(武汉) A kind of multi-rotor unmanned aerial vehicle flight control method and system
CN108021144A (en) * 2017-12-29 2018-05-11 中国地质大学(武汉) A kind of unmanned aerial vehicle flight path planning and dynamic threats evade emulator
CN108153331A (en) * 2017-12-29 2018-06-12 中国地质大学(武汉) Unmanned plane dynamic threats bypassing method, equipment and storage device based on Kalman filter
US20180308371A1 (en) * 2017-04-19 2018-10-25 Beihang University Joint search method for uav multiobjective path planning in urban low altitude environment
US20190086938A1 (en) * 2015-07-27 2019-03-21 Genghiscomm Holdings, LLC Airborne Relays in Cooperative-MIMO Systems
CN109992000A (en) * 2019-04-04 2019-07-09 北京航空航天大学 A kind of multiple no-manned plane path collaborative planning method and device based on Hierarchical reinforcement learning
CN110544296A (en) * 2019-07-31 2019-12-06 中国矿业大学 intelligent planning method for three-dimensional global flight path of unmanned aerial vehicle in environment with uncertain enemy threat

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104036332A (en) * 2014-06-17 2014-09-10 中国地质大学(武汉) Average gradient value and improved multi-objective particle swarm optimization based robust optimization system
US20190086938A1 (en) * 2015-07-27 2019-03-21 Genghiscomm Holdings, LLC Airborne Relays in Cooperative-MIMO Systems
CN105700549A (en) * 2016-01-21 2016-06-22 北京理工大学 Unmanned plane multi-track planning method based on sequence ecological niche PSO (particle swarm optimization) algorithm
US20180308371A1 (en) * 2017-04-19 2018-10-25 Beihang University Joint search method for uav multiobjective path planning in urban low altitude environment
CN107479575A (en) * 2017-08-25 2017-12-15 中国地质大学(武汉) A kind of multi-rotor unmanned aerial vehicle flight control method and system
CN108021144A (en) * 2017-12-29 2018-05-11 中国地质大学(武汉) A kind of unmanned aerial vehicle flight path planning and dynamic threats evade emulator
CN108153331A (en) * 2017-12-29 2018-06-12 中国地质大学(武汉) Unmanned plane dynamic threats bypassing method, equipment and storage device based on Kalman filter
CN109992000A (en) * 2019-04-04 2019-07-09 北京航空航天大学 A kind of multiple no-manned plane path collaborative planning method and device based on Hierarchical reinforcement learning
CN110544296A (en) * 2019-07-31 2019-12-06 中国矿业大学 intelligent planning method for three-dimensional global flight path of unmanned aerial vehicle in environment with uncertain enemy threat

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
WANG GUOSHI,等: "Multiple UAVs Routes Planning Based on particle swarm optimization Algorithm", 《IEEE》 *
凌富园,等: "基于不规则障碍物环境下无人机的改进几何路径规划算法", 《航空电子技术》 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113421171A (en) * 2021-03-12 2021-09-21 国网山东省电力公司临沂供电公司 MOEA/D evolution multi-objective optimization-based path navigation system and method
CN113342008A (en) * 2021-05-21 2021-09-03 武汉理工大学 Path planning system and method for tracking sea-air cooperative underwater target
CN113342008B (en) * 2021-05-21 2023-12-26 武汉理工大学 Path planning system and method for sea-air cooperative underwater target tracking
CN113639750A (en) * 2021-07-20 2021-11-12 中国地质大学(武汉) Unmanned aerial vehicle monitoring path planning method and device considering time-varying demand during peak period
CN114138005B (en) * 2021-10-26 2023-10-13 南京航空航天大学 Urban mass logistics unmanned aerial vehicle flight path planning method and device based on improved A-algorithm
CN114138005A (en) * 2021-10-26 2022-03-04 南京航空航天大学 Urban mass logistics unmanned aerial vehicle track planning method and device based on improved A-star algorithm
CN114326792A (en) * 2021-12-09 2022-04-12 中国电子科技集团公司第五十四研究所 Unmanned aerial vehicle collaborative track control method based on mixed group intelligent algorithm
WO2023142316A1 (en) * 2022-01-25 2023-08-03 南方科技大学 Flight decision generation method and apparatus, computer device, and storage medium
CN115220478A (en) * 2022-08-16 2022-10-21 哈尔滨逐宇航天科技有限责任公司 Aircraft path planning method based on cluster evolution reinforcement learning
CN115220478B (en) * 2022-08-16 2024-05-31 哈尔滨逐宇航天科技有限责任公司 Aircraft path planning method based on cluster evolution reinforcement learning
CN115308233B (en) * 2022-09-29 2023-05-05 中国科学院过程工程研究所 Radiographic imaging apparatus, radiographic imaging method, and storage medium
CN115308233A (en) * 2022-09-29 2022-11-08 中国科学院过程工程研究所 Radiographic imaging apparatus, radiographic imaging method, and storage medium
CN116088396A (en) * 2023-03-06 2023-05-09 北京理工大学 Unmanned cluster double-layer cooperative task control method and system
CN116088396B (en) * 2023-03-06 2023-06-02 北京理工大学 Unmanned cluster double-layer cooperative task control method and system
CN116542468A (en) * 2023-05-06 2023-08-04 中国人民解放军32370部队 Unmanned aerial vehicle cluster task planning method
CN116542468B (en) * 2023-05-06 2023-10-20 中国人民解放军32370部队 Unmanned aerial vehicle cluster task planning method
CN117494919A (en) * 2023-11-13 2024-02-02 广州力生机器人技术有限公司 Path planning method and device based on multi-robot collaborative stacking operation
CN117494919B (en) * 2023-11-13 2024-04-19 广州力生机器人技术有限公司 Path planning method and device based on multi-robot collaborative stacking operation

Also Published As

Publication number Publication date
CN112327923B (en) 2022-04-01

Similar Documents

Publication Publication Date Title
CN112327923B (en) Multi-unmanned aerial vehicle collaborative path planning method
CN109945881B (en) Mobile robot path planning method based on ant colony algorithm
CN107515618B (en) Heterogeneous unmanned aerial vehicle cooperative task allocation method considering time window
CN107330560B (en) Heterogeneous aircraft multi-task cooperative allocation method considering time sequence constraint
CN113741508B (en) Unmanned aerial vehicle task allocation method based on improved wolf pack algorithm
CN110766254A (en) Multi-unmanned aerial vehicle cooperative task allocation method based on improved genetic algorithm
CN113159432A (en) Multi-agent path planning method based on deep reinforcement learning
CN111337931A (en) AUV target searching method
CN113917925A (en) Mobile robot path planning method based on improved genetic algorithm
CN115454067A (en) Path planning method based on fusion algorithm
Sallam et al. A hybrid differential evolution with cuckoo search for solving resource constrained project scheduling problems
Yuhang et al. An adaptive evolutionary multi-objective estimation of distribution algorithm and its application to multi-UAV path planning
CN116164753B (en) Mine unmanned vehicle path navigation method and device, computer equipment and storage medium
Li et al. Mobile Robot Navigation Algorithm Based on Ant Colony Algorithm with A* Heuristic Method
Zhou et al. A novel mission planning method for UAVs’ course of action
Wang et al. An improved NSGA-II algorithm for UAV path planning problems
CN115963731A (en) Command control system network structure optimization method based on improved genetic algorithm
CN113141272B (en) Network security situation analysis method based on iteration optimization RBF neural network
CN112148030B (en) Underwater glider path planning method based on heuristic algorithm
Li et al. Improved genetic algorithm for multi-agent task allocation with time windows
CN111796597A (en) Robot multi-target path optimization method based on novel firefly group optimization
Chittineni et al. A parallel hybridization of clonal selection with shuffled frog leaping algorithm for solving global optimization problems (P-AISFLA)
CN114510876B (en) Multi-platform weapon target distribution method based on symbiotic search biological geography optimization
CN113485119B (en) Heterogeneous homogeneous population coevolution method for improving swarm robot evolutionary capability
CN114020022B (en) Heterogeneous unmanned aerial vehicle collaborative hit task planning method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant