CN115202400A

CN115202400A - Unmanned aerial vehicle cluster task planning method based on self-adaptive penalty TAEA

Info

Publication number: CN115202400A
Application number: CN202210930554.6A
Authority: CN
Inventors: 郑纪彬; 张晓杰; 苏涛
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2022-08-04
Filing date: 2022-08-04
Publication date: 2022-10-18

Abstract

The invention provides an unmanned aerial vehicle cluster task planning method based on self-adaptive penalty TAEA, which comprises the following steps: constructing an unmanned aerial vehicle cluster collaborative search tracking task planning scene; establishing a multi-target multi-constraint optimization problem model for unmanned aerial vehicle searching formation searching; acquiring an optimal search task planning scheme; establishing a multi-target multi-constraint optimization problem model for searching and tracking by unmanned aerial vehicle searching formation; initializing self-adaptive punished double-file evolution algorithm TAEA parameters; updating the convergence population and the diversity population; and acquiring a planning result of the unmanned aerial vehicle cluster collaborative search tracking task. According to the method, the problem of unmanned aerial vehicle cluster searching and tracking task planning is solved through a self-adaptive punished double-file evolution algorithm, the task planning efficiency is improved, a new task planning problem model is established for optimization after the unmanned aerial vehicle searching formation finds a target, a global optimal task planning scheme is obtained, and the unmanned aerial vehicle cluster effectiveness is favorably exerted.

Description

Unmanned aerial vehicle cluster task planning method based on self-adaptive penalty TAEA

Technical Field

The invention belongs to the technical field of unmanned aerial vehicle clusters, relates to an unmanned aerial vehicle cluster task planning method, and particularly relates to an unmanned aerial vehicle cluster task planning method based on a self-adaptive punished double-file evolution algorithm TAEA (probabilistic algorithm for area optimization) in a dynamic scene environment, which can be used in the fields of city security, environment monitoring, field search and rescue, terrorist prevention monitoring and the like.

Background

The unmanned aerial vehicle cluster task planning refers to allocating appropriate unmanned aerial vehicle formation to dynamic tasks and determining the sequence of the unmanned aerial vehicle formation to execute the tasks. According to the organization form of the unmanned aerial vehicle cluster, the unmanned aerial vehicle task allocation mode mainly comprises a centralized mode and a distributed mode. The centralized type is a system structure that all unmanned aerial vehicles in the formation communicate through a single control center to realize signal transmission and control. All unmanned aerial vehicles in the formation transmit the acquired external information and the state information of the unmanned aerial vehicles to the central processing center, the central processing center processes and makes decisions on the information, a control instruction is formed, and then the control instruction is sent to the unmanned aerial vehicles in the cluster, so that the task allocation planning of the unmanned aerial vehicle cluster is realized. Distributed refers to a control system structure in which all drones in a formation can communicate with each other. Compared with a centralized unmanned aerial vehicle cluster system, the distributed unmanned aerial vehicle cluster system has the following advantages: each unmanned aerial vehicle is a computing node, each unmanned aerial vehicle can make a decision autonomously, risks caused by damage of a central node can be reduced, and the unmanned aerial vehicle has the capability of quickly coping with scene changes; however, when the number of communication nodes of the distributed unmanned aerial vehicle system is too large, the communication information amount is extremely large, great challenges are brought to the communication capacity of the system, only the local optimal solution can be obtained, but the global optimal solution cannot be obtained, and the efficiency of the unmanned aerial vehicle for executing tasks is reduced.

The unmanned aerial vehicle cluster task planning method can be divided into an unmanned aerial vehicle cluster task planning method under a dynamic scene environment and an unmanned aerial vehicle cluster task planning method under a static scene environment according to the working scene of the unmanned aerial vehicle cluster, wherein the basic principle of the unmanned aerial vehicle cluster task planning method under the dynamic scene environment is that unmanned aerial vehicle cluster task allocation planning is carried out again after tasks are changed dynamically, so that the unmanned aerial vehicle cluster can finish the tasks efficiently, and the key point is to improve the efficiency of solving a global optimal task allocation planning scheme after new tasks are found. For example, application publication No. CN111199360A, entitled "unmanned aerial vehicle task allocation planning method", which, when a new task P needs to be executed during the task execution process of an unmanned aerial vehicle, divides the task allocation into two stages, namely, pre-allocation and negotiation, i.e., in the pre-allocation stage, a satisfaction set of executable tasks is selected by a central node according to an individual satisfaction function and an individual rejection function, and determines whether the set is empty: if the set is not empty, selecting the most suitable unmanned aerial vehicle to execute a new task by the set of unmanned aerial vehicles meeting the conditions through mutual negotiation, and if the set is empty, executing the same task by combining a plurality of unmanned aerial vehicles. The method mainly solves the problem that the computing time of the central node is too long when task allocation is carried out in the prior art, ensures real-time dynamic allocation of tasks and load balance of unmanned aerial vehicle tasks, and improves the overall performance of unmanned aerial vehicle task execution, but the method has the defects that: 1. after a new task appears, task allocation planning is carried out, a new task planning scheme is obtained through a mutual negotiation mechanism of the unmanned aerial vehicle, the time consumption of the negotiation process is large, and the efficiency of solving the task planning scheme is low; 2. according to the method, only one locally optimal task planning scheme can be obtained through a mutual negotiation mechanism of the unmanned aerial vehicles, the unmanned aerial vehicle cluster can not obtain the maximum benefit at the minimum cost when executing tasks according to the task planning scheme, and therefore the unmanned aerial vehicle cluster cannot exert the maximum effectiveness.

Disclosure of Invention

The invention aims to overcome the defects in the prior art, and provides an unmanned aerial vehicle cluster task planning method based on self-adaptive penalty TAEA (target area algorithm), which is used for solving the technical problem of low planning efficiency in the prior art.

In order to achieve the purpose, the technical scheme adopted by the invention comprises the following steps:

(1) Constructing an unmanned aerial vehicle cluster collaborative search tracking task planning scene:

constructing an unmanned aerial vehicle cluster collaborative search tracking task planning scene, wherein the scene comprises M square grid task areas T = { T = distributed on the ground ₁ ,T ₂ ,...,T _m ,...,T _M K area targets T' = { T } distributed within grid task area T ₁ ′,T′ ₂ ,...,T′ _k ,...,T′ _K Searching N unmanned aerial vehicles distributed in the same horizontal plane in spaceFormation F = { F = { (F) ₁ ,F ₂ ,...,F _n ,...,F _N And central processing unit Θ, each grid task area T _m The central point coordinate, the threat factor and the threat level parameter of (x) are respectively _m ,y _m )、w _m 、λ _m Per target T' _k The threat factor and the threat level parameter of (1) are w' _k 、λ′ _k Each drone search formation F _n Including the flight speeds of

The starting time of all unmanned aerial vehicles for searching and forming to execute the searching task is t ₀ Maximum flight time of T ^max In each task area T _m The time of internal search is phi, the time interval of mission planning is tau, and each unmanned aerial vehicle searches for formation F _n The number of the task areas covered by executing one search task is L, wherein M is more than or equal to 100 _m Denotes the m-th task area, K is more than or equal to 1,T' _k Represents the kth regional target, N is more than or equal to 2 _n Representing the nth unmanned aerial vehicle searching formation, wherein L is more than or equal to 2;

(2) Establishing a multi-target multi-constraint optimization problem model for unmanned aerial vehicle search formation search:

(2a) Formation of unmanned aerial vehicle search F _n And a grid task area T _m As the allocation decision variable assign [ m, n ]]Searching and grouping unmanned aerial vehicles into an execution grid task area set Gamma ⁿ The sequence of (A) is used as a sequence decision variable order [ n ]]And passes through assign [ m, n ]]Value of (d) determines each drone search formation F _n Assigned set of grid task regions tau ⁿ Then through order [ n ]]Gamma ⁿ All the distributed grid task areas are sequenced to obtain a sequenced grid task area set T ^nL ；

(2b) Searching formation F by each drone _n Flying speed of

And a set T of ordered grid task areas ^nL Calculating unmanned aerial vehicle searchCable formation F _n Completion task region T ^nl Time of day of

(2c) Sum S of threat values established to minimize τ over time for all grid task areas _t Integral f of ₁ And minimizing the maximum time f for the n unmanned aerial vehicle search formations to complete the L grid task areas respectively distributed ₂ For the objective function, the drones search for the mutual distance d between the formation at the sampling time points _ti (F _a ,F _b ) Each drone search formation F _n Maximum time of flight constraint c ₂ Each grid task area T _m Threat value constraint c of ₃ Multi-objective multi-constraint optimization problem model Ψ for task planning for searching grid task region T for constraint condition F ₁ Wherein:

min f ₁

min f ₂

c ₃ :s(m,t)≤S _m,max

where s (m, T) denotes each grid task area T at time T _m E represents the natural logarithm base number, t is t ₀ Is a time variable at the start time, t' is a time offset, d _min To indicate nobodyMinimum safe distance between machine search formations, S _m,max Representing a grid task area T _m The threat value upper limit value of (2);

(3) Obtaining an optimal search task planning scheme:

by assigning a decision variable assign [ m, n ]]And order decision variable order [ n ]]The decimal system randomly generating includes

A convergence population CA and a diversity population DA of an initial individual x, and a constraint-oriented double-file evolution algorithm is adopted to solve a multi-target multi-constraint optimization problem model psi by updating the CA and the DA ₁ To obtain a convergence population CA ₀ And diversity population DA ₀ Then from CA ₀ Optionally selecting one individual gamma in the front face of Paretor ₁ As an optimal search task planning scheme;

(4) Establishing a multi-target multi-constraint optimization problem model for searching and tracking of unmanned aerial vehicle searching formation:

(4a) Central processing unit theta selects unmanned aerial vehicle search formation F _μ Flight direction grid task area T _ν Search formation F with unmanned aerial vehicle _η Form unmanned aerial vehicle tracking formation F _μ ″ _η To F, for _n Using a search mission planning scheme gamma ₁ For grid task region T _ν Searching for found target T' _k Set up trace, remainder divide by F _μ And F _η N-2 unmanned aerial vehicle search formations F 'based on a decision variable assign [ m, N']And order [ n']Determining a mesh task area search order allocated by unmanned aerial vehicle search formation, F ' = { F ', F ' ₂ ,...,F _n ″,...,F′ _N-2 Where μ = sent [ n ″ ]]，sent[n″]To assign a decision variable, N ∈ { 1., N } \ η;

(4b) Establishing to minimize the sum S 'of threat values for all grid task areas over time τ' _t Integral f 'of' ₁ Minimizing the maximum time f for N-2 unmanned aerial vehicles to search and form to complete L grid task areas distributed to the unmanned aerial vehicles ₂ ' and minimize drone tracking formation F _μ Reach grid taskRegion T _ν Time f of ₃ ' As an objective function, the drones search for the mutual distance between the formation at the sampling time points

Searching mutual distance of formation at sampling time point by unmanned aerial vehicle

Safety constraint of c ₁ ', maximum time of flight constraint for each drone search formation c ₂ ', each grid task area T _m Threat value constraint c ₃ ', target T ' has been found ' _k Threat value constraint c ₄ 'unmanned aerial vehicle search formation F' as constraint condition and unmanned aerial vehicle tracking formation carry out searching and tracking in grid task area T by multi-target multi-constraint optimization problem model psi ₂ Wherein:

min f ₁ ′

min f ₂ ′

min f ₃ ′

c′ ₃ :s(m,t)≤S _m,max

c′ ₄ :s(k,t)≤S _k,max

wherein s (k, T) represents that target T 'is found at time T' _k T' is a time offset, S _k,max Denotes that target T has been found' _k The upper limit value of the threat degree;

(5) Initializing parameters of a self-adaptive punished double-file evolution algorithm TAEA:

initializing the convergence population CA obtained in the step (3) ₀ For the initial generation of the convergent population CA, randomly generated comprises

Taking the initial individual x as an initial generation population of the diversity population DA, wherein the iteration frequency is R, the maximum iteration frequency is R, and R =0;

(6) Updating the convergence population and the diversity population:

(6a) The r generation convergence population CA _r And the r-th generation diversity population DA _r Combining to obtain population Hm _r Separately calculating the population Hm _r In CA _r And DA _r Non-dominant individual population Hm of _r Pc and Pd, and Pc is judged>If Pd is true, if so, from CA _r In the random selection

Individual as a population P _1r Otherwise, from DA _r In the random selection

Individual as population P _1r (ii) a Computing population CA _r The ratio of PC to non-dominant individual and producing a composition comprising

A vector pf of random numbers, the f-th value pf (f) of the vector pf is judged>Whether PC is true, if so, from CA _r In the method, one individual is randomly selected as P _2r Of the f individual, otherwise from the DA _r In the method, one individual is randomly selected as P _2r The f th individual of (1), P _1r And P _2r Form a

Group parent individual P _r To P _r Each group of parent individuals in the group are subjected to uniform cross operation to obtain

Group P _ru To P _ru Each group of individuals in the group is subjected to partial matching and cross operation to obtain

Group P _rp And from P _rp Randomly selecting one individual from each group of individuals to form P _rhalf Then to P _rhalf Is uniformly mutated to obtain P _rum To P _rum Performing insertion variation operation on each individual to obtain a filial generation population Q _r ；

(6b) The r generation convergence population CA _r And the offspring population Q _r Composition of a population HC _r The r-th generation convergence population DA _r And progeny population Q _r Make up population HD _r Species HC _r Feasible individuals and infeasible individuals in the group respectively form a group Fs _r And a population Is _r ；

(6c) Respectively through HC _r 、HD _r For convergence population CA _r Diverse group DA _r Updating is carried out;

(7) Acquiring a planning result of a collaborative search tracking task of the unmanned aerial vehicle cluster:

judging whether R = R is true, if yes, selecting the R-th generation convergence population CA according to a weight method _r In the pareto frontier of (1) optionally selecting one individual gamma ₂ The unmanned aerial vehicle search formation determined by the value of the decision variable is used for executing the grid task area allocated by the grid task area, the sequence of the grid task area allocated by the search and the designated search unmanned aerial vehicle formation are used as a task planning scheme for unmanned aerial vehicle cluster search tracking, otherwise, r = r +1 is carried out, and the step (6) is carried out;

compared with the prior art, the invention has the following advantages:

1. when the problem of unmanned aerial vehicle cluster search and tracking task planning is solved, the global optimal task planning scheme is obtained by using valuable and infeasible individuals to accelerate population convergence through a self-adaptive punishment-based double-file evolution algorithm, so that the defect that the negotiation process in the prior art consumes a lot of time is overcome, and the task planning efficiency is improved.

2. According to the invention, a new task planning problem model is established after the unmanned aerial vehicles search for the formation to find the target, and a global optimal task planning scheme is obtained by optimizing the task planning problem model, so that the unmanned aerial vehicle cluster effectiveness is favorably exerted.

Drawings

FIG. 1 is a flow chart of an implementation of the present invention.

Fig. 2 is a schematic diagram of an effective search range of the unmanned aerial vehicle search formation of the present invention.

FIG. 3 is a schematic diagram of the square grid task areas and the threat values of each grid task area at time t according to the present invention.

Detailed Description

The invention is described in further detail below with reference to the figures and specific examples.

Referring to fig. 1, the present invention includes the steps of:

step 1) constructing an unmanned aerial vehicle cluster collaborative search tracking task planning scene:

constructing an unmanned aerial vehicle cluster collaborative search tracking task planning scene, wherein the scene comprises M square grid task areas T = { T = distributed on the ground ₁ ,T ₂ ,...,T _m ,...,T _M K area targets T' = { T } distributed within the grid task area T ₁ ′,T′ ₂ ,...,T′ _k ,...,T′ _K N unmanned aerial vehicle search formations F = { F) distributed in the same horizontal plane of space ₁ ,F ₂ ,...,F _n ,...,F _N And central processing unit Θ, each grid task area T _m The central point coordinate, the threat factor and the threat level parameter are respectively (x) _m ,y _m )、w _m 、λ _m Per target T' _k The threat factor and the threat level parameter of (2) are w' _k 、λ′ _k Each drone search formation F _n Involving flyingAll speeds are

The starting time of all unmanned aerial vehicles for searching and forming to execute the searching task is t ₀ Maximum flight time of T ^max In each task area T _m The time of the internal search is phi, the time interval of the task planning is tau, and each unmanned aerial vehicle searches for a formation F _n The number of the task areas covered by executing one search task is L, wherein M is more than or equal to 100 _m Denotes the m-th task region, K.gtoreq.1,T' _k Denotes the kth regional target, N ≧ 2 _n Represents the nth drone search formation, L ≧ 2, in this example, M =400, K =1, N =5,

w _m is [ 1X 10 ] ^-5 ,1×10 ^-4 ]Random number in the range, λ _m Is [ 1X 10 ] ^-11 ,1×10 ^-5 ]Random number in the range, w' _k ＝1×10 ^-1 ，λ′ _k ＝1×10 ^-5 ，t ₀ ＝0，T ^max ＝1×10 ⁴ Second, φ =10 seconds, τ =200 seconds, L =7, each grid task area T _m The inscribed regular quadrangle is represented as an effective search circular range for unmanned aerial vehicle search formation, the side length of each grid area is 100 meters, as shown in fig. 2, a dotted circle represents an effective search range for unmanned aerial vehicle search formation, and for simplifying a scene, the effective search range for unmanned aerial vehicle search formation is represented as a circular inscribed gray square.

Step 2), establishing a multi-target multi-constraint optimization problem model for unmanned aerial vehicle search formation search:

step 2 a) search formation F of unmanned aerial vehicles _n And a grid task area T _m As the allocation decision variable assign [ m, n ]]Searching and grouping unmanned aerial vehicles into an execution grid task area set Gamma ⁿ The sequence of (A) is used as a sequence decision variable order [ n ]]And passes through assign [ m, n ]]Value of (d) determines each drone search formation F _n Assigned set of grid task areas T ⁿ Then through order [ n ]]Gamma ⁿ All the distributed grid task areas are sequenced to obtain a sequenced grid task area set T ^nL ；

Step 2 b) search formation F by each drone _n Flying speed of

And a set T of ordered grid task areas ^nL And calculating unmanned aerial vehicle search formation F _n Completion task region T ^nl Time of day of

The calculation formula is as follows:

wherein (x) _nl ,y _nl ) And (x) _n(l+1) ,y _n(l+1) ) Are each T ^nL Two grid task areas adjacent to each other in the execution sequence;

step 2 c) set up to minimize the sum S of threat values of τ over time for all grid task areas _t Integral f of ₁ And minimizing the maximum time f for the n unmanned aerial vehicle search formations to complete the L grid task areas respectively allocated ₂ For the objective function, the drones search for the mutual distance d between the formation at the sampling time points _ti (F _a ,F _b ) Each drone search formation F _n Maximum time of flight constraint c ₂ Each grid task area T _m Threat value constraint c ₃ Multi-objective multi-constraint optimization problem model Ψ for mission planning for searching constraint conditions F in grid mission region T ₁ Wherein:

min f ₁

min f ₂

c ₃ :s(m,t)≤S _m,max

where s (m, T) denotes each grid task area T at time T _m E represents the natural logarithm baseA number, t being t ₀ Is a time variable at the start time, t' is a time offset, d _min Representing the minimum safe distance between unmanned aerial vehicle search formations, S _m,max Representing a grid task area T _m Upper limit value of the threat value of t _i For sampling time series, f _s In order to be able to do so at a time sampling rate,

is t _i Unmanned aerial vehicle search formation F at moment _n The location of the location; in this example d _min =20 m, S _m,max ＝1×10 ⁵ ，f _s =1, each side of the grid task area T contains 20 square grid task areas, as shown in fig. 3, where α =20, the darker the color of the grid task area represents the greater the threat value of the grid area.

Step 3) obtaining an optimal search task planning scheme:

by assigning a decision variable assign [ m, n ]]And order decision variable order [ n ]]The decimal encoding mode randomly generates a decimal code containing

A convergence population CA and a diversity population DA of an initial individual x, and a constraint-oriented double-file evolution algorithm is adopted to solve a multi-target multi-constraint optimization problem model psi by updating the CA and the DA ₁ To obtain a convergence population CA ₀ And diversity population DA ₀ Then from CA ₀ The pareto frontier of the system can select an individual gamma ₁ As an optimal search task planning scheme.

Step 4), establishing a multi-target multi-constraint optimization problem model for searching and tracking of unmanned aerial vehicle searching formation:

step 4 a) selecting unmanned aerial vehicle search formation F by the central processing unit theta _μ Flight direction grid task area T _ν Search formation F with unmanned aerial vehicle _η Form unmanned aerial vehicle tracking formation F _μη To F, for _n Using a search mission planning scheme gamma ₁ For grid task region T _ν Searching for found target T' _k Set up trace, remainder divide by F _μ And F _η N-2 unmanned aerial vehicle search formations F 'based on a decision variable assign [ m, N']And order [ n']Determining a mesh task area search order allocated by unmanned aerial vehicle search formation, F '= { F' ₁ ,F′ ₂ ,...,F′ _n′ ,...,F′ _N-2 Wherein μ = send [ n ″ ")]，sent[n″]To assign a decision variable, N ∈ { 1., N } \ η;

step 4 b) set up to minimize the sum S 'of threat values for all grid task areas over time τ' _t Integral f' ₁ Minimizing the maximum time f for N-2 unmanned aerial vehicles to search and form to complete L grid task areas distributed to the unmanned aerial vehicles ₂ ' and minimize drone tracking formation F _μ Reach grid task area T _ν Time f of ₃ ' As an objective function, the drones search for the mutual distance between the formation at the sampling time points

Safety constraint of c ₁ ', maximum time of flight constraint for each drone search formation c ₂ ', each grid task area T _m Threat value constraint c ₃ ', found target T' _k Threat value constraint c of ₄ 'unmanned aerial vehicle search formation F' as constraint condition and unmanned aerial vehicle tracking formation carry out searching and tracking in grid task area T multi-target multi-constraint optimization problem model psi of task planning ₂ Wherein:

min f ₁ ′

min f ₂ ′

min f ₃ ′

c′ ₃ :s(m,t)≤S _m,max

c′ ₄ :s(k,t)≤S _k,max

wherein s (k, T) represents that the target T 'has been found at time T' _k T' is a time offset, S _k,max Denotes found target T' _k Threat degree upper limit value, S' _t Representing all grid task areas and found target T 'at moment T' _k The sum of the threat values of (a) is,

representing unmanned aerial vehicle search formation F _μ Reach grid task area T _ν The time of (d); in this example S _k,max ＝1×10 ⁷ 。

Step 5) initializing parameters of a self-adaptive punished double-file evolution algorithm TAEA:

Taking the initial individual x as the initial generation population of the diversity population DA, the iteration number is r, and the maximum iteration isThe times are R, and R =0; in the present example, the first and second substrates were,

R＝300。

step 6) updating the convergence population and the diversity population:

step 6 a) the r-th generation of the convergent population CA _r And the r-th generation diversity population DA _r Combining to obtain population Hm _r Separately calculating the population Hm _r In the genus of CA _r And DA _r Non-dominant individual population Hm of _r Pc and Pd, pc is judged>If Pd is true, if yes, from CA _r In the random selection

Individual as P _1r Else from DA _r In the random selection

Individual as P _1r (ii) a Computing population CA _r The ratio of PC to non-dominant individual and producing a composition comprising

A vector pf of random numbers, determining the f-th value pf (f) of the vector pf>If PC is true, if so, from CA _r In the method, one individual is randomly selected as P _2r Of the f individual, otherwise from the DA _r In the method, one individual is randomly selected as P _2r Of the f individual, P _1r And P _2r Form a

Group parent individual P _r To P is to P _r Each group of parent individuals in the group are subjected to uniform cross operation to obtain

Group P _ru To P is to P _ru Each group of individuals in the group is subjected to partial matching and crossing operation to obtain

Group P _rp And from P _rp Randomly selecting one individual from each group of individuals to form P _rhalf Then to P _rhalf Is uniformly mutated to obtain P _rum To P _rum Performing insertion mutation operation on each individual to obtain a filial generation population Q _r ；

Step 6 b) the r-th generation of the convergent population CA _r And the offspring population Q _r Composition of a population HC _r The r-th generation convergence population DA _r And the offspring population Q _r Make up a population HD _r Population HC _r Feasible individuals and infeasible individuals in the group respectively form a group Fs _r And a population Is _r ；

Step 6 c) by separately passing the population HC _r And population Is _r For convergent population CA _r Update is performed through HD _r For diversity population DA _r Updating, wherein the updating method comprises the following steps: when in use

When a time of CA _r ＝X _r When is coming into contact with

When a time of CA _r ＝S _r ；

Wherein, X _r Is a population Is _r N- | Fs selected according to a two-dimensional optimization problem formed by a constraint violation value of each individual and a modified Chebyshev decomposition value based on the constraint _r Set of | optimal individuals, by X ^hr Hexix- ^hr+1 Composition is carried out; s _lr Showing population HC solved by self-adaptive punishment based double-file evolution algorithm TAEA _r The average number of the most optimal individuals,

population HD obtained by dual-file evolution algorithm TAEA (probabilistic algorithm) for expressing self-adaptive punishment _r Well-optimized individuals x ^b Are respectively represented as：

X _r ＝{X ^hr ,χ ^hr+1 }

x ^b ＝argmin{g ^tch (x|w ⁱ ,z ^* )}

v ₃ ＝max(s(m,t)-S _m,max ,0)

v ₄ ＝max(s(k,t)-S _k,max ,0)

X ^hr Representing vs. population Is _r The individuals in the population are subjected to rapid non-dominated sorting according to a dual-objective optimization problem gamma (x), and the population Is _r The individuals in (1) are divided into different grades, a set formed by the individuals of the first hr grades is taken, and hr is satisfied

Maximum integer value of χ ^h+1 Indicating normalized constraint violation in individuals taking the hr-th level

Is smaller

Set of individual entities, S _lr Representing a population Fs _r According to the correction objective function value

Performing rapid non-dominated sorting to select the species Fs _r The individuals in (1) are divided into different grades, the first lr grades of the individuals are taken to form a set, and lr is satisfied

The minimum value of (a) is determined,

penalty based modified chebyshev decomposition method g in individuals representing the lr th level ^mtch-penalty (x|w ⁱ ,z ^* ) Selected of

Worst individual x ^w Set of compositions, X _CAr Denotes CA _r A collection of partial individuals as a reference set,

method g for representing penalty-based modified Chebyshev decomposition ^mtch-penalty (x|w ⁱ ,z ^* ) From HD _r Selected optimal individual x ^b R (x) represents a constructed dual-objective optimization problem including constraint violation values and modified chebyshev decomposition values based on adaptive penalties,

express the integrated individual x normalized objective function value

And an adaptive penalty value p ₁ 、p ₂ 、p ₃ The obtained correction objective function value;

respectively representing the objective function f ₁ 、f ₂ 、f ₃ Normalized value of (p) ₁ 、p ₂ 、p ₃ Respectively representing an objective function f ₁ 、f ₂ 、f ₃ An adaptive penalty value of (a) is set,

to represent

The weighted average of (a) of (b),

representing violation value v ₁ ，v ₂ ，v ₃ ，v ₄ Normalized value of r _f Representing the ratio of feasible individuals in the population;

step 7) acquiring a planning result of the unmanned aerial vehicle cluster collaborative search tracking task:

judging whether R = R is true, if so, selecting the R-th generation of convergence population CA according to the weight method _r Optionally selecting one individual gamma from front surface of middle Paretor ₂ And (3) determining the execution grid task area allocated by the unmanned aerial vehicle search formation, the sequence of the execution grid task area allocated by the execution search formation and the designated search unmanned aerial vehicle formation as a task planning scheme for unmanned aerial vehicle cluster search tracking, and if not, making r = r +1 and executing the step (6).

Claims

1. An unmanned aerial vehicle cluster task planning method based on self-adaptive penalty TAEA (task adaptive algorithm) in a dynamic scene environment is characterized by comprising the following steps of:

constructing an unmanned aerial vehicle cluster collaborative search tracking task planning scene, wherein the scene comprises M square grid task areas T = { T = distributed on the ground ₁ ,T ₂ ,...,T _m ,...,T _M K area targets T' = { T } distributed within the grid task area T ₁ ′,T ₂ ′,...,T _k ′,...,T _K ' }, N unmanned aerial vehicle search formations F = { F = distributed in the same horizontal plane of space ₁ ,F ₂ ,...,F _n ,...,F _N } and a central processor Θ, each grid task area T _m The central point coordinate, the threat factor and the threat level parameter are respectively (x) _m ,y _m )、w _m 、λ _m Per target T' _k The threat factor and the threat level parameter of (2) are w' _k 、λ′ _k Each drone search formation F _n Including the flight speeds of

The starting time of all unmanned aerial vehicles for searching and forming to execute the searching task is t ₀ Maximum flight time of T ^max In each task area T _m The time of internal search is phi, the time interval of mission planning is tau, and each unmanned aerial vehicle searches for formation F _n The number of the task areas covered by the execution of one search task is L, wherein M is more than or equal to 100 _m Denotes the m-th task region, K ≧ 1 _k ' denotes the kth regional target, N.gtoreq.2 _n Representing the nth unmanned aerial vehicle searching formation, wherein L is more than or equal to 2;

(2) Establishing a multi-target multi-constraint optimization problem model for unmanned aerial vehicle search formation search task allocation planning:

(2a) Formation of unmanned aerial vehicle search F _n And a grid task area T _m As the allocation decision variable assign [ m, n ]]And passes through assign [ m, n ]]Value of (d) determines each drone search formation F _n Assigned set of grid task areas T ⁿ Search and formation of unmanned aerial vehicle into task area set Gamma ⁿ The sequence of (A) is used as a sequence decision variable order [ n ]]Then through order [ n ]]Gamma ⁿ All the distributed grid task areas are sequenced to obtain a sequenced grid task area set T ^nL ；

(2b) Searching formation F by each drone _n Flying speed of

(2c) Set up to minimize the sum S of threat values of all grid task areas over time _t Integral f of ₁ And minimizing the maximum time f for the N unmanned aerial vehicles to search and form a formation to complete the L grid task areas distributed to the N unmanned aerial vehicles ₂ For the objective function, the drones search the mutual distance between the formation at the sampling time points

Each drone search formation F _n Maximum time of flight constraint c ₂ Each grid task area T _m Threat value constraint c ₃ Task planning multi-objective multi-constraint optimization problem model psi for searching grid task region T for F with constraint conditions ₁ Wherein:

min f ₁

min f ₂

c ₃ :s(m,t)≤S _m,max

where s (m, T) denotes each grid task area T at time T _m E represents a natural logarithm base number, and t is t ₀ Is a time variable at the start time, t' is a time offset, d _min Representing the minimum safe distance between unmanned aerial vehicle search formations, S _m,max Representing a grid task area T _m The upper limit value of the threat value of (1);

(3) Obtaining an optimal search task allocation planning scheme:

by assigning a decision variable assign m, n]And order decision variable order [ n ]]The decimal encoding mode randomly generates a decimal code containing

A convergence population CA and a diversity population DA of an initial individual x, and a constraint-oriented double-file evolution algorithm is adopted to solve a multi-target multi-constraint optimization problem model psi by updating the CA and the DA ₁ To obtain a convergent population CA ₀ And diversity population DA ₀ Then from CA ₀ The pareto frontier of the system can select an individual gamma ₁ As an optimal search task planning scheme;

(4) Establishing a multi-target multi-constraint optimization problem model for searching and tracking task allocation planning of unmanned aerial vehicle search formation:

(4a) Central processing unit theta selects unmanned aerial vehicle search formation F _μ Flight direction grid task area T _ν Search formation F with unmanned aerial vehicle _η Form unmanned aerial vehicle tracking formation F _μη To F _n Using search mission planning scheme gamma ₁ For grid task region T _ν Target T found by searching _k ' establish track, remainder divide F _μ And F _η N-2 unmanned aerial vehicle search formation F 'according to decision variable assign [ m, N']And order [ n']Determining a grid task area search sequence distributed by unmanned aerial vehicle search formation, wherein F' = { F = } ₁ ′,F′ ₂ ,...,F′ _n′ ,...,F′ _N-2 Where μ = sent [ n ″ ]]，sent[n″]To assign a decision variable, N ∈ { 1., N } \ η;

(4b) Established to minimize all grid task areas and discovered target T' _k Sum of threat values S over time τ _t ' integral f ₁ ' minimizing the maximum time f for N-2 unmanned aerial vehicle search teams to complete respectively distributed L grid task areas ₂ ' and minimize drone search formation F _μ Reach grid task area T _ν Time f of ₃ ' As an objective function, the drones search for the mutual distance d ' to be queued between the sampling time points ' _ti (F′ _a ,F′ _b ) Searching the mutual distance of the formation at the sampling time point by the unmanned plane

Safety constraint c ₁ ', maximum time of flight constraint for each drone search formation c ₂ ', each grid task area T _m Threat value constraint c of ₃ ', found target T' _k Threat value constraint c ₄ 'unmanned aerial vehicle search formation F' under constraint condition and unmanned aerial vehicle tracking formation carry out search and tracking in grid task area T by task allocation planning multi-target multi-constraint optimization problem model psi ₂ Wherein:

min f ₁ ′

min f ₂ ′

min f ₃ ′

c′ ₃ :s(m,t)≤S _m,max

c′ ₄ :s(k,t)≤S _k,max

(6) Updating the convergence population and the diversity population:

(6a) The r generation convergence population CA _r And the r-th generation diversity population DA _r Combining to obtain population Hm _r Separately calculating the population Hm _r In the genus of CA _r And DA _r Non-dominant individual population Hm of _r Pc and Pd, and Pc is judged>If Pd is true, if yes, from CA _r In the random selection

Individual as a population P _1r Else from DA _r In the random selection

Individual as a population P _1r (ii) a Computing population CA _r The ratio of PC to non-dominant individual and producing a composition comprising

One is followedThe value of the f-th value pf (f) of the vector pf is judged according to the vector pf of the machine number>Whether PC is true, if so, from CA _r In the method, one individual is randomly selected as P _2r Of the f individual, otherwise from the DA _r In the method, one individual is randomly selected as P _2r The f th individual of (1), P _1r And P _2r Form a composition

Group parent individual P _r To P is to P _r Each group of parent individuals in the group is subjected to uniform cross operation to obtain

Group P _rp And from P _rp Randomly selecting one individual from each group of individuals to form P _rhalf Then to P _rhalf Is uniformly mutated to obtain P _rum To P is to P _rum Performing insertion variation operation on each individual to obtain a filial generation population Q _r ；

(6b) The r generation convergence population CA _r And the offspring population Q _r Composition of a population HC _r The r-th generation convergence population DA _r And the offspring population Q _r Make up population HD _r Population HC _r Feasible individuals and infeasible individuals in the group respectively form a group Fs _r And a population Is _r ；

(6c) Respectively through the population HC _r Group Fs _r And population Is _r For convergence population CA _r Update, go through HD _r For diversity population DA _r Updating is carried out;

(7) Acquiring an unmanned aerial vehicle cluster collaborative search tracking task allocation planning result:

judging whether R = R is established, if yes, substituting the convergent population CA from the R _r In the front surface of the pareto, one individual is arbitrarily selectedΥ ₂ The sequence of the grid task areas distributed by the unmanned aerial vehicle search formation and the assigned search unmanned aerial vehicle formation determined by the value of the decision variable are used as a task allocation planning scheme for unmanned aerial vehicle cluster search tracking, otherwise, r = r +1 is set, and the step (6) is executed.

2. The method according to claim 1, wherein the set of mesh task regions t at step (2 a) ⁿ And a set of grid task areas T ^nL The acquisition method comprises the following steps:

(2a1) Assign a decision variable assign [ m, n ]]Randomly assigning binary values to the binary values and assigning an assign [ m, n ] to the assign]=1 for drone search formation F _n And grid task area T _m There is an allocation relation, assign [ m, n ] to]=0 expressed as F _n And T _m There is no assignment, and then will be F _n All grid task area composition F with distribution relation _n Set of grid task areas t ⁿ ；

(2a2) For sequential decision variables order n]Random assignment of [1, L]L non-repeating integers in between, and according to order [ n ]]Sequence of (2) to T ⁿ Sequence number of task area of all grids

Sorting and then sorting the set by the sorted sequence numbers

Determination of F _n Set T of grid task areas to which it is allocated to be executed ⁿ In the order of execution, get the set T arranged in the execution order ^nL 。

3. The method of claim 2, wherein the step (2 b) of computing a drone search formation F _n Completion task area T ^nl Time of day of

The calculation formula is as follows:

wherein (x) _nl ,y _nl ) And (x) _n(l+1) ,y _n(l+1) ) Are each T ^nL Two grid task areas that are sequentially adjacent are executed.

4. The method of claim 3, wherein the sum of threat values S in step (2 c) _t Integral f of ₁ Maximum time f ₂ And the mutual distance d between the unmanned aerial vehicle search formation at the sampling time points _ti (F _a ,F _b ) The calculation formulas are respectively as follows:

wherein, t _i For sampling time series, f _s In order to be able to do a time-sampling rate,

is t _i Unmanned aerial vehicle search formation F at any moment _n The location of the location.

5. The method of claim 4, wherein the sum of threat values S in step (4 b) _t ', and drone search formation F _μ Reach grid task area T _ν Time f' ₃ The calculation formulas are respectively as follows:

wherein, S' _t Representing all grid task areas and found target T 'at time T' _k The sum of the threat values of (c) is,

representing unmanned aerial vehicle search formation F _μ Reach grid task area T _ν The time of (c).

6. A method according to claim 5, characterized in that step (6 c) is performed by HC respectively _r 、Fs _r And Is _r For convergence population CA _r Update by CA _r And HD _r For diversity population DA _r Updating is carried out; the updating method comprises the following steps: when the temperature is higher than the set temperature

When a time of CA _r ＝X _r When it comes to

When a time of CA _r ＝S _r ；

Wherein X Is a population Is _r N-I Fs selected according to a two-dimensional optimization problem formed by a constraint violation value of each individual and a corrected Chebyshev decomposition value based on adaptive punishment _r A set of | optimal individuals; s _r Representing a population Fs _r The N optimal individuals selected by the correction Chebyshev decomposition method based on the self-adaptive penalty,

modified chebyshev factorization method to represent population HD through adaptive penalties _r And CA _r The optimal individual set obtained in (1).