CN114205778B

CN114205778B - Heterogeneous task-oriented unmanned aerial vehicle cluster cooperative target selection method

Info

Publication number: CN114205778B
Application number: CN202111346648.0A
Authority: CN
Inventors: 姚昌华; 安蕾; 韩贵真; 高泽郃; 程康; 胡程程
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2021-11-15
Filing date: 2021-11-15
Publication date: 2022-08-26
Anticipated expiration: 2041-11-15
Also published as: CN114205778A

Abstract

The invention discloses an unmanned aerial vehicle cluster cooperative target selection method facing heterogeneous tasks, which constructs a Steinberg game model by considering task values and requirements of different targets and multi-machine cooperative gain and restriction relation, establishes an upper-layer unmanned aerial vehicle as a game leader and a lower-layer unmanned aerial vehicle as a game follower, and provides a distributed strategy updating iterative algorithm to effectively improve the efficiency of the unmanned aerial vehicle cluster system for simultaneously completing a plurality of tasks and realize efficient cooperation facing heterogeneous task values in different environments Neglect the difference of task value, and adopt the not enough, high scheduling of flexibility, computational complexity of centralized scheduling problem.

Description

Heterogeneous task-oriented unmanned aerial vehicle cluster cooperative target selection method

Technical Field

The invention relates to an unmanned aerial vehicle cluster system intelligent optimization technology, in particular to an unmanned aerial vehicle cluster cooperative target selection method for heterogeneous tasks.

Background

In recent years, along with the rapid development of the technical level of artificial intelligence, the intelligent level of unmanned aerial vehicles is higher and higher, a large number of unmanned aerial vehicles form an unmanned aerial vehicle cluster and are applied to various fields of social life, and the unmanned aerial vehicle cluster has wider and wider application potential with high flexibility, wide adaptability and controllable economy, and is highly concerned at home and abroad. The unmanned aerial vehicle is simple, flexible and reliable in equipment, so that selective and targeted observation and communication can be carried out on ground targets in a close range.

The unmanned aerial vehicle cluster system has the advantages of being strong in fault tolerance, good in self-adaptability and the like, and is more suitable for executing tasks in a complex environment. The unmanned aerial vehicle cluster cooperatively completes tasks, which is an important trend in development. In order to improve the benefit of the unmanned aerial vehicle cluster for executing tasks, efficient task allocation must be performed on the unmanned aerial vehicle cluster, and the method is one of key technologies for cooperative control of unmanned aerial vehicles. The field of task planning of unmanned aerial vehicle clusters refers to comprehensive scheduling of target tasks according to requirements of task demands, self characteristics and the like, and therefore a reasonable mapping cooperative relationship between unmanned aerial vehicles and the tasks is established. Although the prior art has partial research on task allocation of multiple drones, most of the research does not consider heterogeneous task values, and most of the considered tasks are isomorphic, and does not consider simultaneous existence of multipoint reconnaissance tasks and communication services. From the aspect of the method, most of the technologies are centralized distribution algorithms, that is, a central control entity is needed to distribute tasks for all members in the cluster, and this mode is not beneficial to improving the robustness and the environment corresponding capability of the unmanned aerial vehicle cluster. Therefore, it is necessary to research a cluster cooperative target selection technology for a cluster of unmanned aerial vehicles to face multiple modes and multiple task values in the process of executing an actual task.

Disclosure of Invention

The purpose is as follows: in order to overcome the defects in the prior art, the invention provides the heterogeneous task-oriented unmanned aerial vehicle cluster cooperative target selection method, so that an unmanned aerial vehicle can reasonably distribute the task objects of each unmanned aerial vehicle in a distributed decision mode through cluster internal cooperation and algorithm iteration according to the requirements and value attributes of target tasks in regions, the overall task capacity of an unmanned aerial vehicle cluster is improved, and the problems that the task distribution of the current unmanned aerial vehicle cluster system is limited to isomorphic tasks and the task values are ignored, the centralized scheduling is insufficient in flexibility, the calculation complexity is high and the like are solved.

The technical scheme is as follows: in order to achieve the purpose, the invention provides the following technical scheme: an unmanned aerial vehicle cluster cooperative target selection method facing heterogeneous tasks is characterized by comprising the following steps:

step 1, initializing the transmitting power and task scheduling of a leading unmanned aerial vehicle, collecting related channel state information and task value of a schedulable task target, and setting an initial running round number and a maximum round number upper limit; let 0 denote the leading drone number, the set of cooperating drones distributed around is denoted as a _m ＝[1,2,…,N]The schedulable communication task target set of the leading drone and the cooperating drone is denoted as UE ═ 0,1,2, …, m]The schedulable scout task object set is represented by OE ═ 0,1,2, …, n]Channel gain is g _i,j ，j∈UE _i ∪OE _i ∪{0}，i∈A，A＝A _m ∪{0}；

Step 2, firstly, updating and adjusting the strategy of the cooperative unmanned aerial vehicle in the lower-layer sub game, and calculating the utility value corresponding to the selected task target according to the communication reconnaissance task target iteratively selected by each cooperative unmanned aerial vehicle and in combination with the environmental requirement;

the lower level sub-game is defined as:

wherein, game participant A _m For a set of cooperative drones, a set of policies of participants { Φ } _i }，Φ _i ＝{p _i ,c _i }，p _i Transmitting power for serving target tasks for each drone, c _i For each drone's target task selection, { U _i Selecting a utility function value of a target task for each unmanned aerial vehicle;

step 3, repeating the step 2 for iteration to preset times, reasonably distributing the lower-layer task target scheduling, and outputting an optimal strategy set of the lower-layer sub game; the iteration of the lower-layer sub game is updated when each cooperative unmanned aerial vehicle carries out k iterations

And then, the sub game is stable, and for any cooperative unmanned aerial vehicle, the target task at the (k +1) th time is selected to be c _i (k +1) and kth target task selection c _i (k) The difference of the utility values is smaller than a fixed constant zeta, and the optimal strategy set of the N cooperative unmanned aerial vehicles of the lower-layer sub game is output to phi _m Wherein phi _m ＝{Φ ₁ ,Φ ₂ ,…,Φ _N }。

Step 4, updating and adjusting the strategy of the leading unmanned aerial vehicle in the upper layer game according to the strategy of the lower layer sub-game, and calculating the utility value corresponding to the task of the allocation target according to the communication reconnaissance task target selected by the leading unmanned aerial vehicle;

the upper level sub game is defined as: g ═ A ₀ ，{Φ ₀ },{U ₀ }}

Game participant A ₀ Set of policies for participants { Φ, set of leading drones ₀ }，Φ _i ＝{p ₀ ,c ₀ }，p ₀ Transmitting power to serve target tasks for leading drone, c ₀ For the target task selection of the leading drone, { U ₀ And selecting a utility function value of the target task for the leading unmanned aerial vehicle.

Step 5, repeating the step 4 for a preset number of times, reasonably distributing the upper layer task target scheduling, and outputting an optimal strategy set of the upper layer sub game; when the leading unmanned aerial vehicle carries out k iterations to satisfy

And then, the sub game is stable, and for the leading unmanned aerial vehicle, the task target selection c for the (k +1) th time is performed ₀ (k +1) and kth task target selection c ₀ (k) The difference of the utility values is smaller than a fixed constant zeta, the upper layer task target scheduling is reasonably distributed, and the optimal strategy set phi of the upper layer sub game leader unmanned aerial vehicle is output ₀

And 6, repeating the steps 2-5, iteratively updating the optimal strategy of the upper and lower layer sub-games, and solving and constructing the Steinberg game equilibrium solution

Reasonably distributing target tasks of the leading unmanned aerial vehicle and the cooperative unmanned aerial vehicle;

the best corresponding strategy for representing the upper game maximization utility function,

indicating the best response strategy for the underlying game.

In a preferred embodiment of the present invention, in step 1, the channel gain g is _i,j And the task scheduling and power adjusting period is stable and unchanged.

In step 6, a preferred embodiment of the present invention is to combine the strategies

The following conditions are satisfied:

wherein the lower layer game strategy phi _m ＝{Φ ₁ ,Φ ₂ ,…,Φ _N }，Φ _-i ＝{Φ ₀ ,Φ ₁ ,…,Φ _i-1 ,Φ _i+1 ,…,Φ _N Represents the strategy combination of the upper-layer leading unmanned aerial vehicle and other cooperative unmanned aerial vehicles at the lower layer, and the optimal strategy of the upper-layer leading unmanned aerial vehicle

Given by a lower layer game optimal response strategy, solving by maximizing a self utility function, and obtaining the optimal strategy of each cooperative unmanned aerial vehicle of the lower layer

And (4) solving the maximum self utility function by the optimal corresponding strategy of the given upper-layer game and the optimal corresponding strategies of other cooperative unmanned aerial vehicles.

Compared with the prior art, the invention has the following beneficial effects: the invention provides an unmanned aerial vehicle cluster cooperative target selection method facing heterogeneous tasks, which enables unmanned aerial vehicles to reasonably distribute task objects of each unmanned aerial vehicle in a distributed decision mode through cluster internal cooperation and algorithm iteration according to the requirements and value attributes of target tasks in regions, and improves the overall task capacity of an unmanned aerial vehicle cluster.

Meanwhile, the heterogeneous characteristics and the value characteristics of a plurality of tasks in the area are considered, starting from the improvement of the overall task capacity of the cluster, the unmanned aerial vehicle can reasonably distribute the task objects of each unmanned aerial vehicle in a distributed decision-making mode according to the requirements and the value attributes of target tasks in the area through the utility function design and the iterative algorithm design, and the overall task capacity of the unmanned aerial vehicle cluster is improved.

Drawings

Fig. 1 is a diagram of a model of a scout mission distribution system based on unmanned aerial vehicle communication according to an embodiment;

FIG. 2 is a flow chart of a Steinberg equalization-based solution algorithm according to an exemplary embodiment;

FIG. 3 is a diagram of a simulation scenario according to an embodiment;

FIG. 4 is a diagram of convergence of utility functions of a leading unmanned aerial vehicle and a cooperative unmanned aerial vehicle according to an embodiment;

fig. 5 is a target task allocation diagram of a leading drone and a cooperative drone according to an embodiment;

FIG. 6 is a network utility comparison graph according to an exemplary embodiment;

fig. 7 is a graph comparing the utility of networks of different numbers of cooperative drones according to an embodiment.

Detailed Description

The present invention is further illustrated in the accompanying drawings and described in the following detailed description, it is to be understood that such examples are included solely for the purposes of illustration and are not intended as a definition of the limits of the invention, since various equivalent modifications of the invention will become apparent to those skilled in the art after reading the present specification, and it is intended to cover all such modifications as fall within the scope of the invention as defined in the appended claims.

Example (b):

fig. 1 exemplarily shows a scout mission distribution system model based on drone communication, including a leading drone, a cooperating drone and a target. Based on this model, this embodiment provides a heterogeneous task-oriented method for selecting cooperative targets of an unmanned aerial vehicle cluster, and the flow of the method is shown in fig. 2, and includes the following steps:

step 1, initializing the transmitting power and task scheduling of the leading unmanned aerial vehicle, collecting the related channel state information and task value of a schedulable task target, and setting the initial running wheel number and the maximum wheel number upper limit.

Specifically, when 0 denotes the leading drone number, the set of cooperative drones distributed around is denoted as a _m ＝[1，2，…N]N is the total number of cooperative drones, and the schedulable communication task target set of the leading drone and the cooperative drones is denoted as UE ═ 0,1,2, … m]M is the total number of schedulable communication task targets, and the set of schedulable scout task targets is represented by OE ═ 0,1,2, … n]And n is the total number of schedulable scout mission targets. Assuming that the channel gain is stable and unchanged in the task scheduling and power adjustment periods, the channel gain of the leader unmanned aerial vehicle, the cooperative unmanned aerial vehicle thereof and the target task is recorded as g _i,j Wherein i is equal to A, A is equal to A _m Where u {0}, i may refer to a leading drone or a cooperating drone, j ∈ UE ueu {0}, j may refer to a communication and reconnaissance mission target or a leading drone.

In step 2, strategy updating adjustment of the cooperative unmanned aerial vehicles in the lower-layer sub game is firstly carried out, communication reconnaissance task targets iteratively selected by the cooperative unmanned aerial vehicles are combined with environmental requirements, and the utility value corresponding to the selected task targets is calculated.

And the lower layer sub game carries out target task strategy updating and adjusting. In the communication task scheduling, after the scheduling strategies of the leading unmanned aerial vehicle and other cooperative unmanned aerial vehicles are given, the ith cooperative unmanned aerial vehicle CD _i Downlink signal-to-noise ratio serving kth communication target task

Comprises the following steps:

signal-to-noise ratio when uploading communication information

Comprises the following steps:

wherein the content of the first and second substances,

for the ith cooperative unmanned plane CD _i The interference sum when the kth communication target task is served comprises cross-layer interference generated by the kth target task served by the leading unmanned aerial vehicle and same-layer interference and noise generated by the kth target task served by other cooperative unmanned aerial vehicles, and p ₀ Transmit power for leading drone, g _0,k Channel gain obtained when the leading drone is served the kth target task,

generating sum of interference, p, for other cooperative drones _-i ＝[p ₀ ,p ₁ ,…,p _i-1 ,p _i+1 ,…,p _N ]To indicate to remove CD _i Power allocation vector, σ, of all but drone ² Is background interference noise. p is a radical of _-i,0 ＝[p ₁ ,p ₂ ,…p _i-1 ,p _i+1 ,…,p _N ]To indicate to remove CD _i In addition to the power allocation vectors of other cooperating drones,

indicating CD removal _i Sum of interference, p, generated by uploading information by other cooperative unmanned aerial vehicles _j,0 Indicating CD removal _i Power value g of other cooperative unmanned aerial vehicles uploading information _j,0 Indicating CD removal _i And other cooperative unmanned aerial vehicles are used for uploading information, so that the channel gain is increased. p is a radical of _i,0 Is a CD _i Communication upload ofPower vector, assuming CD _i The communication uploading rate to LD is R _i Allocating a bandwidth of

By

Can be obtained. For an unmanned aerial vehicle executing a communication service task, the design of the utility function of the unmanned aerial vehicle takes into account the satisfaction degree and the power consumption of a target task at the same time when a CD is given _i Serving the kth communication target task, CD _i The utility function of (a) can be expressed as:

wherein, CD _i Communication utility U _i ^k Revenue generation for performing communication tasks for drones

And cost consumption

The difference and the profit function are partially modeled into an S-shaped function to represent the satisfaction degree of the target task and simultaneously consider the signal-to-noise ratio of downlink communication

Satisfaction and communication upload signal-to-noise ratio

And theta is a constant and is used for compromising the communication downlink signal-to-noise ratio and the uplink signal-to-noise ratio. In addition, the method can be used for producing a composite material

And beta _i ^k For the ith cooperative unmanned plane CD _i The steepness and center value of the function for serving the k-th communication target age. val (k) is the value of the kth task object. CD (compact disc) _i Cost ofThe function part simultaneously takes into account the power consumption mu for executing the target task _i p _i Power consumption kappa of uploading communication information _i p _i,0 And the lower layer CD _i Interference penalty λ for upper layer LD communication services _i g _i,0 p _i 。μ _i Is a constant, which is used to trade off power consumption, k _i Representing the power consumption coefficient, λ, of the uploaded communication information _i Representing an interference penalty parameter for adjusting the impact of cross-layer interference on an upper layer service objective task _i Increasing the transmission power p _i In time, the satisfaction degree of the service target task is increased, and simultaneously higher cross-layer interference is brought to the upper-layer LD, and the Qos of the LD service target task is influenced, so the CD _i A compromise optimization is required. For a drone executing a scout service task, the design of the drone scout utility function also includes two parts, namely the satisfaction degree of the target task and the power consumption. In the scout task scheduling, the resolution r of each cooperative unmanned aerial vehicle to each target task is a fixed value, and a resolution matrix is constructed. When given CD _i Service scout target task x, CD _i The scout utility function of (a) is expressed as:

wherein, CD _i Communication utility U _i ^x Revenue for performing communication tasks for unmanned aerial vehicles

And cost consumption

The difference, the gain function, is partially modeled as a sine function, the coefficients are weighted

Make the benefit stable to between 0 and 1, r _i ^x Is a CD _i For the resolution value of the scout target assignment x,

is a CD _i Distance from the scout target task x. Wherein the cost function part simultaneously considers the power consumption of the uploading of the scout image

Power consumption delta identified by LD _i p' _i 。p' _i For each CD _i Total power of scouting tasks, τ _i The total power fraction is used to identify the power proportion of the computational process, 1-tau _i Represents the power consumption proportion of the upward transmission of the scout information after the identification is finished,

interference punishment parameters are uploaded by the scouting information and used for balancing the interference generated by uploading the scouting information on the leading unmanned aerial vehicle _i As a constant, balance CD _i Power consumption for photographing.

Given the target task selection and transmit power of the upper layer LD, each CD independently selects the best strategy to maximize its utility function, and thus the lower layer sub-game is defined as:

the lower layer game G comprises three elements of participants, strategy sets and utility functions, and game participants A _m For a set of cooperative drones, a set of policies of participants { Φ } _i }，Φ _i ＝{p _i ,c _i }，p _i Transmitting power for serving target tasks for each drone, c _i And selecting a target task for each unmanned aerial vehicle. { U _i And selecting a utility function value of the target task for each unmanned aerial vehicle. Given other drone's policy Φ _-i ，CD _i Optimal communication target task selection

Wherein the content of the first and second substances,

the interference generated to serve the kth communication target task,

for removing CD _i Interference generated by uploading information by other cooperative unmanned aerial vehicles, g _i,k For cooperating with unmanned aerial vehicle CD _i Channel gain, g, obtained while serving the kth target task _i,0 Representing cooperative unmanned aerial vehicle CD _i Channel gain when uploading information. p is a radical of _i,0 ＝ε _i p _i ，ε _i Is a CD _i Ratio coefficient of transmission power and uploading power of service communication target task, p _i,0 A power vector is uploaded for the communication. Theta is a proportionality coefficient. val (k) is the value of target task k. The CD determines an optimal serving communication objective task and then further optimizes a transmit power maximization communication utility function, which communication utility function is for the CD _i Transmission power p _i The optimum transmitting power can be obtained by calculating the partial derivative and combining the reciprocal relation of the S-shaped function

And utility function

Comprises the following steps:

for serving the tth under leading drone and other cooperative drone policies _i The interference generated by the individual communication target tasks,

for removing CD _i Interference generated by uploading information under other cooperative unmanned aerial vehicle strategies,

for cooperating with unmanned aerial vehicle CD _i T th service _i Channel gain, g, obtained at the time of the target task _i,0 Representing cooperative unmanned aerial vehicle CD _i Channel gain when uploading information. p is a radical of _i,0 ＝ε _i p _i ，ε _i Is a CD _i And the proportionality coefficient of the transmitting power and the uploading power of the service communication target task. Furthermore alpha _i And beta _i For the steepness and center value of the sigmoid function,

Γ _i and theta is the sum of the uplink signal-to-noise ratio and the downlink signal-to-noise ratio of the service target task, and is a proportionality coefficient.

Wherein mu _i Is a constant, which is used to trade off power consumption, k _i Representing the power consumption coefficient, epsilon, of the uploaded communication information _i Is a CD _i Ratio of transmission power to upload power, lambda, of serving communication target task _i Denotes an interference penalty parameter, g _i,0 Representing cooperative unmanned aerial vehicle CD _i Channel gain in uploading information, val (t) _i ) Is a target task t _i The value of (A) is obtained. Similarly, let total power p 'of reconnaissance target task' _i ＝p _i Finding out the effectiveness of the scout network, comparing the task effectiveness values of all the scout targets, and selecting the optimal scout target

If the utility value is negative, p' _i If the result is equal to 0, the scout mission of the target is selected to be abandonedAnalyzing communication scouting network utility and determining optimal task target selection

The service target task iteration strategy under the final strategy updating iteration is as follows:

for optimal communication target task

The utility of the network is that,

for optimal scouting of target tasks

And the utility is compared with the utility value to adjust the target task selection.

In the step 3, the iteration of the step 2 is repeated to preset times, the lower layer task scheduling is reasonably distributed, and the optimal strategy set of the lower layer sub game is output;

the iteration of the lower-layer sub game is updated when each cooperative unmanned aerial vehicle carries out k iterations

After that, the sub-game stabilizes. For any cooperative unmanned aerial vehicle, selecting c for target task at k +1 time _i (k +1) and kth target task selection c _i (k) The difference of the utility values is smaller than a fixed constant zeta, the lower layer task target tasks are dispatched and reasonably distributed, and the optimal strategy set phi of the lower layer sub game N cooperative unmanned aerial vehicles is output _m Wherein phi _m ＝{Φ ₁ ,Φ ₂ ,…,Φ _N }。

In step 4, updating and adjusting the strategy of the leading unmanned aerial vehicle in the upper layer game according to the strategy of the lower layer sub-game, and calculating the utility value corresponding to the task of the allocation target according to the communication reconnaissance task target selected by the leading unmanned aerial vehicle;

and the upper layer sub game carries out target task strategy updating and adjusting. In the communication task scheduling, after the scheduling strategy of each collaborative unmanned aerial vehicle CD of the lower layer is given, the downlink signal-to-noise ratio of the I-th user of LD service

Can be expressed as:

wherein the content of the first and second substances,

representing the sum of interference p generated by each lower cooperative unmanned aerial vehicle on LD service target task l ₀ For leading the unmanned aerial vehicle transmitting power, g _0,l Serving the channel gain of target task l for the leading drone. p is a radical of _j And g _j,l The transmit power for other coordinated drones and the gain for target task l. Sigma ² Is noise. For an unmanned aerial vehicle executing a communication service task, the design of the utility function of the unmanned aerial vehicle takes into account the satisfaction degree and power consumption of a target task, and for a given communication target task k, the utility function of the leading unmanned aerial vehicle LD can be expressed as:

the utility function U ₀ ^k Comprises two parts, the first part is the benefit of the service communication target task

Modeled as an S-shaped function representing the value of the benefit from the satisfaction of the serving communication objective task, and the second part is a cost function

Representing dynamic power overhead, wherein the parameter α ₀ And beta ₀ Respectively the steepness and the center value of the sigmoid function. val (k) represents the value of the communication task destination LDk,

and serving the LD with the downlink signal-to-noise ratio of the target task k. p is a radical of formula ₀ Pilot unmanned aerial vehicle transmitting power, mu ₀ Is a constant used to balance the satisfaction of the target object of the service task and the power energy consumption. For an unmanned aerial vehicle executing a scout service task, the design of the unmanned aerial vehicle scout utility function also comprises two parts of satisfaction degree and power consumption of a target task, in the scout task scheduling, the resolution r of each cooperative unmanned aerial vehicle or leading unmanned aerial vehicle to each task target is a fixed value, and a resolution matrix is constructed.

The utility function U ₀ ^x Comprises two parts which are respectively connected with a power supply and a power supply,

representing the revenue of the service reconnaissance mission objective,

represents the cost of the service scout mission objective, i.e., the power consumption of LD image recognition, wherein,

for the resolution of the LD to the task object x,

is the distance between the LD and the task target x,

the coefficient is traded off to make the benefit stable with 0-1. Delta ₀ Is an image recognition power consumption proportional constant, p' ₀ Power value when reconnaissance is carried out for leading unmanned aerial vehicle.

Optimal strategy set phi according to lower layer sub game _m And the upper-layer leader unmanned plane LD independently selects the optimal strategy to maximize the utility function of the upper-layer leader unmanned plane LD, so that the upper-layer sub game is defined as follows:

G＝{A ₀ ，{Φ ₀ },{U ₀ }}

similarly, the upper layer game G has three elements of a participant, a strategy set and a utility function, and the game participant A ₀ For the set of leading drones, the set of policies of the participants { Φ } ₀ }，Φ _i ＝{p ₀ ,c ₀ }，p ₀ Transmitting power to serve target tasks for leading drone, c ₀ And selecting a target task of the leading unmanned aerial vehicle. { U ₀ And selecting a utility function value of the target task for the leading unmanned aerial vehicle. LD optimal communication task target selection given other drone policies

Wherein the content of the first and second substances,

interference generated to serve the kth communication task objective, g _0,k The channel gain value obtained when the kth target task is served for the leading drone LD, val (k) is the value of the target task k. The LD determines the optimal service communication target task, then further optimizes the transmission power to maximize the communication utility function, and combines the reciprocal relation of the S-shaped function to obtain the optimal transmission power

And utility function

Comprises the following steps:

wherein alpha is ₀ And beta ₀ For the steepness and central value of the sigmoid function, gamma ₀ Downstream signal-to-noise ratio, val (t), for serving a communication target ₀ ) For task target t ₀ Value of (a), mu ₀ Is a constant that balances the satisfaction of the service objective with the power energy consumption. And similarly, solving the network utility of the upper layer scout task target, comparing the utility values of all the scout task targets, and selecting the optimal scout task

Analyzing communication scouting network utility and determining optimal task target selection

The service task target iteration strategy under the final strategy updating iteration is as follows:

for optimal communication task goal

The utility of the network is that,

for optimal scouting mission objectives

In step 5, repeating the step 4 until the preset times, reasonably distributing the upper layer task target scheduling, and outputting an optimal strategy set of the upper layer sub game;

the iteration of the upper layer sub game is updated, and when the leading unmanned aerial vehicle carries out k iterations

After that, the sub-game is stable. For leading unmanned aerial vehicle, target task selection c for k +1 time ₀ (k +1) and kth target task selection c ₀ (k) The difference of the utility values is smaller than a fixed constant zeta, the upper layer task target scheduling is reasonably distributed, and the optimal strategy set phi of the upper layer sub game leader unmanned aerial vehicle is output ₀ 。

And 6, repeating the steps 2-5, iteratively updating the optimal strategy of the upper and lower layer sub-games, establishing the balance of the Steinberg game, and reasonably distributing the target tasks of the leading unmanned aerial vehicle and the cooperative unmanned aerial vehicle.

Represents the best corresponding strategy of the upper game maximization utility function,

indicating the best response strategy for the underlying game. For any combination of strategies, the following conditions are satisfied:

wherein the lower layer game strategy phi _m ＝{Φ ₁ ,Φ ₂ ,…,Φ _N }，Φ _-i ＝{Φ ₀ ,Φ ₁ ,…,Φ _i-1 ,Φ _i+1 ,…,Φ _N Represents the strategy combination of the upper-layer leading unmanned aerial vehicle and the lower-layer other cooperative unmanned aerial vehicles,

known as steinberg equalization. Optimal strategy of upper-layer leading unmanned aerial vehicle

The method is given by a lower-layer game optimal response strategy, and the maximum self utility function is solved. In a similar way, the optimal strategy of each cooperative unmanned aerial vehicle at the lower layer

The optimal corresponding strategy of the upper-layer game and the optimal corresponding strategies of other cooperative unmanned aerial vehicles are given, and the self utility function is maximized to solve.

Fig. 2 shows an algorithm flow chart of the method, the sub-game is solved by a general iterative algorithm in a circulating mode, the Steinberg equilibrium iteration is finished, the upper and lower layer target task distribution is not changed any more, the equilibrium of the upper and lower layer sub-games is sought mainly through a reverse recursion method, and the task distribution problem in the multi-unmanned aerial vehicle system is realized.

As shown in fig. 3, the radius of the LD serviceable task target area is 300m, 15 CDs are randomly distributed in the LD scheduling range, the radius of the serviceable task target is 30m, and the communication and reconnaissance tasks are randomly distributed in the LD and CD service ranges. The LD can serve 3 communication task targets and reconnaissance task targets. The number of communication and reconnaissance task objects that can be served by a CD is 4,5,4,5,4,4,5,3,4,3 and 3,4,2,4,3,2,4,1,2,2 in sequence. The communication and reconnaissance task target value val served by the LD is 1, the communication and reconnaissance task target value served by the CD is relatively low, and the values are [0.9,0.95 ]]And (4) internally generating randomly. Wherein the CD _i To task object jChannel gain

Representing the corresponding distance, the signal attenuation is 25 dB. The target signal-to-noise ratio of the communication task served by the LD is gamma ₀ 30dB, the signal-to-noise ratio of the communication task target served by the CD and the uploading signal-to-noise ratio are both 10,20]Generated randomly within dB. Noise power σ ² ＝10 ^-8 mW. parameter alpha _i 1, and theta is 1. Communication interference penalty and interference cost parameter set to lambda _i ＝10 ⁸ ，μ _i 1/mV with an upload power consumption parameter of κ _i 1/mV. LD identification image power consumption delta in scout task ₀ 1, parameter of

CD recognition image power consumption delta _i 1, the upload interference penalty and the upload power ratio parameter are set to

τ _i 0.6. Weighing parameters

The resolution of the LD and CD for the scout mission target object is given in table 1.

TABLE 1 LD and CD pairs scout target resolution

Fig. 4 shows a network utility iteration update curve of the corresponding leading drone and the cooperative drone according to the method of the present invention, where 30 rounds are set for each iteration until a preset number of times. From the update curve of network utility, the convergence state can be finally achieved after the upper and lower layers of game interaction iteration, and the convergence performance of the algorithm is verified.

Fig. 5 shows the optimal communication and reconnaissance mission allocation of the unmanned aerial vehicles when the upper and lower sub-games reach the steinberg balance by using the method of the invention, and each unmanned aerial vehicle can independently perform the optimal allocation of service communication or reconnaissance target missions.

Fig. 6 shows the system utility value change in three states of jointly considering communication and scout tasks, only considering communication tasks and only considering scout tasks in the target task scheduling process by using the method of the present invention. The utility values of the system under the condition of 8, 9, 10, 11, 12 and 13 cooperative unmanned aerial vehicles are respectively shown in fig. 7, it can be seen that the upper and lower layer games finally converge to the equilibrium point, and the utility values of the systems of the used algorithms are all larger than the system utility value under the condition of considering communication or reconnaissance of a single index.

The multi-unmanned aerial vehicle communication and reconnaissance task allocation has important research significance in unmanned cluster network optimization. The method focuses on the combined optimization of target task scheduling and power control in the unmanned aerial vehicle network, utilizes a layered game framework to analyze decision behaviors of a leading unmanned aerial vehicle and a cooperative unmanned aerial vehicle, adopts a distributed strategy iterative updating algorithm to realize Stackelberg (Stackelberg) balance, realizes the optimal target task scheduling of the unmanned aerial vehicle, performs simulation analysis on a plurality of scenes, verifies that the provided algorithm can realize the convergence of distributed task allocation and system stability in the multi-unmanned aerial vehicle system, and effectively improves the overall effectiveness of tasks executed by the multi-unmanned aerial vehicle system.

The above description is only of the preferred embodiments of the present invention, and it should be noted that: it will be apparent to those skilled in the art that various modifications and adaptations can be made without departing from the principles of the invention and these are intended to be within the scope of the invention.

Claims

1. An unmanned aerial vehicle cluster cooperative target selection method facing heterogeneous tasks is characterized by comprising the following steps:

step 1, initializing the transmitting power and task scheduling of a leading unmanned aerial vehicle, collecting related channel state information and task value of a schedulable task target, and setting an initial running round number and a maximum round number upper limit; let 0 denote the leading drone number, then the set of cooperative drones distributed around is denoted as a _m ＝[1,2,…,N]N is the total number of the cooperative unmanned aerial vehiclesHead drone and cooperative drone schedulable communication task target set denoted UE ═ 0,1,2, …, m]M is the total number of schedulable communication task targets, and the set of schedulable scout task targets is represented by OE ═ 0,1,2, …, n]N is the total number of schedulable scout mission targets and the channel gain is g _i,j ，j∈UE _i ∪OE _i ∪{0}，i∈A，A＝A _m ∪{0}；

Step 2, firstly, strategy updating adjustment of the cooperative unmanned aerial vehicles in the lower-layer sub game is carried out, communication reconnaissance task targets iteratively selected by the cooperative unmanned aerial vehicles are combined with environmental requirements, and the utility value corresponding to the selected task targets is calculated;

and (3) updating and adjusting the target task strategy by the lower layer sub game: in the communication task scheduling, after the scheduling strategies of the leading unmanned aerial vehicle and other cooperative unmanned aerial vehicles are given, the ith cooperative unmanned aerial vehicle CD _i Downlink signal-to-noise ratio serving kth communication target task

Comprises the following steps:

signal-to-noise ratio when uploading communication information

Comprises the following steps:

wherein the content of the first and second substances,

for the ith cooperative unmanned plane CD _i Serving the kth communicationThe interference sum during the target task comprises cross-layer interference generated by the kth target task served by the leading unmanned aerial vehicle and same-layer interference and noise generated by the kth target task served by other cooperative unmanned aerial vehicles, and p ₀ For the transmission power of the leading drone, g _0,k Channel gain obtained when the leading drone is served the kth target task,

generating sum of interference, p, for other cooperative drones _-i ＝[p ₀ ,p ₁ ,…,p _i-1 ,p _i+1 ,…,p _N ]Express to remove CD _i Power allocation vectors, σ, of all but unmanned aerial vehicles ² Is background interference noise; p is a radical of _-i,0 ＝[p ₁ ,p ₂ ,…p _i-1 ,p _i+1 ,…,p _N ]To indicate to remove CD _i In addition to the power allocation vectors of other cooperating drones,

indicating CD removal _i The sum of interference, p, generated by uploading information by other cooperative unmanned aerial vehicles _j,0 Indicating CD removal _i Power value g of information uploaded by other cooperative unmanned aerial vehicles _j,0 Indicating CD removal _i Channel gain when other cooperative unmanned aerial vehicles upload information; p is a radical of _i,0 Is a CD _i Of the communication upload power vector, assuming CD _i The LD communication uploading rate of the unmanned aerial vehicle to the leader is R _i Allocating a bandwidth of

By

Obtaining; for a drone performing a communication service task, the design of the utility function of the drone takes into account both the satisfaction of the target task and the power consumption when given a CD _i Serving the kth communication target task, CD _i The utility function of (a) is expressed as:

wherein, CD _i Communication utility of

Revenue generation for performing communication tasks for drones

And cost consumption

Satisfaction and communication upload signal-to-noise ratio

Theta is a constant and is used for compromising the signal-to-noise ratio of the communication downlink and the signal-to-noise ratio of the communication uplink; in addition, alpha _i ^k And beta _i ^k For the ith cooperative unmanned plane CD _i Serving the steepness and center value of the function for the k-th communication target age; val (k) is the value of the kth task goal; CD (compact disc) _i The cost function part simultaneously considers the power consumption mu of executing the target task _i p _i Power consumption k for uploading communication information _i p _i,0 And the lower layer CD _i Interference punishment lambda for LD communication service of upper-layer leading unmanned aerial vehicle _i g _i,0 p _i ；μ _i Is a constant, which is used to trade off power consumption, k _i Representing the power consumption coefficient, λ, of the uploaded communication _i Representing an interference penalty parameter for adjusting the effect of cross-layer interference on an upper layer service target task, when the CD is used _i Increasing the transmission power p _i Clothes for manThe satisfaction degree of the service target task is increased, higher cross-layer interference can be brought to the upper-layer leading unmanned aerial vehicle LD, the Qos of the target task served by the leading unmanned aerial vehicle LD is influenced, and therefore the CD _i Compromise optimization is required; for the unmanned aerial vehicle executing the reconnaissance service task, the design of the reconnaissance utility function of the unmanned aerial vehicle also comprises two parts of satisfaction degree and power consumption of a target task; in the scout task scheduling, the resolution r of each cooperative unmanned aerial vehicle to each target task is a fixed value, and a resolution matrix is constructed; when given CD _i Service scout target task x, CD _i The scout utility function of (a) is expressed as:

wherein, CD _i Communication utility of

Revenue for performing communication tasks for unmanned aerial vehicles

And cost consumption

The difference, the gain function, is partially modeled as a sine function, the trade-off coefficients

Make the benefit stable to between 0 and 1, r _i ^x Is a CD _i For the resolution value of the reconnaissance target task x,

is a CD _i Distance from the reconnaissance target task x, val (x) representing task value; wherein the cost function part simultaneously considers the power consumption of the uploading of the scout image

And power consumption delta identified by lead unmanned plane LD _i p' _i ；p' _i For each CD _i Total power of scouting tasks, τ _i The total power fraction is used to identify the power proportion, 1-tau, of the computational processing _i Represents the power consumption proportion of the upward transmission of the scout information after the identification is finished,

interference punishment parameters are uploaded to the reconnaissance information and used for balancing interference, delta, generated by the reconnaissance information uploading on the leading unmanned aerial vehicle _i As a constant, balance CD _i Power consumption for taking a picture;

given the target task selection and the transmitting power of the upper-layer leading unmanned aerial vehicle LD, each CD independently selects the optimal strategy to maximize the utility function of the CD, and the lower-layer sub game is defined as follows:

wherein, game participant A _m For the set of cooperative drones, the policy set of participants { Φ } _i }，Φ _i ＝{p _i ,c _i }，p _i Transmitting power for serving target tasks for each drone, c _i For each drone's target task selection,

selecting a utility function value of the target task for each unmanned aerial vehicle;

given other drone's policy Φ _-i ，CD _i Optimal communication target task selection

Wherein the content of the first and second substances,

the interference generated to serve the kth communication target task,

for removing CD _i Interference generated by uploading information by other cooperative unmanned aerial vehicles, g _i,k For cooperating with unmanned aerial vehicle CD _i Channel gain, g, obtained while serving the kth target task _i,0 Representing cooperative unmanned aerial vehicle CD _i Channel gain when uploading information; p is a radical of _i,0 ＝ε _i p _i ，ε _i Is a CD _i Ratio coefficient of transmission power and uploading power of service communication target task, p _i,0 Uploading a power vector for the communication; theta is a proportionality coefficient; val (k) is the value of target task k; the CD determines an optimal serving communication objective task and then further optimizes a transmit power maximization communication utility function, which is applied to the CD _i Transmission power p _i The optimum transmitting power can be obtained by combining the relation of S-shaped function reciprocal

And utility function

Comprises the following steps:

for cooperating with unmanned CD _i T th service _i Channel gain, g, obtained at the time of the target task _i,0 Representing cooperative unmanned aerial vehicle CD _i Channel gain when uploading information; p is a radical of _i,0 ＝ε _i p _i ，ε _i Is a CD _i The ratio coefficient of the transmitting power and the uploading power of the service communication target task; in addition, alpha _i And beta _i For the steepness and center value of the sigmoid function,

Γ _i the sum of the uplink signal-to-noise ratio and the downlink signal-to-noise ratio of the service target task is obtained, and theta is a proportionality coefficient;

wherein mu _i Is a constant, which is used to trade off power consumption, k _i Representing the power consumption coefficient, epsilon, of the uploaded communication information _i Is a CD _i Ratio of transmission power to upload power, lambda, of serving communication target task _i Denotes an interference penalty parameter, g _i,0 Representing cooperative unmanned aerial vehicle CD _i Channel gain in uploading information, val (t) _i ) Is a target task t _i The value of (D); similarly, let total power p 'of reconnaissance target task' _i ＝p _i Finding out the effectiveness of the scout network, comparing the task effectiveness values of all the scout targets, and selecting the optimal scout target

If the utility value is negative, p' _i When the task is equal to 0, selecting the scout task for abandoning the target, analyzing the communication scout network utility, and determining the optimal task target selection

for optimal communication target task

The utility of the network is that,

for optimal scouting of target tasks

Comparing the utility value to adjust the selection of the target task;

And then, the sub game is stable, and for any cooperative unmanned aerial vehicle, the target task at the (k +1) th time is selected to be c _i (k +1) and kth target task selection c _i (k) The difference of the utility values is smaller than a fixed constant zeta, and the most cooperative unmanned aerial vehicle of the lower-layer sub game N frames is outputSet of optimal strategies as phi _m Wherein phi _m ＝{Φ ₁ ,Φ ₂ ,…,Φ _N }；

the upper layer sub game carries out target task strategy updating adjustment; in the communication task scheduling, when the scheduling strategy of each collaborative unmanned aerial vehicle CD of the lower layer is given, the downlink signal-to-noise ratio of the I-th user served by the LD of the leading unmanned aerial vehicle

Can be expressed as:

wherein the content of the first and second substances,

representing the sum of interference p generated by each lower cooperative unmanned aerial vehicle on LD service target task l of the leading unmanned aerial vehicle ₀ For leading the unmanned aerial vehicle transmitting power, g _0,l Channel gain of a target task l is served for the leading unmanned aerial vehicle; p is a radical of _j And g _j,l The gain for the transmission power of other cooperative unmanned aerial vehicles and the target task l; sigma ² Is noise; for an unmanned aerial vehicle executing a communication service task, the design of the utility function of the unmanned aerial vehicle takes into account the satisfaction degree and power consumption of a target task, and for a given communication target task k, the utility function of the leading unmanned aerial vehicle LD can be expressed as:

the utility function

Comprises two parts, the first part is the benefit of the service communication target task

Modeled as an S-shaped function, representing the value of the benefit from the satisfaction of the serving communication objective task, and the second part is a cost function

Representing dynamic power overhead, where the parameter α ₀ And beta ₀ The steepness and the central value of the S-shaped function are respectively; val (k) represents the value of the communication mission target lead drone LD k,

serving a downlink signal-to-noise ratio of a target task k for a leading unmanned aerial vehicle LD; p is a radical of ₀ Pilot unmanned plane transmitting power, mu ₀ The constant is used for balancing the satisfaction degree of the target object of the service task and the power energy consumption; for the unmanned aerial vehicle executing the reconnaissance service task, the design of the reconnaissance utility function of the unmanned aerial vehicle also comprises two parts of satisfaction degree and power consumption of a target task, in the reconnaissance task scheduling, the resolution r of each cooperative unmanned aerial vehicle or the leading unmanned aerial vehicle to each task target is a fixed value, a resolution matrix is constructed, and when a leading unmanned aerial vehicle LD serves a reconnaissance task target x, the reconnaissance utility function of the leading unmanned aerial vehicle LD is expressed as follows:

the utility function

Comprises two parts which are respectively connected with a power supply and a power supply,

receiving of object representing service scout taskThe advantages that the method is good for,

represents the cost of serving the scout mission objective, i.e., the power consumption of the leading drone LD image recognition, where,

to get the resolution of the drone LD to the task target x,

for the distance between the leading unmanned plane LD and the task target x,

the coefficient is balanced to ensure that the benefit is stable and is between 0 and 1; delta ₀ Identifying Power consumption proportionality constant, p 'for an image' ₀ Power value for the leading unmanned aerial vehicle during reconnaissance;

optimal strategy set phi according to lower-layer sub game _m The upper-layer leader unmanned aerial vehicle LD independently selects an optimal strategy to maximize a self utility function, and the upper-layer sub game is defined as follows:

game participant A ₀ Set of policies for participants { Φ, set of leading drones ₀ }，Φ _i ＝{p ₀ ,c ₀ }，p ₀ Transmitting power to serve target tasks for leading drone, c ₀ For the target mission selection of the leading drone,

selecting a utility function value of the target task for the leading unmanned aerial vehicle;

selecting a utility function value of the target task for the leading unmanned aerial vehicle; given aStrategy of other unmanned aerial vehicles, optimal communication task target selection of leading unmanned aerial vehicle LD

Wherein the content of the first and second substances,

interference generated to serve the kth communication task objective, g _0,k A channel gain value, val (k), obtained when the leading drone LD serves the kth target task is the value of target task k; determining an optimal service communication target task by a leading unmanned aerial vehicle LD, further optimizing a transmission power maximization communication utility function, and obtaining the optimal transmission power by combining the relation of the reciprocal of an S-shaped function

And utility function

Comprises the following steps:

wherein alpha is ₀ And beta ₀ For the steepness and central value of the sigmoid function, gamma ₀ Downstream signal-to-noise ratio, val (t), for serving a communication target ₀ ) To a task objectt ₀ Value of (a), mu ₀ Is a constant used to trade off satisfaction of service objectives and power energy consumption; and similarly, solving the network utility of the upper layer scout task target, comparing the utility values of all the scout task targets, and selecting the optimal scout task

for optimal communication task goal

The utility of the network is that,

for optimal scout mission objectives

Comparing the utility value to adjust the selection of the target task;

Later, the sub-game is stable, for leading unmanned, kth+1 task target selection c ₀ (k +1) and kth task object selection c ₀ (k) The difference of the utility values is smaller than a fixed constant zeta, the upper layer task target scheduling is reasonably distributed, and the optimal strategy set phi of the upper layer sub game leader unmanned aerial vehicle is output ₀ ；

representing the best response strategy for the underlying game, combining the strategies

The following conditions are satisfied:

optimal strategy called Stenberg equilibrium, top leader drone

Given by the lower-layer game optimal response strategy, solving by the maximized self utility function, and determining the optimal strategy of each cooperative unmanned aerial vehicle in the lower layer

2. The heterogeneous task-oriented unmanned aerial vehicle cluster cooperative target selection method according to claim 1, wherein in step 1, the channel gain g is _i,j And the task scheduling and power adjusting period is stable and unchanged.