CN115507852A

CN115507852A - Multi-unmanned aerial vehicle path planning method based on block chain and attention-enhancing learning

Info

Publication number: CN115507852A
Application number: CN202211086751.0A
Authority: CN
Inventors: 鲁仁全; 陈建焰; 徐雍; 饶红霞; 彭慧; 刘畅
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2022-09-07
Filing date: 2022-09-07
Publication date: 2022-12-23
Anticipated expiration: 2042-09-07
Also published as: CN115507852B

Abstract

The invention relates to the field of unmanned aerial vehicle path planning, and discloses a multi-unmanned aerial vehicle path planning algorithm based on a block chain and enhanced attention. Meanwhile, a safety mechanism of a block chain is introduced, so that the information safety of the multi-computer system is improved, some malicious information attacks are prevented, and meanwhile, the safety and the stability of the system are greatly improved by adding an asymmetric key mechanism. Meanwhile, a block chain knowledge sharing mechanism is used, and a public key and a private key are issued to achieve the function of sharing the multi-unmanned aerial vehicle path planning knowledge. The generalization ability of many unmanned aerial vehicle path planning can be improved.

Description

Multi-unmanned aerial vehicle path planning method based on block chain and attention-enhancing learning

Technical Field

The invention relates to the field of unmanned aerial vehicle path planning, in particular to a multi-unmanned aerial vehicle path planning method based on a block chain and attention-enhancing learning.

Background

The application of the unmanned aerial vehicle brings brand-new development paths for a plurality of industries, the unmanned aerial vehicle has excellent application prospect, the safety risk coefficient is extremely low in the using process, the consumed resources are less in operation, and tasks can be completed quickly and quickly. When the unmanned aerial vehicle moves, how to plan the path is the core content of the application of the unmanned aerial vehicle, and generally, the path planning of the unmanned aerial vehicle is the use of reducing resources and time to the maximum while reaching the place. Under the condition of an obstacle, the unmanned aerial vehicle finishes the movement to a target point by simulating the behavior of human avoiding the obstacle, and for the condition that a plurality of target points exist, people convert the behavior into a traveler problem to search, and provide an algorithm based on a self-organizing mapping network. The collaborative path planning method for multiple drones can be used as an NP combinatorial optimization problem with multiple constraints. At present, the main classical models of the multi-unmanned aerial vehicle cooperative reconnaissance problem comprise a multi-traveler model, a mixed linear integer programming model and a vehicle scheduling and path programming model. Conventional solving methods solve through heuristic algorithms, such as genetic algorithms, simulated annealing algorithms, and evolutionary algorithms. However, the traditional model cannot fully describe the constraints of the multi-drone cooperative reconnaissance mission. Under complex environmental conditions, in the face of constantly changing terrain data, the heuristic algorithm needs to be re-optimized and solved, so that the adaptability is poor, and a corresponding solution cannot be quickly given. Most of the traditional heuristic learning can not avoid the problem that iteration needs to be optimized again, and the instantaneity is not high. And some important flight route information is often related to in traditional multi-unmanned aerial vehicle path planning, the risk that the unmanned aerial vehicle is attacked by cyber physics in the coming years is increased, and traditional unmanned aerial vehicle crowd is easily attacked by other malicious unmanned aerial vehicles, so that some important path information is lost, the multi-unmanned aerial vehicle path planning method is damaged, and other problems occur in the current multi-unmanned aerial vehicle application, and some aerial collisions often occur due to the coordination of the unmanned aerial vehicles or the communication of the multi-unmanned aerial vehicles is hijacked. In addition, most of the existing systems only rely on local communication between adjacent member unmanned aerial vehicles in member unmanned aerial vehicles of an unmanned aerial vehicle network at present, and lack knowledge sharing capable of trusting unmanned aerial vehicles globally, so that a multi-unmanned aerial vehicle path planning method based on block chains and strengthened attention learning is provided.

Disclosure of Invention

Technical problem to be solved

Aiming at the defects of the prior art, the invention provides a multi-unmanned aerial vehicle path planning method based on a block chain and strengthened attention learning, and solves the problems.

(II) technical scheme

In order to achieve the above purpose, the invention provides the following technical scheme: a multi-unmanned aerial vehicle path planning method based on block chains and enhanced attention learning comprises the following steps:

the first step is as follows: using decision variables

Decision variables representing cooperative reconnaissance tasks of multiple unmanned aerial vehicles;

the second step is that: different proportionality coefficients are used for combining each objective function in the process that the unmanned aerial vehicle executes the reconnaissance mission;

the third step: introducing a node variance to adjust the load of each drone;

the fourth step: establishing an end-to-end training model of an attention mechanism to plan the path of the unmanned aerial vehicle;

the fifth step: training a model;

and a sixth step: a multi-unmanned aerial vehicle system carrying a block chain;

the seventh step: and multi-unmanned aerial vehicle cross-domain knowledge sharing and common learning.

Preferably, the objective function includes: access by each drone to a node;

the unmanned aerial vehicle leaves the access of the node;

the distance constraint of each drone cannot exceed its maximum distance;

total flight mileage of all unmanned aerial vehicles;

survival coverage function of drone.

Preferably, the survival coverage function of the whole path of the multi-drone scout mission is an accumulated sum of the survival coverage functions of the nodes.

Preferably, the path planning in the fourth step is obtained by the following method:

s1: mapping coordinates of the nodes to a 128-dimensional embedding layer by using different parameters, and distinguishing the base and the nodes by using a linear mapping layer of parameter sums;

s2: the embedding of the input node is updated through attention layers, each attention layer consists of two sublayers, a multi-head attention layer and a node full-connection feedforward layer are connected, and all sublayers are connected by adopting a jump connection structure;

s3: calculating graph embedding by calculating an average value of network output layer embedding, and then transmitting the output layer embedding and the graph embedding to a decoder;

s4: the decoder generates a path plan by receiving the embedding, the graph embedding and the context embedding of the encoder output layer nodes.

Preferably, the decoding process is performed step by step, and at each time point t, the decoder generates context embedding based on the input nodes and the generated path plan at the time point t-1.

Preferably, the method for training the model in the fifth step includes the following steps:

the first step is as follows: defining a model and defining an optimization objective;

the second step: generating simulation data, then performing pre-training, judging whether the standard is valid, and if the standard is invalid, performing the pre-training again, and effectively extracting the model;

the third step: inputting terrain data, solving a model, and training the model;

the fourth step: and obtaining the maximum optimal solution of the path, judging whether the effectiveness in the model is improved or not, if not, ending, and if so, effectively extracting the model again.

Preferably, the blockchain multi-drone system includes the following: each unmanned aerial vehicle has a public key and a private key in a multi-unmanned aerial vehicle system, when the unmanned aerial vehicle needs to send information, the public key is needed to encrypt the information, and then other unmanned aerial vehicles need to obtain the information and need to decrypt the encrypted information by using the private keys of the other unmanned aerial vehicles.

Preferably, the cross-domain knowledge sharing includes the following:

the first step is as follows: finding out a newest block in the channel A, wherein the newest block stores the global model, and initializing parameters of the model;

the second step: the regional multi-unmanned aerial vehicle path planning system needs to update model parameters, firstly a channel application needs to be sent to a blockchain network, then the blockchain network returns a channel list to the regional multi-unmanned aerial vehicle path planning system, a monitoring system needs to register on a corresponding channel, then the regional multi-unmanned aerial vehicle path planning system obtains a public key and a private key, and the regional multi-unmanned aerial vehicle path planning system sends the model parameters and needs to use the public key to encrypt the parameters.

The third step: updating global model parameters of each channel, and downloading the latest block model of the channel by the local monitoring system to update the model parameters;

the fourth step: when the model loss function in the channel tends to be stable, updating the global model state Trie, and generating blocks;

the fifth step: the latest block is written in each channel, and the Trie is updated.

(III) advantageous effects

Compared with the prior art, the invention provides a multi-unmanned aerial vehicle path planning method based on a block chain and strengthened attention learning, which has the following beneficial effects:

1. this many unmanned aerial vehicle route planning method based on block chain and intensive attention study, this patent are in order to solve the learning method that exists in the many unmanned aerial vehicle route planning of tradition and mostly can't avoid optimizing iterative problem again, and adaptability is relatively poor, can't give corresponding solution fast. By using a novel end-to-end model enhanced attention mechanics training method in deep learning, the automatic learning capability of path planning of multiple unmanned aerial vehicles can be enhanced aiming at terrain data which changes continuously.

2. According to the multi-unmanned-plane path planning method based on the block chain and the attention-enhancing learning, the block chain is adopted for data storage and data management. Meanwhile, an asymmetric encryption mechanism in the block chain is introduced to provide safe one-to-one communication in the unmanned aerial vehicle network, and a knowledge sharing mechanism in the block chain is used to share knowledge in data collected by a plurality of unmanned aerial vehicles, so that the problem that the unmanned aerial vehicles only communicate locally can be well solved.

3. According to the multi-unmanned aerial vehicle path planning method based on the block chain and the attention-enhanced learning, the sequence model of the attention mechanism is added, so that the learning capacity of the unmanned aerial vehicle for coping with the data of the changed terrain can be enhanced, and the changeable terrain can be coped with more effectively. Meanwhile, a safety mechanism of a block chain is introduced, so that the information safety of a multi-computer system is improved, malicious information attack is prevented, and meanwhile, an asymmetric key mechanism is added, so that the safety and the stability of the system are greatly improved. Meanwhile, a block chain knowledge sharing mechanism is used, and the function of sharing the path planning knowledge of the multiple unmanned aerial vehicles is achieved by issuing a public key and a private key. The generalization ability of many unmanned aerial vehicle route planning can be improved.

Drawings

Fig. 1 is a schematic diagram of a multi-drone path planning model;

FIG. 2 is a schematic diagram of an encoder structure;

FIG. 3 is a schematic diagram of a decoder network;

fig. 4 is a schematic diagram of a multi-drone path training process;

fig. 5 is a schematic diagram of a multi-drone system based on a blockchain;

FIG. 6 is a block chain based asymmetric encryption information exchange algorithm;

fig. 7 is a schematic diagram of a multi-drone blockchain knowledge sharing model;

fig. 8 is a timing diagram illustrating knowledge sharing of a blockchain.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1 to 8, a method for planning paths of multiple drones based on block chains and enhanced attention learning includes the following steps:

for the multi-unmanned aerial vehicle cooperative reconnaissance mission, the threat and relevant position information of a target area are roughly evaluated before investigation, and the multi-unmanned aerial vehicle needs to reconnaissance on a plurality of target areas of an affected area in detail. Suppose N _V Unmanned aerial vehicle needs detailed detection with multiple N _T The target area of (1). At present, the cruising distance of the unmanned aerial vehicle and the threat degree of a target are comprehensively considered, and a model is established.

The patent describes the path planning problem of multi-unmanned aerial vehicle cooperation as the base, wherein the coordinates of the position where the unmanned aerial vehicle leaves and the position returned after completing the reconnaissance task are (x) ₀ ,y ₀ ) (ii) a Unmanned aerial vehicle assuming N exists _v Unmanned plane, through set V =

To show that the maximum cruising distance of the unmanned aerial vehicle is L. Detecting a target: suppose there is N _T Objects to be detected, using sets

Indicating that the set T = {0} < u > T is used ₀ Where 0 represents the initial position and landing position of the drone and the set T is used to represent nodes used locally for detection. The coordinate of any one target node i in the set T is (x) _i ,y _i ) D when the object i is detected,survival probability of unmanned aerial vehicle is P _i ∈[0.7,0.9](ii) a Set of paths, using pi ^v And { (i, j) | i, j ∈ T } represents the unmanned aerial vehicle reconnaissance path, wherein v represents the basic principle of safe flight.

Using decision variables

Decision variables representing a multi-drone cooperative reconnaissance mission:

the cruising distance, the survival probability and the execution capacity of the unmanned aerial vehicle are considered. This patent uses a relatively simple linear combination method, and this method uses different proportionality coefficients to combine each objective function in the unmanned aerial vehicle carries out the reconnaissance task, and the objective function is as follows:

min L(π)＝a*f ₁ +b*f ₂ +c*f ₃ ,0≤a,b,c≤a+b+c＝1 (2)

wherein f is ₁ ,f ₂ And f and ₃ the method is characterized in that three optimization objective functions in an unmanned aerial vehicle reconnaissance model are adopted, and a, b and c respectively correspond to loss function proportionality coefficients of the three objective functions.

Where equation (3) represents one visit by each drone to the node and equation (4) represents one visit by a drone to leave the node. When multiple drones are flying at high altitude for a long time, many limiting factors can occur. For example, if the cruising distance is too long, the mission may directly fail, first due to the limited power of the on-board battery. It may also cause communication failures due to interference from various external uncertainties, resulting in the inability to transmit valid data to the ground station. Therefore, to reduce the flight risk, the shorter the total flight distance of the drone, the better. Equation (5) indicates that the distance constraint of each drone cannot exceed its maximum distance.

Equation (6) describes the total flight range of all drones, which is one of the optimization objectives of the reconnaissance mission

Since the threat level of each node to be detected to the drone is not consistent, the drone must select the safest flight path when detecting multiple nodes, and equation (7) defines the survival coverage function of the drone

Wherein a is ^v Is the starting node of the unmanned aerial vehicle v reconnaissance path, b ^v Is the end node, P _i Is the survival probability of drone v when detecting node i. Assuming that drones pass through

nodes

1, 2 and 3, their survival probabilities are 0.9, 0.8 and 0.7, the survival coverage function of the drone at node 1 is 0.9, the survival coverage function of node 2 is 0.9 x 0.8=0.72, node 3 is 0.9 x 0.8 x 0.7=0.5, and the survival coverage function of the entire path of the multi-drone scout task is the cumulative sum of the survival coverage functions of the nodes, as shown in equation (8)

Equation (9) represents the survival coverage function f ₂ The optimization is to take the maximum value.

In the process of executing the actual task, in order to avoid the overload of the whole unmanned aerial vehicle, the load of each unmanned aerial vehicle is adjusted by introducing the node variance, as shown in formula (10).

min f ₃ ＝var(π ^v ) (10)

This patent establishes an end-to-end training model based on attention mechanism and solves many unmanned aerial vehicle path planning problem.

Wherein

π _t E T pi is a path plan given by constraints on the local terrain, s is an example of the current plan. In the end-to-end model, an encoder processes encoding of an input coordinate node, a decoder receives the input of the encoder, and the decoding process is constrained by a masking mechanism to give a reasonable path plan pi.

The encoder structure that this patent adopted is shown in fig. 2:

mapping the coordinates of node e to the 128-dimensional embedding layer using different parameters, by using parameter w ^x And b ^x To distinguish between bases and nodes.

The embedding of the input nodes is updated by N layers of attention layers, each consisting of two sublayers: the multi-head attention layer and the nodes are all connected with the feedforward layer. The sub-layers are connected by adopting a hop connection structure, so that the outstanding problem that the learning efficiency is reduced along with the increase of the number of network layers is solved. The calculation formulas are shown in formula (13) and formula (14).

Wherein

Defining as node embedding, wherein l belongs to { 1.,. N }; calculating image embedding by calculating the average value of the network output layer embedding, and transmitting the output layer embedding and the image embedding to a decoder;

the decoder generates a path plan pi by receiving the embedding, graph embedding and context embedding of the encoder output layer nodes, the structure is shown in fig. 3:

the decoding process is performed step by step, and as shown in fig. 3, at each time point t, the decoder generates context embedding based on the input nodes and the generated path plan at the time point t-1:

wherein

Is the current cruising distance of the drone, wherein

Is represented at pi _t The distance, the symbol, that the time performs this task.]Describing a horizontal join operator, this means to join three vectors horizontally into one and then use a multi-headed attention tier, where q is _(c) By

Calculation of k _i And v _i By

Computing

Similarity u _(c)i Can be calculated by the above formula, wherein d _k To set the similarity distance to 16 while masking nodes that do not satisfy the constraint and setting compatibility to- ∞, a single attention tier is used at the last tier of the network in order to derive the policy in equation (11)

And finally, obtaining the access probability of each node, and carrying out probability sampling on the current best-fit node through the following equation in the decoding process.

The model is pre-trained, and the strengthening algorithm used in the patent is a strategy gradient algorithm represented by the following formula:

π _θ ＝P(a|s,θ)≈π(a|s) (19)

to find the optimal strategy, a gradient formula is obtained by differentiating the objective function

Wherein

Is a scoring function, Q _π And (s, a) are operation state variable values. After the model is constructed, a loss function equation (21) is defined and trained. And modifying the target function of reinforcement learning to ensure that L (pi) achieves the purpose of optimizing planning:

in the enhancement algorithm, the base line b(s) can reduce the variance and improve the training speed of the model. In this patent, the baseline of the enhancement algorithm is M ← β M + (1- β) L (pi). The training process is shown in fig. 4:

multi-drone path training first defines a model and an optimization objective in a first step, and then trains it using a large amount of simulation data so that the model can mine the connection between the data and the optimization path. If the criteria are valid, the model can be extracted and its parameters fixed, then the real data of the terrain can be input to solve the unmanned aerial vehicle reconnaissance problem in real time. Meanwhile, an online training mechanism is introduced into the model, and the model is trained by using real data of the terrain, so that the effect is continuously improved, and the self-evolution of the model is realized.

Many unmanned aerial vehicle systems based on block chain

Some air collisions can occur in the current multi-unmanned aerial vehicle application, which is often caused by problems occurring in the coordination of unmanned aerial vehicles or hijacking of multi-unmanned aerial vehicle communication. Moreover, most current existing systems rely on local communication between adjacent member drones at the member drones of the drone network, and lack the global knowledge sharing that can trust drones.

To this end, this patent proposes a multi-drone system based on block chain storage, as shown in fig. 5:

the system uses the block chain as the unit for storing the information of unmanned aerial vehicle path planning, and simultaneously uses the asymmetric encryption method in the block chain as the encryption algorithm for unmanned aerial vehicle information exchange, as shown in fig. 6:

each unmanned aerial vehicle has a public key and a private key in a multi-unmanned aerial vehicle system, when the unmanned aerial vehicle needs to send information, the information needs to be encrypted by using the public key, and then other unmanned aerial vehicles need to obtain the information and need to decrypt the encrypted information by using the private keys of the other unmanned aerial vehicles. Such encryption mechanism can effectively prevent malicious unmanned aerial vehicle to obtain the information, has improved the security performance of whole unmanned aerial vehicle system, can prevent that unmanned aerial vehicle path planning information from being maliciously stolen, uses the memory function of block chain with, can effectively trace to the source to the information.

Meanwhile, a block chain knowledge sharing learning method is established, and the problem that the traditional multi-unmanned aerial vehicle lacks the functions of cross-domain knowledge sharing and common learning is solved.

The process of the multi-unmanned aerial vehicle path planning model for knowledge sharing is as follows:

the first step is as follows: and finding the latest block in the channel A, wherein the latest block stores the global model and initializes the parameters of the model.

The second step is that: the regional multi-unmanned aerial vehicle path planning system needs to update model parameters, firstly needs to send a channel application to the blockchain network, and then the blockchain network returns a channel list to the regional multi-unmanned aerial vehicle path planning system. The monitoring system needs to register on a corresponding channel, then the regional multi-unmanned-aerial-vehicle path planning system obtains a public key and a private key, and the regional multi-unmanned-aerial-vehicle path planning system sends model parameters and needs to use public key encryption parameters for the parameters.

The third step: and each channel updates the global model parameters, and the local monitoring system downloads the latest block model of the channel to update the model parameters.

The fourth step: and when the model loss function in the channel tends to be stable, updating the global model state Trie and generating the block.

The fifth step: the latest block is written in each channel, and the Trie is updated. The specific timing diagram is shown in fig. 8:

the patent provides a many unmanned aerial vehicle path planning algorithm based on block chain and intensive attention, through the sequence model who adds the attention mechanism, can strengthen the learning ability that unmanned aerial vehicle reply changes the topography data, can effectively reply changeable topography more. Meanwhile, a safety mechanism of a block chain is introduced, so that the information safety of the multi-computer system is improved, some malicious information attacks are prevented, and meanwhile, the safety and the stability of the system are greatly improved by adding an asymmetric key mechanism. Meanwhile, a block chain knowledge sharing mechanism is used, and the function of sharing the path planning knowledge of the multiple unmanned aerial vehicles is achieved by issuing a public key and a private key. The generalization ability of many unmanned aerial vehicle route planning can be improved.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. A multi-unmanned aerial vehicle path planning method based on a block chain and attention-enhancing learning is characterized by comprising the following steps:

the first step is as follows: using decision variables

A decision variable representing a multi-drone cooperative scout mission;

the second step: different proportionality coefficients are used for combining each objective function in the process of executing the reconnaissance task by the unmanned aerial vehicle;

the third step: introducing a node variance to adjust the load of each unmanned aerial vehicle;

the fifth step: training a model;

the seventh step: and sharing cross-domain knowledge of multiple unmanned aerial vehicles and learning together.

2. The method for planning the path of multiple unmanned aerial vehicles based on the block chain and the enhanced attention learning of claim 1, characterized in that: the objective function includes:

access by each drone to a node;

the unmanned aerial vehicle leaves the access of the node;

the distance constraint of each drone cannot exceed its maximum distance;

the total flight mileage of all unmanned aerial vehicles;

survival coverage function of drone.

3. The method for planning the path of the multiple unmanned aerial vehicles based on the block chain and the enhanced attention learning of claim 2, wherein: the survival coverage function of the whole path of the multi-unmanned aerial vehicle reconnaissance task is the cumulative sum of the survival coverage functions of the nodes.

4. The method for planning the path of multiple unmanned aerial vehicles based on the block chain and the enhanced attention learning of claim 1, characterized in that: the path planning in the fourth step is obtained by the following method:

s1: mapping coordinates of the nodes to a 128-dimensional embedding layer by using different parameters, and distinguishing bases and nodes by using a linear mapping layer of parameter sums;

5. The method for planning the path of the multiple unmanned aerial vehicles based on the block chain and the enhanced attention learning of claim 1, wherein: the decoding process is carried out step by step, and at each time point t, the decoder generates context embedding based on the input nodes and the generated path plan at the time point t-1.

6. The method for planning the path of the multiple unmanned aerial vehicles based on the block chain and the enhanced attention learning of claim 1, wherein: the method for training the model in the fifth step comprises the following steps:

the second step is that: generating simulation data, then performing pre-training, judging whether the standard is valid, and if the standard is invalid, performing the pre-training again, and effectively extracting the model;

7. The method for planning the path of the multiple unmanned aerial vehicles based on the block chain and the enhanced attention learning of claim 1, wherein: the multi-drone system of the blockchain comprises the following: each unmanned aerial vehicle has a public key and a private key in a multi-unmanned aerial vehicle system, when the unmanned aerial vehicle needs to send information, the information needs to be encrypted by using the public key, and then other unmanned aerial vehicles need to obtain the information and need to decrypt the encrypted information by using the private keys of the other unmanned aerial vehicles.

8. The method for planning the path of multiple unmanned aerial vehicles based on the block chain and the enhanced attention learning of claim 7, wherein: cross-domain knowledge sharing includes the following:

the second step is that: the regional multi-unmanned aerial vehicle path planning system needs to update model parameters, firstly needs to send a channel application to a blockchain network, then the blockchain network returns a channel list to the regional multi-unmanned aerial vehicle path planning system, a monitoring system needs to register on a corresponding channel, then the regional multi-unmanned aerial vehicle path planning system obtains a public key and a private key, and the regional multi-unmanned aerial vehicle path planning system sends the model parameters and needs to use public key encryption parameters for the parameters.