CN112650290B - High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation - Google Patents

High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation Download PDF

Info

Publication number
CN112650290B
CN112650290B CN202011552686.7A CN202011552686A CN112650290B CN 112650290 B CN112650290 B CN 112650290B CN 202011552686 A CN202011552686 A CN 202011552686A CN 112650290 B CN112650290 B CN 112650290B
Authority
CN
China
Prior art keywords
unmanned aerial
aerial vehicle
disturbance
vehicle formation
optimal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011552686.7A
Other languages
Chinese (zh)
Other versions
CN112650290A (en
Inventor
李鸿一
郭子杰
鲁仁全
周琪
刘洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN202011552686.7A priority Critical patent/CN112650290B/en
Publication of CN112650290A publication Critical patent/CN112650290A/en
Application granted granted Critical
Publication of CN112650290B publication Critical patent/CN112650290B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
    • G05D1/10Simultaneous control of position or course in three dimensions
    • G05D1/101Simultaneous control of position or course in three dimensions specially adapted for aircraft
    • G05D1/104Simultaneous control of position or course in three dimensions specially adapted for aircraft involving a plurality of aircrafts, e.g. formation flying
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Abstract

The utility model discloses a high-rise fire control unmanned aerial vehicle formation optimization control method and system with disturbance compensation, including: collecting operation state information of unmanned aerial vehicle formation; and inputting the acquired running state information of the unmanned aerial vehicle formation into an unmanned aerial vehicle formation optimal control model to acquire a cooperative control strategy of the unmanned aerial vehicle formation, wherein the unmanned aerial vehicle formation optimal control model comprises a distributed Hamiltonian-Jacobian-Belman equation under no disturbance and a disturbance observer model, the distributed Hamiltonian-Jacobian-Belman equation under no disturbance aims at the minimum performance index, the optimal cooperative control strategy under no disturbance is obtained by solving, the external disturbance is estimated through the disturbance observer model, and the cooperative control strategy of the unmanned aerial vehicle formation is acquired through the optimal cooperative control strategy under no disturbance and the estimated external disturbance. The performance index is minimized while the formation of the unmanned aerial vehicle is maintained.

Description

High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation
Technical Field
The application relates to the technical field of artificial intelligence and control, in particular to a high-rise firefighting unmanned aerial vehicle formation optimization control method and system with disturbance compensation.
Background
The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.
Along with the continuous expansion of urban construction scale, how to extinguish fire in high-rise buildings and escape people in complex and changeable fire scene conditions is attracting more and more attention. The unmanned aerial vehicle is used for on-site fire investigation and personnel escape guidance, so that the unmanned aerial vehicle becomes an effective means. However, high-rise buildings often have complex building structures, rescue tasks are heavy, and the requirement of actual tasks cannot be met by using one unmanned aerial vehicle alone. Compared with a single unmanned aerial vehicle, the unmanned aerial vehicle cluster has the characteristics of cooperative control, capability complementation and the like, and the execution efficiency of tasks is greatly improved. Therefore, cooperative control of multiple unmanned aerial vehicle systems is a research hotspot in the field of unmanned aerial vehicles today. While multi-drone formation control is the most attractive direction of research among them.
How to obtain the optimal cooperative control law of unmanned aerial vehicle cluster formation, so that the unmanned aerial vehicle cluster can realize formation and simultaneously minimize performance indexes, and the unmanned aerial vehicle cluster formation method is a technical problem which is not solved at present.
Disclosure of Invention
In order to solve the problems, the method and the system for optimizing and controlling the formation of the high-rise fire-fighting unmanned aerial vehicle with disturbance compensation are provided, firstly, the optimal cooperative control strategy under the condition of no disturbance is obtained by taking the minimum performance index as a target, the external disturbance is estimated, the cooperative control strategy of the formation of the unmanned aerial vehicle is obtained through the optimal cooperative control strategy under the condition of no disturbance and the estimated external disturbance, and the minimum performance index is realized while the formation of the unmanned aerial vehicle is realized.
In order to achieve the above purpose, the present disclosure adopts the following technical scheme:
in a first aspect, a method for optimizing control of a high-rise unmanned aerial vehicle formation with disturbance compensation is provided, including:
collecting operation state information of unmanned aerial vehicle formation;
and inputting the acquired running state information of the unmanned aerial vehicle formation into an unmanned aerial vehicle formation optimal control model to acquire a cooperative control strategy of the unmanned aerial vehicle formation, wherein the unmanned aerial vehicle formation optimal control model comprises a distributed Hamiltonian-Jacobian-Belman equation under no disturbance and a disturbance observer model, the distributed Hamiltonian-Jacobian-Belman equation under no disturbance aims at the minimum performance index, the optimal cooperative control strategy under no disturbance is obtained by solving, the external disturbance is estimated through the disturbance observer model, and the cooperative control strategy of the unmanned aerial vehicle formation is acquired through the optimal cooperative control strategy under no disturbance and the estimated external disturbance.
In a second aspect, a high-rise unmanned fire-fighting vehicle formation optimization control system with disturbance compensation is provided, including:
the data acquisition module is used for acquiring the running state information of the unmanned aerial vehicle formation;
the cooperative control strategy generation module is used for inputting the acquired operation state information of the unmanned aerial vehicle formation into an unmanned aerial vehicle formation optimization control model to acquire a cooperative control strategy of the unmanned aerial vehicle formation, wherein the unmanned aerial vehicle formation optimization control model comprises a distributed Hamiltonian-Jacobian-Belman equation under no disturbance and a disturbance observer model, the distributed Hamiltonian-Jacobian-Belman equation under no disturbance aims at the minimum performance index, the optimal cooperative control strategy under the condition of no disturbance is obtained by solving, the external disturbance is estimated through the disturbance observer model, and the cooperative control strategy of the unmanned aerial vehicle formation is obtained through the optimal cooperative control strategy under the condition of no disturbance and the estimated external disturbance.
In a third aspect, an electronic device is provided, including a memory and a processor, and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the steps described in the method for optimizing control of a high-rise fire unmanned aerial vehicle formation with disturbance compensation.
In a fourth aspect, a computer readable storage medium is provided for storing computer instructions that, when executed by a processor, perform the steps described in the method for optimizing control of a high-rise unmanned aerial vehicle formation with disturbance compensation.
Compared with the prior art, the beneficial effects of the present disclosure are:
1. according to the method, firstly, the optimal cooperative control strategy under the condition of no disturbance is obtained by taking the minimum performance index as a target, external disturbance is estimated, and the cooperative control strategy of unmanned aerial vehicle formation is obtained through the optimal cooperative control strategy under the condition of no disturbance and the estimated external disturbance, and the control strategy achieves the formation of unmanned aerial vehicle formation and simultaneously enables the performance index to be minimum.
2. The method and the device ensure that the unmanned aerial vehicle cluster is stable in consistency, the optimal control problem is also considered, the control precision and the control energy consumption can be balanced at the same time, and the method and the device have practical value.
3. The method adopts the disturbance observer to estimate external disturbance, realizes accurate compensation of the external disturbance, and can better improve control precision.
4. The method and the system adopt an evaluation network and a weight updating mode of an execution network, are simpler, overcome the limitation that most of optimal control methods based on reinforcement learning are required to meet continuous excitation conditions, and are easier to apply to engineering practice.
Additional aspects of the application will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the application.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the application.
Fig. 1 is a horizontal structural view of a tie machine-server cooperative formation involved in embodiment 1 of the present disclosure;
fig. 2 is a V-shaped formation and communication topology diagram of a drone cluster involved in embodiment 1 of the present disclosure;
fig. 3 is a formation trace diagram of a cluster of unmanned aerial vehicles involved in embodiment 1 of the present disclosure;
fig. 4 is an estimation error curve of the external disturbance involved in embodiment 1 of the present disclosure.
The specific embodiment is as follows:
the disclosure is further described below with reference to the drawings and examples.
It should be noted that the following detailed description is illustrative and is intended to provide further explanation of the application. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.
It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the present application. As used herein, the singular is also intended to include the plural unless the context clearly indicates otherwise, and furthermore, it is to be understood that the terms "comprises" and/or "comprising" when used in this specification are taken to specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof.
In the present disclosure, terms such as "upper", "lower", "left", "right", "front", "rear", "vertical", "horizontal", "side", "bottom", and the like indicate an azimuth or a positional relationship based on the azimuth or the positional relationship shown in the drawings, are merely relational terms determined for convenience in describing structural relationships of the various components or elements of the present disclosure, and do not denote any one of the components or elements of the present disclosure, and are not to be construed as limiting the present disclosure.
In the present disclosure, terms such as "fixedly coupled," "connected," and the like are to be construed broadly and refer to either a fixed connection or an integral or removable connection; can be directly connected or indirectly connected through an intermediate medium. The specific meaning of the terms in the disclosure may be determined according to circumstances, and should not be interpreted as limiting the disclosure, for relevant scientific research or a person skilled in the art.
Example 1
In recent years, when solving unmanned aerial vehicle cluster formation control, it is always desirable to design an optimal cooperative control law so as to minimize performance indexes while the unmanned aerial vehicle cluster realizes formation. In a physical sense, optimal cooperative control considers how each drone reaches a formation state with minimal energy. From a mathematical point of view, the optimal control problem eventually translates into solving the hamilton-jacobian-bellman equation, which is a sufficient condition for the existence of an optimal solution. However, for nonlinear systems such as unmanned aerial vehicles, the hamilton-jacobian-bellman equation is effectively a nonlinear partial differential equation that is generally difficult to solve mathematically. The reinforcement learning method provides an effective method for solving the nonlinear partial differential equation.
In a complex fire scene environment, external disturbance suffered by the unmanned aerial vehicle is more severe than that in a normal environment, so that the performance of the whole unmanned aerial vehicle cluster is often reduced, the system stability is affected, and unpredictable loss is caused. Therefore, it is particularly important to improve the external disturbance resistance of the unmanned aerial vehicle cluster. Estimating unknown external disturbances suffered by a system using disturbance observers has become a common approach in the control arts. The disturbance observer is combined to construct the anti-disturbance compensation controller to compensate the influence of external disturbance on the system, so that the system is not only feasible in theory, but also easier to realize in practice.
In this embodiment, a high-rise fire unmanned aerial vehicle formation optimization control method with disturbance compensation is disclosed, including:
collecting operation state information of unmanned aerial vehicle formation;
and inputting the acquired running state information of the unmanned aerial vehicle formation into an unmanned aerial vehicle formation optimal control model to acquire a cooperative control strategy of the unmanned aerial vehicle formation, wherein the unmanned aerial vehicle formation optimal control model comprises a distributed Hamiltonian-Jacobian-Belman equation under no disturbance and a disturbance observer model, the distributed Hamiltonian-Jacobian-Belman equation under no disturbance aims at the minimum performance index, the optimal cooperative control strategy under no disturbance is obtained by solving, the external disturbance is estimated through the disturbance observer model, and the cooperative control strategy of the unmanned aerial vehicle formation is acquired through the optimal cooperative control strategy under no disturbance and the estimated external disturbance.
Further, the construction process of the distributed Hamiltonian-Jacobian-Belman equation under no disturbance is as follows:
establishing a mathematical model of an unmanned aerial vehicle nominal system under the condition of no disturbance for each unmanned aerial vehicle in unmanned aerial vehicle formation;
determining a communication topological structure of the unmanned aerial vehicle in unmanned aerial vehicle formation;
constructing a cooperative consistency error function of the plane in unmanned aerial vehicle formation through a nominal system mathematical model of the unmanned aerial vehicle and a communication topological structure of the unmanned aerial vehicle;
defining a performance index function of the plane by a collaborative consistency error function of the plane, and defining an optimal performance index function by taking the minimum performance index as a target;
defining a system Hamiltonian function through a performance index function of a wing plane to obtain a distributed optimal cooperative control law;
and constructing a distributed Hamiltonian-Jacobian-Bellman equation under no disturbance through the optimal performance index function and the distributed optimal cooperative control law.
Further, the performance index function of the wing plane comprehensively considers the system consistency error of the wing plane and the control input of the wing plane.
Further, a reinforcement learning method of an execution-evaluation dual-network structure is adopted to solve a distributed Hamiltonian-Jacobian-Bellman equation under the condition of no disturbance, and an optimal cooperative control strategy under the condition of no disturbance is obtained.
Furthermore, according to the best principle of Bellman, the Hamiltonian is solved, and the distributed best cooperative control law is obtained.
Further, a directed graph is used to describe the communication topology between the drones.
Further, the operation state information of the unmanned aerial vehicle formation includes a position, a speed, a heading angle and a yaw rate of the unmanned aerial vehicle.
The construction process of the distributed Hamiltonian-Jacobian-Bellman equation without disturbance is described in detail, and the construction process comprises the following steps:
s1, establishing a mathematical model of a nominal system of the high-rise firefighting four-rotor unmanned aerial vehicle under the condition of no disturbance;
s2, determining a communication topological structure among all unmanned aerial vehicles in unmanned aerial vehicle formation;
s3, defining proper cooperative consistency errors and performance index functions aiming at the i-th plane nominal system;
and S4, deducing a distributed Hamiltonian-Jacobian-Belman equation.
Step S1, establishing a mathematical model of a nominal system of the high-rise firefighting four-rotor unmanned aerial vehicle under the condition of no disturbance, wherein the mathematical model is specifically as follows:
first, to simplify the presentation of the problem, attention is paid only to the queuing behavior of four-rotor unmanned aerial vehicles, assuming that each unmanned aerial vehicle is equipped with a throttle, a rudder and a homoautopilot. Complex aerodynamic characteristics of the aircraft can be omitted, and the unmanned aerial vehicle model is simplified. Assuming that each unmanned aerial vehicle is on the same horizontal plane, the simplified mathematical model of the single unmanned aerial vehicle is obtained as follows:
wherein x is i ,y i Representing the position in the east and north directions of the established coordinate system. v i ,θ i ,ω i Speed, heading angle and yaw rate, respectively. u (u) i,1 ,u i,2 Is a control input. d, d i,1 ,d i,2 Is an external disturbance suffered by the system.
Let X i =[x i ,y ii ,v ii ] T ,f(X i )=[v i cosθ i ,v i sinθ ii ,0,0] T ,
g(X i )=[0,0;0,0;0,0;1,0;0,1],u i =[u i,1 ,u i,2 ] T ,d i =[d i,1 ,d i,2 ] T The mathematical model can be further reduced to:
neglecting external disturbance, the nominal system mathematical model of the unmanned aerial vehicle under the condition of no disturbance is described as follows:
step S2, determining the communication topological structure among the unmanned aerial vehicles in the unmanned aerial vehicle formation as follows:
describing a communication connection relationship between unmanned aerial vehicles in formation by using a directed graph G= { V, E, A };
where v= {0,1,..n } represents the set of nodes in graph G,represents a set of directed edges in the graph, a= [ a ] ij ]∈R n×n A weight matrix representing a directed graph G;
if the plane i can receive information from plane j, then a ij =1 (i+.j), otherwise, a ij =0; defining the neighbor node of node i as N i = { j E v| (i, j) E, i+.j }, the ingress matrix D is d=diag { h } 1 ,...h N }, whereinSuppose a ii =0, directed graph is a strict join; the connection between the assistant i and the collar is represented as a diagonal matrix b=diag { B 1 ,...,b N If the bureau i is able to receive information from the leader, b i =1, otherwise b i =0。
Step S3, defining a proper cooperative consistency error function and a performance index function specifically as follows for an ith plane nominal system:
definition of the coordinated coherence error delta of the plane i i The method comprises the following steps:
wherein, kappa ij ∈R n Representing predefined formation status information between the plane i and the plane j. Kappa (kappa) i0 ∈R n Representing pre-defined formation status information between the leader i and the leader. Two pairs ofAll are constants. Then a consistency error dynamic systemThe method comprises the following steps:
wherein,
defining a performance index function J of a plane i ii (t)) is:
the performance index function comprehensively considers the consistency error delta i And control inputs of the plane i by finding the optimal control strategy u ii ) The minimization of the performance index function is realized, and the consistent stability of the unmanned aerial vehicle cluster under the condition of no disturbance can be ensured.
Step S4 derives the distributed hamilton-jacobian-bellman equation as follows:
defining optimal performance index function with minimum performance index as targetThe method comprises the following steps:
defining a synergistic hamiltonian H ii ,u i ) The method comprises the following steps:
according to BellmanPrinciple of optimality, byCan obtain distributed optimal cooperative control law>The following are provided:
the corresponding distributed hamilton-jacobian-bellman equation is then:
the distributed hamilton-jacobian-bellman equation is in fact a nonlinear partial differential equation, whose analytical solution is difficult to obtain. The reinforcement learning method of the execution-evaluation dual-network structure provides an effective means for overcoming the difficulty.
The execution-evaluation dual-network structure is adopted to approximate the optimal cooperative control law and the performance index function respectively, and the distributed Hamiltonian-Jacobian-Bellman equation is solved to obtain the optimal cooperative control strategy under the condition of no disturbance, and the method comprises the following steps:
can be decomposed into two parts
γ i >0 is a design constant that is set to be equal to the design constant,then
The evaluation network is used to approximate the performance index function:
is to evaluate the network weight,/-, for>Is an excitation function. The evaluation network weight update law is designed as follows:
wherein, kappa ci >0 is a design parameter.
Executing a network to approximate optimal control inputs
To perform network weights. The execution network weight update law is:
wherein, kappa ai >0 is a design parameter.
Solving the obtained optimal control inputIs the optimal cooperative control strategy under the condition of no disturbance.
The disturbance observer is designed to estimate the external disturbance, and thus the disturbance compensator is obtained to cancel the influence of the external disturbance as follows:
the design of the distributed cooperative control law is carried out under the condition of no external disturbance, and then the distributed optimal cooperative robust control law based on disturbance compensation of a disturbance observer is designed to ensure the stability of the unmanned aerial vehicle cluster under the condition of suffering from the external disturbance. The disturbance observer is designed as follows:
wherein,is an estimated value of external disturbance, Z i Is an auxiliary variable, K 0 >0 is the disturbance observer gain matrix.
Thus, the distributed optimal cooperative robust control law concrete form u i The following are provided:
solved u i And a cooperative control strategy for unmanned aerial vehicle formation.
In order to confirm the effectiveness of this example, simulation experiments were performed as follows:
in the simulation experiment, the control target is to design a cooperative control strategy of unmanned aerial vehicle formation, namely an optimal distributed cooperative control law of unmanned aerial vehicle formation, so that a bureau can track a lead machine in a certain formation. The control inputs of the collar machine adopted in the example are as follows:
wherein the time constant τ v =3,τ ω =2,v com =10m/s sumThe initial value of the system state is selected as Formation information k 10 =[-3,3,0,0,0],κ 21 =[-3,3,0,0,0],κ 30 =[-3,-3,0,0,0],κ 43 =[-3,-3,0,0,0]. External disturbance of d 1 =[sin(t);cos(t)],d 2 =[2sin(t);2cos(t)],d 3 =[1.5sin(t);1.5cos(t)],d 4 =[2.5sin(t);2.5cos(t)]. Design parameter gamma 1 =γ 2 =γ 3 =γ 4 =40,κ c1 =κ c2 =κ c3 =κ c4 =1,κ a1 =κ a2 =κ a3 =κ a4 =0.6。
Analysis of results:
selecting a Liapunov function:
deriving available->According to the lisapunov stability theorem, the parameter gamma is adjusted i ,κ ci ,κ ai Can make the system allThe signals are semi-globally consistent and bounded, and the unmanned aerial vehicle cluster consistency error can be converged into a neighborhood centered on the origin.
From fig. 3 it can be seen that the bureau can track the track of the upper leader very well in case of external disturbance and can maintain the preset formation shape. As can be seen from fig. 4, the disturbance observer is able to estimate the external disturbance of the system well, the estimation error being consistent and eventually bounded.
The reinforcement learning method based on the execution-evaluation dual-network structure realizes the formation optimization control of the high-rise fire unmanned aerial vehicle under the directed communication network, and considers the influence of external disturbance. Based on the co-operation mode of the lead-the-plane, the communication topological structure and the formation information shown in fig. 1 and the communication topological structure and the formation information shown in fig. 2, an unmanned plane cluster co-operation consistency error dynamic equation is established. And selecting a proper performance index function, so that the scheme can comprehensively consider the balance between control precision and control energy consumption. Based on the reinforcement learning idea, the network approximation control input is executed by adopting the evaluation network approximation performance index function, and a simplified weight updating law is designed, so that the limitation that the continuous excitation condition must be met is overcome. Based on a disturbance observer, a compensation controller is provided to achieve the purpose of counteracting the influence of external disturbance. And finally, applying the control method of the cooperative distributed optimal control and disturbance compensation control under the condition of no disturbance to the formation optimal control of the high-rise unmanned aerial vehicle, and verifying the effectiveness of the method.
The embodiment discloses a high-rise fire-fighting unmanned aerial vehicle formation optimization control method with disturbance compensation, and a distributed self-adaptive optimal cooperative robust control scheme is designed aiming at unmanned aerial vehicle formation in a tractor-plane cooperative mode and considering the situation that unmanned aerial vehicles suffer external disturbance. The proposed controller can be divided into two parts: (1) an optimal cooperative control strategy in the absence of disturbances; (2) a disturbance compensator. Firstly, solving a distributed Hamiltonian-Jacobian-Bellman equation by adopting a reinforcement learning method of an execution-evaluation dual-network structure aiming at each bureau, and further obtaining a distributed optimal cooperative control strategy. Subsequently, a disturbance compensator is derived based on the disturbance observer to eliminate the influence of external disturbance. The whole controller not only ensures that all signals in the unmanned aerial vehicle system are finally consistent and bounded, but also ensures that the cooperative cost function is minimized.
The method and the device for controlling the unmanned aerial vehicle cluster have the advantages that on the basis of guaranteeing that the unmanned aerial vehicle cluster keeps consistent and stable, the optimal control problem is also considered, the control precision and the control energy consumption can be balanced at the same time, and the method and the device have practical values.
The disturbance observer is adopted to estimate external disturbance, so that accurate compensation of the external disturbance is realized, and the control precision can be better improved.
The evaluation network and the weight updating mode of the execution network are simpler, and the limitation that the continuous excitation condition is met necessary for most of the optimal control methods based on reinforcement learning is overcome, so that the method is easier to apply to engineering practice.
Example 2
In this embodiment, a high-rise fire unmanned aerial vehicle formation optimization control system with disturbance compensation is disclosed, comprising:
the data acquisition module is used for acquiring the running state information of the unmanned aerial vehicle formation;
the cooperative control strategy generation module is used for inputting the acquired operation state information of the unmanned aerial vehicle formation into an unmanned aerial vehicle formation optimization control model to acquire a cooperative control strategy of the unmanned aerial vehicle formation, wherein the unmanned aerial vehicle formation optimization control model comprises a distributed Hamiltonian-Jacobian-Belman equation under no disturbance and a disturbance observer model, the distributed Hamiltonian-Jacobian-Belman equation under no disturbance aims at the minimum performance index, the optimal cooperative control strategy under the condition of no disturbance is obtained by solving, the external disturbance is estimated through the disturbance observer model, and the cooperative control strategy of the unmanned aerial vehicle formation is obtained through the optimal cooperative control strategy under the condition of no disturbance and the estimated external disturbance.
Example 3
In this embodiment, an electronic device is disclosed that includes a memory and a processor, and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the steps described in the method for optimizing control of a high-rise fire unmanned aerial vehicle formation with disturbance compensation disclosed in embodiment 1.
Example 4
In this embodiment, a computer readable storage medium is disclosed for storing computer instructions that, when executed by a processor, perform the steps described in the high-rise fire unmanned aerial vehicle formation optimization control method with disturbance compensation disclosed in embodiment 1.
The above is only a preferred embodiment of the present application, and is not intended to limit the present application, but various modifications and variations can be made to the present application by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the protection scope of the present application.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
Finally, it should be noted that: the above embodiments are only for illustrating the technical aspects of the present application and not for limiting the same, and although the present application has been described in detail with reference to the above embodiments, it should be understood by those of ordinary skill in the art that: modifications and equivalents may be made to the specific embodiments of the application without departing from the spirit and scope of the application, which is intended to be covered by the claims.

Claims (9)

1. The high-rise unmanned aerial vehicle formation optimization control method with disturbance compensation is characterized by comprising the following steps of:
collecting operation state information of unmanned aerial vehicle formation;
inputting the acquired running state information of the unmanned aerial vehicle formation into an unmanned aerial vehicle formation optimal control model to acquire a cooperative control strategy of the unmanned aerial vehicle formation, wherein the unmanned aerial vehicle formation optimal control model comprises a distributed Hamiltonian-Jacobian-Belman equation under no disturbance and a disturbance observer model, the distributed Hamiltonian-Jacobian-Belman equation under no disturbance aims at the minimum performance index, the optimal cooperative control strategy under no disturbance is obtained by solving, the external disturbance is estimated through the disturbance observer model, and the cooperative control strategy of the unmanned aerial vehicle formation is obtained by the optimal cooperative control strategy under no disturbance and the estimated external disturbance, wherein the distributed Hamiltonian-Jacobian equation under no disturbance is solved by adopting a reinforcement learning method for executing and evaluating a double-network structure to obtain the optimal cooperative control strategy under no disturbance, and the method is as follows:
is decomposed into two parts:
wherein, gamma i >0 is a design constant, ">δ i For synergistic consistency error, ++>a ij Indicating whether or not the plane i can receive information from plane j, b i Indicating whether the bureau i can receive information from the leader->Is an optimal performance index function;
the optimal cooperative control law isWherein the system state value X i =[x i ,y ii ,v ii ] T ,x i ,y i Representing the positions of the established coordinate system in the east and north directions; v i ,θ i ,ω i Speed, heading angle and yaw rate, respectively; g T (X i ) Representing a transpose of the system control gain matrix;
the evaluation network is used to approximate the performance index function:wherein (1)>Is to evaluate the network weight,/-, for>For the excitation function +.>Delta as an approximately optimal performance index function i (t) represents a collaborative consistency error function;
the evaluation network weight update law is designed as follows:
wherein, kappa ci >0 is a design parameter;
executing a network to approximate optimal control inputs
Wherein (1)>To perform network weights;
the execution network weight update law is:
wherein, kappa ai >0 is a design parameter;
solving the obtained optimal control inputIs the optimal cooperative control strategy under the condition of no disturbance.
2. The method for optimizing control of high-rise unmanned aerial vehicle formation with disturbance compensation according to claim 1, wherein the construction process of the distributed hamilton-jacobian-bellman equation without disturbance is as follows:
establishing a mathematical model of an unmanned aerial vehicle nominal system under the condition of no disturbance for each unmanned aerial vehicle in unmanned aerial vehicle formation;
determining a communication topological structure of the unmanned aerial vehicle in unmanned aerial vehicle formation;
constructing a performance index function of the unmanned aerial vehicle in unmanned aerial vehicle formation through a nominal system mathematical model of the unmanned aerial vehicle and a communication topological structure of the unmanned aerial vehicle;
and constructing a distributed Hamiltonian-Jacobian-Bellman equation under no disturbance through a performance index function of the plane.
3. The method for optimizing control of a high-rise fire-fighting unmanned aerial vehicle formation with disturbance compensation according to claim 2, wherein the performance index function of the plane comprehensively considers the system consistency error of the plane and the control input of the plane.
4. The method for optimizing control of high-rise unmanned aerial vehicle formation with disturbance compensation according to claim 2, wherein the Hamiltonian function is solved according to the principle of optimality of Belman to obtain a distributed optimal cooperative control law.
5. The method for optimized control of high-rise fire unmanned aerial vehicle formation with disturbance compensation according to claim 1, wherein the communication topology between unmanned aerial vehicles is described by using a directed graph.
6. The method for optimized control of high-rise fire unmanned aerial vehicle formation with disturbance compensation according to claim 1, wherein the operation state information of unmanned aerial vehicle formation includes position, speed, heading angle and yaw rate of unmanned aerial vehicle.
7. High-rise fire control unmanned aerial vehicle formation optimal control system with disturbance compensation, its characterized in that includes:
the data acquisition module is used for acquiring the running state information of the unmanned aerial vehicle formation;
the cooperative control strategy generation module is used for inputting the acquired operation state information of the unmanned aerial vehicle formation into an unmanned aerial vehicle formation optimization control model to acquire a cooperative control strategy of the unmanned aerial vehicle formation, wherein the unmanned aerial vehicle formation optimization control model comprises a distributed Hamiltonian-Jacobian-Belman equation under no disturbance and a disturbance observer model, the distributed Hamiltonian-Jacobian-Belman equation under no disturbance aims at the minimum performance index, the optimal cooperative control strategy under the condition of no disturbance is obtained by solving, the external disturbance is estimated through the disturbance observer model, and the cooperative control strategy of the unmanned aerial vehicle formation is obtained through the optimal cooperative control strategy under the condition of no disturbance and the estimated external disturbance, and the cooperative control strategy of the unmanned aerial vehicle formation is specifically:is decomposed into two parts:
wherein, gamma i >0 is a design constant, ">δ i For synergistic consistency error, ++>a ij Indicating whether or not the plane i can receive information from plane j, b i Indicating whether the bureau i can receive information from the leader->Is an optimal performance index function;
the optimal cooperative control law isWherein the system state value X i =[x i ,y ii ,v ii ] T ,x i ,y i Representing the positions of the established coordinate system in the east and north directions; v i ,θ i ,ω i Speed, heading angle and yaw rate, respectively; g T (X i ) Representing a transpose of the system control gain matrix;
the evaluation network is used to approximate the performance index function:wherein (1)>Is to evaluate the network weight,/-, for>Is an excitation function; />Delta as an approximately optimal performance index function i (t) represents a collaborative consistency error function;
the evaluation network weight update law is designed as follows:
wherein, kappa ci >0 is a design parameter;
executing a network to approximate optimal control inputs
Wherein (1)>To perform network weights;
the execution network weight update law is:
wherein, kappa ai >0 is a design parameter;
solving the obtained optimal control inputIs the optimal cooperative control strategy under the condition of no disturbance.
8. An electronic device comprising a memory and a processor and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the steps of the high-rise fire unmanned aerial vehicle formation optimization control method with disturbance compensation of any one of claims 1-6.
9. A computer readable storage medium storing computer instructions which, when executed by a processor, perform the steps of the high-rise unmanned fire aircraft formation optimization control method with disturbance compensation of any one of claims 1-6.
CN202011552686.7A 2020-12-24 2020-12-24 High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation Active CN112650290B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011552686.7A CN112650290B (en) 2020-12-24 2020-12-24 High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011552686.7A CN112650290B (en) 2020-12-24 2020-12-24 High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation

Publications (2)

Publication Number Publication Date
CN112650290A CN112650290A (en) 2021-04-13
CN112650290B true CN112650290B (en) 2023-12-08

Family

ID=75360257

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011552686.7A Active CN112650290B (en) 2020-12-24 2020-12-24 High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation

Country Status (1)

Country Link
CN (1) CN112650290B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113359733B (en) * 2021-06-15 2022-08-02 东北大学秦皇岛分校 Finite time vehicle formation control method with preset transient and steady state performance
CN114020038B (en) * 2021-12-07 2023-08-15 广东工业大学 Self-adaptive sliding mode attitude tracking control method and system for multi-quad-rotor unmanned aerial vehicle
CN115268481B (en) * 2022-07-06 2023-06-20 中国航空工业集团公司沈阳飞机设计研究所 Unmanned aerial vehicle countermeasure policy decision-making method and system thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108828949A (en) * 2018-07-20 2018-11-16 南京航空航天大学 A kind of optimally in distributed mode collaboration fault tolerant control method based on adaptive Dynamic Programming
CN110782011A (en) * 2019-10-21 2020-02-11 辽宁石油化工大学 Networked multi-agent system distributed optimization control method based on reinforcement learning
CN111308896A (en) * 2015-05-25 2020-06-19 中国科学院自动化研究所 Nonlinear system self-adaptive optimal control method based on variable error

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111308896A (en) * 2015-05-25 2020-06-19 中国科学院自动化研究所 Nonlinear system self-adaptive optimal control method based on variable error
CN108828949A (en) * 2018-07-20 2018-11-16 南京航空航天大学 A kind of optimally in distributed mode collaboration fault tolerant control method based on adaptive Dynamic Programming
CN110782011A (en) * 2019-10-21 2020-02-11 辽宁石油化工大学 Networked multi-agent system distributed optimization control method based on reinforcement learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
无人机编队的自适应最优协同容错控制律研究;戴姣;《中国优秀硕士学位论文全文数据库(电子期刊)》;第C031-732页 *

Also Published As

Publication number Publication date
CN112650290A (en) 2021-04-13

Similar Documents

Publication Publication Date Title
CN112650290B (en) High-rise fire unmanned aerial vehicle formation optimization control method and system with disturbance compensation
Fu et al. Distributed formation navigation of constrained second-order multiagent systems with collision avoidance and connectivity maintenance
CN109508035B (en) Multi-region hierarchical unmanned aerial vehicle formation path planning method based on distributed control
CN111638726A (en) Event trigger communication-based multi-unmanned aerial vehicle formation consistency control method
CN107992090A (en) A kind of adaptive formation method applied to networking swarm intelligence system system
CN113342037B (en) Multi-rotor unmanned aerial vehicle time-varying formation control method and system with input saturation
CN111522361A (en) Multi-unmanned aerial vehicle formation consistency control method in master-slave mode
CN112925346A (en) High-rise fire-fighting multi-unmanned aerial vehicle fault-tolerant control method and system based on man-in-the-loop
CN110658821A (en) Multi-robot anti-interference grouping time-varying formation control method and system
Mehrabian et al. Distributed formation recovery control of heterogeneous multiagent euler–lagrange systems subject to network switching and diagnostic imperfections
Yang et al. Event-triggered finite-time formation control for multiple unmanned aerial vehicles with input saturation
CN112947086B (en) Self-adaptive compensation method for actuator faults in formation control of heterogeneous multi-agent system consisting of unmanned aerial vehicle and unmanned vehicle
CN112859913B (en) Multi-quad-rotor unmanned helicopter attitude consistency optimal control method considering output constraint
CN115220472B (en) Fault-tolerant control method of air-ground heterogeneous formation system under switching topology
Ali et al. A leader-follower formation control of multi-UAVs via an adaptive hybrid controller
Zaidi et al. Distributed observer-based leader following consensus tracking protocol for a swarm of drones
CN114237041A (en) Air-ground cooperative fixed time fault-tolerant control method based on preset performance
Xu et al. Anti-disturbance fault-tolerant formation containment control for multiple autonomous underwater vehicles with actuator faults
Wang et al. Adaptive consensus tracking control of uncertain nonlinear systems: A first-order example
Huang et al. Decentralized direct adaptive output feedback fuzzy H∞ tracking design of large-scale nonaffine nonlinear systems
Jain et al. Self-triggered cooperative path following control of fixed wing unmanned aerial vehicles
Gkesoulis et al. PdI regulation for consensus: Application to unknown pure-feedback agents with state and communication delays
CN116466588A (en) Finite time-varying formation tracking control method and system for multi-agent system
Liu et al. Distributed adaptive bipartite containment control of linear multi-agent systems with structurally balanced graph
CN111176324B (en) Method for avoiding dynamic obstacle by multi-unmanned aerial vehicle distributed collaborative formation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant