CN111580389A - Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning - Google Patents

Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning Download PDF

Info

Publication number
CN111580389A
CN111580389A CN202010433327.3A CN202010433327A CN111580389A CN 111580389 A CN111580389 A CN 111580389A CN 202010433327 A CN202010433327 A CN 202010433327A CN 111580389 A CN111580389 A CN 111580389A
Authority
CN
China
Prior art keywords
control
deep learning
training
neural network
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010433327.3A
Other languages
Chinese (zh)
Other versions
CN111580389B (en
Inventor
张聚
施超
吴崇坚
牛彦
潘伟栋
陈德臣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN202010433327.3A priority Critical patent/CN111580389B/en
Publication of CN111580389A publication Critical patent/CN111580389A/en
Application granted granted Critical
Publication of CN111580389B publication Critical patent/CN111580389B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/10Geometric CAD
    • G06F30/15Vehicle, aircraft or watercraft design
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Geometry (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Automation & Control Theory (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computer Hardware Design (AREA)
  • Pure & Applied Mathematics (AREA)
  • Biophysics (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Optimization (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Feedback Control In General (AREA)

Abstract

The three-degree-of-freedom helicopter explicit model prediction control based on deep learning comprises the following steps: step 1) firstly, analyzing and dynamically modeling a helicopter system; step 2) data acquisition and processing are carried out through an explicit model predictive control algorithm; step 3), constructing and training an EMPC-based neural network; and 4) model verification and deep learning network application. The invention solves the problems of high calculation resource requirement and long calculation time of the traditional model predictive control, effectively improves the control precision and the prediction accuracy rate and improves the calculation efficiency by combining with deep learning.

Description

Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning
Technical Field
The invention relates to a helicopter control method, in particular to a three-degree-of-freedom helicopter model prediction control method based on deep learning.
Background
With the rapid development of modern science and technology and artificial intelligence, the unmanned helicopter is one of the unique creations, and the application range of the aircraft is greatly expanded. The unmanned helicopter has the advantages that the unmanned helicopter can fly in any direction, can vertically take off and land, can hover to monitor a preset target at a fixed point, and can also rotate for 360 degrees at the fixed point; the overall dimension is small, the radar transmitting area is small, and the flight safety is high; the requirement on take-off and landing sites is low; the ground equipment is simple, the erection and withdrawal time is short, and the like, so that the unmanned helicopter which is reliable to research and has low cost becomes a main trend particularly along with the development of microelectronic technology and micro sensors. Due to the characteristics, the compound has wide application and development prospect. The method has wide application in military field such as ground attack, landing, weapon transportation, logistics support and the like. The method is applied to short-distance transportation, medical rescue, disaster relief and lifesaving, emergency rescue, hoisting equipment, geological exploration, forest protection and fire extinguishing, aerial photography and the like in the civil aspect. When the helicopter is controlled, a good control strategy is selected to be crucial, under the condition that the system is normal, the normal operation of the system is ensured, the calculation cost is reduced, and meanwhile, the improvement of the performance index of the system is the key of system evaluation and analysis.
For many years, experts and scholars in the flight control industry have conducted extensive research on the control strategy of an airplane and proposed some more typical advanced control methods, and a Model Predictive Control (MPC) method is a very effective method and mainly comprises three elements: however, MPC is generally applied to systems with slow dynamic characteristics, and for nonlinear systems (such as helicopters), MPC needs to solve nonlinear performance indexes at each sampling transient moment, and has large demand on computing resources and low computational efficiency.
Explicit predictive control is a fast MPC algorithm for small-scale control proposition proposed by Bemporad et al in 2002. The main idea is to put the online calculation of the optimized solution into offline operation through the idea of parameter planning, thereby improving the speed of online calculation. When solving a multi-parameter planning (mp-QP) problem, the conventional explicit model predictive control is usually a geometric-based solution, i.e., convex division is performed on a state region of a system, an explicit functional relationship (which is a linear control law of states) between an optimal control law corresponding to an optimization problem on each state partition and the state is established, and a finally obtained parameter solution is the control law corresponding to each state critical domain; however, the method has limitations, and on one hand, whether the properties of adjacent surfaces between critical domains are satisfied or not must be considered, on the other hand, the step length must be set reasonably, and all constraint sets must be distinguished one by one, so that the calculation efficiency for solving the problem is not high, and the calculation amount is large.
The model prediction control based on deep learning effectively combines the model prediction control and the neural network, and the most outstanding characteristic of the neural network is that the neural network has excellent mapping approximation capability and excellent learning generalization capability, so that the model prediction part in the prediction control can more accurately approximate to an actual process object by training and learning self-adaptive adjustment parameters and adopting a gradient descent algorithm and continuously updating a weight value and a bias value, and the control precision of the whole prediction control algorithm is improved. Neural networks are ideal choices for processing control objects into higher-order, multivariate, time-varying, nonlinear, strongly coupled models. By utilizing the characteristics of the model prediction algorithm that the control result is predicted first and then the behavior is controlled, and by means of the advantages of deep learning in multi-parameter optimization, the model prediction control algorithm for deep learning is provided, the requirements of low computing resources and optimized computation are met, the computing efficiency is improved, and the computation of the past model prediction control is compensated. And after the training is finished, the three-degree-of-freedom helicopter is controlled by using a deep learning neural network instead of model prediction.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides an explicit model prediction control method based on deep learning, model prediction control and a neural network are effectively combined by introducing a deep learning concept, and the most prominent characteristic of the neural network is that the neural network has excellent mapping approximation capability and excellent learning generalization capability, so that a model prediction part in prediction control can more accurately approximate to an actual process object by training and learning self-adaptive adjustment parameters and adopting a gradient descent algorithm and continuously updating a weight value and a bias value, and the control precision of the whole prediction control algorithm is improved.
When the control is carried out by using the deep learning explicit model predictive control, except that the explicit model predictive control is required to be used for carrying out online solving and optimizing problems for the first time, the control laws of all state partitions are calculated offline, the partition where the current system state is located is found online, and the optimal control quantity of the current parameter is determined. And once the training is finished, the deep learning neural network is used for replacing model prediction to control the three-degree-of-freedom helicopter. Neural networks are ideal choices for processing control objects into higher-order, multivariate, time-varying, nonlinear, strongly coupled models. The model predictive control algorithm for deep learning is provided by using the characteristics of the model predictive algorithm that the control result is predicted first and then the behavior is controlled, and by means of the advantages of deep learning in multi-parameter optimization, the requirements of low computing resources and optimized computation are met, the computing efficiency is improved, and the computing burden of the past model predictive control is made up.
The control method of the explicit prediction model based on the deep learning is divided into three parts of data acquisition, model training and model prediction in an algorithm. The method comprises the steps of obtaining data through two working processes of off-line calculation and on-line calculation by using an explicit model prediction control method during data acquisition, establishing a model for model training to finally obtain an ideal model, predicting the model in the last part, and controlling the three-degree-of-freedom helicopter by using a deep learning neural network instead of the original model prediction when the expected target is achieved.
In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention are further described below. The three-degree-of-freedom helicopter explicit model prediction control method based on deep learning comprises the following steps:
step 1) firstly, analyzing and dynamically modeling a helicopter system: and analyzing the stress condition corresponding to each shaft and component in the operation process of the helicopter system.
The three-degree-of-freedom helicopter system space state equation is as follows:
Figure BDA0002501306120000031
according to the modeling analysis of the helicopter, the altitude angle, the pitch angle p, the rotation angle r and their respective differential altitude angular velocities are selected
Figure BDA0002501306120000032
Pitch angular velocity
Figure BDA0002501306120000033
And rotational angular velocity
Figure BDA0002501306120000034
As state vectors, i.e.
Figure BDA0002501306120000035
The voltage of front and rear motors of the three-freedom-degree helicopter is selected as an input vector, namely u ═ VfVb]TAltitude, pitch angle p, rotation angle r as output vector y ═ p r]T. Substituting the values in the specific relevant parameter table to obtain the coefficients of the A, B and C state equations as follows:
Figure BDA0002501306120000036
step 2) data acquisition and processing are carried out through an explicit model predictive control algorithm: the main idea of off-line computation of explicit predictive control is multi-parameter planning, thus constraining the optimal control problem:
Figure BDA0002501306120000037
s.t.Exk+Luk≤M,k=0,1,···N-1 (1b)
xk+1=Axk+Bxk,k≥0 (1c)
x0=x(0),xN∈χf(1d)
wherein P, Q is not less than 0, R is more than 0, xNIs terminal state, ukAnd xkRespectively, the control variable and the state variable, U, of each sampling pointNIn order to optimize the control vector,
Figure BDA0002501306120000041
for the set of all feasible solutions x (0), xN∈χfIs the final state constraint region χfIs a convex set.
However, the optimal control vector U in the optimization problem (1)NDepending only on the current system state x (0), since (1c) is linear, it can be translated into
Figure BDA0002501306120000042
Substituting (2) into (1) can transform the constrained optimal control problem into the following optimization problem:
Figure BDA0002501306120000043
s.t.GUN≤W+Ex(0) (3b)
wherein Y, H, F, G, W, E can be obtained from (1). The multi-parameter quadratic programming problem solution is in the form of a piecewise affine function:
u(k)=fix(k)+gi,x(k)∈CRi(4)
and (4) performing convex division on the system state space by using mp-QP (mp-QP) in the explicit model prediction control to obtain the corresponding optimal explicit control law on the state partition, and storing the data. The online calculation is that the solved state partition and control law data are stored on a computer, and the state partition and the corresponding control law of the current state are determined as the control voltage of the three-degree-of-freedom helicopter only by looking up a table. The input (control voltage) and output (altitude, pitch, rotation angle) data are integrated into a training set, a test set and a verification set.
Step 3), constructing and training an EMPC-based neural network: the model prediction control algorithm of the deep neural network is designed by means of the advantages of deep learning on multi-parameter optimization, the model prediction algorithm of the deep learning is a deep learning network which takes explicit model prediction control as a prophase algorithm and carries out online training through data of a control result, and therefore reconstruction of the control law of the three-degree-of-freedom helicopter under different flight attitudes is achieved. The deep neural network phi (x | theta) is composed of L band parameters (w)j,bj) Affine function λ ofj(x) Wherein L is the number of layers of the neural network.
λj(x)=wjx+bj(5)
By the last affine function lambdaj(x) In addition, each of the rest needs to have the function of a nonlinear activation function h, because each node in the hidden layer of the neural network is associated with an activation function, so the deep neural network is expressed as:
φ(x|θ)=λL·h+λl-1·h···h+λ1(6)
wherein
Figure BDA0002501306120000051
Is the set of all parameters that need to be optimized, wjIs the connection weight of the jth, bjIs a threshold value. The ReLU is selected as the activation function because the deep network with the ReLU as the activation function has good feasibility and can obtain better results to reduce the influence of the approximation error. The simultaneous activation function ReLU and corresponding network parameters can accurately represent mp-QP, and the solution of mp-QP is continuous piecewise simulation on polyhedronThe neural network with the activation function of ReLU is a continuous and segmented affine function on polyhedron.
The ReLU activation function is defined as: the piecewise linear function of y max {0, x } (7) activation function is expressed as:
Figure BDA0002501306120000052
the deep learning network constructed on the basis can accurately represent any explicit model predictive control law u (k), which is also determined by two aspects, namely that each piecewise affine function f (x) can be written into the difference of two convex piecewise affine functions
Figure BDA0002501306120000053
The method comprises the following steps:
f(x)=γ(x)-η(x) (9)
wherein γ (x) is represented in
Figure BDA0002501306120000054
Thereon is provided with rγAn area, η (x) is shown in
Figure BDA0002501306120000055
Thereon is provided with rηAnd (4) a region. Second convex piecewise affine function f (x) in
Figure BDA0002501306120000056
The maximum value of the upper N points is
Figure BDA0002501306120000057
After the neural network structure is constructed, the next step is to optimize the network parameters-this is dependent on the training set data. Therefore, a plurality of different points are randomly selected in the state space, and the corresponding optimal control input u (k) is obtained by repeatedly solving to obtain the explicit model predictive control law, so that the generated training set data is used for training the neural network.
This training process can be viewed as a process of continuously iteratively updating the weights and bias values of each layer of neurons to improve their accuracy using an optimization algorithm and a loss function, wherein methods including a stochastic gradient descent optimizer method, such as Adam optimizer method and automatic parameter adjustment method, are used, and these methods are performed by training a subset (small lot) of the dataset and calculating the loss. And replacing the whole data set with a small batch, and then calculating a loss gradient by using a back propagation method to update the neural network parameter theta until the data set is completely sampled to obtain an ideal model. The standard for finishing the training model is whether the convergence degree of the loss function meets the requirement or not and whether the accuracy reaches the expectation or not, the loss function is the mean square error of a predicted value and an actual value, and the accuracy can be more intuitively and effectively seen in the relation between the two. Training thus provides a basis for how deep learning networks map inputs to outputs.
Step 4), model verification and application of a deep learning network: the model designed by the controller is only an approximation of the actual system, and to ensure that the trained model is an accurate representation of the data set and to prevent overfitting, a validation set is required to validate the model with constant tuning, validation techniques such as data partitioning, cross validation and analysis of residual maps. After the deep learning network is trained for the first time, when the output error is smaller than a set value, the actual deep learning network control parameter is not trained to be optimal, multiple times of subsequent training are still needed, and the training parameters, the optimizer and the loss function are adjusted to optimize the program, so that the error accuracy of the deep learning network is further trained and improved, and the deep learning network can reach the expected value. For a trained neural network, u (k) slave can be accurately represented
Figure BDA0002501306120000061
The feedback law of the model predictive control is displayed, the reconstruction of the control law of the three-degree-of-freedom helicopter under different flight attitudes can be realized, and the control of the flight attitudes of the three-degree-of-freedom helicopter can be realized quickly and accurately.
The invention has the following advantages:
1. the invention solves the problems of high calculation resource requirement and long calculation time of the traditional model predictive control, effectively improves the control precision and the prediction accuracy rate and improves the calculation efficiency by combining with deep learning.
2. The method is applied to the control problem of the three-degree-of-freedom helicopter and has a good prediction effect.
3. The method has firm theoretical basis of steps, simple and clear steps and perfect theoretical support.
Drawings
Fig. 1 is a schematic view of a three-degree-of-freedom helicopter to which the method of the present invention relates.
Fig. 2 is a model diagram of a three-degree-of-freedom helicopter to which the method of the present invention relates.
FIG. 3 is an overall flow chart of the method of the present invention.
FIG. 4 is a state partition diagram of the present invention.
FIG. 5 is a diagram of the PWA function of the present invention.
FIG. 6 is a cost function graph of the present invention.
FIG. 7 is the structure of the neuron-layer-network of the present invention.
Fig. 8 is a graph of the loss function convergence of the present invention.
Fig. 9 is a loss function visualization of the present invention.
FIG. 10 is a structural analysis diagram of a visualization module of the present invention.
Detailed Description
The invention is further described below with reference to the accompanying drawings:
the three-degree-of-freedom helicopter model prediction control method based on deep learning disclosed by the invention comprises the following steps of:
step 1) firstly, analyzing and dynamically modeling a helicopter system: and analyzing the stress condition corresponding to each shaft and component in the operation process of the helicopter system.
The three-degree-of-freedom helicopter system space state equation is as follows:
Figure BDA0002501306120000071
according to the modeling analysis of the helicopter, the altitude angle, the pitch angle p, the rotation angle r and their respective differential altitude angular velocities are selected
Figure BDA0002501306120000072
Pitch angular velocity
Figure BDA0002501306120000073
And rotational angular velocity
Figure BDA0002501306120000074
As state vectors, i.e.
Figure BDA0002501306120000075
The voltage of front and rear motors of the three-freedom-degree helicopter is selected as an input vector, namely u ═ VfVb]TAltitude, pitch angle p, rotation angle r as output vector y ═ p r]T. Substituting the values in the specific relevant parameter table to obtain the coefficients of the A, B and C state equations as follows:
Figure BDA0002501306120000076
step 2) data acquisition and processing are carried out through an explicit model predictive control algorithm: the main idea of off-line computation of explicit predictive control is multi-parameter planning, thus constraining the optimal control problem:
Figure BDA0002501306120000077
s.t.Exk+Luk≤M,k=0,1,···N-1 (1b)
xk+1=Axk+Bxk,k≥0 (1c)
x0=x(0),xN∈χf(1d)
wherein P, Q is not less than 0, R is more than 0, xNIs terminal state, ukAnd xkRespectively the control variable and the state of each sampling pointVariable, UNIn order to optimize the control vector,
Figure BDA0002501306120000081
for the set of all feasible solutions x (0), xN∈χfIs the final state constraint region χfIs a convex set.
However, the optimal control vector U in the optimization problem (1)NDepending only on the current system state x (0), since (1c) is linear, it can be translated into
Figure BDA0002501306120000082
Substituting (2) into (1) can transform the constrained optimal control problem into the following optimization problem:
Figure BDA0002501306120000083
s.t.GUN≤W+Ex(0) (3b)
wherein Y, H, F, G, W, E can be obtained from (1). The multi-parameter quadratic programming problem solution is in the form of a piecewise affine function:
u(k)=fix(k)+gi,x(k)∈CRi(4)
and (4) performing convex division on the system state space by using mp-QP (mp-QP) in the explicit model prediction control to obtain the corresponding optimal explicit control law on the state partition, and storing the data. The online calculation is that the solved state partition and control law data are stored on a computer, and the state partition and the corresponding control law of the current state are determined as the control voltage of the three-degree-of-freedom helicopter only by looking up a table. The input (control voltage) and output (altitude, pitch, rotation angle) data are integrated into a training set, a test set and a verification set.
Step 3), constructing and training an EMPC-based neural network: the model prediction control algorithm of the deep neural network is designed by means of the advantages of deep learning on multi-parameter optimization, and the model prediction algorithm of the deep learning takes explicit model prediction control as a prophase algorithm and carries out on-line training through the data of the control resultAnd (4) learning the network, thereby realizing the reconstruction of the control law of the three-degree-of-freedom helicopter under different flight attitudes. The deep neural network phi (x | theta) is composed of L band parameters (w)j,bj) Affine function λ ofj(x) Wherein L is the number of layers of the neural network.
λj(x)=wjx+bj(5)
By the last affine function lambdaj(x) In addition, each of the rest needs to have the function of a nonlinear activation function h, because each node in the hidden layer of the neural network is associated with an activation function, so the deep neural network is expressed as:
φ(x|θ)=λL·h+λl-1·h···h+λ1(6)
wherein
Figure BDA0002501306120000091
Is the set of all parameters that need to be optimized, wjIs the connection weight of the jth, bjIs a threshold value. The ReLU is selected as the activation function because the deep network with the ReLU as the activation function has good feasibility and can obtain better results to reduce the influence of the approximation error. Meanwhile, the activation function ReLU and corresponding network parameters can accurately represent the mp-QP, and as the solution of the mp-QP is continuous piecewise affine on a polyhedron and is positively determined to be unique, the neural network with the activation function ReLU is also continuous piecewise affine on the polyhedron.
The ReLU activation function is defined as: the piecewise linear function of y max {0, x } (7) activation function is expressed as:
Figure BDA0002501306120000092
the deep learning network constructed on the basis can accurately represent any explicit model predictive control law u (k), which is also determined by two aspects, namely that each piecewise affine function f (x) can be written into the difference of two convex piecewise affine functions
Figure BDA0002501306120000093
The method comprises the following steps:
f(x)=γ(x)-η(x) (9)
wherein γ (x) is represented in
Figure BDA0002501306120000094
Thereon is provided with rγAn area, η (x) is shown in
Figure BDA0002501306120000095
Thereon is provided with rηAnd (4) a region. Second convex piecewise affine function f (x) in
Figure BDA0002501306120000096
The maximum value of the upper N points is
Figure BDA0002501306120000097
After the neural network structure is constructed, the next step is to optimize the network parameters-this is dependent on the training set data. Therefore, a plurality of different points are randomly selected in the state space, and the corresponding optimal control input u (k) is obtained by repeatedly solving to obtain the explicit model predictive control law, so that the generated training set data is used for training the neural network.
This training process can be viewed as a process of continuously iteratively updating the weights and bias values of each layer of neurons to improve their accuracy using an optimization algorithm and a loss function, wherein methods including a stochastic gradient descent optimizer method, such as Adam optimizer method and automatic parameter adjustment method, are used, and these methods are performed by training a subset (small lot) of the dataset and calculating the loss. And replacing the whole data set with a small batch, and then calculating a loss gradient by using a back propagation method to update the neural network parameter theta until the data set is completely sampled to obtain an ideal model. The standard for finishing the training model is whether the convergence degree of the loss function meets the requirement or not and whether the accuracy reaches the expectation or not, the loss function is the mean square error of a predicted value and an actual value, and the accuracy can be more intuitively and effectively seen in the relation between the two. Training thus provides a basis for how deep learning networks map inputs to outputs.
Step (ii) of4) Application of model verification and deep learning network: the model designed by the controller is only an approximation of the actual system, and to ensure that the trained model is an accurate representation of the data set and to prevent overfitting, a validation set is required to validate the model with constant tuning, validation techniques such as data partitioning, cross validation and analysis of residual maps. After the deep learning network is trained for the first time, when the output error is smaller than a set value, the actual deep learning network control parameter is not trained to be optimal, multiple times of subsequent training are still needed, and the training parameters, the optimizer and the loss function are adjusted to optimize the program, so that the error accuracy of the deep learning network is further trained and improved, and the deep learning network can reach the expected value. For a trained neural network, u (k) slave can be accurately represented
Figure BDA0002501306120000101
The feedback law of the model predictive control is displayed, the reconstruction of the control law of the three-degree-of-freedom helicopter under different flight attitudes can be realized, and the control of the flight attitudes of the three-degree-of-freedom helicopter can be realized quickly and accurately.
Case analysis
The invention aims at the three-degree-of-freedom helicopter, because of the self MIM0, higher order and nonlinear characteristics, the aircraft is controlled by utilizing the explicit model prediction to obtain output data, the data is used as training data, then the trained deep learning network is used for respectively carrying out experiments on the height axis, the rotating axis and the pitching axis, the performance of the deep learning combined with the explicit model prediction control method in the specific application of the three-degree-of-freedom helicopter is shown, and the superior performance of the invention is embodied by comparing the experimental results. The control signal can be fed back more quickly under the control of the deep learning network than the control of the explicit model prediction, the response time is shortened, and the stability of the system in the control process is improved due to the autonomous learning capability of the control system.
And obtaining output data of the system by an explicit model predictive control method, wherein the value of the input quantity and the correspondingly selected output quantity are correspondingly combined into data. Then, the table is analyzed to delete unnecessary data, correct abnormal data and repair the missing data. And finally, constructing a data set meeting the training requirement, converting the data set into a csv format, and dividing the data set into a training set, a verification set and a test set. The overall steps can be seen from the overall flow chart of fig. 2, a neural network is built on the tenserflow, data is imported, and then normalization processing is performed. Then 500 rounds of data training are set with a learning rate of 0.01, defining a mean square error loss function and creating an optimizer. As shown in the specification and the attached figure 5, after 500 training, the error of 0.12 still exists between the predicted value and the actual value, although the error exists between the finally obtained result and the calculation of the explicit model prediction control, the flight stability of the helicopter is not affected, and the helicopter can still keep stable flight within the error range. Compared with the solution time, the deep learning network has higher control efficiency than the common model prediction under the condition of the same parameters. The burden of the computer is greatly reduced on the storage space and the computation.
According to the experimental results and the program operation, under the same conditions and on the premise of stable operation, compared with common explicit model prediction control, the model control based on deep learning has the advantages of higher solving speed and better explicit calculation performance. In the aspect of control effect, the altitude angle, the rotation angle and the pitch angle of the three-freedom-degree helicopter can be effectively adjusted, an ideal stable state is quickly achieved, and the three-freedom-degree helicopter has good control performance.
The embodiments described in this specification are merely illustrative of implementations of the inventive concept and the scope of the present invention should not be considered limited to the specific forms set forth in the embodiments but rather by the equivalents thereof as may occur to those skilled in the art upon consideration of the present inventive concept.

Claims (1)

1. The three-degree-of-freedom helicopter explicit model prediction control based on deep learning comprises the following steps:
step 1) firstly, analyzing and dynamically modeling a helicopter system: the method comprises the following steps of analyzing stress conditions corresponding to all shafts and components of the helicopter system in the operation process, and specifically comprises the following steps:
the three-degree-of-freedom helicopter system space state equation is as follows:
Figure FDA0002501306110000011
according to the modeling analysis of the helicopter, the altitude angle, the pitch angle p, the rotation angle r and their respective differential altitude angular velocities are selected
Figure FDA0002501306110000012
Pitch angular velocity
Figure FDA0002501306110000013
And rotational angular velocity
Figure FDA0002501306110000014
As state vectors, i.e.
Figure FDA0002501306110000015
The voltage of front and rear motors of the three-freedom-degree helicopter is selected as an input vector, namely u ═ VfVb]TAltitude, pitch angle p, rotation angle r as output vector y ═ p r]T(ii) a Substituting the values in the specific relevant parameter table to obtain the coefficients of the A, B and C state equations as follows:
Figure FDA0002501306110000016
step 2) data acquisition and processing are carried out through an explicit model predictive control algorithm: the main idea of off-line computation of explicit predictive control is multi-parameter planning, thus constraining the optimal control problem:
Figure FDA0002501306110000017
s.t.Exk+Luk≤M,k=0,1,···N-1 (1b)
xk+1=Axk+Bxk,k≥0 (1c)
x0=x(0),xN∈χf(1d)
wherein P, Q is not less than 0, R is more than 0, xNIs terminal state, ukAnd xkRespectively, the control variable and the state variable, U, of each sampling pointNIn order to optimize the control vector,
Figure FDA0002501306110000018
for the set of all feasible solutions x (0), xN∈χfIs the final state constraint region χfIs a convex set;
however, the optimal control vector U in the optimization problem (1)NDepending only on the current system state x (0), since (1c) is linear, it can be translated into
Figure FDA0002501306110000021
Substituting (2) into (1) can transform the constrained optimal control problem into the following optimization problem:
Figure FDA0002501306110000022
s.t.GUN≤W+Ex(0) (3b)
wherein Y, H, F, G, W, E are obtained from (1); the multi-parameter quadratic programming problem solution is in the form of a piecewise affine function:
u(k)=fix(k)+gi,x(k)∈CRi(4)
performing convex division on a system state space by using mp-QP through explicit model predictive control to obtain an optimal explicit control law corresponding to the state partition, and storing the lower data; the online calculation is that the solved state partition and control law data are stored on a computer, and the state partition and the corresponding control law of the current state are determined to be used as the control voltage of the three-degree-of-freedom helicopter only by looking up a table; integrating input (control voltage) and output (altitude angle, pitch angle and rotation angle) data into a training set, a test set and a verification set;
step 3), constructing and training an EMPC-based neural network: the model prediction control algorithm of the deep neural network is designed by means of the advantage of deep learning on multi-parameter optimization, the model prediction algorithm of the deep learning is the deep learning network which takes explicit model prediction control as a prophase algorithm and carries out online training through data of a control result of the deep learning network, and therefore the reconstruction of the control law of the three-degree-of-freedom helicopter under different flight attitudes is realized; the deep neural network phi (x | theta) is composed of L band parameters (w)j,bj) Affine function λ ofj(x) Wherein L is the number of layers of the neural network;
λj(x)=wjx+bj(5)
by the last affine function lambdaj(x) In addition, each of the rest needs to have the function of a nonlinear activation function h, because each node in the hidden layer of the neural network is associated with an activation function, so the deep neural network is expressed as:
φ(x|θ)=λL·h+λl-1·h···h+λ1(6)
wherein
Figure FDA0002501306110000023
Is the set of all parameters that need to be optimized, wjIs the connection weight of the jth, bjIs a threshold value; ReLU is selected as the activation function because the deep network taking ReLU as the activation function has good feasibility and can obtain better results so as to reduce the influence of approximation error; meanwhile, the activation function ReLU and corresponding network parameters can accurately represent mp-QP, and as the solution of mp-QP is continuous piecewise affine on a polyhedron and is positively determined to be unique, the neural network with the activation function ReLU is also continuous piecewise affine on the polyhedron;
the ReLU activation function is defined as: y max {0, x } (7)
The piecewise linear function of the activation function is represented as: y is equal to x,
Figure FDA0002501306110000031
the deep learning network constructed on the basis can accurately represent any explicit model predictive control law u (k), which is also determined by two aspects, namely that each piecewise affine function f (x) can be written into the difference of two convex piecewise affine functions
Figure FDA0002501306110000032
The method comprises the following steps:
f(x)=γ(x)-η(x) (9)
wherein γ (x) is represented in
Figure FDA0002501306110000033
Thereon is provided with rγAn area, η (x) is shown in
Figure FDA0002501306110000034
Thereon is provided with rηAn area; second convex piecewise affine function f (x) in
Figure FDA0002501306110000035
The maximum value of the upper N points is
Figure FDA0002501306110000036
After the neural network structure is constructed, the next step is to optimize the network parameters, which depend on the training set data; therefore, a plurality of different points are randomly selected in a state space, and the corresponding optimal control input u (k) is obtained by repeatedly solving to obtain an explicit model predictive control law, so that the generated training set data is used for training the neural network;
this training process can be seen as a process of continuously iteratively updating the weights and bias values of each layer of neurons to continuously improve their accuracy using an optimization algorithm and a loss function, wherein methods including a stochastic gradient descent optimizer method, such as Adam optimizer method and automatic parameter adjustment method, are used, and these methods are performed by training a subset (small lot) of the dataset and calculating the loss; replacing the whole data set with a small batch, and then calculating a loss gradient by using a back propagation method to update a neural network parameter theta until the data set is completely sampled to obtain an ideal model; the standard for finishing the training model is whether the convergence degree of the loss function meets the requirement or not and whether the accuracy reaches the expectation or not, the loss function is the mean square error of a predicted value and an actual value, and the accuracy can be more intuitively and effectively seen in the relation between the two; training therefore provides a basis for how deep learning networks map inputs to outputs;
step 4), model verification and application of a deep learning network: the model designed by the controller is only approximate to an actual system, and in order to ensure that the trained model is accurately represented by a data set and prevent overfitting, a verification set is needed to verify the model and continuously adjust parameters, and verification technologies comprise data partitioning, cross verification and residual error map analysis; after the deep learning network is trained for the first time, when the output error is smaller than a set value, the actual deep learning network control parameter is not trained to be optimal, multiple times of subsequent training are still needed, and the training parameters, an optimizer and a loss function are adjusted to optimize a program, so that the error precision of the deep learning network is further trained and improved, and the deep learning network reaches an expected value; for a trained neural network, u (k) slave can be accurately represented
Figure FDA0002501306110000041
The feedback law of the model predictive control is displayed, the reconstruction of the control law of the three-degree-of-freedom helicopter under different flight attitudes can be realized, and the control of the flight attitudes of the three-degree-of-freedom helicopter can be realized quickly and accurately.
CN202010433327.3A 2020-05-21 2020-05-21 Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning Active CN111580389B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010433327.3A CN111580389B (en) 2020-05-21 2020-05-21 Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010433327.3A CN111580389B (en) 2020-05-21 2020-05-21 Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning

Publications (2)

Publication Number Publication Date
CN111580389A true CN111580389A (en) 2020-08-25
CN111580389B CN111580389B (en) 2022-05-24

Family

ID=72125170

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010433327.3A Active CN111580389B (en) 2020-05-21 2020-05-21 Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning

Country Status (1)

Country Link
CN (1) CN111580389B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114115302A (en) * 2021-06-08 2022-03-01 中国人民解放军国防科技大学 Spacecraft attitude control method based on deep neural network approximation MPC
CN114626509A (en) * 2022-03-28 2022-06-14 杭州师范大学 Method for reconstructing explicit model prediction control based on deep learning
CN117131784A (en) * 2023-10-20 2023-11-28 北京航空航天大学 Global agent optimization-based accelerated degradation test design method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2672340A1 (en) * 2012-06-04 2013-12-11 Universita Degli Studi Di Genova Method for creating digital circuits of a feedback control system that implements an approximation technique for model predictive control (MPC)
CN110471280A (en) * 2019-07-30 2019-11-19 浙江工业大学 The connected graph explicit model forecast Control Algorithm of Three Degree Of Freedom helicopter

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2672340A1 (en) * 2012-06-04 2013-12-11 Universita Degli Studi Di Genova Method for creating digital circuits of a feedback control system that implements an approximation technique for model predictive control (MPC)
CN110471280A (en) * 2019-07-30 2019-11-19 浙江工业大学 The connected graph explicit model forecast Control Algorithm of Three Degree Of Freedom helicopter

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
BAIHONG JIN 等: "Detecting and Diagnosing Incipient Building Faults Using Uncertainty Information from Deep Neural Networks", 《2019 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM)》 *
DANIELE MASTI 等: "Learning binary warm starts for multiparametric mixed-integer quadratic programming", 《2019 18TH EUROPEAN CONTROL CONFERENCE (ECC)》 *
JU ZHANG 等: "Control of a laboratory 3-DOF helicopter_Explicit model predictive approach", 《INTERNATIONAL JOURNAL OF CONTROL, AUTOMATION AND SYSTEMS》 *
刘景: "一类受限非线性系统显示模型预测控制器", 《计算机系统应用》 *
张聚 等: "汽车悬挂系统显式模型预测控制", 《信息与控制》 *
王仑等: "四旋翼飞行器姿态的显式模型预测控制", 《电光与控制》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114115302A (en) * 2021-06-08 2022-03-01 中国人民解放军国防科技大学 Spacecraft attitude control method based on deep neural network approximation MPC
CN114626509A (en) * 2022-03-28 2022-06-14 杭州师范大学 Method for reconstructing explicit model prediction control based on deep learning
CN117131784A (en) * 2023-10-20 2023-11-28 北京航空航天大学 Global agent optimization-based accelerated degradation test design method and device
CN117131784B (en) * 2023-10-20 2024-02-23 北京航空航天大学 Global agent optimization-based accelerated degradation test design method and device

Also Published As

Publication number Publication date
CN111580389B (en) 2022-05-24

Similar Documents

Publication Publication Date Title
CN111580389B (en) Three-degree-of-freedom helicopter explicit model prediction control method based on deep learning
Yan et al. Extended state observer‐based sliding mode fault‐tolerant control for unmanned autonomous helicopter with wind gusts
CN107807521B (en) Fixed time-oriented formation aircraft cooperative control method and system
CN107065897B (en) Three-degree-of-freedom helicopter explicit model prediction control method
CN111813146B (en) Reentry prediction-correction guidance method based on BP neural network prediction voyage
CN112947592B (en) Reentry vehicle trajectory planning method based on reinforcement learning
Ardil Fighter Aircraft Evaluation and Selection Process Based on Triangular Fuzzy Numbers in Multiple Criteria Decision Making Analysis Using the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS)
Abdollahi et al. Simplified fuzzy‐Padé controller for attitude control of quadrotor helicopters
Zhang et al. Recurrent neural network-based model predictive control for multiple unmanned quadrotor formation flight
Ghasemi Hamed et al. Statistical prediction of aircraft trajectory: regression methods vs point-mass model
Bierling Comparative analysis of adaptive control techniques for improved robust performance
Wu et al. Learning-based fixed-wing UAV reactive maneuver control for obstacle avoidance
CN117055605A (en) Multi-unmanned aerial vehicle attitude control method and system
Wang Adaptive sliding mode robust control based on multi‐dimensional Taylor network for trajectory tracking of quadrotor UAV
Bian et al. A fitness sharing based ant clustering method for multimodal optimization of the aircraft longitudinal automatic carrier landing system
Zhou Multi-variable adaptive high-order sliding mode quasi-optimal control with adjustable convergence rate for unmanned helicopters subject to parametric and external uncertainties
Emaletdinova et al. Methods of constructing the neural network models of regulators for controlling a dynamic object with smooth monotonous behavior
Ajwad et al. Distributed time-varying formation tracking of multi-quadrotor systems with partial aperiodic sampled data
CN114488815A (en) Air-to-air missile active disturbance rejection control parameter setting method and device and electronic equipment
CN113848726A (en) System controller design method and device and electronic equipment
Li et al. A path planning for one UAV based on geometric algorithm
Mozelli et al. Attitude of quadrotor-like vehicles: Fuzzy modeling and control with prescribed rate of convergence
Chen et al. Real-time UAV Flight Path Prediction Using A Bi-directional Long Short-term Memory Network with Error Compensation [J]
CN117111629B (en) Multi-unmanned aerial vehicle fixed time optimal control method based on self-adaptive dynamic programming
Millidere et al. SAAB 340B Aerodynamic Model Development Using Binary Particle Swarm Optimization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant