CN106033189B

CN106033189B - Flying robot's pose network response surface device

Info

Publication number: CN106033189B
Application number: CN201510113369.8A
Authority: CN
Inventors: 樊兆峰
Original assignee: Xuzhou University of Technology
Current assignee: Xuzhou University of Technology
Priority date: 2015-03-16
Filing date: 2015-03-16
Publication date: 2019-02-22
Anticipated expiration: 2035-03-16
Also published as: CN106033189A

Abstract

The invention discloses a kind of flying robot's pose network response surface devices, the Pose Control especially suitable for more rotor flying robots.Controller includes: rolling optimization device, neural network prediction model, inverse neural network model, delay link, feedback element, addition comparator, subtraction comparator；Controller models flying robot's dynamic process using neural network, predicts；The initial value of iteration optimization algorithms is determined using inverse neural network；Local optimum is carried out using Newton-Raphson method；Controller also has feedback control function.Compared to modelling by mechanism, neural network model has higher precision, and the PREDICTIVE CONTROL based on this model has higher control performance；Using the method for objective function optimization, power consumption can be reduced to the greatest extent under the premise of guaranteeing control performance；Compared with PID control, control overshoot is small, and control effect is more steady；Compared with sliding formwork control, it will not lead to the problem of and tremble shake.

Description

Flying robot's pose network response surface device

Technical field

The present invention relates to the Pose Control devices of flying robot a kind of, are especially used for the pose of more rotor flying robots Controller belongs to robotic technology field.

Background technique

Flying robot generally refers to be equipped with various sensors, the nolo flight with certain perception and intelligence Device.There is the successful application case of some flying robots in recent years with the development of science and technology in succession, for example, plus Alan's (Aeryon) laboratory development Scout of Waterloo of putting on airs is a kind of novel flying machine that offender can be enabled to quail Device people, may replace monitoring camera；In Japan, flying robot helps peasant to manage rice field, monitors corn growing, sows agriculture Medicine；In Africa, the people of wild animal is catched and killed in flying robot's tracking with video camera.Currently, every profession and trade is to flying robot Demand it is larger, after having scholarly forecast 2015, the business application of flying robot explosively increases appearance, extensively Application prospect also promotes the research of flying robot's technology, since current flying robot generally uses multi-rotor aerocraft, Therefore hot spot is become to the control technology research of multi-rotor aerocraft；Multi-rotor aerocraft belongs to multi input, multi output, has by force The nonlinear control system of coupling, lag etc., modelling by mechanism use Newton-Euller method mostly, and this model, which often exists, to be lost With the higher problem of degree；Control to multi-rotor aerocraft, most commonly used at present is still PID control, due to PID control sheet That control is realized according to the error generated in matter, therefore, it is difficult to error generate before just provide one for eliminate mistake The control amount of difference, control precision be not high；There is document proposition to use sliding formwork control, but " trembles shake " present in sliding formwork control and ask Topic is difficult to solve；Also there is document to propose PREDICTIVE CONTROL, but by using model, there are the higher problem of mismatch, control effects Fruit is unsatisfactory.In addition, therefore, battery continues since flying robot is to be powered using battery, and battery is all heavier Working time is always the bottleneck of flying robot's technology, how under the premise of guaranteeing to control precision, is farthest reduced Power consumption is also larger problem present in control, and PID control and sliding formwork control are difficult to solve this problem.

In various intelligent control methods, it is based on prediction model that Model Predictive Control (MPC), which derives from industrial practice, Finite time-domain optimal control.MPC is applied widely in recent years, but from the point of view of application, is mainly also limited to linear or quasi- Linear process causes this phenomenon mainly due to expend very big cost in Nonlinear Mechanism modeling, and hardly results in standard True model.

Artificial neural network can sufficiently approach complicated Nonlinear Mapping relationship, have study and adapt to uncertain system Dynamic characteristic and stronger robustness and the characteristics of fault-tolerance, this make its become to nonlinear system establish prediction model and One of key technology of optimal control.If artificial neural network is combined and can be suitable for predictive control algorithm The network response surface of Control of Nonlinear Systems.

Summary of the invention

The invention discloses a kind of flying robot's pose network response surface devices, especially suitable for more rotor flyings The Pose Control of robot；Main purpose is the relatively Accurate Model for solving the problems, such as flying robot, and is guaranteeing control performance Under the premise of reduce power consumption the problem of.

The technical scheme adopted by the invention is that: flying robot's pose network response surface device, comprising: roll excellent Change device, neural network prediction model, inverse neural network model, n_uRank delay link, n_wIt is rank delay link, first-order lag link, anti- Feedback link, addition comparator, subtraction comparator；Network response surface device is using neural network prediction model to flying machine People's dynamic process is modeled, is predicted；The initial value of iteration optimization algorithms is determined using inverse neural network；Using Newton-Raphson Algorithm optimizes；Controller has feedback control function.

The dynamic process of flying robot may be expressed as: with discrete nonlinear equation

w(k+1) = f(u(k),u(k-1),…,u(k-n_u+1),w(k) ,w(k-1),…,w(k-n_w+1))

Wherein: w (k+1), w (k), w (k-1) ..., w (k-n_w+ 1) kth+1 is respectively indicated, k, k-1 ..., k-n_w+ 1 moment The pose parameter value of flying robot；u(k),u(k-1),…,u(k-n_u+ 1) kth is respectively indicated, k-1 ..., k-n_u+ 1 moment flew The control magnitude of row robot；n_w, n_uThe order of delay is respectively indicated, these numerical value can be measured by the method for experiment； Function f indicates a unknown Nonlinear Mapping relationship.

By R_p=[u(k),u(k-1),…,u(k-n_u+1),w(k),w(k-1),…,w(k-n_w+ 1)] as neural network Input, the output by w (k+1) as neural network only need to be according to the numerical value training neural network measured above, so that it may determine The weight of neural network a, so that it is determined that neural network enables the dynamic of the neural network flyby robot special Property.

For the controller of flying robot, at the kth moment, u (k-1) ..., u (k-n_u+1), w(k) ,w(k- 1),…,w(k-n_wIt+1) is obviously all known constant, as long as utilizing the neural network at this point, provide a control amount u (k) The pose parameter w (k+1) at next moment can be calculated, so, this trained neural network can serve as one The prediction model of a flying robot's pose.

In order to realize the purpose for reducing power consumption under the premise of guaranteeing control performance, rolling optimization can be technically used Method, the objective function of optimization determines are as follows:

J = (w_m(k+1)–w_r’(k+1))²+λu(k)²

In formula: w_m(k+1) it indicates by the pose value at the k+1 moment of Neural Network model predictive, w_r' (k+1) indicate it is calibrated It is externally input afterwards to refer to pose value, (w_m(k+1)–w_r’(k+1))²Value it is smaller, control performance is better, u (k)²Value get over Small, then the electric energy consumed is fewer, and λ is weight factor, for compromising between control performance and power consumption.Target letter Several minimum values corresponds to the optimum control amount that power consumption can be reduced under the premise of guaranteeing control performance.

For the real-time of Guarantee control system, a kind of local value optimization algorithm, that is, ox is used to the optimizing of objective function - the inferior algorithm of pressgang, iterative formula are as follows:

In formula: u (k)^j+1,u(k)^jRespectively indicate the control amount of jth+1 time and iteration j, last point of the right Subrepresentation objective function J is to the first-order partial derivative of control amount u (k) in u (k)=u (k)^jWhen value, denominator indicate objective function J To the second-order partial differential coefficient of control amount u (k) in u (k)=u (k)^jWhen value.

Known by iterative formula, iteration needs to set the i.e. u (k) of initial value⁰, ideal scheme is to choose to have most The control amount of excellent control performance is initial value, the advantage of doing so is that can prevent algorithm from falling into other minimums, to guarantee The control performance of system；For this purpose, can be realized using an inverse neural network, the inverse kinematics process of flying robot can be indicated For discrete nonlinear equation:

u(k) = g(u(k-1),…,u(k-n_u+1), w(k+1),w(k) ,w(k-1),…,w(k-n_w+1))

Wherein: the inverse Nonlinear Mapping relationship of function g expression f；As long as at this time by R_i=[ u(k-1),…,u(k-n_u+1), w_r’(k +1),w(k), w(k-1),…,w(k-n_w+ 1)] as the input of neural network, by u (k) as the defeated of neural network Out, w_r' (k+1) be taken as experimental data w (k+1), it can use the data of experiment measurement still just to train inverse neural network, this The output of sample neural network enables to w_m(k+1) = w_r' (k+1), so corresponding to optimal control performance.

Feedback compensation part obtains error with actually detected value to realize using prediction pose value, and the error at k moment is w (k)-w_m(k), after correction are as follows: e=δ (w (k)-w_m(k)), wherein δ is feedback correcting coefficient；The realization of feedback compensation passes through amendment Reference input is realized, if k moment external reference input is w_r(k+1), the then reference input after correcting are as follows: w_r ^’(k+1) = w_r (k+1)+ δ(w(k)-w_m(k))。

To the order n of delay_w, n_u, the memory-aided method of implementation use benefit, such as to n_uRank delay, uses n_uA storage unit stores kth, k-1 ..., k-n in kth sampling instant respectively_uThe value of+1 sampling instant is adopted in kth+1 At the sample moment, updating storage unit is k+1 → k, k → k-1 ..., k-n_u+2→k-n_u+ 1, abandon kth-n_uThe value at+1 moment.

In conclusion flying robot's network response surface device is completed by following steps:

Step 1: initialization, network response surface device needs to initialize following parameter: feedback compensation when system electrification Coefficient δ, weight factor λ, terminal parameter α, maximum number of iterations ξ, delay link order n_u 、n_w, the number of iterations j=0, prolong The value of slow link storage unit, w_m(1)；.

Step 2: feedback compensation, network response surface device measures the pose parameter w (k) at k moment, according to w_r ^’(k+1) = w_r(k+1)+ δ(w(k)-w_m(k)) to reference pose signal w_r(k+1) it is corrected, the reference pose w after being corrected_r ^’ (k+1)。

Step 3: calculating the initial value of rolling optimization, the initial value u (k) of rolling optimization is calculated⁰, neural network prediction control Device processed calls inverse neural network model, according to formula: u (k)⁰= lw_i×a(iw_i×R_i+b_1i)+ b_2iCalculate the first of rolling optimization Initial value u (k)⁰。

Step 4: carrying out rolling optimization according to Newton-Raphson iterative formula obtains optimum control amount,

(1) by formula:

It calculates u (k)^j+1；

(2) if ︱ u (k)^j+1 -u(k)^j︱<α or j>ξ then turns the 5th step, otherwise, j+1 → j, u (k) is arranged^j+1→ u(k)^j, turn (1).

Step 5: being controlled with optimum control amount obtained in the previous step, optimum control amount, u* (k)=(u (k) are calculated^j+1 + u(k)^j)/2, and control flying robot's controlled system is gone with u* (k), network response surface device calls nerve net Network prediction model obtains the pose predicted value w of next sampling instant_m(k+1), the storage unit values for updating delay link are u (k +1)→u(k), u(k)→u(k-1),…, u(k-n_u+2)→u(k-n_u+ 1) the value u (k-n at earliest moment, is abandoned_u+1)；w (k)→w(k-1), w(k-1)→w(k-2),…, w(k-n_w+2)→w(k -n_w+ 1) the value w (k-n at earliest moment, is abandoned_w+ 1)。

The beneficial effects of the present invention are: neural network model has higher precision, to flying machine compared with modelling by mechanism The dynamic process Approximation effect of device people is good, but also the PREDICTIVE CONTROL based on this model has higher control performance；It uses The method of objective function optimization can reduce power consumption to the greatest extent under the premise of guaranteeing control performance；Compared with PID control, by It can be controlled in advance in front of following error generates, therefore control overshoot is small, control effect is more steady；With sliding formwork control It compares, will not lead to the problem of and tremble shake.

Detailed description of the invention

Fig. 1 is controller structure diagram of the invention, in figure: 1001, network response surface device；1002, rolling optimization Device；1003, neural network prediction model；1004, inverse neural network model；1005,n_uRank delay link；1006,n_wRank ring retard Section；1007, first-order lag link；1008, feedback element；1009, addition comparator；1010, subtraction comparator；1011, it flies Robot controlled system.

Fig. 2 is neural network structure figure of the present invention, in figure: 2001, input vector；2002, input layer biases Vector；2003, weight matrix is inputted；2004, accumulator；2005, neuron activation functions；2006, output layer bias vector； 2007, weight matrix is exported；2008, output vector.

Fig. 3 is quadrotor flying robot pose schematic diagram.

Fig. 4 is controller work flow diagram.

Specific embodiment

Embodiment 1: illustrate specific real-time mode, quadrotor flying robot position for selection quadrotor flying robot For appearance as shown in figure 3, input control quantity is the speed control signal of four rotor motors, output quantity is robot pose parameter, In, the control signal at k moment are as follows: u (k)=[u₁(k),u₂(k),u₃(k),u₄(k)], current quadrotor control signal is generally adopted With duty ratio, i.e. u₁(k),u₂(k),u₃(k),u₄(k) duty ratio of the control square wave of four motors, pose parameter are respectively indicated For w (k)=[ψ (k), φ (k), θ (k)], ψ (k), φ (k), θ (k) is respectively the yaw angle at k moment, roll angle, pitch angle.

It should be noted that since neural network is that the general mapping relations of one kind approach device, so proposed by the present invention Method is suitble to various flying robots, the difference is that element number and physics dimension that dominant vector u (k) includes are different； Although this illustrates for sentencing quadrotor flying robot, versatility of the invention is had no effect on.

Control system architecture as shown in Figure 1, in dotted line frame 1001 be network response surface device, specifically include: 1002, rolling optimization device；1003, neural network prediction model；1004, inverse neural network model；1005,n_uRank delay link； 1006、n_wRank delay link；1007, first-order lag link；1008, feedback element；1009, addition comparator；1010, subtraction ratio Compared with device；In Fig. 1,1011 be flying robot's controlled system.

The pose data of experiment measurement quadrotor flying robot: 800 are randomly generated within the scope of the permissible value of control amount A data, u₁(1),u₂(1),u₃(1),u₄(1), u₁(2),u₂(2),u₃(2),u₄(2),…,u₁(200),u₂(200),u₃ (200),u₄(200), these control amounts are inputted in corresponding sampling instant respectively, measures corresponding output pose parameter: ψ (2),φ(2),θ(2), ψ(3) ,φ(3),θ(3),…, ψ(201) ,φ(201),θ(201)。

According to the data of the above experiment measurement, delay order n is determined with the method for parameter identification_u,n_w。

Neural network prediction model is established as shown in Fig. 2, the input vector (2001) of neural network is R_p= [u(k), u (k-1),…, u(k-n_u+1), w(k), w(k-1),…, w(k-n_w+1)]=[u₁(k),u₂(k),u₃(k),u₄(k), u₁ (k-1),u₂(k-1),u₃(k-1),u₄(k-1),…, u₁(k-n_u+1),u₂(k-n_u+1),u₃(k-n_u+1),u₄(k-n_u+1),ψ (k),φ(k),θ(k),ψ(k-1),φ(k-1),θ(k-1),…, ψ(k-n_w+1),φ(k-n_w+1),θ(k-n_w+ 1) it], inputs The number of vector (2001) is 4 n_u +3n_w；Output vector (2008) is the prediction pose w at k+1 moment_m(k+1) = [ψ_m (k+ 1),φ_m (k+1),θ_m(k+1)]；Neuron activation functions (2005) use sigmoid function: a (t)=1/ (1+e^-t)；Input Layer bias vector (2002) b_1m, input weight matrix (2003) iw_m, output layer bias vector (2006) b_2m, output weight square Battle array (2007) lw_mIt can be obtained according to the data of the above experiment measurement by neural network training method；Neural network prediction The relational expression of model are as follows: w_m(k+1) = lw_m×a(iw_m×R_p+b_1m)+ b_2m。

The structure of inverse neural network model is also as shown in Fig. 2, the input vector (2001) of network is R_i= [ w_r’(k+1), u(k-1),…, u(k-n_u+1), w(k), w(k-1),…, w(k-n_w+1)]=[ψ(k+1),φ(k+1),θ(k+1), u₁ (k-1),u₂(k-1),u₃(k-1),u₄(k-1),…, u₁(k-n_u+1),u₂(k-n_u+1),u₃(k-n_u+1),u₄(k-n_u+1), ψ (k),φ(k),θ(k),ψ(k-1),φ(k-1),θ(k-1),…, ψ(k-n_w+1),φ(k-n_w+1),θ(k-n_w+ 1) it], inputs The number of vector (2001) is 4 n_u +3n_w-1；Output vector (2008) is control amount u (k)=[u at k moment₁(k),u₂ (k),u₃(k),u₄(k)]；Neuron activation functions (2005) still use sigmoid function: a (t)=1/ (1+e^-t)；Input layer Bias vector (2002) b_1i, input weight matrix (2003) iw_i, output layer bias vector (2006) b_2i, output weight matrix (2007) lw_iIt can be obtained according to the data of the above experiment measurement by neural network training method；Inverse neural network prediction The relational expression of model are as follows: u (k)=lw_i×a(iw_i×R_i+b_1i)+ b_2i。

The implementation of delay link (1005,1006,1007) uses the memory-aided method of benefit, such as to n_uRank ring retard It saves (1005), uses n_uA storage unit stores kth, k-1 ..., k-n in kth sampling instant respectively_u+ 1 sampling instant Value, in+1 sampling instant of kth, updating storage unit is k+1 → k, k → k-1 ..., k-n_u+2→k-n_u+ 1, abandon kth-n_u+ The value at 1 moment.

Feedback fraction is realized by subtraction comparator (1010), feedback element (1008), addition comparator (1009), according to figure 1, feedback quantity are as follows: δ (w (k)-w_m(k)), wherein δ is feedback correcting coefficient, the reference input after feedback compensation are as follows: w_r ^’(k +1) = w_r (k+1)+ δ(w(k)-w_m(k)), feedback compensation is achieved.

The objective function form of rolling optimization device (1002) are as follows: J=(w_r ^’(k+1)- w_m(k+1))²+λu(k)², In, λ is weight factor, is optimized using Newton-Raphson numerical value Local Optimization Algorithm, iterative formula are as follows:

Wherein, u (k)^j+1,u(k)^jRespectively indicate the control amount of jth+1 time and iteration j, last point of the right Subrepresentation objective function J is to the first-order partial derivative of control amount u (k) in u (k)=u (k)^jWhen value, denominator indicate objective function J To the second-order partial differential coefficient of control amount u (k) in u (k)=u (k)^jWhen value.

The initial value u (k) of iteration⁰It is determined by inverse neural network, i.e. u (k)⁰= lw_i×a(iw_i×R_i+b_1i)+ b_2i, repeatedly For termination condition are as follows: ︱ u (k)^j+1 -u(k)^j︱ < α, α are a lesser normal number；In addition, for the real-time for guaranteeing algorithm, Maximum number of iterations ξ is set, if the number of iterations is higher than the value, terminates iterative process；Optimum control amount is by public affairs after the completion of iteration Formula u* (k)=(u (k)^j+1 + u(k)^j)/2 obtain.

The working principle of network response surface device (1001): illustrating by taking the k sample moment as an example, neural network prediction control Device (1001) processed measures the pose parameter w (k) at k moment first, according to w_r ^’(k+1) = w_r(k+1)+ δ(w(k)-w_m(k)) right With reference to pose signal w_r(k+1) it is corrected, the reference pose w after being corrected_r ^’(k+1)；Secondly, network response surface Device (1001) calls inverse neural network model (1004), according to formula: u (k)⁰= lw_i×a(iw_i×R_i+b_1i)+ b_2iCalculate rolling The initial value u (k) of dynamic optimization⁰, optimum control amount u* (k) is obtained according to Newton-Raphson iterative formula, goes to control with u* (k) Flying robot's controlled system (1011)；Finally, updating the storage unit values of delay link, network response surface device (1001) neural network prediction model (1003) are called to obtain the pose predicted value w of next sampling instant_m(k+1), in order to Next sampling instant is used for feedback compensation.

Further, as shown in figure 4, the detailed work steps of network response surface device (1001) are as follows:

Step 1: initialization, network response surface device (1001) needs to initialize following parameter when system electrification: anti- Present the order n of correction coefficient δ, weight factor λ, terminal parameter α, maximum number of iterations ξ, delay link_u 、n_w, the number of iterations j= 0, the value of delay link storage unit, w_m(1)。

Step 2: feedback compensation, network response surface device (1001) measures the pose parameter w (k) at k moment first, According to w_r ^’(k+1) = w_r(k+1)+ δ(w(k)-w_m(k)) to reference pose signal w_r(k+1) it is corrected, after obtaining correction Reference pose w_r ^’(k+1)。

Step 3: calculating the initial value of rolling optimization:

Calculate the initial value u (k) of rolling optimization⁰, the inverse neural network model of network response surface device (1001) calling (1004), according to formula: u (k)⁰= lw_i×a(iw_i×R_i+b_1i)+ b_2iCalculate the initial value u (k) of rolling optimization⁰。

Step 4: carrying out rolling optimization according to Newton-Raphson iterative formula obtains optimum control amount:

(1) by formula:

It calculates u (k)^j+1；

Step 5: being controlled with optimum control amount obtained in the previous step, optimum control amount, u* (k)=(u (k) are calculated^j+1 + u(k)^j)/2, and control flying robot's controlled system (1011), network response surface device are removed with u* (k) (1001) neural network prediction model (1003) are called to obtain the pose predicted value w of next sampling instant_m(k+1), update is prolonged The storage unit values of slow link are u (k+1) → u (k), u (k) → u (k-1) ..., u (k-n_u+2)→u(k -n_u+ 1) it, abandons Value u (k-the n at earliest moment_u+1)；w(k)→w(k-1), w(k-1)→w(k-2),…, w(k-n_w+2)→w(k –n_w+ 1), Abandon the value w (k-n at earliest moment_w+1)。

Claims

1. flying robot's pose network response surface device, comprising: rolling optimization device, neural network prediction model, inverse mind Through network model, n_uRank delay link, n_wRank delay link, first-order lag link, feedback element, addition comparator, subtraction compare Device；Network response surface device models flying robot's dynamic process using neural network prediction model, predicts；Make The initial value of iteration optimization algorithms is determined with inverse neural network；It is optimized using Newton-Raphson method；Neural network prediction control Device processed has feedback control function, and the objective function of rolling optimization device is determined as J=(w_r ^’(k+1)- w_m(k+1))²+λu(k)², Wherein, λ is weight factor, w_r ^’(k+1)、w_m(k+1) be respectively+1 sampling instant of kth correction after with reference to pose and prediction bits Appearance, pose parameter are w (k+1)=[ψ (k+1), φ (k+1), θ (k+1)], and ψ (k+1), φ (k+1), θ (k+1) is respectively k+1 The yaw angle at moment, roll angle, pitch angle, u (k) are the control amount of kth sampling instant；It is carried out using Newton-Raphson method Local optimum, iterative formula are as follows:

In formula: u (k)^j+1, u(k)^jRespectively indicate the control amount of jth+1 time and iteration j, last molecule table of the right Show objective function J to the first-order partial derivative of control amount u (k) in u (k)=u (k)^jWhen value, denominator indicate J pairs of objective function The second-order partial differential coefficient of control amount u (k) is in u (k)=u (k)^jWhen value；Stopping criterion for iteration are as follows: ︱ u (k)^j+1 -u(k)^j︱ < α, α is terminal parameter；For the real-time for guaranteeing algorithm, maximum number of iterations ξ is set, if the number of iterations is higher than the value, termination changes For process；Optimum control amount presses formula u* (k)=(u (k) after the completion of iteration^j+1 + u(k)^j)/2 obtain；Neural network is pre- It surveys model and inverse neural network model includes input vector, input layer bias vector, input weight matrix, accumulator, neuron Activation primitive, output layer bias vector, output weight matrix, output vector；Neuron activation functions use sigmoid function: a (t)=1/(1+e^-t)；The input vector of neural network prediction model is R_p= [u(k), u(k-1),…, u(k-n_u+1), w (k), w(k-1),…, w(k-n_w+ 1)], wherein n_w, n_uThe order of delay is respectively indicated, the number of input vector is 4 n_u + 3n_w；Output vector is the prediction pose w at k+1 moment_m(k+1)；The input vector of inverse neural network model is R_i= [ w_r’(k+ 1), u(k-1),…, u(k-n_u+1), w(k), w(k-1),…, w(k-n_w+ 1)], the number of input vector is 4 n_u + 3n_w-1；Output vector is the control amount u (k) at k moment.

2. flying robot's pose network response surface device according to claim 1, it is characterised in that: n_uRank delay Link, n_wRank delay link, first-order lag link use benefit memory-aided method when implementation, to n_uRank delay link, makes Use n_uA storage unit stores kth, k-1 ..., k-n in kth sampling instant respectively_uThe value of+1 sampling instant, in kth+1 Sampling instant, updating storage unit is k+1 → k, k → k-1 ..., k-n_u+2→k-n_u+ 1, abandon kth-n_uThe value at+1 moment.

3. flying robot's pose network response surface device according to claim 1, it is characterised in that: feedback loop Section, addition comparator, subtraction comparator constitute the feedback fraction of network response surface device, feedback quantity are as follows: δ (w (k)-w_m (k)), wherein δ is feedback correcting coefficient, the reference input after feedback compensation are as follows: w_r ^’(k+1) = w_r (k+1)+ δ(w (k)-w_m(k)), w_r(k+1) pose parameter before being expressed as k+1 time correction.

4. flying robot's pose network response surface device according to claim 1, it is characterised in that: work step Are as follows:

Step 1: initialization, network response surface device needs to initialize following parameter: feedback correcting coefficient when system electrification δ, weight factor λ, terminal parameter α, maximum number of iterations ξ, delay link order n_u、n_w, the number of iterations j=0, delay link The value w of storage unit_m(1)；

Step 2: feedback compensation, network response surface device measures the pose parameter w (k) at k moment, according to w_r ^’(k+1) = w_r(k+1)+ δ(w(k)-w_m(k)) to reference pose signal w_r(k+1) it is corrected, the reference pose w after being corrected_r ^’(k+ 1)；

Step 3: calculating the initial value of rolling optimization:

Calculate the initial value u (k) of rolling optimization⁰, network response surface device calls inverse neural network model, according to formula: u (k)⁰= lw_i×a(iw_i×R_i+b_1i)+ b_2iCalculate the initial value u (k) of rolling optimization⁰, wherein lw_iIndicate output weight square Battle array, iw_iIndicate input weight matrix, b_1iIndicate input layer bias vector, b_2iIndicate output layer bias vector；

(1) by formula:

It calculates u (k)^j+1；

(2) if ︱ u (k)^j+1 -u(k)^j︱<α or j>ξ then turns the 5th step, otherwise, j+1 → j, u (k) is arranged^j+1→u(k)^j, turn (1)；

Step 5: being controlled with optimum control amount obtained in the previous step, optimum control amount, u* (k)=(u (k) are calculated^j+1 + u(k)^j)/2, and control flying robot's controlled system is gone with u* (k), network response surface device calls neural network Prediction model obtains the pose predicted value w of next sampling instant_m(k+1), the storage unit values for updating delay link are u (k+ 1)→u(k), u(k)→u(k-1),…, u(k-n_u+2)→u(k-n_u+ 1) the value u (k-n at earliest moment, is abandoned_u+1)；w(k) →w(k-1), w(k-1)→w(k-2),…, w(k-n_w+2)→w(k -n_w+ 1) the value w (k-n at earliest moment, is abandoned_w+1)。