CN113219832A

CN113219832A - Design method of adaptive iterative learning non-uniform target tracking controller

Info

Publication number: CN113219832A
Application number: CN202110507272.0A
Authority: CN
Inventors: 张春丽; 田旭; 严雷; 钱富才; 谢国; 王文卿; 赵永红
Original assignee: Xian University of Technology
Current assignee: Xian University of Technology
Priority date: 2021-05-10
Filing date: 2021-05-10
Publication date: 2021-08-06

Abstract

The invention discloses a design method of a self-adaptive iterative learning non-uniform target tracking controller, which specifically comprises the following steps: the method comprises the following steps: step 1: performing system description aiming at a type of continuous time nonlinear system; step 2: describing a time-varying boundary layer and a radial basis function neural network; and step 3: designing a self-adaptive learning controller by respectively applying a Backstepping method and a radial basis function neural network in the conditions of asymmetric dead zone input and continuous nonlinear input; and 4, step 4: analyzing the stability and convergence of the designed adaptive learning controller; and 5: a simulation example is given to illustrate the feasibility and effectiveness of the method of the present invention. When the initial state error and the unknown input nonlinearity exist simultaneously, the problem of inconsistent target tracking control of self-adaptive iterative learning of a strict feedback nonlinear system is solved.

Description

Design method of adaptive iterative learning non-uniform target tracking controller

Technical Field

The invention belongs to the technical field of adaptive iterative learning control, and relates to a design method of an adaptive iterative learning non-uniform target tracking controller.

Background

The self-adaptive iterative learning control method is a powerful method for solving the problem of trajectory tracking in a finite time interval of a nonlinear system, and the self-adaptive iterative learning control is combined with iterative learning control. According to the design of the adaptive learning law, the AILC theory can be divided into two main categories: along the iteration domain direction and the time domain direction, are also called discrete AILC and continuous AILC, of course. Adaptive control is often used to deal with uncertainties that can be estimated by learning, and most adaptive controllers involve some type of function approximator, such as Neural Networks (NNs) and Fuzzy Logic Systems (FLSs), in their learning mechanism. The self-adaptive NN and FLS controllers can solve the problem of stability of closed-loop systems of various control systems, and the problem of tracking variable control tasks is solved by applying the self-adaptive fuzzy ILC to an unknown nonlinear system. The AILC is designed with a variable wavelet decomposition, and the wavelet decomposition terms are increased with the increase of the iteration number, so that the tracking error is converged uniformly in a limited time interval. The AILC based on a complex energy function or a Lyapunov function plays an important role in processing the learning estimation of time-varying parameters by connecting the time domain and the iterative domain. The AILC of a non-linear parameterized system remains a disclosed problem. No research efforts have been found for the aicc of non-linear parametric systems. Although traditional D-type and P-type ILCs have good research results on this problem, it remains a public problem for AILC. Due to the physical limitations of actuators, non-linearities are always present in control inputs, such as dead band inputs and continuous non-linear inputs. The presence of non-linearities in the control input tends to undermine system performance. At present, the performance of such control systems is improved by combining different methods for some nonlinear systems such as the tracking control problem of nonlinear uncertain systems with input saturation, very general nonlinear input uncertainty, class of time-lag systems with matched nonlinear time-lag disturbance and asymmetric dead zone saturation input, and uncertain nonlinear parameterized systems with unknown input nonlinearity (NLP-systems). However, there is currently no mature research effort for AILC for nonlinear systems with unknown input nonlinearities and initial state errors. When a possibly large initial state error exists, how to use the adaptive iterative learning control to solve the problem of tracking the non-uniform target track of a nonlinear strict feedback system on [0, T ] is a challenging problem.

Disclosure of Invention

The invention aims to provide a design method of a self-adaptive iterative learning non-uniform target tracking controller, which can solve the problem of self-adaptive iterative learning non-uniform target tracking control of a strict feedback non-linear system when an initial state error and unknown input non-linearity exist simultaneously.

The technical scheme adopted by the invention is that a design method of a self-adaptive iterative learning non-uniform target tracking controller specifically comprises the following steps:

step 1, the following formula (1) is adopted to represent a continuous time nonlinear system:

wherein the content of the first and second substances,

Γ(u_k) E R represents the input-output characteristics of the actuator, y_kE, R refers to system output;

is a smooth unknown non-linear function; k represents an iteration index;

step 2, the time-varying boundary layer network is expressed by the following formula (2):

wherein phi is_i，k(t)＝ε_i，ke^-ηt，z_i,kAnd z_iφ,kIs a function variable of time t, epsilon_i,kIs a convergence series sequence, sat is a saturation function, defined as follows:

wherein phi is_i,k(t) is a time-varying boundary layer,. phi_i,k(t) decreasing along the time axis, selecting an initial state error phi at the kth iteration_i,k(0)＝ε_i,kThen 0 < epsilon_i,ke^-ηT≤φ_i,k(t)≤ε_i,k，

And 3, expressing the radial basis function neural network by adopting the following formula (4):

wherein the content of the first and second substances,

is a known smooth vector function, wherein the number l of NN nodes is more than 1; radial basis function

Is a gaussian function and is expressed by the following equation (5):

wherein, mu_jE.omega and eta > 0 are radial basis functions, respectively

Of (2) centerSum width, optimal weight vector W ═ ω₁,...,ω_l]^TIs defined as:

wherein the content of the first and second substances,

is an approximation error inherent to the NN;

and 4, designing the self-adaptive learning controller by respectively applying a Backstepping method and a radial basis function neural network in the conditions of asymmetric dead zone input and continuous nonlinear input.

The invention is also characterized in that:

the specific process of the step 4 is as follows:

step 4.1, use function Γ (u)_k) The actuator output with an asymmetric dead band is represented as shown in equation (7) below:

wherein the parameter m_rAnd m_lA right slope and a left slope representing the dead zone characteristic, respectively, parameter b_rAnd b_lA breakpoint representing a dead-zone input of the actuator;

and 4.2, modeling the asymmetric dead zone of the actuator into a form of the sum of a straight line and a disturbance-like term, wherein the form is shown in the following formula (8):

Γ(u_k)＝m(t)u_k+d(t) (8)；

wherein:

4.3, designing a controller for the uncertain nonlinear system with unknown asymmetric dead zone input and initial state error based on a self-adaptive iterative learning Backstepping frame;

and (4) performing a step (4.4),in the control input of a continuous-time nonlinear system of the type represented by equation (1), there is a nonlinearity which is assumed in the sector region s₁ s₂]A continuous non-linear function N (u)_k) And N (0) ═ 0, where s is₁And s₂Represents two straight lines (t)₁,t₂) Of (2) i.e.

And 4.5, designing a controller for the condition of continuous nonlinear input.

The specific process of the step 4.3 is as follows:

step A) let z_1,k＝x_1,k-y_r,k，z_2,k＝x_2,k-α_1,kIn which α is_1,kIs a virtual controller; because of the initial state error, according to the step 2 of representing the time-varying boundary layer, the following formula is adopted to respectively carry out the error function z_1φ,kAnd z_2φ,kTo show that:

wherein the content of the first and second substances,

wherein the content of the first and second substances,

ε_1,kand ε_2,kIs a positive series of convergent numbers, η₁And η₂Is a designed normal number;

because:

then z is_1φ,kThe derivative with respect to time is shown in the following equation (13):

step B) according to the representation of the radial basis function neural network in the step 3, the uniform approximation performance of the RBFNN shows that the unknown nonlinear function is

Approximated by RBFNN on a tight set to the following form, and with reconstruction error

Wherein the content of the first and second substances,

is the approximation error, W₁Is the optimal weight vector; for any real number a > 0 and positive integer l ≧ 2, take

Definition of

The virtual controller is taken as:

substituting equations (14) and (15) into (13) yields the following equation (16):

wherein the content of the first and second substances,

and

are respectively a parameter W₁And N₁Is estimated by the estimation of (a) a,

and

is the parameter estimation error, the last two terms to the right of the equal sign of equation (16) are rewritten as:

on the basis of the formula (17), the formula (16) is rewritten as the following formula (18):

order to

Then equation (18) is transformed into equation (19) as follows:

in step 4.3, define: z is a radical of_n,k＝x_n,k-α_n-1,kAccording to the step 2 representation of the time-varying boundary layer network, the following error function z is introduced_nφ,k：

Wherein the content of the first and second substances,

ε_n,kis a positive series of convergent numbers, η_nIs a designed normal number;

then z is_nφ,kThe derivative of (c) is:

wherein the content of the first and second substances,

defining:

z_nφ,kthe derivative with respect to time is:

according to the representation of the radial basis function neural network in the step 3, the RBFNN unified approximation performance shows that the unknown nonlinear function is represented

Wherein the content of the first and second substances,

is the approximation error, W_nIs the optimal weight vector;

get

Selecting the actual controller as

u_k＝u_1,k+u_2,k (26)；

Wherein u is_2,kTo compensate for the unknown input gain m (t); substituting equations (25) and (26) into (24) yields the following equation (27):

wherein the content of the first and second substances,

and

are respectively a parameter W_nAnd N_nIs estimated by the estimation of (a) a,

and

is the parameter estimation error, the last two terms to the right of the equality sign of equation (27) are rewritten as:

get

Wherein

Is an uncertain parameter

(ii) an estimate of (d);

order to

Then, the formula (29) becomes the following formula (30):

the specific process of the step 4.5 is as follows: with N (u)_k) As the output of the system instead of Γ (u)_k) Definition of z_n,k＝x_n,k-α_n-1,kAccording to the step 2 representation of the time-varying boundary layer network, the following error function z is introduced_nφ,k：

Wherein the content of the first and second substances,

ε_n,kis a positive series of convergent numbers, η_nIs a designed normal number; then z is_nφ,kIs a derivative of

Defining:

then equation (32) is rewritten as equation (36) below:

according to the representation of the radial basis function neural network in the step 3, the uniform approximation performance of the RBFNN shows that the unknown nonlinear function is

Wherein the content of the first and second substances,

is the approximation error, W_nIs the optimal weight vector; according to the formulas (36) and (37), the following formula (38) is obtained:

the method for designing the self-adaptive iterative learning non-uniform target tracking controller has the advantages that the radial basis function neural network is introduced to learn the performance of unknown dynamics, and a typical series is used for effectively eliminating approximation errors. The problem of initial state errors is solved by applying a time-varying boundary layer. The problem of non-linearity of two types of input, namely asymmetric dead zone input and continuous input is effectively solved. It can be shown that all signals of the closed loop system are bounded over a given time interval 0, T and that the state tracking error will asymptotically converge to an adjustable set of residuals as the iteration goes to infinity.

Drawings

FIG. 1 shows the I/Z for asymmetric dead zone input in simulation research of the design method of adaptive iterative learning inconsistent target tracking controller_1,kA graph of | | change with iteration index;

FIG. 2 is | | u for asymmetric dead zone input condition in simulation research of the design method of adaptive iterative learning inconsistent target tracking controller of the present invention_kA graph of | | change with iteration index;

FIG. 3 is a graph of asymmetric dead zone input conditions in simulation studies of a method for designing a non-uniform target tracking controller for adaptive iterative learning according to the present invention

A graph of variation with iteration index;

FIG. 4 shows asymmetric dead-zone input conditions in simulation studies of a method for designing a non-uniform target tracking controller according to the present invention

A graph of variation with iteration index;

FIG. 5 shows asymmetric dead-zone input conditions in simulation studies of a method for designing a non-uniform target tracking controller according to the present invention

A graph of variation with iteration index;

FIG. 6 is a graph of asymmetric dead zone input conditions in simulation studies of a method for designing a non-uniform target tracking controller for adaptive iterative learning according to the present invention

A graph of variation with iteration index;

FIG. 7 shows asymmetric dead-zone input conditions in simulation studies of a method for designing a non-uniform target tracking controller according to the present invention

A graph of variation with iteration index;

FIG. 8 is a plot of z for continuous nonlinear input conditions in simulation studies of a method of designing a non-uniform target tracking controller for adaptive iterative learning according to the present invention_1,kA graph of | | change with iteration index;

FIG. 9 is a graph of | | | u for continuous nonlinear input conditions in simulation studies of a method for designing a non-uniform target tracking controller for adaptive iterative learning according to the present invention_kA graph of | | change with iteration index;

FIG. 10 is a graph of continuous nonlinear input conditions in simulation studies of a method of designing a non-uniform target tracking controller for adaptive iterative learning in accordance with the present invention

A graph of variation with iteration index;

FIG. 11 is a graph of continuous nonlinear input conditions in simulation studies of a method of designing a non-uniform target tracking controller for adaptive iterative learning in accordance with the present invention

A graph of variation with iteration index;

FIG. 12 is a graph of continuous nonlinear input conditions in simulation studies of a method of designing a non-uniform target tracking controller for adaptive iterative learning in accordance with the present invention

A graph of variation with iteration index;

FIG. 13 is a graph of continuous nonlinear input conditions in simulation studies of a method of designing a non-uniform target tracking controller for adaptive iterative learning in accordance with the present invention

A graph of variation with iteration index;

FIG. 14 is a graph of continuous nonlinear input conditions in simulation studies of a method of designing a non-uniform target tracking controller for adaptive iterative learning in accordance with the present invention

Graph of variation with iteration index.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

The invention relates to a design method of a self-adaptive iterative learning non-uniform target tracking controller, which specifically comprises the following steps: step 1, carrying out system description on a continuous time nonlinear system; consider a class of continuous-time nonlinear systems, as shown in equation (1) below:

wherein the content of the first and second substances,

is the state vector of the system that can be tested. Gamma (u)_k) E R represents the input-output characteristics of the actuator, y_kE R is the system output.

Is a smooth unknown non-linear function. k denotes an iteration index. The target is at [0, T]Upper design adaptive iterative learning control law u_k(t) such that the system output y is repeated_k(t) tracking the upper target trajectory y_r,k(t) and for some small positive error bound δ

And all signals of the closed loop system are guaranteed to be bounded. y is_r,k(t) represents a sufficiently smooth target trajectory.

Step 2, a time-varying boundary layer and a radial basis function neural networkThe description is carried out; 1) a time-varying boundary layer; to overcome the uncertainty of the initial state error, a new function z is introduced as follows_iφ,k：

Wherein z is_i,kAnd z_iφ,kIs a function variable of time t, epsilon_i,kIs a convergence series sequence, sat is a saturation function, defined as follows:

wherein phi is_i,k(t) is a time-varying boundary layer. Notice phi_i,k(t) decreasing along the time axis, selecting an initial state error phi at the kth iteration_i,k(0)＝ε_i,kThen 0 < epsilon_i,ke^-ηT≤φ_i,k(t)≤ε_i,k，

Assuming 1, since the initial state error for each iteration of the initial state error is not necessarily zero, small and fixed, epsilon is constant for some known normality _i,k1, m, assuming that the initial state error satisfies | z_i,k(0)||≤ε_i,kWherein z is_i,k(0) Is the initial state error of the state error at each iteration. Since z is easily obtained from equation (2) if it is assumed that 1 holds_iφ,k(0)＝0,

During the whole time interval [0, T]When η is increased properly, phi_i,k(t) will be as small as possible. If can prove that

Then the error function will satisfy that when k → ∞,

wherein

Is an arbitrarily small normal number, i.e., the control target is completed.

2) A radial basis function neural network; unknown smooth non-linear function

The approximation on the tight set Ω will be done by the following Radial Basis Function Neural Network (RBFNN):

wherein the content of the first and second substances,

is a known smooth vector function, where the number of NN nodes, l > 1. Radial basis function

Is selected as a commonly used gaussian function and

wherein mu_jE.omega and eta > 0 are radial basis functions, respectively

The center and the width of (c). The optimal weight vector W ═ ω₁,...,ω_l]^TIs defined as:

wherein the content of the first and second substances,

is an approximation error inherent to the NN, which can be arbitrarily reduced by increasing the value of the NN network node number l. Following approximation error

The following assumptions are made.

Suppose 2, on tight set Ω, the error is approximated

Is assumed to be bounded and

wherein the unknown parameter theta_i(1. ltoreq. i. ltoreq.n) represents

A minimum upper bound of where θ_i≥0。

And 3, respectively applying a Backstepping method and a radial basis function neural network to design the self-adaptive learning controller under the conditions of asymmetric dead zone input and continuous nonlinear input. Firstly, asymmetric dead zone input and continuous nonlinear input are described, and then corresponding controllers are respectively designed. 1) Asymmetric dead zone input; actuator dead band is common in mechanical connections, hydraulic servo valves, piezoelectric sensors, and electric servomotors. The presence of such non-linearities often undermines system performance. Step 3 will introduce an asymmetric dead zone input characteristic. Function Γ (u)_k) Representing actuator output with asymmetric dead band, as expressed below

Wherein the parameter m_rAnd m_lRespectively representing the right and left slopes of the dead zone characteristic. Parameter b_rAnd b_lRepresenting a breakpoint of the actuator dead band input. The asymmetric dead zone of the actuator can be modeled as a straight line sumThe sum of one class perturbation term is in the form of:

Γ(u_k)＝m(t)u_k+d(t) (7)；

wherein:

let 3, parameter m_l,m_r,b_lAnd b_rIs an unknown normal number. There is an unknown parameter v small enough that 0 < v ≦ min { m_l,m_r}. Upper bound of disturbance d (t)

Are also unknown constants.

2) Continuous non-linear input; due to the physical limitations of the actuators, the output of the actuators acting on the system is not accurate, i.e., there is a non-linearity in the control input to the system (equation 1). These non-linearities are defined by the fact that the field of the device is a sector [ s ]₁ s₂]A continuous non-linear function N (u)_k) And N (0) ═ 0, where s is₁And s₂Represents two straight lines (t)₁,t₂) Of (2) i.e.

Let 4, parameter s₁And s₂Is an unknown non-zero normal number.

3) Designing a controller under the condition of asymmetric dead zone input; and designing a controller for the uncertain nonlinear system with unknown asymmetric dead zone input and initial state error based on the adaptive iterative learning Backstepping framework.

Step 1, let z_1,k＝x_1,k-y_r,k，z_2,k＝x_2,k-α_1,kIn which α is_1,kIs a virtual controller. Because of the initial state error, the following error function z is introduced according to step 2 for the description of the time-varying boundary layer_1φ,kAnd z_2φ,k：

Wherein epsilon_1,kAnd ε_2,kIs a positive series of convergent numbers, η₁And η₂Is a design normal number. Because:

z_1φ,kthe derivative with respect to time is as follows:

according to the description of the radial basis function neural network in the step 2, the uniform approximation performance of the RBFNN shows that the unknown nonlinear function is

Can be approximated by RBFNN as the following form on a tight set, and has reconstruction error

Wherein the content of the first and second substances,

is an approximation error and W₁Is the optimal weight vector. For any real number a > 0 and positive integer l ≧ 2, take

Definition of

The virtual controller is taken as:

substituting equations (13) and (14) into (12) can give:

wherein the content of the first and second substances,

and

are respectively a parameter W₁And N₁Is estimated.

And

is the parameter estimation error. The last two terms on the right of the equation of equation (15) can be rewritten as:

according to equation (16), equation (15) is rewritten as follows:

order to

Then equation (17) becomes equation (18) as follows:

suppose 5, the residue term ω₁Is bounded and | ω₁|≤ω_M1Wherein ω is_M1Is a normal number. Assuming 5 is reasonable, there are two reasons: 1) according to the assumption of 2, the method,

is bounded; 2) as long as eta₂Is suitably large and is suitably large enough to be,

it can be small enough. Take the following non-negative function:

wherein, gamma is₁₁And Γ₂₁Is a symmetric positive definite matrix. V is given below_1,kAlong the derivative of system equation (18) with time:

step i (i is more than or equal to 2 and less than or equal to n-1), define

As will be given later. Let z_i+1,k＝x_i+1,k-α_i,kAnalogously to step 1, the following error function z is introduced according to the description of step 2 for the time-varying boundary layer_iφ,kAnd z_(i+1)φ,k：

Wherein epsilon_i,kAnd ε_(i+1),kIs a positive series of convergent numbers, η_iAnd η_i+1Is a design normal number. z is a radical of_iφ,kThe derivative with respect to time is as follows:

wherein the content of the first and second substances,

equation (23) can be rewritten as:

Wherein the content of the first and second substances,

is an approximation error and W_iIs the optimal weight vector. The virtual controller is taken as:

substituting equation (25) and equation (26) into equation (24) yields the following equation (27):

wherein the content of the first and second substances,

and

are respectively a parameter W_iAnd N_iIs estimated.

And

is the parameter estimation error. The last two terms to the right of the equal sign of equation (27) can be rewritten as:

order to

Then, the formula (27) becomes the following formula (29):

suppose 6, the residue term ω_iIs bounded and | ω_i|≤ω_MiWherein ω is_MiIs an unknown normal number. The following non-negative function was chosen:

v is given below_i,kDerivative with respect to time along system equation (29):

step n, define z_n,k＝x_n,k-α_n-1,kThe following error function z is introduced according to step 2 description of the time-varying boundary layer_nφ,k：

Wherein epsilon_n,kIs a positive series of convergent numbers, η_nIs a design normal number. Then z_nφ,kThe derivative of (c) is:

defining:

then z_nφ,kThe derivative with respect to time is:

Wherein the content of the first and second substances,

is an approximation error and W_nIs the optimal weight vector.

Get

Selecting the actual controller as

u_k＝u_1,k+u_2,k (38)；

Wherein u is_2,kTo compensate for the unknown input gain m (t). Substituting equation (37) and equation (38) into equation (36) yields the following equation (39):

wherein the content of the first and second substances,

and

are respectively a parameter W_nAnd N_nIs estimated.

And

is the parameter estimation error. The last two terms to the right of the equality sign of equation (39) can be rewritten as:

get

Wherein

Is an uncertain parameter

Is estimated.

Order to

Then the formula (41) becomes the following formula (42):

suppose 7, the residue term ω_nIs bounded and | ω_n|≤ω_Mn1Wherein ω is_Mn1Is an unknown normal number. The following adaptive iterative learning law is taken:

wherein, gamma is_1i,Γ_2i,Γ₃Is a learning law that needs to be designed. The following non-negative function was chosen:

v is given below_n,kAlong with the derivative of system equation (42) with respect to time, in conjunction with equations (43) through (45), the following equation (47) results:

for the setting of the initial estimate at each iteration, the following assumptions are given.

Assumption 8 for

When t is equal to 0, the first step is,

theorem 1, if controller equation (38) and parameter adaptation law equations (43) - (45) are applied to system equation (1) assuming 1-8 are satisfied, then all signals of the closed-loop system are bounded at [0, T ] and there are the following equations:

that is to say that the position of the first electrode,

then when k → ∞ is reached,

wherein

Is an arbitrarily small normal number. Here, the whole time interval [0, T]If let η₁Suitably large then phi_1,∞(t) may be arbitrarily small.

4) Designing a controller for the condition of continuous nonlinear input; in selecting a controller in an actual physical system, there is nonlinearity in the control input due to the limitations of the actuator. For controller design, the following arguments are used:

theorem 1, under the condition that assumption 4 is satisfied, for controller u_kAnd error z_nφ,kThe following inequality holds:

z_nφ,kN(u_k)≤sz_nφ,ku_k,s∈{s₁,s₂} (49)；

and (3) proving that: if u is multiplied simultaneously on both sides of equation (9)_kThen, the following formula (50) is obtained:

further, the inequality in the formula (50) is multiplied by each side

The following formula (51) is obtained:

the formula (51) is rewritten as the following formula (52):

s₁(z_nφ,ku_k)²≤z_nφ,k(z_nφ,ku_k)N(u_k)≤s₂(z_nφ,ku_k)² (52)；

if the controller u_kSatisfies z_nφ,ku_k0 or less, the following inequality holds:

z_nφ,kN(u_k)≤s₁z_nφ,ku_k (53)；

if the controller u_kSatisfies z_nφ,ku_kGreater than or equal to 0, the following inequality holds:

z_nφ,kN(u_k)≤s₂z_nφ,ku_k (54)；

obviously, the conclusion of lemma 1 holds. After the syndrome is confirmed. During the whole design process, use N (u)_k) As the output of the system instead of Γ (u)_k). Step 1, step i, i 2, n-1 is the same as in part 3) of step 3.

Wherein epsilon_n,kIs a positive series of convergent numbers, η_nIs a design normal number. Then z is_nφ,kThe derivative of (c) is:

defining:

equation (56) is rewritten as:

according to the description of the radial basis function neural network in the step 2, the uniform approximation performance of the RBFNN (radial basis function neural network) shows that the unknown nonlinear function is

Wherein the content of the first and second substances,

is an approximation error and W_nIs the optimal weight vector. From equation (57), equation (58) is rewritten as:

the following non-negative function was chosen:

wherein the content of the first and second substances,

and

are respectively a parameter W_n、N_nAnd

is estimated.

And

is the parameter estimation error. Gamma-shaped_1i,Γ_2i,Γ₄Is a learning gain that needs to be designed.

V is given below_n,kAlong the derivative of system equation (59) with respect to time, and substituting equation (49), then there is:

get

Selecting an actual controller as follows:

u_k＝u_1,k+u_2,k (62)；

wherein u is_2,kTo compensate for the unknown input gain s. Order to

Then there are:

get

Equation (63) is changed to the following equation:

suppose 9, the residue term ω_nIs bounded and | ω_n|≤ω_Mn2Wherein ω is_Mn2Is an unknown normal number. The following adaptive iterative learning law is taken:

in order to set the initial estimation at each iteration, the following assumptions are specifically given: assuming 10, for any k, when t is 0,

theorem 2, if hypothesis 2, hypothesis 4-hypothesis 6, hypothesis 9, and hypothesis 10 are satisfied, and the controller equation (62) and the parametric adaptation rate equations (65) - (67) are applied to the system equation (1), then all signals of the closed-loop system are bounded at [0, T ] and have:

that is to say that the position of the first electrode,

then when k → ∞ is reached,

wherein

Step 4, analyzing the stability and convergence of the self-adaptive learning controller designed in the step 3; 1) proof theorem 1;

it is proved that according to the description of the time-varying boundary layer in step 2, there are

Wherein z is_φ,k＝[z_1φ,k,z_2φ,k,...,z_nφ,k]^T. From equation (46), the following holds:

wherein:

substituting equation (45) into equation (70) yields:

defining:

then equation (71) can be rewrittenComprises the following steps:

by

To obtain

Then V_0,kIs bounded. And due to the fact that

By the formula (71) to

Is bounded, therefore

Is also bounded. And also

Therefore, the method comprises the following steps:

from equation (46), for any k,

substituting equation (47) results in the following equation (74):

from the formula (73),

is bounded. Due to Delta_kIs bounded, and T ∈ [0, T ]]Thus, therefore, it is

Is also bounded. From the above discussion, for any k, V_n,k(t) is bounded, then there is x_i,k，

And

is bounded. From equation (39), uk is bounded. As can be seen from the formula (29),

is bounded, therefore z_iφ,kIs consistently continuous, then by the Barbalt theorem, we get when k → ∞, z_iφ,k→ 0, that is, when k → ∞, z_1φ,k→ 0, by definition of Limit, for

Such that:

when k is greater than N, the number of the transition metal,

and is also provided with

Then when k → ∞ is reached,

so theorem proves that the method is good. 2) Theorem 2 is the same as the certification process of theorem 1.

Example (b): a simulation example is given to illustrate the feasibility and effectiveness of the method of the present invention.

Considering a class of mass-spring mechanical systems to illustrate what is presented in this sectionThe effectiveness of the controller is learned adaptively and iteratively. A mass

Attached to the wall by a spring and a slide on a horizontal smooth surface, i.e. the resistance caused by friction is assumed to be zero. The mass being influenced by an external force u_kCan be regarded as a control variable. Let y_kIs a displacement from a reference position. In the presence of asymmetric dead-zone inputs, the dynamic equations for the system are given below

Wherein t is ∈ [0,1 ]]，F_ms(. is) the restoring force of the spring. k denotes an iteration index. Definition of x_1,k＝y_k，

And is

Converting the system into a state space form:

the restoring force of the spring can be modeled as:

in the system, the selected parameter k is 1, a₀＝0，a₁＝a₂＝a₃＝a₄＝1，q＝4。

1. The case of asymmetric dead-zone input; the asymmetric dead zone is described as follows:

the control target is such that the output of the system (formula 74) is at [0, π → ∞ when k → ∞]Upper trace upper reference track y_r,k. Choose to haveThe same amplitude reference track y_r,k＝g_ksin (2 π t), choosing g when k is even for the case of non-uniform traces_k-0.1, when k is an odd number, g_k0.1. According to theorem 1, the adaptive iterative learning controller is selected as:

wherein the content of the first and second substances,

the parameter adaptive iterative learning law is given by equation (43), where η₁＝20，η₂＝50，

Γ₂₁＝1，

Γ₂₂＝1，Γ ₃1. The neural network consists of 31 neurons, with the centers of the basis functions uniformly covered [ -1,1 ]]The width of the basis function is chosen to be 1. The following parameters and initial values of the states and estimated parameters are selected:

the iteration number k is 30, and the simulation result is shown in fig. 1-fig. 7. It can be seen from fig. 1 that the tracking error can converge to zero. Further, fig. 2-7 show the control signal | | | u_k||，

In the interval [0, pi]Upper is bounded. The simulation results shown in fig. 1 to fig. 7 further prove the effectiveness of the control method of the controller designed by the design method of the adaptive iterative learning non-uniform target tracking controller provided by the invention.

2. Continuous non-linear outputThe case of entry; here, a continuous non-linear input N (u) is used_k) Replacing asymmetric dead-zone input Γ (u)_k). Function N (u)_k) Is described as N (u)_k)＝(0.5+0.1sin(u_k))u_k. Selecting reference tracks y having different amplitudes_r,k＝g_ksin (2 π t), choosing g when k is even for the case of non-uniform traces_k-0.1, when k is an odd number, g_k0.1. According to theorem 2, the adaptive iterative learning controller is selected as:

wherein the content of the first and second substances,

the parameter adaptive iterative learning law is given by equation (65), where η₁＝20，η₂＝50，

Γ₂₁＝1，

the simulation result is shown in fig. 8-14 by taking the iteration number k as 30. It can be seen from fig. 8 that the tracking error can converge toAnd (4) zero. Further, fig. 9-14 show the control signal | | | u_k||，

In the interval [0,1]Upper is bounded. The simulation results shown in fig. 8-fig. 14 further prove the effectiveness of the control method of the controller designed by the design method of the adaptive iterative learning non-uniform target tracking controller provided by the invention. In summary, the invention solves the problem of non-uniform target tracking control of adaptive iterative learning of a strict feedback nonlinear system when an initial state error and unknown input nonlinearity coexist. A backstepping method and a radial basis function neural network are utilized to solve the problem of trajectory tracking of a strict feedback nonlinear system. A radial basis function neural network is introduced to learn the performance of unknown dynamics, a typical series is utilized to effectively cancel approximation errors, and the problem of inconsistent target tracking is solved. The problem of initial state errors is solved by applying a time-varying boundary layer. The problem of non-linearity of two types of input, namely asymmetric dead zone input and continuous input is effectively solved. It can be shown that in a given time interval 0, T]All signals of the upper closed loop system are bounded and the state tracking error will asymptotically converge to an adjustable set of residuals as the iteration goes to infinity.

Claims

1. A design method of a self-adaptive iterative learning non-uniform target tracking controller is characterized by comprising the following steps: the method specifically comprises the following steps:

wherein the content of the first and second substances,

is a smooth unknown non-linear function; k represents an iteration index;

wherein phi is_i，k(t)＝εi_，ke^-ηt，z_i,kAnd z_iφ,kIs a function variable of time t, epsilon_i,kIs a convergence series sequence, sat is a saturation function, defined as follows:

wherein the content of the first and second substances,

is known lightA sliding vector function, wherein the number l of NN nodes is more than 1; radial basis function

Is a gaussian function and is expressed by the following equation (5):

wherein, mu_jE.omega and eta > 0 are radial basis functions, respectively

The optimal weight vector W ═ ω₁,...,ω_l]^TIs defined as:

wherein the content of the first and second substances,

is an approximation error inherent to the NN;

2. The design method of the adaptive iterative learning non-uniform target tracking controller according to claim 1, characterized in that: the specific process of the step 4 is as follows:

Γ(u_k)＝m(t)u_k+d(t) (8)；

wherein:

step 4.4, there is a non-linearity in the control input of a class of continuous time non-linear systems represented by equation (1), which is taken to be in the sector region [ s ]₁ s₂]A continuous non-linear function N (u)_k) And N (0) ═ 0, where s is₁And s₂Represents two straight lines (t)₁,t₂) Of (2) i.e.

3. The design method of the adaptive iterative learning non-uniform target tracking controller according to claim 2, characterized in that: the specific process of the step 4.3 is as follows:

step A) let z_1,k＝x_1,k-y_r,k，z_2,k＝x_2,k-α_1,kIn which α is_1,kIs a virtual controller; time ticks according to step 2 because of initial state errorsThe expression of the varying boundary layer is applied to the error function z by the following formula_1φ,kAnd z_2φ,kTo show that:

wherein the content of the first and second substances,

wherein the content of the first and second substances,

because:

Wherein the content of the first and second substances,

is the approximation error, W₁Is the optimal weight vector;

for any real number a > 0 and positive integer l ≧ 2, take

Definition of

The virtual controller is taken as:

wherein the content of the first and second substances,

and

and

order to

Then equation (18) is transformed into equation (19) as follows:

4. the design method of the adaptive iterative learning non-uniform target tracking controller according to claim 3, characterized in that: in the step 4.3:

defining: z is a radical of_n,k＝x_n,k-α_n-1,kAccording to the step 2 representation of the time-varying boundary layer network, the following error function z is introduced_nφ,k：

Wherein the content of the first and second substances,

then z is_nφ,kThe derivative of (c) is:

wherein:

defining:

z_nφ,kthe derivative with respect to time is:

Wherein the content of the first and second substances,

is the approximation error, W_nIs the optimal weight vector;

get

Selecting an actual controller as follows:

u_k＝u_1,k+u_2,k (26)；

wherein u is_2,kTo compensate for the unknown input gain m (t);

substituting equations (25) and (26) into (24) yields the following equation (27):

wherein the content of the first and second substances,

and

are respectively a parameter W_nAnd N_nIs estimated by the estimation of (a) a,

and

then:

get

Wherein

Is an uncertain parameter

(ii) an estimate of (d); order to

Then, the formula (29) becomes the following formula (30):

5. the design method of the adaptive iterative learning non-uniform target tracking controller according to claim 4, characterized in that: the specific process of the step 4.5 is as follows:

with N (u)_k) As the output of the system instead of Γ (u)_k) Definition of z_n,k＝x_n,k-α_n-1,kAccording to the step 2 representation of the time-varying boundary layer network, the following error function z is introduced_nφ,k：