CN111459031A

CN111459031A - Learning-oriented disturbance observer design method

Info

Publication number: CN111459031A
Application number: CN202010314920.6A
Authority: CN
Inventors: 池荣虎; 魏阳春; 姚文龙; 林娜; 惠宇
Original assignee: Qingdao University of Science and Technology
Current assignee: Qingdao University of Science and Technology
Priority date: 2020-04-21
Filing date: 2020-04-21
Publication date: 2020-07-28

Abstract

The invention discloses a learning-oriented disturbance observer design method, and belongs to the field of intelligent control. The invention mainly aims at the problem of total disturbance estimation of a repeatable system with non-repetitive external disturbance in a data-driven framework, and the scheme of a disturbance observer is as follows: establishing a system with non-repetitive disturbances; the system iteration is linearized into a linear data model, and the non-repetitive disturbance is converted into total disturbance; designing an iterative update algorithm to estimate a gradient matrix in the linear data model; aiming at a system with a measurable state, a learning disturbance observer based on the state is designed; and aiming at the system with an unmeasurable state, designing a learning disturbance observer based on output. The learning-oriented disturbance observer design method disclosed by the invention can estimate the total disturbance in the system under a data-driven framework, only utilizes a small amount of model information, has good adaptability, can learn from the running data of the previous batch, and can estimate the total disturbance more accurately.

Description

Learning-oriented disturbance observer design method

Technical Field

The invention belongs to the field of intelligent control, and particularly relates to a design method of a learning disturbance observer based on data driving.

Background

In industrial applications, the presence of disturbances is inevitable. The system disturbance comprises non-linear disturbance, time lag, sensor measurement noise, external disturbance and unknown input disturbance. Disturbances in the system can severely affect control performance. To account for disturbances in the system, a disturbance observer for estimating uncertainty can be designed and then incorporated into the controller to compensate for its effect on control performance.

At present, linear disturbance observers are well developed, and have many theoretical achievements and practical applications, such as frequency domain-based disturbance observers, reduced order disturbance observers, time-lag disturbance observers, generalized PI type observers and the like. On the other hand, since a nonlinear system is more common than a linear system in practical application, in recent years, many researches have been made for scholars at home and abroad on a nonlinear disturbance observer, such as a high-order disturbance observer, an extended high-gain state observer, and the like.

However, most disturbance observers, whether linear or non-linear, require known model information as a priori. In other words, these methods are model-based. However, as the actual process becomes more complex and larger, modeling the control object using physicochemical principles or recognition methods is actually too difficult. Therefore, the above-described model-based observer may encounter challenges and difficulties when applied to practical problems, and thus data-driven modeling, control, and optimization methods become more popular. On the other hand, many practical systems operate repeatedly at fixed times. Such as industrial robots, high speed trains, flow systems, multi-agent systems, etc. For such a repetitive system, the iterative learning control can use the control information in the previous trial for perfect tracking. How to estimate the non-repetitive disturbance in iterative learning control in a data-driven framework is a learning disturbance observer method based on data driving, which is a problem to be solved urgently in the field at present.

In order to estimate the non-repetitive disturbances in repeatable systems, it is necessary to design a learning disturbance observer that can estimate the repetitive disturbances independent of the system model.

Disclosure of Invention

The invention discloses a learning-oriented disturbance observer design method which aims to solve the technical problem that a designed learning disturbance observer can estimate the total disturbance of a system by using system operation data along an iteration axis under the condition that non-repeated disturbance exists in a repeatable system.

The purpose of the invention is realized by the following technical scheme:

the invention discloses a learning-oriented disturbance observer design method, which mainly aims at the problem of total disturbance estimation of a repeatable system with non-repetitive external disturbance in a data-driven framework.

The invention discloses a learning-oriented disturbance observer design method, which comprises the following steps:

step 1, establishing a system with non-repetitive disturbance:

(1) consider a discrete multiple-input multiple-output linear system with perturbations as follows:

wherein the content of the first and second substances,

x_k(t+1)∈Rⁿrepresenting the state of the system at the kth iteration time t + 1;

u_k(t)∈R^lrepresenting the control input of the system at the time t of the kth iteration;

d_k(t)∈Rⁿrepresenting the disturbance in the system at the time t of the kth iteration;

y_k(t)∈R^mrepresenting the output of the system at the time t of the kth iteration;

A(t)∈Rⁿ×ⁿ，B(t)∈Rⁿ×^l，C(t)∈R^m×ⁿrepresenting a state transition matrix of the system at the time t;

the system satisfies the following conditions:

initial state x of system_k(0) Not changing, i.e. x_k(0)＝x₀；

Disturbance d of the system_k(t) is bounded, i.e. | | d_k(t)||≤b_d；

The system meets the condition of broad sense L ipschitz when △ u_k(t)≠0、△d_k(t)≠0、△x_k(t) ≠ 0, | | △ x_k(t+1)||≤b₁||△u_k(t)||+b₂||△d_k(t)||+b₃||△x_k(t)||；

Wherein the content of the first and second substances,

x₀is a constant vector;

b_d、b₁、b₂、b₃are all positive numbers;

△u_k(t)＝u_k(t)-u_k-1(t)，△d_k(t)＝d_k(t)-d_k-1(t)，△x_k(t)＝x_k(t)-x_k-1(t)；

(2) considering a discrete multiple-input multiple-output non-linear non-affine system with perturbation, the system is given as follows:

wherein the content of the first and second substances,

f(·)∈Rⁿrepresenting an unknown non-linear function;

the system satisfies the following conditions:

initial state x of system_k(0) Not changing, i.e. x_k(0)＝x₀；

Disturbance d of the system_k(t) is bounded, i.e. | | d_k(t)||≤b_d；

Step 2, the system iteration with the non-repeated disturbance is linearized into a linear data model, and the non-repeated disturbance is converted into total disturbance in the linear data model:

wherein the content of the first and second substances,

Φ_k(t)∈R^n×m·(t+1)is an unknown gradient matrix;

U_k(t)＝[u_k(0)^T,u_k(1)^T,...,u_k(t)^T]^T∈R^m·(t+1)；

△U_k(t)＝U_k(t)-U_k-1(t)；

_k(t)∈Rⁿis the total perturbation due to non-repetitive perturbations;

step 3, designing an iterative update algorithm to estimate a gradient matrix in the linear data model:

(1) for a state measurable system, the iterative update algorithm is as follows:

wherein the content of the first and second substances,

is phi_k(t) an estimate of;

η∈ (0,2) is a step factor;

μ is a positive weighting factor;

(2) for a state-unmeasured system, the iterative update algorithm is as follows:

wherein the content of the first and second substances,

(MC(t+1))^L+represents the left inverse of the matrix MC (t +1), satisfies (MC (t +1))^L+MC(t+1)＝I；

M∈R^m×mIs a constant matrix; i represents an identity matrix;

and 4, designing a learning disturbance observer based on the state aiming at a system with a measurable state:

wherein the content of the first and second substances,

is that_k(t) an estimate of;

z_k(t)∈Rⁿis an intermediate state variable;

K＝(I_n-)∈Rⁿ×ⁿparameters of a disturbance observer are obtained;

＝diag{γ₁,γ₂,…,γ_nis a diagonal matrix; | gamma_i|<1；i＝1,2,…,n；

And 5, designing a learning disturbance observer based on output aiming at a system with an unmeasured state:

wherein the content of the first and second substances,

C(t+1)⁺representing momentsPseudo-inverse of the matrix C;

is a compensation term;

ξ_k(t) is an intermediate state variable;

W(t+1)＝V-Q(t+1)C(t+1)；

the matrix S (t), Q (t), R (t) satisfy (I)_h-R(t))(V-Q(t)C(t))-S(t)C(t)＝0。

Further, the iterative linearization of the system used in step 2 mainly includes the following steps:

step 2.1, using iterative dynamic linearization on the linear system described in step 1 to obtain:

the system is differentiated along an iteration axis and a differential median theorem is used, so that a linear data model with total disturbance can be obtained:

wherein the content of the first and second substances,

is an unknown gradient matrix;

is the total disturbance;

||Φ_k(t)||≤b_Φ、||_k(t)||≤b；b_Φ，bare all positive numbers;

step 2.2, for the nonlinear system described in step 1, iterative dynamic linearization is used, so that:

wherein the content of the first and second substances,

g^t(. h) is a state transfer function, which is a complex function of f (·);

wherein the content of the first and second substances,

Φ_k(t) is an unknown gradient matrix;_k(t) total perturbation;

||Φ_k(t)||≤b_Φ、||_k(t)||≤b；b_Φ、bare all positive numbers;

still further, in step 3, the design of the iterative update algorithm for estimating the unknown gradient matrix in the linear data model of the state-unmeasured system is mainly realized by the following method:

and 3.1, expressing the state in an output form because the state of the system which cannot be measured cannot be directly obtained:

wherein, (MC (t +1))^L+Represents the left inverse of the matrix MC (t +1), satisfying:

(MC(t+1))^L+MC(t+1)＝I；

and 3.2, converting the state in the iterative update algorithm aiming at the state measurable system in the step 3 into an output form by using the method, thereby obtaining the iterative update algorithm aiming at the state undetectable system:

in a still further aspect of the present invention,

the design of the learning disturbance observer based on the output aiming at the state-immeasurable system in the step 5 mainly comprises the following steps:

and 5.1, aiming at the system with the unmeasured state, representing the state in the learning disturbance observer based on the state in the step 4 in an output form:

wherein the content of the first and second substances,

C(t+1)⁺y_k-1(t +1) is used instead of the system state x_k(t)；

C(t+1)⁺Represents the pseudo-inverse of matrix C;

KN(t+1)x_k-1(t +1) is a compensation term;

N(t+1)＝I_n-C(t+1)⁺C(t+1)；

step 5.2, the unknown quantity KN (t +1) x is added_k-1(t +1) matrix-transforming to estimate:

KN(t+1)＝H(t+1)V；

wherein, V ∈ R^h×nIs a constant matrix, H (t +1) ∈ R^n×h；h＝rank(KN(t+1))；

Thus, unknown KN (t +1) x_k-1(t +1) transforms:

KN(t+1)x_k-1(t+1)＝H(t+1)Vx_k-1(t+1)；

unknown quantity Vx_k-1(t +1) is defined as an unknown state variable β_k(t)∈R^h；

Step 5.3, designing an estimator for the unknown variables β_k(t) estimating;

wherein the content of the first and second substances,

representative β_k(t) estimation;

ξ_k(t)∈R^his an intermediate state variable;

W(t+1)＝V-Q(t+1)C(t+1)；

matrix S (t) ∈ R^h×m，Q(t)∈R^h×m，R(t)∈R^h×hSatisfies the following conditions:

(I_h-R(t))(V-Q(t)C(t))-S(t)C(t)＝0；

thus, an output-based learning disturbance observer is obtained:

has the advantages that:

1. the invention discloses a learning-oriented disturbance observer design method, which utilizes a data driving method, particularly establishes a linear data model containing total disturbance by using input, state and output data of a repetitive system, wherein the total disturbance comprises all influences of the disturbance from an initial moment to a current moment between two continuous iterations on the state and the output of the system, and the dependence of the method on the system model is reduced.

2. The invention discloses a learning-oriented disturbance observer design method, which considers a state measurable system and a state immeasurable system, provides a learning disturbance observer based on a system state and a learning disturbance observer based on system output, and both the two proposed disturbance observers are pertinently executed by using system data of previous tests along an iteration direction, so that the running data of previous batches can be effectively utilized, and the estimation of the total disturbance in a linear data model is more accurate along with the increase of iteration times.

Other features and advantages of the present invention will become more apparent from the following detailed description of the invention when taken in conjunction with the accompanying drawings.

Drawings

FIG. 1 is a flow chart of a learning-oriented disturbance observer design method proposed by the present invention;

FIG. 2 is a tracking curve of a state-based learning disturbance observer proposed by the present invention;

FIG. 3 is an estimation error curve of a state-based learning disturbance observer according to the present invention;

FIG. 4 is a comparison curve of the estimation error of the learning disturbance observer and the reduced order disturbance observer based on the state proposed by the present invention;

FIG. 5 is a comparison curve of the estimation error of the learning disturbance observer and the reduced order disturbance observer based on the state proposed by the present invention;

FIG. 6 is a tracking curve of the learning disturbance observer based on output proposed by the present invention;

FIG. 7 is an estimated error curve of the learning disturbance observer based on output proposed by the present invention;

FIG. 8 is a comparison curve of the estimated error of the learning disturbance observer and the reduced order disturbance observer based on the output proposed by the present invention;

FIG. 9 is a comparison curve of the estimated error of the learning disturbance observer and the reduced order disturbance observer based on the output proposed by the present invention;

Detailed Description

To better illustrate the objects and advantages of the present invention, the following is a comparison with the conventional reduced order disturbance observer method, and the detailed description is further provided in conjunction with the embodiments and the accompanying drawings.

The invention mainly aims at the problem of total disturbance estimation of a repeatable system with non-repeated external disturbance, and establishes a linear data model containing the total disturbance by using a data-driven method and particularly by using input, state and output data of the repeatable system under a data-driven framework, wherein the non-repeated disturbance is expressed by the total disturbance in the linear data model. In order to estimate the total disturbance, two conditions of measurable state and non-measurable state are considered, and a learning disturbance observer based on the system state and a learning disturbance observer based on the system output are provided. Next, a detailed description will be given of a specific embodiment of the disturbance observer design method for the orientation learning.

Referring to fig. 1, a learning-oriented disturbance observer design method disclosed in this embodiment includes the following steps:

step S1: a system with non-repetitive perturbations is established.

(1) For a discrete multiple-input multiple-output linear system with perturbation, the following system is given:

wherein x is_k(t+1)∈RⁿRepresenting the state of the system at the kth iteration time t + 1; u. of_k(t)∈R^lRepresenting the control input of the system at the time t of the kth iteration; d_k(t)∈RⁿRepresenting the disturbance in the system at time t of the kth iteration yk (t) ∈ R^mRepresenting the output of the system at time t of the kth iteration A (t) ∈ Rⁿ×ⁿ，B(t)∈Rⁿ×^l，C(t)∈R^m×nRepresenting the state transition matrix of the system at time t.

The system isThe system satisfies the following conditions: initial state x of system_k(0) Not changing, i.e. x_k(0)＝x₀(ii) a Disturbance d of the system_k(t) is bounded, i.e. | | d_k(t)||≤b_dThe system meets the general L ipschitz condition when △ u_k(t)≠0、△d_k(t)≠0、△x_k(t) ≠ 0, | | △ x_k(t+1)||≤b₁||△u_k(t)||+b₂||△d_k(t)||+b₃||△x_k(t)||。

Wherein x is₀Is a constant vector; b_d、b₁、b₂、b₃Are positive numbers △ u_k(t)＝u_k(t)-u_k-1(t)，△d_k(t)＝d_k(t)-d_k-1(t)，△x_k(t)＝x_k(t)-x_k-1(t)。

(2) For a discrete multiple-input multiple-output non-linear non-affine system with perturbation, the following system is given:

wherein, f (-) ∈ RⁿRepresenting an unknown non-linear function.

The system satisfies the following conditions: initial state x of system_k(0) Not changing, i.e. x_k(0)＝x₀(ii) a Disturbance d of the system_k(t) is bounded, i.e. | | d_k(t)||≤b_dThe system meets the general L ipschitz condition when △ u_k(t)≠0、△d_k(t)≠0、△x_k(t) ≠ 0, | | △ x_k(t+1)||≤b₁||△u_k(t)||+b₂||△d_k(t)||+b₃||△x_k(t)||。

Step S2: and (3) iteratively linearizing the system with the non-repeated disturbance into a linear data model, and converting the non-repeated disturbance into total disturbance in the linear data model.

Specifically, the method comprises the following steps:

step S21: for the linear system described in step 1, iterative dynamic linearization is used, such that:

wherein the content of the first and second substances,

is an unknown gradient matrix;

is the total disturbance; i phi_k(t)||≤b_Φ、||_k(t)||≤b；b_Φ、bAre all positive numbers.

Step S22: for the nonlinear system described in step 1, iterative dynamic linearization is used such that:

wherein, g^t(. cndot.) is a state transfer function that is a complex function of f (. cndot.).

wherein phi_k(t) is an unknown gradient matrix;_k(t) total perturbation; i phi_k(t)||≤b_Φ、||_k(t)||≤b；b_Φ、bAre all positive numbers.

Step S3: an iterative update algorithm is designed to estimate the gradient matrix in the linear data model.

Specifically, the method comprises the following steps:

s31: for a state measurable system, the iterative update algorithm is as follows:

wherein the content of the first and second substances,

is phi_k(t) estimate, η∈ (0,2) is a step factor, and μ is a positive weighting factor.

S32: for a state-undetectable system, since the state of the state-undetectable system is not directly available, the state is represented in output form:

wherein, (MC (t +1))^L+Represents the left inverse of the matrix MC (t +1), satisfies (MC (t +1))^L+MC (t +1) ═ I; i denotes an identity matrix.

S33: transforming states in an iterative update algorithm for a state-measurable system to an output form, thereby resulting in an iterative update algorithm for a state-untestable system:

step S4: aiming at a system with a measurable state, a learning disturbance observer based on the state is designed to estimate the total disturbance.

Wherein the content of the first and second substances,

is that_k(t) an estimate of; z is a radical of_k(t)∈RⁿIs an intermediate state variable; k ═ I (I)_n-)∈R^n×nParameters of a disturbance observer are obtained; biag { γ ═₁,γ₂,...,γ_nIs a diagonal matrix; | gamma_i|<1；i＝1,2,...,n。

Step S5: and aiming at the system with an unmeasurable state, designing a learning disturbance observer based on output.

Specifically, the method comprises the following steps:

step S51: for a system whose state is not measurable, the state in the state-based learning disturbance observer in step 4 is represented in output form:

wherein, C (t +1)⁺y_k-1(t +1) is used instead of the system state x_k(t)；C(t+1)⁺Represents the pseudo-inverse of matrix C; KN (t +1) x_k-1(t +1) is a compensation term; n (t +1) ═ I_n-C(t+1)⁺C(t+1)。

Step S52: mixing the unknown quantity KN (t +1) x_k-1(t +1) performing matrix transformation:

KN(t+1)＝H(t+1)V (14)

wherein, V ∈ R^h×nIs a constant matrix, H (t +1) ∈ Rⁿ×^h；h＝rank(KN(t+1))。

Thus, unknown KN (t +1) x_k-1(t +1) transforms:

KN(t+1)x_k-1(t+1)＝H(t+1)Vx_k-1(t+1) (15)

unknown quantity Vx_k-1(t +1) is defined as an unknown state variable β_k(t)∈R^h。

Step S53, designing an estimator for unknown variables β_k(t) estimating:

wherein the content of the first and second substances,

representative β_k(t) estimation ξ_k(t)∈R^hIs an intermediate state variable;

W(t +1) ═ V-Q (t +1) C (t +1), matrix S (t) ∈ R^h×m，Q(t)∈R^h×m，R(t)∈R^h×hSatisfies the following conditions:

(I_h-R(t))(V-Q(t)C(t))-S(t)C(t)＝0 (18)

thus, an output-based learning disturbance observer is obtained:

the system with perturbations employed in the examples is as follows:

where, t ∈ {0., N }, N ═ 80, the state transition matrix of the system is:

the system disturbance is set as:

since the total disturbance of the system is difficult to obtain, the system state with the estimated disturbance is used, the estimated value of the output is compared with the actual value for verification, and a linear data model with the estimated gradient parameter vector and the estimated total disturbance is used as an estimation model of the system state and the output, as follows:

wherein the content of the first and second substances,

is to x_k(t) estimating the value of the average of the measured values,

is as a pair y_k(t) estimation.

In the described embodiment, the control input is set to u_k(t)＝0.5sin(t/k)。

When the system state is available, the parameters are set to η ═ 1, μ ═ 1, ═ diag {0.1,0.3}, K ═ diag {0.9,0.7}, and M ═ I_nThe proposed state-based learning disturbance observer (fig. 2 and 3) is used. For comparison, the same parameters were used with a state-based learning disturbance observer and a state-based reduced order disturbance observer (fig. 4 and 5).

Wherein, the state-based reduced order disturbance observer is:

when the system state is not available, the parameters are set to η ═ 1, μ ═ 1, diag {0.1,0.3}, K ═ diag {0.9,0.7}, and M ═ I_n，V＝[1 -1]The proposed output-based learning disturbance observer (fig. 6 and 7) is used. For comparison, the same parameters were used with the output-based learning disturbance observer and the state-based reduced order disturbance observer (fig. 8 and 9).

Wherein the output-based reduced order disturbance observer is:

H₁(t)V＝KN(t)，H₂(t)V＝K(A-I_n)N(t)，W_u(t)＝(V-Q(t)C(t))B(t)，W_d(t) V-q (t) c (t), q (t), r (t), s (t) satisfy:

(V-Q(t)C(t))B(t)-R(t)(V-Q(t)C(t))-S(t)C(t)＝0 (31)

as can be seen from fig. 2 and 3: when the iteration times are small, the state-based learning disturbance observer has a poor estimation effect on the total disturbance and a large estimation error, but as the iteration times are increased, the state estimation error of the state-based learning disturbance observer is smaller and smaller, and the performance in estimating the total disturbance is good.

As can be seen from fig. 4 and 5: when the iteration times are smaller, the estimation effect of the state-based reduced order disturbance observer is better than that of the state-based learning disturbance observer, however, the estimation effect of the state-based learning disturbance observer can be improved along with the increase of the iteration times, and after a certain iteration times is reached, the estimation effect of the state-based learning disturbance observer is better than that of the reduced order disturbance observer.

As can be seen from fig. 6 and 7: when the iteration times are small, the estimation effect of the learning disturbance observer based on the output is poor, the maximum estimation error of each iteration is large, however, with the increase of the iteration times, the estimation error of the learning disturbance observer based on the output is smaller and smaller, and the total disturbance can be effectively estimated.

As can be seen from fig. 8 and 9: when the iteration times are smaller, the estimation effect of the reduced order disturbance observer is better than that of the learning disturbance observer based on the output, however, the learning disturbance observer based on the output can utilize the running data of the previous batch, so that after a certain iteration times is reached, the estimation effect of the learning disturbance observer based on the output is better than that of the reduced order disturbance observer.

In the embodiment, the system state of the estimated disturbance, the output estimated value and the estimation error are mainly used as performance parameters, and fig. 2 shows the estimation effect curves of the learning disturbance observer based on the state in the

iterations

4, 10, 30 and 80. FIG. 3 shows a maximum estimation error curve for each iteration using a state-based learning disturbance observer. Fig. 4 and 5 show maximum estimation error contrast curves using a state-based learning disturbance observer and a reduced order disturbance observer. Fig. 6 shows the estimated effect curves at

iterations

4, 10, 30, 80 using an output-based learning perturbation observer. FIG. 7 presents a maximum estimation error curve using an output-based learning disturbance observer. Fig. 8 and 9 present maximum estimation error contrast curves using an output-based learning disturbance observer and a reduced order disturbance observer. The figure shows that the learning-oriented disturbance observer method provided by the invention can well estimate the total disturbance in a data-driven framework, and compared with a reduced-order disturbance observer method, the learning disturbance observer provided by the invention can reduce the estimation error along with the increase of iteration times, is more suitable for a repeatable system, only utilizes a small amount of model information, and has better adaptability.

The above detailed description further illustrates the objects, technical solutions and advantages of the present invention, and it should be understood that the embodiments are only used for explaining the present invention and not for limiting the scope of the present invention, and modifications, equivalent substitutions, improvements and the like under the same principle and concept of the present invention should be included in the scope of the present invention.

Claims

1. A learning-oriented disturbance observer design method is characterized by comprising the following steps: the method comprises the following steps:

step 1, establishing a system with non-repetitive disturbance:

(1) consider a discrete multiple-input multiple-output linear time-varying system with perturbations as follows:

wherein the content of the first and second substances,

A(t)∈R^n×n，B(t)∈R^n×l，C(t)∈R^m×nrepresenting a state transition matrix of the system at the time t;

the system satisfies the following conditions:

initial state x of system_k(0) Not changing, i.e. x_k(0)＝x₀；

Disturbance d of the system_k(t) is bounded, i.e. | | d_k(t)||≤b_d；

Wherein the content of the first and second substances,

x₀is a constant vector;

b_d、b₁、b₂、b₃are all positive numbers;

(2) consider a discrete multiple-input multiple-output non-linear non-affine system with perturbation as follows:

wherein the content of the first and second substances,

f(·)∈Rⁿrepresenting an unknown non-linear function;

the system satisfies the following conditions:

initial state x of system_k(0) Not changing, i.e. x_k(0)＝x₀；

Disturbance d of the system_k(t) is bounded, i.e. | | d_k(t)||≤b_d；

wherein the content of the first and second substances,

Φ_k(t)∈R^n×m·(t+1)is an unknown gradient matrix;

U_k(t)＝[u_k(0)^T,u_k(1)^T,...,u_k(t)^T]^T∈R^m·(t+1)；

△U_k(t)＝U_k(t)-U_k-1(t)；

_k(t)∈Rⁿis the total perturbation due to non-repetitive perturbations;

wherein the content of the first and second substances,

is phi_k(t) an estimate of;

η∈ (0,2) is a step factor;

μ is a positive weighting factor;

wherein the content of the first and second substances,

M∈R^m×mis a constant matrix;

I represents an identity matrix;

wherein the content of the first and second substances,

is that_k(t) an estimate of;

z_k(t)∈Rⁿis an intermediate state variable;

K＝(I_n-)∈R^n×nparameters of a disturbance observer are obtained;

wherein the content of the first and second substances,

C(t+1)⁺represents the pseudo-inverse of matrix C;

is a compensation term;

ξ_k(t) is an intermediate state variable;

W(t+1)＝V-Q(t+1)C(t+1)；

the matrix S (t), Q (t), R (t) satisfy (I)_h-R(t))(V-Q(t)C(t))-S(t)C(t)＝0。

2. The method of claim 1, further comprising: the iterative linearization process of the step 2 mainly comprises the following steps:

step 2.1, for the linear system described in step 1, iterative dynamic linearization is used, so that:

the system is differentiated along an iteration axis and a linear data model can be obtained by using a differential median theorem:

wherein the content of the first and second substances,

is an unknown gradient matrix;

is the total disturbance;

||Φ_k(t)||≤b_Φ、||_k(t)||≤b；b_Φ、bare all positive numbers;

wherein the content of the first and second substances,

g^t(. h) is a state transfer function, which is a complex function of f (·);

wherein the content of the first and second substances,

Φ_k(t) is an unknown gradient matrix;_k(t) total perturbation;

||Φ_k(t)||≤b_Φ、||_k(t)||≤b；b_Φ，bare all positive numbers.

3. The method of claim 1, further comprising: in step 3, for the state-undetectable system, the design of the iterative update algorithm for estimating the gradient matrix in the linear data model mainly comprises the following steps:

wherein the content of the first and second substances,

I represents an identity matrix;

and 3.2, converting the state in the iterative update algorithm aiming at the state measurable system in the step 3 into an output form, thereby obtaining the iterative update algorithm aiming at the state undetectable system:

。

4. the method of claim 1, further comprising: for the state-undetectable system in step 5, the design of the learning disturbance observer based on output mainly comprises the following steps: