CN117872753A - Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming - Google Patents

Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming Download PDF

Info

Publication number
CN117872753A
CN117872753A CN202410022633.6A CN202410022633A CN117872753A CN 117872753 A CN117872753 A CN 117872753A CN 202410022633 A CN202410022633 A CN 202410022633A CN 117872753 A CN117872753 A CN 117872753A
Authority
CN
China
Prior art keywords
layered
formula
follows
vector
constructing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202410022633.6A
Other languages
Chinese (zh)
Inventor
张颖伟
张浩岩
袁平
Original Assignee
东北大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 东北大学 filed Critical 东北大学
Priority to CN202410022633.6A priority Critical patent/CN117872753A/en
Publication of CN117872753A publication Critical patent/CN117872753A/en
Pending legal-status Critical Current

Links

Landscapes

  • Feedback Control In General (AREA)

Abstract

The invention provides an underactuated system layered synovial membrane optimization control method based on self-adaptive dynamic programming, which comprises the steps of firstly establishing an underactuated system dynamics model under a nominal form; secondly, constructing a layered sliding film surface containing state information; then constructing a cost function and an HJB equation based on the layered synovial membrane surface and solving an optimization control strategy by a gradient descent method; finally, constructing an evaluation network and an execution network to evaluate the optimization control law executed each time so as to realize the optimization control target of the underactuated system; the effectiveness of the proposed control method is verified through numerical simulation of the underactuated system; an advantageous tool is provided for analyzing the problems of the optimal control series of the underactuated system in the control field, and the reliability of the controlled system can be enhanced to a certain extent.

Description

Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming
Technical Field
The invention belongs to the technical field of under-actuated system optimal control, and particularly relates to an under-actuated system layered synovial membrane optimal control method based on self-adaptive dynamic programming.
Background
An under-actuated system is a typical unstable dynamic system. Because of the flexible operation space, the method has the characteristics of underactuation, nonlinearity, unstable open loop and the like, the method becomes a hot spot research problem in the control field and is widely applied to aerospace, robot control systems and the like. In recent years, controllers designed based on various classical control theory and modern control theory methods make the dynamic characteristics of underactuated systems well reflected.
As a special nonlinear control method, synovial membrane variable structure control has been widely studied in recent years. It is unique in that the structure is not fixed when constructing the synovial surface, but rather the controlled system state can be forced to move according to a predetermined sliding mode and in the process is insensitive to parametric perturbation and external disturbances. Therefore, the synovial membrane variable structure control has the characteristics of good robustness and the like. However, with the continuous development of industrial processes, conventional single-layer synovial membrane variable structure control has failed to meet practical requirements, and thus, synovial membrane control having a layered structure has been attracting attention in recent years. Each sub-synovial surface of the layered synovial surface is constructed according to the state of each subsystem, and the topmost synovial surface is composed of the sub-synovial surfaces of each layer, i.e. the topmost synovial surface contains all system state information. In the case of an under-actuated system, each layer of sub-slip film face can properly correspond to each subspace of the under-actuated system, and thus, layered slip film control can effectively solve under-actuated system related control problems.
Adaptive dynamic programming provides an efficient technique for dealing with optimal decision and control problems without linear assumptions and uncertain random conditions by employing approximation structures, such as neural networks, to produce an online approximation of the optimal solution by recursive numerical methods. In particular, adaptive dynamic programming combines reinforcement learning ideas with traditional dynamic programming to approximate the solution of the Hamilton-Jacobi-Bellman (HJB) equation. For linear time-varying cases, the HJB equation can evolve into an algebraic Riccati equation, and the well-known iterative solution strategy is proposed by converting the algebraic Riccati equation into a series of linear lisapunov equations. Along this direction, adaptive dynamic programming iterative solution strategies have gradually been extended and applied to nonlinear time-varying situations.
Disclosure of Invention
In order to solve the problem of optimal control of the underactuated system, the invention provides an underactuated system layered synovial membrane optimal control method based on self-adaptive dynamic programming. By introducing a layered slip film surface, the system response rate is improved. And (3) designing an optimization controller for enhancing the dynamic performance of the under-actuated system, and ensuring the optimization performance of the under-actuated system by adopting a self-adaptive dynamic programming method.
The underactuated system layered sliding film optimal control method based on the self-adaptive dynamic programming specifically comprises the following steps:
s1, establishing an under-actuated system dynamics model under a nominal form;
an augmented state variable x= [ X ] defining an underactuated system 1 ,x 2 ,x 3 ,x 4 ] T And the order state components are x respectively 1 ,x 2 ,x 3 And x 4 The under-actuated system dynamics model is formulated as:
wherein p is 1 (X) and p 2 (X) is a system dynamic function; q 1 (X) and q 2 (X) is a control gain function; u is a control input;
s2, constructing a layered sliding film surface containing state information based on an underactuated system dynamics model;
s21, constructing a sub-synovial surface vector S as follows:
s=[s 1 ,0,s 2 ,0] T =HX o +MX e (2)
wherein s is i =h i x 2i-1 +x 2i I=1, 2, the i-th layer sub-slip film surface; h=diag (H 1 ,1,h 2 1) is a diagonal matrix, h 1 And h 2 Are all diagonal elements; vector X o And X e Respectively defined as X o =[x 1 ,0,x 3 ,0] T And X e =[0,x 2 ,0,x 4 ] T The method comprises the steps of carrying out a first treatment on the surface of the The matrix M is:
s22, based on the formula (2), constructing a layered slide film surface vector S as follows:
S=[S 1 ,0,S 2 ,0] T =s+ZS (3)
wherein S is i =s i +z i-1 S i-1 I=1, 2, indicating the i-th layered slide film surface; z=diag (Z) 0 ,1,z 1 1) is a diagonal matrix, z 0 And z 1 Are all diagonal elements; vector quantitySIs defined asS=[S 0 ,0,S 1 ,0] T ,S 0 And S is 1 Are all vectorsSIs an element of (2);
s23, obtaining an ith layered slide film surface S through iterative calculation based on formulas (2) and (3) i The following is shown:
wherein,if->Parameters->Otherwise, go (L)>
S24, based on equation (2), equation (3) and equation (4), the layered synovial surface vector S is re-expressed as follows:
wherein the matrixThe method comprises the following steps:
s25, deriving the following result from the formula (5):
wherein the vector isP (X) and U (t) are each +.>P(X)=[0,p 1 (X),0,p 2 (X)] T And u= [0, U ]] T The method comprises the steps of carrying out a first treatment on the surface of the Vector Q (X) is:
s26, based on a hierarchical structure, the control input u is designed as follows:
wherein i is less than or equal to i; u (u) eq,i And u sw,i Respectively representing an equivalent control law and a switching control law of the ith layered slide film surface;
s27, based on the formula (7), the vector U is re-formulated as:
U=Ξ(u eq +u sw ) (8)
wherein the vector u eq And u sw Respectively isAnd->The matrix xi is:
s3, constructing a cost function and an HJB equation based on the layered sliding film surface based on the step S2, and solving an optimal control strategy through a gradient descent method;
s31, constructing a cost function based on the layered sliding film surface by utilizing the principle of the optimality of BellmanThe following are provided:
wherein A and B are positive arrays; alpha is a normal constant; the |x|| represents the norm of the augmented state variable X; f (u) sw ) Is a positive definite function and is defined as:
wherein β is a normal constant; omega i Represents an integral variable;is a monotonic odd function;
s32, constructing a Hamiltonian based on a layered sliding film surface based on the formula (9), wherein the Hamiltonian is as follows:
wherein,is->Is a gradient of (2); />Represents->Is a transpose of (2);
s33, an HJB equation of the structural formula (11) is as follows:
wherein,a cost function for optimization; />Is->Is a gradient of (2); />Represents->Is a transpose of (2); />And->Respectively an optimized equivalent control law vector and a switching control law vector;
s34, based on the formulas (11) and (12), and by solving partial differential equationAnd->Obtaining:
s4, constructing an evaluation network and an execution network, and evaluating an optimization control strategy executed each time to realize an optimization control target of the underactuated system;
s41, optimizing a cost function by using general approximation performance of a radial basis function neural networkIs approximated as follows:
wherein W is * Is a bounded weight, (W) * ) T Is W * Is a transpose of (2);representing an activation function; />Is an approximation error;
s42, based on the formula (15), an optimized cost functionGradient of->Is formulated as:
wherein,is->Is a gradient of (2); />Represents->Is a transpose of (2); />Is->Is a gradient of (2);
s43, based on the formula (16),and->Is re-expressed as follows:
s44 due to bounded weight W * Is unknown, and therefore,is estimated as follows:
wherein,is->Is determined by the estimation of (a); />Is W * Is determined by the estimation of (a); />Is->Is a transpose of (2);
s45, based on the formula (19),is formulated as:
wherein,is->Is a gradient of (2);
s46, based on the formula (20),and->The estimated form of (a) is:
wherein,and->Respectively->And->Is determined by the estimation of (a); />To perform the weights of the network;
s47, based on the formula (21) and the formula (22), the estimation form of the HJB equation is as follows:
wherein, to evaluate the weight of the network;
s48, based on the formula (12) and the formula (23), the bellman residual is defined as follows:
s49, in order to ensure minimum Belman residual error, an objective function e is defined as follows:
s50, designing and evaluating a network update law based on the formula (25) by using a standardization principle and a gradient descent method, wherein the network update law is as follows:
wherein, xi c To evaluate the network update rate;represents->Is a transpose of (2);
s51, to ensure system stability, the execution network update law is designed to:
wherein epsilon and delta are both normal constants;J T represents the transpose of J; gamma ray a To perform network update rates.
The invention has the beneficial technical effects that:
aiming at the underactuated system, the invention provides a layered synovial membrane optimization control method based on self-adaptive dynamic programming. By introducing a layered synovial surface containing all state information, system robustness is improved. And constructing a cost function and an HJB equation based on the layered synovial membrane surface by utilizing the Bellman optimality principle, and solving a non-explicit optimization control strategy by a gradient descent method. The neural network is adopted to effectively approximate the non-explicit optimization control strategy, the execution network is constructed to tune the control strategy, the evaluation network is constructed to evaluate the control strategy after each tuning, and then the evaluation network is fed back to the execution network to realize the optimization control target of the underactuated system.
Drawings
FIG. 1 is a flow chart diagram of an underactuated system layered synovial membrane optimization control method based on self-adaptive dynamic programming;
FIG. 2 is a graph of state variable response in a simulation of an embodiment of the present invention;
FIG. 3 is a graph of response of a layered synovial membrane variable in a simulation of an embodiment of the present invention;
FIG. 4 is a diagram showing the implementation of network weight updating in a simulation of an embodiment of the present invention;
FIG. 5 is a diagram showing the update of the evaluation network weight in the simulation according to the embodiment of the present invention.
Detailed Description
The invention will be further described with reference to the accompanying drawings and examples;
the under-actuated system layered sliding film optimal control method based on the self-adaptive dynamic programming is shown in the attached figure 1, and specifically comprises the following steps:
s1, establishing an under-actuated system dynamics model under a nominal form;
an augmented state variable x= [ X ] defining an underactuated system 1 ,x 2 ,x 3 ,x 4 ] T And the order state components are x respectively 1 ,x 2 ,x 3 And x 4 The under-actuated system dynamics model is formulated as:
wherein p is 1 (X) and p 2 (X) is a system dynamic function; q 1 (X) and q 2 (X) is a control gain function; u is a control input;
s2, constructing a layered sliding film surface containing state information based on an underactuated system dynamics model;
s21, constructing a sub-synovial surface vector S as follows:
s=[s 1 ,0,s 2 ,0] T =HX o +MX e (2)
wherein s is i =h i x 2i-1 +x 2i I=1, 2, the i-th layer sub-slip film surface; h=diag (H 1 ,1,h 2 1) is a diagonal matrix, h 1 And h 2 Are all diagonal elements; vector X o And X e Respectively defined as X o =[x 1 ,0,x 3 ,0] T And X e =[0,x 2 ,0,x 4 ] T The method comprises the steps of carrying out a first treatment on the surface of the The matrix M is:
s22, based on the formula (2), constructing a layered slide film surface vector S as follows:
S=[S 1 ,0,S 2 ,0] T =s+ZS (3)
wherein S is i =s i +z i-1 S i-1 I=1, 2, indicating the i-th layered slide film surface; z=diag (Z) 0 ,1,z 1 1) is a diagonal matrix, z 0 And z 1 Are all diagonal elements; vector quantitySIs defined asS=[S 0 ,0,S 1 ,0] T ,S 0 And S is 1 Are all vectorsSIs an element of (2);
s23, obtaining an ith layered slide film surface S through iterative calculation based on formulas (2) and (3) i The following is shown:
wherein,if->Parameters->Otherwise, go (L)>
S24, based on equation (2), equation (3) and equation (4), the layered synovial surface vector S is re-expressed as follows:
wherein the matrixThe method comprises the following steps:
s25, deriving the following result from the formula (5):
wherein the vector isP (X) and U (t) are each +.>P(X)=[0,p 1 (X),0,p 2 (X)] T And u= [0, U ]] T The method comprises the steps of carrying out a first treatment on the surface of the Vector Q (X) is:
s26, based on a hierarchical structure, the control input u is designed as follows:
wherein i is less than or equal to i; u (u) eq,i And u sw,i Respectively representing an equivalent control law and a switching control law of the ith layered slide film surface;
s27, based on the formula (7), the vector U is re-formulated as:
U=Ξ(u eq +u sw ) (8)
wherein the vector u eq And u sw Respectively isAnd->The matrix xi is:
s3, constructing a cost function and an HJB equation based on the layered sliding film surface based on the step S2, and solving an optimal control strategy through a gradient descent method;
s31, constructing a cost function based on the layered sliding film surface by utilizing the principle of the optimality of BellmanThe following are provided:
wherein A and B are positive arrays; alpha is a normal constant; the |x|| represents the norm of the augmented state variable X; f (u) sw ) Is a positive definite function and is defined as:
wherein β is a normal constant; omega i Represents an integral variable;is a monotonic odd function;
s32, constructing a Hamiltonian based on a layered sliding film surface based on the formula (9), wherein the Hamiltonian is as follows:
wherein,is->Is a gradient of (2); />Represents->Is a transpose of (2);
s33, an HJB equation of the structural formula (11) is as follows:
wherein,a cost function for optimization; />Is->Is a gradient of (2); />Represents->Is a transpose of (2); />And->Respectively an optimized equivalent control law vector and a switching control law vector;
s34, based on the formulas (11) and (12), and by solving partial differential equationAnd->Obtaining:
s4, constructing an evaluation network and an execution network, and evaluating an optimization control strategy executed each time to realize an optimization control target of the underactuated system;
s41, optimizing a cost function by using general approximation performance of a radial basis function neural networkIs approximated as follows:
wherein W is * Is a bounded weight, (W) * ) T Is W * Is a transpose of (2);representing an activation function; />Is an approximation error;
s42, based on the formula (15), an optimized cost functionGradient of->Is formulated as:
wherein,is->Is a gradient of (2); />Represents->Is a transpose of (2); />Is->Is a gradient of (2);
s43, based on the formula (16),and->Is re-expressed as follows:
s44 due to bounded weight W * Is unknown, and therefore,is estimated as follows:
wherein,is->Is determined by the estimation of (a); />Is W * Is determined by the estimation of (a); />Is->Is a transpose of (2);
s45, based on the formula (19),is formulated as:
wherein,is->Is a gradient of (2);
s46, based on the formula (20),and->The estimated form of (a) is:
wherein,and->Respectively->And->Is determined by the estimation of (a); />To perform the weights of the network;
s47, based on the formula (21) and the formula (22), the estimation form of the HJB equation is as follows:
wherein, to evaluate the weight of the network;
s48, based on the formula (12) and the formula (23), the bellman residual is defined as follows:
s49, in order to ensure minimum Belman residual error, an objective function e is defined as follows:
s50, designing and evaluating a network update law based on the formula (25) by using a standardization principle and a gradient descent method, wherein the network update law is as follows:
wherein, xi c To evaluate the network update rate;represents->Is a transpose of (2);
s51, to ensure system stability, the execution network update law is designed to:
wherein epsilon and delta are both normal constants;J T represents the transpose of J; gamma ray a To perform network update rates.
In order to verify the effectiveness of the control method, the under-actuated system is subjected to numerical simulation verification. Wherein the state variable initial value is given as X (0) = [ X ] 1 (0),x 2 (0),x 3 (0),x 4 (0)] T =[0.03,1,0.002,0.24] T . Parameter mu 1 Sum mu 2 The value of (1) is set to mu 1 =2.1 and μ 2 =4.9. Cost functionThe medium parameters are set to α=0.3 and β=2. The correlation matrix is given as:
the executive network and evaluation network weights are defined asAndits initial value is +.>Andactivation function->Is selected as a Gaussian type function and +.>Its center and widthThe degrees are c respectively NN,l = { -2, -1,0,1,2} and w NN,l = {1,2,1,2,1}. The executive network and evaluation network weight update law parameters are given as ζ c =0.95,γ a =1,ε=0.6,δ=1.5。
The simulation results are shown in fig. 2-5. FIG. 2 is a system state x 1 ,x 2 ,x 3 And x 4 It can be seen that the underactuated system reaches a steady state as the system state gradually converges and reaches the equilibrium point. FIG. 3 shows a layered slip film variable S 1 And S is 2 Can be seen as S 1 And S is 2 The trajectory of (2) is initially oscillating more and then begins to stabilize and gradually converge to the equilibrium point. Fig. 4 and 5 are graphs of the update of the execution network and evaluation network weights, respectively, in the embodiment, it can be seen that the execution network and evaluation network weights converge and reach a steady state after training in a short time.

Claims (7)

1. The underactuated system layered sliding film optimal control method based on the self-adaptive dynamic programming is characterized by comprising the following steps of:
s1, establishing an under-actuated system dynamics model under a nominal form;
s2, constructing a layered sliding film surface containing state information based on an underactuated system dynamics model;
s3, constructing a cost function and an HJB equation based on the layered sliding film surface based on the step S2, and solving an optimal control strategy through a gradient descent method;
and S4, constructing an evaluation network and an execution network, and evaluating the optimization control strategy executed each time to realize the optimization control target of the underactuated system.
2. The under-actuated system layered synovial membrane optimization control method based on self-adaptive dynamic programming as claimed in claim 1, wherein the step S1 is specifically:
an augmented state variable x= [ X ] defining an underactuated system 1 ,x 2 ,x 3 ,x 4 ] T And the order state components are x respectively 1 ,x 2 ,x 3 And x 4 The under-actuated system dynamics model is formulated as:
wherein p is 1 (X) and p 2 (X) is a system dynamic function; q 1 (X) and q 2 (X) is a control gain function; u is the control input.
3. The under-actuated system layered synovial membrane optimization control method based on self-adaptive dynamic programming as claimed in claim 1, wherein the step S2 is specifically:
s21, constructing a sub-synovial surface vector S as follows:
s=[s 1 ,0,s 2 ,0] T =HX o +MX e (2)
wherein s is i =h i x 2i-1 +x 2i I=1, 2, the i-th layer sub-slip film surface; h=diag (H 1 ,1,h 2 1) is a diagonal matrix, h 1 And h 2 Are all diagonal elements; vector X o And X e Respectively defined as X o =[x 1 ,0,x 3 ,0] T And X e =[0,x 2 ,0,x 4 ] T The method comprises the steps of carrying out a first treatment on the surface of the The matrix M is:
s22, based on the formula (2), constructing a layered slide film surface vector S as follows:
S=[S 1 ,0,S 2 ,0] T =s+ZS (3)
wherein S is i =s i +z i-1 S i-1 I=1, 2, indicating the i-th layered slide film surface; z=diag (Z) 0 ,1,z 1 1) is a diagonal matrix, z 0 And z 1 Are all diagonal elements; vector quantitySIs defined asS=[S 0 ,0,S 1 ,0] T ,S 0 And S is 1 Are elements of the vector S;
s23, obtaining an ith layered slide film surface S through iterative calculation based on formulas (2) and (3) i The following is shown:
wherein,if->Parameters->Otherwise, go (L)>
S24, based on equation (2), equation (3) and equation (4), the layered synovial surface vector S is re-expressed as follows:
wherein the matrixThe method comprises the following steps:
s25, deriving the following result from the formula (5):
wherein the vector isP (X) and U (t) are each +.>P(X)=[0,p 1 (X),0,p 2 (X)] T And u= [0, U ]] T The method comprises the steps of carrying out a first treatment on the surface of the Vector Q (X) is:
s26, based on a hierarchical structure, the control input u is designed as follows:
wherein iota is less than or equal to i; u (u) eq,ι And u sw,ι Respectively representing an equivalent control law and a switching control law of the laminated sliding film surface of the first layer;
s27, based on the formula (7), the vector U is re-formulated as:
U=Ξ(u eq +u sw ) (8)
wherein the vector u eq And u sw Respectively isAnd->The matrix xi is:
4. the under-actuated system layered synovial membrane optimization control method based on self-adaptive dynamic programming as claimed in claim 1, wherein the step S3 is specifically:
s31, constructing a cost function based on the layered sliding film surface by utilizing the principle of the optimality of Bellman
S32, constructing a Hamiltonian based on a layered sliding film surface based on the formula (9), wherein the Hamiltonian is as follows:
wherein,is->Is a gradient of (2); />Represents->Is a transpose of (2);
s33, constructing an HJB equation of the formula (11);
s34, based on the formulas (11) and (12), and by solving partial differential equationAndobtaining:
5. the method for optimizing control of an underactuated system layered slide film based on adaptive dynamic programming as claimed in claim 4, wherein the cost function based on layered slide film surfaceThe method comprises the following steps:
wherein A and B are positive arrays; alpha is a normal constant; the |x|| represents the norm of the augmented state variable X; f (u) sw ) Is a positive definite function and is defined as:
wherein β is a normal constant; omega i Represents an integral variable;is a monotonic odd function.
6. The under-actuated system layered synovial membrane optimization control method based on self-adaptive dynamic programming as claimed in claim 4, wherein the HJB equation is specifically:
wherein,a cost function for optimization; />Is->Is a gradient of (2); />Represents->Is a transpose of (2); />And->The optimized equivalent control law vector and the switching control law vector are respectively.
7. The under-actuated system layered synovial membrane optimization control method based on self-adaptive dynamic programming as claimed in claim 1, wherein the step S4 is specifically:
s4-1, optimizing cost function by using general approximation performance of radial basis function neural networkIs approximated as follows:
wherein W is * Is a bounded weight, (W) * ) T Is W * Is a transpose of (2);representing an activation function; />Is an approximation error;
s4-2, based on the formula (15), an optimized cost functionGradient of->Is formulated as:
wherein,is->Is a gradient of (2); />Represents->Is a transpose of (2); />Is->Is a gradient of (2);
s4-3, based on formula (16),and->Is re-expressed as follows:
s4-4 due to bounded weight W * Is unknown, and therefore,is estimated as follows:
wherein,is->Is determined by the estimation of (a); />Is W * Is determined by the estimation of (a); />Is->Is a transpose of (2);
s4-5, based on formula (19),is formulated as:
wherein,is->Is a gradient of (2);
s4-6, based on formula (20),and->The estimated form of (a) is:
wherein,and->Respectively->And->Is determined by the estimation of (a); />To perform the weights of the network;
s4-7, based on the formula (21) and the formula (22), the estimation form of the HJB equation is as follows:
wherein, to evaluate the weight of the network;
s4-8, based on formula (12) and formula (23), defining a Belman residual as follows:
s4-9, in order to ensure that the Belman residual error is minimum, an objective function e is defined as follows:
s5-10, designing and evaluating a network update law based on the formula (25) and by using a standardization principle and a gradient descent method as follows:
wherein, xi c To evaluate the network update rate;represents->Is a transpose of (2);
s5-11, in order to ensure the stability of the system, the execution network update law is designed as follows:
wherein epsilon and delta are both normal constants;J T represents the transpose of J; gamma ray a To perform network update rates.
CN202410022633.6A 2024-01-08 2024-01-08 Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming Pending CN117872753A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410022633.6A CN117872753A (en) 2024-01-08 2024-01-08 Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410022633.6A CN117872753A (en) 2024-01-08 2024-01-08 Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming

Publications (1)

Publication Number Publication Date
CN117872753A true CN117872753A (en) 2024-04-12

Family

ID=90578706

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410022633.6A Pending CN117872753A (en) 2024-01-08 2024-01-08 Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming

Country Status (1)

Country Link
CN (1) CN117872753A (en)

Similar Documents

Publication Publication Date Title
Cho et al. Composite model reference adaptive control with parameter convergence under finite excitation
Wang et al. Dynamic learning from neural control for strict-feedback systems with guaranteed predefined performance
CN111580392B (en) Finite frequency range robust iterative learning control method of series inverted pendulum
CN111459051B (en) Discrete terminal sliding mode model-free control method with disturbance observer
Shao et al. Event-based adaptive fuzzy fixed-time control for nonlinear interconnected systems with non-affine nonlinear faults
Cheng et al. Block backstepping controllers design for a class of perturbed non-linear systems with m blocks
Zhang et al. Decentralized adaptive output feedback dynamic surface control for stochastic nonstrict‐feedback interconnected nonlinear systems with actuator failures and input quantization via command filter
Zerkaoui et al. Stable adaptive control with recurrent neural networks for square MIMO non-linear systems
Ghavidel et al. Synchronization adaptive fuzzy gain scheduling PID controller for a class of mimo nonlinear systems
CN116880191A (en) Intelligent control method of process industrial production system based on time sequence prediction
CN117872753A (en) Underactuated system layered sliding film optimal control method based on self-adaptive dynamic programming
Kusumoputro et al. Performance characteristics of an improved single neuron PID controller using additional error of an inversed control signal
Chavoshi et al. Experimental comparison of STR and PI controllers on a nonlinear liquid-level networked control system
CN112346342B (en) Single-network self-adaptive evaluation design method of non-affine dynamic system
Ghezelayagh et al. Intelligent predictive control of a power plant with evolutionary programming optimizer and neuro-fuzzy identifier
CN113947202A (en) Design method of industrial process multi-state perception prediction controller
Jeyachandran et al. Comparitive performance analysis of various training algorithms for control of CSTR process using narma-L2 control
Kohler et al. PID tuning using cross-entropy deep learning: A Lyapunov stability analysis
CN116954086B (en) Intelligent prediction control method and device for adjusting system of pumped storage unit
M'SAHLI et al. Application of adaptive controllers for the temperature control of a semi-batch reactor
Eaglin et al. Leveraging conventional control to improve performance of systems using reinforcement learning
Pazera et al. A fault-tolerant iterative learning control for Takagi-Sugeno fuzzy systems
JPH0736505A (en) Identifying/controlling method for controlled system
Hush et al. Model following using multilayer perceptrons
Mastorocostas et al. On stable learning of block-diagonal recurrent neural networks, Part 1: the RENNCOM algorithm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination