CN112286052A

CN112286052A - Method for solving industrial control optimal tracking control by using linear system data

Info

Publication number: CN112286052A
Application number: CN202011015464.1A
Authority: CN
Inventors: 李金娜; 张一晗; 林玉英; 王春彦; 闫立鹏
Original assignee: Liaoning Shihua University
Current assignee: Liaoning Shihua University
Priority date: 2020-09-24
Filing date: 2020-09-24
Publication date: 2021-01-29

Abstract

The invention discloses a method for solving the optimal tracking control of industrial control by utilizing linear system data, which relates to an optimal tracking control method of industrial control. The invention mainly aims at the problem that the optimal controller needs an accurate system model, and designs a method for learning the optimal tracking controller gain without system dynamics knowledge. The invention has the main advantages that the error is tracked by using input, output and measured augmentation variables such as past values of the input and the output, and a state observer does not need to be designed while an approximate optimal tracking controller is obtained. And the system under consideration can track the reference signal by an optimal method when the controller is used under the controller, and the technology can design the controller under the conditions that the controlled system of the industrial process is difficult to accurately model and the system state measurement cost is high or is not measurable.

Description

Method for solving industrial control optimal tracking control by using linear system data

Technical Field

The invention relates to an optimal tracking control method for industrial control, in particular to a method for solving the optimal tracking control method for industrial control by utilizing linear system data.

Background

An optimal control law is found to ensure tracking of the reference signal by minimizing certain performance indicators that contain tracking errors. The method is a basic problem of a control theory with wide application in industry, medical treatment, aviation and the like, and the rich achievement of optimal tracking control really lays a foundation for a series of future researches. As known in the open literature, the problem of modeless optimal tracking control where system conditions are not available has not been fully studied. Note, however, that when designing an optimal controller, pioneering work requires an accurate system model, and this requirement is rarely available in practice. Data-driven Reinforcement Learning (RL) has attracted increasing attention over the past decades because it is able to find an optimal solution through interaction with an unknown environment. In practice, it is difficult to obtain system model parameters or system states, it is difficult to accurately model the industrial process controlled systems, and the system state measurements are costly or non-measurable, so tracking using measured augmented variables such as inputs, outputs, and their past values is essential.

Disclosure of Invention

The invention aims to provide a method for solving the optimal tracking control of industrial control by using linear system data, and provides an intelligent design scheme of a discrete linear system optimal tracking controller driven by complete data, which utilizes input, output, past values of the input and output and other measurement augmentation variables to track, thereby solving the problems existing in the design of the industrial system controller.

The purpose of the invention is realized by the following technical scheme:

a method for solving industrial control optimal tracking control by using linear system data is characterized by comprising the following processes:

aiming at a linear discrete system, Q learning is applied to an extended form of a system state space, an improved discrete time linear system data driving optimal tracking control algorithm is provided, the algorithm is used for solving the problem of linear quadratic regulation of an extended state space model only by using input, output and measured values, and a data driving method is adopted;

firstly, proposing an optimal tracking control problem, then rewriting a system model into an extended state space model, searching an optimal tracking control strategy by designing differential input, and enabling the output of the system to follow a target reference signal under the condition of not considering system dynamics;

based on an optimal tracking control strategy, an optimal value function based on Q learning is designed;

the algorithm utilizes the measured data to obtain an approximate optimal tracking controller, under the controller, a considered system tracks a reference signal through an optimal method, an output adjusting method is adopted, and an optimal tracking control strategy is learned by directly utilizing measured differential output, differential input and tracking errors; firstly, a relation matrix is constructed, then an optimal value function and an optimal Q function are respectively defined, and a Bellman equation of the Q function is obtained according to a stability control strategy.

The method for solving the optimal tracking control of the industrial control by utilizing the linear system data comprises the following steps of:

(1) establishing a proper relation matrix;

(2) further optimizing the optimal value function according to a stability control strategy, thereby obtaining a Bellman equation of the Q function;

and designing an iterative algorithm on the basis.

The method for solving the optimal tracking control of the industrial control by utilizing the linear system data comprises the following four steps of:

(1) initializing to give a stable controller gain

；

(2) Then by solving the Q function matrix

Performing performance evaluation;

(3) and finally, strategy updating is carried out:

；

(1) (4) when

Is a very small value, the iteration is stopped.

The invention has the advantages and effects that:

because the optimal tracking controller is difficult to obtain in practical application, the method does not need to utilize system model parameters and system states, a new design method is provided, and a Q learning algorithm and an extended state space model are combined, so that for an unknown linear system, the optimal tracking control strategy can be found only by measuring input and output of states and past values of the states and adding tracking errors, and the method can be applied to an industrial system to solve the problem of controller design.

Drawings

FIG. 1 is a general scheme of a linear system data-driven tracking control method based on Q learning;

FIG. 2

The convergence result of (2);

FIG. 3Q traces of the learning algorithm;

the figure 4Q learning algorithm controls the input trajectory.

Detailed Description

The present invention will be described in detail with reference to the embodiments shown in the drawings.

The present invention employs Q-learning to solve the optimal tracking problem, which is first described as minimizing a specific cost function containing the tracking reference signal target with extended state space implementation. Then, a Q-learning algorithm is proposed, which only needs to utilize the measurement data, without the need to design a state observer, so as to obtain an approximately optimal tracking controller under which the considered system can track the reference signal in an optimal way.

The invention converts the optimal tracking control problem of a linear system into the linear quadratic regulation problem of a system expansion state space expression and provides a novel strategy Q learning algorithm based on data driving. A simple extended state-space equation is used to simplify the solution process of the proposed strategy-based Q learning method.

The invention comprises four steps:

the method comprises the following steps: optimal tracking problem for discrete time linear systems

In this section, the optimal tracking problem for DT (discrete time) linear systems is described by an extended state space expression, making the differential inputs the decision variables that drive the system to track the target reference signal. Consider a linear system of DTs described by an input-output state space model.

And establishing a DT linear system described by the input and output state space model.

(1)

Wherein

And

is the output and input of the system at the sampling time instant.

And

is a matrix of appropriate dimensions.

Now the two back shift operators

Is defined as (1), i.e.

And

. Then (1) can be rewritten as:

(2)

step two: extended state space model transformation

Defining an augmentation vector:

(3)

assuming that the target output is a constant reference signal, the output tracking error can be defined such that

And obtaining the desired extended state space model.

(4)

The difference variables are compressed by using the tracking error, and the established extended state space model adopts a method of combining dynamic programming and an RL framework to process the optimal tracking control problem instead of using the general state and output equation of the system. The goal is to find the optimal tracking control strategy by designing the differential inputs, so that the output of the system (1)

A target reference signal is tracked.

Wherein

And

is a symmetric matrix, therefore, the optimal tracking control problem can be expressed as:

(5)

this sequence finds the optimal tracking control strategy by solving the optimization problem shown in (3) without knowledge of the system.

Step three: strategy Q learning algorithm

First, initialization is performed to give a stable controller gain

Let a

Wherein

Representing an iteration index; then by solving the Q function matrix

And (4) performance evaluation is carried out:

(ii) a And finally, updating the strategy as follows:

when is coming into contact with

And is

And then stop.

Step four: example verification algorithm

In this section, the effectiveness of the proposed strategy Q-learning algorithm is illustrated by an example. For the system (1), selecting

，

，

，

，

And the target signal of reference is

Wherein

，

By adopting the method, the sum is selected, firstly, the parameter of the system is assumed to be known, and then the optimal Q function matrix and the optimal tracking controller gain are respectively obtained by using a dare command in Matlab:

the algorithm obtains the optimal Q function matrix and the optimal tracking controller gain after 10 iterations. The simulation verifies the effectiveness of the algorithm.

Case simulation:

if the system model is unknown, the generalized process model of the SC-1 ethylene cracking furnace system is as follows:

when in use

When the system model is unknown, the DT linear system described by the input-output state space model can be written as:

using the method of the invention, selection

And

firstly, assuming that the parameters of the system are known, then using the dare command in Matlab to respectively obtain the optimal Q function matrix and the optimal tracking controller gain:

the effectiveness of the method is verified through a series of simulation examples.

Claims

1. A method for solving industrial control optimal tracking control by using linear system data is characterized by comprising the following processes:

2. The method for solving the optimal tracking control problem of the industrial control by using the linear system data as claimed in claim 1, wherein the optimal value function based on the Q learning comprises the following steps:

(1) establishing a proper relation matrix;

and designing an iterative algorithm on the basis.

3. The method for solving the industrial control optimal tracking control by using the linear system data as claimed in claim 2, wherein the iterative algorithm comprises the following four steps:

(1) initializing to give a stable controller gain

；

(2) Then by solving the Q function matrix

Performing performance evaluation;

(3) and finally, strategy updating is carried out:

；

(4) when in use

Is a very small value, the iteration is stopped.