CN112905953A

CN112905953A - Unscented Kalman filtering control method for inverted pendulum

Info

Publication number: CN112905953A
Application number: CN202110303714.XA
Authority: CN
Inventors: 王庆; 张欢; 刘鹏飞
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2021-03-22
Filing date: 2021-03-22
Publication date: 2021-06-04

Abstract

The invention discloses an unscented Kalman filtering control method for an inverted pendulum, which comprises the following steps: firstly, initializing the mean value and the variance of the original state distribution of the inverted pendulum, then determining a sampling rule, then determining some Sigma points in the original state distribution according to the selected rule, enabling the determined Sigma points to be equal to the mean value and the covariance of the original state distribution, then carrying out nonlinear transformation on the Sigma points through a nonlinear system function to obtain a series of corresponding point sets, and finally determining the state mean value and the covariance after the nonlinear transformation by using the point sets obtained through the transformation. The invention solves the problems of slightly poor response time, control precision and robustness and limited active disturbance rejection capability of the existing inverted pendulum control algorithm.

Description

Unscented Kalman filtering control method for inverted pendulum

Technical Field

The invention belongs to the technical field of inverted pendulum control, and particularly relates to an unscented Kalman filtering control method for an inverted pendulum.

Background

Initial analytical studies of inverted pendulum systems began in the fifties of the twentieth century and were relatively complex, unstable, multivariable, high-order mechanical systems with nonlinear and strongly coupled characteristics. The inverted pendulum system has serious uncertainty, namely, the uncertainty of parameters of the system, and the interference of uncertain factors to the system. In recent years, many experts and scholars at home and abroad always regard the inverted pendulum as a typical research object, a plurality of control schemes are provided, a great deal of research is carried out on the stability and the stability of the inverted pendulum system, different control methods are sought to realize the control of the inverted pendulum so as to check or explain the control capability of the severely nonlinear and absolutely unstable system of the method, and the control method has wide application in the fields of military industry, aerospace, robots and general industrial processes, such as the processing of precision instruments, the balance control of the walking process of the robots, the verticality control in rocket launching, the missile interception control, the aviation docking control, the attitude control in satellite flight and the like. Therefore, from the viewpoint of control, the research on the inverted pendulum has far-reaching significance in theory and methodology. An inverted pendulum system is a typical self-unstable system in which the pendulum, as a typical vibration and motion problem, can be abstracted into many problems to study. With the development of nonlinear science, the nonlinear method is used for describing the nonlinear property, which is naturally unquestionable, but the method has a limitation that some essential characteristics of the nonlinearity are not always embodied by the linear method. Nonlinearity is a core factor causing chaos, disorder or chaos, which does not mean a complex reason, and simple nonlinearity can generate very chaos, disorder or chaos. The inverted pendulum system contains extremely rich and complex dynamics behaviors, such as bifurcation, fractal and chaotic dynamics.

In the prior art, the control of the inverted pendulum has classical, modern and various intelligent control theories, such as LQR control, fuzzy control, genetic algorithm control, sliding mode variable structure control and the like, but the influence of system noise, measurement noise, external interference and modeling error is difficult to avoid in the implementation process of various controls, so that the designed controller has limited anti-interference capability. In recent years, there are control laws designed by using KF, EKF, and the like, and although the interference rejection capability is enhanced, KF needs to calculate a state transition matrix in a linearized manner, which causes model errors. The EKF performs first-order linear approximation on the nonlinear system, and then performs Kalman filtering processing on the system, thereby achieving the purpose of expanding the Kalman filtering theory to the field of the nonlinear system. However, the EKF method only takes the first-order part of the Taylor-series expansion of the nonlinear system to perform linear approximation, so that approximation errors are inevitably generated, which is not suitable in an environment with higher precision requirement.

Disclosure of Invention

In order to solve the problems, the invention discloses an unscented Kalman filtering control method for an inverted pendulum, which solves the problems of slightly poor response time, control precision and robustness and limited active disturbance rejection capability of the existing inverted pendulum control algorithm.

The specific scheme is as follows:

an unscented Kalman filtering control method for an inverted pendulum is characterized by comprising the following steps: firstly, initializing the mean value and the variance of the original state distribution of the inverted pendulum, then determining a sampling rule, then determining some Sigma points in the original state distribution according to the selected rule, enabling the determined Sigma points to be equal to the mean value and the covariance of the original state distribution, carrying out nonlinear transformation on the Sigma points through a nonlinear system function to obtain a series of corresponding point sets, and finally determining the state mean value and the covariance after the nonlinear transformation by using the point sets obtained through the transformation.

The method comprises the following basic steps of unscented transformation based on a state variance matrix diagonal similarity decomposition sampling strategy: given the nonlinear state transformation y ═ f (x), where the state vector x is an n-dimensional random vector, assuming the system state mean is

If the variance is P, the steps of obtaining the Sigma point and the corresponding weight by UT conversion are as follows:

(1) 2n +1 sampling points are obtained through the following formula, wherein n is the dimension of the system model state;

wherein D ═ diag (ζ)₁,ζ₂,ζ₃,...,ζ_n)，ζ_i(i ═ 1, 2.. and n) is a diagonal matrix composed of characteristic values of P, (P ═ P_m)_iDenotes the ith column, f of the matrix_eigThe function is to solve all characteristic values of the matrix P to form a diagonal matrix D, and solve the characteristic vector of P to form a column vector of H;

(2) solving a corresponding weighting factor of a Sigma point of the model state;

in the formula (I), the compound is shown in the specification,

is the state mean weighting factor for the initial sample point,

is the covariance weighting factor of the initial sample point,

and

state mean and covariance weighting factors corresponding to the ith sampling point; beta ≧ 0 is a nonnegative weight coefficient used to incorporate higher-order term moments in the system equation, and the presence of beta can include the effects of the higher-order terms in the system [26 ]](optimal for the case of gaussian distribution β -2); alpha is a main scale parameter for determining the distribution breadth of Sigma points near the prior mean value, and the value range of the alpha is generally 10^-4Alpha is not less than 1, and the distribution state of sampling points can be controlled by taking different alpha values; k is a candidate parameter without specific values and boundaries, but for symmetric sampling, the value of k is usually to ensure that the matrix (n + λ) P is a semi-positive definite matrix, and usually k is 0 or 3-n; λ ═ α²(n + κ) -n is a scaling parameter used to reduce the overall prediction error of the algorithm.

Discretizing the nonlinear continuous model of the inverted pendulum, and selecting zero-mean white Gaussian noiseFor the system process noise W (k) and the measurement noise V (k), the state vector and the observation vector after the known inverted pendulum is augmented are x_eAnd z, the discretized inverted pendulum nonlinear identification system model can be described by equations (3.3) and (3.4),

in the formula (f)_eRepresents the nonlinear process function h of the inverted pendulum model after the state is expanded_eAn observation function of the inverted pendulum is shown.

Assuming that process noise W (k) and measurement noise V (k) in the inverted pendulum model respectively have covariance matrices Q and R, at different time k, the model augmentation state vector x for the inverted pendulum_eThe unscented Kalman filtering algorithm is basically realized by the following steps:

(1) determining the state sampling points (namely a Sigma point set) of the inverted pendulum model by the following formula;

(2) performing one-step prediction on the acquired sampling points through a nonlinear transformation function;

(3) calculating a one-step prediction mean and covariance matrix by first calculating a weighting factor using equation (3.2)

And

then, carrying out weighted summation on the one-step predicted values of the sampling points to obtain the mean value and covariance of the one-step prediction of the inverted pendulum state;

(4) performing UT conversion on the one-step prediction average value of the inverted pendulum state obtained in the step (3) to obtain a new sampling point;

(5) updating a measured value, namely performing measurement transformation on the sampling point set obtained in the previous step to further obtain an observation predicted value of the inverted pendulum state;

(6) calculating the mean value and the covariance of the observation predicted values, weighting the observation predicted values of the inverted pendulum sampling points obtained in the previous step by weighting factors, and then solving the corresponding mean value and covariance;

in the formula (I), the compound is shown in the specification,

is the observation variance of the inverted pendulum system model;

is the state observation covariance of the inverted pendulum system model;

(7) calculating a corresponding Kalman gain matrix by using the covariance matrix obtained in the previous step;

(8) updating the system state quantity and covariance;

and at this time, one-time updating of the state mean value and covariance of the inverted pendulum model is completed.

The invention has the beneficial effects that: according to the method, the device and the system for controlling the inverted pendulum based on unscented Kalman filtering, the unscented Kalman filtering algorithm is adopted, compared with the traditional inverted pendulum control algorithm, unscented Kalman filtering is adopted in the controller, the UKF is used for directly estimating the state of the nonlinear system, the system does not need to be subjected to linearization processing, and the method, the device and the system have more ideal control effect and anti-interference performance. And a state variance matrix diagonal similarity decomposition strategy is adopted, so that the problems that the UKF algorithm cannot be converged, the calculation load is large, the state variance matrix cannot keep positive definite and the like are solved. The motion attitude prediction of the swing rod is more accurate, the inertia and interference influence of the inverted pendulum is reduced, and the stability of the swing rod is kept. Effectively reduce inertia and interference influence, whole device has higher reliability and robustness, simple structure, low cost.

Drawings

FIG. 1 is a diagram of a classical control principle of an inverted pendulum.

FIG. 2 is a simplified second-order system block diagram of the inverted pendulum classical control.

Fig. 3 shows the modern control principle of the inverted pendulum.

FIG. 4 is a flow chart of predicting the inverted pendulum state based on unscented Kalman filtering according to the present invention.

Fig. 5 is a general configuration diagram of an XZ-iia type rotary inverted pendulum system.

Fig. 6 is a block diagram of the XZ-iia type rotary inverted pendulum system.

FIG. 7 is a kinetic model diagram.

Detailed Description

The present invention will be further illustrated with reference to the accompanying drawings and specific embodiments, which are to be understood as merely illustrative of the invention and not as limiting the scope of the invention.

As shown in fig. 1-3, the prior art is directed to classical control of an inverted pendulum: the rotary arm is driven by a direct current torque motor at the rotating shaft and can rotate around the rotating shaft in a vertical plane vertical to the rotating shaft of the motor; the rotary arm and the swing rod are connected by a movable rotating shaft of the potentiometer, and the swing rod can rotate around the rotating shaft in a vertical plane vertical to the rotating shaft; 2 angular displacement signals (the included angle between the swing arm and a lead straight line and the relative angle between the swing rod and the swing arm) obtained by measurement of the potentiometer are taken as 2 output quantities of the system and are sent to the DSP controller; an angular velocity signal can be obtained through the difference of angular displacement, then a PWM control signal is regulated according to a PID regulation algorithm and is converted into a voltage signal to be supplied to a driving circuit so as to drive the movement of a direct-current torque motor, and the movement of the swing rod is controlled by the rotation of the swing arm driven by the motor. Modern control (control based on state feedback): firstly, the system is judged to be controllable and can be observed, then a feedback control law is determined according to state feedback, and a feedback matrix K is solved by using a pole allocation method, so that the closed loop of the system is stable. Intelligent control: a control law is designed based on intelligent optimization algorithms such as an ant colony algorithm, a genetic algorithm, a neural network and the like, so that the inverted pendulum system is stable in closed loop.

As shown in fig. 4, the invention provides an unscented kalman filter control method for an inverted pendulum, comprising the following steps: firstly, initializing the mean value and the variance of the original state distribution of the inverted pendulum, then determining a sampling rule, then determining some Sigma points in the original state distribution according to the selected rule, enabling the determined Sigma points to be equal to the mean value and the covariance of the original state distribution, carrying out nonlinear transformation on the Sigma points through a nonlinear system function to obtain a series of corresponding point sets, and finally determining the state mean value and the covariance after the nonlinear transformation by using the point sets obtained through the transformation.

in the formula (I), the compound is shown in the specification,

is the state mean weighting factor for the initial sample point,

is the covariance weighting factor of the initial sample point,

and

Discretizing a nonlinear continuous model of the inverted pendulum, selecting zero-mean white Gaussian noise as system process noise W (k) and measurement noise V (k), wherein the state vector and the observation vector of the inverted pendulum after being augmented are respectively known as x_eAnd z, the discretized inverted pendulum nonlinear identification system model can be described by equations (3.3) and (3.4),

And

in the formula (I), the compound is shown in the specification,

is the observation variance of the inverted pendulum system model;

is the state observation covariance of the inverted pendulum system model;

(8) updating the system state quantity and covariance;

Examples

An XZ-IIA type rotary inverted pendulum system is a typical electromechanical integrated system, and a motion controller and a servo motor are adopted to carry out real-time motion control. The XZ-IIA type rotary inverted pendulum system is directly driven by a direct current torque motor and controlled by a built-in DSP chip. The device can be directly operated without a computer, and can also be controlled by the computer through serial port communication. Fig. 5 is a general configuration diagram of an XZ-iia type rotary inverted pendulum system. Fig. 6 is a block diagram of the XZ-iia type rotary inverted pendulum system. The system comprises a computer, a DSP controller, a driving mechanism, an inverted pendulum body and a position detection element, and forms a closed loop system.

The TMS320F240 DSP controller is adopted as a core device in the system, a real-time control algorithm can be independently executed, and the system can also be communicated with a computer through an RS-232C serial bus to debug the on-line control algorithm. Its working principle is that 2 angular displacement signals (included angle between rotary arm and lead straight line and relative angle between swing rod and rotary arm) obtained by potentiometer measurement are used as 2 output quantities of system and fed into computer. The computer calculates the control quantity according to a certain control algorithm, converts the control quantity into a corresponding voltage signal and provides the voltage signal for the driving circuit so as to drive the direct current torque motor to move, and controls the inversion of the swing rod and keeps balance by driving the rotation of the swing arm through the motor.

After the theoretical analysis of the inverted pendulum system ignores various influences such as friction and the like, a dynamic model shown in fig. 7 can be abstracted. The main mechanical parameters and variables of the system are shown in the following table.

The swing arm rotates around the shaft, the moment of inertia J1 and the corresponding friction moment coefficient f1 of the swing arm rotate around the shaft, and the moment of inertia J2 and the corresponding friction moment coefficient f2 of the swing rod rotate around the shaft. By measuring and calculating J1 ═ 0.004Kg · m²,J2＝0.001Kg·m²F1 is 0.01N · m · s, and f2 is 0.001N · m · s. Theta 1 and theta 2 are respectively included angles between the spiral arm and the swing rod and a vertical line, and the clockwise direction is positive; u is the control voltage applied to the motorThe clockwise direction is positive and the counterclockwise direction is negative. The system nonlinear model is as follows:

let θ 1 → 0, θ 2 → 0, then the linearized model is:

order to

Then there is

The state space equation for the system is as follows:

wherein

C＝[I_2×2 0_2×2]

Since rank [ B, AB, A2B, A3B ] ═ 4, rank [ C; CA; CA 2; CA3 ═ 4, the system is fully controllable and fully observable.

The corresponding inverted pendulum UKF state prediction algorithm is designed by the following main steps:

(1) UKF parameter and state quantity initialization

Firstly, the corresponding state variables are initialized by the formula (3.2)And observed quantity weighting factor

And variance weighting factor

And an associated scale factor lambda. Assuming that the process noise and the measurement noise are white gaussian noise with zero mean, wherein the process noise variance matrix is Q and the measurement noise variance matrix is R, the initialization values of the inverted pendulum augmented state vector and the variance thereof are shown in formulas (3.17) and (3.18).

In the formula (I), the compound is shown in the specification,

initializing a mean, P, for an augmented state vector⁰Initializing variance for the corresponding; x is the number of₀And η₀Respectively are the initialization values of the basic state quantity and the augmentation state quantity of the inverted pendulum,

and

the covariance is initialized accordingly.

(2) Calculating Sigma Point and time updates

The UKF algorithm accomplishes the nonlinear transfer of random state variables through a certain set of samples. Let the external input signal of the nonlinear system of the inverted pendulum at the known k moment be delta_kInverted pendulum model augmented state vector x_eThe corresponding estimated value and the estimated error variance matrix are respectively

And

then the Sigma points of 2n +1 samples at time k and their predicted values, predicted mean and state covariance predicted values can be obtained by equations (3.5) - (3.8).

(3) Observed value update

After the calculation of the Sigma point and the time update of the state quantity are completed, the observed quantity of the inverted pendulum needs to be updated. The system observation predicted value of the Sigma point is z at the moment k_k|kThe observed predicted mean value is

The corresponding observed prediction variance matrix is

The covariance matrix of the state quantity and observation quantity predictions is

Kalman filter gain of K_k+1The state variance matrix at time k +1 is P_k+1The state estimation value is

The state estimate and covariance matrix of the system can be iteratively updated by equations (3.10) - (3.16).

Selecting state quantity of system

As observed quantities, an observation equation of the object to be studied is established as:

the technical means disclosed in the invention scheme are not limited to the technical means disclosed in the above embodiments, but also include the technical scheme formed by any combination of the above technical features. It should be noted that those skilled in the art can make various improvements and modifications without departing from the principle of the present invention, and such improvements and modifications are also considered to be within the scope of the present invention.

Claims

1. An unscented Kalman filtering control method for an inverted pendulum is characterized by comprising the following steps:

firstly, initializing the mean value and the variance of the original state distribution of the inverted pendulum, then determining a sampling rule, then determining some Sigma points in the original state distribution according to the selected rule, enabling the determined Sigma points to be equal to the mean value and the covariance of the original state distribution, then carrying out nonlinear transformation on the Sigma points through a nonlinear system function to obtain a series of corresponding point sets, and finally determining the state mean value and the covariance after the nonlinear transformation by using the point sets obtained through the transformation.

2. The unscented kalman filter control method of the inverted pendulum according to claim 1, characterized in that the unscented transformation basic steps of the sampling strategy based on the state variance matrix diagonal similarity decomposition are as follows:

given the nonlinear state transformation y ═ f (x), where the state vector x is an n-dimensional random vector, assuming the system state mean is

wherein D ═ diag (ζ)₁,ζ₂,ζ₃,...,ζ_n)，ζ_i(i ═ 1, 2.. and n) is a diagonal matrix composed of characteristic values of P, (P ═ P_m)_iRepresents the ith column of the matrix，f_eigThe function is to solve all characteristic values of the matrix P to form a diagonal matrix D, and solve the characteristic vector of P to form a column vector of H;

in the formula (I), the compound is shown in the specification,

is the state mean weighting factor for the initial sample point,

is the covariance weighting factor of the initial sample point,

and

state mean and covariance weighting factors corresponding to the ith sampling point; beta is more than or equal to 0 and is a non-negative weight coefficient used for combining the moments of the high-order terms in the system equation, the influence of the high-order terms of the system can be included by the existence of beta, and the beta is 2 optimal for the situation of Gaussian distribution; alpha is a main scale parameter for determining the distribution breadth of Sigma points near the prior mean value, and the value range of the alpha is 10^-4Alpha is not less than 1, and the distribution state of sampling points can be controlled by taking different alpha values; kappa is a parameter to be selected without specific values and limits, but for symmetric sampling, the value of the matrix (n + lambda) P is a semi-positive definite matrix, and the value of kappa is 0 or 3-n; λ ═ α²(n + κ) -n is a scaling parameter used to reduce the overall prediction error of the algorithm.

3. The unscented kalman filter control method of an inverted pendulum according to claim 1, characterized in that the nonlinearity for the inverted pendulumDiscretizing the continuous model, selecting zero-mean white Gaussian noise as system process noise W (k) and measurement noise V (k), wherein the state vector and observation vector after the known inverted pendulum is augmented are x respectively_eAnd z, the discretized inverted pendulum nonlinear identification system model can be described by equations (3.3) and (3.4),

in the formula (f)_eRepresents the nonlinear process function h of the inverted pendulum model after the state is expanded_eAn observation function representing the inverted pendulum;

assuming that process noise W (k) and measurement noise V (k) in the inverted pendulum model have covariance matrices Q and R respectively; augmenting the state vector x for the model of the inverted pendulum at different times k_eThe unscented Kalman filtering algorithm is basically realized by the following steps:

(1) determining a state sampling point of the inverted pendulum model, namely a Sigma point set, through the following formula;

And

then, the one-step prediction value of the sampling point is weighted and summed to obtain the one-step prediction of the inverted pendulum stateMean and covariance;

in the formula (I), the compound is shown in the specification,

is the observation variance of the inverted pendulum system model;

is the state observation covariance of the inverted pendulum system model;

(8) updating the system state quantity and covariance;

and at this point, one-time updating of the state mean value and covariance of the inverted pendulum model is completed.