CN111505936B

CN111505936B - An Automatic Safety Tuning Method Based on Gaussian Process PID Control Parameters

Info

Publication number: CN111505936B
Application number: CN202010520485.2A
Authority: CN
Inventors: 吴坚; 陈国胜; 李帅; 杜志强; 张晶华
Original assignee: Jilin University
Current assignee: Jilin University
Priority date: 2020-06-09
Filing date: 2020-06-09
Publication date: 2021-10-01
Anticipated expiration: 2040-06-09
Also published as: CN111505936A

Abstract

The invention belongs to the technical field of automobiles, and particularly relates to an automatic safety setting method based on Gaussian process PID control parameters under the condition of ensuring safety. The invention adopts the cost function to evaluate the quality degree of the control effect, and designs the safety evaluation function to evaluate how far the current state is away from the dangerous state, thereby solving the domain range of the next function evaluation. And fitting the functional relationship between the controller parameters and the cost function by using a Gaussian process regression model, and fitting the relationship between the controller parameters and the safety evaluation function. And optimizing the cost function by using Bayesian optimization theorem to obtain the controller parameters which are required to be configured for the minimum value of the cost function. In this way, optimal values for the controller parameters can be determined safely and automatically.

Description

Automatic safety setting method based on Gaussian process PID control parameter

Technical Field

The invention belongs to the technical field of automobiles, and particularly relates to an automatic safety setting method based on Gaussian process PID control parameters under the condition of ensuring safety.

Background

With the development of automation technology in various industries, more and more repetitive operations are gradually replaced by automatic machines and automatic algorithms instead of manual operations. In the control field, adjusting control parameters is a very complicated matter, and the parameters need to be adjusted continuously, the experimental result is observed, and if the experimental result cannot meet the expectation, the work needs to be repeated. The repetitive work generally needs 5 to 10 times for a skilled operator, while for some unskilled operators, the number of repeated settings is several times that of the skilled operator, and furthermore, the unskilled calibration engineer has no much experience, and may manually debug some parameters causing unexpected situations of the machine. In this case, a great safety hazard may be caused.

In the control part of intelligent driving, longitudinal vehicle control has been a popular control topic, wherein an important application direction comprises Adaptive Cruise Control (ACC) of a vehicle, and the common longitudinal control uses a PID method to control the longitudinal acceleration. The traditional approach is to manually try out the individual parameters of the PID. This operation is repetitive and complex, requires a lot of manpower, and even causes danger if the parameters are misfitted. Many times, the parameter calibration results depend on the artificial feeling, and no specific quantitative index exists. Therefore, it is necessary to develop a calibration method capable of automatically calibrating parameters, and which is safe and effective.

Disclosure of Invention

The invention provides a method capable of safely and automatically setting PID control parameters, which can manually set a target function so that the finally calibrated PID parameters can meet the initially set ideal requirements. The scheme solves the problem of complex and repeated operation in the manual parameter trial and error control process, and avoids the occurrence of unstable system condition caused by the manual parameter trial and error process. The invention can also meet other control methods, such as PD control, PI control and other feedback control methods, and solves the problems existing in the manual fitting of each parameter of the PID.

The technical scheme of the invention is described as follows by combining the attached drawings:

an automatic safety setting method based on a Gaussian process PID control parameter comprises the following steps:

firstly, initializing a controller, namely, firstly giving a very conservative parameter to the controller, wherein the parameter is set by the controller, and only the controlled system can reach an expected ideal value;

step two, calculating the control quantity to be output according to the current vehicle state and the existing parameters of the controller and the calculation rule in the controller, and outputting the control quantity to the controlled object;

and step three, applying the control quantity to the controlled object, and outputting the state quantity at the next moment after the controlled object obtains the control quantity. The control quantity and the state quantity of the next moment are obtained;

inputting the obtained control quantity and state quantity into a cost function, and indirectly representing the performance of the parameters of the controllers by the cost function according to cost values calculated by the values;

inputting the obtained control quantity and state quantity into a safety evaluation function, calculating a safety value by the safety evaluation function according to the values, and then calculating the safety range of the parameters to be evaluated in the next step;

step six, performing Gaussian process fitting on the obtained controller parameters and the corresponding cost values to obtain a fitted functional relation between the cost values and the corresponding controller parameters;

step seven, performing Gaussian process fitting on the controller parameters and the corresponding safety values to obtain a fitting functional relation between the safety values and the corresponding controller parameters;

and step eight, a safety evaluation step, namely evaluating according to the safety function obtained in the previous step, setting the safety function to meet the Lipschitz condition, and evaluating the range of the safety value of the next step by utilizing a Lipschitz constant. The definition domain of the parameters evaluated next is within the range of the safety value, so that enough safety can be ensured;

step nine, optimizing the parameters of the controller according to the safety range obtained in the last step, obtaining the parameters which enable the cost function to be minimum in the safety range when the optimized controller parameters are still in the safety range, and inputting the parameters to the controller;

and step ten, after the controller reaches the parameters, repeating the step two until the error between the cost functions obtained by twice repeated calculation is smaller than a certain critical value. The test is stopped;

and step eleven, finishing parameter setting, stopping the test, and closing all the devices.

The specific method of the second step is as follows:

the controller calculates a control output quantity, and a calculation formula of the output quantity can be represented by the following formula:

in the formula (18), x is the actual current state quantity, x_refThe current ideal state quantity; in the adaptive cruise control, the state quantity is the current acceleration, and the ideal state quantity is the ideal acceleration; of course, in a complex control logic, the state quantity may also be a vector quantity, which is not limited to a scalar quantity; the control quantity u calculates the required control output quantity through PID according to the deviation between the current state quantity and the ideal state quantity, and u is the output driving force or braking force in the self-adaptive cruise control; here, the surface K_p、K_i、K_dAll the parameters are set parameters; for simplicity, the required tuning parameters will be denoted by the symbol θ below:

θ＝[K_p,K_i,K_d]^T (2)

in equation (18), min (), max () function is to define the control amount u not to exceed the physical limit, and to make the finally solved control amount to be in a satisfied state.

The concrete method of the third step is as follows:

the controlled object executes the input quantity, and the equation of the longitudinal control of the vehicle can be expressed by Newton's law as follows:

in the formula (20), m represents the mass of the whole vehicle,

as acceleration, F_uControl amount calculated for equation (18), F_rThe sum of various resistances in the running process of the vehicle; the vehicle execution control input quantity generates a state quantity at the next moment after execution and is output to the cost function for calculation.

The concrete method of the fourth step is as follows:

the cost function calculates the deviation between the input quantity and the state quantity and the ideal value to obtain a cost function value, and one possible cost function is expressed as follows:

in the formula (21), Q is a quadratic positive definite matrix and represents a penalty coefficient of state quantity deviation; r represents the punishment degree of the control quantity; t is_ovRepresents the overshoot time of the PID control process; g represents a penalty coefficient for overshoot time; the proportion among the penalty coefficients determines which evaluation standard is more important; this parameter is adapted by the commissioning personnel.

The concrete method of the sixth step and the seventh step is as follows:

collected control parameters

Wherein, theta_iFor parameter matching selection in each calibration process, M is the data quantity stored by the calibration times, n_θIs the dimension of the control parameter; with y representing the value of the safety function, the regression can be expressed as the following equation:

y_i＝g(θ_i)+ω_i (5)

in the formula (22), i represents the ith sample point data, ω_iThe noise is independent and identically distributed noise in the measurement process, the mean value is 0, and the standard deviation is sigma_iNoise is expressed throughout the process using vectors, namely:

ω＝[ω₁,...,ω_n]^T～N(0,∑_ω) (6)

wherein:

the noise is generally caused by instrument measurement and external interference factors;

the prior of the gaussian process regression of the measured values can then be expressed as:

y～N(μ(Θ),K_ΘΘ+σ²I) (8)

in the interest of simplicity, μ (Θ) is the mean of the measured values, and is given according to the priori knowledge of a debugger at the time of starting operation, and when the subsequent part of the experimental data exists, the mean is updated according to the obtained data; k_ΘΘAnd calculating the gram matrix of the data points by adopting a Gaussian kernel function:

l in the formula (26) represents the distance length between data points, if L is larger, the curve is smoother, and conversely, the curve is steeper; the gram matrix can then be expressed as:

when the new data value needs to be predicted at the parameter θ' is:

p(g(θ')|Θ,y)～N(μ',∑') (11)

wherein:

fitting a relational expression between the control parameters and the cost function and fitting a relational expression between the control parameters and the safety function through the Gaussian process regression;

the safety evaluation function calculates the current safety value and provides the adjustment range of the control parameter theta of the next step;

one possible security function is defined as follows:

e(θ)＝(x_max-x)(x-x_min)(T_s-T) (13)

in the formula (30), x_maxRepresents the maximum value of the state quantity, x_minRepresents the minimum value of the state quantity, T_sRepresenting the maximum allowed overshoot time of the system.

The method of the step eight comprises the following steps:

according to the safety function definition, the control parameter θ can be guaranteed to be safe only in the definition domain where the safety function is greater than 0, and therefore it is assumed that the safety function satisfies the LipSchitz condition, that is:

|e(θ₁)-e(θ₂)|≤L|θ₁-θ₂| (14)

selecting a Lipschitz constant L which is constantly larger than the slope of the safety function to ensure that the above formula is satisfied, wherein the larger the constant is, the slower the convergence is;

the secure set can then be expressed as:

θ∈{θ|e(θ₀)+L(θ-θ₀)>0} (15)

in the third diagram, when theta ranges from (x)₀,8]Within the range of (a), ensuring that e (theta) is constantly larger than 0, and ensuring the system safety; the selection range of the next sample point is thus chosen to be (x)₀,8]Within.

The concrete method of the ninth step is as follows:

the optimization parameters of the controller utilize the selection range of the next parameter obtained in the safety evaluation, the cost function which is well fitted in the Gaussian process fitting is calculated, the point where the minimum value of the cost function is most probably located is found, and Bayesian optimization is adopted; the gain function uses the desired boost, EI, as follows:

in the formula (33), α (θ) is an expected value of the obtained function, and the larger the value, the smaller the value of the function at the point θ, g_min(theta) represents the minimum of the cost functions known so far, phi is the cumulative density function of the standard normal distribution,

is a probability density function of a standard normal distribution; μ (θ), σ (θ) are obtained from equation (29), and:

therefore, the obtained function value of theta at any point can be calculated, and further the next iterative calculation is carried out.

The invention has the beneficial effects that:

1) the method is automatic operation, and parameters with the lowest cost function are automatically optimized through a program, so that the data are not analyzed by spending labor.

2) The invention can enable a debugger to set the cost function according to the target required to be reached, and can meet the diversified design of the debugger

3) The safety function considered by the invention ensures that the safety performance can be ensured in the whole scene optimization process. Without the problem of failure caused by trial error in the traditional problem

4) The invention can enable the debugging to set the safety function by self, and can set different safety functions according to different scenes so as to meet the requirement of safe operation under each scene.

5) The Bayesian optimization is adopted in the optimization process, and compared with other algorithms such as traditional gradient descent and the like, the global optimal solution can be obtained more quickly without falling into the local optimal solution.

6) The Bayes optimization adopted by the invention, more specifically the expectation-Enhancement (EI) method, can find the minimum value of the cost function more quickly to reach the optimal solution.

7) The invention adopts the Gaussian process to fit the relationship between the controller parameters and the cost function and the safety function. Because the traditional method is used, the relation between the controller parameter and the cost function value is difficult to be given explicitly, and a more complex function relation can be fitted by utilizing a Gaussian process

8) The invention adopts the Gaussian process to fit the relationship between the controller parameters and the cost function and the safety function. Compared with the traditional method, the Gaussian process can be added with a priori, and the priori can be adapted by combining the experience of a debugger.

Drawings

FIG. 1 is a complete flow chart for safely adjusting vehicle control parameters in accordance with the present invention.

FIG. 2 is a diagram of a cost function and a control parameter K of the Gaussian process and Bayesian optimization thereof_pA relational image.

FIG. 3 shows a diagram of K corresponding to FIG. 2 according to the present invention_pA security assessment function image.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In order to solve the complex and complicated repeated work of the vehicle control parameter in the calibration process, for example, the Adaptive Cruise Control (ACC) needs to continuously and repeatedly debug the parameter value in the process of setting the control parameter PID. After adjusting the trend of a parameter analysis data curve, manually allocating the next value to be tested, and if the next value is not needed to be tested, allocating a set of dangerous control parameters.

The invention forms the core framework of the algorithm by the controller, the controlled object, the manually set cost function, the Gaussian process fitting, the safety evaluation function and the optimized controller parameter. In said frame. The controller receives the optimized new controller parameter configuration from the optimized controller parameter part, calculates new control quantity and inputs the new control quantity to the controlled object, and the controlled object obtains new vehicle state quantity at the next moment after executing corresponding control input. And inputting the vehicle state quantity and the control quantity into a preset cost function, and calculating a new cost value.

In order to evaluate the influence of the controller parameters on the change trend of the cost function value, a Gaussian process is adopted for fitting, the Gaussian process is a type of regression without parameters, a plurality of initially known priori knowledge can be given by using the Gaussian process, and the whole process is fitted according to the existing priori knowledge. The gaussian process can fit complex functional graphs. In the Gaussian process fitting, two types of functions are fitted together, one is a safety function, and the other is a cost function, and the independent variables of the functions are parameter values of the controller, such as K in PID_p，K_i，K_dThese set parameters.

And the Gaussian process fitting is used for transferring the fitted function to a parameter part of the safety evaluation and optimization controller. Wherein passed to the security assessment is a security assessment function. The parameter part of the optimization controller is passed with the cost function which is well fitted. The safety evaluation function evaluates which parameters can ensure safety, and ensures that dangerous conditions can not occur. The evaluation function passes to the controller parameter range over which the optimized controller parameter portion can adjust. In the part of optimizing the parameters of the controller, a Bayesian optimization method is adopted, specifically, an expected lifting method (EI) is adopted to optimize the parameters of the controller, and a value which can minimize the cost function is found out from one controller parameter.

Referring to fig. 1, a method for automatic safety tuning based on gaussian process PID control parameters includes the following steps:

step one, initializing the controller, firstly giving the controller a very conservative parameter which is set by the controller, as long as the controlled system can reach the expected ideal value.

And step two, calculating the control quantity to be output according to the current vehicle state and the existing parameters of the controller and the calculation rule in the controller, and outputting the control quantity to the controlled object.

θ＝[K_p,K_i,K_d]^T (19)

And step three, applying the control quantity to the controlled object, and outputting the state quantity at the next moment after the controlled object obtains the control quantity. The control quantity and the state quantity at the next moment are obtained.

in the formula (20), m represents the mass of the whole vehicle,

And step four, inputting the obtained control quantity and state quantity into a cost function, and indirectly representing the performance of the parameters of the controller by the cost function according to cost values calculated by the values.

in the formula (21), Q is a quadratic positive definite matrix and represents a penalty coefficient of state quantity deviation; r represents the punishment degree of the control quantity; t is_ovRepresents the overshoot time of the PID control process; g represents a penalty coefficient for overshoot time; the size proportion among the penalty coefficients determines which evaluation standard is more important; this parameter can be adapted by the commissioning personnel.

And fifthly, inputting the obtained control quantity and state quantity into a safety evaluation function, calculating a safety value by the safety evaluation function according to the values, and then calculating the safety range of the parameters required to be evaluated in the next step.

And sixthly, performing Gaussian process fitting on the obtained controller parameters and the corresponding cost values to obtain a fitted functional relation between the cost values and the corresponding controller parameters.

And seventhly, performing Gaussian process fitting on the controller parameters and the corresponding safety values to obtain a fitting functional relation between the safety values and the corresponding controller parameters.

And fitting two functions together in the Gaussian process regression 4, wherein the independent variables of the two functions are parameters of the controller. Since the road resistance is difficult to calibrate and also varies from scene to scene, the influence of the controller parameters on the cost function is difficult to solve through the equation explicitly, so that a curve is obtained by adopting Gaussian process regression fitting.

Control parameters acquired during a previous data acquisition

Wherein theta is_iFor parameter matching selection in each calibration process, M is the data quantity stored by the calibration times, n_θIs the dimension of the control parameter. With y representing the value of the safety function, the regression can be expressed as the following equation:

y_i＝g(θ_i)+ω_i (22)

ω＝[ω₁,...,ω_n]^T～N(0,∑_ω) (23)

wherein:

noise is generally caused by instrumental measurements, external interference factors.

y～N(μ(Θ),K_ΘΘ+σ²I) (25)

for simplicity, μ (Θ) is the mean of the measured values, and can be given according to the priori knowledge of the debugger at the start of the run, and when the subsequent part of the experimental data exists, the mean can be updated according to the obtained data. K_ΘΘAnd calculating the gram matrix of the data points by adopting a Gaussian kernel function:

l in equation (26) represents the distance length between data points, and if L is larger, the curve is smoother, and conversely, the curve is steeper. The gram matrix can then be expressed as:

when the new data value needs to be predicted at the parameter θ' is:

p(g(θ')|Θ,y)～N(μ',∑') (28)

wherein:

through the Gaussian process regression, the relational expression between the control parameters and the cost function can be fitted, and the relational expression between the control parameters and the safety function can be fitted. As shown in fig. 2, the fitting mean of gaussian process is the relationship between the control parameter θ and the cost function fitted by using the sampling points. Fig. 3 is a graph fitting a relation between the control parameter θ and the safety evaluation function using gaussian process regression.

The safety evaluation function calculates the current safety value and gives the adjustment range of the control parameter theta of the next step. One possible security function is defined as follows:

e(θ)＝(x_max-x)(x-x_min)(T_s-T) (30)

in the formula (30), x_maxRepresents the maximum value of the state quantity, x_minRepresents the minimum value of the state quantity, T_sRepresenting the maximum allowed overshoot time of the system. Regression analysis was performed on the control parameter θ and the safety function value e using a gaussian process, and the resulting image is shown in fig. 3.

And step eight, a safety evaluation step, namely evaluating according to the safety function obtained in the previous step, assuming that the safety function meets the Lipschitz condition, and evaluating the range of the safety value of the next step by using a Lipschitz constant. The domain of the parameters to be evaluated next is then within the range of the safety value, ensuring sufficient safety.

|e(θ₁)-e(θ₂)|≤L|θ₁-θ₂| (31)

a relatively large Lipschitz constant L may be chosen to ensure that the above equation is satisfied, but a larger constant will result in slower convergence.

The secure set can then be expressed as:

θ∈{θ|e(θ₀)+L(θ-θ₀)>0} (32)

as in FIG. 3, it can be seen that when θ ranges from (x)₀,8]Within the range of (A), the constant larger than 0 of e (theta) can be ensured, and the system safety can be ensured. The selection range of the next sample point is thus chosen to be (x)₀,8]Within.

And step nine, optimizing the parameters of the controller according to the safety range obtained in the last step, obtaining the parameters which enable the cost function to be minimum in the safety range when the optimized parameters of the controller are still in the safety range, and inputting the parameters to the controller.

And (3) the optimized parameters 6 of the controller utilize the selection range of the next parameter obtained in the safety evaluation 5 to calculate the fitted cost function in the step (4), find the point where the minimum value of the cost function is most likely to be located, and adopt Bayesian optimization. Where the gain function takes the desired lift (EI), as follows:

in the formula (33), α (θ) is an expected value of the obtained function, and the larger the value, the smaller the value of the function at the point θ, g_min(θ) represents the minimum of the cost function known at present, and Φ is the normA cumulative density function of the quasi-normal distribution,

is a probability density function of a standard normal distribution. μ (θ), σ (θ) are obtained from equation (29), and:

therefore, the obtained function value of theta at any point can be calculated, and further the next iterative calculation is carried out. FIG. 2 is an exemplary acquisition function, with the next evaluation point being at the maximum of the acquisition function, and further, since at [0, x ]₀]Within the range, the safety evaluation function is less than 0, so the coordinates of the evaluation point are selected to be (x)₀,8]Within the range. The parameter theta required to be evaluated next time is obtained_new

And step ten, after the controller reaches the parameters, repeating the step two until the error between the cost functions obtained by twice repeated calculation is smaller than a certain critical value. The continuation of the test is stopped.

The invention quantifies the control performance by setting the cost function, and utilizes the Gaussian process regression to fit the cost function and the safety evaluation function. Firstly, the safety evaluation function is used for solving which areas are safe, then the Bayesian optimization method is used for optimizing the cost function, and the minimum value of the cost function in the safety range is found out. The optimization strategy can automatically and safely solve the problem of difficult calibration, reduce the operation of manually analyzing data and reduce the dangerous fault of manually setting data.

Claims

1. an automatic safety setting method based on Gaussian process PID control parameters, is characterized in that, this setting method comprises the following steps:

Step 1. Initialize the controller. First, give the controller a very conservative parameter. This parameter is set by yourself, as long as the controlled system can reach the desired ideal value;

Step 2: According to the current vehicle state and the parameters of the existing controller, according to the internal calculation rule of the controller, calculate the output control quantity that should be output, and output it to the controlled object;

Step 3: Applying the control amount to the controlled object, after the controlled object obtains the control amount, the state amount of the next moment is output; also the control amount and the state amount of the next moment are obtained;

Step 4: Input the obtained control quantity and state quantity into the cost function, and the cost value calculated by the cost function indirectly represents the performance of these controller parameters;

Step 5. Input the obtained control quantity and state quantity into the safety evaluation function, and the safety evaluation function calculates the safety value according to these values, and then calculates the safety range of the required evaluation parameters in the next step;

Step 6: After obtaining the controller parameters and the corresponding cost values, perform Gaussian process fitting to obtain the functional relationship between the cost values and the corresponding controller parameters;

Step 7: Perform Gaussian process fitting on the controller parameters and the corresponding safety values, and obtain a functional relationship between the safety values and the fitting of the corresponding controller parameters;

Step 8, the safety evaluation step, evaluate according to the fitting function relationship between the safety value obtained in the previous step and the corresponding controller parameters. Assuming that the safety function satisfies the Lipschitz condition, the Lipschitz constant is used to evaluate the safety range of the next step; The definition domain of the parameters to be evaluated in the next step is within the range of safe values, which can ensure sufficient safety;

Step 9: Based on the safety range obtained in the previous step, optimize the parameters of the controller. The optimized controller parameters are still in the safe range. The parameters that minimize the cost function are obtained within the safe range, and the parameter input to the controller;

Step 10. After the controller obtains the parameters, repeat step 2 until the error between the cost functions obtained by the two repetitions is less than a certain critical value; then stop continuing the test;

Step 11. After parameter setting is completed, stop the test and turn off all equipment;

The specific method of the second step is as follows:

The controller calculates the control output, and the calculation formula of the output is expressed by the following formula:

In formula (1), x is the actual current state quantity, and _xref is the current ideal state quantity; in adaptive cruise control, the state quantity is the current acceleration, and the ideal state quantity is the ideal acceleration; of course, in complex control In the logic, the state quantity is a vector or a scalar; the control quantity u calculates the required control output quantity through PID according to the deviation between the current state quantity and the ideal state quantity. In adaptive cruise control, u is the output driving force or Braking force; here K _p , K _i , K _d are all set parameters; the symbol θ will be used below to represent the required set parameters:

θ=[K _p ,K _i ,K _d ] ^T (2)

In formula (1), the min() and max() functions are used to limit the control variable u not to exceed the physical limit, so that the final solution of the control variable can reach a satisfactory state;

The specific method of the third step is as follows:

The controlled object executes the input quantity, according to Newton's law, the equation of vehicle longitudinal control is expressed as:

In formula (3), m represents the vehicle mass,

is the acceleration, F _u is the control amount calculated by the formula (1), F _r is the sum of various resistances during the driving process of the vehicle; the vehicle execution control output amount generates the state amount at the next moment after execution, which is output to the cost calculated in the function.

2. a kind of automatic safety tuning method based on Gaussian process PID control parameter according to claim 1, is characterized in that, the concrete method of described step 4 is as follows:

The cost function calculates the deviation between the input quantity and the state quantity and the ideal value and then obtains the cost function value. The cost function is expressed as follows:

In formula (4), Q is a quadratic positive definite matrix, which represents the penalty coefficient of the state variable deviation; R represents the penalty degree for the control variable; T _ov represents the overshoot time of the PID control process; G represents the overshoot time The penalty coefficient; the ratio between the penalty coefficients determines which evaluation standard is paid more attention to; this parameter is adapted by the debugging personnel.

3. a kind of automatic safety tuning method based on Gaussian process PID control parameter according to claim 1, is characterized in that, the concrete method of described step 6, step 7 is as follows:

The collected control parameters

Among them, θ _i is the parameter matching selection in each calibration process, M is the amount of data stored in the calibration times, and n _θ is the dimension of the control parameter; with y representing the value of the safety function, the regression can be expressed as the following equation:

y _i =g(θ _i )+ω _i (5)

In formula (5), i represents the data of the ith sample point, ω _i is the independent and identically distributed noise in the measurement process, its mean is 0, the standard deviation is σ _i , and the noise is expressed by a vector in the whole process, namely:

ω=[ω ₁ ,...,ω _n ] ^T ～N(0,∑ _ω ) (6)

in:

Noise is generally caused by instrument measurement and external interference factors;

Thus, the prior for the Gaussian process regression of the measured value is expressed as:

y～N(μ(Θ),K _ΘΘ +σ ² I) (8)

μ(Θ) is the mean value of the measured value, which is given according to the prior knowledge of the debugger at the beginning of the operation, and when the subsequent part of the experimental data is available, the mean value is updated according to the obtained data; K _ΘΘ is the data point The Gram matrix of , is calculated using the Gaussian kernel function to calculate the Gram matrix:

In formula (9), L represents the distance between data points. If L is larger, the curve will be smoother, otherwise, the curve will be steeper; then the Gram matrix is expressed as:

When it is necessary to predict the parameter θ', the new data value is:

p(g(θ')|Θ,y)～N(μ',∑') (11)

in:

Through the above Gaussian process regression, the relationship between the control parameters and the cost function is fitted, and the relationship between the control parameters and the safety function is fitted;

The safety function calculates the current safety value and gives the adjustment range of the next control parameter θ; the safety function is defined as follows:

e(θ)=(x _max -x)(xx _min )(T _s -T) (13)

In formula (13), x _max represents the maximum value of the state quantity, x _min represents the minimum value of the state quantity, and T _s represents the maximum overshoot time allowed by the system.

4. a kind of automatic safety tuning method based on Gaussian process PID control parameter according to claim 3, is characterized in that, the method of described step 8 is as follows:

According to the definition of the safety function, the control parameter θ can be guaranteed to be safe only when the safety function is greater than 0 in the definition domain. Therefore, it is assumed that the safety function satisfies the LipSchitz condition, namely:

|e(θ ₁ )-e(θ ₂ )|≤L|θ ₁ -θ ₂ | (14)

The LipSchitz constant L, which is always greater than the slope of the safety function, is selected to ensure that the above formula is satisfied, but the larger the constant, the slower the convergence;

So the safe set is expressed as:

θ∈{θ|e(θ ₀ )+L(θ-θ ₀ )>0} (15)

When the range of θ is within the range of (x ₀ , 8], e(θ) is guaranteed to be always greater than 0, which ensures the safety of the system; therefore, the selection range of the next sampling point is selected within (x ₀ , 8].

5. a kind of automatic safety tuning method based on Gaussian process PID control parameter according to claim 3, is characterized in that, the concrete method of described step 9 is as follows:

The controller optimization parameters use the selection range of the next parameter obtained in the safety evaluation, calculate the cost function that has been fitted in the Gaussian process fitting, and find the point where the minimum value of the cost function is most likely to be located. Bayesian optimization is used. ; where the acquisition function adopts the expectation improvement, namely EI, as follows:

In formula (16), α(θ) is the expected value of the obtained function. The larger the value is, the smaller the function value at the point θ is, and g _min (θ) represents the minimum value of the currently known cost function. , Φ is the cumulative density function of the standard normal distribution,

is the probability density function of the standard normal distribution; μ(θ), σ(θ) are obtained from equation (12), and:

Thus, the obtained function value of θ at any point is calculated, and then the next iterative calculation is performed.