CN104345735A

CN104345735A - Robot walking control method based on foothold compensator

Info

Publication number: CN104345735A
Application number: CN201410521894.9A
Authority: CN
Inventors: 陈启军; 刘成菊; 许涛
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2014-09-30
Filing date: 2014-09-30
Publication date: 2015-02-11

Abstract

The invention relates to a robot walking control method based on a foothold compensator. The robot walking control method comprises the following steps of firstly, establishing a constraint dynamic model of a robot; secondly, designing the foothold compensator based on the constraint dynamic model according to the constraint dynamic model; thirdly, establishing a heteroscedastic sparse Gauss process regression model and realizing the mapping calculation from input to output of the foothold compensator; fourthly, locally updating the heteroscedastic sparse Gauss process regression model; fifthly, establishing the foothold compensator based on the heteroscedastic sparse Gauss process regression model; sixthly, performing prediction control over the walking of the robot according to the foothold compensator based on the heteroscedastic sparse Gauss process regression model. Compared with the prior art, the robot walking control method disclosed by the invention has the advantages of accurate prediction, high learning speed and the like.

Description

Robot walking control method based on foothold compensator

Technical Field

The invention relates to the field of robot motion control, in particular to a robot walking control method based on a foothold compensator.

Background

The dynamic characteristic and controllability of the humanoid robot can be adjusted by dynamically adjusting the foot-falling point of the robot, so that the suppression capability of the robot on unknown disturbance can be improved. At present, the implementation of the foothold compensator is mostly based on a simple linear controller, and for a humanoid robot, the walking process of the humanoid robot is a complex nonlinear system. Therefore, even if the linear foot-drop point compensator can obtain better gain parameters of the foot-drop point compensator through a learning method, a considerable error still exists between the final foot-drop point compensator strategy and the dynamic characteristics of the robot. In addition, if the learning process of the foothold compensator of the humanoid robot is long, the foothold compensator is not suitable for fast and real-time foothold compensation skill learning of the robot in an unknown environment. Therefore, the invention introduces the nonlinear characteristic and designs the foothold compensator based on the different variance sparse Gaussian process (HspGP) model. In order to realize real-time online adjustment of control of the foothold compensator, a local updating method based on an heteroscedastic Gaussian process model is provided. The local updating method has the advantages of small calculation complexity and high learning rate, and provides a mathematical basis for the online learning of the humanoid robot foothold compensator.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provide a robot walking control method based on a foothold compensator.

The purpose of the invention can be realized by the following technical scheme:

a robot walking control method based on a foothold compensator comprises the following steps:

1) establishing a constraint dynamic model of the robot;

2) designing a constraint dynamic model foot point compensator according to the constraint dynamic model;

3) establishing a different variance sparse Gaussian process regression model to realize the mapping calculation from input to output of the foothold compensator;

4) locally updating a different variance sparse Gaussian process model;

5) building a foot point compensator based on a variance sparse Gaussian process model;

6) and performing predictive control on the robot walking according to the footfall compensator based on the heteroscedastic sparse Gaussian process model.

The step 1) comprises the following steps:

11) establishing a three-dimensional linear inverted pendulum mechanical model of the robot, wherein the expression of the mechanical model is as follows:

x_z＝Cx_c

wherein,

A = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{matrix}],

B = [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}],

C = [\begin{matrix} 1 & 0 & - \frac{z_{c} - z_{z}}{g} \end{matrix}],

position, velocity and acceleration along the x-axis of the robot centroid in the world coordinate system, u_xIs the variation of the acceleration of the center of mass and is used to control the acceleration of the center of mass, p_c＝[x_c，y_c，y_c]^TIs the three-dimensional position of the centroid in the world coordinate system, and p_z＝[x_z，y_z，y_z]^TThe position of the ZMP in a world coordinate system, and g is the gravity acceleration;

12) according to the balance principle that the virtual ZMP is positioned in the sole support polygon, the controllable acceleration constraint formula of the mass center is obtained as follows:

<math><mrow> <msup> <mrow> <mo>[</mo> <msub> <mover> <mi>x</mi> <mrow> <mo>·</mo> <mo>·</mo> </mrow> </mover> <mi>c</mi> </msub> <mo>,</mo> <msub> <mover> <mi>y</mi> <mrow> <mo>·</mo> <mo>·</mo> </mrow> </mover> <mi>c</mi> </msub> <mo>]</mo> </mrow> <mi>T</mi> </msup> <mo>=</mo> <mo>{</mo> <mfrac> <mi>g</mi> <mrow> <msub> <mi>z</mi> <mi>c</mi> </msub> <mo>-</mo> <msub> <mi>z</mi> <mi>z</mi> </msub> </mrow> </mfrac> <msup> <mrow> <mo>[</mo> <mrow> <mo>(</mo> <msub> <mi>x</mi> <mi>c</mi> </msub> <mo>-</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>,</mo> <mrow> <mo>(</mo> <msub> <mi>y</mi> <mi>c</mi> </msub> <mo>-</mo> <msub> <mi>y</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>]</mo> </mrow> <mi>T</mi> </msup> <mo>|</mo> <msup> <mrow> <mo>[</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>,</mo> <msub> <mi>y</mi> <mi>i</mi> </msub> <mo>]</mo> </mrow> <mi>T</mi> </msup> <msub> <mrow> <mo>&Element;</mo> <mi>S</mi> </mrow> <mi>c</mi> </msub> <mo>}</mo> </mrow></math>

wherein S is_cFor supporting polygonal convex hulls, x_i，y_iIs S_cCoordinates of the inner points.

The step 2) comprises the following steps:

21) establishing a single-foot supporting foot drop point compensator, and defining the output of the compensator as follows:

π＝[△t_ss，Δx_f，Δy_f]^T

wherein, Δ t_ssFor the modification of the remaining duration of the monopod support phase, Δ p, from the time of the landing compensation trigger_f＝[Δx_f，Δy_f，0]^TThe amount of modification to the landing point for the next step,

from the output of the compensator, the modified trajectory of the reference ZMP is:

wherein γ is (t-t)_ss)/t_Ds，p_fTo determine the position of the landing foot point, t_ssDetermining the time from the moment to the end of the monopod support for the output of the foothold_DsFor both feet support time;

22) the inputs defining the one-foot support foothold compensator are:

23) the performance indexes of the single-foot supporting foot-drop point compensator are defined as follows:

wherein,for reference to the ZMP, the method comprises the steps of,for measurement of VZMP, N is the number of steps taken, t_NTime taken for N steps, Δ p_f(k) And Δ t_ss(k) For the kth step footprint modification strategy, the corresponding coefficient is set to Q_z＝1，Q_p100 and Q_t＝1。

The step 3) comprises the following steps:

31) establishing an heteroscedastic sparse Gaussian process model according to the constant variance sparse Gaussian process model and the sparse Gaussian process model, wherein an output prediction equation of the heteroscedastic sparse Gaussian process model is as follows:

<math><mrow> <msub> <mi>λ</mi> <mi>i</mi> </msub> <mo>=</mo> <mi>κ</mi> <mrow> <mo>(</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>,</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>-</mo> <mi>κ</mi> <mrow> <mo>(</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>,</mo> <mover> <mi>x</mi> <mo>&OverBar;</mo> </mover> <mo>)</mo> </mrow> <msubsup> <mi>K</mi> <mi>M</mi> <mrow> <mo>-</mo> <mn>1</mn> </mrow> </msubsup> <mi>κ</mi> <mrow> <mo>(</mo> <mover> <mi>x</mi> <mo>&OverBar;</mo> </mover> <mo>,</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow></math>

wherein x is_tFor the input state, y is the standard Gaussian process model target output, y ═ y₁，y₂，…，y_N]^TK is the kernel function, k_·＝[κ(x_t，x₁)，κ(x_t，x₂)，…，κ(x_t，x_N)]^T，κ_·＝κ(x₁，x_t) K is a square matrix, and each element is K (i, j) ═ K (x)_i，x_j)，Is the signal variance, W is the weight-diagonal matrix, Λ ═ diag (λ),Q_M＝K_M+K_MN(Λ+diag(r))^-1K_NM，for the uncertainty at each pseudo-input,a pseudo input quantity of a sparse Gaussian process model;

32) and obtaining a final prediction equation according to the unchanged variance at the input state:

wherein,

and is

The step 4) comprises the following steps:

41) firstly, constructing a single pseudo sample point obtained by training into a KD tree in an off-line manner;

42) given an online instance, finding a nearest pseudo sample point according to a KD tree;

43) updating the prediction vector element corresponding to the nearest pseudo sample point;

44) and updating the heteroscedastic sparse Gaussian process model by adopting a one-time batch updating method.

The step 5) comprises the following steps:

51) the output of the different variance sparse Gaussian process model foot point compensator obtained according to the nonparametric mapping capability of the different variance sparse Gaussian process model is as follows:

π＝[min(Δt_ssx，△t_ssy)，Δx_f，Δy_f]^T

wherein, Δ t_ssx＝G_α(s₁) And Δ t_ssx＝G_α(s₁) Is an independent HSpGP model of the heteroscedastic sparse Gaussian process model footpoint compensator along the x-axis, Δ t_ssy＝G_ty(s₂) And Δ y_f＝G_y(s₂) Is an independent HSpGP model of the heteroscedastic sparse Gaussian process model foot point compensator along the x axis,

52) training a different variance sparse Gaussian process model foot point compensator in an off-line mode according to the kinetic model foot point compensator and updating the foot point compensator of the different variance sparse Gaussian process model on line;

53) using the vector from the reference ZMP position to the measured ZMP position as an update output by the foot-drop compensator, calculating the desired foot-drop position as:

y_t=Δy_f+βΔy_z

x_t=Δx_f+βΔx_z

wherein x is_tΔ p for a given predicted input state_f＝[Δx_f，Δy_f]^TThe current output foot-drop point position modifier of the foot-drop point compensator of the heteroscedastic sparse Gaussian process model is obtained.

Compared with the prior art, the invention has the following advantages:

firstly, prediction is accurate, a nonlinear heteroscedastic sparse Gaussian process model foot point compensator is adopted, real-time online adjustment is achieved, and the method is more accurate.

Secondly, the learning speed is high, a local updating method based on the heteroscedastic Gaussian process model is adopted, the calculation complexity is reduced, and the learning speed is improved.

And thirdly, the model is convenient to train, and the overfitting problem of the parameter regression model is avoided.

And fourthly, the model stability is high, and the prediction accuracy of the whole model cannot be damaged due to local updating.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

FIG. 2 is a tracking trajectory diagram of a constrained dynamics model.

FIG. 3 is a tracking trajectory diagram of an ideal linear inverted pendulum model.

FIG. 4 is a diagram of the effect of a landing point compensator based on a constrained dynamical model.

FIG. 5 is a block diagram of a humanoid robot walking control employing an HSpGP foothold compensator.

FIG. 6 is a Y-axis trajectory tracking diagram using the HSpGP foothold compensator.

FIG. 7 is a plan trace using HSpGP landing point compensators.

Detailed Description

The invention is described in detail below with reference to the figures and specific embodiments.

Example (b):

as shown in fig. 1, a robot walking control method based on a foothold compensator includes the following steps:

1) establishing a constraint dynamic model of the robot;

4) locally updating a different variance sparse Gaussian process model;

The step 1) comprises the following steps:

x_z＝Cx_c

wherein,

A = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{matrix}],

B = [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}],

C = [\begin{matrix} 1 & 0 & - \frac{z_{c} - z_{z}}{g} \end{matrix}],

As shown in figures 2 and 3, the role of the constrained dynamic model in the walking control of the humanoid robot is represented vividly, the humanoid robot is subjected to simulation comparison based on the track tracking conditions of the constrained linear inverted pendulum model and the unconstrained linear inverted pendulum model when being impacted, the impact is about 2.5s, and the micro impact is resisted successfully through pre-observation control when the constraint condition of a supporting polygon is not considered. However, in the practical case of considering support for polygon constraints, the robot will not be able to obtain sufficient acceleration of the centroid to control the centroid deviating from the desired trajectory, and therefore the robot inevitably loses balance.

The step 2) comprises the following steps:

π＝[△t_ss，Δx_f，△y_f]^T

22) the inputs defining the one-foot support foothold compensator are:

Such asDrawing (A)4, given the input, output definition, and performance metrics of the footprint compensator, the optimal footprint compensator output can be calculated from each footprint compensator input using a constrained nonlinear optimization Calculation (CNOP) method, since the constrained dynamics model is directly calculable. It can be seen that after the foothold compensator is used, the robot can successfully resist the disturbance and continue to stably advance.

The step 3) comprises the following steps:

wherein x is_tFor the input state, y is the standard Gaussian process model target output, y ═ y₁，y₂，…，y_N]^TK is the kernel function, k_·＝[κ(x_t，x₁)，κ(x_t，x₂)，…，κ(x_t，x_N)]^T，κ_·＝κ(x_t，x_t) K is a square matrix, and each element is K (i, j) ═ K (x)_i，x_j)，Is the signal variance, W is the weight-diagonal matrix, Λ ═ diag (λ),Q_M＝K_M+K_MN(Λ+diag(r))^-1K_NM，for the uncertainty at each pseudo-input,a pseudo input quantity of a sparse Gaussian process model;

wherein,

and is

The step 4) comprises the following steps:

Step 5) comprises the following steps:

π＝[min(Δt_ssx，Δt_ssy)，Δx_f，Δy_f]^T

y_t＝Δy_f+βΔy_z

wherein x is_fΔ p for a given predicted input state_f＝[Δx_f，Δy_f]^TThe current output foot-drop point position modifier of the foot-drop point compensator of the heteroscedastic sparse Gaussian process model is obtained.

The human-simulated robot walking control framework adopting the HSpGP model as the foot-landing point compensator model is shown in fig. 5, the subsequent foot-landing position is planned according to the walking instruction sent by the robot decision layer by the gait planning, the ZMP track is planned according to the foot-landing position, and the expected centroid track is calculated by using the pre-observation controller. And the walking of the robot is realized through a reverse motion model and a PD joint controller. After the robot finishes the current motion instruction, the current state of the mass center is estimated through airborne inertial unit (IMU) measurement information and joint feedback values. The drop point compensator re-modifies subsequent drop points based on the centroid state and the drop point modifier predicted by HspGP and the single-foot support phase time.

To illustrate the role of the updated landing point compensation controller in disturbance rejection, the robot is here made to use the final updated HSpGP model in the simulation software to control the landing point compensation, as shown in FIGS. 6 and 7. In the process of robot walking, a small ball with the mass of 0.4kg is adopted to impact the shoulder of the robot at the speed of 4 m/s. Because gravity does work on the small balls in the flying process of the small balls, the actual impulse generated by the small balls acting on the robot is more than 2 Ns. The impact occurs at around 2.5s and the robot experiences a large displacement of the ZMP after impact. The robot takes a large step to the left under the action of the foothold compensator and most impact disturbances are suppressed. The robot then performs a second step of right foot adjustment to accomplish the suppression of the amount of residual disturbance. From the trajectory diagram in the x-y plane, it can be seen that after the robot has suffered an impact, its ZMP has moved far out of the robot's sole support polygon, if nothing is done the robot will lose balance and fall.

Claims

1. A robot walking control method based on a foothold compensator is characterized by comprising the following steps:

1) establishing a constraint dynamic model of the robot;

3) establishing a different variance sparse Gaussian process regression model, and realizing the mapping calculation from input to output of the constraint dynamic model foot point compensator;

4) locally updating a variance sparse Gaussian process regression model;

5) establishing a different variance sparse Gaussian process regression model-based footpoint compensator;

6) and performing predictive control on the robot walking according to the different-variance-based sparse Gaussian process regression model foot-drop point compensator.

2. The robot walking control method based on the foothold compensator according to claim 1, wherein the step 1) specifically comprises the following steps:

{\dot{x}}_{c} = {Ax}_{c} + {Bu}_{x}

x_z＝Cx_c

wherein,

A = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{matrix}], B = [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}], C = [\begin{matrix} 1 & 0 & - \frac{z_{c} - z_{z}}{g} \end{matrix}], x_{c} = {[x_{c}, {\dot{x}}_{c}, {\overset{. .}{x}}_{c}]}^{T}

<math> <mrow> <msup> <mrow> <mo>[</mo> <msub> <mover> <mi>x</mi> <mrow> <mo>.</mo> <mo>.</mo> </mrow> </mover> <mi>c</mi> </msub> <mo>,</mo> <msub> <mover> <mi>y</mi> <mrow> <mo>.</mo> <mo>.</mo> </mrow> </mover> <mi>c</mi> </msub> <mo>]</mo> </mrow> <mi>T</mi> </msup> <mo>=</mo> <mo>{</mo> <mfrac> <mi>g</mi> <mrow> <msub> <mi>z</mi> <mi>c</mi> </msub> <mo>-</mo> <msub> <mi>z</mi> <mi>z</mi> </msub> </mrow> </mfrac> <mo>[</mo> <mrow> <mo>(</mo> <msub> <mi>x</mi> <mi>c</mi> </msub> <mo>-</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <mo>,</mo> <mrow> <mo>(</mo> <msub> <mi>y</mi> <mi>c</mi> </msub> <mo>-</mo> <msub> <mi>y</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <msup> <mo>]</mo> <mi>T</mi> </msup> <mo>|</mo> <msup> <mrow> <mo>[</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>,</mo> <msub> <mi>y</mi> <mi>i</mi> </msub> <mo>]</mo> </mrow> <mi>T</mi> </msup> <mo>&Element;</mo> <msub> <mi>S</mi> <mi>c</mi> </msub> <mo>}</mo> </mrow> </math>

3. The robot walking control method based on the foothold compensator according to claim 1, wherein the step 2) specifically comprises the following steps:

21) establishing a constraint dynamic model foot point compensator, and defining the output of the compensator as follows:

π＝[Δt_ss，Δx_f，Δy_f]^T

22) the inputs defining the constraint dynamical model footpoint compensator are:

s = {[x_{c}, {\dot{x}}_{c}, y_{c}, {\dot{y}}_{c}]}^{T};

23) defining the performance indexes of the constraint dynamic model foot point compensator as follows:

4. The robot walking control method based on the foothold compensator of claim 1, wherein the step 3) specifically comprises the following steps:

<math> <mrow> <msub> <mi>λ</mi> <mi>i</mi> </msub> <mo>=</mo> <mi>κ</mi> <mrow> <mo>(</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>,</mo> <msub> <mi>x</mi> <mi>j</mi> </msub> <mo>)</mo> </mrow> <mo>-</mo> <mi>κ</mi> <mrow> <mo>(</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>,</mo> <mover> <mi>x</mi> <mo>&OverBar;</mo> </mover> <mo>)</mo> </mrow> <msubsup> <mi>K</mi> <mi>M</mi> <mrow> <mo>-</mo> <mn>1</mn> </mrow> </msubsup> <mi>κ</mi> <mrow> <mo>(</mo> <mover> <mi>x</mi> <mo>&OverBar;</mo> </mover> <mo>,</mo> <msub> <mi>x</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow> </math>

wherein x is_tFor the input state, y is the standard Gaussian process model target output, y ═ y₁，y₂，…，y_N]^TK is the kernel function, k_*＝[κ(x_t，x₁)，κ(x_t，x₂)，…，κ(x_t，x_N)]^T，κ_*＝κ(x_t，x_t) K is a square matrix, and each element is K (i, j) ═ K (x)_i，x_j)，Is the signal variance, W is the weight-diagonal matrix, Λ ═ diag (λ),Q_M＝K_M+K_MN(Λ+diag(r))^-1K_NM，for the uncertainty at each pseudo-input,a pseudo input quantity of a sparse Gaussian process model;

wherein,

and is

5. The robot walking control method based on the foothold compensator according to claim 1, wherein the step 4) specifically comprises the following steps:

6. The robot walking control method based on the foothold compensator according to claim 1, wherein the step 5) specifically comprises the following steps:

π＝[min(Δt_ssx，Δt_ssy)，Δx_f，Δy_f]^T

wherein, Δ t_ssx＝G_ts(s_t) And Δ t_ssx＝G_tx(s₁) Is a heteroscedastic sparse Gaussian processIndependent HSpGP model, Δ t, of model footpoint compensator along the x-axis_ssy＝G_ty(s₂) And Δ y_f＝G_y(s₂) Is an independent HSpGP model of the heteroscedastic sparse Gaussian process model foot point compensator along the x axis,

y_t＝Δy_f+βΔy_z

x_t＝Δx_f+βΔx_z