CN113110062A

CN113110062A - Robot control system based on deep physical network

Info

Publication number: CN113110062A
Application number: CN202110497945.9A
Authority: CN
Inventors: 宋子豪
Original assignee: Hunan Taiguan Technology Co ltd
Current assignee: Hunan Taiguan Technology Co ltd
Priority date: 2021-05-08
Filing date: 2021-05-08
Publication date: 2021-07-13

Abstract

The invention discloses a robot control system based on a deep physical network, which relates to the technical field of robot systems and comprises the following steps: the method comprises the steps of firstly, obtaining position and speed information, secondly, calculating kinetic energy of the position information through kinetic energy (T) GNN, calculating speed information, calculating potential energy through potential energy (V) MLP, thirdly, calculating Lagrangian quantity of the kinetic energy data, converting the kinetic energy data into current state data according to the potential energy data, and judging the current state data through a set control strategy MLP. Through system learning compensation kinematics, dynamics and disturbance model, the learning dynamics lets the nonlinear control based on neural network have physical meaning, and the learning kinematics lets the robot oneself learn the model decision to let it adapt to self condition change, the learning disturbance lets the robot learn to compensate unknown disturbance, has improved system's body perception, utilizes the priori knowledge to self structure to learn, reduces the influence that can't discern the disturbance, improves the immunity to disturbance.

Description

Robot control system based on deep physical network

Technical Field

The invention relates to the technical field of robot systems, in particular to a robot control system based on a deep physical network.

Background

The bionic robot mainly runs in complex and unknown environments, the control problem of a robot body is under-actuated control, the solution is more difficult, the requirements on perception and planning functions are higher, the existing kinetic equation numerical solver is difficult to meet most of under-actuated control problems, a deep neural network is often not stable enough on the robot, a framework based on a deep physical network is provided, the control stability of a complex system is improved, a DRL method adopts a black box control system based on the neural network to directly learn control equations from a large number of samples, for example, the acceleration of the system: the position of the particle, the velocity of the particle, and the control signal are used, so that the learning result is unstable, and is mainly reflected in the following aspects: most of the numerical solutions have no physical significance, do not exist in the real world, the system lacks ontology perception, the prior knowledge of the structure of the system is difficult to learn, the influence of disturbance cannot be distinguished, and the disturbance rejection capability is poor.

Disclosure of Invention

The present invention is directed to a robot control system based on a deep physical network, which solves the above-mentioned problems in the related art.

In order to achieve the purpose, the invention is realized by the following technical scheme: the robot control system based on the deep physical network comprises the following steps:

acquiring position and speed information;

secondly, the position information carries out kinetic energy calculation through kinetic energy (T) GNN, speed information carries out potential energy calculation through potential energy (V) MLP;

thirdly, performing Lagrange's quantity calculation on the kinetic energy data, converting the kinetic energy data into current state data according to the potential energy data, and judging the current state data by a set control strategy MLP;

step four, the set disturbance model BNN calculates the current state and compensates the control input;

and step five, determining the acceleration of the robot through the Lagrange quantity and the control input.

Further, the method comprises the following steps: according to the operation steps in the first step, 1, solving a forward model based on position, speed and control output:

further, the method comprises the following steps: according to the operation steps in the step one, obtaining the generalized acceleration

We can use Runge-Kutta (RK) numerical integration to get the future shape of the dynamic systemState of the art

And calculating a control variable tau by adopting an MPC algorithm.

Further, the method comprises the following steps: according to the operation steps in the second step, the robot passes p(s)_t|o_t，s_t-1，a_t-1) Obtaining x_tIs represented by the low order of s_tWe can assume s_tX represents the coarse particle size_tAnd according to the same rule, substituting

Multi-layer perceptron (MLP) learns kinetic and potential energies:

V_MLP＝MLP(q)。

further, the method comprises the following steps: according to the operation steps in the third step, the mechanical structure of the robot is unknown to the robot, the control problem is to solve the pattern recognition problem, the robot can learn the kinematic model of the self mechanical structure through a Graph Neural Network (GNN), the input of the GNN is a graph G ═ V, E describing the particle dynamics, and comprises a variable number of points (Vertices) and Edges (Edges), wherein the points are particles presumed by the system, the Edges are interparticle actions presumed by the system, and the output of the GNN is the kinematic characteristic of the system by using the GNN framework

I.e. the kinetic energy of the approximation:

we then have equations of control with kinematics:

further, the method comprises the following steps: according to the operation steps in step four, the non-conservative force is further decomposed into the influence of the mechanical output of the system on the whole system, and the system is subjected to the disturbance, such as friction, load and the like: τ ═ b (q) · a + ∈, where a ═ pi (a)_t|s_t) For the control strategy, converting the Jacobian matrix B (q) into control input of the whole system, and various disturbances epsilon-p (epsilon | s) suffered by the system_t) Depending on the current state, the Jacobian matrix B (q) is learned by MLP, the disturbance model p (E | s) is learned by Bayesian Neural Network (BNN)_t)。

Further, the method comprises the following steps: according to the operation steps in the step one, the Lagrangian mechanics is used for describing x_tA dynamic system of generalized coordinates of

During the motion of the dynamic system, all particles are from state x_tEnter the next state x_t+1These particles may follow different paths from x_tTo x_t+1Each path having a lagrange component,

v (q) is kinetic energy and potential energy.

Further, the method comprises the following steps: according to the operation steps in the third step, the robot has various path selections, and the system state is changed by controlling the output, which comprises the steps of keeping the system in a certain unstable state, and possibly kicking the system out of the stable state by external force, so that the Lagrange quantity conforms to the non-conservative Euler-Lagrange equation:

further, the method comprisesThe method comprises the following steps: according to the operation steps in step three, the tau represents the non-conservative force in the system, so that

We obtained:

further, the method comprises the following steps: according to the operation steps in step three, the chain rule further expands the time derivative:

the invention provides a robot control system based on a deep physical network. The method has the following beneficial effects:

through system learning compensation kinematics, dynamics and disturbance model, the learning dynamics lets the nonlinear control based on neural network have physical meaning, and the learning kinematics lets the robot oneself learn the model decision to let it adapt to self condition change, the learning disturbance lets the robot learn to compensate unknown disturbance, has improved system's body perception, utilizes the priori knowledge to self structure to learn, reduces the influence that can't discern the disturbance, improves the immunity to disturbance.

Drawings

FIG. 1 is a control schematic diagram of a robot control system based on a deep physical network according to the present invention;

FIG. 2 is a schematic diagram of a GNN architecture of the robot control system based on a deep physical network according to the present invention;

fig. 3 is a schematic diagram of a control model of the robot control system based on the deep physical network.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.

Examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.

The invention will be further illustrated with reference to the following examples and drawings:

referring to fig. 1-3, the present invention provides a technical solution: the robot control system based on the deep physical network comprises the following steps:

acquiring position and speed information;

Specifically, according to the operation steps in the step one, 1, solving a forward model based on position, speed and control output:

specifically, according to the operation steps in the step one, the generalized acceleration is obtained

We can use Runge-Kutta (RK) numerical integration to obtain the future state of the dynamic system

The control variable τ is calculated using the MPC algorithm.

Specifically, according to the operation steps in the second step, the robot passes p(s)_t|o_t，s_t-1，a_t-1) Obtaining x_tIs expressed as s_tWe can assume s_tX represents the coarse particle size_tAnd according to the same rule, substituting

Multi-layer perceptron (MLP) learns kinetic and potential energies:

V_MLP＝MLP(q)。

specifically, according to the operation steps in the third step, the mechanical structure of the robot is unknown to the robot, the control problem is to solve the pattern recognition problem first, the robot can learn the kinematic model of the self mechanical structure through a Graph Neural Network (GNN), the input of the GNN is a graph G ═ V, E describing the particle dynamics, and the graph G ═ V, E comprises a variable number of points (Vertices) and Edges (Edges), wherein the points are particles presumed by the system, the Edges are interparticle actions presumed by the system, and the output of the GNN is the kinematic feature of the system by using the GNN framework

I.e. the kinetic energy of the approximation:

we then have equations of control with kinematics:

specifically, according to the operation steps in step four, the non-conservative force is further decomposed into the influence of the mechanical output of the system on the whole system, and the disturbance to which the system is subjected, such as friction, load and the like: τ ═ b (q) · a + ∈, where a ═ pi (a)_t|s_t) For the control strategy, converting the Jacobian matrix B (q) into control input of the whole system, and various disturbances epsilon-p (epsilon | s) suffered by the system_t) Depending on the current state, the Jacobian matrix B (q) is learned by MLP, the disturbance model p (E | s) is learned by Bayesian Neural Network (BNN)_t)。

Specifically, according to the operation steps in the step one, the Lagrangian mechanics is used for describing x_tA dynamic system of generalized coordinates of

v (q) is kinetic energy and potential energy.

Specifically, according to the operation steps in the third step, the robot has various path selections, and the system state is changed by controlling the output, which includes keeping the system in a certain non-steady state, and possibly kicking the system out of the steady state by external force, so that the lagrangian quantity conforms to a non-conservative euler-lagrangian equation:

specifically, according to the operation steps in step three, τ represents the non-conservative force in the system, such that

We obtained:

specifically, according to the operation steps in step three, the chain rule further expands the time derivative:

the above is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, many variations and modifications can be made without departing from the inventive concept of the present invention, which falls into the protection scope of the present invention.

Claims

1. The robot control system based on the deep physical network is characterized by comprising the following steps:

s1, acquiring position and speed information;

s2, calculating kinetic energy through kinetic energy (T) GNN by the position information, calculating speed information through the kinetic energy, and calculating potential energy through potential energy (V) MLP;

s3, performing Lagrange' S quantity calculation on the kinetic energy data, converting the kinetic energy data into current state data according to the potential energy data, and judging the current state data by a set control strategy MLP;

s4, calculating the current state and compensating the control input by the set disturbance model BNN;

and S5, determining the acceleration of the robot through the Lagrange quantity and the control input.

2. The deep physical network based robot control system according to claim 1, comprising the steps of: according to the operation steps in the first step, 1, solving a forward model based on position, speed and control output:

3. the deep physical network based robot control system according to claim 1, comprising the steps of: according to the operation steps in the step one, obtaining the generalized acceleration

And calculating a control variable tau by adopting an MPC algorithm.

4. The deep physical network based robot control system according to claim 1, comprising the steps of: according to the operation steps in the second step, the robot passes p(s)_t|o_t，s_t-1，a_t-1) Obtaining x_tIs expressed as s_tWe can assume s_tX represents the coarse particle size_tAnd according to the same rule, substituting

Multi-layer perceptron (MLP) learns kinetic and potential energies:

V_MLP＝MLP(q)。

5. the deep physical network based robot control system of claim 1,the method is characterized by comprising the following steps: according to the operation steps in the third step, the mechanical structure of the robot is unknown to the robot, the control problem is to solve the pattern recognition problem, the robot can learn the kinematic model of the self mechanical structure through a Graph Neural Network (GNN), the input of the GNN is a graph G ═ V, E describing the particle dynamics, and comprises a variable number of points (Vertices) and Edges (Edges), wherein the points are particles presumed by the system, the Edges are interparticle actions presumed by the system, and the output of the GNN is the kinematic characteristic of the system by using the GNN framework

I.e. the kinetic energy of the approximation:

we then have equations of control with kinematics:

6. the deep physical network based robot control system according to claim 1, comprising the steps of: according to the operation steps in step four, the non-conservative force is further decomposed into the influence of the mechanical output of the system on the whole system, and the system is subjected to the disturbance, such as friction, load and the like: τ ═ b (q) · a + ∈, where a ═ pi (a)_t|s_t) For the control strategy, converting the Jacobian matrix B (q) into control input of the whole system, and various disturbances epsilon-p (epsilon | s) suffered by the system_t) Depending on the current state, the Jacobian matrix B (q) is learned by MLP, the disturbance model p (E | s) is learned by Bayesian Neural Network (BNN)_t)。

7. The deep physical network based robot of claim 1A control system, comprising the steps of: according to the operation steps in the step one, the Lagrangian mechanics is used for describing x_tA dynamic system of generalized coordinates of

During the motion of the dynamic system, all particles are from state x_tEnter the next state x_t+1These particles may follow different paths from x_tTo x_t+1Each path has a lagrange component, ≡ T-V,

v (q) is kinetic energy and potential energy.

8. The deep physical network based robot control system according to claim 1, comprising the steps of: according to the operation steps in the third step, the robot has various path selections, and the system state is changed by controlling the output, which comprises the steps of keeping the system in a certain unstable state, and possibly kicking the system out of the stable state by external force, so that the Lagrange quantity conforms to the non-conservative Euler-Lagrange equation:

9. the deep physical network based robot control system according to claim 1, comprising the steps of: according to the operation steps in step three, the tau represents the non-conservative force in the system, so that

We obtained:

10. the deep physical network based robot control system according to claim 1, comprising the steps of: according to the operation steps in step three, the chain rule further expands the time derivative: