CN108656119A - A kind of control method of humanoid robot - Google Patents

A kind of control method of humanoid robot Download PDF

Info

Publication number
CN108656119A
CN108656119A CN201810774005.8A CN201810774005A CN108656119A CN 108656119 A CN108656119 A CN 108656119A CN 201810774005 A CN201810774005 A CN 201810774005A CN 108656119 A CN108656119 A CN 108656119A
Authority
CN
China
Prior art keywords
robot
action
model
data
gaussian process
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810774005.8A
Other languages
Chinese (zh)
Inventor
宓建
林艺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201810774005.8A priority Critical patent/CN108656119A/en
Publication of CN108656119A publication Critical patent/CN108656119A/en
Pending legal-status Critical Current

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1602Programme controls characterised by the control system, structure, architecture
    • B25J9/1605Simulation of manipulator lay-out, design, modelling of manipulator

Landscapes

  • Engineering & Computer Science (AREA)
  • Automation & Control Theory (AREA)
  • Robotics (AREA)
  • Mechanical Engineering (AREA)
  • Manipulator (AREA)

Abstract

A kind of control method of humanoid robot, includes the following steps:Step 1:The general joint action data feature that the humanoid robot is extracted using principal component analysis carries out dimension-reduction treatment to data;Step 2:Pass through the action model of Gaussian process Dynamic Model robot in lower dimensional space;Step 3:The humanoid robot is controlled by the action model of robot to be moved.Directly the action of robot learn by using Gaussian process dynamic model and effectively establishes action model, to the effective solution excessively high problem of robot motion data dimension, can learning training effectively be carried out to the action of robot by dimension-reduction treatment, the feature of robot motion is showed in lower dimensional space.

Description

Control method of humanoid robot
Technical Field
The invention relates to the field of robots, in particular to a control method of a humanoid robot.
Background
In recent years, research and development of a humanoid robot have been actively carried out, and there are many problems to be overcome in the research of a humanoid robot motion model. For the research of human motion models, there are many researchers, and various human motion simulation systems have been established using humanoid robots. Human motion models are not directly applicable to humanoid robots.
And, with the increase of the degree of freedom of the robot, the control difficulty to the robot will increase correspondingly, this leads to the robot motion accuracy to be lower. In addition, the action amplitude of the robot is small, various interference factors cannot be avoided along with the action of the robot, and the factors increase the complexity of establishing a robot action model and the difficulty of learning.
Disclosure of Invention
Aiming at the problems, the invention provides a method for controlling the robot-like robot to move by establishing a robot action model after carrying out dimensionality reduction on high-dimensional data through a Gaussian process dynamic model. The control method can effectively simplify the complexity of establishing the robot action model, and the control of the robot is more stable and accurate.
In order to achieve the purpose, the invention adopts the following technical scheme:
a method for controlling a humanoid robot, comprising the steps of:
step 1: adopting principal component analysis to extract the characteristics of the whole body joint action data of the humanoid robot, and performing dimensionality reduction on the data;
step 2: establishing an action model of the robot in a low-dimensional space through a Gaussian process dynamic model;
and step 3: controlling the humanoid robot to move through an action model of the robot;
specifically, the dimension reduction processing in step 1 specifically comprises the following steps:
1-1: inputting a data setOutput setWhereinAn observation set representing the pose of the robot in continuous time,representing a low-dimensional data set after principal component analysis dimensionality reduction processing;
1-2: carrying out zero mean normalization processing on Y and calculating a covariance matrix
Calculating the eigenvector U and the eigenvalue lambda according to the eigenvaluesIn the order of (1) toIs recombined intoWherein
1-3: to the output setPerforming a calculation in which
Specifically, the specific steps of establishing the robot motion model through the gaussian process dynamic model in the step 2 are as follows:
2-1: input data Y, output parameters
2-2: the initialization of the X is carried out,executing the dynamic model of the Gaussian process from i =1 to i = Iteration, wherein Iteration is the learning times of the dynamic model of the whole Gaussian process;
2-3: pairAndget the same time to distributeTaking its negative logarithmic probability to obtain
Starting from j =1 to j = D, the formula is executed
After solving W, ending the j loop;
executing the formula starting from M =1 to M = M
To obtainEnding the m-cycle after the optimal solution;
2-4: the i loop is ended.
The invention has the beneficial effects that: the Gaussian process dynamic model is adopted to directly learn the motion of the robot and effectively establish the motion model, so that the problem of overhigh dimension of motion data of the robot is effectively solved, the motion of the robot can be effectively learned and trained through dimension reduction processing, and the motion characteristics of the robot are displayed in a low-dimensional space. Meanwhile, the action of the robot at the next moment can be predicted, and the action prediction and planning of the robot are realized, so that the robot is controlled more effectively, and the self-control of the robot in an unmanned environment and under the condition of no instruction can be realized. The control method can be widely applied to the establishment of action models of industrial robots such as humanoid robots and mechanical arms, and has great significance for realizing more stable and accurate realization of various actions of the robots.
Drawings
Fig. 1 shows a humanoid robot controlled by the present invention.
FIG. 2 is a flow chart of principal component analysis dimension reduction processing according to the present invention.
Fig. 3 is a walking diagram of the humanoid robot of the present invention.
Fig. 4 is a walking model of the humanoid robot of the present invention.
Fig. 5 is a comparison graph of the restored data after the learning of the humanoid robot of the present invention and the original robot joint data.
Detailed Description
The present invention will be described in more detail with reference to the accompanying drawings and specific embodiments, but the present invention is not limited to the following embodiments, and can be modified and implemented as desired within a scope not departing from the gist of the present invention.
As shown in figure 1, the invention adopts an NAO robot to perform modeling processing after collecting the motion data. As shown in fig. 1, the NAO robot has 25 degrees of freedom of joints throughout the body, and the motor controlling the 25 joints can drive the NAO robot to perform various motions.
And (3) extracting the characteristics of the whole body joint action data of the NAO robot by adopting principal component analysis, and performing dimensionality reduction on the data. Through dimension reduction processing, learning can be simplified, and the calculation cost of the whole robot action data learning is reduced.
As shown in fig. 2, the vectorIs an observation set of the robot postures in continuous time,comprisesNA state, each stateIs as followsRepresents an instantaneous state value of the robot motion trajectory. Each state is defined asWhereinIs the dimension of each state. XIs a low-dimensional data set after principal component analysis and dimension reduction processing. At this time. Dimension reduction from D-dimensional data to 3D data can be realized by performing dimension reduction on principal component analysis ()。
The principal component analysis dimension reduction treatment comprises the following specific steps:
inputting a data setOutput set(ii) a Carrying out zero mean normalization processing on Y and calculating a covariance matrix(ii) a Calculating the eigenvector U and the eigenvalue lambda according to the eigenvaluesIn the order of (1) toIs recombined intoWherein(ii) a To the output setPerforming a calculation in which
After the principal component analysis and the dimension reduction processing, an action model of the robot is established in a low-dimensional space through a Gaussian process dynamic model. In the learning process of the Gaussian process dynamic model, the learning parameters are
Wherein,for the observation set and the low-dimensional state set,in order to be a hyper-parameter,is a scale parameter. At any timeThe state equation is:
(1)
(2)
is at the moment of timeThe state value of the space after the dimension reduction,it is the observed value corresponding to the time of day,andin order to be a noise, the noise is,andis formed by a non-linear basis functionAndas defined.
(3)
(4)
Here, ,,a and B follow a normal distribution.
Observation ofThe probability distribution with respect to X is as follows
(5)
Wherein,is formed byA defined core matrix. In GPDM, we use RBF kernel functions,
(6)
wherein,in order to be a scale parameter,to control the parameters of the kernel function width,is a noise variance parameter.
(7)
Here, the number of the first and second electrodes,
(8)
is defined by a linear kernel function and an RBF kernel function together.The part of the RBF kernel function is controlled,the linear kernel function portion is controlled.
During the learning process, some parameters are processed approximately. The method comprises the following specific steps:
observation ofThe X initial value is a known condition and can also be initialized by principal component analysis.
It needs to be estimated by learning. By the formulae (5) to (8), the compounds can be obtainedAndwhile being distributed as
(9)
The negative log probability is:
(10)
wherein,are constant parameters.
According to the formulae (5) and (7), the compounds are obtained
(11)
(12)
By minimizing the negative logarithmic probability of the formula (10) by the formulas (11) and (12), it can be estimatedThe value of (c). And (5) iterating for M times by adopting a scaling conjugate gradient method to obtain an optimal solution.
The specific algorithm of the gaussian process dynamic model is as follows:
input data Y, output parameters(ii) a Initializing X using principal component analysis dimension reduction processing, initializingThe gaussian process dynamic model is executed from i =1 to i = Iteration, where Iteration is the number of times the entire gaussian process dynamic model is learned.
Equation (10) is executed starting from j =1 to j = D, and the cycle of j is ended after W is solved.
Equation (10) is calculated from M =1 to M = MAnd ending the m-loop after the optimal solution.
The i loop is ended.
After the robot action model is established, the NAO robot can be controlled to move through the action model. As shown in fig. 3, the NAO robot is controlled to walk like a human, and data of 25 joints of the whole body in the walking process of the robot are collected simultaneously. Each frame has 25 joints of motion data, which represents the state of the robot at a certain moment in the walking process. And randomly recording data of 1000 continuous frames in the walking of the robot, and learning and modeling the walking action of the robot by using a Gaussian process dynamic model. The whole learning takes 5 minutes, and the robot walking action model is effectively learned and established.
As shown in fig. 4, the robot continuously performs repetitive motions during walking, so the models obtained by learning are also overlapped. In the 3D space, the walking motion of the robot can be visually and clearly shown. And the action state of the robot at the next moment can be predicted by using the model obtained by learning along with the change of time.
As shown in fig. 5, the NAO robot 25 joint data were restored to verify the validity of the created motion model. The restored data is highly consistent with the original data, the data stability is enhanced, and the robot can walk more stably by using the learned data. In this case, a joint (e.g., a LEIbowRoll joint) is randomly selected and visualized with the joint data, as can be seen in fig. 5, the fitting of the original joint data to the learned restored joint data is very high, which also demonstrates the effectiveness of the motion model created using the gaussian process dynamics model.
The present invention has been described in detail in order to enable those skilled in the art to understand the invention and to practice it, and it is not intended to limit the scope of the invention, and all equivalent changes and modifications made according to the spirit of the present invention should be covered by the present invention.

Claims (1)

1. A method for controlling a humanoid robot, characterized by: the method comprises the following steps:
step 1: adopting principal component analysis to extract the whole body joint action data characteristics of the humanoid robot,
performing dimensionality reduction on the data;
step 2: establishing an action model of the robot in a low-dimensional space through a Gaussian process dynamic model;
and step 3: controlling the humanoid robot to move through an action model of the robot;
the dimension reduction treatment in the step 1 comprises the following specific steps:
1-1: inputting a data setOutput setWhereinAn observation set representing the pose of the robot in continuous time,representing a low-dimensional data set after principal component analysis dimensionality reduction processing;
1-2: carrying out zero mean normalization processing on Y and calculating a covariance matrix
Calculating the eigenvector U and the eigenvalue lambda according to the eigenvaluesIn the order of (1) toIs recombined intoWherein
1-3: to the output setPerforming a calculation in which
The specific steps of establishing the robot motion model through the Gaussian process dynamic model in the step 1 are as follows:
2-1: input data Y, output parameters
2-2: initializing X using principal component analysis dimension reduction processing, initializingExecuting the dynamic model of the Gaussian process from i =1 to i = Iteration, wherein Iteration is the learning times of the dynamic model of the whole Gaussian process;
2-3: pairAndget the same time to distributeTaking its negative logarithmic probability to obtain
Starting from j =1 to j = D, the formula is executed
After solving W, ending the j loop;
executing the formula starting from M =1 to M = M
To obtainEnding the m-cycle after the optimal solution;
2-4: the i loop is ended.
CN201810774005.8A 2018-07-15 2018-07-15 A kind of control method of humanoid robot Pending CN108656119A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810774005.8A CN108656119A (en) 2018-07-15 2018-07-15 A kind of control method of humanoid robot

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810774005.8A CN108656119A (en) 2018-07-15 2018-07-15 A kind of control method of humanoid robot

Publications (1)

Publication Number Publication Date
CN108656119A true CN108656119A (en) 2018-10-16

Family

ID=63789081

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810774005.8A Pending CN108656119A (en) 2018-07-15 2018-07-15 A kind of control method of humanoid robot

Country Status (1)

Country Link
CN (1) CN108656119A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109732609A (en) * 2019-02-21 2019-05-10 电子科技大学中山学院 Redundant degree of freedom mechanical arm motion planning method and device
CN111571588A (en) * 2020-05-15 2020-08-25 深圳国信泰富科技有限公司 Robot whole-body action planning method and system
CN112605973A (en) * 2020-11-19 2021-04-06 广东省科学院智能制造研究所 Robot motor skill learning method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011032207A1 (en) * 2009-09-15 2011-03-24 The University Of Sydney A method and system for multiple dataset gaussian process modeling
CN102122391A (en) * 2010-12-13 2011-07-13 中国人民解放军国防科学技术大学 Automatic partitioning method for motion capture data
CN106227037A (en) * 2016-07-29 2016-12-14 宁波工程学院 A kind of humanoid robot action planning parameter optimization method based on FPGA

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011032207A1 (en) * 2009-09-15 2011-03-24 The University Of Sydney A method and system for multiple dataset gaussian process modeling
CN102122391A (en) * 2010-12-13 2011-07-13 中国人民解放军国防科学技术大学 Automatic partitioning method for motion capture data
CN106227037A (en) * 2016-07-29 2016-12-14 宁波工程学院 A kind of humanoid robot action planning parameter optimization method based on FPGA

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109732609A (en) * 2019-02-21 2019-05-10 电子科技大学中山学院 Redundant degree of freedom mechanical arm motion planning method and device
CN111571588A (en) * 2020-05-15 2020-08-25 深圳国信泰富科技有限公司 Robot whole-body action planning method and system
CN111571588B (en) * 2020-05-15 2021-05-18 深圳国信泰富科技有限公司 Robot whole-body action planning method and system
CN112605973A (en) * 2020-11-19 2021-04-06 广东省科学院智能制造研究所 Robot motor skill learning method and system
WO2022105635A1 (en) * 2020-11-19 2022-05-27 广东省科学院智能制造研究所 Robot motor-skill learning method and system

Similar Documents

Publication Publication Date Title
Chebotar et al. Path integral guided policy search
Juang et al. Wall-following control of a hexapod robot using a data-driven fuzzy controller learned through differential evolution
Ficuciello et al. Synergy-based policy improvement with path integrals for anthropomorphic hands
CN108656119A (en) A kind of control method of humanoid robot
Park et al. Inverse optimal control for humanoid locomotion
Castillo et al. Reinforcement learning meets hybrid zero dynamics: A case study for rabbit
Singla et al. Realizing learned quadruped locomotion behaviors through kinematic motion primitives
Bischoff et al. Policy search for learning robot control using sparse data
Zhai et al. An intelligent control system for robot massaging with uncertain skin characteristics
Jandaghi et al. Motion dynamics modeling and fault detection of a soft trunk robot
Ma et al. An efficient robot precision assembly skill learning framework based on several demonstrations
Dong et al. Passive bimanual skills learning from demonstration with motion graph attention networks
Chao et al. Learning robotic hand-eye coordination through a developmental constraint driven approach
CN117733870A (en) Mechanical arm target capturing method, device and medium based on track prediction control
Kuo et al. Fuzzy deep deterministic policy gradient-based motion controller for humanoid robot
Koropouli et al. Generalization of Force Control Policies from Demonstrations for Constrained Robotic Motion Tasks: A Regression-Based Approach
Beltran-Hernandez et al. Learning to grasp with primitive shaped object policies
Ting et al. Locally Weighted Regression for Control.
Mosbach et al. Learning generalizable tool use with non-rigid grasp-pose registration
Matsubara et al. Spatiotemporal synchronization of biped walking patterns with multiple external inputs by style–phase adaptation
Jia et al. Learning-based feedback controller for deformable object manipulation
Ariki et al. Latent Kullback-Leibler control for dynamic imitation learning of whole-body behaviors in humanoid robots
Pavlichenko et al. Flexible-joint manipulator trajectory tracking with learned two-stage model employing one-step future prediction
Ruud Reinforcement learning with the TIAGo research robot: manipulator arm control with actor-critic reinforcement learning
Luo et al. Human-agent joint learning for efficient robot manipulation skill acquisition

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20181016

WD01 Invention patent application deemed withdrawn after publication