WO2020118730A1 - Compliance control method and apparatus for robot, device, and storage medium - Google Patents

Compliance control method and apparatus for robot, device, and storage medium Download PDF

Info

Publication number
WO2020118730A1
WO2020118730A1 PCT/CN2018/121338 CN2018121338W WO2020118730A1 WO 2020118730 A1 WO2020118730 A1 WO 2020118730A1 CN 2018121338 W CN2018121338 W CN 2018121338W WO 2020118730 A1 WO2020118730 A1 WO 2020118730A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
teaching
motion
variable impedance
movement
Prior art date
Application number
PCT/CN2018/121338
Other languages
French (fr)
Chinese (zh)
Inventor
欧勇盛
段江哗
徐升
王志扬
金少堃
田超然
王煜睿
熊荣
江国来
吴新宇
Original Assignee
中国科学院深圳先进技术研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国科学院深圳先进技术研究院 filed Critical 中国科学院深圳先进技术研究院
Priority to PCT/CN2018/121338 priority Critical patent/WO2020118730A1/en
Publication of WO2020118730A1 publication Critical patent/WO2020118730A1/en

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/42Recording and playback systems, i.e. in which the programme is recorded from a cycle of operations, e.g. the cycle of operations being manually controlled, after which this record is played back on the same machine
    • G05B19/423Teaching successive positions by walk-through, i.e. the tool head or end effector being grasped and guided directly, with or without servo-assistance, to follow a path
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/39Robotics, robotics to robotics hand
    • G05B2219/39319Force control, force as reference, active compliance

Definitions

  • the invention belongs to the field of computer technology, and particularly relates to a robot compliance control method, device, equipment and storage medium.
  • the trajectory of the robotic arm is generally pre-defined by the user, or a certain task environment is preset, and then the robot or the robotic arm can be repeatedly executed according to the plan.
  • the robotic arm operating in this mode cannot face environmental changes or sudden disturbances.
  • this mode also requires more arduous manual programming.
  • the use threshold is high (for example: to be able to program robots). More importantly, this robot control mode does not imply human operation habits, nor is it as flexible as human hands.
  • the robotic arm or robot should have learning capabilities and be more flexible and compliant.
  • the robot "Imitation Learning” (Imitation Learning) or “Teaching Learning” (Programming by Demonstration) is an important method to solve this problem.
  • the compliant behavior of a robot includes two aspects of action and force, so the learning of compliant behavior also includes two aspects of action learning and force learning.
  • the existing robot compliance control method independently models and learns the motion trajectory and force, and the learning effect is not good, which leads to inaccurate control results; based on Gaussian mixture model, Gaussian process and other offline regression methods to For imitation learning, the training time required is relatively long, and the training efficiency is low; the stability of the control cannot be guaranteed, and there may be situations where the robot interaction force is too large and hurts people.
  • the object of the present invention is to provide a robot compliance control method, device, equipment and storage medium, aiming to solve the problems of inaccurate control results and poor control effects caused by poor compliance of the existing robot compliance control methods.
  • the present invention provides a robot compliance control method, which includes the following steps:
  • the teaching data includes at least the movement data and interaction force data of the teaching movement
  • variable impedance Calculating the motion equation of the teaching motion based on the motion data in the teaching data, and simultaneously calculating the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, wherein the variable impedance
  • the parameters include at least variable stiffness parameters and variable damping parameters
  • the operation is controlled according to the equation of motion and the variable impedance parameter.
  • the present invention provides a robot compliance control device, the device including:
  • a data acquisition unit for acquiring teaching data of the teaching movement, wherein the teaching data includes at least the movement data and the interaction force data of the teaching movement;
  • a parameter calculation unit configured to calculate the motion equation of the teaching motion based on the motion data in the teaching data, and at the same time calculate the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data,
  • the variable impedance parameter includes at least a variable stiffness parameter and a variable damping parameter;
  • the operation control unit is used for controlling operation according to the motion equation and the variable impedance parameter.
  • the present invention also provides a computing device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, which is implemented when the processor executes the computer program The steps of the robot compliance control method as described.
  • the present invention also provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, implements the steps of the robot compliance control method.
  • the invention obtains the teaching data of the teaching movement, calculates the motion equation of the teaching movement based on the movement data in the teaching data, and simultaneously calculates the teaching according to the interaction force data in the teaching data
  • the variable impedance parameter of the movement controls the operation according to the motion equation and the variable impedance parameter, thereby reducing manual programming during the robot compliance control process, lowering the threshold for robot use, and improving the flexibility and accuracy of robot control , And further improve the robot's generalization ability, intelligence and control effect.
  • Embodiment 1 is an implementation flowchart of a robot compliance control method provided in Embodiment 1 of the present invention
  • FIG. 2 is a schematic structural diagram of a robot compliance control device according to Embodiment 2 of the present invention.
  • FIG. 3 is a schematic structural diagram of a robot compliance control device according to Embodiment 3 of the present invention.
  • FIG. 4 is a schematic structural diagram of a computing device according to Embodiment 4 of the present invention.
  • FIG. 5 is a schematic structural diagram of a robot compliance control device according to Embodiment 2 of the present invention.
  • FIG. 6 is a schematic structural diagram of a computing device according to Embodiment 3 of the present invention.
  • FIG. 1 shows the implementation flow of the robot compliance control method provided in Embodiment 1 of the present invention. For convenience of description, only the parts related to the embodiment of the present invention are shown. The details are as follows:
  • step S101 the teaching data of the teaching movement is acquired.
  • the embodiments of the present invention are suitable for automatic control of robots.
  • Robots include a series of robot products that are not limited to robotic arms, humanoid robots, etc. with joints, links, and other structures, and can achieve telescopic and grasping actions.
  • the teaching data may include at least motion data and interaction force data of the teaching movement. Therefore, learning of the teaching movement may include action learning and force learning (ie, variable stiffness parameter and variable damping parameter learning).
  • the motion data may include position data and speed data of a preset point of the robot (eg, end effector), or the motion data may include angle and angle of a preset angle of the robot (eg, joint angle) Acceleration.
  • the motion data may also include one or more other parameters that can be used to completely describe the teaching motion, which is not limited by the present invention.
  • FIG. 2 shows a diagram of teaching a robot.
  • the teacher grasps the end effector of the robot with one hand and moves in a plane or space. Make a trajectory, and the other hand exerts a teaching force at the end.
  • the robot collects teaching data through its own motion capture system and a six-dimensional force sensor mounted on the wrist.
  • the motion data includes position data and velocity data of a preset sampling point of the robot (eg, end effector or end, etc.)
  • the position data and interaction force data of the end are sampled at time intervals to obtain a series of samples
  • the teacher controls the robot through the remote control or the teach pendant to perform the teaching operation, or teaches by hand.
  • the robot records the teaching data according to the teaching operation.
  • the instructor personally completes the teaching movement task.
  • the teaching data is collected by the robot's motion catcher, data glove, and force sensor according to the teaching movement.
  • the motion data includes position data and speed data of a preset point of the robot (for example, an end effector)
  • the position data related to the teaching motion may be obtained first, Interaction force data and time data, and then calculate the speed data related to the teaching movement based on the position data and the time data, thereby obtaining the movement data of the teaching movement.
  • the motion data includes the angle and angular acceleration of the preset angle of the robot (for example, the joint angle)
  • the angle data and interaction force related to the teaching motion may be obtained first Data and time data, and then calculate the angular acceleration data related to the teaching movement based on the angle data and the time data, thereby obtaining the movement data of the teaching movement.
  • step S102 the motion equation of the teaching motion is calculated based on the motion data in the teaching data, and at the same time, the variable impedance parameter of the teaching motion is calculated based on the interaction force data in the teaching data.
  • variable impedance parameter may include at least a variable stiffness parameter and a variable damping parameter.
  • trajectory and force are learned, so as to improve the learning effect and thus the accuracy of the control results.
  • the preset neural network model can be trained using the motion data to obtain the motion equation of the teaching motion, and the neural network can be trained according to the motion equation
  • the model is updated online, thereby improving the calculation efficiency of the motion equation, facilitating the subsequent use of the motion equation, and adapting to the needs of real-time online learning, thereby improving the learning effect.
  • the motion data when using the motion data to train the preset neural network model, can be incrementally learned one by one or block by block to obtain the motion equation of the teaching motion, thereby improving the accuracy of the motion equation Sex, thereby improving learning effectiveness.
  • the neural network model can be a support vector machine (Support Vector Machine, SVM), online sequence over-limit learning machine and other models that can be incrementally online learning, or other incremental online learning models, such as incremental support vector machine (ISVM), etc., the present invention does not limit this.
  • SVM Support Vector Machine
  • the online sequence over-limit learning machine has the characteristics of fast learning speed, strong generalization ability, and simple implementation. Therefore, preferably, the neural network model is an online sequence over-limit learning machine, that is, use The motion data trains the online sequence overrun learning machine, thereby improving the training efficiency.
  • the input and output are the position and speed (or the angle and angular acceleration of the joint angle) of the sampling point (for example, the robot end effector), so the online
  • FIG. 3 shows an exemplary structure of an online sequence overrun learning machine.
  • the activation function of the hidden layer of the online sequence overrun learning machine is g
  • the online sequence overrun learning we want to learn
  • the machine ie, the model to be learned
  • the number of hidden layer neurons is For the offset of the hidden layer, Is the weight of the hidden layer, the dimension is Is the weight of the output layer, the dimension is
  • W and b are randomly generated and fixed.
  • the training process only needs to determine the weight of the output layer. Optimization process to achieve.
  • the activation function g generally selects the sigmoid function (sigmoid function) or the hyperbolic tangent function (tanh function), the modified sigmoid function can also be used, for example, However, as long as it is satisfied And the monotonically increasing continuous and continuously differentiable functions all meet the requirements of the activation function, and are not limited here.
  • the training goal of the online sequence overrun learning machine is to find a set of optimal output layer weights
  • H + is the Moore-Penrose generalized inverse matrix of the matrix H.
  • the output layer weights can be obtained without iteration.
  • the constraints are added, the problem of solving the output layer weights becomes a constrained optimization problem.
  • the training process of the online sequence overrun learning machine includes an initial ELM batch learning process and a continuous sequential learning process, as follows:
  • N 1 is the newly arrived data, by The calculated initial output weight is among them, Whenever a new training sample is obtained When Recursively calculate output weights. among them,
  • variable stiffness parameter and the variable damping parameter may be calculated based on the interaction force data.
  • variable stiffness parameter when calculating the variable stiffness parameter, let Represents the collected interaction force (F) and corresponding time (q) information, where is the number of disturbance data samples obtained.
  • the variable stiffness parameter at time q is calculated from the force information in the time window [q-(w-1), q].
  • the length of the sliding time window is w, and the upper and lower bounds of the data points in the window are represented by L q and U q ,
  • the stiffness matrix K q is among them, And eigenvalues Proportional to the expression
  • the interactive force data will be continuously collected, and the new data will be sorted according to the time information and the values in the window will be taken to solve the stiffness. For example, when the data at time q+1 enters, the online update of the covariance is among them,
  • variable damping parameter B since the damping ratio is constant, the square root of the damping and the stiffness is linear, so it can be based on the formula To calculate the variable damping parameter B. Among them, ⁇ is a constant greater than 0.
  • the preset stability constraints and the interaction force data can be used to predict
  • the variable impedance model is trained to obtain the variable impedance parameters of the teaching movement, and the variable impedance model is updated according to the variable impedance parameters, so as to ensure the stability of the variable impedance control and avoid excessive robot interaction force that may cause injury. happening.
  • step S103 the operation is controlled according to the equation of motion and the variable impedance parameter.
  • the operation can be controlled according to the motion equation and the variable impedance parameter, thereby controlling the robot to reproduce the movement trajectory and interactive force of the teaching movement.
  • the trained neural network model for example, online sequence overrun learning machine
  • variable impedance models to control the trajectory and interaction force of the robot to reproduce the teaching movement.
  • FIG. 4 shows an exemplary diagram of teaching learning and reproduction of robot compliance control.
  • the instructor grasps the robot with one hand for teaching, and the robot collects Track information And force information F q , then according to the trajectory information Perform motion learning to obtain f( ⁇ ), and learn variable stiffness parameters and variable damping parameters according to the force information F q to get ⁇ B q ,K q ⁇ , and finally generate motion according to f( ⁇ ) and according to ⁇ B q ,K q ⁇ Variable impedance control to control the trajectory and interaction force of the robot to reproduce the teaching movement.
  • the motion equation of the teaching movement is calculated according to the movement data in the teaching data, and at the same time, the variation of the teaching movement is calculated according to the interaction force data in the teaching data Impedance parameters, according to the motion equation and variable impedance parameter control operation, thereby reducing the manual programming in the robot compliance control process, lowering the threshold for robot use, improving the flexibility and accuracy of robot control, and thus improving the robot's universal Ability, degree of intelligence and control effect.
  • FIG. 5 shows the structure of the robot compliance control device provided in Embodiment 2 of the present invention. For ease of explanation, only parts related to the embodiment of the present invention are shown, including: a data acquisition unit 51, a parameter calculation unit 52 and Operation control unit 53.
  • the data acquiring unit 51 is configured to acquire teaching data of the teaching movement, wherein the teaching data includes at least the movement data and the interaction force data of the teaching movement.
  • the learning of the teaching movement may include action learning and force learning (ie, variable stiffness parameter and variable damping parameter learning).
  • the motion data may include position data and speed data of a preset point of the robot (eg, end effector), or the motion data may include angle and angle of a preset angle of the robot (eg, joint angle) Acceleration.
  • the motion data may also include one or more other parameters that can be used to completely describe the teaching motion, which is not limited by the present invention.
  • the data acquisition unit 51 may include:
  • the first acquiring unit is used to acquire position data, interaction force data and time data related to the teaching movement
  • the first calculation unit is used to calculate the motion data according to the position data and the time data.
  • the speed data related to the teaching movement is calculated based on the position data and the time data, thereby obtaining the movement data of the teaching movement.
  • the data acquisition unit 51 may further include:
  • the second acquisition unit is used to acquire angle data, interaction force data and time data related to the teaching movement.
  • the second calculation unit is used to calculate the motion data according to the angle data and the time data.
  • the angular acceleration data related to the teaching movement may be calculated according to the angle data and the time data, thereby obtaining the movement data of the teaching movement.
  • the parameter calculation unit 52 is configured to calculate the motion equation of the teaching motion based on the motion data in the teaching data, and at the same time calculate the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, where the variable impedance parameter includes at least Variable stiffness parameters and variable damping parameters.
  • the trajectory and the force are simultaneously learned, thereby improving the learning effect, and thereby improving the accuracy of the control result.
  • the parameter calculation unit 52 may include:
  • the first training unit is used to train the preset neural network model using motion data to obtain the motion equation of the teaching movement, and update the neural network model online according to the motion equation, thereby improving the calculation efficiency of the motion equation and facilitating movement
  • the model training unit may include:
  • the incremental learning unit is used to incrementally learn the motion data in a one-by-one or block-by-block manner to obtain the motion equation of the teaching motion, thereby improving the accuracy of the motion equation and thereby improving the learning effect.
  • the neural network model is an online sequence overrun learning machine.
  • the parameter calculation unit 52 may include:
  • the second training unit is used to train the preset variable impedance model according to the preset stability constraints and interaction force data to obtain the variable impedance parameter of the teaching movement, and update the variable impedance model according to the variable impedance parameter, Therefore, the stability of the variable impedance control is ensured, and the situation that the interaction force of the robot is too large to cause injury is avoided.
  • the operation control unit 53 is used to control the operation according to the equation of motion and the variable impedance parameter.
  • the teaching data of the teaching movement is acquired by the data acquiring unit 51, the motion equation of the teaching movement is calculated according to the movement data in the teaching data by the parameter calculating unit 52, and at the same time according to the teaching data
  • the interactive force data calculates the variable impedance parameters of the teaching movement, and the operation is controlled by the operation control unit 53 according to the motion equation and the variable impedance parameters, thereby reducing the manual programming in the robot compliance control process, lowering the robot's use threshold, and improving the robot The flexibility and accuracy of the control, thereby improving the robot's generalization ability, intelligence and control effect.
  • each unit of the robot compliance control device may be implemented by a corresponding hardware or software unit, and each unit may be an independent software and hardware unit, or may be integrated into one software and hardware unit, which is not limited here. this invention.
  • FIG. 6 shows the structure of the computing device provided in Embodiment 4 of the present invention. For ease of description, only parts related to the embodiment of the present invention are shown.
  • the computing device 6 of the embodiment of the present invention includes a processor 60, a memory 61, and a computer program 62 stored in the memory 61 and executable on the processor 60.
  • the processor 60 executes the computer program 62
  • the steps in the above embodiments of the robot compliance control method are implemented, for example, steps S101 to S103 shown in FIG. 1.
  • the processor 60 executes the computer program 62
  • the functions of the units in the above device embodiments are realized, for example, the functions of the units 51 to 53 shown in FIG.
  • the teaching data of the teaching motion is acquired, and the teaching data is calculated according to the motion data in the teaching data Teaching the motion equation of motion, and at the same time calculating the variable impedance parameters of the teaching motion based on the interactive force data in the teaching data, and controlling the operation according to the motion equations and variable impedance parameters, thereby reducing the manual programming and reducing the robot's compliance control process.
  • the use threshold of the robot is improved, and the flexibility and accuracy of the robot control are improved, thereby improving the robot's generalization ability, intelligence, and control effect.
  • a computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps in the embodiments of the foregoing robot compliance control methods are implemented. For example, steps S101 to S103 shown in FIG. 1.
  • the functions of the units in the foregoing device embodiments are realized, for example, the functions of the units 51 to 53 shown in FIG. 5.
  • the teaching data of the teaching motion is acquired, the motion equation of the teaching motion is calculated according to the motion data in the teaching data, and the variable impedance of the teaching motion is calculated according to the interaction force data in the teaching data
  • the parameters control the operation according to the equations of motion and variable impedance parameters, thereby reducing manual programming during robot compliance control, lowering the threshold for robot use, improving the flexibility and accuracy of robot control, and thus improving the generalization of the robot Ability, degree of intelligence and control effect.
  • the robot compliance control method implemented when the computer program is executed by the processor may further refer to the description of the steps in the foregoing method embodiments, and details are not described herein again.
  • the computer-readable storage medium in the embodiments of the present invention may include any entity or device capable of carrying computer program code, and a recording medium, such as ROM/RAM, magnetic disk, optical disk, flash memory, and other memories.

Landscapes

  • Engineering & Computer Science (AREA)
  • Robotics (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Manipulator (AREA)

Abstract

A compliance control method for a robot. The method comprises: acquiring demonstration data of a demonstration motion; calculating a motion equation of the demonstration motion according to motion data in the demonstration data, and calculating variable impedance parameters of the demonstration motion according to interaction force data in the demonstration data at the same time; and controlling the operation according to the motion equation and variable impedance parameters, so that the manual programming during robot compliance control is omitted, the difficulty for using robots is lowered, and the compliance and accuracy of the robot control are improved, thereby improving the generalization ability, intelligence, and control effect of robots. Also related is a compliance control apparatus for a robot, a device, and a storage medium.

Description

机器人柔顺性控制方法、装置、设备及存储介质Robot compliance control method, device, equipment and storage medium 技术领域Technical field
本发明属于计算机技术领域,尤其涉及一种机器人柔顺性控制方法、装置、设备及存储介质。The invention belongs to the field of computer technology, and particularly relates to a robot compliance control method, device, equipment and storage medium.
背景技术Background technique
在现阶段机器人的应用中,尤其是工业应用中,机械臂的运动轨迹一般是通过用户预先定义的,或者预先设定某种任务环境,然后让机器人或机械臂按照计划重复执行即可。这种模式运行的机械臂无法面对环境的变化,或者突如其来的扰动。对于复杂场景下或较困难任务的实现,这种模式也需要较为繁重的人工编程。对普通工人来讲,使用门槛要求高(例如:要会机器人编程)。更重要的是,这种机器人控制模式没有隐含人的操作习惯更没有像人手那样的具有柔顺性。为了有效降低机器人的使用门槛、更好地实现人机协同交互,机械臂或机器人应该具有学习能力,并更加灵活和柔顺的特性。机器人“模仿学习”(Imitation Learning)或者“示教学习”(Programming by Demonstration)便是解决这一问题的重要方法。In the application of robots at this stage, especially in industrial applications, the trajectory of the robotic arm is generally pre-defined by the user, or a certain task environment is preset, and then the robot or the robotic arm can be repeatedly executed according to the plan. The robotic arm operating in this mode cannot face environmental changes or sudden disturbances. For the realization of complex scenarios or more difficult tasks, this mode also requires more arduous manual programming. For ordinary workers, the use threshold is high (for example: to be able to program robots). More importantly, this robot control mode does not imply human operation habits, nor is it as flexible as human hands. In order to effectively lower the threshold for the use of robots and better achieve human-machine collaborative interaction, the robotic arm or robot should have learning capabilities and be more flexible and compliant. The robot "Imitation Learning" (Imitation Learning) or "Teaching Learning" (Programming by Demonstration) is an important method to solve this problem.
通常机器人的柔顺性行为包含动作和力两个方面,因此柔顺性行为的学习也是包括动作学习和力学习两个方面的。Generally, the compliant behavior of a robot includes two aspects of action and force, so the learning of compliant behavior also includes two aspects of action learning and force learning.
在机器人柔顺性控制领域,之前的研究工作主要集中在控制器的人为设计领域(例如:力位混合控制,阻抗控制,碰撞检测反馈控制器等)以及被动柔顺机构设计。上述的柔顺性控制器设计方法具有复杂的调参过程,且不具有泛化能力不能适应新的情况。机器人通过学习人类柔顺性行为而获得柔顺控制策略的研究能简化复杂的调参过程和减低机器人的使用门槛(工人只需提供正确的人类示教即可让机器人具有相应的柔顺性行为,而不需要使用者具有编程和 机器人控制的相关技术基础)。In the field of robot compliance control, previous research work has focused on the artificial design of controllers (such as force-position hybrid control, impedance control, collision detection feedback controller, etc.) and passive compliance mechanism design. The above-mentioned design method of the compliant controller has a complicated parameter adjustment process, and does not have the generalization ability to adapt to the new situation. The study of robots to obtain compliance control strategies by learning human compliance behavior can simplify the complex parameter adjustment process and lower the threshold for robot use (workers only need to provide correct human teaching to allow robots to have corresponding compliance behavior, without Users need to have the relevant technical foundation of programming and robot control).
机器人通过学习人类柔顺性行为而获得柔顺控制策略的研究属于前沿领域,示教学习控制中大都将运动轨迹学习和力的学习独立研究。例如,Seyed Mohammad Khansari-Zadeh提出一种学习运动轨迹的方法(发表于2011年IEEE Transactions on Robotics上的文章《Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models》)。在该方法的最初提出时,动态系统是通过高斯混合模型(Gaussian Mixture Models)来建模的,并且基于李雅普诺夫稳定性的约束也被推导出来用于保证运动收敛到目标。在接下来几年的发展中也出现了其他一些轨迹动作的模仿学习方法,但使用动态系统建模、使用李雅普诺夫稳定性进行约束这两大特征,基本上是各种方法的共同特征。Calinon提出了一种根据示教位置扰动的协方差推导出不同交互力的学习方法,但是这种方法示教怪异且不利于与轨迹一起学习。The study of robots' compliance control strategies obtained by learning human compliance behavior belongs to the frontier field. Most of the teaching learning control will independently study motion trajectory learning and force learning. For example, Seyed Mohammad Khansari-Zadeh proposed a method to learn the trajectory of movement (the article "Learning Stable Nonlinear Dynamics Systems With Gaussian Mixture Models" published in 2011 IEEE Transactions on Robotics). When this method was first proposed, the dynamic system was modeled by Gaussian Mixture Models, and constraints based on Lyapunov stability were also derived to ensure that the motion converged to the target. Other imitation learning methods of trajectory movements have emerged in the development in the following years, but the use of dynamic system modeling and the use of Lyapunov stability to constrain these two characteristics are basically the common characteristics of various methods. Calinon proposed a learning method for deriving different interaction forces based on the covariance of the perturbation of the teaching position, but this method is weird in teaching and is not conducive to learning together with the trajectory.
从现有的资料来看,将运动轨迹和力看做柔顺性行为两个组成部分,并将两者用于机器人柔顺性行为学习控制的相关成熟方案甚少。2017年发表于Autonomous Robots的文章《Learning potential functions from human demonstrations with encapsulated dynamic and compliant behaviors》提出了基于势函数和耗散场的联合变阻抗控制策略,该方法的需要人为通过先验知识设计多组基于任务的参数,这种方法具有强构造性,且只能离线训练无法,效率低。From the existing data, the motion trajectory and force are regarded as two components of compliant behavior, and there are very few mature schemes that use the two for robot learning and control of compliant behavior. An article published in 2017 by Autonomous Robots, "Learning potentials from human demonstrations with encapsulated dynamics and compliant behaviors," proposes a joint variable impedance control strategy based on potential functions and dissipative fields. This method requires artificially designing multiple groups through prior knowledge. Based on the parameters of the task, this method is strongly constructed, and can only be trained offline, which is inefficient.
已申请或已授权的专利中,也有一些与所述领域有关。在名称为“一种基于高斯过程的机器人模仿”的专利文件中,公开了一种基于高斯过程的机器人模仿学习方法。高斯过程也是一种回归算法,与高斯混合模型类似,该方案使用高斯过程对机器人运动进行建模学习。在名称为“一种基于轨迹模仿的机器人汉字书写学习方法”的专利文件中,公开了一种将基于轨迹匹配的模仿学习引入到机器人书写技能的学习中,将汉字的比划进行分割,并通过多个高斯混合模型对示教数据进行编码学习和重构的方法。在名称为“具有模仿学习机制的手把手示教机械臂系统及方法”的专利文件中,公开了一种带有模仿学习功 能的机械臂系统,并给出了基于前馈神经网络的模仿学习建模方法。在名称为“一种机器人力控示教模仿学习的装置及方法”的专利文件中,公开了一种在示教数据中引入了力反馈信息,并使用隐马尔科夫模型对示教数据进行建模编码的方法。Some of the patents that have been applied for or granted are also related to the mentioned fields. In the patent document titled "A Robotic Imitation Based on Gaussian Process", a robotic imitation learning method based on Gaussian process is disclosed. The Gaussian process is also a regression algorithm, similar to the Gaussian mixture model. This scheme uses the Gaussian process to model and learn the robot motion. In the patent document entitled "A Robot Chinese Character Writing Learning Method Based on Trajectory Imitation", a method of imitation learning based on trajectory matching is introduced into the learning of robot writing skills, and the strokes of Chinese characters are divided and passed A method of coding learning and reconstruction of teaching data by multiple Gaussian mixture models. In the patent document entitled "Hand-teaching robotic arm system and method with imitation learning mechanism", a robotic arm system with imitation learning function is disclosed, and imitation learning based on feedforward neural network is given.模方法。 Modal method. In the patent document titled "A device and method for robotic force-control teaching imitation learning", it is disclosed that force feedback information is introduced into the teaching data, and the hidden Markov model is used to perform the teaching data. Modeling coding method.
综上所述,现有的机器人柔顺性控制方法,对运动轨迹和力进行独立建模学习,学习效果不佳,进而导致控制结果不精确;基于高斯混合模型、高斯过程等离线的回归方法来进行模仿学习,需要的训练时间比较长,训练效率较低;控制的稳定性无法保证,可能出现机器人交互力过大而出现伤人的情况。In summary, the existing robot compliance control method independently models and learns the motion trajectory and force, and the learning effect is not good, which leads to inaccurate control results; based on Gaussian mixture model, Gaussian process and other offline regression methods to For imitation learning, the training time required is relatively long, and the training efficiency is low; the stability of the control cannot be guaranteed, and there may be situations where the robot interaction force is too large and hurts people.
发明内容Summary of the invention
本发明的目的在于提供一种机器人柔顺性控制方法、装置、设备及存储介质,旨在解决现有的机器人柔顺性控制方法的控制结果不精确、柔顺性较差导致的控制效果不佳问题。The object of the present invention is to provide a robot compliance control method, device, equipment and storage medium, aiming to solve the problems of inaccurate control results and poor control effects caused by poor compliance of the existing robot compliance control methods.
一方面,本发明提供了一种机器人柔顺性控制方法,所述方法包括下述步骤:In one aspect, the present invention provides a robot compliance control method, which includes the following steps:
获取示教运动的示教数据,其中,所述示教数据至少包括所述示教运动的运动数据和交互力数据;Acquiring teaching data of a teaching movement, wherein the teaching data includes at least the movement data and interaction force data of the teaching movement;
根据所述示教数据中的运动数据计算所述示教运动的运动方程,并且同时根据所述示教数据中的交互力数据计算所述示教运动的变阻抗参数,其中,所述变阻抗参数至少包括变刚度参数和变阻尼参数;Calculating the motion equation of the teaching motion based on the motion data in the teaching data, and simultaneously calculating the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, wherein the variable impedance The parameters include at least variable stiffness parameters and variable damping parameters;
根据所述运动方程和所述变阻抗参数控制操作。The operation is controlled according to the equation of motion and the variable impedance parameter.
另一方面,本发明提供了一种机器人柔顺性控制装置,所述装置包括:In another aspect, the present invention provides a robot compliance control device, the device including:
数据获取单元,用于获取示教运动的示教数据,其中,所述示教数据至少包括所述示教运动的运动数据和交互力数据;A data acquisition unit for acquiring teaching data of the teaching movement, wherein the teaching data includes at least the movement data and the interaction force data of the teaching movement;
参数计算单元,用于根据所述示教数据中的运动数据计算所述示教运动的运动方程,并且同时根据所述示教数据中的交互力数据计算所述示教运动的变 阻抗参数,其中,所述变阻抗参数至少包括变刚度参数和变阻尼参数;以及A parameter calculation unit, configured to calculate the motion equation of the teaching motion based on the motion data in the teaching data, and at the same time calculate the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, Wherein, the variable impedance parameter includes at least a variable stiffness parameter and a variable damping parameter; and
操作控制单元,用于根据所述运动方程和所述变阻抗参数控制操作。The operation control unit is used for controlling operation according to the motion equation and the variable impedance parameter.
另一方面,本发明还提供了一种计算设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如所述机器人柔顺性控制方法的步骤。On the other hand, the present invention also provides a computing device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, which is implemented when the processor executes the computer program The steps of the robot compliance control method as described.
另一方面,本发明还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如所述机器人柔顺性控制方法的步骤。On the other hand, the present invention also provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, implements the steps of the robot compliance control method.
本发明通过获取示教运动的示教数据,根据所述示教数据中的运动数据计算所述示教运动的运动方程,并且同时根据所述示教数据中的交互力数据计算所述示教运动的变阻抗参数,根据所述运动方程和所述变阻抗参数控制操作,从而减少了机器人柔顺性控制过程中的手动编程,降低了机器人的使用门槛,提高了机器人控制的柔顺性和精确性,进而提高了机器人的泛化能力、智能化程度和控制效果。The invention obtains the teaching data of the teaching movement, calculates the motion equation of the teaching movement based on the movement data in the teaching data, and simultaneously calculates the teaching according to the interaction force data in the teaching data The variable impedance parameter of the movement controls the operation according to the motion equation and the variable impedance parameter, thereby reducing manual programming during the robot compliance control process, lowering the threshold for robot use, and improving the flexibility and accuracy of robot control , And further improve the robot's generalization ability, intelligence and control effect.
附图说明BRIEF DESCRIPTION
图1是本发明实施例一提供的机器人柔顺性控制方法的实现流程图;1 is an implementation flowchart of a robot compliance control method provided in Embodiment 1 of the present invention;
图2是本发明实施例二提供的机器人柔顺性控制装置的结构示意图;2 is a schematic structural diagram of a robot compliance control device according to Embodiment 2 of the present invention;
图3是本发明实施例三提供的机器人柔顺性控制装置的结构示意图;以及3 is a schematic structural diagram of a robot compliance control device according to Embodiment 3 of the present invention; and
图4是本发明实施例四提供的计算设备的结构示意图。4 is a schematic structural diagram of a computing device according to Embodiment 4 of the present invention.
[根据细则91更正 01.01.2019] 
图5是本发明实施例二提供的机器人柔顺性控制装置的结构示意图;以及
[Correction based on Rule 91 01.01.2019]
5 is a schematic structural diagram of a robot compliance control device according to Embodiment 2 of the present invention; and
[根据细则91更正 01.01.2019] 
图6是本发明实施例三提供的计算设备的结构示意图。
[Correction based on Rule 91 01.01.2019]
6 is a schematic structural diagram of a computing device according to Embodiment 3 of the present invention.
具体实施方式detailed description
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, and are not intended to limit the present invention.
以下结合具体实施例对本发明的具体实现进行详细描述:The following describes the specific implementation of the present invention in detail with reference to specific embodiments:
实施例一:Example one:
图1示出了本发明实施例一提供的机器人柔顺性控制方法的实现流程,为了便于说明,仅示出了与本发明实施例相关的部分,详述如下:FIG. 1 shows the implementation flow of the robot compliance control method provided in Embodiment 1 of the present invention. For convenience of description, only the parts related to the embodiment of the present invention are shown. The details are as follows:
在步骤S101中,获取示教运动的示教数据。In step S101, the teaching data of the teaching movement is acquired.
本发明实施例适用于对机器人的自动控制。机器人包括并不局限于机械臂、人形机器人等一系列带有关节、连杆等结构,并可实现伸缩、抓取等动作的机器人产品。其中,示教数据至少可包括示教运动的运动数据和交互力数据,因此,对示教运动的学习可包括动作学习和力学习(即,变刚度参数和变阻尼参数学习)。The embodiments of the present invention are suitable for automatic control of robots. Robots include a series of robot products that are not limited to robotic arms, humanoid robots, etc. with joints, links, and other structures, and can achieve telescopic and grasping actions. Among them, the teaching data may include at least motion data and interaction force data of the teaching movement. Therefore, learning of the teaching movement may include action learning and force learning (ie, variable stiffness parameter and variable damping parameter learning).
在本发明实施例中,运动数据可包括机器人的预设点(例如,末端执行器)的位置数据和速度数据,或者运动数据可包括机器人的预设角(例如,关节角)的角度和角加速度,另外,运动数据还可包括其他可用于完整描述示教运动的一个或多个参数,本发明对此不进行限制。In the embodiment of the present invention, the motion data may include position data and speed data of a preset point of the robot (eg, end effector), or the motion data may include angle and angle of a preset angle of the robot (eg, joint angle) Acceleration. In addition, the motion data may also include one or more other parameters that can be used to completely describe the teaching motion, which is not limited by the present invention.
作为示例,图2示出一种对机器人进行示教的示图,如图2所示,在示教时,示教者一只手抓握住机器人的末端执行器,在平面或空间中运动出一条轨迹,另一个手在末端施加示教力。机器人通过自带的运动捕捉系统和腕部装的六维力传感器采集示教数据。As an example, FIG. 2 shows a diagram of teaching a robot. As shown in FIG. 2, during teaching, the teacher grasps the end effector of the robot with one hand and moves in a plane or space. Make a trajectory, and the other hand exerts a teaching force at the end. The robot collects teaching data through its own motion capture system and a six-dimensional force sensor mounted on the wrist.
例如,当运动数据包括机器人的预设采样点(例如,末端执行器或者末端等)的位置数据和速度数据时,按照时间间隔对末端的位置数据和交互力数据进行采样,从而获取一系列采样点数据
Figure PCTCN2018121338-appb-000001
其中i=1,...,N traj,N traj表示示教运动轨迹的数量,k=1,...,N i,N i表示示教中采样点个数(每隔一个时间间隔采一次样),
Figure PCTCN2018121338-appb-000002
即表示第i条轨迹的第k个采样点的末端位置,F k第k个采样点的示教力大小。
For example, when the motion data includes position data and velocity data of a preset sampling point of the robot (eg, end effector or end, etc.), the position data and interaction force data of the end are sampled at time intervals to obtain a series of samples Point data
Figure PCTCN2018121338-appb-000001
Where i=1,...,N traj ,N traj represents the number of teaching trajectories, k=1,...,N i ,N i represents the number of sampling points in the teaching (taken every other time interval Once),
Figure PCTCN2018121338-appb-000002
That is, the end position of the k-th sampling point of the i-th trajectory, and the teaching force of the k-th sampling point of F k .
作为另一示例,在示教时,示教者通过遥控器或示教器控制机器人进行示教操作,或者手把手示教。机器人根据示教操作记录示教数据。As another example, during teaching, the teacher controls the robot through the remote control or the teach pendant to perform the teaching operation, or teaches by hand. The robot records the teaching data according to the teaching operation.
作为又一示例,在示教时,示教者亲自完成示教运动任务。由机器人的运 动捕捉器或数据手套以及力传感器等设备根据示教运动采集示教数据。As yet another example, during teaching, the instructor personally completes the teaching movement task. The teaching data is collected by the robot's motion catcher, data glove, and force sensor according to the teaching movement.
优选地,如果运动数据包括机器人的预设点(例如,末端执行器)的位置数据和速度数据,则在获取示教运动的示教数据时,可首先获取与示教运动相关的位置数据、交互力数据和时间数据,然后根据位置数据和时间数据计算与示教运动相关的速度数据,从而得到示教运动的运动数据。Preferably, if the motion data includes position data and speed data of a preset point of the robot (for example, an end effector), when acquiring the teaching data of the teaching motion, the position data related to the teaching motion may be obtained first, Interaction force data and time data, and then calculate the speed data related to the teaching movement based on the position data and the time data, thereby obtaining the movement data of the teaching movement.
优选地,如果运动数据包括机器人的预设角(例如,关节角)的角度和角加速度,则在获取示教运动的示教数据时,可首先获取与示教运动相关的角度数据、交互力数据和时间数据,然后根据角度数据和时间数据计算与示教运动相关的角加速度数据,从而得到示教运动的运动数据。Preferably, if the motion data includes the angle and angular acceleration of the preset angle of the robot (for example, the joint angle), when acquiring the teaching data of the teaching motion, the angle data and interaction force related to the teaching motion may be obtained first Data and time data, and then calculate the angular acceleration data related to the teaching movement based on the angle data and the time data, thereby obtaining the movement data of the teaching movement.
在步骤S102中,根据示教数据中的运动数据计算示教运动的运动方程,并且同时根据示教数据中的交互力数据计算示教运动的变阻抗参数。In step S102, the motion equation of the teaching motion is calculated based on the motion data in the teaching data, and at the same time, the variable impedance parameter of the teaching motion is calculated based on the interaction force data in the teaching data.
在本发明实施例中,变阻抗参数至少可包括变刚度参数和变阻尼参数。同时对运动轨迹和力进行学习,从而提高学习效果,进而提高控制结果的精确性。In the embodiment of the present invention, the variable impedance parameter may include at least a variable stiffness parameter and a variable damping parameter. At the same time, the trajectory and force are learned, so as to improve the learning effect and thus the accuracy of the control results.
优选地,在根据示教数据中的运动数据计算示教运动的运动方程时,可使用运动数据对预设的神经网络模型进行训练,得到示教运动的运动方程,并根据运动方程对神经网络模型进行在线更新,从而提高运动方程的计算效率,方便运动方程的后续使用,并且适应实时在线学习的需要,进而提高学习效果。Preferably, when calculating the motion equation of the teaching motion based on the motion data in the teaching data, the preset neural network model can be trained using the motion data to obtain the motion equation of the teaching motion, and the neural network can be trained according to the motion equation The model is updated online, thereby improving the calculation efficiency of the motion equation, facilitating the subsequent use of the motion equation, and adapting to the needs of real-time online learning, thereby improving the learning effect.
其中,优选地,在使用运动数据对预设的神经网络模型进行训练时,可以以逐一或逐块的方式对运动数据进行增量学习,得到示教运动的运动方程,从而提高运动方程的准确性,进而提高学习效果。Among them, preferably, when using the motion data to train the preset neural network model, the motion data can be incrementally learned one by one or block by block to obtain the motion equation of the teaching motion, thereby improving the accuracy of the motion equation Sex, thereby improving learning effectiveness.
神经网络模型可以是支持向量机(Support Vector Machine,SVM)、在线序列超限学习机等能够增量在线学习的模型,也可以是其他的增量在线学习的模型,如增量支持向量机(ISVM)等,本发明对此不进行限定。其中,由于与其他在线学习算法相比,在线序列超限学习机具有学习速度快、泛化能力强、实现简单等特点,因此,优选地,神经网络模型是在线序列超限学习机,即使用运动数据对在线序列超限学习机进行训练,从而提高训练效率。The neural network model can be a support vector machine (Support Vector Machine, SVM), online sequence over-limit learning machine and other models that can be incrementally online learning, or other incremental online learning models, such as incremental support vector machine ( ISVM), etc., the present invention does not limit this. Among them, because compared with other online learning algorithms, the online sequence over-limit learning machine has the characteristics of fast learning speed, strong generalization ability, and simple implementation. Therefore, preferably, the neural network model is an online sequence over-limit learning machine, that is, use The motion data trains the online sequence overrun learning machine, thereby improving the training efficiency.
其中,在对在线序列超限学习机进行训练时,在运动方面,输入输出分别为采样点(例如,机器人末端执行器)的位置和速度(或者关节角的角度和角加速度),因此,在线序列超限学习机的输入和输出应该具有相同的维度,即具有相同的神经元个数d。如果考虑二维平面内的运动,d=2,如果考虑三维空间内的运动,d=3。Among them, when training the online sequence over-limit learning machine, in terms of motion, the input and output are the position and speed (or the angle and angular acceleration of the joint angle) of the sampling point (for example, the robot end effector), so the online The input and output of the sequence overrun learning machine should have the same dimension, that is, the same number of neurons d. If you consider the movement in the two-dimensional plane, d = 2, if you consider the movement in the three-dimensional space, d = 3.
作为示例,图3示出在线序列超限学习机的示例性结构,如图3所示,假设在线序列超限学习机的隐藏层的激活函数为g,那么我们要学习的在线序列超限学习机(即,要学习的模型)可以表达为
Figure PCTCN2018121338-appb-000003
其中,隐藏层神经元个数为
Figure PCTCN2018121338-appb-000004
为隐藏层的偏置,
Figure PCTCN2018121338-appb-000005
为隐藏层的权值,维度为
Figure PCTCN2018121338-appb-000006
为输出层的权值,维度为
Figure PCTCN2018121338-appb-000007
As an example, FIG. 3 shows an exemplary structure of an online sequence overrun learning machine. As shown in FIG. 3, assuming that the activation function of the hidden layer of the online sequence overrun learning machine is g, then the online sequence overrun learning we want to learn The machine (ie, the model to be learned) can be expressed as
Figure PCTCN2018121338-appb-000003
Among them, the number of hidden layer neurons is
Figure PCTCN2018121338-appb-000004
For the offset of the hidden layer,
Figure PCTCN2018121338-appb-000005
Is the weight of the hidden layer, the dimension is
Figure PCTCN2018121338-appb-000006
Is the weight of the output layer, the dimension is
Figure PCTCN2018121338-appb-000007
其中,在线序列超限学习机的训练过程中,W和b是随机产生并固定不变的,训练的过程只需要确定输出层的权值即可,可通过对
Figure PCTCN2018121338-appb-000008
的优化过程来实现。
Among them, in the training process of the online sequence over-limit learning machine, W and b are randomly generated and fixed. The training process only needs to determine the weight of the output layer.
Figure PCTCN2018121338-appb-000008
Optimization process to achieve.
其中,
Figure PCTCN2018121338-appb-000009
表示示教数据中的目标输出。由于w和b都是随机产生并且固定的,因此H也是固定的。训练的目标即求解最优的一组输出层权值
Figure PCTCN2018121338-appb-000010
使得
Figure PCTCN2018121338-appb-000011
取到最小值。
among them,
Figure PCTCN2018121338-appb-000009
Indicates the target output in the teaching data. Since w and b are randomly generated and fixed, H is also fixed. The goal of training is to find the optimal set of output layer weights
Figure PCTCN2018121338-appb-000010
Make
Figure PCTCN2018121338-appb-000011
Get the minimum value.
其中,虽然激活函数g一般选择S形函数(sigmoid函数)或双曲正切函数(tanh函数),也可使用的修改后的S形函数,例如,
Figure PCTCN2018121338-appb-000012
但是,只要是满足
Figure PCTCN2018121338-appb-000013
并且单调递增的连续、连续可微的函数都符合激活函数的要求,在此不做限制。
Among them, although the activation function g generally selects the sigmoid function (sigmoid function) or the hyperbolic tangent function (tanh function), the modified sigmoid function can also be used, for example,
Figure PCTCN2018121338-appb-000012
However, as long as it is satisfied
Figure PCTCN2018121338-appb-000013
And the monotonically increasing continuous and continuously differentiable functions all meet the requirements of the activation function, and are not limited here.
在线序列超限学习机的训练目标是要找到一组最佳的输出层权值
Figure PCTCN2018121338-appb-000014
使用最小二乘法可得到
Figure PCTCN2018121338-appb-000015
其中,H +是矩阵H的Moore-Penrose广义逆矩阵。使用这种方法可不经过迭代求得输出层权值,在增加约束条件时,求解输出层权值的问题就变成了一个带约束的优化问题。
The training goal of the online sequence overrun learning machine is to find a set of optimal output layer weights
Figure PCTCN2018121338-appb-000014
Use the least squares method to get
Figure PCTCN2018121338-appb-000015
Among them, H + is the Moore-Penrose generalized inverse matrix of the matrix H. Using this method, the output layer weights can be obtained without iteration. When the constraints are added, the problem of solving the output layer weights becomes a constrained optimization problem.
其中,在线序列超限学习机的训练过程包括一个初始的ELM批量学习过程和一个连续的贯序学习过程,具体如下:Among them, the training process of the online sequence overrun learning machine includes an initial ELM batch learning process and a continuous sequential learning process, as follows:
在初始化阶段,给定初始训练子集
Figure PCTCN2018121338-appb-000016
其中,N 1是新到达的数据,由式
Figure PCTCN2018121338-appb-000017
计算得到的初始输出权值为
Figure PCTCN2018121338-appb-000018
其中,
Figure PCTCN2018121338-appb-000019
每当获取到新的训练样本
Figure PCTCN2018121338-appb-000020
时,按照
Figure PCTCN2018121338-appb-000021
递归计算输出权值。其中,
Figure PCTCN2018121338-appb-000022
In the initialization phase, given the initial training subset
Figure PCTCN2018121338-appb-000016
Among them, N 1 is the newly arrived data, by
Figure PCTCN2018121338-appb-000017
The calculated initial output weight is
Figure PCTCN2018121338-appb-000018
among them,
Figure PCTCN2018121338-appb-000019
Whenever a new training sample is obtained
Figure PCTCN2018121338-appb-000020
When
Figure PCTCN2018121338-appb-000021
Recursively calculate output weights. among them,
Figure PCTCN2018121338-appb-000022
作为示例,在根据示教数据中的交互力数据计算示教运动的变阻抗参数时,可根据交互力数据计算变刚度参数和变阻尼参数。As an example, when calculating the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, the variable stiffness parameter and the variable damping parameter may be calculated based on the interaction force data.
具体地,在计算变刚度参数时,令
Figure PCTCN2018121338-appb-000023
表示采集到的交互力(F)与相应的时间(q)信息,其中是所获得扰动数据样本的个数。在q时刻的变刚度参数是由时间窗[q-(w-1),q]内的力信息计算得到。滑动时间窗的长度为w,窗内数据点的上下界分别用L q,U q表示,
Figure PCTCN2018121338-appb-000024
在q时刻窗内数据点的个数为W q=U q-L q+1,窗内力数据对应的的协方差矩阵为
Figure PCTCN2018121338-appb-000025
其中,
Figure PCTCN2018121338-appb-000026
由于协方差矩阵Σ q是对称且正定的,故其能分解成如下形式Σ q=PΛP- 1,其中,Λ是包含特征值
Figure PCTCN2018121338-appb-000027
的对角阵。刚度矩阵K q
Figure PCTCN2018121338-appb-000028
其中,
Figure PCTCN2018121338-appb-000029
与特征值
Figure PCTCN2018121338-appb-000030
成正比,表达式为
Figure PCTCN2018121338-appb-000031
Figure PCTCN2018121338-appb-000032
Figure PCTCN2018121338-appb-000033
随着示教的进行,交互力的数据会不断的被收集,并根据时间信息将新的数据进行排序并取窗内的值进行刚度求解。例如当q+1时刻的数据进入时,协方差的在线更新为
Figure PCTCN2018121338-appb-000034
其中,
Figure PCTCN2018121338-appb-000035
Specifically, when calculating the variable stiffness parameter, let
Figure PCTCN2018121338-appb-000023
Represents the collected interaction force (F) and corresponding time (q) information, where is the number of disturbance data samples obtained. The variable stiffness parameter at time q is calculated from the force information in the time window [q-(w-1), q]. The length of the sliding time window is w, and the upper and lower bounds of the data points in the window are represented by L q and U q ,
Figure PCTCN2018121338-appb-000024
The number of data points in the window at time q is W q =U q -L q +1, and the covariance matrix corresponding to the force data in the window is
Figure PCTCN2018121338-appb-000025
among them,
Figure PCTCN2018121338-appb-000026
Because the covariance matrix Σ q is symmetric and positive definite, it can be decomposed into the following form Σ q = PΛP- 1 , where Λ is the eigenvalue
Figure PCTCN2018121338-appb-000027
Diagonal array. The stiffness matrix K q is
Figure PCTCN2018121338-appb-000028
among them,
Figure PCTCN2018121338-appb-000029
And eigenvalues
Figure PCTCN2018121338-appb-000030
Proportional to the expression
Figure PCTCN2018121338-appb-000031
Figure PCTCN2018121338-appb-000032
Figure PCTCN2018121338-appb-000033
As the teaching progresses, the interactive force data will be continuously collected, and the new data will be sorted according to the time information and the values in the window will be taken to solve the stiffness. For example, when the data at time q+1 enters, the online update of the covariance is
Figure PCTCN2018121338-appb-000034
among them,
Figure PCTCN2018121338-appb-000035
具体地,在计算变阻尼参数时,由于阻尼比是恒定的,阻尼与刚度的平方根是线性关系,因此可根据公式
Figure PCTCN2018121338-appb-000036
来计算变阻尼参数B。其中,γ是大于0的常数。
Specifically, when calculating the variable damping parameter, since the damping ratio is constant, the square root of the damping and the stiffness is linear, so it can be based on the formula
Figure PCTCN2018121338-appb-000036
To calculate the variable damping parameter B. Among them, γ is a constant greater than 0.
在本发明实施例中,为保证学习出的模型具有稳定性,优选地,在根据交互力数据计算示教运动的变阻抗参数时,可根据预设的稳定性约束条件和交互力数据对预设的变阻抗模型进行训练,得到示教运动的变阻抗参数,并根据变阻抗参数对变阻抗模型进行更新,从而保证变阻抗控制的稳定性,避免出现机器人交互力过大而出现伤人的情况。In the embodiment of the present invention, in order to ensure the stability of the learned model, preferably, when calculating the variable impedance parameter of the teaching motion based on the interaction force data, the preset stability constraints and the interaction force data can be used to predict The variable impedance model is trained to obtain the variable impedance parameters of the teaching movement, and the variable impedance model is updated according to the variable impedance parameters, so as to ensure the stability of the variable impedance control and avoid excessive robot interaction force that may cause injury. Happening.
在步骤S103中,根据运动方程和变阻抗参数控制操作。In step S103, the operation is controlled according to the equation of motion and the variable impedance parameter.
在本发明实施例中,在得到运动方程和变阻抗参数之后,可根据运动方程和变阻抗参数控制操作,从而控制机器人复现示教运动的运动轨迹和交互力。In the embodiment of the present invention, after obtaining the motion equation and the variable impedance parameter, the operation can be controlled according to the motion equation and the variable impedance parameter, thereby controlling the robot to reproduce the movement trajectory and interactive force of the teaching movement.
在本发明实施例中,在预设的神经网络模型(例如,在线序列超限学习机)和变阻抗模型训练完成之后,可使用训练后的神经网络模型(例如,在线序列超限学习机)和变阻抗模型,从而控制机器人复现示教运动的运动轨迹和交互力。In the embodiment of the present invention, after the training of the preset neural network model (for example, online sequence overrun learning machine) and variable impedance model is completed, the trained neural network model (for example, online sequence overrun learning machine) can be used And variable impedance models to control the trajectory and interaction force of the robot to reproduce the teaching movement.
作为示例,图4示出机器人柔顺性控制的示教学习和复现的示例性示图,如图4所示,示教者一只手抓握住机器人进行示教,机器人采集示教运动的轨 迹信息
Figure PCTCN2018121338-appb-000037
和力信息F q,然后根据轨迹信息
Figure PCTCN2018121338-appb-000038
进行动作学习得到f(·),并且根据力信息F q进行变刚度参数和变阻尼参数学习得到{B q,K q},最后根据f(·)产生运动,并根据{B q,K q}进行变阻抗控制,从而控制机器人复现示教运动的运动轨迹和交互力。
As an example, FIG. 4 shows an exemplary diagram of teaching learning and reproduction of robot compliance control. As shown in FIG. 4, the instructor grasps the robot with one hand for teaching, and the robot collects Track information
Figure PCTCN2018121338-appb-000037
And force information F q , then according to the trajectory information
Figure PCTCN2018121338-appb-000038
Perform motion learning to obtain f(·), and learn variable stiffness parameters and variable damping parameters according to the force information F q to get {B q ,K q }, and finally generate motion according to f(·) and according to {B q ,K q }Variable impedance control to control the trajectory and interaction force of the robot to reproduce the teaching movement.
在本发明实施例中,通过获取示教运动的示教数据,根据示教数据中的运动数据计算示教运动的运动方程,并且同时根据示教数据中的交互力数据计算示教运动的变阻抗参数,根据运动方程和变阻抗参数控制操作,从而减少了机器人柔顺性控制过程中的手动编程,降低了机器人的使用门槛,提高了机器人控制的柔顺性和精确性,进而提高了机器人的泛化能力、智能化程度和控制效果。In the embodiment of the present invention, by acquiring the teaching data of the teaching movement, the motion equation of the teaching movement is calculated according to the movement data in the teaching data, and at the same time, the variation of the teaching movement is calculated according to the interaction force data in the teaching data Impedance parameters, according to the motion equation and variable impedance parameter control operation, thereby reducing the manual programming in the robot compliance control process, lowering the threshold for robot use, improving the flexibility and accuracy of robot control, and thus improving the robot's universal Ability, degree of intelligence and control effect.
实施例二:Example 2:
图5示出了本发明实施例二提供的机器人柔顺性控制装置的结构,为了便于说明,仅示出了与本发明实施例相关的部分,其中包括:数据获取单元51、参数计算单元52和操作控制单元53。FIG. 5 shows the structure of the robot compliance control device provided in Embodiment 2 of the present invention. For ease of explanation, only parts related to the embodiment of the present invention are shown, including: a data acquisition unit 51, a parameter calculation unit 52 and Operation control unit 53.
数据获取单元51,用于获取示教运动的示教数据,其中,示教数据至少包括示教运动的运动数据和交互力数据。The data acquiring unit 51 is configured to acquire teaching data of the teaching movement, wherein the teaching data includes at least the movement data and the interaction force data of the teaching movement.
在本发明实施例中,对示教运动的学习可包括动作学习和力学习(即,变刚度参数和变阻尼参数学习)。In the embodiment of the present invention, the learning of the teaching movement may include action learning and force learning (ie, variable stiffness parameter and variable damping parameter learning).
在本发明实施例中,运动数据可包括机器人的预设点(例如,末端执行器)的位置数据和速度数据,或者运动数据可包括机器人的预设角(例如,关节角)的角度和角加速度,另外,运动数据还可包括其他可用于完整描述示教运动的一个或多个参数,本发明对此不进行限制。In the embodiment of the present invention, the motion data may include position data and speed data of a preset point of the robot (eg, end effector), or the motion data may include angle and angle of a preset angle of the robot (eg, joint angle) Acceleration. In addition, the motion data may also include one or more other parameters that can be used to completely describe the teaching motion, which is not limited by the present invention.
因此,优选地,数据获取单元51可包括:Therefore, preferably, the data acquisition unit 51 may include:
第一获取单元,用于获取与示教运动相关的位置数据、交互力数据和时间数据;以及The first acquiring unit is used to acquire position data, interaction force data and time data related to the teaching movement; and
第一计算单元,用于根据位置数据和时间数据计算运动数据。The first calculation unit is used to calculate the motion data according to the position data and the time data.
具体地,根据位置数据和时间数据计算与示教运动相关的速度数据,从而得到示教运动的运动数据。Specifically, the speed data related to the teaching movement is calculated based on the position data and the time data, thereby obtaining the movement data of the teaching movement.
优选地,数据获取单元51还可包括:Preferably, the data acquisition unit 51 may further include:
第二获取单元,用于获取与示教运动相关的角度数据、交互力数据和时间数据;以及The second acquisition unit is used to acquire angle data, interaction force data and time data related to the teaching movement; and
第二计算单元,用于根据角度数据和时间数据计算运动数据。The second calculation unit is used to calculate the motion data according to the angle data and the time data.
具体地,可根据角度数据和时间数据计算与示教运动相关的角加速度数据,从而得到示教运动的运动数据。Specifically, the angular acceleration data related to the teaching movement may be calculated according to the angle data and the time data, thereby obtaining the movement data of the teaching movement.
参数计算单元52,用于根据示教数据中的运动数据计算示教运动的运动方程,并且同时根据示教数据中的交互力数据计算示教运动的变阻抗参数,其中,变阻抗参数至少包括变刚度参数和变阻尼参数。The parameter calculation unit 52 is configured to calculate the motion equation of the teaching motion based on the motion data in the teaching data, and at the same time calculate the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, where the variable impedance parameter includes at least Variable stiffness parameters and variable damping parameters.
在本发明实施例中,同时对运动轨迹和力进行学习,从而提高学习效果,进而提高控制结果的精确性。In the embodiment of the present invention, the trajectory and the force are simultaneously learned, thereby improving the learning effect, and thereby improving the accuracy of the control result.
优选地,参数计算单元52可包括:Preferably, the parameter calculation unit 52 may include:
第一训练单元,用于使用运动数据对预设的神经网络模型进行训练,得到示教运动的运动方程,并根据运动方程对神经网络模型进行在线更新,从而提高运动方程的计算效率,方便运动方程的后续使用,并且适应实时在线学习的需要,进而提高学习效果。The first training unit is used to train the preset neural network model using motion data to obtain the motion equation of the teaching movement, and update the neural network model online according to the motion equation, thereby improving the calculation efficiency of the motion equation and facilitating movement The subsequent use of equations, and to meet the needs of real-time online learning, and thus improve the learning effect.
其中,优选地,模型训练单元可包括:Among them, preferably, the model training unit may include:
增量学习单元,用于以逐一或逐块的方式对运动数据进行增量学习,得到示教运动的运动方程,从而提高运动方程的准确性,进而提高学习效果。The incremental learning unit is used to incrementally learn the motion data in a one-by-one or block-by-block manner to obtain the motion equation of the teaching motion, thereby improving the accuracy of the motion equation and thereby improving the learning effect.
其中,优选地,神经网络模型是在线序列超限学习机。Among them, preferably, the neural network model is an online sequence overrun learning machine.
优选地,参数计算单元52可包括:Preferably, the parameter calculation unit 52 may include:
第二训练单元,用于根据预设的稳定性约束条件和交互力数据对预设的变阻抗模型进行训练,得到示教运动的变阻抗参数,并根据变阻抗参数对变阻抗模型进行更新,从而保证变阻抗控制的稳定性,避免出现机器人交互力过大而 出现伤人的情况。The second training unit is used to train the preset variable impedance model according to the preset stability constraints and interaction force data to obtain the variable impedance parameter of the teaching movement, and update the variable impedance model according to the variable impedance parameter, Therefore, the stability of the variable impedance control is ensured, and the situation that the interaction force of the robot is too large to cause injury is avoided.
操作控制单元53,用于根据运动方程和变阻抗参数控制操作。The operation control unit 53 is used to control the operation according to the equation of motion and the variable impedance parameter.
在本发明实施例中,通过数据获取单元51获取示教运动的示教数据,通过参数计算单元52根据示教数据中的运动数据计算示教运动的运动方程,并且同时根据示教数据中的交互力数据计算示教运动的变阻抗参数,通过操作控制单元53根据运动方程和变阻抗参数控制操作,从而减少了机器人柔顺性控制过程中的手动编程,降低了机器人的使用门槛,提高了机器人控制的柔顺性和精确性,进而提高了机器人的泛化能力、智能化程度和控制效果。In the embodiment of the present invention, the teaching data of the teaching movement is acquired by the data acquiring unit 51, the motion equation of the teaching movement is calculated according to the movement data in the teaching data by the parameter calculating unit 52, and at the same time according to the teaching data The interactive force data calculates the variable impedance parameters of the teaching movement, and the operation is controlled by the operation control unit 53 according to the motion equation and the variable impedance parameters, thereby reducing the manual programming in the robot compliance control process, lowering the robot's use threshold, and improving the robot The flexibility and accuracy of the control, thereby improving the robot's generalization ability, intelligence and control effect.
在本发明实施例中,机器人柔顺性控制装置的各单元可由相应的硬件或软件单元实现,各单元可以为独立的软、硬件单元,也可以集成为一个软、硬件单元,在此不用以限制本发明。In the embodiment of the present invention, each unit of the robot compliance control device may be implemented by a corresponding hardware or software unit, and each unit may be an independent software and hardware unit, or may be integrated into one software and hardware unit, which is not limited here. this invention.
实施例三:Example three:
图6示出了本发明实施例四提供的计算设备的结构,为了便于说明,仅示出了与本发明实施例相关的部分。FIG. 6 shows the structure of the computing device provided in Embodiment 4 of the present invention. For ease of description, only parts related to the embodiment of the present invention are shown.
本发明实施例的计算设备6包括处理器60、存储器61以及存储在存储器61中并可在处理器60上运行的计算机程序62。该处理器60执行计算机程序62时实现上述各个机器人柔顺性控制方法实施例中的步骤,例如图1所示的步骤S101至S103。或者,处理器60执行计算机程序62时实现上述各装置实施例中各单元的功能,例如,图5所示单元51至53的功能。The computing device 6 of the embodiment of the present invention includes a processor 60, a memory 61, and a computer program 62 stored in the memory 61 and executable on the processor 60. When the processor 60 executes the computer program 62, the steps in the above embodiments of the robot compliance control method are implemented, for example, steps S101 to S103 shown in FIG. 1. Alternatively, when the processor 60 executes the computer program 62, the functions of the units in the above device embodiments are realized, for example, the functions of the units 51 to 53 shown in FIG.
在本发明实施例中,该处理器60执行计算机程序62时实现上述各个机器人柔顺性控制方法实施例中的步骤时,获取示教运动的示教数据,根据示教数据中的运动数据计算示教运动的运动方程,并且同时根据示教数据中的交互力数据计算示教运动的变阻抗参数,根据运动方程和变阻抗参数控制操作,从而减少了机器人柔顺性控制过程中的手动编程,降低了机器人的使用门槛,提高了机器人控制的柔顺性和精确性,进而提高了机器人的泛化能力、智能化程度和控制效果。In the embodiment of the present invention, when the processor 60 executes the computer program 62 to realize the steps in the above embodiments of the robot compliance control method, the teaching data of the teaching motion is acquired, and the teaching data is calculated according to the motion data in the teaching data Teaching the motion equation of motion, and at the same time calculating the variable impedance parameters of the teaching motion based on the interactive force data in the teaching data, and controlling the operation according to the motion equations and variable impedance parameters, thereby reducing the manual programming and reducing the robot's compliance control process. The use threshold of the robot is improved, and the flexibility and accuracy of the robot control are improved, thereby improving the robot's generalization ability, intelligence, and control effect.
该计算设备6中处理器60在执行计算机程序62时实现的步骤具体可参考实施例一中方法的描述,在此不再赘述。For the steps implemented by the processor 60 in the computing device 6 when executing the computer program 62, reference may be made to the description of the method in Embodiment 1, and details are not described herein again.
实施例五:Example 5:
在本发明实施例中,提供了一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,该计算机程序被处理器执行时实现上述各个机器人柔顺性控制方法实施例中的步骤,例如,图1所示的步骤S101至S103。或者,该计算机程序被处理器执行时实现上述各装置实施例中各单元的功能,例如,图5所示单元51至53的功能。In an embodiment of the present invention, a computer-readable storage medium is provided, and the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps in the embodiments of the foregoing robot compliance control methods are implemented. For example, steps S101 to S103 shown in FIG. 1. Alternatively, when the computer program is executed by the processor, the functions of the units in the foregoing device embodiments are realized, for example, the functions of the units 51 to 53 shown in FIG. 5.
在本发明实施例中,获取示教运动的示教数据,根据示教数据中的运动数据计算示教运动的运动方程,并且同时根据示教数据中的交互力数据计算示教运动的变阻抗参数,根据运动方程和变阻抗参数控制操作,从而减少了机器人柔顺性控制过程中的手动编程,降低了机器人的使用门槛,提高了机器人控制的柔顺性和精确性,进而提高了机器人的泛化能力、智能化程度和控制效果。该计算机程序被处理器执行时实现的机器人柔顺性控制方法进一步可参考前述方法实施例中步骤的描述,在此不再赘述。In the embodiment of the present invention, the teaching data of the teaching motion is acquired, the motion equation of the teaching motion is calculated according to the motion data in the teaching data, and the variable impedance of the teaching motion is calculated according to the interaction force data in the teaching data The parameters control the operation according to the equations of motion and variable impedance parameters, thereby reducing manual programming during robot compliance control, lowering the threshold for robot use, improving the flexibility and accuracy of robot control, and thus improving the generalization of the robot Ability, degree of intelligence and control effect. The robot compliance control method implemented when the computer program is executed by the processor may further refer to the description of the steps in the foregoing method embodiments, and details are not described herein again.
本发明实施例的计算机可读存储介质可以包括能够携带计算机程序代码的任何实体或装置、记录介质,例如,ROM/RAM、磁盘、光盘、闪存等存储器。The computer-readable storage medium in the embodiments of the present invention may include any entity or device capable of carrying computer program code, and a recording medium, such as ROM/RAM, magnetic disk, optical disk, flash memory, and other memories.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。The above are only the preferred embodiments of the present invention and are not intended to limit the present invention. Any modification, equivalent replacement and improvement made within the spirit and principle of the present invention should be included in the protection of the present invention Within range.

Claims (12)

  1. 一种机器人柔顺性控制方法,其特征在于,所述方法包括下述步骤:A robot compliance control method, characterized in that the method includes the following steps:
    获取示教运动的示教数据,其中,所述示教数据至少包括所述示教运动的运动数据和交互力数据;Acquiring teaching data of a teaching movement, wherein the teaching data includes at least the movement data and interaction force data of the teaching movement;
    根据所述示教数据中的运动数据计算所述示教运动的运动方程,并且同时根据所述示教数据中的交互力数据计算所述示教运动的变阻抗参数,其中,所述变阻抗参数至少包括变刚度参数和变阻尼参数;Calculating the motion equation of the teaching motion based on the motion data in the teaching data, and simultaneously calculating the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, wherein the variable impedance The parameters include at least variable stiffness parameters and variable damping parameters;
    根据所述运动方程和所述变阻抗参数控制操作。The operation is controlled according to the equation of motion and the variable impedance parameter.
  2. 如权利要求1所述的方法,其特征在于,获取示教运动的示教数据的步骤,包括:The method of claim 1, wherein the step of acquiring teaching data of the teaching movement includes:
    获取与示教运动相关的位置数据、交互力数据和时间数据;Obtain position data, interaction force data and time data related to teaching movement;
    根据所述位置数据和所述时间数据计算所述运动数据,其中,所述运动数据中包括速度数据。The motion data is calculated based on the position data and the time data, wherein the motion data includes speed data.
  3. 如权利要求1所述的方法,其特征在于,根据所述示教数据中的运动数据计算所述示教运动的运动方程的步骤,包括:The method according to claim 1, wherein the step of calculating the motion equation of the teaching motion based on the motion data in the teaching data includes:
    使用所述运动数据对预设的神经网络模型进行训练,得到所述示教运动的运动方程,并根据所述运动方程对所述神经网络模型进行在线更新,Training the preset neural network model using the motion data to obtain the motion equation of the teaching movement, and updating the neural network model online according to the motion equation,
    其中,使用所述运动数据对预设的神经网络模型进行训练的步骤,包括:Wherein, the step of training the preset neural network model using the motion data includes:
    以逐一或逐块的方式对所述运动数据进行增量学习,得到所述示教运动的运动方程。Perform incremental learning on the motion data in a one-by-one or block-by-block manner to obtain the motion equation of the teaching motion.
  4. 如权利要求4所述的方法,其特征在于,所述神经网络模型包括在线序列超限学习机。The method of claim 4, wherein the neural network model includes an online sequence overrun learning machine.
  5. 如权利要求1所述的方法,其特征在于,根据所述示教数据中的交互力数据计算所述示教运动的变阻抗参数的步骤,包括:The method of claim 1, wherein the step of calculating the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data includes:
    根据预设的稳定性约束条件和所述交互力数据对预设的变阻抗模型进行训练,得到所述示教运动的变阻抗参数,并根据所述变阻抗参数对所述变阻抗模 型进行更新。Train a preset variable impedance model according to preset stability constraints and the interaction force data to obtain a variable impedance parameter of the teaching movement, and update the variable impedance model according to the variable impedance parameter .
  6. 一种机器人柔顺性控制装置,其特征在于,所述装置包括:A robot compliance control device, characterized in that the device includes:
    数据获取单元,用于获取示教运动的示教数据,其中,所述示教数据至少包括所述示教运动的运动数据和交互力数据;A data acquisition unit for acquiring teaching data of the teaching movement, wherein the teaching data includes at least the movement data and the interaction force data of the teaching movement;
    参数计算单元,用于根据所述示教数据中的运动数据计算所述示教运动的运动方程,并且同时根据所述示教数据中的交互力数据计算所述示教运动的变阻抗参数,其中,所述变阻抗参数至少包括变刚度参数和变阻尼参数;以及A parameter calculation unit, configured to calculate the motion equation of the teaching motion based on the motion data in the teaching data, and at the same time calculate the variable impedance parameter of the teaching motion based on the interaction force data in the teaching data, Wherein, the variable impedance parameter includes at least a variable stiffness parameter and a variable damping parameter; and
    操作控制单元,用于根据所述运动方程和所述变阻抗参数控制操作。The operation control unit is used for controlling operation according to the motion equation and the variable impedance parameter.
  7. 如权利要求6所述的装置,其特征在于,所述数据获取单元包括:The apparatus of claim 6, wherein the data acquisition unit comprises:
    第一获取单元,用于获取与示教运动相关的位置数据、交互力数据和时间数据;以及The first acquiring unit is used to acquire position data, interaction force data and time data related to the teaching movement; and
    第一计算单元,用于根据所述位置数据和所述时间数据计算所述运动数据,其中,所述运动数据中包括速度数据。The first calculation unit is configured to calculate the motion data according to the position data and the time data, wherein the motion data includes speed data.
  8. 如权利要求6所述的装置,其特征在于,所述参数计算单元包括:The apparatus of claim 6, wherein the parameter calculation unit comprises:
    第一训练单元,用于使用所述运动数据对预设的神经网络模型进行训练,得到所述示教运动的运动方程,并根据所述运动方程对所述神经网络模型进行在线更新,The first training unit is used to train the preset neural network model using the motion data to obtain the motion equation of the teaching movement, and update the neural network model online according to the motion equation,
    其中,所述模型训练单元包括:Wherein, the model training unit includes:
    增量学习单元,用于以逐一或逐块的方式对所述运动数据进行增量学习,得到所述示教运动的运动方程。An incremental learning unit is used to incrementally learn the motion data in a one-by-one or block-by-block manner to obtain the motion equation of the teaching motion.
  9. 如权利要求8所述的装置,其特征在于,所述神经网络模型包括在线序列超限学习机。The apparatus of claim 8, wherein the neural network model includes an online sequence overrun learning machine.
  10. 如权利要求6所述的装置,其特征在于,所述参数计算单元包括:The apparatus of claim 6, wherein the parameter calculation unit comprises:
    第二训练单元,用于根据预设的稳定性约束条件和所述交互力数据对预设的变阻抗模型进行训练,得到所述示教运动的变阻抗参数,并根据所述变阻抗参数对所述变阻抗模型进行更新。The second training unit is configured to train the preset variable impedance model according to the preset stability constraint conditions and the interaction force data, to obtain the variable impedance parameter of the teaching movement, and according to the variable impedance parameter The variable impedance model is updated.
  11. 一种计算设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1至5任一项所述方法的步骤。A computing device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, characterized in that, when the processor executes the computer program, it is implemented as claimed in claims 1 to 5 The steps of any of the methods described.
  12. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至5任一项所述方法的步骤。A computer-readable storage medium storing a computer program, characterized in that, when the computer program is executed by a processor, the steps of the method according to any one of claims 1 to 5 are implemented.
PCT/CN2018/121338 2018-12-14 2018-12-14 Compliance control method and apparatus for robot, device, and storage medium WO2020118730A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/121338 WO2020118730A1 (en) 2018-12-14 2018-12-14 Compliance control method and apparatus for robot, device, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/121338 WO2020118730A1 (en) 2018-12-14 2018-12-14 Compliance control method and apparatus for robot, device, and storage medium

Publications (1)

Publication Number Publication Date
WO2020118730A1 true WO2020118730A1 (en) 2020-06-18

Family

ID=71075313

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/121338 WO2020118730A1 (en) 2018-12-14 2018-12-14 Compliance control method and apparatus for robot, device, and storage medium

Country Status (1)

Country Link
WO (1) WO2020118730A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114366552A (en) * 2021-12-23 2022-04-19 威海经济技术开发区天智创新技术研究院 Exoskeleton control method and system for upper limb rehabilitation training
CN114800489A (en) * 2022-03-22 2022-07-29 华南理工大学 Mechanical arm compliance control method based on combination of definite learning and composite learning, storage medium and robot
CN114848391A (en) * 2022-04-28 2022-08-05 北京邮电大学 Compliance control method for lower limb rehabilitation robot
CN115421387A (en) * 2022-09-22 2022-12-02 中国科学院自动化研究所 Variable impedance control system and control method based on inverse reinforcement learning
WO2023124346A1 (en) * 2021-12-28 2023-07-06 广东省科学院智能制造研究所 Method and system for variable-stiffness motor skill learning and regulation for collaborative robot

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0416123A1 (en) * 1989-03-20 1991-03-13 Fanuc Ltd. Manual intervention method for industrial robot
JP2002120174A (en) * 2000-10-11 2002-04-23 Sony Corp Authoring system, authoring method and storage medium
CN101909829A (en) * 2008-02-28 2010-12-08 松下电器产业株式会社 Control apparatus and control method for a robot arm, robot, control program for a robot arm, and electronic integrated circuit for controlling a robot arm
CN104259991A (en) * 2014-09-01 2015-01-07 沈阳远大科技园有限公司 Force control module based on variable-rigidity flexible mechanism
CN206484559U (en) * 2016-11-08 2017-09-12 武汉海默机器人有限公司 Robot teaching system based on three-dimensional force sensor

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0416123A1 (en) * 1989-03-20 1991-03-13 Fanuc Ltd. Manual intervention method for industrial robot
JP2002120174A (en) * 2000-10-11 2002-04-23 Sony Corp Authoring system, authoring method and storage medium
CN101909829A (en) * 2008-02-28 2010-12-08 松下电器产业株式会社 Control apparatus and control method for a robot arm, robot, control program for a robot arm, and electronic integrated circuit for controlling a robot arm
CN104259991A (en) * 2014-09-01 2015-01-07 沈阳远大科技园有限公司 Force control module based on variable-rigidity flexible mechanism
CN206484559U (en) * 2016-11-08 2017-09-12 武汉海默机器人有限公司 Robot teaching system based on three-dimensional force sensor

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114366552A (en) * 2021-12-23 2022-04-19 威海经济技术开发区天智创新技术研究院 Exoskeleton control method and system for upper limb rehabilitation training
CN114366552B (en) * 2021-12-23 2023-10-24 威海经济技术开发区天智创新技术研究院 Exoskeleton control method and system for upper limb rehabilitation training
WO2023124346A1 (en) * 2021-12-28 2023-07-06 广东省科学院智能制造研究所 Method and system for variable-stiffness motor skill learning and regulation for collaborative robot
CN114800489A (en) * 2022-03-22 2022-07-29 华南理工大学 Mechanical arm compliance control method based on combination of definite learning and composite learning, storage medium and robot
CN114800489B (en) * 2022-03-22 2023-06-20 华南理工大学 Mechanical arm compliant control method based on combination of deterministic learning and compound learning, storage medium and robot
CN114848391A (en) * 2022-04-28 2022-08-05 北京邮电大学 Compliance control method for lower limb rehabilitation robot
CN114848391B (en) * 2022-04-28 2023-08-29 北京邮电大学 Flexible control method of lower limb rehabilitation robot
CN115421387A (en) * 2022-09-22 2022-12-02 中国科学院自动化研究所 Variable impedance control system and control method based on inverse reinforcement learning

Similar Documents

Publication Publication Date Title
WO2020118730A1 (en) Compliance control method and apparatus for robot, device, and storage medium
Gu et al. Deep reinforcement learning for robotic manipulation
Gu et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates
CN109702740B (en) Robot compliance control method, device, equipment and storage medium
CN108115681B (en) Simulation learning method and device for robot, robot and storage medium
Guenter et al. Reinforcement learning for imitating constrained reaching movements
Calinon et al. Learning and reproduction of gestures by imitation
Peters et al. Reinforcement learning by reward-weighted regression for operational space control
Stulp et al. Model-free reinforcement learning of impedance control in stochastic environments
Mitrovic et al. Adaptive optimal feedback control with learned internal dynamics models
Wang et al. A framework of hybrid force/motion skills learning for robots
Neumann et al. Neural learning of stable dynamical systems based on data-driven Lyapunov candidates
Yu et al. Global model learning for large deformation control of elastic deformable linear objects: An efficient and adaptive approach
US11565412B2 (en) Generating a robot control policy from demonstrations collected via kinesthetic teaching of a robot
Ewerton et al. Incremental imitation learning of context-dependent motor skills
Zhang et al. Robot learning system based on dynamic movement primitives and neural network
Yin et al. Learning nonlinear dynamical system for movement primitives
Zhang et al. Learning robust point-to-point motions adversarially: A stochastic differential equation approach
Kawaharazuka et al. Adaptive robotic tool-tip control learning considering online changes in grasping state
Joukov et al. Gaussian process based model predictive controller for imitation learning
Danel Reinforcement learning for humanoid robot control
Koropouli et al. Generalization of Force Control Policies from Demonstrations for Constrained Robotic Motion Tasks: A Regression-Based Approach
Guenter et al. Using reinforcement learning to adapt an imitation task
WO2019095108A1 (en) Robot imitation learning method and apparatus, robot and storage medium
Li et al. Target dynamic grasping during mobile robot movement based on learning methods

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18943108

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 04.11.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 18943108

Country of ref document: EP

Kind code of ref document: A1