WO2020133881A1 - Learning control method for mechanical apparatus, and mechanical apparatus learning control system having learning function - Google Patents

Learning control method for mechanical apparatus, and mechanical apparatus learning control system having learning function Download PDF

Info

Publication number
WO2020133881A1
WO2020133881A1 PCT/CN2019/086712 CN2019086712W WO2020133881A1 WO 2020133881 A1 WO2020133881 A1 WO 2020133881A1 CN 2019086712 W CN2019086712 W CN 2019086712W WO 2020133881 A1 WO2020133881 A1 WO 2020133881A1
Authority
WO
WIPO (PCT)
Prior art keywords
learning
motion
action
information
control
Prior art date
Application number
PCT/CN2019/086712
Other languages
French (fr)
Chinese (zh)
Inventor
张冶
李松洋
王杰高
Original Assignee
南京埃斯顿机器人工程有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京埃斯顿机器人工程有限公司 filed Critical 南京埃斯顿机器人工程有限公司
Publication of WO2020133881A1 publication Critical patent/WO2020133881A1/en

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls

Definitions

  • the invention relates to the technical field of machine learning, and relates to machine equipment learning control, which is a machine equipment learning method with a learning function and a server system thereof.
  • Mechanical equipment learning control is the control that optimizes the processing technology.
  • the processing technology generally includes roughing, finishing, assembling, inspection, and packaging.
  • the mechanical equipment learns the above processes to achieve optimal control of the processing technology.
  • scholars and engineers have applied learning control to robot control and other occasions. As described in the following literature:
  • the Chinese patent "Robot with Learning Control Function” (CN102189550A) discloses a robot with learning control function.
  • a sensor is installed on a part of the position control object, and the learning control unit learns through the target trajectory or position detected and operated by the sensor.
  • the learning correction amount is obtained, and the data collected by the external sensor includes position, speed, acceleration, inertia, etc. This method relies on the same predetermined action, and different actions need to be re-learned, so that the previous learning results cannot be reused.
  • the Chinese patent application "Robot Device with Learning Function” discloses a robot device that calculates a predetermined motion for improving the robot based on the physical quantity detected by the sensor during the predetermined motion of the robot during the predetermined motion
  • the learning correction amount is calculated by the learning expansion unit through the relationship between the learning correction amount and the information of the learned predetermined action. For the information of the new action, the calculated relationship is used to calculate the learning correction amount for improving the new action. study again.
  • the learning expansion unit since the learning expansion unit is built into the control unit of the mechanical equipment, other similar mechanical equipment cannot use the learning result of the mechanical equipment, that is, there is a problem that the other mechanical equipment needs to re-learn to construct the learning expansion unit.
  • the problem to be solved by the present invention is that the existing mechanical equipment learning control method has a single learning object and is not universal. For new action instructions or new equipment, it is often necessary to re-learn, and the efficiency of the learning control method is not high.
  • the technical solution of the present invention is: a control learning method of mechanical equipment, a learning server, a mechanical control section, a driving section, a teaching system and a sensor are provided for the mechanical equipment,
  • the teaching system sends action commands to the machine control part, the drive part is used to drive the machine action, and the sensor is used to obtain the actual action information of the machine;
  • the mechanical control unit includes a motion analysis unit, a learning control unit, a storage unit, a learning analysis unit, and a motion control unit.
  • the motion analysis unit performs motion analysis on the motion commands issued by the teaching system, and the motion analysis unit sends the analyzed motion information to the learning
  • the amount of learning correction is obtained and stored in the storage unit, where the machine learns different predetermined actions, including action commands in the case of different positions, speeds, and accelerations; the stored action information and corresponding learning are cached in the storage unit
  • the correction amount is transmitted to the learning and analysis department; the learning and analysis department sorts out the action information, that is, the learning correction amount, and uploads the sorted information to the learning server.
  • the learning analysis unit analyzes the new action command issued by the teaching system, and obtains the learning correction amount of the corresponding or similar action from the learning server according to the analysis result.
  • the motion compensation is performed on the driving part.
  • the motion control part performs robot motion control according to the motion information of the new motion command transmitted by the motion analysis part, and transmits the motion signal to the drive part.
  • the drive part according to the motion information and the learning correction amount To make the machine execute according to the command action.
  • the learning of the learning control unit and the analysis of the learning analysis unit are as follows:
  • the learning control section obtained by learning the learning correction amount corresponding to N sets of tracks, the movement trajectory N i is divided into M i segment, the N sets of different trajectories that there are M sets operation information and the corresponding study Correction amount,
  • E pq CanonicalWarpingDistance[L p ,L q ]p,q ⁇ M,p ⁇ q
  • ⁇ , ⁇ are weight coefficients
  • the action value data W p and W q are considered to be similar, and the action value data is redundant; when the M sets of data are processed by similarity calculations, the redundant action value data is removed And organize and send to the learning server for use.
  • the corrected action information of the segmented action information is Z j
  • the similarity solution analysis is performed with Y k in the action value data W
  • the similarity G jk of the action information is obtained
  • G jk CanonicalWarpingDistance[Z j ,Y k ]
  • the learning correction amount of the new action corresponds to the learning correction amount L k in the learning server, and the above process is repeatedly solved until the learning correction amount of the segmented trajectory of the new action instruction is all obtained.
  • the similarity calculation method used when calculating the similarity between the action information and the learning correction amount, includes a time warping distance algorithm, a neural network algorithm, a fuzzy control algorithm, a genetic algorithm, and a simulated annealing algorithm.
  • the obtained action information and learning correction amount are uploaded to the learning server.
  • the method of the present invention is used to optimize the action of mechanical equipment and process optimization of mechanical equipment.
  • the present invention also proposes a mechanical equipment server system with a learning function, including a learning server and a mechanical control unit.
  • the mechanical equipment itself has an action instruction system, a drive unit, and a sensor.
  • the action instruction system is used to issue an action instruction to the machine control unit, drive
  • the part is used to drive the operation of the mechanical device, and the sensor is used to obtain the actual action information of the mechanical device;
  • the learning server and the mechanical control part are storage media provided with a computer program that implements the method of claim 1 when the computer program runs.
  • the setting method of learning server includes mechanical equipment LAN server, enterprise server and cloud server.
  • the invention provides a mechanical equipment learning method with a learning function and a server system thereof.
  • the new processing technology is analyzed, and the learning correction amount is obtained from the learning server according to the analysis result, without the need to use
  • the sensor re-learns the processing technology.
  • the present invention proposes a control learning method for mechanical equipment.
  • a learning server is constructed.
  • the motion analysis unit analyzes the new motion instructions, and based on the results of the matching learning server, can obtain the learned learning correction amount without repeated learning.
  • control learning method and server system of the mechanical equipment proposed in the present invention do not need to be bound to fixed mechanical equipment, and can be flexibly replaced and run on different mechanical equipment.
  • the learning server system of the present invention can improve the working efficiency of mechanical equipment, reduce the cost of mechanical equipment and the cost of production line systems.
  • the motion analysis unit analyzes the new motion command, and updates the learning server of the mechanical device with the new motion information and the data of the learning correction amount according to the result of the matching learning server, so that the learning server of the present invention continuously self Enrichment.
  • the method of the present invention can be used to assist in suppressing vibrations during the operation of mechanical equipment to achieve agile movements.
  • the method of the present invention can optimize the action of mechanical equipment from multiple angles, such as beat, power consumption, motion accuracy, etc., and improve production efficiency.
  • the method of the present invention can optimize the processing technology of mechanical equipment from multiple angles, casting, forging, etc., and improve the product processing quality of mechanical equipment.
  • the learning server system established by the present invention can be used by replacing equipment, and other mechanical equipment does not need to be re-learned.
  • the motion analysis unit can directly obtain the corresponding learning correction amount from the learning server for the new motion.
  • Figure 1 is a schematic diagram of the method of the present invention.
  • FIG. 2 is a schematic diagram of robot learning in the method of the present invention.
  • FIG. 3 is a flowchart of constructing a learning server in the present invention.
  • FIG. 4 is a schematic diagram of the robot in operation in the method of the present invention.
  • FIG. 5 is a flowchart of using the learning server system in the present invention.
  • the invention provides a mechanical equipment learning method with a learning function and a server system thereof.
  • the new processing technology is analyzed, and the learning correction amount is obtained from the learning server according to the analysis result, without the need to use
  • the sensor re-learns the processing technology.
  • the embodiments of the present invention take industrial robots and motion optimization control as examples, but are not limited to industrial robots, but also include other mechanical equipment such as servo drives, and are not limited to motion optimization control, but also include process optimization. control.
  • FIG. 1 shows a schematic diagram of a mechanical device learning method and a server system thereof according to an embodiment of the present invention.
  • the robot is equipped with a learning server, robot control unit, drive unit, teaching system, and sensors.
  • the teaching system sends motion commands to the robot control unit, hoping that the robot can perform the desired actions.
  • the drive unit is used to drive the robot to move.
  • the robot control unit is composed of a motion analysis unit, a learning control unit, a storage unit, a learning analysis unit, and a motion control unit.
  • the motion analysis unit mainly analyzes the motion commands transmitted from the teaching system.
  • the motion analysis includes kinematics planning and dynamic analysis.
  • the motion analysis unit sends the analyzed motion information to the learning control unit, storage unit, and learning analysis. Part, and motion control part.
  • the learning control unit learns the motion information transmitted by the motion analysis unit and the information collected by the sensor, obtains the learning correction amount, and stores it in the storage unit.
  • the robot learns with a predetermined motion in the work space.
  • the predetermined motion needs to include different positions, speeds, accelerations, etc. to ensure the quality of the robot learning.
  • the learning action information and the corresponding learning correction amount are stored in the storage unit, and when the learning is completed, it is transmitted to the learning analysis unit.
  • the learning analysis section sorts the action value information, removes redundant information, and uploads the action value information to the learning server.
  • the learning analysis part analyzes the action, and obtains the learning correction amount of the corresponding or similar action from the learning server according to the analysis result, and acts on the drive part make up.
  • the motion control unit performs robot motion control on the motion information transmitted from the motion analysis unit, and transmits the motion signal to the driving unit.
  • the driving part is mainly composed of a servo drive and a servo motor.
  • the servo drive drives the servo motor through position, speed or current feedback control to make the robot mechanism perform the expected action.
  • the sensor is installed on the target part of the robot's position control, and is used to obtain physical information such as the position, velocity, or acceleration of the target part.
  • the sensors involved in the above embodiments include encoders, position sensors, speed sensors, acceleration sensors, vision sensors, force sensors, angular velocity sensors, gyro sensors, inertial measurement units, and the like.
  • Figure 3 shows a flow chart for building a learning server.
  • the teaching system issues motion commands to the robot control unit.
  • the motion analysis unit of the robot control unit analyzes the motion commands.
  • the learning control unit obtains the motion information and the data collected by the external sensor to learn to obtain the learning correction of the overall motion. And save the motion information and learning correction amount to the storage unit.
  • the learning analysis unit takes out the action information and the learning correction amount from the storage unit, and processes the learning correction amount and the action information in segments to construct action value data, and removes the redundant result after analysis.
  • the learning analysis department uploads the final analysis results to the learning server.
  • An acceleration sensor is used as an example of the sensor installed in the target part, and the physical quantity detected by the sensor is the acceleration of the target part controlled by the robot position.
  • the velocity component is obtained after one integration, and the position component is obtained after two integrations.
  • the learning control section obtained by learning the learning correction amount corresponding to N sets of tracks, the movement trajectory N i is divided into M i segment, the N sets of different trajectories that there are M sets operation information and the corresponding study Correction amount, M i can be determined according to the length of the trajectory and the number of interpolation points.
  • the first group of trajectories is divided into M 1 segments, the second group of trajectories is divided into M 2 segments, and so on.
  • E pq CanonicalWarpingDistance[L p ,L q ] p,q ⁇ M,p ⁇ q
  • ⁇ , ⁇ are weight coefficients
  • ⁇ 1 is the similarity threshold, corresponding to the actual operating conditions of different models, different loads, different speeds, different inertias, etc., which can be adjusted in advance according to the experiment.
  • the redundant action value data is removed and sorted, and sent to the learning server for use by the robot and other robots.
  • Fig. 5 shows a flow chart of the use of the learning server system.
  • the teaching system issues motion commands to the robot control unit, and the motion analysis unit in the control unit analyzes the motion commands.
  • the learning analysis unit performs further segmentation processing on the analyzed action information, and obtains the corresponding learning correction amount from the learning server after analysis.
  • the learning correction amount is added to the motion information output by the motion control unit and transmitted to the drive unit.
  • the corrected action information of the segmented action information is Z j
  • the similarity solution analysis is performed with Y k in the action value data W
  • the similarity G jk of the action information is obtained
  • G jk CanonicalWarpingDistance[Z j ,Y k ]
  • the learning correction amount of the new action corresponds to the learning correction amount L k in the learning server, and the above process is repeatedly solved until the learning correction amount of the segmented trajectory of the new action instruction is all obtained.
  • the threshold ⁇ 2 here is also obtained based on the experimental adjustment of different models, different loads, different speeds, different inertias and other working conditions, and is preset.
  • the new motion instruction does not need to spend time and effort to install and re-install sensors, re-learn, and the learning analysis part analyzes the motion information, Get the learning correction amount directly from the learning server.
  • the present invention uses the time warping distance to construct a learning server system and use it, but it is not limited to this method, and other methods of calculating similarity may also be used. , Such as error sum of squares, least squares, correlation coefficient, neural network algorithm, fuzzy control algorithm, genetic algorithm and simulated annealing algorithm.
  • the mechanical equipment learning control of the present invention is optimized control for the processing technology, including the motion control and technological process control of the mechanical equipment. Construct a learning and analysis section in the control section of machinery and equipment. During the learning process of machinery and equipment, process and analyze the process information and learning correction amount stored in the storage section, and use time warping distance to integrate process value data and organize The post-process value data is saved to the learning server.
  • the process value data stored in the learning server can be used not only for the learned mechanical equipment, but also for other unlearned mechanical equipment.
  • the learning server can be updated according to the requirements to ensure the high efficiency of the mechanical equipment performance.

Landscapes

  • Engineering & Computer Science (AREA)
  • Robotics (AREA)
  • Mechanical Engineering (AREA)
  • Manipulator (AREA)
  • Feedback Control In General (AREA)

Abstract

A learning control method for a mechanical apparatus, and a mechanical apparatus learning control system having a learning function. A learning server, a mechanical control unit, a driving unit, a teaching system and a sensor are provided for a mechanical apparatus; a learning analysis unit is constructed in the mechanical control unit; during a learning process of the mechanical apparatus, segmented arrangement and analysis are carried out on process information and learning correction of an action; and process value data is integrated, and the arranged process value data is saved in the learning server. Process value information of a mechanical apparatus is recorded, a new processing process is analyzed and learning correction is acquired from a learning server according to an analysis result, so that there is no need to re-use a sensor to re-learn the processing process, thereby improving production efficiency.

Description

[根据细则37.2由ISA制定的发明名称] 机械设备的控制学习方法和具备学习功能的机械设备控制学习系统[Name of invention formulated by ISA according to Rule 37.2]  Mechanical equipment control learning method and mechanical equipment control learning system with learning function 技术领域Technical field
本发明涉及机器学习技术领域,涉及机械设备学习控制,为一种具备学习功能的机械设备学习方法及其服务器系统。The invention relates to the technical field of machine learning, and relates to machine equipment learning control, which is a machine equipment learning method with a learning function and a server system thereof.
背景技术Background technique
机械设备学习控制是对加工工艺进行优化的控制。加工工艺一般包括粗加工、精加工、装配、检验、包装,机械设备通过对上述过程进行学习,从而实现对加工工艺进行优化的控制。目前学者、工程师已经将学习控制运用于机器人控制等场合下。如以下文献的介绍:Mechanical equipment learning control is the control that optimizes the processing technology. The processing technology generally includes roughing, finishing, assembling, inspection, and packaging. The mechanical equipment learns the above processes to achieve optimal control of the processing technology. At present, scholars and engineers have applied learning control to robot control and other occasions. As described in the following literature:
中国专利《具有学习控制功能的机器人》(CN102189550A),公开了一种具有学习控制功能的机器人,在位置控制对象的部位安装传感器,学习控制部通过传感器检测及动作的目标轨迹或者位置进行学习,得到学习修正量,外部传感器采集的数据有位置、速度、加速度、惯量等。这种方式依赖于相同预定的动作,不同的动作需要进行重新学习,使得之前的学习结果不能重复利用。The Chinese patent "Robot with Learning Control Function" (CN102189550A) discloses a robot with learning control function. A sensor is installed on a part of the position control object, and the learning control unit learns through the target trajectory or position detected and operated by the sensor. The learning correction amount is obtained, and the data collected by the external sensor includes position, speed, acceleration, inertia, etc. This method relies on the same predetermined action, and different actions need to be re-learned, so that the previous learning results cannot be reused.
中国专利申请《具备学习功能的机器人装置》(CN106965171A)公开了一种机器人装置,根据机器人进行预定动作时的、在预定动作中的通过传感器检测出的物理量,来计算用于改善机器人的预定动作的学习修正量,通过学习扩展部计算学习修正量与学习过的预定动作的信息之间的关系,对于新动作的信息,使用求出的关系,计算用于改善新动作的学习修正量,无需重新学习。该技术方案由于将学习扩展部内置在机械设备的控制部中,使得其他类似的机械设备不能使用该机械设备的学习结果,即存在其他机械设备需要再重新学习构建学习扩展部的问题。The Chinese patent application "Robot Device with Learning Function" (CN106965171A) discloses a robot device that calculates a predetermined motion for improving the robot based on the physical quantity detected by the sensor during the predetermined motion of the robot during the predetermined motion The learning correction amount is calculated by the learning expansion unit through the relationship between the learning correction amount and the information of the learned predetermined action. For the information of the new action, the calculated relationship is used to calculate the learning correction amount for improving the new action. study again. In this technical solution, since the learning expansion unit is built into the control unit of the mechanical equipment, other similar mechanical equipment cannot use the learning result of the mechanical equipment, that is, there is a problem that the other mechanical equipment needs to re-learn to construct the learning expansion unit.
发明内容Summary of the invention
本发明要解决的问题是:现有的机械设备学习控制方法学习对象单一,不具有通用性,对于新的动作指令或新的设备,往往需要重新学习,学习控制方法的效率不高。The problem to be solved by the present invention is that the existing mechanical equipment learning control method has a single learning object and is not universal. For new action instructions or new equipment, it is often necessary to re-learn, and the efficiency of the learning control method is not high.
本发明的技术方案为:一种机械设备的控制学习方法,对机械设备设置学习服务器、机械控制部、驱动部、示教系统和传感器,The technical solution of the present invention is: a control learning method of mechanical equipment, a learning server, a mechanical control section, a driving section, a teaching system and a sensor are provided for the mechanical equipment,
示教系统发出动作指令给机械控制部,驱动部用于驱动机械动作,传感器用于获得机械的实际动作信息;The teaching system sends action commands to the machine control part, the drive part is used to drive the machine action, and the sensor is used to obtain the actual action information of the machine;
机械控制部包括动作解析部、学习控制部、存储部、学习分析部以及动作控制部,动作解析部对示教系统发出的动作指令进行动作解析,动作解析部将解析后的动作信息发给学习控制部、存储部、学习分析部、以及动作控制部;动作控制部发出控制信息给驱动部,学习控制部对动作解析部传来的动作信息及传感器采集到的信息进行学习,由两者偏差得到学习修正量,并存储于存储部中,其中,机械对不同的预定动作进行学习,包括不同的位置、速度以及加速度情况下的动作指令;存储部中缓存所学习的动作信息及相应的学习修正量,并传输给学习分析部;学习分析部对动作信息即学习修正量进行整理,并将整理后的信息上传于学习服务器,The mechanical control unit includes a motion analysis unit, a learning control unit, a storage unit, a learning analysis unit, and a motion control unit. The motion analysis unit performs motion analysis on the motion commands issued by the teaching system, and the motion analysis unit sends the analyzed motion information to the learning The control unit, storage unit, learning analysis unit, and motion control unit; the motion control unit sends control information to the drive unit, and the learning control unit learns the motion information from the motion analysis unit and the information collected by the sensor, which deviates from the two The amount of learning correction is obtained and stored in the storage unit, where the machine learns different predetermined actions, including action commands in the case of different positions, speeds, and accelerations; the stored action information and corresponding learning are cached in the storage unit The correction amount is transmitted to the learning and analysis department; the learning and analysis department sorts out the action information, that is, the learning correction amount, and uploads the sorted information to the learning server.
当机械对指定动作指令学习完成后,针对示教系统下发的新的动作指令,学习分析部对该动作指令进行分析,根据分析结果从学习服务器中得到对应或相似动作的学习修正量,用于在新动作下对驱动部进行动作补偿,动作控制部根据动作解析部传来的新动作指令的动作信息进行机器人动作控制,将动作信号传给驱动部,驱动部根据动作信息和学习修正量,使机械按照指令动作进行执行。After the machine learns the specified action command, the learning analysis unit analyzes the new action command issued by the teaching system, and obtains the learning correction amount of the corresponding or similar action from the learning server according to the analysis result. In the new motion, the motion compensation is performed on the driving part. The motion control part performs robot motion control according to the motion information of the new motion command transmitted by the motion analysis part, and transmits the motion signal to the drive part. The drive part according to the motion information and the learning correction amount To make the machine execute according to the command action.
进一步的,学习控制部的学习及学习分析部的分析具体如下:Further, the learning of the learning control unit and the analysis of the learning analysis unit are as follows:
设机械根据预定动作指令进行N组动作轨迹,学习控制部学习得到对应N组轨迹的学习修正量,将动作轨迹N i分成M i段,则N组不同轨迹共有M组动作信息及对应的学习修正量,
Figure PCTCN2019086712-appb-000001
Provided machinery for N groups of motion track according to a predetermined operation instruction, the learning control section obtained by learning the learning correction amount corresponding to N sets of tracks, the movement trajectory N i is divided into M i segment, the N sets of different trajectories that there are M sets operation information and the corresponding study Correction amount,
Figure PCTCN2019086712-appb-000001
记一段动作信息为X k,对应的学习修正量为L k,动作信息包含相对位置、速度、加速度和惯量,k=1,2…M,对每段的轨迹数据进行修正,计算每段轨迹相对于轨迹起点的位置,得到修正后的轨迹位置数据,即得到修正后的动作信息Y k,设W k=[Y k,L k],构建得到动作价值数据W=[W 1,W 2,...W k,...W M], Record a piece of motion information as X k , the corresponding learning correction amount is L k , the motion information includes relative position, speed, acceleration and inertia, k=1, 2...M, correct the trajectory data of each section, calculate each trajectory Relative to the position of the starting point of the trajectory, the corrected trajectory position data is obtained, that is, the corrected action information Y k is obtained, and W k =[Y k ,L k ] is set to construct the action value data W=[W 1 ,W 2 ,...W k ,...W M ],
以时间翘曲距离求解两两修正后动作信息的相似度D pqSolve the similarity D pq of the motion information after pairwise correction with the time warping distance,
D pq=CanonicalWarpingDistance[Y p,Y q]p,q∈M,p≠q D pq = CanonicalWarpingDistance[Y p ,Y q ]p,q∈M,p≠q
并同样方式求解对应两两的学习修正量的相似度E pqAnd in the same way, solve the similarity E pq corresponding to the pairwise learning correction amount,
E pq=CanonicalWarpingDistance[L p,L q]p,q∈M,p≠q E pq = CanonicalWarpingDistance[L p ,L q ]p,q∈M,p≠q
则两两动作价值数据的相似度F pqThen the similarity F pq of the pairwise action value data,
F pq=αD pq+βE pq F pq = αD pq + βE pq
其中,α,β为权重系数;Among them, α, β are weight coefficients;
当F pq<ε 1,则认为动作价值数据W p、W q的信息类似,动作价值数据存在冗余;当M组数据两两经过相似度计算处理完后,对冗余的动作价值数据去除并进行整理,发送至学习服务器以供使用。 When F pq1 , the action value data W p and W q are considered to be similar, and the action value data is redundant; when the M sets of data are processed by similarity calculations, the redundant action value data is removed And organize and send to the learning server for use.
对于新的动作指令,将新动作的轨迹分段,对位置数据进行修正处理后,分段动作信息修正后的动作信息为Z j,与动作价值数据W中的Y k进行相似度求解分析,利用时间翘曲距离,得到动作信息的相似度G jkFor new action instructions, after segmenting the trajectory of the new action and correcting the position data, the corrected action information of the segmented action information is Z j , and the similarity solution analysis is performed with Y k in the action value data W, Using the time warping distance, the similarity G jk of the action information is obtained,
G jk=CanonicalWarpingDistance[Z j,Y k] G jk = CanonicalWarpingDistance[Z j ,Y k ]
当G ij<ε 2,则新动作的学习修正量对应学习服务器中的学习修正量L k,重复求解上述过程,直至新动作指令的分段轨迹的学习修正量全部得到。 When G ij2 , the learning correction amount of the new action corresponds to the learning correction amount L k in the learning server, and the above process is repeatedly solved until the learning correction amount of the segmented trajectory of the new action instruction is all obtained.
作为优选方式,计算动作信息和学习修正量的相似度时,使用的相似度计算方法包括时间翘曲距离算法、神经网络算法、模糊控制算法、遗传算法和模拟退火算法。As a preferred method, when calculating the similarity between the action information and the learning correction amount, the similarity calculation method used includes a time warping distance algorithm, a neural network algorithm, a fuzzy control algorithm, a genetic algorithm, and a simulated annealing algorithm.
进一步的对于新的动作指令,将获得的动作信息和学习修正量上传至学习服务器。Further, for the new action instruction, the obtained action information and learning correction amount are uploaded to the learning server.
本发明方法用于机械设备的动作优化控制,以及机械设备的过程优化控制。The method of the present invention is used to optimize the action of mechanical equipment and process optimization of mechanical equipment.
本发明还提出一种具备学习功能的机械设备服务器系统,包括学习服务器和机械控制部,机械设备自身具有动作指令系统、驱动部和传感器,动作指令系统用于发出动作指令给机械控制部,驱动部用于驱动机械设备动作,传感器用于获得机械设备的实际动作信息;学习服务器和机械控制部为设置有计算机程序的存储介质,所述计算机程序运行时实现权利要求1所述的方法。The present invention also proposes a mechanical equipment server system with a learning function, including a learning server and a mechanical control unit. The mechanical equipment itself has an action instruction system, a drive unit, and a sensor. The action instruction system is used to issue an action instruction to the machine control unit, drive The part is used to drive the operation of the mechanical device, and the sensor is used to obtain the actual action information of the mechanical device; the learning server and the mechanical control part are storage media provided with a computer program that implements the method of claim 1 when the computer program runs.
学习服务器的设置方式包括机械设备局域网服务器、企业服务器和云服务器。The setting method of learning server includes mechanical equipment LAN server, enterprise server and cloud server.
本发明提供一种具备学习功能的机械设备学习方法及其服务器系统,通过记录机械设备的工艺价值信息,对新的加工工艺进行分析,并根据分析结果从学习服务器获取学习修正量,无需再使用传感器对加工工艺进行重新学习。具有以下有益效果:The invention provides a mechanical equipment learning method with a learning function and a server system thereof. By recording the process value information of the mechanical equipment, the new processing technology is analyzed, and the learning correction amount is obtained from the learning server according to the analysis result, without the need to use The sensor re-learns the processing technology. Has the following beneficial effects:
1)本发明提出一种机械设备的控制学习方法,通过对预定指令的学习,学习机械动作的工艺价值信息,构建学习服务器,当遇到新动作指令或新设备时,可以根据新动作指令或新机械设备,动作分析部针对新的动作指令进行分析,根据匹配学习服务器结果,可获取已学习的学习修正量,无需重复学习。1) The present invention proposes a control learning method for mechanical equipment. By learning predetermined instructions and learning the technological value information of mechanical actions, a learning server is constructed. When a new action instruction or new device is encountered, the new action instruction or For new machinery and equipment, the motion analysis unit analyzes the new motion instructions, and based on the results of the matching learning server, can obtain the learned learning correction amount without repeated learning.
2)本发明提出的机械设备的控制学习方法及其服务器系统,无需与固定的机械设备绑定,可以灵活更换到不同的机械设备上运行,学习服务器中的工艺价值信息越多,即学习过越多的动作指令,本发明就能够适用于越多的设备。2) The control learning method and server system of the mechanical equipment proposed in the present invention do not need to be bound to fixed mechanical equipment, and can be flexibly replaced and run on different mechanical equipment. The more process value information in the learning server, the more learned The more action instructions, the more devices the invention can be applied to.
3)本发明的学习服务器系统,能够提高机械设备的工作效率、降低机械设备成本 及生产线系统成本。3) The learning server system of the present invention can improve the working efficiency of mechanical equipment, reduce the cost of mechanical equipment and the cost of production line systems.
4)本发明方法中,动作分析部针对新的动作指令进行分析,根据匹配学习服务器的结果,用新动作信息与学习修正量的数据更新机械设备的学习服务器,使得本发明的学习服务器不断自我充实。4) In the method of the present invention, the motion analysis unit analyzes the new motion command, and updates the learning server of the mechanical device with the new motion information and the data of the learning correction amount according to the result of the matching learning server, so that the learning server of the present invention continuously self Enrichment.
5)本发明方法能够用于协助抑制机械设备动作过程中的振动,实现敏捷的动作。5) The method of the present invention can be used to assist in suppressing vibrations during the operation of mechanical equipment to achieve agile movements.
6)本发明方法能够从多角度,如节拍、功耗、运动精度等,优化机械设备的动作,提高生产效率。6) The method of the present invention can optimize the action of mechanical equipment from multiple angles, such as beat, power consumption, motion accuracy, etc., and improve production efficiency.
7)本发明方法能够从多角度,铸造、锻压等,优化机械设备的加工工艺,提高机械设备的产品加工质量。7) The method of the present invention can optimize the processing technology of mechanical equipment from multiple angles, casting, forging, etc., and improve the product processing quality of mechanical equipment.
8)本发明所建立的学习服务器系统,可更换设备使用,其他机械设备无需再重新学习,动作分析部针对新动作可直接从学习服务器获取对应的学习修正量。8) The learning server system established by the present invention can be used by replacing equipment, and other mechanical equipment does not need to be re-learned. The motion analysis unit can directly obtain the corresponding learning correction amount from the learning server for the new motion.
附图说明BRIEF DESCRIPTION
图1为本发明方法示意图。Figure 1 is a schematic diagram of the method of the present invention.
图2为本发明方法中,机器人学习中的示意图。2 is a schematic diagram of robot learning in the method of the present invention.
图3为本发明构建学习服务器的流程图。FIG. 3 is a flowchart of constructing a learning server in the present invention.
图4为本发明方法中,机器人运行中的示意图。4 is a schematic diagram of the robot in operation in the method of the present invention.
图5为本发明中学习服务器系统的使用流程图。FIG. 5 is a flowchart of using the learning server system in the present invention.
具体实施方式detailed description
本发明提供一种具备学习功能的机械设备学习方法及其服务器系统,通过记录机械设备的工艺价值信息,对新的加工工艺进行分析,并根据分析结果从学习服务器获取学习修正量,无需再使用传感器对加工工艺进行重新学习。为了描述机械设备方便,本发明实施例以工业机器人和动作优化控制为例进行说明,但不局限于工业机器人,还包括伺服驱动器等其他机械设备,也不局限于动作优化控制,还包括过程优化控制。The invention provides a mechanical equipment learning method with a learning function and a server system thereof. By recording the process value information of the mechanical equipment, the new processing technology is analyzed, and the learning correction amount is obtained from the learning server according to the analysis result, without the need to use The sensor re-learns the processing technology. In order to describe the convenience of mechanical equipment, the embodiments of the present invention take industrial robots and motion optimization control as examples, but are not limited to industrial robots, but also include other mechanical equipment such as servo drives, and are not limited to motion optimization control, but also include process optimization. control.
以下,参照附图,说明本发明的实施例所涉及的机械设备学习服务器系统,其中机械设备以工业机器人为例,即下面关于机器人认为是机械设备的描述。Hereinafter, a mechanical device learning server system according to an embodiment of the present invention will be described with reference to the drawings, in which the mechanical device takes an industrial robot as an example, that is, the following description regarding the robot is regarded as a mechanical device.
图1表示本发明的实施例所涉及的机械设备学习方法及其服务器系统的示意图。对机器人备设置学习服务器、机器人控制部、驱动部、示教系统和传感器,示教系统下发动作指令传输给机器人控制部,以期望机器人能够按照期望的动作进行执行。驱动部用 于驱动机器人动作。FIG. 1 shows a schematic diagram of a mechanical device learning method and a server system thereof according to an embodiment of the present invention. The robot is equipped with a learning server, robot control unit, drive unit, teaching system, and sensors. The teaching system sends motion commands to the robot control unit, hoping that the robot can perform the desired actions. The drive unit is used to drive the robot to move.
机器人控制部由动作解析部、学习控制部、存储部、学习分析部以及动作控制部组成。动作解析部主要是对示教系统传过来的动作指令进行动作解析,动作解析包括运动学规划、动力学分析等,动作解析部将解析后的动作信息发给学习控制部、存储部、学习分析部、以及动作控制部。学习控制部对动作解析部传来的动作信息及传感器采集到的信息进行学习,得到学习修正量后存储于存储部中。机器人在工作空间中以预定的动作进行学习,预定的动作需要包含不同的位置、速度、加速度等情况下,以保证机器人学习的质量。存储部中存储学习的动作信息及相应的学习修正量,当学习完成后,传输给学习分析部。当学习完成后,学习控制部和存储部可以不需要。学习分析部对动作价值信息进行整理,去除冗余的信息,并将动作价值信息上传于学习服务器。当机器人学习完成后,针对示教系统下发的新的动作指令,学习分析部对该动作进行一定分析,根据分析结果从学习服务器中得到对应或相似动作的学习修正量,对驱动部进行动作补偿。动作控制部对动作解析部传来的动作信息进行机器人动作控制,将动作信号传给驱动部。驱动部主要由伺服驱动器、伺服电机组成,伺服驱动器通过位置、速度或者电流反馈控制驱动伺服电机动作,使机器人机构部按照预期的动作进行执行。The robot control unit is composed of a motion analysis unit, a learning control unit, a storage unit, a learning analysis unit, and a motion control unit. The motion analysis unit mainly analyzes the motion commands transmitted from the teaching system. The motion analysis includes kinematics planning and dynamic analysis. The motion analysis unit sends the analyzed motion information to the learning control unit, storage unit, and learning analysis. Part, and motion control part. The learning control unit learns the motion information transmitted by the motion analysis unit and the information collected by the sensor, obtains the learning correction amount, and stores it in the storage unit. The robot learns with a predetermined motion in the work space. The predetermined motion needs to include different positions, speeds, accelerations, etc. to ensure the quality of the robot learning. The learning action information and the corresponding learning correction amount are stored in the storage unit, and when the learning is completed, it is transmitted to the learning analysis unit. When the learning is completed, the learning control unit and the storage unit may not be required. The learning analysis section sorts the action value information, removes redundant information, and uploads the action value information to the learning server. After the robot learning is completed, for the new action command issued by the teaching system, the learning analysis part analyzes the action, and obtains the learning correction amount of the corresponding or similar action from the learning server according to the analysis result, and acts on the drive part make up. The motion control unit performs robot motion control on the motion information transmitted from the motion analysis unit, and transmits the motion signal to the driving unit. The driving part is mainly composed of a servo drive and a servo motor. The servo drive drives the servo motor through position, speed or current feedback control to make the robot mechanism perform the expected action.
传感器安装在机器人的位置控制的对象部位,用于得到对象部位的位置、速度或加速度等物理信息。The sensor is installed on the target part of the robot's position control, and is used to obtain physical information such as the position, velocity, or acceleration of the target part.
上述实施例所涉及传感器包括编码器、位置传感器、速度传感器、加速度传感器、视觉传感器、力觉传感器、角速度传感器、陀螺传感器、惯性测量单元等。The sensors involved in the above embodiments include encoders, position sensors, speed sensors, acceleration sensors, vision sensors, force sensors, angular velocity sensors, gyro sensors, inertial measurement units, and the like.
接下来叙述学习服务器系统的构建及使用过程。The following describes the construction and use of the learning server system.
图3表示构建学习服务器的流程图。首先,示教系统下发运动指令给机器人控制部,机器人控制部的动作解析部对动作指令进行动作解析,学习控制部得到动作信息及外部传感器采集得到的数据进行学习,得到整体动作的学习修正量,并将动作信息及学习修正量保存至存储部。其次,学习分析部从存储部取出动作信息及学习修正量,并将学习修正量和动作信息进行分段处理,构建动作价值数据,通过分析后去除冗余结果。最后,学习分析部将最终的分析结果上传于学习服务器。Figure 3 shows a flow chart for building a learning server. First, the teaching system issues motion commands to the robot control unit. The motion analysis unit of the robot control unit analyzes the motion commands. The learning control unit obtains the motion information and the data collected by the external sensor to learn to obtain the learning correction of the overall motion. And save the motion information and learning correction amount to the storage unit. Next, the learning analysis unit takes out the action information and the learning correction amount from the storage unit, and processes the learning correction amount and the action information in segments to construct action value data, and removes the redundant result after analysis. Finally, the learning analysis department uploads the final analysis results to the learning server.
记机器人在工作空间中动作N组位置、速度、加速度、惯量等不同的轨迹。安装在对象部位的传感器以加速度传感器为例,传感器检测得到的物理量是机器人位置控制的对象部位加速度。经过一次积分得到速度成分,二次积分得到位置成分。Remember the different trajectories of the position, speed, acceleration, inertia, etc. of N groups of robots in the working space. An acceleration sensor is used as an example of the sensor installed in the target part, and the physical quantity detected by the sensor is the acceleration of the target part controlled by the robot position. The velocity component is obtained after one integration, and the position component is obtained after two integrations.
设机械根据预定动作指令进行N组动作轨迹,学习控制部学习得到对应N组轨迹 的学习修正量,将动作轨迹N i分成M i段,则N组不同轨迹共有M组动作信息及对应的学习修正量,
Figure PCTCN2019086712-appb-000002
M i可以根据轨迹的长短、插补点的多少来确定,第一组动作轨迹分成M 1段,第二组轨迹分为M 2段,以此类推。
Provided machinery for N groups of motion track according to a predetermined operation instruction, the learning control section obtained by learning the learning correction amount corresponding to N sets of tracks, the movement trajectory N i is divided into M i segment, the N sets of different trajectories that there are M sets operation information and the corresponding study Correction amount,
Figure PCTCN2019086712-appb-000002
M i can be determined according to the length of the trajectory and the number of interpolation points. The first group of trajectories is divided into M 1 segments, the second group of trajectories is divided into M 2 segments, and so on.
记一段动作信息为X k,对应的学习修正量为L k,动作信息包含相对位置、速度、加速度和惯量,k=1,2…M,对每段的轨迹数据进行修正,计算每段轨迹相对于轨迹起点的位置,得到修正后的轨迹位置数据,即得到修正后的动作信息Y k,设W k=[Y k,L k],构建得到动作价值数据W=[W 1,W 2,...W k,...W M], Record a piece of motion information as X k , the corresponding learning correction amount is L k , the motion information includes relative position, speed, acceleration and inertia, k=1, 2...M, correct the trajectory data of each section, calculate each trajectory Relative to the position of the starting point of the trajectory, the corrected trajectory position data is obtained, that is, the corrected action information Y k is obtained, and W k =[Y k ,L k ] is set to construct the action value data W=[W 1 ,W 2 ,...W k ,...W M ],
以时间翘曲距离求解两两修正后动作信息的相似度D pqSolve the similarity D pq of the motion information after pairwise correction with the time warping distance,
D pq=CanonicalWarpingDistance[Y p,Y q]  p,q∈M,p≠q D pq = CanonicalWarpingDistance[Y p ,Y q ] p,q∈M,p≠q
并同样方式求解对应两两的学习修正量的相似度E pqAnd in the same way, solve the similarity E pq corresponding to the pairwise learning correction amount,
E pq=CanonicalWarpingDistance[L p,L q]  p,q∈M,p≠q E pq = CanonicalWarpingDistance[L p ,L q ] p,q∈M,p≠q
则两两动作价值数据的相似度F pqThen the similarity F pq of the pairwise action value data,
F pq=αD pq+βE pq F pq = αD pq + βE pq
其中,α,β为权重系数;Among them, α, β are weight coefficients;
当F pq<ε 1,则认为动作价值数据W p、W q的信息类似,动作价值数据存在冗余。ε 1是相似度阈值,对应不同机型、不同负载、不同速度、不同惯量等实际工况,可以根据实验事先调整得出。 When F pq1 , the action value data W p and W q are considered to be similar, and the action value data is redundant. ε 1 is the similarity threshold, corresponding to the actual operating conditions of different models, different loads, different speeds, different inertias, etc., which can be adjusted in advance according to the experiment.
当M组数据两两经过相似度计算处理完后,对冗余的动作价值数据去除并进行整理,发送至学习服务器以供本机器人和其他机器人使用。After the M sets of data are processed by similarity calculations, the redundant action value data is removed and sorted, and sent to the learning server for use by the robot and other robots.
图5表示学习服务器系统的使用流程图。首先,示教系统下发动作指令至机器人控制部,控制部中的动作解析部对动作指令进行解析。学习分析部对解析后的动作信息进行进一步分段处理,通过分析后从学习服务器获取对应的学习修正量。最后,将学习修正量与动作控制部输出的动作信息相加,传输给驱动部。Fig. 5 shows a flow chart of the use of the learning server system. First, the teaching system issues motion commands to the robot control unit, and the motion analysis unit in the control unit analyzes the motion commands. The learning analysis unit performs further segmentation processing on the analyzed action information, and obtains the corresponding learning correction amount from the learning server after analysis. Finally, the learning correction amount is added to the motion information output by the motion control unit and transmitted to the drive unit.
对于新的动作指令,将新动作的轨迹分段,对位置数据进行修正处理后,分段动作信息修正后的动作信息为Z j,与动作价值数据W中的Y k进行相似度求解分析,利用时间翘曲距离,得到动作信息的相似度G jkFor new action instructions, after segmenting the trajectory of the new action and correcting the position data, the corrected action information of the segmented action information is Z j , and the similarity solution analysis is performed with Y k in the action value data W, Using the time warping distance, the similarity G jk of the action information is obtained,
G jk=CanonicalWarpingDistance[Z j,Y k] G jk = CanonicalWarpingDistance[Z j ,Y k ]
当G ij<ε 2,则新动作的学习修正量对应学习服务器中的学习修正量L k,重复求解上述过程,直至新动作指令的分段轨迹的学习修正量全部得到。这里的阈值ε 2也是根据不同机型、不同负载、不同速度、不同惯量等工况的实验调整得出,预先设定。 When G ij2 , the learning correction amount of the new action corresponds to the learning correction amount L k in the learning server, and the above process is repeatedly solved until the learning correction amount of the segmented trajectory of the new action instruction is all obtained. The threshold ε 2 here is also obtained based on the experimental adjustment of different models, different loads, different speeds, different inertias and other working conditions, and is preset.
如以上说明,根据本发明方法及系统,在学习过的机器人和未经任何学习的机器人上,新的动作指令无需耗费时间和精力再安装设置传感器、重新学习,学习分析部对动作信息分析,直接从学习服务器中得到学习修正量。As explained above, according to the method and system of the present invention, on the learned robot and the robot without any learning, the new motion instruction does not need to spend time and effort to install and re-install sensors, re-learn, and the learning analysis part analyzes the motion information, Get the learning correction amount directly from the learning server.
本发明使用时间翘曲距离构建学习服务器系统并进行运用,但不局限于此种方法,其他计算相似度的方法也可以使用。,如误差平方和、最小二乘法、相关系数、神经网络算法、模糊控制算法、遗传算法和模拟退火算法。The present invention uses the time warping distance to construct a learning server system and use it, but it is not limited to this method, and other methods of calculating similarity may also be used. , Such as error sum of squares, least squares, correlation coefficient, neural network algorithm, fuzzy control algorithm, genetic algorithm and simulated annealing algorithm.
本发明机械设备学习控制是针对加工工艺进行的优化控制,包括机械设备的动作控制和工艺过程控制。在机械设备的控制部中构造学习分析部,在机械设备学习过程中,对存储在存储部的工艺信息及学习修正量进行分段整理分析,利用时间翘曲距离整合工艺价值数据,并将整理后的工艺价值数据保存至学习服务器中。存储在学习服务器的工艺价值数据不仅可以供学习的机械设备使用,也可以提供给其他未经学习的机械设备使用。学习服务器可再根据需求进行更新,以保证机械设备性能的高效性。The mechanical equipment learning control of the present invention is optimized control for the processing technology, including the motion control and technological process control of the mechanical equipment. Construct a learning and analysis section in the control section of machinery and equipment. During the learning process of machinery and equipment, process and analyze the process information and learning correction amount stored in the storage section, and use time warping distance to integrate process value data and organize The post-process value data is saved to the learning server. The process value data stored in the learning server can be used not only for the learned mechanical equipment, but also for other unlearned mechanical equipment. The learning server can be updated according to the requirements to ensure the high efficiency of the mechanical equipment performance.

Claims (8)

  1. 一种机械设备的控制学习方法,其特征是对机械设备设置学习服务器、机械控制部、驱动部、示教系统和传感器,A control learning method for mechanical equipment, which is characterized by setting a learning server, a mechanical control section, a driving section, a teaching system and sensors for the mechanical equipment,
    示教系统发出动作指令给机械控制部,驱动部用于驱动机械动作,传感器用于获得机械的实际动作信息;The teaching system sends action commands to the machine control part, the drive part is used to drive the machine action, and the sensor is used to obtain the actual action information of the machine;
    机械控制部包括动作解析部、学习控制部、存储部、学习分析部以及动作控制部,动作解析部对示教系统发出的动作指令进行动作解析,动作解析部将解析后的动作信息发给学习控制部、存储部、学习分析部、以及动作控制部;动作控制部发出控制信息给驱动部,学习控制部对动作解析部传来的动作信息及传感器采集到的信息进行学习,由两者偏差得到学习修正量,并存储于存储部中,其中,机械对不同的预定动作进行学习,包括不同的位置、速度以及加速度情况下的动作指令;存储部中缓存所学习的动作信息及相应的学习修正量,并传输给学习分析部;学习分析部对动作信息即学习修正量进行整理,并将整理后的信息上传于学习服务器,The mechanical control unit includes a motion analysis unit, a learning control unit, a storage unit, a learning analysis unit, and a motion control unit. The motion analysis unit performs motion analysis on the motion commands issued by the teaching system, and the motion analysis unit sends the analyzed motion information to the learning The control unit, storage unit, learning analysis unit, and motion control unit; the motion control unit sends control information to the drive unit, and the learning control unit learns the motion information from the motion analysis unit and the information collected by the sensor, which deviates from the two The amount of learning correction is obtained and stored in the storage unit, where the machine learns different predetermined actions, including action commands in the case of different positions, speeds, and accelerations; the stored action information and corresponding learning are cached in the storage unit The correction amount is transmitted to the learning and analysis department; the learning and analysis department sorts out the action information, that is, the learning correction amount, and uploads the sorted information to the learning server.
    当机械对指定动作指令学习完成后,针对示教系统下发的新的动作指令,学习分析部对该动作指令进行分析,根据分析结果从学习服务器中得到对应或相似动作的学习修正量,用于在新动作下对驱动部进行动作补偿,动作控制部根据动作解析部传来的新动作指令的动作信息进行机器人动作控制,将动作信号传给驱动部,驱动部根据动作信息和学习修正量,使机械按照指令动作进行执行。After the machine learns the specified action command, the learning analysis unit analyzes the new action command issued by the teaching system, and obtains the learning correction amount of the corresponding or similar action from the learning server according to the analysis result. In the new motion, the motion compensation is performed on the driving part. The motion control part performs robot motion control according to the motion information of the new motion command transmitted by the motion analysis part, and transmits the motion signal to the drive part. The drive part according to the motion information and the learning correction amount To make the machine execute according to the command action.
  2. 根据权利要求1所述的一种机械设备的控制学习方法,其特征是学习控制部的学习及学习分析部的分析具体如下:The control learning method for mechanical equipment according to claim 1, wherein the learning of the learning control unit and the analysis of the learning analysis unit are as follows:
    设机械根据预定动作指令进行N组动作轨迹,学习控制部学习得到对应N组轨迹的学习修正量,将动作轨迹N i分成M i段,则N组不同轨迹共有M组动作信息及对应的学习修正量,
    Figure PCTCN2019086712-appb-100001
    Provided machinery for N groups of motion track according to a predetermined operation instruction, the learning control section obtained by learning the learning correction amount corresponding to N sets of tracks, the movement trajectory N i is divided into M i segment, the N sets of different trajectories that there are M sets operation information and the corresponding study Correction amount,
    Figure PCTCN2019086712-appb-100001
    记一段动作信息为X k,对应的学习修正量为L k,动作信息包含相对位置、速度、加速度和惯量,k=1,2…M,对每段的轨迹数据进行修正,计算每段轨迹相对于轨迹起点的位置,得到修正后的轨迹位置数据,即得到修正后的动作信息Y k,设W k=[Y k,L k],构建得到动作价值数据W=[W 1,W 2,...W k,...W M], Record a piece of motion information as X k , the corresponding learning correction amount is L k , the motion information includes relative position, speed, acceleration and inertia, k=1, 2...M, correct the trajectory data of each section, calculate each trajectory Relative to the position of the starting point of the trajectory, the corrected trajectory position data is obtained, that is, the corrected action information Y k is obtained, and W k =[Y k ,L k ] is set to construct the action value data W=[W 1 ,W 2 ,...W k ,...W M ],
    以时间翘曲距离求解两两修正后动作信息的相似度D pqSolve the similarity D pq of the motion information after pairwise correction with the time warping distance,
    D pq=CanonicalWarpingDistance[Y p,Y q] p,q∈M,p≠q D pq = CanonicalWarpingDistance[Y p ,Y q ] p,q∈M,p≠q
    并同样方式求解对应两两的学习修正量的相似度E pqAnd in the same way, solve the similarity E pq corresponding to the pairwise learning correction amount,
    E pq=CanonicalWarpingDistance[L p,L q] p,q∈M,p≠q E pq = CanonicalWarpingDistance[L p ,L q ] p,q∈M,p≠q
    则两两动作价值数据的相似度F pqThen the similarity F pq of the pairwise action value data,
    F pq=αD pq+βE pq F pq = αD pq + βE pq
    其中,α,β为权重系数;Among them, α, β are weight coefficients;
    当F pq<ε 1,ε 1是设置的相似度阈值,则认为动作价值数据W p、W q的信息类似,动作价值数据存在冗余; When F pq1 , ε 1 is the set similarity threshold, the information of action value data W p and W q is considered to be similar, and action value data is redundant;
    当M组数据两两经过相似度计算处理完后,对冗余的动作价值数据去除并进行整理,发送至学习服务器以供使用。After the M sets of data are processed by similarity calculations, the redundant action value data is removed and sorted, and sent to the learning server for use.
  3. 根据权利要求2所述的一种机械设备的控制学习方法,其特征是对于新的动作指令,将新动作的轨迹分段,对位置数据进行修正处理后,分段动作信息修正后的动作信息为Z j,与动作价值数据W中的Y k进行相似度求解分析,利用时间翘曲距离,得到动作信息的相似度G jkA control learning method for mechanical equipment according to claim 2, characterized in that for a new motion command, the trajectory of the new motion is segmented, and after the position data is corrected, the motion information after the segmented motion information is corrected For Z j , perform a similarity solution analysis with Y k in the action value data W, and use the time warping distance to obtain the similarity of action information G jk ,
    G jk=CanonicalWarpingDistance[Z j,Y k] G jk = CanonicalWarpingDistance[Z j ,Y k ]
    当G ij<ε 2,ε 2为设置的阈值,则新动作的学习修正量对应学习服务器中的学习修正量L k,重复求解上述过程,直至新动作指令的分段轨迹的学习修正量全部得到。 When G ij2 , ε 2 is the set threshold, the learning correction amount of the new action corresponds to the learning correction amount L k in the learning server, and the above process is repeatedly solved until the learning correction amount of the segmented trajectory of the new action instruction is all get.
  4. 根据权利要求2或3所述的一种机械设备的控制学习方法,其特征是计算动作信息和学习修正量的相似度时,使用的相似度计算方法还包括误差平方和、最小二乘法、相关系数、神经网络算法、模糊控制算法、遗传算法和模拟退火算法。A control learning method for mechanical equipment according to claim 2 or 3, characterized in that when calculating the similarity between the action information and the learning correction amount, the similarity calculation method used further includes sum of squared errors, least squares, correlation Coefficient, neural network algorithm, fuzzy control algorithm, genetic algorithm and simulated annealing algorithm.
  5. 根据权利要求1所述的一种机械设备的控制学习方法,其特征是对于新的动作指令,将获得的动作信息和学习修正量上传至学习服务器。The control learning method of a mechanical device according to claim 1, wherein for the new motion instruction, the obtained motion information and the learning correction amount are uploaded to the learning server.
  6. 根据权利要求1所述的一种机械设备的控制学习方法,其特征是用于机械设备的动作优化控制,以及机械设备的过程优化控制。A control learning method for mechanical equipment according to claim 1, characterized in that it is used for optimal control of the movement of mechanical equipment and process optimization of the mechanical equipment.
  7. 一种具备学习功能的机械设备控制学习系统,其特征是包括学习服务器和机械控制部,机械设备自身具有动作指令系统、驱动部和传感器,动作指令系统用于发出动作指令给机械控制部,驱动部用于驱动机械设备动作,传感器用于获得机械设备的实际动作信息;学习服务器和机械控制部为设置有计算机程序的存储介质,所述计算机程序运行时实现权利要求1所述的方法。A mechanical equipment control learning system with a learning function is characterized by including a learning server and a mechanical control part. The mechanical equipment itself has an action command system, a driving part and a sensor. The action command system is used to issue a motion command to the mechanical control part and drive The part is used to drive the operation of the mechanical device, and the sensor is used to obtain the actual action information of the mechanical device; the learning server and the mechanical control part are storage media provided with a computer program that implements the method of claim 1 when the computer program runs.
  8. 根据权利要求6所述的机械设备控制学习系统,其特征是学习服务器的设置方式包括机械设备局域网服务器、企业服务器和云服务器。The machine equipment control learning system according to claim 6, characterized in that the learning server is arranged in a manner including a machine equipment LAN server, an enterprise server, and a cloud server.
PCT/CN2019/086712 2018-12-29 2019-05-13 Learning control method for mechanical apparatus, and mechanical apparatus learning control system having learning function WO2020133881A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811642219.6A CN109773787A (en) 2018-12-29 2018-12-29 The Schistosomiasis control method of mechanical equipment and its server system for having learning functionality
CN201811642219.6 2018-12-29

Publications (1)

Publication Number Publication Date
WO2020133881A1 true WO2020133881A1 (en) 2020-07-02

Family

ID=66499540

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/086712 WO2020133881A1 (en) 2018-12-29 2019-05-13 Learning control method for mechanical apparatus, and mechanical apparatus learning control system having learning function

Country Status (2)

Country Link
CN (1) CN109773787A (en)
WO (1) WO2020133881A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111890350A (en) * 2020-06-12 2020-11-06 深圳先进技术研究院 Robot, method of controlling the same, and computer-readable storage medium
CN112054738A (en) * 2020-09-09 2020-12-08 福安市高博特电机科技有限公司 Automatic adjusting type frequency conversion controller

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004243475A (en) * 2003-02-14 2004-09-02 Seiko Epson Corp Robot and robot system
US20150231784A1 (en) * 2012-03-23 2015-08-20 Irobot Corporation Robot controller learning system
CN106695793A (en) * 2017-01-18 2017-05-24 宁波韦尔德斯凯勒智能科技有限公司 Device and method for active compliance control for XYZR four-axis drilling robot
CN106965171A (en) * 2016-01-14 2017-07-21 发那科株式会社 Possesses the robot device of learning functionality
WO2018158601A1 (en) * 2017-03-01 2018-09-07 Omron Corporation Monitoring devices, monitored control systems and methods for programming such devices and systems
CN108687766A (en) * 2017-03-31 2018-10-23 发那科株式会社 Control device, machine learning device and the machine learning method of robot

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101661599B1 (en) * 2014-08-20 2016-10-04 한국과학기술연구원 Robot motion data processing system using motion data reduction/restoration compatible to hardware limits
US9630318B2 (en) * 2014-10-02 2017-04-25 Brain Corporation Feature detection apparatus and methods for training of robotic navigation
CN105045900A (en) * 2015-08-05 2015-11-11 石河子大学 Data extraction method and apparatus
JP6514166B2 (en) * 2016-09-16 2019-05-15 ファナック株式会社 Machine learning apparatus, robot system and machine learning method for learning robot operation program
CN107103029B (en) * 2017-03-03 2021-08-20 北京光年无限科技有限公司 Image data processing method for robot
CN107657266B (en) * 2017-08-03 2021-01-08 华北电力大学(保定) Load curve clustering method based on improved spectrum multi-manifold clustering

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004243475A (en) * 2003-02-14 2004-09-02 Seiko Epson Corp Robot and robot system
US20150231784A1 (en) * 2012-03-23 2015-08-20 Irobot Corporation Robot controller learning system
CN106965171A (en) * 2016-01-14 2017-07-21 发那科株式会社 Possesses the robot device of learning functionality
CN106695793A (en) * 2017-01-18 2017-05-24 宁波韦尔德斯凯勒智能科技有限公司 Device and method for active compliance control for XYZR four-axis drilling robot
WO2018158601A1 (en) * 2017-03-01 2018-09-07 Omron Corporation Monitoring devices, monitored control systems and methods for programming such devices and systems
CN108687766A (en) * 2017-03-31 2018-10-23 发那科株式会社 Control device, machine learning device and the machine learning method of robot

Also Published As

Publication number Publication date
CN109773787A (en) 2019-05-21

Similar Documents

Publication Publication Date Title
CN110355751B (en) Control device and machine learning device
CN106647282B (en) Six-degree-of-freedom robot trajectory planning method considering tail end motion error
Ollero et al. Predictive path tracking of mobile robots. Application to the CMU Navlab
US11092965B2 (en) Method and device for driving dynamics control for a transportation vehicle
CN110815190B (en) Industrial robot dragging demonstration method and system
Mueller et al. Iterative learning of feed-forward corrections for high-performance tracking
CN110103225B (en) Data-driven method and device for controlling repeated motion of mechanical arm
WO2020133881A1 (en) Learning control method for mechanical apparatus, and mechanical apparatus learning control system having learning function
CN108717262B (en) Special-shaped curved surface tracking method and system based on moment feature learning neural network
CN111679579A (en) Sliding mode prediction fault-tolerant control method for fault system of sensor and actuator
JP2019098439A (en) Vibration suppression device
CN110978014A (en) Control device and control system
Navarro-Alarcon et al. Visually servoed deformation control by robot manipulators
CN115122325A (en) Robust visual servo control method for anthropomorphic manipulator with view field constraint
Fahimi et al. An alternative closed-loop vision-based control approach for Unmanned Aircraft Systems with application to a quadrotor
Yu et al. Position-based visual servo control of dual robotic arms with unknown kinematic models: A cerebellum-inspired approach
CN112650217A (en) Robot trajectory tracking strategy dynamic optimization method based on evaluation function
CN114378830B (en) Robot wrist joint singular avoidance method and system
CN110788859B (en) Controller parameter universe self-adaptive adjustment system
CN114952858A (en) Industrial robot trajectory tracking method and system based on friction compensation control
CN108107882A (en) Service robot automatic Calibration and detecting system based on optical motion tracking
CN115469665B (en) Intelligent wheelchair target tracking control method and system suitable for dynamic environment
JP2020055095A (en) Control device and control system
CN115179290B (en) Mechanical arm and track control method and device thereof
Chen et al. Adaptive Stiffness Visual Servoing for Unmanned Aerial Manipulators With Prescribed Performance

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19906583

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19906583

Country of ref document: EP

Kind code of ref document: A1