CN117021118A

CN117021118A - Dynamic compensation method for digital twin track error of parallel robot

Info

Publication number: CN117021118A
Application number: CN202311287483.3A
Authority: CN
Inventors: 张宇廷; 王宗彦; 李梦龙; 高沛; 吴璞
Original assignee: North University of China
Current assignee: North University of China
Priority date: 2023-10-08
Filing date: 2023-10-08
Publication date: 2023-11-10
Anticipated expiration: 2043-10-08
Also published as: CN117021118B

Abstract

The invention belongs to the technical field of robot running track compensation, and solves the problem of poor virtual-real interaction performance of parallel robots. The method comprises the steps of generating a digital twin model of the parallel robot and constructing a parallel robot track sample data set; training and iterating the digital twin model by a dynamic coordination reinforcement learning method to obtain an optimal motion track under optimal displacement and speed; compensating the speed error between the physical entity of the parallel robot and the optimized digital twin model by a symmetrical error compensation method; and integrating the digital twin frames to realize real-time synchronous interaction between the physical entity of the parallel robot and the digital twin model. The invention is beneficial to improving the reconstruction capability of the digital twin model and the error control performance of the parallel robot, can realize the communication synchronization between the physical entity and the digital twin model and improve the virtual-real interaction rate of the physical entity and the digital twin model.

Description

A dynamic compensation method for trajectory error of digital twins of parallel robots

技术领域Technical field

本发明属于机器人运行轨迹补偿技术领域，具体涉及一种并联机器人数字孪生轨迹误差动态补偿方法。The invention belongs to the technical field of robot running trajectory compensation, and specifically relates to a dynamic compensation method for trajectory errors of parallel robot digital twins.

背景技术Background technique

目前，结合数字孪生技术的并联机器人领域的智能装箱，搬运等生产作业已经成为智能制造业的热点。但是，对于并联机器人而言，高精度、高速运行工作是其关键所在，大多数数字孪生系统框架无法满足并联机器人高精度、高速的需求。At present, production operations such as intelligent boxing and handling in the field of parallel robots combined with digital twin technology have become a hot topic in intelligent manufacturing. However, for parallel robots, high-precision and high-speed operation are the key. Most digital twin system frameworks cannot meet the high-precision and high-speed requirements of parallel robots.

当前数字孪生技术已经将虚实同步交互应用的较为成熟，但是并联机器人由于本体的刚性误差与装配误差，导致并联机器人在移动中位移的不同步；数字孪生技术中物理实体与数字孪生模型之间存在通信延迟，通信时间会导致并联机器人在同步中产生速度误差，导致两者的虚实交互率下降。The current digital twin technology has matured the application of virtual and real synchronous interaction. However, due to the rigidity error and assembly error of the parallel robot body, the displacement of the parallel robot is not synchronized during movement; in digital twin technology, there is a gap between the physical entity and the digital twin model. Communication delay and communication time will cause speed errors in the synchronization of parallel robots, resulting in a decrease in the virtual and real interaction rate between the two.

发明内容Contents of the invention

本发明为了解决现有技术中存在的上述至少一个技术问题，提供了一种并联机器人数字孪生轨迹误差动态补偿方法。In order to solve at least one of the above technical problems existing in the prior art, the present invention provides a parallel robot digital twin trajectory error dynamic compensation method.

本发明采用如下的技术方案实现：一种并联机器人数字孪生轨迹误差动态补偿方法，包括以下步骤：S1：构建并联机器人的虚拟模型，并对虚拟模型进行杆件约束和空间约束，生成并联机器人的数字孪生模型；S2：对数字孪生模型进行动力学分析，采集并联机器人的电机转矩与动能参数将其转换为轨迹位移与速度参数，并构建并联机器人轨迹样本数据集；S3：通过动态协调强化学习方法对数字孪生模型进行训练迭代，得到最优位移和速度下的最优运动轨迹；S4：通过对称式误差补偿法对并联机器人的物理实体和优化后的数字孪生模型间的速度误差进行补偿；S5：进行数字孪生框架的整合，实现并联机器人物理实体与数字孪生模型的实时同步交互。The present invention adopts the following technical solution to realize: a parallel robot digital twin trajectory error dynamic compensation method, which includes the following steps: S1: Construct a virtual model of the parallel robot, perform rod constraints and space constraints on the virtual model, and generate a parallel robot. Digital twin model; S2: Perform dynamic analysis on the digital twin model, collect the motor torque and kinetic energy parameters of the parallel robot, convert them into trajectory displacement and speed parameters, and construct a parallel robot trajectory sample data set; S3: Strengthen through dynamic coordination The learning method performs training iterations on the digital twin model to obtain the optimal motion trajectory under optimal displacement and speed; S4: Compensate the speed error between the physical entity of the parallel robot and the optimized digital twin model through the symmetric error compensation method ; S5: Integrate the digital twin framework to achieve real-time synchronous interaction between the physical entity of the parallel robot and the digital twin model.

优选地，步骤S2中，构建并联机器人动力学集合，得到并联机器人的轨迹位移与速度/>；建立数字孪生模型的轨迹位移与速度数学模型，采集并联机器人轨迹样本数据，共采集N个样本数据点集合作为并联机器人轨迹样本数据集。Preferably, in step S2, a parallel robot dynamics set is constructed to obtain the trajectory displacement of the parallel robot. and speed/> ; Establish a mathematical model of trajectory displacement and speed of the digital twin model, collect parallel robot trajectory sample data, and collect a total of N sample data point sets as a parallel robot trajectory sample data set.

优选地，步骤S3具体包括以下步骤：S31：构建轨迹位移DQN子网络和速度DQN子网络：最优位移模型和最优速度模型； S32：将轨迹样本数据集分为轨迹位移和速度的样本数据集，进行DQN子网络的训练，引入动态协调系数，更改轨迹策略，通过改进后的最优轨迹策略动态协调两DQN子网络的位移Q值与速度Q值；S33：构建用于获取最优运动轨迹的DQN总网络，输入位移Q值与速度Q值对其进行训练迭代，最终得到最优位移和速度下的最优运动轨迹。Preferably, step S3 specifically includes the following steps: S31: Construct a trajectory displacement DQN subnetwork and a velocity DQN subnetwork: an optimal displacement model and an optimal velocity model; S32: Divide the trajectory sample data set into trajectory displacement and velocity sample data. Set, conduct training of DQN subnetworks, introduce dynamic coordination coefficients, change trajectory strategies, and dynamically coordinate the displacement Q values and velocity Q values of the two DQN subnetworks through the improved optimal trajectory strategy; S33: Construction for obtaining optimal motion For the DQN total network of the trajectory, the displacement Q value and the velocity Q value are input for training and iteration, and finally the optimal motion trajectory under the optimal displacement and velocity is obtained.

优选地，步骤S32中，改进后的最优策略的公式为：Preferably, in step S32, the improved optimal strategy The formula is:

式中，与/>为该策略下的动态协调系数；/>为初始轨迹策略；/>为位移误差；/>为速度误差；/>为最小值求解函数；/>为执行初始轨迹策略/>的Q值。In the formula, with/> is the dynamic coordination coefficient under this strategy;/> is the initial trajectory strategy;/> is the displacement error;/> is the speed error;/> Solve the function for the minimum value;/> To execute the initial trajectory strategy/> Q value.

优选地，步骤S32中，数字孪生模型与物理实体在时刻的位移误差/>为：Preferably, in step S32, the digital twin model and the physical entity are Displacement error at time/> for:

式中，表示数字孪生模型在/>时刻的末端执行器运行的位置坐标；表示物理实体在/>时刻的末端执行器运行的位置坐标；In the formula, Indicates that the digital twin model is in/> The position coordinates of the end effector running at the moment; Indicates that the physical entity is in/> The position coordinates of the end effector running at the moment;

数字孪生模型与物理实体在时刻的速度误差/>为：Digital twin model and physical entity Time speed error/> for:

式中，表示并联机器人数学孪生模型在/>时刻的末端执行器的速度，/>表示并联机器人物理实体在/>时刻的末端执行器的速度。In the formula, Represents the mathematical twin model of the parallel robot in/> The velocity of the end effector at the moment, /> Indicates that the physical entity of the parallel robot is in/> The speed of the end effector at the moment.

优选地，作为最优位移模型的输出并反馈作用于最优位移模型中，最终输出最小的位移误差/>作为位移Q值；/>作为最优速度模型的输出并反馈作用于最优速度模型中，最终输出最小的速度误差/>作为速度Q值。Preferably, As the output of the optimal displacement model and fed back into the optimal displacement model, the smallest displacement error is finally output/> As the displacement Q value;/> As the output of the optimal speed model and fed back into the optimal speed model, the minimum speed error is finally output. As the speed Q value.

优选地，在DQN总网络，定义最小轨迹综合误差为：Preferably, in the DQN total network, the minimum trajectory comprehensive error is defined as:

式中为总体动态平衡系数。in the formula is the overall dynamic balance coefficient.

优选地，并联机器人物理实体与数字孪生模型存在时间延迟，为保证两者运行速度的一致性，需根据速度对称理念，对速度误差进行补偿，具体包括以下步骤：Preferably, there is a time delay between the physical entity of the parallel robot and the digital twin model. In order to ensure the consistency of the running speeds of the two, the speed error needs to be compensated based on the concept of speed symmetry, which specifically includes the following steps:

S41：将数字孪生模型运行周期与物理实体运动过程中的时间周期同步，数字孪生模型运行周期包括5个环节，依次为：启动加速环节、匀加速环节、匀加速到匀减速环节、匀减速环节和最终制动减速环节；将并联机器人物理实体也进行相同的运行周期的划分；S42：使物理实体按照数字孪生模型的最优运动轨迹进行周期性运动，设置状态节点对应门型轨迹运动；S43：门型运动轨迹为对称形状，采用对称式误差补偿法将周期内变换规律一致的运动进行划分；将匀减速环节和最终制动减速环节简化为匀加速环节和启动加速环节的速度对称模型，以中心线为基准，左侧为加速趋势速度模型，右侧为减速趋势速度模型，同时改变数字孪生模型与物理实体的速度策略，使得数字孪生模型与物理实体的运动趋势达到一致；S44：在启动加速环节、匀加速到匀减速环节、最终制动减速环节，误差补偿为直线误差补偿；在匀加速环节、匀减速环节，误差补偿为圆弧误差补偿。S41: Synchronize the digital twin model operation cycle with the time period during the movement of the physical entity. The digital twin model operation cycle includes 5 links, which are: startup acceleration link, uniform acceleration link, uniform acceleration to uniform deceleration link, and uniform deceleration link. and the final braking deceleration link; divide the physical entity of the parallel robot into the same operating cycle; S42: Make the physical entity perform periodic motion according to the optimal motion trajectory of the digital twin model, and set the state node to correspond to the door-shaped trajectory motion; S43 : The door-shaped motion trajectory is a symmetrical shape, and the symmetrical error compensation method is used to divide the motion with consistent transformation rules within the cycle; the uniform deceleration link and the final braking deceleration link are simplified into a speed symmetric model of the uniform acceleration link and the start-up acceleration link, Taking the center line as the benchmark, the left side is the acceleration trend speed model, and the right side is the deceleration trend speed model. At the same time, the speed strategy of the digital twin model and the physical entity is changed to make the movement trends of the digital twin model and the physical entity consistent; S44: In In the start-up acceleration link, uniform acceleration to uniform deceleration link, and final braking deceleration link, the error compensation is linear error compensation; in the uniform acceleration link and uniform deceleration link, the error compensation is arc error compensation.

优选地，步骤S5中，并联机器人物理实体与数字孪生模型的实时同步交互系统包括：物理实体层、信息交互层、误差反馈层、数据积累层、虚拟孪生层、数据处理层、动态协调层、虚实交互模块和动态协调强化学习模块；Preferably, in step S5, the real-time synchronous interaction system between the parallel robot physical entity and the digital twin model includes: physical entity layer, information interaction layer, error feedback layer, data accumulation layer, virtual twin layer, data processing layer, dynamic coordination layer, Virtual-real interaction module and dynamic coordination reinforcement learning module;

物理实体层包括并联机器人本体，通讯端口；信息交互层用于控制运行软件与数字孪生模型的通讯联动；误差反馈层进行物理实体层与虚拟孪生层的最优轨迹位移与速度的同步误差消除；数据积累建立数据库信息，保存DQN网络的训练参数以及并联机器人运动过程中的补偿参数；虚拟孪生层包括并联机器人数字孪生模型，软件平台界面，移动控制系统；动态协调层对当前轨迹误差与速度误差结合动态协调强化学习中的最优策略进行协调，负责对物理实体层与虚拟孪生层的反馈协调，形成总体的闭环交互系统。The physical entity layer includes the parallel robot body and communication ports; the information interaction layer is used to control the communication linkage between the running software and the digital twin model; the error feedback layer eliminates synchronization errors in the optimal trajectory displacement and speed of the physical entity layer and the virtual twin layer; Data accumulation establishes database information, saving the training parameters of the DQN network and the compensation parameters during the movement of the parallel robot; the virtual twin layer includes the parallel robot digital twin model, software platform interface, and mobile control system; the dynamic coordination layer controls the current trajectory error and speed error Combined with the optimal strategy in dynamic coordination reinforcement learning for coordination, it is responsible for the feedback coordination between the physical entity layer and the virtual twin layer to form an overall closed-loop interactive system.

优选地，并联机器人物理实体与数字孪生模型的实时同步交互的过程为：Preferably, the process of real-time synchronous interaction between the physical entity of the parallel robot and the digital twin model is:

启动物理实体层，使并联机器人做周期性运动，通过通信接口进入信息交互层；启动虚拟孪生层，动态协调强化学习模块进行最优轨迹位移，将多余的数据信息在数据处理层进行剔除；Start the physical entity layer to make the parallel robot make periodic movements and enter the information interaction layer through the communication interface; start the virtual twin layer to dynamically coordinate the reinforcement learning module for optimal trajectory displacement, and eliminate redundant data information in the data processing layer;

信息交互层将物理实体层中的运动学参数进行解析，转换为数字信息传输到误差反馈层；同时动态协调强化学习模块进行奖励值的计算与累加，将当前的数据传送到数据累计层与误差反馈层；The information interaction layer analyzes the kinematic parameters in the physical entity layer, converts them into digital information and transmits them to the error feedback layer; at the same time, it dynamically coordinates the reinforcement learning module to calculate and accumulate reward values, and transmits the current data to the data accumulation layer and error feedback layer;

误差反馈层则将同步结果进行分析，在轨迹位移同步的基础上达到轨迹速度同步补偿，得到较小的速度与位移误差，实现高精度数字孪生模型与物体实体的同步控制效果；The error feedback layer analyzes the synchronization results, achieves trajectory velocity synchronization compensation based on trajectory displacement synchronization, obtains smaller velocity and displacement errors, and achieves synchronization control effects between high-precision digital twin models and object entities;

通过数据累积层中的历史信息与当前误差反馈层中的信息进行动态协调，最后以闭环的方式反馈给物理实体层与虚拟孪生层完成虚实交互过程。The historical information in the data accumulation layer is dynamically coordinated with the information in the current error feedback layer, and finally fed back to the physical entity layer and the virtual twin layer in a closed-loop manner to complete the virtual-real interaction process.

与现有技术相比，本发明的有益效果是：Compared with the prior art, the beneficial effects of the present invention are:

本发明通过构建并联机器人的虚拟模型，并生成并联机器人的数字孪生模型；采集并联机器人的扭矩与动能参数并将其转换为轨迹位移与速度参数；通过动态协调强化学习方法对数字孪生模型进行训练迭代，得到最优运动轨迹下的最小位移误差和速度误差；通过对称式误差补偿法对并联机器人的物理实体和优化后的数字孪生模型间的速度误差进行补偿；进行数字孪生框架整合，实现并联机器人物理实体与数字孪生模型的实时同步交互。有利于提升数字孪生模型重构能力与并联机器人的误差控制性能，能够实现物理实体与数字孪生模型之间通信同步，提高两者的虚实交互率。This invention builds a virtual model of a parallel robot and generates a digital twin model of the parallel robot; collects the torque and kinetic energy parameters of the parallel robot and converts them into trajectory displacement and speed parameters; and trains the digital twin model through a dynamic coordination reinforcement learning method. Iterate to obtain the minimum displacement error and speed error under the optimal motion trajectory; use the symmetrical error compensation method to compensate for the speed error between the physical entity of the parallel robot and the optimized digital twin model; integrate the digital twin framework to achieve parallel connection Real-time synchronous interaction between the robot's physical entity and the digital twin model. It is conducive to improving the digital twin model reconstruction capability and the error control performance of parallel robots, enabling communication synchronization between physical entities and digital twin models, and improving the virtual and real interaction rate between the two.

附图说明Description of the drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案，下面将对实施例中所需要使用的附图作简单的介绍，显而易见地，下面描述中的附图仅仅是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some of the drawings of the present invention. Embodiments, for those of ordinary skill in the art, other drawings can also be obtained based on these drawings without exerting creative efforts.

图1是本发明的整体流程示意图；Figure 1 is a schematic diagram of the overall process of the present invention;

图2是本发明的CD-DQN网络的实现流程图；Figure 2 is a flow chart of the implementation of the CD-DQN network of the present invention;

图3是改进前的DQN网络的实现流程图；Figure 3 is the implementation flow chart of the DQN network before improvement;

图4是本发明的CD-DQN网络与改进前的DQN网络的速度误差对比图；Figure 4 is a comparison diagram of the speed error between the CD-DQN network of the present invention and the DQN network before improvement;

图5是本发明的轨迹位移与速度数学模型的示意图；Figure 5 is a schematic diagram of the trajectory displacement and velocity mathematical model of the present invention;

图6是本发明的对称误差补偿结构的示意图；Figure 6 is a schematic diagram of the symmetric error compensation structure of the present invention;

图7是本发明的动态虚实交互方法的结构框图；Figure 7 is a structural block diagram of the dynamic virtual and real interaction method of the present invention;

图8是改进前的虚实交互方法的结构框图；Figure 8 is a structural block diagram of the virtual-real interaction method before improvement;

图9是本发明的动态虚实交互方法与改进前的虚实交互方法的交互同步率对比图。Figure 9 is a comparison diagram of the interaction synchronization rate between the dynamic virtual and real interaction method of the present invention and the improved virtual and real interaction method.

具体实施方式Detailed ways

结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚，完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部实施例。基于本发明的实施例，本领域的普通技术人员在没有做出创造性劳动的前提下所得到的所有其他实施方式，都属于本发明所保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other implementations obtained by those of ordinary skill in the art without any creative work fall within the scope of protection of the present invention.

须知，本说明书附图所绘示的结构、比例、大小等，均仅用以配合说明书所揭示的内容，以供熟悉此技术的人士了解与阅读，并非用以限定本发明可实施的限定条件，故不具技术上的实质意义，任何结构的修饰、比例关系的改变或大小的调整，在不影响本发明所能产生的功效及所能达成的目的下，均应落在本发明所揭示的技术内容能涵盖的范围内，需要说明的是，在本说明书中，诸如第一和第二之类的关系术语仅仅用来将一个实体与另外几个实体区分开来，而不一定要求或者暗示这些实体之间存在任何实际的关系或者顺序。It should be noted that the structures, proportions, sizes, etc. shown in the drawings of this specification are only used to coordinate with the content disclosed in the specification for the understanding and reading of those familiar with this technology, and are not used to limit the conditions for the implementation of the present invention. , so it has no technical substantive significance. Any structural modifications, changes in proportions or adjustments in size shall fall within the scope of what is disclosed in the present invention as long as it does not affect the effects that the present invention can produce and the purposes that can be achieved. To the extent that the technical content can be covered, it should be noted that in this specification, relational terms such as first and second are only used to distinguish one entity from several other entities, and do not necessarily require or imply There is no actual relationship or ordering between these entities.

本发明提供了一种实施例：The present invention provides an embodiment:

如图1所示，一种并联机器人数字孪生轨迹误差动态补偿方法，包括以下步骤：As shown in Figure 1, a dynamic compensation method for trajectory error of a parallel robot digital twin includes the following steps:

S1：构建并联机器人的虚拟模型，并对虚拟模型进行杆件约束和空间约束，生成并联机器人的数字孪生模型；S1: Construct a virtual model of the parallel robot, perform rod constraints and space constraints on the virtual model, and generate a digital twin model of the parallel robot;

S2：对数字孪生模型进行动力学分析，采集并联机器人的电机转矩与动能参数将其转换为轨迹位移与速度参数，并采集并联机器人轨迹样本数据集；S2: Perform dynamic analysis on the digital twin model, collect the motor torque and kinetic energy parameters of the parallel robot, convert them into trajectory displacement and speed parameters, and collect the parallel robot trajectory sample data set;

S3：通过动态协调强化学习方法对数字孪生模型进行训练迭代，得到最优位移和速度下的最优运动轨迹；S3: Use the dynamic coordination reinforcement learning method to conduct training iterations on the digital twin model to obtain the optimal motion trajectory under optimal displacement and speed;

S4：通过对称式误差补偿法对并联机器人的物理实体和优化后的数字孪生模型间的速度误差进行补偿；S4: Compensate the speed error between the physical entity of the parallel robot and the optimized digital twin model through the symmetrical error compensation method;

S5：进行数字孪生框架的整合，实现并联机器人物理实体与数字孪生模型的实时同步交互。S5: Integrate the digital twin framework to achieve real-time synchronous interaction between the physical entity of the parallel robot and the digital twin model.

步骤S1中，建立并联机器人的三维模型，进行模块式划分，包含静平台，动平台，驱动杆件以及弹簧、螺丝零件，对三维模型的主要部件进行装配，以此构建并联机器人的虚拟模型。使用物理引擎对并联机器人虚拟模型固定与约束，给定并联机器人的工作空间，设置并联机器人的空间工作范围，建立并联机器人数字孪生模型。In step S1, a three-dimensional model of the parallel robot is established and divided into modules, including a static platform, a moving platform, driving rods, springs, and screw parts. The main components of the three-dimensional model are assembled to build a virtual model of the parallel robot. Use the physics engine to fix and constrain the parallel robot virtual model, give the parallel robot's work space, set the parallel robot's spatial working range, and establish a parallel robot digital twin model.

具体步骤为：使用Unity3D物理引擎对并联机器人进行固定与约束，将并联机器人中的静平台坐标系与实际的世界坐标系进行配准；固定并联机器人静平台部件，对驱动杆件进行球铰约束，对并联机器人动平台部件进行平行约束；约束驱动杆件与动平台的结构关系，完成固定与约束操作；将动平台部件赋予不同方向同等大小的应力，应力平行于静平台所在的空间坐标系中，给定并联机器人的正常工作空间，设置合理的工作范围；添加并联机器人结构映射关系，生成并联机器人的数字孪生模型。The specific steps are: use the Unity3D physics engine to fix and constrain the parallel robot, align the static platform coordinate system in the parallel robot with the actual world coordinate system; fix the static platform components of the parallel robot, and perform spherical hinge constraints on the driving rods. , carry out parallel constraints on the moving platform components of the parallel robot; constrain the structural relationship between the driving rod and the moving platform to complete the fixing and constraining operations; give the moving platform components equal stresses in different directions, and the stresses are parallel to the spatial coordinate system where the static platform is located , given the normal working space of the parallel robot, set a reasonable working range; add the structural mapping relationship of the parallel robot to generate a digital twin model of the parallel robot.

步骤S2中，并联机器人物理信息构建的运动学集合为，采集并联机器人的各项参数转换为并联机器人的运动学信息,通过静平台半径/>、动平台半径/>，主动杆长度/>，从动杆长度/>得到并联机器人末端位姿关系；通过构建的运动学模型得到将并联机器人的轨迹位移/>与速度；使用Unity3D中的ML-agent插件，使并联机器人的数字孪生模型进行虚拟空间中的运动，采集并联机器人轨迹样本数据，共采集50000个样本数据点集合作为并联机器人轨迹样本数据集。In step S2, the kinematics set constructed from the physical information of the parallel robot is , collect various parameters of the parallel robot and convert them into kinematic information of the parallel robot, through the radius of the static platform/> , moving platform radius/> , active rod length/> , driven rod length/> Obtain the end pose relationship of the parallel robot; obtain the trajectory displacement of the parallel robot through the constructed kinematic model/> with speed ; Use the ML-agent plug-in in Unity3D to make the digital twin model of the parallel robot move in the virtual space, and collect the parallel robot trajectory sample data. A total of 50,000 sample data point sets are collected as the parallel robot trajectory sample data set.

如图5所示，数字孪生模型运行时包括5个环节，将其定义为轨迹位移与速度数学模型：As shown in Figure 5, the digital twin model includes 5 links when running, which is defined as a mathematical model of trajectory displacement and velocity:

对应的运行周期；Corresponding operation cycle ;

、/>和/>分别表示在区间内/>时刻的加速度、速度和位移；/>为运行周期内的最大加速度。 ,/> and/> Respectively expressed in the interval/> Momentary acceleration, velocity and displacement;/> is the maximum acceleration during the operating cycle.

定义区间对应启动加速环节，对应图5中环节1，在此区间内：definition The interval corresponds to the startup acceleration link, which corresponds to link 1 in Figure 5. Within this interval:

、/>、 ,/> ,

； ;

定义区间对应匀加速环节，对应图5中环节2，在此区间内：definition The interval corresponds to the uniform acceleration link, which corresponds to link 2 in Figure 5. Within this interval:

、/>、 ,/> ,

； ;

定义区间对应匀加速到匀减速环节，对应图5中环节3，在此区间内：definition The interval corresponds to the period from uniform acceleration to uniform deceleration, which corresponds to link 3 in Figure 5. Within this interval:

、 ,

定义区间对应匀减速环节，对应图5中环节4，在此区间内：definition The interval corresponds to the uniform deceleration link, which corresponds to link 4 in Figure 5. Within this interval:

、/>、 ,/> ,

定义区间对应最终制动减速环节，对应图5中环节5，在此区间内：definition The interval corresponds to the final braking deceleration link, which corresponds to link 5 in Figure 5. Within this interval:

、/>、 ,/> ,

； ;

步骤S3中，对数字孪生模型使用并联机器人轨迹的样本数据集进行迭代训练，构建两个DQN子网络用于最优位移和最优速度的求解，通过动态协调过程将轨迹的样本数据集分为位移与速度的样本数据集。分别进行两个DQN子网络的模型训练，训练过程中动态改变轨迹位移与速度，求解最优轨迹下的数字孪生模型中位移和速度误差，误差最小时得到位移Q值与速度Q值。In step S3, the digital twin model is iteratively trained using the sample data set of the parallel robot trajectory, and two DQN subnetworks are constructed for solving the optimal displacement and optimal speed. The sample data set of the trajectory is divided into Sample data set of displacements and velocities. Carry out model training for two DQN sub-networks respectively. During the training process, the trajectory displacement and velocity are dynamically changed, and the displacement and velocity errors in the digital twin model under the optimal trajectory are solved. When the error is minimum, the displacement Q value and velocity Q value are obtained.

然后在DQN总网络中进行不断试错，错误状态包括运行紊乱，机器人末端执行器的奇异位姿，最终在得到最优位移与速度同时，得到最优的门型运行轨迹。两个DQN子网络和一个DQN总网络组成CD-DQN网络，特点为两个子网络偏向于探索轨迹运行的可能，而总网络减小探索的同时偏向于最优轨迹的执行活动，将探索与执行工作以串联方式进行，同时注重多参数（速度，位移）之间的协调关系。Then, continuous trial and error is carried out in the DQN total network. The error states include operating chaos and strange postures of the robot's end effector. Finally, while obtaining the optimal displacement and speed, the optimal door-shaped operating trajectory is obtained. Two DQN sub-networks and a DQN total network form a CD-DQN network. The characteristic is that the two sub-networks are biased towards exploring the possibility of trajectory operation, while the total network reduces exploration and is biased towards the execution activities of the optimal trajectory, combining exploration and execution. The work is carried out in series, paying attention to the coordination relationship between multiple parameters (velocity, displacement).

如图2至图4所示，步骤S3具体包括以下步骤：As shown in Figures 2 to 4, step S3 specifically includes the following steps:

S31：构建两个DQN子网络：最优位移模型A1和最优速度模型B1；定义协调动态网络的各项参数，agent为智能执行体，用来执行收敛的目标策略。S31: Construct two DQN sub-networks: the optimal displacement model A1 and the optimal speed model B1; define various parameters of the coordinated dynamic network, and the agent is an intelligent execution body used to execute the convergence target strategy.

具体地，S311：分别定义两种不同的学习环境：定义为/>时刻对应的总体环境状态，/>、/>分别为/>时刻对应的最优位移模型、最优速度模型的环境状态；Specifically, S311: Define two different learning environments respectively: Definition for/> The overall environmental state corresponding to the moment,/> ,/> respectively/> The environmental state of the optimal displacement model and optimal velocity model corresponding to each moment;

S312：分别定义、/>为/>时刻最优位移模型和最优速度模型中可执行的动作；S312: Define separately ,/> for/> Actions that can be performed in the optimal displacement model and optimal velocity model at all times;

S313：定义奖励机制，/>表示总奖励值；在/>时刻下总的环境状态/>与动作/>、/>下获得的总的奖励值为/>，/>为位移奖励值大小，为速度奖励值大小；S313: Define reward mechanism ,/> Indicates the total reward value; in/> Total environmental status at all times/> with action/> ,/> The total reward value obtained is/> ,/> is the displacement reward value, is the speed bonus value;

S314：数字孪生模型在时刻的末端执行器运行的当前位置为，并联机器人物理实体在/>时刻的末端执行器运行的当前位置为；数字孪生模型与物理实体在/>时刻的位移误差为：S314: Digital twin model in The current position of the end effector running at the moment is , the physical entity of the parallel robot is in/> The current position of the end effector running at the moment is ;Digital twin model and physical entity in/> The displacement error at time is:

式中，表示数字孪生模型在/>时刻的末端执行器运行的位置坐标；表示物理实体在/>时刻的末端执行器运行的位置坐标；/>表示位移误差。In the formula, Indicates that the digital twin model is in/> The position coordinates of the end effector running at the moment; Indicates that the physical entity is in/> The position coordinates of the end effector running at the moment;/> represents the displacement error.

并联机器人数学孪生模型在时刻的末端执行器的速度为/>，并联机器人物理实体在/>时刻的末端执行器的速度/>，数字孪生模型与物理实体在/>时刻的速度误差为：Mathematical twin model of parallel robot in The speed of the end effector at time is/> , the physical entity of the parallel robot is in/> Moment of end effector speed/> , the digital twin model and the physical entity are in/> The speed error at time is:

式中，表示速度误差。In the formula, Indicates speed error.

S315：初始轨迹策略函数为，式中/>为执行初始轨迹策略/>的Q值，/>为最小值函数。S315: The initial trajectory policy function is , formula in/> To execute the initial trajectory strategy/> Q value,/> is the minimum value function.

S32：引入动态协调系数，更改轨迹策略；在最优轨迹状态下，通过改进后的最优策略动态协调两个DQN子网络的目标Q值，动态协调两者关系；一开始的轨迹为S315的轨迹策略，在运行轨迹过程中按照正确的门型轨迹去运动，下一刻轨迹策略函数发生改变，变成，开始变成强调速度与位移误差的变化，速度与误差最小时为最优。S32: Introduce a dynamic coordination coefficient and change the trajectory strategy; in the optimal trajectory state, dynamically coordinate the target Q values of the two DQN subnetworks through the improved optimal strategy, and dynamically coordinate the relationship between the two; the initial trajectory is S315 The trajectory strategy moves according to the correct door-shaped trajectory during the running trajectory. The trajectory strategy function changes at the next moment and becomes , begins to emphasize the changes in speed and displacement errors, and the optimal speed and error are minimized.

动态协调过程中的最优策略公式为：Optimal strategy in dynamic coordination process The formula is:

式中，与/>为该策略下的动态协调系数；/>为初始轨迹策略；/>为位移误差；/>为速度误差；/>为执行初始轨迹策略/>的Q值。In the formula, with/> is the dynamic coordination coefficient under this strategy;/> is the initial trajectory strategy;/> is the displacement error;/> is the speed error;/> To execute the initial trajectory strategy/> Q value.

当数字孪生模型速度误差大于0.03m/s，位移误差小于0.2mm时，注重于位移误差最小化；当数字孪生模型速度误差小于0.03m/s，位移误差大于0.2mm时，注重于速度误差最小化；优点是排除较大误差的运动学模型，能够得到更好的网络评估效果。When the speed error of the digital twin model is greater than 0.03m/s and the displacement error is less than 0.2mm, focus on minimizing the displacement error; when the speed error of the digital twin model is less than 0.03m/s and the displacement error is greater than 0.2mm, focus on minimizing the speed error. ization; the advantage is that kinematic models with larger errors are eliminated and better network evaluation results can be obtained.

如果两者都符合基本误差，则保持初始轨迹策略不变；如果运行速度与轨迹误差均超过固定值，设置策略值为0并不进行奖励值的累加与计算，减小网络计算量。If both meet the basic error, the initial trajectory strategy remains unchanged; if the running speed and trajectory error exceed the fixed value, the strategy value is set to 0 and the accumulation and calculation of reward values are not performed to reduce the amount of network calculations.

训练子网络：进行CD-DQN网络的训练，作为最优位移模型的输出并反馈作用于最优位移模型中，最终输出最小的位移误差/>作为位移Q值；/>作为最优速度模型的输出并反馈作用于最优速度模型中，最终输出最小的速度误差/>作为速度Q值；原始的50000个样本点整合为更优的轨迹样本15000个，位移Q值与速度Q值作为总体DQN网络的输入Q值。Training subnetwork: train the CD-DQN network, As the output of the optimal displacement model and fed back into the optimal displacement model, the smallest displacement error is finally output/> As the displacement Q value;/> As the output of the optimal speed model and fed back into the optimal speed model, the minimum speed error is finally output. As the velocity Q value; the original 50,000 sample points are integrated into 15,000 better trajectory samples, and the displacement Q value and velocity Q value are used as the input Q value of the overall DQN network.

S33：构建用于获取最优运动轨迹的DQN总网络，输入位移Q值与速度Q值对其进行训练迭代；DQN总网络训练过程得到误差最小的运行轨迹；S33: Construct a DQN total network for obtaining the optimal motion trajectory, input the displacement Q value and the velocity Q value for training iterations; the DQN total network training process obtains the running trajectory with the smallest error;

具体地，S331：首先对数据样本集进行DQN总网络训练，离线策略完成探索与学习，进行多次的批量样本采样；定义最小轨迹综合误差为：Specifically, S331: First perform DQN total network training on the data sample set, offline strategy Complete exploration and learning, and conduct multiple batch sample samplings; define the minimum trajectory comprehensive error as:

，式中/>为总体动态平衡系数； , formula in/> is the overall dynamic balance coefficient;

总的奖励值为，式中/>为最小轨迹综合误差，/>为最大值函数。The total reward value is , formula in/> is the minimum trajectory comprehensive error,/> is the maximum value function.

S332：由位移误差与速度误差得到的最优轨迹在并联机器人运行时需要经过周期性运动；将并联机器人数字孪生模型运行周期进一步分为/>个状态节点，时间间隔为20ms，此处/>，/>个状态节点为在5个环节中的进一步划分。子网络简化后的样本数为15000，开始进行5000次的迭代计算；S332: The optimal trajectory obtained from the displacement error and speed error needs to undergo periodic motion when the parallel robot is running; the parallel robot digital twin model will run periodically further divided into/> status nodes, the time interval is 20ms, here/> ,/> Each status node is further divided into 5 links. The number of simplified samples of the subnetwork is 15,000, and 5,000 iterative calculations are started;

S333：对DQN总网络的损失函数进行求解；S333: Solve the loss function of the DQN total network;

式中，为经验回放的数学期望；/>为总权重，/>分别为位移和速度子网络中的权重；/>，/>，为接近输入Q值函数，/>为/>时刻的总体环境状态，/>为初始轨迹策略/>的Q值函数。In the formula, Mathematical expectations for empirical replay;/> is the total weight,/> are the weights in the displacement and velocity subnetworks respectively;/> ,/> , To approximate the input Q value function,/> for/> The overall environmental state of the moment,/> For the initial trajectory strategy/> Q-value function.

S334：进行总体DQN网络的训练，在权重交替过程中，先将目标Q值进行固定，随后每10次迭代训练中后将评估网络中更新的总权重/>赋予/>，进行初步迭代，总体网络训练5000次。目标Q值的计算公式为：S334: Carry out training of the overall DQN network. During the weight alternation process, first set the target Q value is fixed, and then the updated total weights in the network are evaluated after every 10 iterations of training/> Give/> , perform preliminary iterations, and train the overall network 5,000 times. The calculation formula of the target Q value is:

式中，为执行/>策略的当前的最优Q值，/>为接近最优策略的当前最优Q值函数，/>为数学期望。In the formula, for execution/> The current optimal Q value of the strategy,/> is the current optimal Q-value function close to the optimal strategy,/> for mathematical expectations.

具体地，在两个DQN子网络中获得最优位移与最优速度参数，将两者作为输入Q值代入到总网络中求得机器人的最优轨迹路线。将S2中得到的轨迹样本数据集中的位移样本参数与速度样本参数划分，经过最优位移模型与最优速度模型两个DQN子网络，样本数量由50000个变为15000个。设定最优运行轨迹为DQN总网络的（目标Q值），将DQN子网络的得到的速度Q值与位移Q值作为总网络的输入Q值；在最优策略下，DQN总网络能实时预测最优Q值，将最优Q值与/>进行比较，使得迭代过程中的损失函数/>进行收敛；在训练迭代过程中，轨迹样本集数据得以简化，机器人数字孪生模型的末端执行器在运动过程中会预测出多种轨迹，简化轨迹样本；网络迭代中每训练10次会更新的输入Q值，输入Q值将权重/>赋予目标Q值，对目标Q值完成更新；网络训练迭代中不断形成新的输入Q值与目标Q值达到动态轨迹误差补偿。Specifically, the optimal displacement and optimal speed parameters are obtained in two DQN sub-networks, and the two are substituted into the total network as input Q values to obtain the optimal trajectory of the robot. The displacement sample parameters and velocity sample parameters in the trajectory sample data set obtained in S2 are divided into two DQN subnetworks: the optimal displacement model and the optimal velocity model. The number of samples changes from 50,000 to 15,000. Set the optimal running trajectory as the total DQN network (target Q value), the speed Q value and displacement Q value obtained from the DQN subnetwork are used as the input Q value of the total network; under the optimal strategy, the DQN total network can predict the optimal Q value in real time, and use the optimal Q value to with/> Compare so that the loss function in the iteration process/> Convergence; during the training iteration process, the trajectory sample set data is simplified. The end effector of the robot digital twin model will predict a variety of trajectories during the movement and simplify the trajectory samples; the input will be updated every 10 times of training in the network iteration Q value, enter the Q value to change the weight/> The target Q value is assigned and the target Q value is updated; new input Q values and target Q values are continuously formed during network training iterations to achieve dynamic trajectory error compensation.

从原来的运动学参数通过数字孪生模型转换为轨迹坐标参数；使用Unity3D引擎中的ML-agent插件中采用经验回放原则，即agent不断进行交互数据迭代，形成参数集合，对/>时刻下总的奖励值/>进行累加，训练过程中，环境/>进行改变，变为/>，得到最小综合误差，最终同步映射得到最优路径与机器人数字孪生模型末端执行器的同步；在经过速度与轨迹位移补偿调整后，数字孪生模型中运行与最优轨迹同步，得到最小综合误差。误差越小，奖励值越高，完成同步后再将/>、/>返回经验回放中进行下一次的训练迭代。The original kinematic parameters are converted into trajectory coordinate parameters through the digital twin model; the ML-agent plug-in in the Unity3D engine is used to adopt the experience playback principle, that is, the agent continuously iterates the interactive data to form a parameter set , right/> The total reward value at the moment/> Accumulation, during training, environment/> Make changes to/> , the minimum comprehensive error is obtained, and the final synchronization mapping obtains the synchronization between the optimal path and the end effector of the robot digital twin model; after speed and trajectory displacement compensation adjustment, the operation in the digital twin model is synchronized with the optimal trajectory, and the minimum comprehensive error is obtained. The smaller the error, the higher the reward value. After synchronization is completed, // ,/> Return to the experience replay for the next training iteration.

训练完成后，将数字孪生模型数据进行实时更新，得到最优运动轨迹与孪生模型的运动轨迹的虚实同步。检测网络训练质量，对比原始DQN与改进方法CD-DQN的速度误差。如图5所示，选择60秒作为运动周期，反复循环执行轨迹，CD-DQN方法的速度误差随时间减小的较快，明显优于原始DQN方法。After the training is completed, the digital twin model data is updated in real time to obtain virtual and real synchronization of the optimal motion trajectory with the twin model's motion trajectory. Test the quality of network training and compare the speed error of the original DQN and the improved method CD-DQN. As shown in Figure 5, 60 seconds is selected as the motion period and the trajectory is repeatedly executed. The speed error of the CD-DQN method decreases faster with time, which is significantly better than the original DQN method.

如图5、图6所示，完成动态强化学习过程后，并联机器人的数字孪生模型完成最优轨迹位移与速度的运行，在虚拟空间内进行周期运动，由于物理实体与数字孪生模型存在时间延迟，需要进行并联机器人的物理实体与数字孪生模型的实际误差补偿，提出速度对称理念，使得数字孪生模型运动趋势与物理实体的运行速度一致。As shown in Figure 5 and Figure 6, after completing the dynamic reinforcement learning process, the digital twin model of the parallel robot completes the operation of optimal trajectory displacement and speed, and performs periodic motion in the virtual space. Due to the time delay between the physical entity and the digital twin model, , it is necessary to carry out actual error compensation between the physical entity and the digital twin model of the parallel robot, and propose the concept of speed symmetry to make the movement trend of the digital twin model consistent with the running speed of the physical entity.

具体地，S41：将数字孪生模型的运行周期与物理实体运动过程中的时间周期同步，数字孪生模型运行周期为，将并联机器人的物理实体也进行相同的运行周期的划分；Specifically, S41: Synchronize the running cycle of the digital twin model with the time cycle during the movement of the physical entity. The running cycle of the digital twin model is , divide the physical entity of the parallel robot into the same operating cycle;

S42：使物理实体按照数字孪生模型的最优位移轨迹进行周期性运动，设置状态节点对应门型轨迹运动；S42: Make the physical entity move periodically according to the optimal displacement trajectory of the digital twin model, and set the state node to move corresponding to the gate-shaped trajectory;

S43：门型运动轨迹为对称形状，采用对称式误差补偿法将周期内变换规律一致的运动进行划分；将匀减速环节和最终制动减速环节简化为匀加速环节和启动加速环节的速度对称模型（环节4与环节5进行速度模型简化，环节4变为环节2’，环节5变为1’。环节1’与环节2’为环节1,2的速度对称模型），以中心线为基准，左侧为加速趋势速度模型，右侧为减速趋势速度模型，同时改变数字孪生模型与物理实体中的速度策略，使得数字孪生模型与物理实体的运动趋势达到一致；S43: The door-shaped motion trajectory is a symmetrical shape, and the symmetrical error compensation method is used to divide the motion with consistent transformation rules within the cycle; the uniform deceleration link and the final braking deceleration link are simplified into a speed symmetric model of the uniform acceleration link and the start-up acceleration link. (The speed model of link 4 and link 5 is simplified, link 4 becomes link 2', and link 5 becomes 1'. Link 1' and link 2' are the speed symmetry models of links 1 and 2), based on the center line, The left side is the acceleration trend speed model, and the right side is the deceleration trend speed model. At the same time, the speed strategy in the digital twin model and the physical entity is changed to make the motion trends of the digital twin model and the physical entity consistent;

S44：具体补偿方法为：、/>、/>时，误差补偿以直线误差补偿为主；/>、/>时，误差补偿以圆弧误差补偿为主。S44: The specific compensation methods are: ,/> ,/> When , the error compensation is mainly linear error compensation;/> ,/> When , the error compensation is mainly arc error compensation.

如图7至图9所示，根据步骤S3中训练的并联机器人执行的最优轨迹和步骤S4中得到的反馈误差补偿，进行数字孪生框架整合，设定物理实体层，虚拟孪生层，信息交互层，误差反馈层，数据处理层，数据积累层，动态协调层，实现并联机器人物理实体与数字孪生模型的实时同步交互。As shown in Figures 7 to 9, based on the optimal trajectory executed by the parallel robot trained in step S3 and the feedback error compensation obtained in step S4, the digital twin framework is integrated, and the physical entity layer, virtual twin layer, and information interaction are set. layer, error feedback layer, data processing layer, data accumulation layer, and dynamic coordination layer to realize real-time synchronous interaction between the physical entity of the parallel robot and the digital twin model.

实时同步交互系统包括：物理实体层、信息交互层、误差反馈层、数据积累层、虚拟孪生层、数据处理层、动态协调层、虚实交互模块和动态协调强化学习模块；物理实体层包括并联机器人本体，通讯端口；信息交互层用于控制运行软件与数字孪生模型的通讯联动；误差反馈层进行物理实体层与虚拟孪生层的最优轨迹位移与速度的同步误差消除；数据积累层建立数据库信息，保存DQN网络的训练参数以及并联机器人运动过程中的补偿参数；虚拟孪生层包括并联机器人数字孪生模型，软件平台界面，移动控制系统；动态协调层对当前轨迹误差与速度误差结合动态协调强化学习中的最优策略进行协调，负责对物理实体层与虚拟孪生层的反馈协调，形成总体的闭环交互系统。The real-time synchronous interaction system includes: physical entity layer, information interaction layer, error feedback layer, data accumulation layer, virtual twin layer, data processing layer, dynamic coordination layer, virtual and real interaction module and dynamic coordination reinforcement learning module; the physical entity layer includes parallel robots Ontology, communication port; the information interaction layer is used to control the communication linkage between the running software and the digital twin model; the error feedback layer is used to eliminate the synchronization error of the optimal trajectory displacement and speed of the physical entity layer and the virtual twin layer; the data accumulation layer establishes database information , saves the training parameters of the DQN network and the compensation parameters during the movement of the parallel robot; the virtual twin layer includes the parallel robot digital twin model, software platform interface, and mobile control system; the dynamic coordination layer combines dynamic coordination reinforcement learning with the current trajectory error and speed error. It coordinates the optimal strategy in the system and is responsible for the feedback coordination between the physical entity layer and the virtual twin layer to form an overall closed-loop interactive system.

并联机器人物理实体与数字孪生模型的实时同步交互的过程为：The process of real-time synchronous interaction between the physical entity of the parallel robot and the digital twin model is:

启动物理实体层，使机器人做周期性运动，通过通信接口进入信息交互层；启动虚拟孪生层，动态协调强化学习模块进行最优轨迹位移，将多余的数据信息在数据处理层进行剔除；Start the physical entity layer to make the robot make periodic movements and enter the information interaction layer through the communication interface; start the virtual twin layer to dynamically coordinate the reinforcement learning module for optimal trajectory displacement, and eliminate redundant data information in the data processing layer;

信息交互层将物理实体层中的动力学参数进行解析，转换为数字信息传输到误差反馈层；同时动态协调强化学习模块进行奖励值的计算与累加，将当前的数据传送到数据累计层与误差反馈层；The information interaction layer analyzes the dynamic parameters in the physical entity layer, converts them into digital information and transmits them to the error feedback layer; at the same time, it dynamically coordinates the reinforcement learning module to calculate and accumulate reward values, and transmits the current data to the data accumulation layer and error feedback layer;

以上所述，仅为本发明较佳的具体实施方式，但本发明的保护范围并不局限于此，任何熟悉本技术领域的技术人员在本发明揭露的技术范围内，可轻易想到的变化或替换，都应该涵盖在本发明的保护范围之内。因此，本发明的保护范围应以所述权利要求的保护范围为准。The above are only preferred specific embodiments of the present invention, but the protection scope of the present invention is not limited thereto. Any person familiar with the technical field can easily think of changes or modifications within the technical scope disclosed in the present invention. All substitutions should be within the scope of the present invention. Therefore, the protection scope of the present invention should be subject to the protection scope of the claims.

Claims

1. The dynamic compensation method for the digital twin track error of the parallel robot is characterized by comprising the following steps of:

s1: constructing a virtual model of the parallel robot, and performing rod member constraint and space constraint on the virtual model to generate a digital twin model of the parallel robot;

s2: carrying out dynamic analysis on the digital twin model, collecting motor torque and kinetic energy parameters of the parallel robot, converting the motor torque and kinetic energy parameters into track displacement and speed parameters, and constructing a track sample data set of the parallel robot;

s3: training and iterating the digital twin model by a dynamic coordination reinforcement learning method to obtain an optimal motion track under optimal displacement and speed;

s4: compensating the speed error between the physical entity of the parallel robot and the optimized digital twin model by a symmetrical error compensation method;

s5: and integrating the digital twin frames to realize real-time synchronous interaction between the physical entity of the parallel robot and the digital twin model.

2. The method for dynamically compensating digital twin track errors of parallel robot according to claim 1, wherein the method comprises the following steps: in step S2, a parallel robot dynamics set is constructed to obtain the track displacement of the parallel robotAnd speed->The method comprises the steps of carrying out a first treatment on the surface of the And establishing a track displacement and speed mathematical model of the digital twin model, collecting parallel robot track sample data, and collecting N sample data point sets as a parallel robot track sample data set.

3. The method for dynamically compensating digital twin track errors of parallel robot according to claim 2, wherein the method comprises the following steps: the step S3 specifically comprises the following steps:

s31: constructing a track displacement DQN sub-network and a speed DQN sub-network: an optimal displacement model and an optimal velocity model;

s32: dividing a track sample data set into track displacement and speed sample data sets, training the DQN sub-networks, introducing dynamic coordination coefficients, changing track strategies, and dynamically coordinating the displacement Q values and the speed Q values of the two DQN sub-networks through the improved optimal track strategies;

s33: and constructing a DQN total network for acquiring the optimal motion trail, inputting a displacement Q value and a speed Q value, and performing training iteration on the DQN total network to finally obtain the optimal motion trail under the optimal displacement and the speed.

4. A method for dynamically compensating digital twin track error of a parallel robot according to claim 3, wherein: in step S32, the improved optimal strategyThe formula of (2) is:

in the method, in the process of the invention,and->Dynamic coordination coefficients under the strategy; />Is an initial trajectory strategy; />Is displacement error; />Is a speed error; />Solving a function for a minimum value; />To execute the initial trajectory strategy->Is a Q value of (C).

5. The method for dynamically compensating digital twin track errors of parallel robot according to claim 4, wherein the method comprises the following steps: in step S32, the digital twin model and the physical entity are inTime displacement error->The method comprises the following steps:

in the method, in the process of the invention,representing a digital twin model at +.>Position coordinates of the end effector operation at the moment;representing physical entity in->Position coordinates of the end effector operation at the moment;

digital twin model and physical entitySpeed error +.>The method comprises the following steps:

in the method, in the process of the invention,representing that the mathematical twin model of the parallel robot is +.>Speed of end effector at time, +.>Representing that the physical entity of the parallel robot is +.>The speed of the end effector at the moment in time.

6. The method for dynamically compensating digital twin track errors of parallel robot according to claim 5, wherein the method comprises the following steps:as an output of the optimal displacement model and feedback acting on the optimal displacement model, the displacement error is finally output to be the smallest>As a displacement Q value; />As an output of the optimal speed model and feedback is applied to the optimal speed model, the final output is the smallest speed error +.>As the speed Q value.

7. The method for dynamically compensating digital twin track errors of parallel robot according to claim 6, wherein the method comprises the following steps: in the DQN total network, defining the minimum track integrated error as:

in the middle ofIs the overall dynamic balance coefficient.

8. The method for dynamically compensating digital twin track errors of parallel robot according to claim 7, wherein the method comprises the following steps: the parallel robot physical entity and the digital twin model have time delay, and in order to ensure the consistency of the two operation speeds, the speed error is compensated according to the speed symmetry concept, and the method specifically comprises the following steps:

s41: synchronizing the operation period of the digital twin model with the time period in the physical entity movement process, wherein the operation period of the digital twin model comprises 5 links which are sequentially: starting an acceleration link, a uniform acceleration to uniform deceleration link, a uniform deceleration link and a final braking deceleration link; dividing the same operation period by the physical entities of the parallel robot;

s42: enabling the physical entity to perform periodic motion according to the optimal motion trail of the digital twin model, and setting a state node to correspond to the gate-type trail motion;

s43: the gate-type motion trail is in a symmetrical shape, and motion with consistent transformation rule in the period is divided by adopting a symmetrical error compensation method; the uniform deceleration link and the final braking deceleration link are simplified into speed symmetrical models of the uniform acceleration link and the starting acceleration link, the center line is taken as a reference, the left side is an acceleration trend speed model, the right side is a deceleration trend speed model, and meanwhile, the speed strategy of the digital twin model and the physical entity is changed, so that the movement trend of the digital twin model and the physical entity is consistent;

s44: in the starting acceleration link, the uniform acceleration to uniform deceleration link and the final braking deceleration link, the error compensation is linear error compensation; in the uniform acceleration link and the uniform deceleration link, the error compensation is arc error compensation.

9. The method for dynamically compensating digital twin track errors of parallel robot according to claim 8, wherein the method comprises the following steps: in step S5, the real-time synchronous interaction system of the parallel robot physical entity and the digital twin model includes: the system comprises a physical entity layer, an information interaction layer, an error feedback layer, a data accumulation layer, a virtual twin layer, a data processing layer, a dynamic coordination layer, a virtual-real interaction module and a dynamic coordination reinforcement learning module;

the physical entity layer comprises a parallel robot body and a communication port; the information interaction layer is used for controlling communication linkage between the running software and the digital twin model; the error feedback layer is used for eliminating the synchronous error of the optimal track displacement and the optimal track speed of the physical entity layer and the virtual twin layer; data accumulation establishes database information, and saves training parameters of the DQN network and compensation parameters in the parallel robot motion process; the virtual twin layer comprises a parallel robot digital twin model, a software platform interface and a mobile control system; the dynamic coordination layer coordinates the current track error and the speed error in combination with the optimal strategy in dynamic coordination reinforcement learning, and is responsible for feedback coordination of the physical entity layer and the virtual twin layer to form an overall closed-loop interaction system.

10. The method for dynamically compensating digital twin track errors of parallel robot according to claim 9, wherein the method comprises the following steps: the real-time synchronous interaction process of the physical entity of the parallel robot and the digital twin model comprises the following steps:

starting a physical entity layer to enable the parallel robot to do periodic motion, and entering an information interaction layer through a communication interface; starting a virtual twin layer, dynamically coordinating and strengthening the learning module to perform optimal track displacement, and removing redundant data information in a data processing layer;

the information interaction layer analyzes the kinematic parameters in the physical entity layer, converts the kinematic parameters into digital information and transmits the digital information to the error feedback layer; meanwhile, the dynamic coordination reinforcement learning module calculates and accumulates the rewarding value and transmits the current data to the data accumulation layer and the error feedback layer;

the error feedback layer analyzes the synchronous result, achieves synchronous compensation of the track speed on the basis of synchronous track displacement, obtains smaller speed and displacement error, and achieves synchronous control effect of the high-precision digital twin model and the object entity;

and finally, the historical information in the data accumulation layer and the information in the current error feedback layer are dynamically coordinated, and are fed back to the physical entity layer and the virtual twin layer in a closed loop mode to complete the virtual-real interaction process.