CN106444738A

CN106444738A - Mobile robot path planning method based on dynamic motion primitive learning model

Info

Publication number: CN106444738A
Application number: CN201610348356.3A
Authority: CN
Inventors: 陈洋; 姜明浩; 吴怀宇; 程磊; 李威凌; 谭艳平
Original assignee: Wuhan University of Science and Technology WHUST
Current assignee: Wuhan University of Science and Technology WHUST
Priority date: 2016-05-24
Filing date: 2016-05-24
Publication date: 2017-02-22
Anticipated expiration: 2036-05-24
Also published as: CN106444738B

Abstract

The invention discloses a path planning method for a mobile robot based on a learning model of a dynamic motion primitive. First use the handle to control the movement of the robot and record the trajectory of the robot. Then, the recorded trajectory is used as a sample of the dynamic motion primitive model, and the parameters of the dynamic motion primitive model are obtained by establishing the dynamic motion primitive model and using the trajectory samples for training, so as to realize the autonomous path planning of the robot. On this basis, the target position of the robot movement is changed to complete the generalization of the new target. The path planning method of the present invention improves the intelligence level of the mobile robot. When the target position of the robot movement changes, the robot can reach the new target position autonomously, that is, the robot can complete not for a certain specified task, but for other tasks It also has the ability of generalization; and the combination of the online learning feature of the dynamic motion primitive model and its autonomous obstacle avoidance function improves the efficiency of path planning.

Description

Path Planning Method for Mobile Robots Based on Learning Model of Dynamic Motion Primitives

技术领域technical field

本发明涉及移动机器人路径规划领域，具体是一种基于动态运动基元学习模型的移动机器人路径规划方法。The invention relates to the field of path planning for mobile robots, in particular to a path planning method for mobile robots based on a learning model of dynamic motion primitives.

背景技术Background technique

路径规划是移动机器人的关键技术之一，它在一定程度上标志着移动机器人智能水平的高低，能快速找出一条便捷、无碰撞的路径不仅保证了移动机器人自身的安全，更体现了机器人的高效性和可靠性。Path planning is one of the key technologies of mobile robots. To a certain extent, it marks the intelligence level of mobile robots. Being able to quickly find a convenient and collision-free path not only ensures the safety of the mobile robot itself, but also reflects the robot's Efficiency and reliability.

目前，常用到的机器人路径规划方法有人工势场法、模糊逻辑模型、遗传等模型。人工势场法是路径规划模型中较成熟且较高效的规划方法，以其简单的数学计算被广泛使用。但是传统的人工势场法存在局部极小点和目标不可达等问题。目前，有多种解决局部极小点的办法，如启发式搜索，随机逃走法等，但这些改进的人工势场法只是对机器人施加附加的控制力，没有从根本上解决问题。遗传模型是一种基于遗传和自然选择的多搜索模型，具有鲁棒、灵活、在种群中搜索不易落入局部最小点等优点。但遗传模型在进行机器人路径规划时存在种群规模大、搜索空间大、容易陷入局部极小点、收敛速度慢等问题。At present, the commonly used robot path planning methods include artificial potential field method, fuzzy logic model, genetic and other models. The artificial potential field method is a relatively mature and efficient planning method in the path planning model, and is widely used for its simple mathematical calculation. However, the traditional artificial potential field method has problems such as local minimum points and unreachable targets. At present, there are many ways to solve the local minimum point, such as heuristic search, random escape method, etc., but these improved artificial potential field methods only exert additional control force on the robot, and do not fundamentally solve the problem. The genetic model is a multi-search model based on genetic and natural selection, which has the advantages of robustness, flexibility, and the search in the population is not easy to fall into a local minimum point. However, the genetic model has problems such as large population size, large search space, easy to fall into local minimum point, and slow convergence speed when performing robot path planning.

以上传统的机器人路径规划模型主要存在以下两个方面的问题：The above traditional robot path planning model mainly has the following two problems:

(1)任务是特定的，仅仅针对某一任务有很好性能，而不具有泛化推广能力；(1) The task is specific, and it only has good performance for a certain task, but does not have the ability to generalize;

(2)学习往往是离线的，这就导致了对新的场景要重新训练学习，实时性很差。(2) Learning is often offline, which leads to retraining and learning for new scenes, and the real-time performance is poor.

发明内容Contents of the invention

本发明要解决的技术问题是：针对上述的移动机器人路径规划方法中所存在的实时性差，以及移动机器人完成任务单一的问题，提出一种基于动态运动基元学习模型的路径规划方法。能够实时搜索路径，与其自主避障功能结合起来能有效地提高路径规划的效率，此外，机器人在完成新的任务时，可以不用重新训练样本而保持原来样本轨迹的特性到达新的目标位置。The technical problem to be solved by the present invention is to propose a path planning method based on a learning model of dynamic motion primitives in view of the poor real-time performance and the single task of the mobile robot in the above path planning method for mobile robots. The ability to search for paths in real time, combined with its autonomous obstacle avoidance function, can effectively improve the efficiency of path planning. In addition, when the robot completes new tasks, it can maintain the characteristics of the original sample trajectory to reach the new target position without retraining samples.

为解决上述技术问题，本发明提供如下技术方案：In order to solve the above technical problems, the present invention provides the following technical solutions:

一种基于动态运动基元学习模型的移动机器人路径规划方法，其特征在于主要包括如下步骤：A method for path planning of a mobile robot based on a dynamic motion primitive learning model, characterized in that it mainly includes the following steps:

步骤1：对机器人运动的二维环境进行建模，模拟机器人运动的二维环境界面，机器人用小实心圆来代替，障碍物为各种平面图形；Step 1: Model the two-dimensional environment of the robot's movement, and simulate the two-dimensional environment interface of the robot's movement. The robot is replaced by a small solid circle, and the obstacles are various plane figures;

步骤2：利用手柄对机器人的操控，使机器人能从起点避免与障碍物碰撞而到达目标点；Step 2: Use the handle to control the robot so that the robot can avoid collision with obstacles and reach the target point from the starting point;

步骤3：在步骤2进行的过程中，采集机器人运动轨迹数据作为动态运动基元学习模型的样本点，所述机器人运动轨迹数据包括位移、速度和加速度；Step 3: During the process of step 2, collect the robot motion trajectory data as the sample points of the dynamic motion primitive learning model, and the robot motion trajectory data includes displacement, velocity and acceleration;

步骤4：根据步骤3得到的机器人运动轨迹时的位移、速度和加速度数据，将这些数据作为训练样本，通过动态运动基元算法对样本进行训练得到机器人运动轨迹所对应的最佳权重值；Step 4: According to the displacement, velocity and acceleration data of the robot's trajectory obtained in step 3, these data are used as training samples, and the samples are trained by the dynamic motion primitive algorithm to obtain the optimal weight value corresponding to the robot's trajectory;

步骤5：针对特定任务设置初始参数，所述初始参数包括机器人运动的起点和终点，根据步骤4得到的最佳权重值，规划出通过动态运动基元模型学习后的路径，该路径具有原样本轨迹的特性，即起点和终点一致，并且其运行轨迹与样本轨迹大致相同；Step 5: Set initial parameters for specific tasks, the initial parameters include the starting point and end point of robot motion, and plan the path after learning through the dynamic motion primitive model according to the optimal weight value obtained in step 4, which has the original sample The characteristics of the trajectory, that is, the starting point and the ending point are consistent, and its running trajectory is roughly the same as the sample trajectory;

步骤6：在步骤5的基础上，加入圆形障碍物，并且在原有的动力学方程中加入耦合项，从而构建带有避障功能的动力学系统，实现动态运动基元学习模型的自主避障功能；Step 6: On the basis of step 5, add circular obstacles, and add coupling terms to the original dynamic equation, so as to construct a dynamic system with obstacle avoidance function, and realize the autonomous avoidance of the dynamic motion primitive learning model dysfunction;

步骤7：在步骤5的基础上，改变机器人运动的目标位置，在不重新训练样本的前提下，仅仅改变目标位置的参数，机器人仍能自主到达新的目标点位置，即机器人可以完成不针对某一指定任务，而对于其他的任务也具有泛化推广的能力。Step 7: On the basis of step 5, change the target position of the robot movement, and only change the parameters of the target position without retraining samples, and the robot can still reach the new target point position autonomously, that is, the robot can complete untargeted A specific task, but also has the ability to generalize to other tasks.

上述技术方案中，步骤1中对机器人运动的二维环境进行建模，建模的要求为：移动机器人的活动范围在一个有限的二维空间；以移动机器人的尺寸为基准，将障碍物的尺寸向外扩展，将机器人看作一个质点；障碍物由各种平面图形组成，数目有限，并且在机器人移动过程中这些障碍物不会发生变化和移动。In the above technical solution, in step 1, the two-dimensional environment of the robot's movement is modeled, and the requirements for modeling are: the range of motion of the mobile robot is in a limited two-dimensional space; The size expands outwards, and the robot is regarded as a particle; the obstacles are composed of various plane figures, the number is limited, and these obstacles will not change and move during the movement of the robot.

上述技术方案中，步骤2具体过程如下：In the above technical solution, the specific process of step 2 is as follows:

步骤2-1：读取机器人手柄的数据，当手柄向上下或左右推动时，该界面实时的显示机器人在建模环境中运动的位移、速度和加速度；Step 2-1: Read the data of the robot handle. When the handle is pushed up and down or left and right, the interface will display the displacement, speed and acceleration of the robot in the modeling environment in real time;

步骤2-2：遥控手柄，人为的规划出一条机器人能从起始点到达终点的最优路径，考虑到机器人一般只能前后和左右运动，因此规划出来的路径也是前后或者左右运动的路径，规划出来的轨迹也叫做样本轨迹；Step 2-2: Remote control handle, artificially plan an optimal path for the robot to reach the end point from the starting point. Considering that the robot can only move back and forth and left and right, the planned path is also the path of front and back or left and right movement. Planning The resulting trajectory is also called the sample trajectory;

步骤2-3：在规划路径时，要避开障碍物，并且用数据保存的方法将样本轨迹的位移、速度和加速度的值记录下来，并作为样本数据。Step 2-3: When planning the path, avoid obstacles, and use the method of data saving to record the displacement, velocity and acceleration of the sample trajectory, and use it as sample data.

上述技术方案中，步骤4包括如下具体步骤：In the above technical solution, step 4 includes the following specific steps:

步骤4-1：建立动态运动基元的数学模型：动态运动基元一般用来形成离散的运动，对于单一的自由度位移y，引入带有恒定系数线性微分方程并称之为动力学系统，此系统作为对运动学习的基础：Step 4-1: Establish the mathematical model of dynamic motion primitives: dynamic motion primitives are generally used to form discrete motions. For a single degree of freedom displacement y, introduce a linear differential equation with constant coefficients and call it a dynamic system. This system serves as the basis for motor learning:

式中：In the formula:

x和v分别是系统的位移和速度；x₀和g分别是初始位置和目标位置；τ是时间伸缩因子；K是弹簧的弹性系数；D是系统处于临界状态下的阻尼系数；f是非线性函数，用于生成任意复杂的运动；x and v are the displacement and velocity of the system, respectively; x ₀ and g are the initial position and target position, respectively; τ is the time stretch factor; K is the elastic coefficient of the spring; D is the damping coefficient of the system in a critical state; f is the nonlinearity functions for generating arbitrarily complex movements;

步骤4-2：设置初始参数，机器人运动的起始点x₀和目标点g，时间常数τ，弹簧的弹性系数K，系统处于临界状态下的阻尼系数D；非线性函数f用于形成任意复杂的运动，定义f为：Step 4-2: Set the initial parameters, the starting point x ₀ and the target point g of the robot movement, the time constant τ, the elastic coefficient K of the spring, and the damping coefficient D when the system is in a critical state; the nonlinear function f is used to form any complex The motion of , define f as:

式中：In the formula:

ψ_i(s)是径向基核函数，i表示第i个径向基核函数ψ_i(s)，其取值范围是1到N，其中N表示径向基核函数的个数；径向基核函数定义为：ψ _i (s) is a radial basis kernel function, i represents the i-th radial basis kernel function ψ _i (s), and its value ranges from 1 to N, where N represents the number of radial basis kernel functions; The basic kernel function is defined as:

ψ_i(s)＝exp(-h_i(s-c_i)²) (4)ψ _i (s)＝exp(-h _i (sc _i ) ² ) (4)

式中：In the formula:

c_i是径向基核函数的中心，h_i>0且决定核函数的宽；其中 h_N＝h_N-1,i＝1,...N，α为任意正常数；c _i is the center of the radial basis kernel function, h _i >0 and determines the width of the kernel function; where h _N ＝h _N-1 , i=1,...N, α is any normal constant;

公式(3)中的函数f并不取决于时间参数，而是取决于相位变量s，s的表达形式为：The function f in formula (3) does not depend on the time parameter, but on the phase variable s, the expression of s is:

式中：In the formula:

s是关于时间t的函数，α为任意正常数，τ是时间伸缩因子；由方程(5)可知s是由1到0单调递减的，因此方程(5)称为正则系统；s is a function of time t, α is any normal constant, and τ is a time expansion factor; from equation (5), it can be seen that s is monotonically decreasing from 1 to 0, so equation (5) is called a regular system;

步骤4-3：将步骤3中得到的样本数据代入公式(1)和公式(2)中，因为正则系统是可积分的，即s可以根据参数τ计算出来，所以训练样本中的非线性扰动f′(s)可以表示成：Step 4-3: Substitute the sample data obtained in step 3 into formula (1) and formula (2), because the regular system is integrable, that is, s can be calculated according to the parameter τ, so the nonlinear disturbance in the training sample f'(s) can be expressed as:

根据最小误差准则函数J来求解最佳权重值w_i，其中最小误差准则函数的表达式为：According to the minimum error criterion function J to solve the optimal weight value w _i , where the expression of the minimum error criterion function is:

J＝∑_s(f′(s)-f(s))² (7)J＝∑s (f′( _s )-f(s)) ² (7)

当J取最小时的w_i就是最佳的权重值。When J is minimized, _wi is the best weight value.

上述技术方案中，步骤5包括如下具体步骤：In the above technical solution, step 5 includes the following specific steps:

步骤5-1：当机器人执行指定的任务时，设置机器人的起点位置与终点位置；Step 5-1: When the robot performs the specified task, set the start position and end position of the robot;

步骤5-2：样本数据是二维的，也即包括x轴方向上的数据和y轴方向上的数据，将x轴方向上的数据按照步骤4进行训练，得到x轴方向上的最佳权重值，代入步骤5-1中的起点和终点值，计算出x方向上通过动态运动基元模型学习后的位移、速度和加速度；Step 5-2: The sample data is two-dimensional, that is, it includes data in the x-axis direction and data in the y-axis direction. The data in the x-axis direction is trained according to step 4 to obtain the best data in the x-axis direction. The weight value is substituted into the starting point and end point value in step 5-1 to calculate the displacement, velocity and acceleration in the x direction after learning through the dynamic motion primitive model;

步骤5-3：将y轴方向上的数据按照步骤4进行训练，得到y轴方向上的最佳权重值，代入步骤5-1中的起点和终点值，计算出y方向上通过动态运动基元模型学习后的位移、速度和加速度；Step 5-3: Train the data in the y-axis direction according to step 4 to obtain the optimal weight value in the y-axis direction, substitute the starting point and end point values in step 5-1, and calculate the dynamic motion basis in the y-direction Displacement, velocity and acceleration after meta-model learning;

步骤5-4：读入步骤5-2和步骤5-3中得到的数据，分别得到x轴和y轴两个方向的运动数据，在二维平面上输出运动的轨迹仿真图，即完成基于动态运动基元学习模型对移动机器人的路径规划。Step 5-4: Read in the data obtained in step 5-2 and step 5-3, obtain the motion data in the x-axis and y-axis directions respectively, and output the motion trajectory simulation diagram on the two-dimensional plane, that is, complete the Path Planning for Mobile Robots by Learning Models of Dynamic Motion Primitives.

上述技术方案中，步骤6中所加入的障碍物是以(0.4,0.4)为圆心坐标，半径为0.1m的圆。In the above technical solution, the obstacle added in step 6 is a circle with (0.4,0.4) as the center coordinates and a radius of 0.1m.

本发明的方法开始以简单的线性动态系统(一组微分方程)开始研究，通过转换系统将简单的线性动态系统转换成非线性系统，并且通过吸引子来形成任意复杂的运动，这样就能较简单的对非线性系统进行研究。其中，用微分方程来表示的优点在于误差可以自动的被校正，而且微分方程都是以固定的格式形成的，按照这个固定的格式仅仅只需要简单的改变一个目标参数，就能适应新的环境，即可以对新目标进行泛化；基于动态运动基元学习的方法是在线学习的，对于新的情形不用重新学习，能实时的跟踪目标位置。因而，在避障方面上，通过构建带避障功能的动力学系统实现自主避障，并且动态运动基元模型的在线学习特征和其自主避障功能相结合提高了路径规划的效率。The method of the present invention begins to study with a simple linear dynamic system (a group of differential equations), converts the simple linear dynamic system into a nonlinear system through the conversion system, and forms any complex motion through the attractor, so that it can be compared Simple study of nonlinear systems. Among them, the advantage of using differential equations is that the error can be automatically corrected, and the differential equations are formed in a fixed format. According to this fixed format, only a simple change of a target parameter is required to adapt to the new environment. , that is, it can generalize new targets; the method based on dynamic motion primitive learning is online learning, and can track the target position in real time without re-learning for new situations. Therefore, in terms of obstacle avoidance, autonomous obstacle avoidance is realized by constructing a dynamic system with obstacle avoidance function, and the combination of online learning features of the dynamic motion primitive model and its autonomous obstacle avoidance function improves the efficiency of path planning.

与现有技术相比，本发明的有益效果是：Compared with prior art, the beneficial effect of the present invention is:

(1)本发明提出一种基于动态运动基元学习模型的移动机器人路径规划方法，该学习模型具有泛化推广能力，机器人在完成新的任务时，可以不用重新训练样本而保持原来样本轨迹的特性到达新的目标位置。(1) The present invention proposes a mobile robot path planning method based on a dynamic motion primitive learning model. The learning model has the ability of generalization. When the robot completes a new task, it can keep the original sample track without retraining the sample. The feature reaches the new target location.

(2)本发明提出的路径规划模型在搜索路径时是实时的，与其自主避障功能结合起来能有效地提高路径规划的效率。(2) The path planning model proposed by the present invention is real-time when searching for a path, and combining with its autonomous obstacle avoidance function can effectively improve the efficiency of path planning.

附图说明Description of drawings

图1是本发明基于动态运动基元学习模型的移动机器人路径规划过程示意图；Fig. 1 is the schematic diagram of the path planning process of the mobile robot based on the dynamic motion primitive learning model of the present invention;

图2是本发明中模拟机器人运动的二维环境；其中直线型轨迹代表样本轨迹；各种形状的图形(如矩形、圆形、椭圆形)表示二维环境中的障碍物；Fig. 2 is the two-dimensional environment of simulating robot motion among the present invention; Wherein the linear track represents the sample track; Graphics of various shapes (such as rectangle, circle, ellipse) represent the obstacle in the two-dimensional environment;

图3是本发明中动态运动基元学习模型的自主避障仿真图；Fig. 3 is the simulation diagram of autonomous obstacle avoidance of dynamic motion primitive learning model among the present invention;

图4是本发明中动态运动基元学习模型所具有泛化推广能力的仿真图；Fig. 4 is the emulation diagram of the generalization promotion ability that dynamic motion primitive learning model has among the present invention;

图5是训练的样本轨迹与通过动态运动基元模型学习后的轨迹对比图。Fig. 5 is a comparison diagram of the training sample trajectory and the trajectory learned by the dynamic motion primitive model.

具体实施方式detailed description

为了进一步说明本发明的技术方案，下面结合附图1-5对本发明进行详细的说明。In order to further illustrate the technical solution of the present invention, the present invention will be described in detail below in conjunction with accompanying drawings 1-5.

步骤1：模拟机器人运动的二维环境界面，其界面上机器人用小实心圆来代替，障碍物为各种平面图形；设置机器人运动的二维环境界面为正方形(长和宽都为1m)，机器人用一个直径为5mm的小实心圆来代替。Step 1: Simulate the two-dimensional environment interface of the robot movement, the robot on the interface is replaced by a small solid circle, and the obstacles are various plane figures; set the two-dimensional environment interface of the robot movement as a square (both length and width are 1m), The robot is replaced by a small solid circle with a diameter of 5mm.

步骤2：利用OPENCV(Open Source Computer Vision Library)实现手柄对机器人的操控，使机器人能从起点避免与障碍物碰撞到达目标点；Step 2: Use OPENCV (Open Source Computer Vision Library) to realize the control of the robot by the handle, so that the robot can avoid collision with obstacles from the starting point to reach the target point;

步骤2-1：基于MFC(Microsoft Foundation Classes)界面编写一个上位机软件，该软件可以读取机器人手柄的数据，当手柄向上下或左右推动时，该界面可以实时的显示机器人在建模环境中运动的位移、速度和加速度；Step 2-1: Write a host computer software based on the MFC (Microsoft Foundation Classes) interface. This software can read the data of the robot handle. When the handle is pushed up and down or left and right, the interface can display the robot in the modeling environment in real time. Movement displacement, velocity and acceleration;

步骤2-3：在规划路径时，要避开障碍物，并且用数据保存的方法将样本轨迹的位移、速度和加速度的值记录下来，并作为样本数据；Step 2-3: When planning the path, avoid obstacles, and use the method of data saving to record the displacement, velocity and acceleration of the sample trajectory, and use it as sample data;

步骤2中使用的是基于MFC编写的上位机软件,通过对手柄的操控就能实现对机器人的控制。其中设置手柄推杆的位移为机器人运动时的速度大小，其中控制机器人运动速度的范围为-5mm/s～5mm/s。In step 2, the upper computer software based on MFC is used, and the robot can be controlled by manipulating the handle. The displacement of the push rod of the handle is set as the speed of the robot when it is moving, and the range of controlling the moving speed of the robot is -5mm/s～5mm/s.

步骤3：在步骤2进行的过程中，采集机器人运动轨迹数据作为动态运动基元学习模型的样本点，其中机器人运动轨迹数据包括其位移、速度和加速度值的大小；Step 3: During the process of step 2, collect the robot motion trajectory data as the sample points of the dynamic motion primitive learning model, wherein the robot motion trajectory data includes the size of its displacement, velocity and acceleration values;

步骤4：根据步骤3得到的机器人运动轨迹时的位移、速度和加速度，将这些数据作为DMP学习模型的训练样本，通过对样本的训练得到机器人运动轨迹所对应的最佳权重值；Step 4: According to the displacement, velocity and acceleration of the robot trajectory obtained in step 3, these data are used as the training samples of the DMP learning model, and the optimal weight value corresponding to the robot trajectory is obtained by training the samples;

步骤4-1：建立动态运动基元的数学模型。动态运动基元一般用来形成离散的运动，对于单一的自由度位移y，引入带有恒定系数线性微分方程并称之为动力学系统，此系统作为对运动学习的基础：Step 4-1: Establish a mathematical model of dynamic motion primitives. Dynamic motion primitives are generally used to form discrete motions. For a single degree of freedom displacement y, a linear differential equation with a constant coefficient is introduced and called a dynamic system. This system serves as the basis for motion learning:

式中：In the formula:

步骤4-2：设置初始参数，机器人运动的起始点x₀和目标点g，时间常数τ，弹簧的弹性系数K，系统处于临界状态下的阻尼系数D；非线性函数f用于形成任意复杂的运动，定义为：Step 4-2: Set the initial parameters, the starting point x ₀ and the target point g of the robot movement, the time constant τ, the elastic coefficient K of the spring, and the damping coefficient D when the system is in a critical state; the nonlinear function f is used to form any complex The motion of is defined as:

式中：In the formula:

ψ_i(s)＝exp(-h_i(s-c_i)²) (4)ψ _i (s)＝exp(-h _i (sc _i ) ² ) (4)

式中：In the formula:

步骤4-3：将步骤3中得到的样本数据代入上述公式中，因为正则系统是可积分的，即s可以根据参数τ计算出来，所以训练样本中的非线性扰动f′(s)可以表示成：Step 4-3: Substitute the sample data obtained in step 3 into the above formula, because the regular system is integrable, that is, s can be calculated according to the parameter τ, so the nonlinear disturbance f'(s) in the training sample can be expressed as become:

J＝∑_s(f′(s)-f(s))² (7)J＝∑s (f′( _s )-f(s)) ² (7)

当J取最小时的w_i就是最佳的权重值；When J takes the minimum, _wi is the best weight value;

步骤5：针对特定任务设置初始参数(机器人运动的起点和终点)，根据步骤4得到的最佳权重值，规划出通过动态运动基元模型学习后的路径，该路径具有原样本轨迹的特性；Step 5: Set the initial parameters (starting point and end point of the robot motion) for a specific task, and plan the path learned by the dynamic motion primitive model according to the optimal weight value obtained in step 4, which has the characteristics of the original sample trajectory;

步骤5-2：样本数据是二维的(x轴方向上的数据和y轴方向上的数据)，将x轴方向上的数据按照步骤4进行训练，得到x轴方向上的最佳权重值，代入步骤5-1中的起点和终点值，就可以计算出x方向上通过动态运动基元模型学习后的位移、速度和加速度；Step 5-2: The sample data is two-dimensional (data in the x-axis direction and data in the y-axis direction), and the data in the x-axis direction is trained according to step 4 to obtain the optimal weight value in the x-axis direction , by substituting the start and end values in step 5-1, the displacement, velocity and acceleration in the x direction after learning through the dynamic motion primitive model can be calculated;

步骤5-3：将y轴方向上的数据按照步骤4进行训练，得到y轴方向上的最佳权重值，代入步骤5-1中的起点和终点值，就可以计算出y方向上通过动态运动基元模型学习后的位移、速度和加速度；Step 5-3: Train the data in the y-axis direction according to step 4 to obtain the optimal weight value in the y-axis direction, and substitute the starting point and end point values in step 5-1 to calculate the dynamic weight in the y-axis direction. Displacement, velocity and acceleration after learning the motion primitive model;

步骤5-4：将步骤5-2和步骤5-3中得到的数据通过MATLAB读入，得到x轴和y轴两个方向的运动数据，在二维平面上输出运动的轨迹仿真图，即完成基于动态运动基元学习模型对移动机器人的路径规划。Step 5-4: Read the data obtained in step 5-2 and step 5-3 into MATLAB to obtain the motion data in the directions of x-axis and y-axis, and output the motion trajectory simulation diagram on the two-dimensional plane, namely Complete the path planning of the mobile robot based on the dynamic motion primitive learning model.

步骤6：在步骤5的基础上，加入圆形障碍物，并且在原有的动力学方程中加入耦合项，从而构建带有避障功能的动力学系统，实现了动态运动基元学习模型的自主避障功能；Step 6: On the basis of step 5, add circular obstacles, and add coupling terms to the original dynamic equation, so as to construct a dynamic system with obstacle avoidance function, and realize the autonomy of the dynamic motion primitive learning model Obstacle avoidance function;

步骤6-1：在步骤5-4的基础上，加入圆形障碍物，其中障碍物是以(0.4,0.4)为圆心坐标，半径为0.1m的圆；Step 6-1: On the basis of step 5-4, add a circular obstacle, where the obstacle is a circle with (0.4,0.4) as the center coordinates and a radius of 0.1m;

步骤6-2：在步骤4-1给出的动力学系统方程的基础上加入耦合项P(x,v)来构建带有避障功能的动力学系统，其中耦合项P(x,v)的表达式为：Step 6-2: On the basis of the dynamic system equation given in step 4-1, add the coupling item P(x,v) to construct a dynamic system with obstacle avoidance function, where the coupling item P(x,v) The expression is:

式中：In the formula:

是以为轴，为旋转角的旋转矩阵，矢量是障碍物的位置，γ与β是常量，θ是轨迹上的点与障碍物的距离矢量与轨迹上那一点的相对速度之间的夹角； so for the axis, is the rotation matrix for the rotation angle, vector is the position of the obstacle, γ and β are constants, θ is the angle between the distance vector between the point on the trajectory and the obstacle and the relative velocity of that point on the trajectory;

步骤6-3：给定耦合项P(x,v)公式中的常数项初值，其中γ＝8，旋转矩阵R表示成：Step 6-3: Given the initial value of the constant term in the coupling term P(x, v) formula, where γ=8, The rotation matrix R is expressed as:

步骤6-4：通过构建带有避障功能的动力学系统，加入步骤6-1中所述的障碍物，机器人仍能避开障碍物到达目标点，其中带有避障功能的动力学系统的数学表达式为：Step 6-4: By building a dynamic system with obstacle avoidance function and adding the obstacles described in step 6-1, the robot can still avoid obstacles to reach the target point, and the dynamic system with obstacle avoidance function The mathematical expression of is:

由图3可以看出，动态运动基元模型具有自主避障的功能；It can be seen from Figure 3 that the dynamic motion primitive model has the function of autonomous obstacle avoidance;

步骤7：在步骤5的基础上，改变机器人运动的目标位置，在不重新训练样本的前提下，仅仅改变目标位置的参数，机器人仍能自主的到达新的目标点位置，即机器人可以完成不针对某一指定任务，而对于其他的任务也具有泛化推广的能力。Step 7: On the basis of step 5, change the target position of the robot movement, and only change the parameters of the target position without retraining samples, the robot can still reach the new target point position autonomously, that is, the robot can complete different For a specific task, it also has the ability to generalize to other tasks.

步骤7-1：在步骤5-4的基础上，改变机器人目标点的位置为(0.5,0.5)，代入步骤4，得到在不重新训练样本的前提下，动态运动基元模型学习后的轨迹；Step 7-1: On the basis of step 5-4, change the position of the robot target point to (0.5,0.5), substitute into step 4, and obtain the trajectory after learning the dynamic motion primitive model without retraining samples ;

步骤7-2：在步骤5-4的基础上，改变机器人目标点的位置为(0.8,0.8)，代入步骤4，得到在不重新训练样本的前提下，动态运动基元模型学习后的轨迹；Step 7-2: On the basis of step 5-4, change the position of the robot target point to (0.8,0.8), and substitute it into step 4 to obtain the trajectory after learning the dynamic motion primitive model without retraining samples ;

步骤7-3：由图4可知，在步骤7-1和步骤7-2中，机器人可以到达新的目标位置，并且保持原样本轨迹的特性，因此证明了动态运动基元学习模型所具有的泛化推广能力；Step 7-3: As can be seen from Figure 4, in steps 7-1 and 7-2, the robot can reach the new target position and maintain the characteristics of the original sample trajectory, thus proving that the dynamic motion primitive learning model has Generalization ability;

综上，本发明基于动态运动基元学习模型实现对机器人的路径规划，该学习模型的在线学习特征和其自主避障功能相结合提高了路径规划的效率，并且该模型具有泛化推广能力。本发明的提出，提高了移动机器人的智能性，为移动机器人在路径规划、避障与导航等相关领域提供了参考。In summary, the present invention implements path planning for robots based on a dynamic motion primitive learning model. The combination of online learning features of the learning model and its autonomous obstacle avoidance function improves the efficiency of path planning, and the model has generalization capabilities. The proposal of the present invention improves the intelligence of the mobile robot, and provides reference for the mobile robot in related fields such as path planning, obstacle avoidance and navigation.

Claims

1. the method for planning path for mobile robot based on dynamic motion primitive learning model, it is characterised in that mainly include Following steps：

Step 1：The two-dimensional environment of robot motion is modeled, the two-dimensional environment interface of dummy robot's motion, robot Replacing by little filled circles, barrier is various planar graphs；

Step 2：Utilize the manipulation to robot for the handle, make robot can avoid colliding with barrier from starting point and reach target Point；

Step 3：During step 2 is carried out, gather robot motion trace data as dynamic motion primitive learning model Sample point, described robot motion's track data includes displacement, speed and acceleration；

Step 4：These data are made by the displacement during robot motion's track obtaining according to step 3, speed and acceleration information It for training sample, is trained obtaining the best weights corresponding to robot motion's track to sample by dynamic motion primitive algorithm Weight values；

Step 5：Arranging initial parameter for particular task, described initial parameter includes the beginning and end of robot motion, root The best weights weight values obtaining according to step 4, the path after cooking up by the study of dynamic motion basic-element model, this path has former state The characteristic of this track, i.e. beginning and end are consistent, and its running orbit is roughly the same with sample trace；

Step 6：On the basis of step 5, add circular barrier, and in original kinetics equation, add coupling terms, Thus build the dynamic system with barrier avoiding function, it is achieved the automatic obstacle avoiding function of dynamic motion primitive learning model；

Step 7：On the basis of step 5, change the target location of robot motion, on the premise of not re-training sample, Only changing the parameter of target location, robot remains to independently reach new aiming spot, i.e. robot can complete not pin To a certain appointed task, and other task is also had to the ability of extensive popularization.

2. the method for planning path for mobile robot based on dynamic motion primitive learning model according to claim 1, its It is characterised by：In step 1 being modeled the two-dimensional environment of robot motion, the requirement of modeling is：The activity of mobile robot Scope is at a limited two-dimensional space；With move robot size as benchmark, by the size outward expansion of barrier, by machine Device people regards a particle as；Barrier is made up of various planar graphs, Limited Number, and in robot moving process these Barrier will not change and move.

3. the method for planning path for mobile robot based on dynamic motion primitive learning model according to claim 1, its It is characterised by：Step 2 detailed process is as follows：

Step 2-1：The data of read machine people's handle, when handle to up and down or left and right promote when, this interface shows machine in real time Displacement, speed and the acceleration that device people moves in modeling environment；

Step 2-2：Remote-control handle, artificial cooks up the optimal path that a robot can reach home from starting point, it is considered to To robot typically can only before and after and side-to-side movement, before and after the path therefore planning out is also or the path of side-to-side movement, Planning track out is also referred to as sample trace；

Step 2-3：When path planning, avoiding obstacles, and by the method for data preservation by the displacement of sample trace, speed The value of degree and acceleration is recorded, and as sample data.

4. the method for planning path for mobile robot based on dynamic motion primitive learning model according to claim 1, its It is characterised by：Step 4 comprises the following specific steps that：

Step 4-1：Set up the Mathematical Modeling of dynamic motion primitive：Dynamic motion primitive is generally used to be formed discrete motion, right In single free degree displacement y, introduce with constant coefficients linear differential equation referred to as dynamic system, this system conduct Basis to motor learning：

τ \overset{\cdot}{v} = K (g - x) - D v - K (g - x_{0}) s + K f (s) - - - (1)

τ \overset{\cdot}{x} = v - - - (2)

In formula：

X and v is displacement and the speed of system respectively；x₀It is initial position and target location respectively with g；τ is time contraction-expansion factor；K It is the coefficient of elasticity of spring；D is the damped coefficient that system is under critical condition；F is nonlinear function, is used for generating arbitrarily multiple Miscellaneous motion；

Step 4-2：Initial parameter is set, starting point x of robot motion₀With impact point g, timeconstantτ, the elastic system of spring Number K, system is in the damped coefficient D under critical condition；Nonlinear function f is for forming arbitrarily complicated motion, and definition f is：

f (s) = \frac{Σ_{i = 1}^{N} ψ_{i} (s) w_{i}}{Σ_{i = 1}^{N} ψ_{i} (s)} s - - - (3)

In formula：

ψ_iS () is Radial basis kernel function, i represents i-th Radial basis kernel function ψ_iS (), its span is 1 to N, and wherein N represents The number of Radial basis kernel function；Radial basis kernel function is defined as：

ψ_i(s)=exp (-h_i(s-c_i)²) (4)

In formula：

c_iIt is the center of Radial basis kernel function, h_i>0 and determine kernel function width；Wherein h_N=h_N-1, i=1 ... N, α are any normal number；

Function f in formula (3) is not dependent on time parameter, and is depending on phase variant s, and the expression-form of s is：

τ \overset{\cdot}{s} = - α s - - - (5)

In formula：

S is the function with regard to time t, and α is any normal number, and τ is time contraction-expansion factor；Equation (5) is understood that s is single by 1 to 0 Tune successively decreases, and therefore equation (5) is referred to as cannoncial system；

Step 4-3：The sample data obtaining in step 3 is substituted in formula (1) and formula (2), because cannoncial system is to amass Point, i.e. s can calculate according to parameter τ, so nonlinear disturbance f ' (s) in training sample can be expressed as：

f^{'} (s) = \frac{τ \overset{\cdot}{v} + D v}{K} - (g - x) + (g - x_{0}) s - - - (6)

Solve best weights weight values w according to minimum error principle function J_i, wherein the expression formula of minimum error principle function is：

J=∑_s(f′(s)-f(s))²(7)

W when J takes minimum_iIt is exactly optimal weighted value.

5. the method for planning path for mobile robot based on dynamic motion primitive learning model according to claim 1, its It is characterised by：Step 5 comprises the following specific steps that：

Step 5-1：When robot performs specifying of task, start position and the final position of robot are set；

Step 5-2：Sample data is two-dimentional, namely includes the data on x-axis direction and the data on y-axis direction, by x-axis side Data upwards are trained according to step 4, obtain the best weights weight values on x-axis direction, substitute into the starting point in step 5-1 and end Point value, calculates x side upwardly through the displacement after the study of dynamic motion basic-element model, speed and acceleration；

Step 5-3：Data on y-axis direction are trained according to step 4, obtain the best weights weight values on y-axis direction, substitute into Beginning and end value in step 5-1, calculate y side upwardly through dynamic motion basic-element model study after displacement, speed and Acceleration；

Step 5-4：Read in the data obtaining in step 5-2 and step 5-3, respectively obtain the motion number of x-axis and y-axis both direction According to exporting the track emulation figure of motion on two dimensional surface, i.e. complete based on dynamic motion primitive learning model to mobile machine The path planning of people.

6. the method for planning path for mobile robot based on dynamic motion primitive learning model according to claim 1, its It is characterised by：The barrier being added in step 6 is to be central coordinate of circle with (0.4,0.4), and radius is the circle of 0.1m.