CN115016506A

CN115016506A - Heterogeneous multi-agent network cooperative scheduling planning method based on autonomous obstacle avoidance

Info

Publication number: CN115016506A
Application number: CN202210854751.4A
Authority: CN
Inventors: 程翔; 李鹏; 陈仕韬; 魏平; 郑南宁
Original assignee: Peking University; Xian Jiaotong University
Current assignee: Peking University; Xian Jiaotong University
Priority date: 2022-07-18
Filing date: 2022-07-18
Publication date: 2022-09-06
Anticipated expiration: 2042-07-18
Also published as: CN115016506B; WO2024016457A1

Abstract

The invention discloses a heterogeneous multi-agent networking cooperative scheduling planning method based on autonomous obstacle avoidance, which comprises the following steps: carrying out system configuration and transportation task management of multi-agent networking cooperative scheduling planning; each intelligent agent which is going to execute the transportation task in the system carries out collaborative path search to obtain a corresponding path list; the intelligent agent carries out motion planning to obtain an execution path; calculating to obtain a driving speed according to the execution path; the intelligent agent performs autonomous obstacle detouring in the operation process; and adjusting the time window. The invention realizes the cooperative solution of conflict between the intelligent agents and the autonomous obstacle avoidance by adopting a time window algorithm and introducing an autonomous obstacle avoidance technology of the intelligent agents, and compared with the prior art, the invention has the advantages of higher efficiency, flexibility and robustness, and more intelligent agents can be accommodated by the system.

Description

Heterogeneous multi-agent network cooperative scheduling planning method based on autonomous obstacle avoidance

技术领域technical field

本发明属于多智能体调度及路径规划技术领域，具体涉及一种基于自主绕障技术的异构型多智能体网联协同调度规划方法，是一种引入智能体自主绕障技术的中心式与分布式融合的异构型多智能体网联协同调度规划方法，通过发掘智能体的自主智能能力，实现高效、灵活、鲁棒且可扩展性强的异构型多智能体调度规划，可用于智慧物流、智慧仓储等复杂场景下对多智能体的调度及路径规划。The invention belongs to the technical field of multi-agent scheduling and path planning, and in particular relates to a heterogeneous multi-agent network-connected collaborative scheduling planning method based on autonomous obstacle avoidance technology. The distributed and integrated heterogeneous multi-agent network-connected collaborative scheduling planning method realizes efficient, flexible, robust and highly scalable heterogeneous multi-agent scheduling planning by exploring the autonomous intelligence capabilities of the agents, which can be used for Scheduling and path planning of multi-agents in complex scenarios such as smart logistics and smart warehousing.

背景技术Background technique

近年来，由于柔性制造系统的出现、定制机器人需求的增加以及中小企业工业自动化需求的增加，移动智能体在制造业、医疗及智能物流等领域发展迅速，得到了广泛的应用。In recent years, due to the emergence of flexible manufacturing systems, the increasing demand for customized robots, and the increasing demand for industrial automation in small and medium-sized enterprises, mobile agents have developed rapidly in the fields of manufacturing, medical care, and intelligent logistics, and have been widely used.

以最大限度地提高生产效率为目的，使用者往往期望多台智能体同时在场景中运行，这对场景中智能体的安全性提出了挑战，保证多台智能体在场景中无冲突地、高效地执行运输任务是多智能体系统需要解决的首要问题。完成这一任务的就是调度规划系统，它需要协调场景中多台智能体的运行，在保证安全性的前提下，尽可能地提升系统的运行效率。In order to maximize production efficiency, users often expect multiple agents to run in the scene at the same time, which poses a challenge to the security of the agents in the scene, ensuring that multiple agents operate in the scene without conflict and efficiently. Performing transportation tasks efficiently is the primary problem that multi-agent systems need to solve. To accomplish this task is the scheduling and planning system, which needs to coordinate the operation of multiple agents in the scene, and improve the operating efficiency of the system as much as possible on the premise of ensuring safety.

目前，现有技术主要采用两种不同架构的调度规划系统，即中心式架构系统和分布式架构系统。分布式架构系统相较于中心式架构系统有着灵活性强、可扩展性强等优势，但在此种架构的系统中，各智能体“各自为政”，这为系统的安全性埋下了隐患，且不利于使用者对场景中智能体的统一管理与控制，所以目前分布式架构的系统在智慧物流、智慧仓储等真实环境中鲜有应用。中心式架构系统目前应用非常广泛，它具有便于控制、易于获得最优解等优势，但现有的中心式架构系统中智能体的智能能力未被充分利用，控制中心需要完成系统中所有的计算任务，且系统中所有智能体需要不断地与控制中心进行信息交换，这造成了现有的中心式架构系统的控制中心计算压力大、网络通信负担大，进一步限制了系统可容纳智能体数目的提升，且此种系统存在着“单点故障”的缺陷，灵活性及鲁棒性差。At present, the prior art mainly adopts two scheduling planning systems with different architectures, namely, a central architecture system and a distributed architecture system. Compared with the centralized architecture system, the distributed architecture system has the advantages of strong flexibility and scalability. However, in this architecture system, each agent "governs its own affairs", which lays a hidden danger for the security of the system. And it is not conducive to the user's unified management and control of the agents in the scene, so the current distributed architecture system is rarely used in real environments such as smart logistics and smart warehousing. The central architecture system is currently widely used. It has the advantages of being easy to control and easy to obtain the optimal solution. However, the intelligent capabilities of the agents in the existing central architecture system are not fully utilized, and the control center needs to complete all calculations in the system. task, and all the agents in the system need to exchange information with the control center continuously, which causes the control center of the existing central architecture system to have high computing pressure and heavy network communication burden, which further limits the number of agents that the system can accommodate. Moreover, this system has the defect of "single point of failure", and the flexibility and robustness are poor.

发明内容SUMMARY OF THE INVENTION

本发明提供了一种基于自主绕障技术的中心式与分布式融合的异构型多智能体网联协同调度规划方法，该方法充分利用了智能体的自主智能能力，结合了中心式架构与分布式架构的优势，实现了一种更加灵活、高效、鲁棒且可容纳智能体数目更多的异构型调度规划方法。The invention provides a heterogeneous multi-agent network-connected collaborative scheduling planning method based on autonomous obstacle circumvention technology, which integrates central and distributed integration. The advantages of the distributed architecture realize a more flexible, efficient, robust and heterogeneous scheduling planning method that can accommodate a larger number of agents.

本发明包括控制中心、智能体和数据共享端，智能体是指应用场景下执行运输任务的智能机器人，控制中心算法部分部署在一台计算机上，数据共享端部署在一台服务器上。其中，由控制中心进行任务分配及协同路径搜索，任务分配采用基础的就近分配逻辑，协同路径搜索基于dijkstra算法实现，本发明同时考虑了路径中重复路段及对向行驶路段为智能体行驶带来的安全隐患，若两智能体的路径中均包含同一段路段，则该路段是两台智能体的重复路段，若两台智能体的路径中均包含同一段路段且其经过该路段的方向相反，则该路段是两台智能体的重复路段和对向行驶路段；智能体根据控制中心路径搜索的结果采用现有的时间窗算法进行运动规划并循迹行驶，在智能体遇到障碍物时，智能体利用自身搭载的现有的局部路径规划算法进行绕障，绕障结束后，执行本发明设计的时间窗检查与调整策略以保障后续运行过程中场景中智能体之间的协同关系；数据共享端用于存放智能体间碰撞避免所需的时间窗数据结构，各智能体均可以独立访问或修改数据共享端中存放的数据。The invention includes a control center, an intelligent body and a data sharing terminal. The intelligent body refers to an intelligent robot that performs transportation tasks in an application scenario. The algorithm part of the control center is deployed on a computer, and the data sharing terminal is deployed on a server. Among them, task assignment and cooperative path search are performed by the control center, the task assignment adopts the basic nearby assignment logic, and the cooperative path search is realized based on the Dijkstra algorithm. If the paths of the two agents contain the same road segment, the road segment is a duplicate road segment of the two agents. If the paths of the two agents both contain the same road segment and the directions passing through the road segment are opposite , then the road section is a repeated road section and an opposite driving section of the two agents; the agent uses the existing time window algorithm to plan the motion according to the result of the path search of the control center and follows the path. When the agent encounters an obstacle , the agent uses the existing local path planning algorithm carried by itself to bypass the obstacle, and after the obstacle is circumvented, the time window check and adjustment strategy designed by the present invention is executed to ensure the collaborative relationship between the agents in the scene in the subsequent operation process; The data sharing terminal is used to store the time window data structure required for collision avoidance between agents, and each agent can independently access or modify the data stored in the data sharing terminal.

本发明方法主要包括五个部分：任务管理部分、协同路径搜索部分、运动规划部分、自主绕障部分及时间窗调整部分；其中任务管理部分和协同路径搜索部分在控制中心实现，而运动规划部分、自主绕障部分及时间窗调整部分在各智能体端分别实现。The method of the invention mainly includes five parts: a task management part, a cooperative path search part, a motion planning part, an autonomous obstacle circumvention part and a time window adjustment part; wherein the task management part and the cooperative path search part are realized in the control center, and the motion planning part , The autonomous obstacle circumvention part and the time window adjustment part are implemented separately at each agent end.

任务管理部分实现动态添加任务和任务分配；协同路径搜索部分基于dijkstra算法实现，同时考虑了路径中重复路段及对向行驶路段为智能体行驶带来的安全隐患，使得本发明方法为智能体规划出的路径更加安全可靠；在运动规划部分中，各智能体根据控制中心协同路径搜索的结果，对其中的每条路径采用时间窗算法来避免与场景中其他智能体的冲突，获得具体到速度层面的执行指令；本发明创新性地将智能体的自主规划引入到了多智能体调度与规划方法及多智能体协同应用系统中，当智能体在行驶过程中遇到障碍物时，智能体采用自主绕障部分的局部路径规划算法进行自主绕障；自主绕障会打破运动规划部分规划好的行驶行为，所以在自主绕障结束后需要由时间窗调整部分进行时间窗检查与调整。The task management part realizes the dynamic addition of tasks and task allocation; the collaborative path search part is realized based on the Dijkstra algorithm, and at the same time, the safety hazards brought by the repeated sections and the opposite driving sections in the path for the driving of the intelligent agent are considered, so that the method of the present invention is an intelligent agent planning. The resulting path is safer and more reliable; in the motion planning part, each agent uses the time window algorithm for each path according to the results of the cooperative path search of the control center to avoid conflicts with other agents in the scene, and obtain specific speed. The present invention innovatively introduces the autonomous planning of the agent into the multi-agent scheduling and planning method and the multi-agent cooperative application system. When the agent encounters an obstacle during driving, the agent adopts the The local path planning algorithm of the autonomous obstacle avoidance part performs autonomous obstacle avoidance; autonomous obstacle avoidance will break the driving behavior planned by the motion planning part, so the time window adjustment part needs to check and adjust the time window after the autonomous obstacle avoidance.

本发明提供的基于自主绕障技术的异构型多智能体网联协同调度规划方法包括如下具体步骤：The heterogeneous multi-agent networked collaborative scheduling planning method based on the autonomous obstacle circumvention technology provided by the present invention includes the following specific steps:

步骤1进行多智能体网联协同调度规划的系统配置和运输任务管理；Step 1: Carry out system configuration and transportation task management of multi-agent network-connected collaborative scheduling planning;

系统包括控制中心、智能体和数据共享端；任务管理部分用于向系统中添加运输任务并进行任务分配。The system includes a control center, an agent and a data sharing terminal; the task management part is used to add transportation tasks to the system and assign tasks.

智能体为场景中执行运输任务的智能机器人；控制中心用于进行任务分配及协同路径搜索；数据共享端用于存放智能体间碰撞避免所需的时间窗数据结构；各智能体均可以独立访问或修改数据共享端中存放的数据。The agent is an intelligent robot that performs transportation tasks in the scene; the control center is used for task assignment and collaborative path search; the data sharing terminal is used to store the time window data structure required for collision avoidance between agents; each agent can be accessed independently Or modify the data stored in the data sharing terminal.

多智能体网联协同调度规划系统配置具体包括控制中心及各智能体的启动、控制中心和各智能体中场景地图的设置及数据共享端时间窗数据结构的初始化。场景地图中包括节点信息和路段信息，节点需根据场景的长度、宽度以固定间距(如场景宽度的10％)设定位置，场景中的一些关键点(如取货点、投货点等)也应被设定为节点，若场景中存在即有的行驶路线，则应在即有的行驶路线上以固定间距设定节点的位置；路段为两相邻节点相邻所形成的线段，多条路段相连形成一条路径。The configuration of the multi-agent networked cooperative scheduling planning system specifically includes the startup of the control center and each agent, the setting of the scene map in the control center and each agent, and the initialization of the time window data structure of the data sharing terminal. The scene map includes node information and road section information. Nodes need to be positioned at fixed intervals (such as 10% of the scene width) according to the length and width of the scene, and some key points in the scene (such as pickup points, delivery points, etc.) It should also be set as a node. If there is an existing driving route in the scene, the position of the node should be set at a fixed interval on the existing driving route; a road segment is a line segment formed by two adjacent nodes. The road segments are connected to form a path.

运输任务可以以地图中任一节点为任务终点，以执行任务的相应智能体的位置为任务起点；具体实施时，可根据任务添加的顺序进行任务分配。The transportation task can take any node in the map as the task end point, and the position of the corresponding agent performing the task as the task start point; during the specific implementation, the task assignment can be carried out according to the order in which the tasks are added.

步骤2协同路径搜索部分——为每一台即将执行任务的智能体进行协同路径搜索，得到相应的路径列表。Step 2: Collaborative path search part - perform a collaborative path search for each agent that is about to perform a task, and obtain a corresponding path list.

基于dijkstra算法获得任务起点与任务终点间所有无环路的路径，存储在路径列表PATH中，同时本发明考虑了重复路段、对向行驶路段为智能体行驶带来的安全隐患，若当前智能体与其他智能体的路径中均包含同一段路段，则该路段是当前智能体的重复路段，若当前智能体与其他智能体的路径中均包含同一段路段且其经过该路段的方向相反，则该路段是当前智能体的重复路段和对向行驶路段；具体实施时，检查路径列表PATH中的每一条路径，设路径列表中的某一路径为path₀，若path₀与其他任一智能体正在执行的路径的重复路段数与path₀上总路段数的比值大于设定阈值(如0.4)或与其他任一智能体正在执行的路径的对向行驶路段数与path₀上总路段数的比值大于设定阈值(如0.3)，则path₀为不可用路径，将path₀从路径列表PATH中移除。Based on the dijkstra algorithm, all loop-free paths between the task starting point and the task ending point are obtained and stored in the path list PATH. At the same time, the present invention considers the safety hazards brought by repeated road sections and opposite driving sections for the driving of the agent. If the current agent If the paths of the current agent and other agents contain the same road segment, the road segment is a duplicate road segment of the current agent. If the paths of the current agent and other agents contain the same road segment and the direction of passing through the road segment is opposite, then This road segment is the current agent’s repeating road segment and the opposite driving road segment; in the specific implementation, check each path in the path list PATH, and set a certain path in the path list as path ₀ , if path ₀ is associated with any other agent The ratio of the number of repeated segments of the path being executed to the total number of segments on path ₀ is greater than the set threshold (such as 0.4) or the ratio of the number of opposite travel segments of the path being executed by any other agent to the total number of segments on path ₀ . If the ratio is greater than the set threshold (such as 0.3), then path ₀ is an unavailable path, and path ₀ is removed from the path list PATH.

步骤3运动规划部分——进行运动规划得到执行路径；Step 3: Motion planning part - execute motion planning to get the execution path;

智能体获得控制中心进行协同路径搜索的结果并采用现有的时间窗算法进行运动规划，对其中的每一条路径进行时间窗插入操作，从中选取能够最早完成任务的一条路径作为执行路径。The agent obtains the result of the cooperative path search performed by the control center and uses the existing time window algorithm for motion planning, performs time window insertion operation on each path, and selects the path that can complete the task earliest as the execution path.

步骤4运动规划部分——根据执行路径计算得到行驶速度；Step 4: Motion planning part - calculating the driving speed according to the execution path;

对于步骤3中获得的执行路径，根据路径上每条路段的长度及时间窗插入结果计算出在该条路段上的行驶速度，并下发给智能体的底盘，智能体按照此速度进行行驶。For the execution path obtained in step 3, according to the length of each road segment on the path and the time window insertion result, the driving speed on the road segment is calculated, and sent to the chassis of the agent, and the agent travels at this speed.

步骤5自主绕障部分——在智能体运行的过程中，若智能体遇到障碍物，则由智能体按照自主绕障部分的局部路径规划算法执行障碍物的绕开操作，具体实施时，局部路径规划算法采用了现有的动态时间窗口算法(Dynamic Window Approach，DWA)。Step 5: Autonomous Obstacle Deviance Part - During the operation of the agent, if the agent encounters an obstacle, the agent will execute the obstacle avoidance operation according to the local path planning algorithm of the autonomous obstacle avoidance part. The local path planning algorithm adopts the existing dynamic time window algorithm (Dynamic Window Approach, DWA).

步骤6时间窗调整部分——由于智能体自主绕障的时间无法确定，所以在智能体自主绕障结束后，需要采用本发明方法设计的时间窗检查与调整策略以保障后续运行过程中场景中智能体之间的协同关系。Step 6: Time window adjustment part - Since the time for the agent to bypass the obstacle autonomously cannot be determined, after the autonomous obstacle avoidance of the agent is completed, the time window inspection and adjustment strategy designed by the method of the present invention needs to be adopted to ensure the subsequent operation in the scene. Collaborative relationship between agents.

通过以上步骤，即可实现多台智能体灵活、高效地在场景中执行任务。Through the above steps, multiple agents can flexibly and efficiently perform tasks in the scene.

与现有技术相比，本发明的有益效果：Compared with the prior art, the beneficial effects of the present invention:

本发明提供一种基于自主绕障技术的中心式与分布式融合的异构型多智能体网联协同调度规划方法，通过采用时间窗算法和引入智能体的自主绕障技术，实现智能体之间冲突的协同解决和自主绕障，本方法相较于传统的中心式架构系统更加高效、灵活、鲁棒且系统可容纳的智能体数目更多。The invention provides a heterogeneous multi-agent network-connected collaborative scheduling planning method based on autonomous obstacle avoidance technology, which integrates central and distributed integration. Compared with the traditional central architecture system, this method is more efficient, flexible, and robust, and the system can accommodate a larger number of agents.

本发明提供的异构型多智能体网联协同调度规划方法具有以下优点：The heterogeneous multi-agent networked cooperative scheduling planning method provided by the present invention has the following advantages:

1)引入了智能体的自主局部路径规划，显著降低了控制中心的计算压力。1) The autonomous local path planning of the agent is introduced, which significantly reduces the computational pressure of the control center.

2)智能体具备自主绕障能力，控制中心无需时刻监控智能体的运行状态，显著降低了网络通信的负担。2) The agent has the ability to bypass obstacles autonomously, and the control center does not need to monitor the running status of the agent at all times, which significantly reduces the burden of network communication.

3)控制中心计算压力及网络通信负担的降低进一步促进了系统可容纳智能体数目的提升。3) The reduction of the control center's computational pressure and network communication burden further promotes the increase in the number of agents that the system can accommodate.

4)系统的运行不再仅仅依赖于控制中心，即消除了传统的中心式架构系统“单点故障”的缺陷，系统鲁棒性更强。4) The operation of the system no longer only depends on the control center, that is, the defect of the "single point of failure" of the traditional centralized architecture system is eliminated, and the system is more robust.

5)智能体具备自主应对突发状况及异常状况的能力，相较于传统的中心式架构系统中由控制中心通过重规划应对突发状况的策略灵活性更强。5) The agent has the ability to autonomously respond to emergencies and abnormal situations, which is more flexible than the traditional central architecture system where the control center responds to emergencies through re-planning.

附图说明Description of drawings

图1是本发明方法具体实施采用的系统架构，包括控制中心、智能体和数据共享端。FIG. 1 is a system architecture adopted for the specific implementation of the method of the present invention, including a control center, an intelligent body and a data sharing terminal.

图2是本发明任务管理部分任务分配方法的流程框图。Fig. 2 is a flow chart of the task assignment method of the task management part of the present invention.

图3是本发明说明具体实施方式时所采用示例的场景示意图。FIG. 3 is a schematic diagram of an example used in explaining the specific implementation of the present invention.

图4是本发明数据共享端时间窗数据结构的一种示例。FIG. 4 is an example of the time window data structure of the data sharing side of the present invention.

图5是本发明协同路径搜索部分进行路径检查的算法流程框图Fig. 5 is the algorithm flow chart of the cooperative path search part of the present invention to check the path

图6本发明运动规划部分进行时间窗插入操作的一种示例。FIG. 6 is an example of the time window insertion operation performed by the motion planning part of the present invention.

图7是本发明局部路径规划部分速度采样空间示意图。FIG. 7 is a schematic diagram of the velocity sampling space of the local path planning part of the present invention.

图8是本发明自主绕障部分智能体自主规划的绕障路径示意图。FIG. 8 is a schematic diagram of an obstacle bypass path autonomously planned by a part of the intelligent body for autonomous obstacle bypass according to the present invention.

具体实施方式Detailed ways

为使本发明的上述目的、特征和优点能够更加明显易懂，下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above objects, features and advantages of the present invention more clearly understood, the present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments.

本发明提供一种基于自主绕障技术的中心式与分布式融合的异构型多智能体网联协同调度规划方法，实现多台智能体在智慧物流、智慧仓储等场景下高效、灵活、无冲突地执行任务，本发明方法具体实施采用的系统架构参考图1所示。本发明方法主要包括五个部分：任务管理部分、协同路径搜索部分、运动规划部分、自主绕障部分及时间窗调整部分；其中协同路径搜索部分充分考虑了重复路段、对向行驶路段为智能体行驶带来的安全隐患，使得系统为智能体规划出的路径更加安全可靠；此外，本发明方法是一种中心式架构与分布式架构融合的异构型方法，充分利用了智能体的自主智能能力，结合了中心式架构与分布式架构的技术优势，相较于传统的中心式架构系统，灵活性、鲁棒性、可扩展性均得到了较大的提升。The present invention provides a heterogeneous multi-agent network-connected collaborative scheduling planning method based on autonomous obstacle bypassing technology, which integrates central and distributed integration, and realizes efficient, flexible, and seamless operation of multiple agents in scenarios such as smart logistics and smart warehousing. Conflicting tasks are performed, and the system architecture adopted for the specific implementation of the method of the present invention is shown in FIG. 1 . The method of the invention mainly includes five parts: a task management part, a cooperative path search part, a motion planning part, an autonomous obstacle avoidance part and a time window adjustment part; wherein the cooperative path search part fully considers the repeated road sections and the opposite driving section as the intelligent body The potential safety hazard brought by driving makes the path planned by the system for the agent more safe and reliable; in addition, the method of the present invention is a heterogeneous method integrating a central architecture and a distributed architecture, making full use of the autonomous intelligence of the agent Compared with the traditional centralized architecture system, the flexibility, robustness and scalability have been greatly improved.

任务管理部分实现动态添加任务和任务分配。任务分配的策略流程参考图2所示。协同路径搜索部分基于dijkstra算法实现，同时考虑了路径中重复路段、对向行驶路段为智能体行驶带来的安全隐患，使得系统为智能体规划出的路径更加安全可靠。运动规划部分实现智能体之间冲突的避免，由各智能体分别访问数据共享端并进行时间窗插入操作，获得具体到速度层面的执行指令。自主绕障部分实现智能体在遇到障碍物时的自主绕开，即利用局部路径规划算法规划出绕障路径。时间窗调整部分实现在自主绕障结束后的时间窗检查与调整，以保障后续运行过程中场景中智能体之间的协同关系。The task management part realizes dynamic addition of tasks and assignment of tasks. The strategy flow of task assignment is shown in Figure 2. The collaborative path search is partially implemented based on the Dijkstra algorithm, and at the same time, the safety hazards brought by the repeated sections and opposite driving sections in the path are considered for the driving of the agent, so that the path planned by the system for the agent is safer and more reliable. The motion planning part realizes the avoidance of conflicts between agents. Each agent accesses the data sharing terminal and performs time window insertion operations to obtain execution instructions specific to the speed level. The autonomous obstacle avoidance part realizes the autonomous avoidance of the agent when it encounters an obstacle, that is, a local path planning algorithm is used to plan a route around the obstacle. The time window adjustment part realizes the inspection and adjustment of the time window after the autonomous obstacle avoidance is completed, so as to ensure the cooperative relationship between the agents in the scene during the subsequent operation.

为便于理解，在介绍本发明方法的具体实施方式时，将以图3所示的场景为例，由图可见，场景中共存在4台智能体，分别为AGENT₁、AGENT₂、AGENT₃及AGENT₄，其中只有AGENT₃处于空闲状态。For ease of understanding, when introducing the specific implementation of the method of the present invention, the scene shown in FIG. 3 will be taken as an example. As can be seen from the figure, there are 4 agents in the scene, namely AGENT ₁ , AGENT ₂ , AGENT ₃ and AGENT ₄ , of which only AGENT ₃ is idle.

本发明方法的具体实施包括如下步骤：The specific implementation of the method of the present invention comprises the following steps:

步骤1进行多智能体网联协同调度规划系统配置，系统包括控制中心、智能体和数据共享端；任务管理部分——向系统中添加任务并进行任务分配。Step 1: Configure the multi-agent networked collaborative scheduling planning system, the system includes a control center, an agent and a data sharing terminal; task management part - adding tasks to the system and assigning tasks.

多智能体网联协同调度规划系统配置具体包括控制中心及各智能体的启动、控制中心和各智能体中场景地图的设置及数据共享端时间窗数据结构的初始化。控制中心及各智能体的启动即启动控部署控制中心算法部分的计算机和各台智能体；控制中心和各智能体中场景地图的设置是指向控制中心和各智能体输入相同的地图文件，保证控制中心和各智能体使用相同的场景地图；时间窗数据结构为地图中每一条路段维护一个时间窗向量，时间窗向量中的每一个元素对应着一台智能体占用该路段的一段时间，初始时时间窗数据结构为空，图4所示为系统运行过程中的时间窗数据结构示例。The configuration of the multi-agent networked cooperative scheduling planning system specifically includes the startup of the control center and each agent, the setting of the scene map in the control center and each agent, and the initialization of the time window data structure of the data sharing terminal. The startup of the control center and each agent starts the control and deployment of the computer and each agent in the algorithm part of the control center; the setting of the scene map in the control center and each agent refers to the input of the same map file to the control center and each agent to ensure that The control center and each agent use the same scene map; the time window data structure maintains a time window vector for each road segment in the map, and each element in the time window vector corresponds to a period of time that an agent occupies the road segment. The time and time window data structure is empty. Figure 4 shows an example of the time window data structure during the system operation.

向系统中添加以地图中任一节点为终点的任务，并根据任务添加的顺序进行任务分配，参考图2所示，具体的任务分配策略为：Add a task with any node in the map as the end point to the system, and assign tasks according to the order in which the tasks are added. Referring to Figure 2, the specific task assignment strategy is:

1)若当前系统中无空闲智能体，则该任务暂缓分配；1) If there is no idle agent in the current system, the assignment of the task is suspended;

2)若当前系统中有空闲智能体，则按照就近原则将该任务分配给当前位置距离任务终点最近的智能体。2) If there is an idle agent in the current system, the task is assigned to the agent whose current position is closest to the end point of the task according to the principle of proximity.

以图3所示场景为例，若添加了终点为节点21的任务，根据任务分配策略，该任务将被分配给AGENT₃执行。Taking the scenario shown in FIG. 3 as an example, if a task whose end point is node 21 is added, the task will be assigned to AGENT ₃ for execution according to the task assignment policy.

基础的dijkstra算法的执行步骤如下：The execution steps of the basic Dijkstra algorithm are as follows:

1)获取任务起点s和任务终点e；1) Obtain the task start point s and the task end point e;

2)定义两个集合：S和U。S用来记录地图中所有已求出最短路径的节点，U用来记录还未求出最短路径的节点。定义两个向量：D和P，向量D中以节点编号顺序存储起点s到地图中其它节点的最短距离，向量P中以节点编号顺序存储任务起点s到地图中其它某一节点z的最短路由中节点z的上一节点。初始时，S中只有任务起点s，U中包含除任务起点s之外的地图中所有节点；D中各元素的值是任务起点s到地图中其它各节点的距离(若U中某节点v与任务起点s不相邻，则D中对应的元素值为∞)；2) Define two sets: S and U. S is used to record all nodes in the map for which the shortest path has been found, and U is used to record the nodes for which the shortest path has not yet been found. Define two vectors: D and P. The vector D stores the shortest distance from the starting point s to other nodes in the map in the node number order, and the vector P stores the shortest route from the task starting point s to another node z in the map in the node number order. The previous node of node z in the middle. Initially, there is only the task starting point s in S, and U contains all the nodes in the map except the task starting point s; the value of each element in D is the distance from the task starting point s to other nodes in the map (if a node v in U If it is not adjacent to the task starting point s, the corresponding element value in D is ∞);

3)从U中选出距离任务起点s最近的节点k，并将节点k加入到S中，同时，从U中移除节点k；3) Select the node k closest to the task starting point s from U, add node k to S, and remove node k from U at the same time;

4)更新U中各个节点到起点s的距离(考虑以节点k为中间点起点s到U中某个节点v的距离可能减小)，同时更新向量D和向量P；4) Update the distance from each node in U to the starting point s (considering that the distance from the starting point s of the node k as the intermediate point to a node v in U may decrease), and update the vector D and the vector P at the same time;

5)重复步骤3)和步骤4)，直到遍历完地图中所有节点；5) Repeat steps 3) and 4) until all nodes in the map are traversed;

6)从终点e开始，反向检索向量P，获取起点s到终点e的最短路径。6) Starting from the end point e, reversely retrieve the vector P to obtain the shortest path from the start point s to the end point e.

设系统中共有A台智能体，除当前智能体外其他A-1台智能体中第i台智能体目前正在执行的路径长度为l_i(即路径上共有l_i个节点)，则其他A-1台智能体中第i台智能体正在执行的路径可表示为

本发明基于dijkstra算法进行协同路径搜索的步骤如下：Assuming that there are A agents in the system, except for the current agent, the _i -th agent in the other A-1 agents is currently executing a path length of li (that is, there are _li nodes on the path), then the other A- The path being executed by the ith agent in one agent can be expressed as

The steps of the present invention based on the Dijkstra algorithm to search for a collaborative path are as follows:

1)采用基础的dijkstra算法求出一条任务起点和任务终点的两点之间的最短路径：s→k₁→k₂→…→k_n→e；其中k₁～k_n均为路径上的节点；1) Use the basic Dijkstra algorithm to find the shortest path between the two points of a task starting point and a task ending point: s→k ₁ →k ₂ →…→k _n →e; where k ₁ ～k _n are all on the path node;

2)令k₀＝s，k_n+1＝e，对于路径上除任务终点e外的每一个节点k_m(m＝0，1，2…，n)，设k_m共有r个相邻节点，遍历k_m所相邻的每一个节点v_q(q＝1，2，…，r)；若v_q≠k_m+1，以v_q为任务起点，以e为任务终点，采用基础的dijkstra算法求出v_q与e之间的最短路径：v_q→n₁→n₂→…→n_p→e，n₁～n_p均为路径上的节点，则得到另一条s与e之间的路径：k₀→…→k_m→v_q→…→e，将其加入路径列表PATH。2) Let k ₀ =s, k _n+1 =e, for each node k _m (m=0, 1, 2..., n) on the path except the task end point e, let k _m have r adjacent nodes in total. Node, traverse each node _v _q (q=1, 2,..., r) adjacent to km; if v _q ≠km ₊₁ , take v _q as the starting point of the task, and take e as the end of the task, using the basic Find the shortest path between v _q and e using the Dijkstra algorithm: v _q →n ₁ →n ₂ →…→n _p →e, where n ₁ ～n _p are all nodes on the path, then another s and e are obtained. The path between: k ₀ →…→k _m →v _q →…→e, add it to the path list PATH.

3)步骤2)执行结束后，路径列表PATH中存储了任务起点s到任务终点e之间所有无环路路径，设路径列表PATH中共有S条路径，对于其中的每一条路径PATH_z(z＝1，2，…，S)，分别使用β_z和γ_z来记录路径PATH_z与其他智能体正在执行的路径的重复路段及对向行驶路段占路径PATH_z总路段数的比值的最大值。算法流程参考图5所示。3) After the execution of step 2), all loop-free paths between the task start point s and the task end point e are stored in the path list PATH. Suppose there are S paths in the path list PATH, and for each path PATH _z (z = 1, 2, ..., S), respectively use β _z and γ _z to record the maximum value of the ratio of the repeated sections of the path PATH _z and the paths being executed by other agents and the opposite driving sections to the total number of sections of the path PATH _z . The algorithm flow is shown in Figure 5.

设路径PATH_z的长度为l(即路径PATH_z上共有l个节点)，对于h＝1，2，…，l-1，检查系统中除当前智能体外其他A-1台智能体路径N_i(i＝1，2，…A-1)，则对于w＝1，2，…，l_i-1，l_i为路径N_i的长度(即路径上共有l_i个节点)；从地图信息中分别查取由节点

和节点

连接而成的路段记为arc_h，及节点

和节点

连接而成的路段记为arc_w，若arc_h＝arc_w，则路径PATH_z与路径N_i的重复路段数累加1，若arc_h＝arc_w且

则路径PATH_z与路径N_i的对向行驶路段数也累加1。对于其他智能体当前执行路径的集合N，求得路径PATH_z与其中A-1条路径重复路段、对向行驶路段的最大值，进而求得β_z与γ_z，若β_z≥0.4或γ_z≥0.3，则认为路径PATH_z与当前系统中正在执行任务的其他智能体有较大的产生冲突的风险，则将路径PATH_z从路径列表中移除。Let the length of the path PATH _z be l (that is, there are l nodes on the path PATH _z ), for h=1, 2, ..., l-1, check the paths N _i of other A-1 agents in the system except the current agent ( _i =1, 2,...A-1), then for _w =1, 2,..., li -1, _li is the length of the path Ni (that is, there are _li nodes on the path); from the map information Query by node respectively

and node

The connected road segment is recorded as arc _h , and the node

and node

The connected road section is denoted as arc _w , if arc _h =arc _w , then the number of repeated road sections of path PATH _z and path _Ni is accumulated by 1, if arc _h =arc _w and

Then, the number of opposite travel sections of the path PATH _z and the path N _i is also accumulated by 1. For the set N of current execution paths of other agents, obtain the maximum value of the path PATH _z and the repeated sections and opposite driving sections of the A-1 paths, and then obtain β _z and γ _z , if β _z ≥ 0.4 or γ _z ≥ 0.3, it is considered that the path PATH _z has a greater risk of conflict with other agents that are executing tasks in the current system, and the path PATH _z is removed from the path list.

在图3所示的场景中，假设AGENT₁、AGENT₂及AGENT₄当前的执行路径分别为：10→3→24→12→11→18→19，19→15→13→14及16→21→26→23→24→25，为AGENT₃规划出的路径列表PATH_E包含以下3条路径：In the scenario shown in Figure 3, it is assumed that the current execution paths of AGENT ₁ , AGENT ₂ and AGENT ₄ are: 10→3→24→12→11→18→19, 19→15→13→14 and 16→21 →26→23→24→25, the path list PATH _E planned for AGENT ₃ includes the following three paths:

①22→24→3→2→9→16→21①22→24→3→2→9→16→21

②22→18→14→13→12→19→24→23→26→21②22→18→14→13→12→19→24→23→26→21

③22→24→12→11→18→23→26→21③22→24→12→11→18→23→26→21

对这3条路径进行路经检查后，路径列表PATH_E中仅存在一条路径可行，即路径①，路径②被移除的原因是其与AGENT₄的路径之间的对向行驶路段与路径总路段数的比值大于0.3，路径③被移除的原因是其与AGENT₁的路径之间的重复路段与路径总路段数的比值大于0.4。After the path inspection of these three paths, there is only one feasible path in the path list PATH _E , that is, path ① and path ② are removed because the opposite driving section and the total path between them and the path of AGENT ₄ are. The ratio of the number of road segments is greater than 0.3, and the reason for the removal of route ③ is that the ratio of the duplicate road segments between it and the route of AGENT ₁ to the total number of road segments of the route is greater than 0.4.

步骤3运动规划部分——智能体获得控制中心进行协同路径搜索的结果(步骤2得到的路径列表)并采用现有的时间窗算法进行运动规划，对其中的每一条路径进行时间窗插入操作，从中选取能够最早完成任务的一条路径作为执行路径。Step 3: Motion planning part - the agent obtains the result of the collaborative path search performed by the control center (the path list obtained in step 2) and uses the existing time window algorithm for motion planning, and performs a time window insertion operation for each path. The path that can complete the task earliest is selected as the execution path.

控制中心将协同路径搜索的结果发送给智能体后，智能体首先需要访问数据共享端获取当前时间窗结构，时间窗数据结构为地图中每一条路段维护一个时间窗向量，时间窗向量中的每一个元素对应着一台智能体占用该路段的一段时间，而后对于智能体收到的每一条路径，执行如下操作：After the control center sends the results of the collaborative path search to the agent, the agent first needs to access the data sharing terminal to obtain the current time window structure. The time window data structure maintains a time window vector for each road segment in the map. An element corresponds to a period of time that an agent occupies the road segment, and then for each path received by the agent, the following operations are performed:

对于每条路径上的每一段路段α_m(m＝1，2，…，n)：For each segment α _m (m=1, 2, . . . , n) on each path:

1)以最短长度的时间窗(最短时间窗长度为路段a_m的长度与智能体最大行驶速度的比值)进行插入。寻找满足以下两个条件的插入位置：1) Insert the time window with the shortest length (the length of the shortest time window is the ratio of the length of the road segment a _m to the maximum driving speed of the agent). Finds the insertion position that satisfies both of the following conditions:

①在路段a_m的时间窗向量中，若某个位置的后一个时间段的起始时间与该位置的前一个时间段的终止时间之差足以容纳此最短长度的时间窗的长度，则该位置可作为插入位置。① In the time window vector of road segment a _m , if the difference between the start time of the next time period of a certain position and the end time of the previous time period of the position is enough to accommodate the length of the time window of the shortest length, then the A position can be used as an insertion position.

②若m≠1，进入此段路段的时间应在智能体释放其路径上的上一段路段的时间之后。②If m≠1, the time to enter this road segment should be after the time when the agent releases the previous road segment on its path.

2)若m≠1，判断智能体释放其路径上的上一段路段的时间与进入此段路段的时间是否相等。若不相等，则延长上一段路段的释放时间，使其等于此段路段的进入时间。2) If m≠1, judge whether the time for the agent to release the previous section on its path is equal to the time for entering this section. If they are not equal, extend the release time of the previous section to make it equal to the entry time of this section.

3)检查时间窗之间是否存在重叠。若不重叠，插入成功，继续对下一段路段进行操作。若重叠，返回路径上的上一段路段，并返回到步骤1)。3) Check for overlap between time windows. If there is no overlap, the insertion is successful, and the operation is continued to the next segment. If it overlaps, go back to the previous segment on the route and go back to step 1).

对一条路径进行时间窗插入操作具体是，一条路径包含多个路段a_m，对路径上的所有路段在时间窗向量中找到满足条件的插入位置后，这条路径的时间窗插入操作成功。The time window insertion operation on a path is specifically, a path contains multiple road _segments am , and the time window insertion operation of this path is successful after finding the insertion position that satisfies the condition in the time window vector for all the road segments on the path.

对一条路径进行时间窗插入操作成功后，智能体释放该条路径上最后一段路段的时间即为智能体采用该条路径执行任务时预计完成任务的时间。对智能体收到的每一条路径均进行时间窗插入操作，而后选取预计能够最早完成任务的一条路径作为智能体执行任务的最终路径。After the time window insertion operation on a path is successful, the time when the agent releases the last segment on the path is the estimated time when the agent uses the path to perform the task. The time window insertion operation is performed on each path received by the agent, and then a path that is expected to complete the task at the earliest is selected as the final path for the agent to perform the task.

通过对智能体收到的每一条路径进行时间窗插入操作，选取出最终的执行路径后，智能体需要向数据共享端发送消息，将选取的执行路径的时间窗插入结果向数据共享端同步更新。By performing the time window insertion operation on each path received by the agent, after selecting the final execution path, the agent needs to send a message to the data sharing side, and update the time window insertion result of the selected execution path to the data sharing side synchronously .

仍以图3所示场景为例，在步骤2中的协同路径搜索部分求出AGENT₃只有一条候选路径，即22→24→3→2→9→16→21，设路径上的路段依次为a₉→a₁₆→α₇→α₁→a₁₇→a₂₂，设AGENT₃收到控制中心发送的候选路径后，在进行运动规划(也即进行时间窗插入操作)前，上述路段的时间窗数据结构为图6中(1)所示，则进行运动规划后，上述路段的时间窗数据结构如图6中(2)所示。Still taking the scene shown in Figure 3 as an example, in the collaborative path search part in step 2, it is found that there is only one candidate path for AGENT ₃ , that is, 22→24→3→2→9→16→21, and the sections on the path are set as follows: a ₉ →a ₁₆ →α ₇ →α ₁ →a ₁₇ →a ₂₂ , suppose that after AGENT ₃ receives the candidate path sent by the control center, before performing motion planning (that is, performing the time window insertion operation), the time of the above road segment The window data structure is shown in (1) in FIG. 6 , and after the motion planning is performed, the time window data structure of the above road section is shown in (2) in FIG. 6 .

步骤4运动规划部分——对于步骤3中获得的执行路径，根据路径上每条路段的长度及时间窗插入结果计算出在该条路段上的行驶速度，并下发给智能体的底盘，智能体按照此速度进行行驶。Step 4: Motion planning part - for the execution path obtained in step 3, according to the length of each road section on the path and the time window insertion result to calculate the driving speed on this road section, and send it to the chassis of the agent, intelligent The body travels at this speed.

智能体根据当前时间窗结构完成时间窗插入操作后，即可获得执行路径上各段路段的长度及应在该路段上的行驶时间，进一步计算出智能体在执行路径中各段路段上应行驶的速度，将此速度下发给智能体底盘执行机构即可实现智能体循迹行驶。After the agent completes the time window insertion operation according to the current time window structure, it can obtain the length of each section on the execution path and the travel time on the section, and further calculate that the agent should travel on each section in the execution path. The speed of the intelligent body can be sent to the chassis actuator of the intelligent body to realize the tracking driving of the intelligent body.

步骤5自主绕障部分——在智能体遇到障碍物时，智能体利用局部路径规划算法进行自主绕障。Step 5: The part of autonomous obstacle avoidance - when the agent encounters an obstacle, the agent uses the local path planning algorithm to autonomously avoid the obstacle.

当智能体在行驶过程中前方遇到障碍物时，不同于传统的中心式架构系统中仅依赖于控制中心进行重规划的策略，本发明方法中的智能体将采用局部路径规划算法进行自主绕障，自主绕障的目标点为智能体当前所在节点在运行路径上的下一个节点，特别地，当运行路径上的下一个节点为路径的最后一个节点时，自主绕障的目标点为任务终点。在实施时，本发明方法具体采用了现有的动态时间窗口算法(Dynamic Window Approach，DWA)，这是一种基于有限观测范围的动态决策类算法，该算法共包含三个步骤：When the agent encounters an obstacle in front of the driving process, different from the traditional central architecture system that only relies on the control center for re-planning, the agent in the method of the present invention will use the local path planning algorithm to autonomously go around. The target point of autonomous obstacle avoidance is the next node on the running path of the node where the agent is currently located. In particular, when the next node on the running path is the last node of the path, the target point of autonomous obstacle avoidance is the task end. During implementation, the method of the present invention specifically adopts an existing dynamic time window algorithm (Dynamic Window Approach, DWA), which is a dynamic decision-making algorithm based on a limited observation range, and the algorithm comprises three steps:

1)速度采样1) Speed sampling

智能体的速度采样空间主要受以下几个因素限制：智能体本身存在的最大速度及最小速度限制；智能体存在的最大加速度、最大减速度的限制，参考图7所示，这导致智能体在下一个前向模拟周期内所能达到的速度被限制在一个动态时间窗口内。The speed sampling space of the agent is mainly limited by the following factors: the maximum speed and the minimum speed limit of the agent itself; the limit of the maximum acceleration and maximum deceleration of the agent, as shown in Figure 7, which leads to the agent in the following The speed that can be achieved in one forward simulation cycle is limited to a dynamic time window.

2)轨迹评分2) Trajectory scoring

在采样的速度组中，有若干组轨迹均是可行的，因此采用评价函数为每条轨迹进行评价，采用的评价函数如下：In the sampled velocity group, several groups of trajectories are feasible, so the evaluation function is used to evaluate each trajectory. The evaluation function used is as follows:

H(u，ω)＝α·goal(v，ω)+β·dist(v，ω)+γ·path(v，ω) (4.5)H(u,ω)=α·goal(v,ω)+β·dist(v,ω)+γ·path(v,ω) (4.5)

其中，goal(v，ω)是目标评价函数，用来评价机器人在当前设定的采样速度下，达到模拟轨迹末端时的位置与目标点的位置及到达模拟轨迹末端时的朝向角与目标方位角之间的差距，差距越大，评分越低；dist(v，ω)是距离评价函数，用来评价轨迹上与最近障碍物之间的距离，与最近障碍物之间的距离越小，此值越大；path(v，ω)为路径评价函数，用来评价采样轨迹上的点距离全局路径的最近距离，此评价函数主要是用来引导智能体尽可能地按照全局路径搜索的结果行进，以更早地到达目标点。Among them, goal(v, ω) is the target evaluation function, which is used to evaluate the position of the robot when it reaches the end of the simulated trajectory and the position of the target point, and the orientation angle and target orientation when it reaches the end of the simulated trajectory under the currently set sampling speed. The gap between the angles, the larger the gap, the lower the score; dist(v, ω) is the distance evaluation function, which is used to evaluate the distance between the trajectory and the nearest obstacle, and the smaller the distance to the nearest obstacle, The larger this value is; path(v, ω) is the path evaluation function, which is used to evaluate the closest distance between the points on the sampling trajectory and the global path. This evaluation function is mainly used to guide the agent to search according to the global path as much as possible. Proceed to reach the target point earlier.

需要注意的是，由于goal(v，ω)、dist(v，ω)、path(v，ω)等3个评价函数的参量不同，在计算最终的评价值H(v，ω)前，需要将3部分的评价函数值进行归一化处理，而后分别与对应的系数α、β、γ相乘后求和，即可得到该组速度在下一段前向模拟周期内轨迹的评分，从所有采样轨迹中选取评分最高的一组速度作为下一周期内智能体的执行速度。It should be noted that since the parameters of the three evaluation functions such as goal(v, ω), dist(v, ω), and path(v, ω) are different, before calculating the final evaluation value H(v, ω), it is necessary to The evaluation function values of the three parts are normalized, and then multiplied by the corresponding coefficients α, β, and γ, respectively, and then summed to obtain the score of the trajectory of this group of speeds in the next forward simulation period. In the trajectory, a set of speeds with the highest score is selected as the execution speed of the agent in the next cycle.

3)速度发布3) Speed release

在速度空间中采样多组速度并模拟出多组速度在下一前向模拟周期的轨迹后，对所有轨迹进行评分并选取出评分最高的轨迹，而后将评分最高的轨迹对应的速度下发给智能体，智能体在下一周期内按照此速度进行行驶，重复上述过程直至智能体到达目标点。After sampling multiple sets of velocities in the velocity space and simulating the trajectories of multiple sets of velocities in the next forward simulation cycle, all trajectories are scored and the trajectory with the highest score is selected, and then the velocity corresponding to the trajectory with the highest score is sent to the intelligent The agent will drive at this speed in the next cycle, and the above process will be repeated until the agent reaches the target point.

步骤6时间窗调整部分-—在智能体绕障结束后，采用本发明方法设计的时间窗检查与调整策略以保障后续运行过程中场景中智能体之间的协同关系。。Step 6: Time window adjustment part - after the agent has completed the obstacle avoidance, the time window checking and adjustment strategy designed by the method of the present invention is adopted to ensure the cooperative relationship between the agents in the scene in the subsequent running process. .

由于智能体自主绕障是一项较为耗时的操作且所需时间无法提前预估，若智能体自主绕障结束后还未到达任务终点，则在绕障结束后需要检查绕障所消耗的时间，若所消耗的时间超过了时间窗保护时间的一半，则需要进行时间窗的调整。设自主绕障所耗费的时间为δ，时间窗保护时间为w(时间窗保护时间是指若两台智能体均需要经过一条路段，它们的时间窗之间应最少相差出的时间)，具体策略如下：Since the autonomous obstacle avoidance of the agent is a time-consuming operation and the required time cannot be estimated in advance, if the agent has not reached the end of the task after the autonomous obstacle avoidance, it is necessary to check the time consumed by the obstacle after the obstacle avoidance. If the time consumed exceeds half of the time window protection time, the time window needs to be adjusted. Let the time spent on autonomously circumventing obstacles be δ, and the time window protection time is w (the time window protection time refers to the minimum time difference between their time windows if two agents both need to pass through a road section). The strategy is as follows:

1)

1)

在此种情况下，自主绕障所消耗的时间小于或等于时间窗保护时间的一半，在后续的行驶中智能体可通过调整行驶速度来弥补时间的消耗，故无需进行时间窗的调整。In this case, the time consumed by autonomous obstacle avoidance is less than or equal to half of the time window protection time. In the subsequent driving, the agent can make up for the time consumption by adjusting the driving speed, so there is no need to adjust the time window.

2)

2)

在此种情况下，自主绕障所消耗的时间超过了时间窗保护时间的一半，若仅依靠智能体在后续的行驶中通过调整速度来弥补时间误差将存在较大的冲突的风险，所以此时需要进行时间窗的调整并重新进行运动规划。In this case, the time consumed by autonomous obstacle avoidance exceeds half of the time window protection time. If only relying on the agent to compensate for the time error by adjusting the speed in the subsequent driving, there will be a greater risk of conflict, so this It is necessary to adjust the time window and re-plan the motion.

时间窗调整并重新进行运动规划的具体做法为：The specific method of adjusting the time window and re-planning the motion is as follows:

1)访问数据共享端，清除当前智能体的时间窗数据。1) Access the data sharing terminal and clear the time window data of the current agent.

2)获得清除当前智能体时间窗数据后的时间窗结构。2) Obtain the time window structure after clearing the current agent time window data.

3)根据智能体当前位置，转到步骤3确定智能体后续的行驶路径并重新进行运动规划。3) According to the current position of the agent, go to step 3 to determine the subsequent driving path of the agent and re-plan the motion.

仍以图3所示场景为例，当AGENT₃行驶至节点24所在位置时，检测到前方行驶路径上存在障碍物，智能体随即采用步骤5中的局部路径规划算法进行自主绕障，图8为AGENT₃可能规划出的一种绕障路径；AGENT₃当绕障结束到达节点3所在位置时，若绕障额外消耗了5.2s，即δ＝5.2，设时间窗保护时间为8s，即w＝8，由于

所以接下来AGENT₃应清除当前时间窗数据结构中自身的时间窗，而后对后续行驶路径(3→2→9→16→21)重新进行运动规划。Still taking the scene shown in Figure 3 as an example, when AGENT ₃ travels to the location of node 24, it detects that there is an obstacle on the driving path ahead, and the agent uses the local path planning algorithm in step 5 to autonomously circumvent the obstacle, as shown in Figure 8 It is a kind of obstacle bypass path that AGENT ₃ may plan; when AGENT ₃ reaches the location of node 3 after the obstacle bypass, if the obstacle bypass consumes an additional 5.2s, that is, δ=5.2, set the time window protection time to 8s, that is, w = 8, since

Therefore, AGENT ₃ should clear its own time window in the current time window data structure, and then perform motion planning again for the subsequent travel path (3→2→9→16→21).

需要注意的是，公布实施例的目的在于帮助进一步理解本发明，但是本领域的技术人员可以理解：在不脱离本发明及所附权利要求的精神和范围内，各种替换和修改都是可能的。因此，本发明不应局限于实施例所公开的内容，本发明要求保护的范围以权利要求书界定的范围为准。It should be noted that the purpose of publishing the embodiments is to help further understanding of the present invention, but those skilled in the art can understand that various replacements and modifications are possible without departing from the spirit and scope of the present invention and the appended claims of. Therefore, the present invention should not be limited to the contents disclosed in the embodiments, and the scope of protection of the present invention shall be subject to the scope defined by the claims.

Claims

1. A heterogeneous multi-agent networking cooperative scheduling planning method based on autonomous obstacle avoidance is characterized by comprising the following steps:

step 1: carrying out system configuration and transportation task management of multi-agent networking cooperative scheduling planning;

the system comprises a control center, an intelligent agent and a data sharing end; the intelligent agent is an intelligent robot for executing a transportation task in a scene; the control center is used for performing task allocation and collaborative path search; the data sharing end is used for storing a time window data structure required by collision avoidance between the intelligent agents; each agent can independently access or modify the data stored in the data sharing end;

the system configuration of the multi-agent networking cooperative scheduling planning comprises the following steps: starting a control center and each intelligent agent, setting a scene map in the control center and each intelligent agent and initializing a time window data structure of a data sharing end;

the scene map comprises nodes and road sections; setting the key points of the scene as nodes; the key points of the scene comprise a goods picking point and a goods throwing point; in addition, the positions of the nodes are set at fixed intervals according to the length and the width of the scene; if the existing driving route exists in the scene, setting the positions of the nodes at fixed intervals on the existing driving route; the road sections of the scene are line sections formed by two adjacent nodes, and a plurality of road sections are connected to form a path;

the transportation task management is to point to a system to add a transportation task and distribute the task; the transportation task takes any node in the scene map as a task end point and takes the position of a corresponding agent executing the task as a task starting point;

step 2: each intelligent agent which is about to execute the transportation task in the system scene carries out collaborative path search to obtain a corresponding path list; the method comprises the following steps:

21) and (3) solving the shortest path between a task starting point and a task end point, and recording as: s → k ₁ →k ₂ →…→k _n → e; wherein s is a task starting point; e is a task end point; k is a radical of ₁ ～k _n All nodes are nodes on the path;

22) let k ₀ ＝s，k _n+1 E, for each point k on the path except the task end point e _m Where m is 0,1,2 …, n, k _m Total r neighboring nodes, traverse k _m Each adjacent node v _q Q is 1,2, …, r; if v is _q ≠k _m+1 In v with _q As a task starting point, and e as a task end point, v is obtained _q Shortest path to e: v. of _q →n ₁ →n ₂ →…→n _p →e，n ₁ ～n _p If all the nodes are nodes on the path, obtaining another path between s and e: k is a radical of ₀ →…→k _m →v _q → … → e, add it to the path list;

23) step 22), after the execution is finished, all loop-free paths from the task starting point s to the task ending point e are stored in the path list; for each path, calculating and recording the maximum value of the ratio of repeated road sections and opposite driving road sections of the path and the paths being executed by other agents to the total number of the road sections of the path;

the repeated section means that: if the paths of the current agent and other agents contain the same section of road, the road is the repeated section of the current agent;

the repeated section and the opposite direction driving section refer to: if the paths of the current agent and other agents comprise the same section of road and the direction of the road is opposite, the road is the repeated road and the opposite driving road of the current agent;

the method comprises the following steps:

231) let the PATH list PATH have S PATHs in total, and for each PATH _z 1,2, …, S, respectively, beta _z And gamma _z To record the PATH _z PATH for overlapping road section and opposite driving road section of PATH being executed by other agent _z The maximum value of the ratio of the total number of stages;

232) path setting PATH _z Has a length of l, i.e. the PATH _z The number of the nodes is l; for h-1, 2, …, l-1, a-1 agent paths N in the examination system in addition to the current agent _i I 1,2, … a-1, then w 1,2, …, l _i -1，l _i Is path N _i Respectively finding the route nodes from the map

And node

The link formed by the connection is denoted arc _h And a routing node

And node

The road section formed by the connection is denoted as arc _w ；

233) If arc _h ＝arc _w Then PATH _z And path N _i 1 is accumulated in the number of the repeated sections; if arc _h ＝arc _w And is provided with

Then PATH _z And path N _i The number of opposite driving sections is also accumulated to be 1;

234) solving PATH for the set N of current execution PATHs of other agents _z Calculating beta according to the maximum value of the repeated road section of the A-1 paths and the maximum value of the opposite driving road section _z And gamma _z (ii) a For beta is _z And gamma _z Respectively setting threshold values; if beta is _z Exceeding the corresponding threshold value or gamma _z If the corresponding threshold value is exceeded, the PATH PATH is set _z Removing from the path list;

and step 3: the intelligent agent carries out motion planning to obtain an execution path;

according to a path list result obtained by collaborative path search, performing motion planning by adopting a time window algorithm, performing time window insertion operation on each path, and selecting a path capable of completing a task earliest as an execution path;

and 4, step 4: calculating to obtain a driving speed according to the execution path;

calculating the running speed on each road section according to the length of each road section on the execution path obtained in the step 3 and the time window insertion result, and sending the running speed to a chassis of the intelligent agent, wherein the intelligent agent runs according to the speed;

and 5: the intelligent agent carries out autonomous obstacle avoidance in the operation process: if the intelligent agent meets the obstacle, the intelligent agent bypasses the obstacle according to a local path planning algorithm and reaches an autonomous obstacle-bypassing target point; the target point of the autonomous obstacle detouring is the next node of the node where the intelligent agent is located on the operation path;

step 6, time window adjustment: after the autonomous obstacle avoidance of the intelligent agent is finished, if the intelligent agent does not reach a task end point, a time window checking and adjusting strategy is designed and adopted to ensure the cooperative scheduling among the intelligent agents in a scene in the subsequent operation process of the intelligent agent; the method comprises the following steps:

6A) when the time consumed by the intelligent agent autonomous obstacle avoidance is less than or equal to half of the time window protection time, the time window does not need to be adjusted; in the subsequent running process, the intelligent agent can compensate the consumption of time by adjusting the running speed;

the time window protection time refers to the time which is required to be separated from the time windows of two intelligent agents at least if the two intelligent agents need to pass through a road section;

6B) when the time consumed by the autonomous obstacle avoidance exceeds half of the time window protection time, the time window is adjusted according to the following method:

accessing a data sharing end, and clearing time window data of the current agent;

obtaining a time window structure after the current agent time window data is cleared;

turning to the step 3 to determine a subsequent driving path of the intelligent agent and carrying out motion planning again according to the current position of the intelligent agent;

through the steps, the heterogeneous multi-agent networking cooperative scheduling planning based on autonomous obstacle avoidance can be realized.

2. The heterogeneous multi-agent networking cooperative scheduling planning method based on autonomous obstacle avoidance as claimed in claim 1, wherein in step 234), β is set _z 0.4 of the threshold value; setting up gamma _z The threshold value of (2) is 0.3.

3. The heterogeneous multi-agent networking cooperative scheduling planning method based on autonomous obstacle avoidance as claimed in claim 1, wherein step 21) specifically uses dijkstra algorithm to find a shortest path between two points.

4. The heterogeneous multi-agent networking cooperative scheduling planning method based on autonomous obstacle avoidance as claimed in claim 1, wherein step 3 is to perform motion planning by using a time window algorithm and calculate to obtain an execution path; the method comprises the following steps:

firstly, the intelligent agent accesses a data sharing end to obtain a current time window data structure; the time window data structure maintains a time window vector for each road section in the map, and each element in the time window vector corresponds to a period of time for which an agent occupies the road section; then, for each path received by the agent, the following operations are executed:

for each segment of road a on each path _m ，m＝1,2,…,n：

31) Inserting with a time window of shortest length; the shortest length of the time window is the road section a _m The length of the intelligent agent and the maximum driving speed of the intelligent agent; finding an insertion position satisfying the following two conditions:

(ii) on a road section a _m If the difference between the starting time of the time period after a certain position and the ending time of the time period before the position is enough to accommodate the length of the time window with the shortest length, the position can be used as an insertion position;

if m is not equal to 1, the time for entering the section of road section is after the time for releasing the section of road section on the path of the intelligent agent;

32) if m is not equal to 1, judging whether the time for releasing the last section of road section on the path of the intelligent agent is equal to the time for entering the section of road section; if not, prolonging the release time of the previous section of road section to be equal to the entry time of the section of road section;

33) checking whether there is an overlap between the time windows; if the sections are not overlapped, the insertion is successful, and the next section of road section is continuously operated; if the road sections are overlapped, returning to the previous section of the road section on the path and returning to the step 31);

and after selecting the final execution path by performing time window insertion operation on each path received by the intelligent agent, the intelligent agent sends a message to the data sharing end, and the time window insertion result of the selected execution path is synchronously updated to the data sharing end.

5. The heterogeneous multi-agent networking cooperative scheduling planning method based on autonomous obstacle avoidance of claim 1, wherein in step 5, the agent performs autonomous obstacle avoidance in the operation process according to a local path planning algorithm; specifically, a dynamic time window algorithm DWA is adopted, and the method comprises the following steps:

51) sampling speed;

52) and (3) track scoring:

scoring each trajectory by using an evaluation function; the evaluation indexes include: the distance between the position of the tail end of the track and the position of the target point, the difference between the orientation angle when the tail end of the track is reached and the target azimuth angle, the distance between the track and the nearest obstacle, and the nearest distance between a point on the sampling track and the global path;

53) and (3) speed release:

scoring all tracks and selecting the track with the highest score;

then sending the speed corresponding to the track with the highest score to the intelligent agent;

the intelligent agent runs at the speed in the next period;

and repeating the process until the intelligent agent reaches the target point of autonomous obstacle avoidance.