CN116858253A

CN116858253A - A lightweight predictive navigation method and system suitable for indoor environments

Info

Publication number: CN116858253A
Application number: CN202310965884.3A
Authority: CN
Inventors: 郑南宁; 张晓彤; 符嘉玮; 史佳敏; 陈仕韬
Original assignee: Ningbo Shun'an Artificial Intelligence Research Institute; Xian Jiaotong University
Current assignee: Ningbo Shun'an Artificial Intelligence Research Institute; Xian Jiaotong University
Priority date: 2023-08-02
Filing date: 2023-08-02
Publication date: 2023-10-10

Abstract

The invention discloses a lightweight predictive navigation method and system suitable for indoor environments. The environment prediction combines the obtained sensory information and motion information to generate future environment prediction and mapping, which is expressed as an occupied grid sequence; Motion planning uses a grid sequence with time attributes as a search space, formulates the trajectory solution as a nonlinear optimization problem, and solves the nonlinear optimization problem to obtain the trajectory; in a unified environment representation form, obtains sensing data and vehicle's own motion data (Speed, coordinate) partially decouples the dynamic and static environment. The static environment obtains the future position through coordinate transformation and mapping. The dynamic environment predicts the raster map of several steps in the future by extracting scene motion feature information, and then directly uses the future raster. At the same time Process dynamic and static obstacles to improve the efficiency and effectiveness of trajectory optimization; the method proposed by the present invention can improve real-time performance and safety while reducing computing requirements.

Description

A lightweight predictive navigation method and system suitable for indoor environments

技术领域Technical field

本发明属于智能体导航技术领域，具体涉及一种适用室内环境的轻量性预测式导航方法及系统。The invention belongs to the field of intelligent navigation technology, and specifically relates to a lightweight predictive navigation method and system suitable for indoor environments.

背景技术Background technique

为了完成特定的任务，机器人需要精确而安全的导航才能在不发生碰撞的情况下到达目的地。这种导航是通过实现导航算法来实现的。在动态环境中，该算法包括两个主要组成部分：预测和规划。预测组件提供关于局部障碍物分布的信息，而规划组件基于预测信息生成安全且无缝的轨迹以适应动态变化。尽管研究人员研究了这两种功能的单独实现，但发现它们在轻量级、非结构化的室内动态环境中的性能不足。由于空间拥挤，障碍物密度高，体积小，缺乏统一的表示形式，导致障碍物信息丢失。此外，有限的空间限制了智能体，其较差的实时性能使其无法及时进行规避机动，从而导致碰撞。To complete specific tasks, robots require precise and safe navigation to reach their destination without collisions. This navigation is achieved by implementing navigation algorithms. In a dynamic environment, the algorithm consists of two main components: prediction and planning. The prediction component provides information about local obstacle distribution, while the planning component generates safe and seamless trajectories based on the prediction information to adapt to dynamic changes. Although the researchers studied separate implementations of both functions, their performance was found to be insufficient in lightweight, unstructured indoor dynamic environments. Due to space congestion, high obstacle density, small size, and lack of unified representation, obstacle information is lost. In addition, the limited space constrains the agent, and its poor real-time performance prevents it from performing evasive maneuvers in time, leading to collisions.

多数现存的方法对传感器系统要求高，耗时长，难以满足智能体规划实时性的要求。另外，这些方法采用物体跟踪来获取动态障碍物的未来轨迹，然后将其用作规划模块的输入。然而，许多方法依赖于大规模模型，导致实时性能较差。这可归因于机器人在成本和尺寸方面的限制，使得实现需要过多资源的预测方法具有挑战性。此外，预测和规划之间缺乏统一的表示，导致在多种输入和输出格式的转换过程中丢失了基本信息。此外，规划算法往往对障碍物的未来行为缺乏远见，这可能导致规划算法陷入困境，导致轨迹偏差或不连续。大多数现有的预测方法在很大程度上依赖于多线激光雷达，这是昂贵的，并且会生成大规模的点云，给室内代理导航增加了额外的计算负担。传统的规划算法需要高质量的预测信息，但鲁棒性较低，导致实际适用性有限。Most existing methods have high requirements on sensor systems and are time-consuming, making it difficult to meet the real-time requirements of agent planning. Additionally, these methods employ object tracking to obtain future trajectories of dynamic obstacles, which are then used as inputs to the planning module. However, many methods rely on large-scale models, resulting in poor real-time performance. This can be attributed to limitations in cost and size of robots, making it challenging to implement predictive methods that require excessive resources. Furthermore, the lack of a unified representation between prediction and planning results in the loss of essential information during the conversion process of multiple input and output formats. Additionally, planning algorithms often lack foresight into the future behavior of obstacles, which can cause the planning algorithm to get bogged down, resulting in trajectory deviations or discontinuities. Most existing prediction methods rely heavily on multi-line lidar, which is expensive and generates large-scale point clouds, adding additional computational burden to indoor agent navigation. Traditional planning algorithms require high-quality prediction information but have low robustness, resulting in limited practical applicability.

发明内容Contents of the invention

为了解决现有技术中存在的问题，本发明提供一种适用室内环境的轻量性预测式导航方法采用分层的预测-规划方法，在统一的环境表征形式下，获取感知数据和车辆自身运动数据(速度，坐标)部分解耦动静态环境，静态环境通过坐标变换与映射得到未来位置，动态环境通过提取场景运动特征信息，预测未来若干步长的栅格地图，然后直接利用未来栅格，同时处理动静态障碍物，提升轨迹优化的效率和有效性。In order to solve the problems existing in the prior art, the present invention provides a lightweight predictive navigation method suitable for indoor environments. It adopts a hierarchical prediction-planning method to obtain sensing data and vehicle own motion in a unified environment representation form. The data (velocity, coordinates) part decouples the dynamic and static environment. The static environment obtains the future position through coordinate transformation and mapping. The dynamic environment predicts the raster map of several steps in the future by extracting scene motion feature information, and then directly uses the future raster. It also handles dynamic and static obstacles to improve the efficiency and effectiveness of trajectory optimization.

为了实现上述目的，本发明采用的技术方案是：一种适用室内环境的轻量性预测式导航方法，包括以下步骤：In order to achieve the above objectives, the technical solution adopted by the present invention is: a lightweight predictive navigation method suitable for indoor environments, which includes the following steps:

通过对多传感器系统所获得智能体位姿信息、速度信息与环境表征进行传感测量值的坐标转换，将环境变量投影到未来智能体所处的坐标系下；By performing coordinate conversion of the sensor measurement values on the pose information, speed information and environmental representation of the intelligent body obtained by the multi-sensor system, the environmental variables are projected into the coordinate system of the future intelligent body;

考虑环境变量在未来智能体所处的坐标系下，动态障碍物的自行和静态障碍物在环境中的占用，捕捉两者之间的相互作用，并且将动态分量和静态分量解耦，提取未来概率占据栅格图的潜在变量特征；Consider the environmental variables in the coordinate system where the future agent is located, the propulsion of dynamic obstacles and the occupation of static obstacles in the environment, capture the interaction between the two, and decouple the dynamic components and static components to extract the future Probabilistic occupancy raster plot of latent variable characteristics;

采用变分自动编码器构建OGM占据栅格图预测生成模块，将未来概率占据栅格图的潜在变量特征作为输入，解码器输出预测的占用栅格图序列；A variational autoencoder is used to build an OGM occupancy raster map prediction generation module. The latent variable characteristics of the future probability occupancy raster map are used as input, and the decoder outputs the predicted occupancy raster map sequence;

将每个栅格图顺序地用作采样候选轨迹的可行空间；Each raster map is used sequentially as a feasible space for sampling candidate trajectories;

在所述可行空间中生成候选轨迹，获取满足运动学约束和障碍物约束的初始轨迹；Generate candidate trajectories in the feasible space and obtain initial trajectories that satisfy kinematic constraints and obstacle constraints;

采用图优化方法基于所述初始轨迹生成更平滑、更安全的轨迹。Graph optimization methods are used to generate smoother and safer trajectories based on the initial trajectory.

进一步的，通过对多传感器系统所获得智能体位姿信息、速度信息与环境表征进行传感测量值的坐标转换，将环境变量投影到未来智能体所处的坐标系下包括：Furthermore, by performing coordinate conversion of the sensor measurement values on the pose information, speed information and environmental representation of the intelligent body obtained by the multi-sensor system, projecting the environmental variables into the coordinate system of the future intelligent body includes:

首先根据激光雷达的测量值计算出当前的障碍物占据位置，获取在当前智能体所处的坐标系下的障碍物占据位置；First, calculate the current obstacle occupation position based on the measurement value of the lidar, and obtain the obstacle occupation position in the coordinate system where the current agent is located;

根据智能体当前和下一刻位姿之间的转化关系以及智能体当前位姿信息推断出下一时刻的智能体位姿；将当前障碍物分布投影到经历设定时长后的智能体所处位姿的坐标系，得到环境变量在未来智能体所处的坐标系下的投影。According to the conversion relationship between the current and next poses of the agent and the current pose information of the agent, the pose of the agent at the next moment is inferred; the current obstacle distribution is projected to the pose of the agent after a set period of time. coordinate system to obtain the projection of environmental variables in the coordinate system where the future agent is located.

进一步的，考虑动态障碍物的自行和静态障碍物在环境中的占用，捕捉两者之间的相互作用，并且将动态分量和静态分量解耦，提取未来概率占据栅格图的潜在变量特征具体包括以下步骤：Furthermore, consider the propulsion of dynamic obstacles and the occupation of static obstacles in the environment, capture the interaction between the two, decouple the dynamic components and static components, and extract the specific latent variable characteristics of the future probability occupation raster map. Includes the following steps:

将整个网格地图信息馈送到卷积长短时间记忆网络模块中，卷积长短时间记忆网络模块将一维占据栅格图作为输入，并使用大小为3的卷积核生成动态网格特征；The entire grid map information is fed into the convolutional long and short-term memory network module. The convolutional long and short-term memory network module takes the one-dimensional occupied grid map as input and uses a convolution kernel of size 3 to generate dynamic grid features;

采用贝叶斯生成方法预测未来静态障碍物的局部地图，同时过滤掉动态信息，得到静态障碍物的局部地图信息；The Bayesian generation method is used to predict the local map of future static obstacles, and at the same time filter out the dynamic information to obtain the local map information of the static obstacles;

将提取的动态网格特征和静态障碍物的局部地图信息作为OGM占据栅格图预测生成模块的输入。The extracted dynamic grid features and local map information of static obstacles are used as inputs to the OGM occupancy grid map prediction generation module.

进一步的，所述OGM占据栅格图预测生成模块包括编码器和解码器，编码器由两层卷积层和两层残差层构成，两层残差层参数一致；解码器由两个残差层和两个反卷积层组成，残差层结构与编码器中的结构一致。Further, the OGM occupied raster map prediction generation module includes an encoder and a decoder. The encoder is composed of two convolutional layers and two residual layers. The parameters of the two residual layers are consistent; the decoder is composed of two residual layers. It consists of a difference layer and two deconvolution layers, and the residual layer structure is consistent with the structure in the encoder.

进一步的，将每个栅格图顺序地用作采样候选轨迹的可行空间时，具体的，基于占用栅格图的扩展序列，通过位置变换关系导出预测序列中的占用空间，在设定时间段内，智能体均匀运动，质心是圆心，通过智能体能到达的区域得到每帧的可行空间表示，可行区间内的状态遵守严格速度区间约束和碰撞约束。Further, when each raster map is used sequentially as a feasible space for sampling candidate trajectories, specifically, based on the extended sequence of occupied raster maps, the occupied space in the prediction sequence is derived through the position transformation relationship, and in the set time period Within, the agent moves uniformly, and the center of mass is the center of the circle. The feasible space representation of each frame is obtained through the area that the agent can reach. The state within the feasible interval obeys strict speed interval constraints and collision constraints.

进一步的，在所述可行空间中生成候选轨迹，获取满足运动学约束和障碍物约束的初始轨迹时，采用随机抽样方法在可行空间内选择帧，单独逐个采样栅格图序列，采样的空间表示每个图中未占用的区域，采样时在满足智能体的运动学约束和确保可达性的基础上，得到搜索空间；在采样时，如果相邻帧之间的采样点能在没有任何碰撞的情况下连接，并判断连接后的边是否满足智能体的动力学约束与运动学约束，若满足则将采样点连接在一起以形成边；遍历节点列表以识别成本最低的节点，并启动回溯过程以获得满足运动学约束和障碍物约束的初始轨迹。Further, when generating candidate trajectories in the feasible space and obtaining the initial trajectory that satisfies kinematic constraints and obstacle constraints, a random sampling method is used to select frames in the feasible space, and the raster image sequence is sampled one by one, and the spatial representation of the sampling is For the unoccupied area in each graph, the search space is obtained on the basis of satisfying the kinematic constraints of the agent and ensuring reachability during sampling; during sampling, if the sampling points between adjacent frames can be moved without any collision , and determine whether the connected edge satisfies the dynamics and kinematic constraints of the agent. If so, connect the sampling points together to form an edge; traverse the node list to identify the node with the lowest cost, and start backtracking The process obtains an initial trajectory that satisfies kinematic constraints and obstacle constraints.

进一步的，采用图优化方法基于所述初始轨迹生成更平滑、更安全的轨迹包括：Further, using graph optimization methods to generate smoother and safer trajectories based on the initial trajectory includes:

将使用图优化方法生成更平滑和更安全的轨迹公式化为一个非线性优化问题：The generation of smoother and safer trajectories using graph optimization methods is formulated as a nonlinear optimization problem:

其中，L({P_t}|{m_t:t+β})表示优化目标，{m_t:t+β}表示栅格图序列，C_ego表示自车外轮廓，C_obs表示障碍物外轮廓，表示所能达到的最大速度和最小速度，ω_min,ω_{max}表示所能达到的最大角速度和最小角速度，|a_v|≤a_{max},表示线加速度小于最大线加速度，表示角加速度小于最大角加速度；Among them, L({P _t }|{m _t:t+β }) represents the optimization goal, {m _t:t+β } represents the grid map sequence, C _ego represents the outer contour of the self-car, and C _obs represents the outer contour of the obstacle. contour, represents the maximum speed and minimum speed that can be achieved, ω _min , ω _{max} represents the maximum angular velocity and minimum angular velocity that can be achieved, |a _v |≤a _{max} , which means that the linear acceleration is less than the maximum linear acceleration, Indicates that the angular acceleration is less than the maximum angular acceleration;

将硬约束转换为目标函数内的惩罚，将硬约束视为软约束；循环迭代优化直至优化的轨迹满足运动学约束和障碍物约束或达到迭代的上限，解算得到最终规划路径。The hard constraints are converted into penalties within the objective function, and the hard constraints are regarded as soft constraints; the loop is iteratively optimized until the optimized trajectory satisfies the kinematic constraints and obstacle constraints or reaches the upper limit of the iteration, and the final planned path is obtained by solution.

基于所述方法的技术构思，本发明还提供一种适用室内环境的轻量性预测式导航系统，包括位姿预测模块、动静态特征提取模块、占用栅格图序列预测模块、可行空间获取模块、初始轨迹获取模块以及轨迹优化模块；Based on the technical concept of the method, the present invention also provides a lightweight predictive navigation system suitable for indoor environments, including a pose prediction module, a dynamic and static feature extraction module, an occupancy grid sequence prediction module, and a feasible space acquisition module. , initial trajectory acquisition module and trajectory optimization module;

位姿预测模块通过对多传感器系统所获得智能体位姿信息、速度信息与环境表征进行传感测量值的坐标转换，将环境变量投影到未来智能体所处的坐标系下；The pose prediction module performs coordinate conversion of sensory measurement values on the pose information, speed information and environmental representation of the intelligent body obtained by the multi-sensor system, and projects the environmental variables into the coordinate system of the future intelligent body;

动静态特征提取模块用于考虑环境变量在未来智能体所处的坐标系下，动态障碍物的自行和静态障碍物在环境中的占用，捕捉两者之间的相互作用，并且将动态分量和静态分量解耦，提取未来概率占据栅格图的潜在变量特征；The dynamic and static feature extraction module is used to consider the environmental variables in the coordinate system where the future agent is located, the propulsion of dynamic obstacles and the occupation of static obstacles in the environment, capture the interaction between the two, and combine the dynamic components with Decouple static components to extract latent variable features of future probability occupation raster maps;

占用栅格图序列预测模块采用变分自动编码器构建OGM占据栅格图预测生成模块，将未来概率占据栅格图的潜在变量特征作为输入，解码器输出预测的占用栅格图序列；The occupancy raster map sequence prediction module uses a variational autoencoder to build an OGM occupancy raster map prediction generation module, taking the latent variable characteristics of the future probability occupancy raster map as input, and the decoder outputs the predicted occupancy raster map sequence;

可行空间获取模块用于将每个栅格图顺序地用作采样候选轨迹的可行空间；The feasible space acquisition module is used to sequentially use each raster map as a feasible space for sampling candidate trajectories;

初始轨迹获取模块用于在所述可行空间中生成候选轨迹，获取满足运动学约束和障碍物约束的初始轨迹；The initial trajectory acquisition module is used to generate candidate trajectories in the feasible space and obtain initial trajectories that satisfy kinematic constraints and obstacle constraints;

轨迹优化模块采用图优化方法基于所述初始轨迹生成更平滑、更安全的轨迹。The trajectory optimization module uses graph optimization methods to generate smoother and safer trajectories based on the initial trajectory.

另外提供一种计算机设备，包括处理器以及存储器，存储器用于存储计算机可执行程序，处理器从存储器中读取所述计算机可执行程序并执行，所述处理器执行程序时能实现本发明所述的适用室内环境的轻量性预测式导航方法。In addition, a computer device is provided, including a processor and a memory. The memory is used to store a computer executable program. The processor reads the computer executable program from the memory and executes it. When the processor executes the program, the invention can be realized. The described lightweight predictive navigation method suitable for indoor environments.

本发明同时提供一种计算机可读存储介质，计算机可读存储介质中存储有计算机程序，所述计算机程序被处理器执行时，能实现本发明所述的适用室内环境的轻量性预测式导航方法。The present invention also provides a computer-readable storage medium. A computer program is stored in the computer-readable storage medium. When the computer program is executed by a processor, the lightweight predictive navigation suitable for indoor environments according to the present invention can be realized. method.

与现有技术相比，本发明至少具有以下有益效果：Compared with the prior art, the present invention at least has the following beneficial effects:

本发明基于深度学习进行环境预测，利于自制数据集，适用于不同种类的室内环境。随着所处环境的收集数据增加，可以获得更好的预测效果；提出了一个轻量快速通用的框架，在室内移动机器人和无人驾驶车辆上都能有良好的应用，资源消耗小，实时性高，解决了单线激光的预测问题；预测工作中考虑到了智能体的运动，这种运动信息是必要的，有助于区分动静态障碍物，提供先验信息；动态障碍物由时序栅格的占用状态表示，规划可以同时处理动态和静态障碍物，保证轨迹在多障碍物的情况下安全通行。The present invention performs environmental prediction based on deep learning, is conducive to self-made data sets, and is suitable for different types of indoor environments. As the collected data of the environment increases, better prediction results can be obtained; a lightweight, fast and universal framework is proposed, which can have good applications in indoor mobile robots and driverless vehicles, with low resource consumption and real-time performance. It has high performance and solves the prediction problem of single-line laser; the movement of the agent is taken into account in the prediction work. This kind of movement information is necessary and helps to distinguish dynamic and static obstacles and provides prior information; dynamic obstacles are determined by the time series grid. The occupancy status indicates that the planning can handle dynamic and static obstacles at the same time to ensure safe passage of the trajectory in the presence of multiple obstacles.

附图说明Description of the drawings

图1为本发明的技术框架图。Figure 1 is a technical framework diagram of the present invention.

图2为预测部分框架图。Figure 2 is a framework diagram of the prediction part.

图3为用于预测的变分自动编码器网络结构图。Figure 3 is a structural diagram of the variational autoencoder network used for prediction.

图4为规划部分框架图。Figure 4 is a partial framework diagram of the planning.

具体实施方式Detailed ways

下面结合附图和具体实施对本申请的示范性实例进行详细阐明，其中包括本申请实施例的各种细节以助于理解。应理解这些实施例仅用于说明本发明而不用于限制本发明的范围，在阅读了本发明之后，本领域技术人员对本发明的各种等价形式的修改均落于本申请所附权利要求所限定的范围。Exemplary examples of the present application are explained in detail below with reference to the accompanying drawings and specific implementations, which include various details of the embodiments of the present application to facilitate understanding. It should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention. After reading the present invention, those skilled in the art will make modifications to various equivalent forms of the present invention and all fall within the scope of the appended claims of this application. limited scope.

实时导航是智能体在复杂动态环境中安全驾驶能力的重要组成部分，包括预测和运动规划。然而，现有的大多数方法都需要高质量的传感器系统和巨大的计算消耗，这与实时性要求背道而驰。此外，这些方法缺乏用于预测和规划的统一表示框架，导致表示转换过程中的准确性损失。本发明提出了一个基于占用网格图的预测规划框架。预测模块将感知信息和运动信息相结合，以处理动态和静态障碍物，并生成占用网格图序列。规划模块对候选轨迹进行采样、选择和优化。在多个场景中的实验结果表明，与常规方法相比，所提出的框架可以提高实时性能和安全性，同时降低计算需求。Real-time navigation is an important component of an agent's ability to drive safely in complex dynamic environments, including prediction and motion planning. However, most existing methods require high-quality sensor systems and huge computational consumption, which is contrary to real-time requirements. Furthermore, these methods lack a unified representation framework for prediction and planning, resulting in accuracy loss during representation conversion. The present invention proposes a predictive planning framework based on occupancy grid diagrams. The prediction module combines sensory and motion information to handle dynamic and static obstacles and generates a sequence of occupancy grid maps. The planning module samples, selects and optimizes candidate trajectories. Experimental results in multiple scenarios show that the proposed framework can improve real-time performance and security while reducing computational requirements compared to conventional methods.

图1是本发明基于深度学习的预测式导航方法流程示意图，本发明适用于复杂动态的环境，对于室内室外环境具有通用性。本发明的目的是从传感器获得的测量值为智能代理生成一系列未来包含位姿与速度信息的状态。本发明将传感器对于当前环境的表征和下一时间步长的预期姿态作为输入，包括初始环境信息和潜在的运动变量，期望的输出是可行的运动轨迹。本发明分为两个相互关联的部分：环境预测和运动规划。环境预测将所获得的感知信息与运动信息相结合，能够生成未来的环境预测和映射，表示为占据栅格图序列；运动规划将具有时间属性的网格序列作为搜索空间，并将轨迹解公式化为非线性优化问题，解算非线性优化问题得到轨迹。本发明的细节如下：Figure 1 is a schematic flowchart of the predictive navigation method based on deep learning of the present invention. The present invention is suitable for complex and dynamic environments and has universality for indoor and outdoor environments. The purpose of the invention is to generate a series of future states containing pose and velocity information for an intelligent agent from measurements obtained from sensors. This invention takes the sensor's representation of the current environment and the expected posture of the next time step as input, including initial environment information and potential motion variables, and the expected output is a feasible motion trajectory. The invention is divided into two interrelated parts: environment prediction and motion planning. Environment prediction combines the obtained sensory information with motion information to generate future environment predictions and mappings, expressed as occupied grid sequence; motion planning uses the grid sequence with time attributes as the search space and formulates the trajectory solution It is a nonlinear optimization problem, and the trajectory is obtained by solving the nonlinear optimization problem. The details of the invention are as follows:

步骤1：在实际的智能体运行过程中，采集的激光雷达数据所在的局部坐标系会随着智能体的运动而不断变化，这会对占据栅格图的预测造成影响。运动补偿分为两步，分别是智能体位姿预测以及坐标转换。位姿预测的过程中，通过对多传感器系统所获得智能体位姿信息、速度信息与环境表征进行传感测量值的坐标转换，将环境变量投影到未来智能体所处的坐标系下，具体细节如下：Step 1: During the actual operation of the agent, the local coordinate system where the collected lidar data is located will continue to change with the movement of the agent, which will affect the prediction of the occupied raster map. Motion compensation is divided into two steps, namely body pose prediction and coordinate conversion. In the process of pose prediction, the environmental variables are projected into the coordinate system of the future agent by performing coordinate transformation on the pose information, speed information and environmental representation of the intelligent body obtained by the multi-sensor system. The specific details are as follows:

本发明可以使用任何可以构建概率栅格图的传感器，作为示例，以单线激光这种低成本的传感器进行说明。假定激光雷达的测量值为M＝[d_t,k_t*δ]，其中d_t是t时刻激光雷达测量出的距离，k_t是激光雷达的测距顺序，δ是激光雷达的分辨率，用角度表示。The present invention can use any sensor that can construct a probability grid map. As an example, a low-cost sensor such as a single-line laser is used for illustration. Assume that the measurement value of lidar is M = [d _t , k _t * δ], where d _t is the distance measured by lidar at time t, k _t is the ranging sequence of lidar, and δ is the resolution of lidar, Expressed in terms of angles.

(1)首先根据激光雷达的测量值计算出当前的障碍物占据位置，在当前智能体所处的坐标系下，障碍物占据位置为：(1) First, calculate the current occupied position of the obstacle based on the measurement value of the lidar. In the coordinate system where the current agent is located, the occupied position of the obstacle is:

[x,y]＝d_t[cos(k_t*δ),sin(k_t*δ)]^T [x,y]＝d _t [cos(k _t *δ),sin(k _t *δ)] ^T

而根据惯性测量单元可以获得智能体t时刻的线速度V_t、角速度ω_t，线加速度a_vt，角加速度a_ωt。According to the inertial measurement unit, the linear velocity V _t , angular velocity ω _t , linear acceleration a _vt and angular acceleration a _ωt of the agent at time t can be obtained.

(2)根据定位模块所给出的智能体当前位姿信息[x_ego,y_ego,θ₁]，本发明假定在0.1s的时间间隔Δt内智能体进行匀速直线运动，可以推断出下一时刻的智能体位姿[x′_ego,y′_ego,θ₂]。其中x，y，θ分别表示智能体所处位置的横纵坐标与朝向。(2) According to the current pose information of the agent [x _ego , y _ego , θ ₁ ] given by the positioning module, the present invention assumes that the agent performs uniform linear motion within the time interval Δt of 0.1s, and can infer the next The pose of the agent at the moment [x′ _ego ,y′ _ego ,θ ₂ ]. Among them, x, y, and θ respectively represent the horizontal and vertical coordinates and orientation of the location of the agent.

智能体当前和下一刻位姿之间的转化关系可以表示如下：The transformation relationship between the current and next moment poses of the agent can be expressed as follows:

将当前障碍物分布投影到经历Δt时长后的智能体所处位姿的坐标系中，可以获得障碍物占据点的对应坐标[x^′,y^′]，其计算方法如下：By projecting the current obstacle distribution into the coordinate system of the agent's pose after Δt, the corresponding coordinates [x ^′ , y ^′ ] of the obstacle occupation point can be obtained. The calculation method is as follows:

其中，[x^′,y^′]是经历Δt时长后的智能体所处位姿的坐标系中，可以获得障碍物占据点的对应坐标，[x_ego,y_ego]是当前全局坐标系下智能体所处位置，θ₁为朝向，[x^′ _ego,y^′ _ego]是经历Δt时长后全局坐标系下智能体所处位置，θ₂为朝向；进而得到环境变量在未来智能体所处的坐标系下的投影。Among them, [x ^′ , y ^′ ] is the coordinate system of the pose of the agent after Δt, and the corresponding coordinates of the obstacle occupation point can be obtained. [x _ego , y _ego ] is the coordinate system of the intelligent agent in the current global coordinate system. The position of the body, θ ₁ is the orientation, [x ^′ _ego , y ^′ _ego ] is the position of the agent in the global coordinate system after Δt time, θ ₂ is the orientation; and then we get the position of the environmental variables in the future where the agent is Projection in coordinate system.

步骤2：提取动静态特征信息。提取动静态特征信息过程同时考虑动态障碍物的自行和静态障碍物在环境中的占用，隐含地捕捉两者之间的相互作用，并且将动态和静态分量部分解耦，以提取未来概率占据栅格图的潜在变量特征。Step 2: Extract dynamic and static feature information. The process of extracting dynamic and static feature information simultaneously considers the propulsion of dynamic obstacles and the occupancy of static obstacles in the environment, implicitly captures the interaction between the two, and partially decouples the dynamic and static components to extract future probability occupancy Latent variable characteristics of raster plots.

(1)为了预测动态网格信息，将整个网格地图信息馈送到卷积长短时间记忆网络模块中。卷积长短时间记忆网络模块将一维占据栅格图作为输入，并使用大小为3的卷积核生成32维动态网格特征。(1) In order to predict dynamic grid information, the entire grid map information is fed into the convolutional long short-term memory network module. The convolutional long short-term memory network module takes a one-dimensional occupancy grid map as input and uses a convolution kernel of size 3 to generate 32-dimensional dynamic grid features.

(2)关于静态障碍物信息，采用贝叶斯生成方法预测未来静态障碍物的局部地图，同时过滤掉动态信息，得到静态障碍物的局部地图信息。这种方法允许动态和静态信息的成功解耦和特征提取。(2) Regarding the static obstacle information, the Bayesian generation method is used to predict the local map of the future static obstacles, and at the same time filter out the dynamic information to obtain the local map information of the static obstacles. This approach allows successful decoupling and feature extraction of dynamic and static information.

(3)将提取的动态网格特征和静态障碍物的局部地图信息输入到占据栅格图预测生成模块。(3) Input the extracted dynamic grid features and local map information of static obstacles into the occupancy grid map prediction and generation module.

步骤3：本发明采用变分自动编码器(VAE)构建OGM占据栅格图预测生成模块，模块通过提取到的动态栅格特征和局部地图，通过模型训练得到预训练好的网络参数，预测未来时刻的栅格图序列，表征环境分布。参考图2和图3。Step 3: The present invention uses a variational autoencoder (VAE) to build an OGM occupied grid map prediction generation module. The module obtains pre-trained network parameters through model training through extracted dynamic grid features and local maps, and predicts the future. A sequence of raster images at a moment in time, representing the distribution of the environment. Refer to Figures 2 and 3.

(1)编码器：编码器部分由两层卷积层和两层残差层构成。第一层卷积输入维度为32的特征M，输出维度为64的特征，卷积核大小为4，步长为2，padding值为1。第二层卷积输入维度为64的特征，输出维度为128的特征，卷积核大小为4，步长为2，padding值为1。两层残差层参数一致，每个残差层输入维度为128维的特征X，特征X经过两层卷积处理得到X2，残差层的输出为X+X2；第一层卷积输入128维，输出64维，卷积核大小为3，步长为1。第二层卷积输入64维，输出128维，卷积核大小为1，步长为1。然后将残差层的输出经过两个结构相同卷积块(输入维度128，输出维度2)编码得到μ和σ。(1) Encoder: The encoder part consists of two convolutional layers and two residual layers. The first layer of convolution inputs features M with dimension 32 and outputs features with dimension 64. The convolution kernel size is 4, the step size is 2, and the padding value is 1. The second layer of convolution inputs features with a dimension of 64 and outputs features with a dimension of 128. The convolution kernel size is 4, the stride is 2, and the padding value is 1. The parameters of the two residual layers are consistent. Each residual layer inputs a 128-dimensional feature X. The feature X is processed by two layers of convolution to obtain X2. The output of the residual layer is Dimension, the output is 64 dimensions, the convolution kernel size is 3, and the stride is 1. The second layer of convolution inputs 64 dimensions and outputs 128 dimensions. The convolution kernel size is 1 and the stride is 1. Then the output of the residual layer is encoded through two convolution blocks with the same structure (input dimension 128, output dimension 2) to obtain μ and σ.

(3)解码器：解码器部分将正态分布采样得到的Z通过反卷积模块(输入维度为2，输出维度为128)生成维度为128维的特征，然后输入到VAE解码器中。解码器由两个残差层和两个反卷积层组成，残差层结构与编码器中的结构一致，输入128维特征，输出128维特征。然后经过第一个卷积层，输出维度64的特征，卷积核大小为4，步长为2。之后输入第二个卷积层，输出维度为64的特征Q，卷积核大小为4，步长为2。(3) Decoder: The decoder part uses the Z sampled from the normal distribution through the deconvolution module (input dimension is 2, output dimension is 128) to generate 128-dimensional features, and then inputs it into the VAE decoder. The decoder consists of two residual layers and two deconvolution layers. The residual layer structure is consistent with the structure in the encoder. It inputs 128-dimensional features and outputs 128-dimensional features. Then after the first convolution layer, the features of dimension 64 are output, the convolution kernel size is 4, and the stride is 2. Then the second convolution layer is input, and the output dimension is feature Q of 64, the convolution kernel size is 4, and the stride is 2.

(4)Q输入到输出卷积层(卷积核大小为3，步长为1)中，输出维度为1的预测的占用栅格图序列，交给运动规划模块进行处理。(4) Q is input into the output convolution layer (convolution kernel size is 3, step size is 1), and the predicted occupancy grid sequence with dimension 1 is output and handed over to the motion planning module for processing.

基于上述网络结构，本发明在应用整个训练集进行模型的训练，基于KL散度和交叉熵混合损失函数：Based on the above network structure, the present invention uses the entire training set to train the model, based on the KL divergence and cross-entropy hybrid loss function:

N是样本的数量，C是分类数量，y_ij指示第i个样本的真实标签是否是第j个类(如果是1，如果不是0)，p_ij表示第i个样本属于第j类的预测概率。本发明进行了60轮的训练，每一轮均对整个训练集进行遍历。梯度下降方法选用Adam，batch_size设为64，学习率设置为随训练轮数的增加不断减小。经过60轮的训练，损失函数逐渐下降并趋于稳定。N is the number of samples, C is the number of categories, y _ij indicates whether the true label of the i-th sample is the j-th class (if it is 1, if not 0), p _ij indicates the prediction that the i-th sample belongs to the j-th class Probability. The present invention conducted 60 rounds of training, and each round traversed the entire training set. The gradient descent method is Adam, the batch_size is set to 64, and the learning rate is set to decrease with the increase of the number of training rounds. After 60 rounds of training, the loss function gradually decreases and stabilizes.

步骤4：本发明在获得预测的占用栅格图序列后，将每个栅格图顺序地用作采样候选轨迹的可行空间。建立一个可行的空间序列，该所述空间序列将时间信息合并在智能体的局部坐标系内。本发明将规划问题定义为优化问题，可以通过获得初始解并迭代优化来解决，可以参考图4。Step 4: After obtaining the predicted occupancy grid map sequence, the present invention uses each grid map sequentially as a feasible space for sampling candidate trajectories. A feasible spatial sequence is established that incorporates temporal information within the agent's local coordinate system. The present invention defines the planning problem as an optimization problem, which can be solved by obtaining the initial solution and iterative optimization. Please refer to Figure 4.

为了简化计算，以智能体半径为膨胀半径对障碍物进行膨胀，将智能代理转换为质点，目的是计算碰撞和确定代理的轨迹。In order to simplify the calculation, the obstacle is expanded using the radius of the agent as the expansion radius, and the intelligent agent is converted into a particle. The purpose is to calculate the collision and determine the trajectory of the agent.

通过获得占用栅格图的扩展序列，可以通过位置变换关系导出预测序列中的占用空间，所述位置变换关系如下式所示：By obtaining the extended sequence of the occupied raster map, the occupied space in the predicted sequence can be derived through the position transformation relationship, which is shown in the following formula:

在短时间间隔内，智能体可以被认为是均匀运动的，质心是圆心，[V_maxΔt,V_minΔt]作为半径是智能体可以到达的区域。因此，每帧的可行空间表示可以表示如下：可行区间内的状态必须遵守严格速度区间约束和碰撞约束，最终采样的轨迹也必须保持在可行空间内。V_max和V_min分别表示智能体的最大最小速度，表征以最大速度行驶Δt时间所能达到的空间范围，/>表征以最小速度行驶Δt时间所能达到的空间范围，C_obs表示未被障碍物占据的空间。In a short time interval, the agent can be considered to be moving uniformly, the center of mass is the center of the circle, and [V _max Δt, V _min Δt] as the radius is the area that the agent can reach. Therefore, the feasible space representation of each frame can be expressed as follows: The states within the feasible interval must comply with strict speed interval constraints and collision constraints, and the final sampled trajectory must also remain within the feasible space. V _max and V _min represent the maximum and minimum speed of the agent respectively, Characterizes the spatial range that can be reached by driving at the maximum speed Δt time,/> It represents the spatial range that can be reached by driving at the minimum speed Δt time, and C _obs represents the space not occupied by obstacles.

步骤5：获得可行空间后，就在其中进行候选轨迹生成。本发明采用随机抽样方法在可行空间内选择帧。本发明单独逐个采样步骤4中膨胀后的栅格图序列，采样的空间表示每个图中未占用的区域。在对后续地图进行采样时，评估未占用栅格的可达性，以确保可达性，为了满足智能体的运动学约束，搜索空间描述如下：Step 5: After obtaining the feasible space, generate candidate trajectories in it. The present invention uses a random sampling method to select frames within the feasible space. The present invention individually samples the expanded raster image sequence in step 4 one by one, and the sampling space represents the unoccupied area in each image. When sampling subsequent maps, the reachability of unoccupied grids is evaluated to ensure reachability. In order to satisfy the kinematic constraints of the agent, the search space is described as follows:

如果相邻帧之间的采样点可以在没有任何碰撞的情况下连接，并且满足构造条件：连接的边没有落在可行空间外、两点间满足动力学约束与运动学约束，则将符合要求的相邻帧之间的采样点连接在一起以形成边。如果采样点不符合这些构造条件，则会将其丢弃，然后遍历采样点列表以识别成本最低的节点，并启动回溯过程以获得初始轨迹作为初始解。If the sampling points between adjacent frames can be connected without any collision, and the construction conditions are met: the connected edges do not fall outside the feasible space, and the dynamic constraints and kinematic constraints between the two points are met, then the requirements will be met Sample points between adjacent frames are connected together to form edges. If a sampling point does not meet these construction conditions, it is discarded, and then the list of sampling points is traversed to identify the node with the lowest cost and a backtracking process is initiated to obtain the initial trajectory as the initial solution.

步骤6：获得同时满足运动学约束和障碍物约束的初始轨迹后，就通过使用图优化方法生成更平滑、更安全的轨迹。智能体在任何给定时间的状态转换满足以下条件：Step 6: After obtaining the initial trajectory that satisfies both kinematic constraints and obstacle constraints, a smoother and safer trajectory is generated by using graph optimization methods. The agent's state transition at any given time satisfies the following conditions:

要优化的评估函数包括与参考轨迹间的横向和纵向距离偏差、方向偏差、线速度偏差与角速度偏差。优化评价指标L({P_t})由两部分组成：第一部分表示与全局路径所得到的参考轨迹{P_ref}的偏差，而第二部分反映当前路径的平滑度：L({P_t})＝L^offset({P_t}；{P_ref})+L^smooth({P_t})。其中L({P_t})表示当前路径的代价，L^offset({P_t}；{P_ref})表示与参考轨迹{P_ref}的偏移量，而L^smooth({P_t})表示轨迹的平滑程度。The evaluation functions to be optimized include lateral and longitudinal distance deviations, directional deviations, linear velocity deviations and angular velocity deviations from the reference trajectory. The optimization evaluation index L({P _t }) consists of two parts: the first part represents the deviation from the reference trajectory {P _ref } obtained by the global path, while the second part reflects the smoothness of the current path: L({P _t } )=L ^offset ({P _t }; {P _ref })+L ^smooth ({P _t }). where L({P _t }) represents the cost of the current path, L ^offset ({P _t }; {P _ref }) represents the offset from the reference trajectory {P _ref }, and L ^smooth ({P _t }) represents The smoothness of the trajectory.

动态约束可以表示为确保智能体的加速度和速度遵守特定的限制，最终，该路径规划问题被公式化为一个非线性优化问题：Dynamic constraints can be expressed as ensuring that the agent's acceleration and velocity adhere to specific limits. Ultimately, the path planning problem is formulated as a nonlinear optimization problem:

其中，L({P_t}|{m_t:t+β})表示优化目标，{m_t:t+β}表示栅格图序列，C_ego表示自车外轮廓，C_obs表示障碍物外轮廓，表示所能达到的最大速度和最小速度，ω_min,ω_{max}表示所能达到的最大角速度和最小角速度，|a_v|≤a_{max},表示线加速度小于最大线加速度，表示角加速度小于最大角加速度。Among them, L({P _t }|{m _t:t+β }) represents the optimization goal, {m _t:t+β } represents the grid map sequence, C _ego represents the outer contour of the self-car, and C _obs represents the outer contour of the obstacle. contour, represents the maximum speed and minimum speed that can be achieved, ω _min , ω _{max} represents the maximum angular velocity and minimum angular velocity that can be achieved, |a _v |≤a _{max} , which means that the linear acceleration is less than the maximum linear acceleration, Indicates that the angular acceleration is less than the maximum angular acceleration.

考虑到与解决非凸约束问题相关的挑战，通过将硬约束转换为目标函数内的惩罚，而将硬约束视为软约束。如果优化的轨迹不能满足运动学约束和障碍物约束，则重复执行本步骤优化过程进行进一步的优化，直到达到迭代的上限，则结束该求解过程，输出最终的规划路径。Taking into account the challenges associated with solving non-convex constrained problems, hard constraints are treated as soft constraints by converting them into penalties within the objective function. If the optimized trajectory cannot satisfy the kinematic constraints and obstacle constraints, the optimization process of this step is repeated for further optimization until the upper limit of the iteration is reached, then the solution process is ended and the final planned path is output.

本发明在基于Gazebo软件的仿真平台上进行验证，为与多类室内环境进行对应，本发明使用了Gazebo软件进行仿真环境的搭建，构建包含走廊、桌子、椅子的室内环境仿真平台。每一轮测试采用随机的起终点，和运动速度，起始位置随机分布在智能体周围0.5m外的障碍物，障碍物的数量、最大最小速度可以进行设置，当障碍物与智能体的距离小于0.05m，则视为发生碰撞，若发生碰撞，当前智能体测试轮次视为结束。若正常到达终点，当前智能体测试轮次视为结束。The present invention is verified on a simulation platform based on Gazebo software. In order to correspond to multiple types of indoor environments, the present invention uses Gazebo software to build a simulation environment and build an indoor environment simulation platform including corridors, tables, and chairs. Each round of testing uses random starting and ending points, and movement speed. The starting position is randomly distributed with obstacles 0.5m away around the agent. The number of obstacles, maximum and minimum speeds can be set. When the distance between the obstacles and the agent If it is less than 0.05m, it will be regarded as a collision. If a collision occurs, the current agent test round will be deemed to be over. If the end point is reached normally, the current agent test round is deemed to be over.

基于以上测试要求，本发明采用的测试参数如下表所示：Based on the above test requirements, the test parameters used in this invention are as shown in the following table:

最终的测试结果如下表所示：The final test results are shown in the table below:

基于本发明所述方法的构思，还提供一种适用室内环境的轻量性预测式导航系统，包括位姿预测模块、动静态特征提取模块、占用栅格图序列预测模块、可行空间获取模块、初始轨迹获取模块以及轨迹优化模块；Based on the concept of the method of the present invention, a lightweight predictive navigation system suitable for indoor environments is also provided, including a pose prediction module, a dynamic and static feature extraction module, an occupancy grid sequence prediction module, and a feasible space acquisition module. Initial trajectory acquisition module and trajectory optimization module;

另一方面，本发明还提供一种计算机可读存储介质，计算机可读存储介质中存储有计算机程序，所述计算机程序被处理器执行时，能实现本发明所述的适用室内环境的轻量性预测式导航方法。On the other hand, the present invention also provides a computer-readable storage medium. A computer program is stored in the computer-readable storage medium. When the computer program is executed by a processor, the lightweight indoor environment suitable for indoor environments described in the present invention can be realized. Predictive navigation methods.

所述计算机设备可以采用笔记本电脑、桌面型计算机、工作站或车载计算机。The computer equipment may be a laptop computer, a desktop computer, a workstation or a vehicle-mounted computer.

对于本发明所述处理器，可以是中央处理器(CPU)、图形处理器(GPU)、数字信号处理器(DSP)、专用集成电路(ASIC)或现成可编程门阵列(FPGA)。The processor of the present invention may be a central processing unit (CPU), a graphics processing unit (GPU), a digital signal processor (DSP), an application specific integrated circuit (ASIC) or an off-the-shelf programmable gate array (FPGA).

对于本发明所述存储器，可以是笔记本电脑、桌面型计算机、工作站或车载计算机的内部存储单元，如内存、硬盘；也可以采用外部存储单元，如移动硬盘、闪存卡。The memory of the present invention can be an internal storage unit of a notebook computer, a desktop computer, a workstation or a vehicle-mounted computer, such as a memory or a hard disk; it can also be an external storage unit, such as a mobile hard disk or a flash memory card.

本发明还可以提供一种计算机设备，包括处理器以及存储器，存储器用于存储计算机可执行程序，处理器从存储器中读取所述计算机可执行程序并执行，处理器执行计算机可执行程序时能实现本发明所述的适用室内环境的轻量性预测式导航方法。The present invention can also provide a computer device, including a processor and a memory. The memory is used to store computer executable programs. The processor reads the computer executable programs from the memory and executes them. When the processor executes the computer executable programs, it can Implement the lightweight predictive navigation method suitable for indoor environments according to the present invention.

计算机可读存储介质可以包括计算机存储介质和通信介质。计算机存储介质包括以用于存储诸如计算机可读指令、数据结构、程序模块或其他数据等信息的任何方法或技术实现的易失性和非易失性、可移动和不可移动介质。计算机可读存储介质可以包括：只读存储器(ROM，Read Only Memory)、随机存取记忆体(RAM，Random Access Memory)、固态硬盘(SSD，Solid State Drives)或光盘等。其中，随机存取记忆体可以包括电阻式随机存取记忆体(ReRAM，Resistance Random Access Memory)和动态随机存取存储器(DRAM，Dynamic Random Access Memory)。Computer-readable storage media may include computer storage media and communication media. Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. Computer-readable storage media may include: Read Only Memory (ROM), Random Access Memory (RAM), Solid State Drives (SSD), optical disks, etc. Among them, random access memory may include resistive random access memory (ReRAM, Resistance Random Access Memory) and dynamic random access memory (DRAM, Dynamic Random Access Memory).

综上所述，本发明提供一种适用室内环境的轻量性预测式导航方法采用分层的预测-规划方法，在统一的环境表征形式下，获取感知数据和车辆自身运动数据(速度，坐标)部分解耦动、静态环境，静态环境通过坐标变换与映射得到未来位置，动态环境通过提取场景运动特征信息，预测未来若干步长的栅格地图，然后直接利用未来栅格，同时处理动静态障碍物，提升轨迹优化的效率和有效性。To sum up, the present invention provides a lightweight predictive navigation method suitable for indoor environments. It adopts a hierarchical prediction-planning method to obtain sensing data and vehicle's own motion data (speed, coordinates) in a unified environment representation form. ) partially decouples the dynamic and static environments. The static environment obtains the future position through coordinate transformation and mapping. The dynamic environment predicts the raster map of several steps in the future by extracting scene motion feature information, and then directly uses the future raster to process the dynamic and static environments at the same time. Obstacles improve the efficiency and effectiveness of trajectory optimization.

本发明所述方法能够在一个统一的表征形式下进行智能体的长时运动规划是一项有难度的任务。栅格地图是一种精简的，常见的环境表征方法，其具有与俯视图下类似的表征，适用于非结构化环境。栅格地图可以表征规划所需的可行空间，并具有抽象性、高复用性，可以直接为规划所利用。占用网格图的好处是能够表示任意形状的物体，并且能够估计物体的运动而不需要明确的数据关联。本发明所述方法用概率占据栅格图作为一种全面的表示，以防止转换过程中的信息丢失。本发明按时间顺序组织预测的占据栅格图序列以生成轨迹，从而减轻了规划过程对先前预测信息的依赖。关于预测方面，通过利用单线激光雷达进行感知，并结合运动信息，解决了实时性较差和预测结果不准确的挑战，此外，还采用了轻型结构来加快计算速度。关于规划问题，基于占据栅格图序列生成可行轨迹，同时考虑障碍物的未来分布，从而解决短期规划问题。此外，对所获得的轨迹进行优化，以确保平滑并减少轨迹的不连续性。The method of the present invention can perform long-term motion planning of an agent in a unified representation form, which is a difficult task. Raster map is a streamlined and common environment representation method, which has a representation similar to that in a top view and is suitable for unstructured environments. Raster maps can represent the feasible space required for planning, are abstract and highly reusable, and can be directly utilized for planning. The benefits of occupancy grid graphs are the ability to represent objects of arbitrary shapes and the ability to estimate the motion of objects without the need for explicit data associations. The method of the present invention uses a probabilistic occupancy raster diagram as a comprehensive representation to prevent information loss during the conversion process. The present invention chronologically organizes a sequence of predicted occupancy grid maps to generate trajectories, thereby alleviating the planning process' dependence on previously predicted information. Regarding prediction, by utilizing single-line lidar for sensing and combining motion information, the challenges of poor real-time performance and inaccurate prediction results are solved. In addition, a lightweight structure is used to speed up calculations. Regarding the planning problem, short-term planning problems are solved by generating feasible trajectories based on the occupancy grid sequence while taking into account the future distribution of obstacles. Furthermore, the obtained trajectories are optimized to ensure smoothness and reduce trajectory discontinuities.

最后说明的是，以上所述，仅为说明本发明的具体实施方式，但本发明创造的保护范围并不局限于此，熟悉本技术领域的技术人员应该明白，在本发明技术方案的基础上，根据本发明创造的技术方案及其发明构思做出的修改或变形，都应涵盖在本发明的保护范围之内。Finally, it should be noted that the above is only to illustrate the specific embodiments of the present invention, but the protection scope of the present invention is not limited thereto. Those familiar with the technical field should understand that on the basis of the technical solution of the present invention, , modifications or deformations made based on the technical solution created by the present invention and its inventive concept shall be covered by the protection scope of the present invention.

Claims

1. The lightweight predictive navigation method suitable for the indoor environment is characterized by comprising the following steps of:

the method comprises the steps of performing coordinate conversion of sensing measurement values on pose information, speed information and environmental characterization of an intelligent agent obtained by a multi-sensor system, and projecting environmental variables to a coordinate system where the intelligent agent is located in the future;

taking into consideration the self-walking of the dynamic barrier and the occupation of the static barrier in the environment of the environment variable under the coordinate system of the future intelligent agent, capturing the interaction between the dynamic barrier and the static barrier, decoupling the dynamic component and the static component, and extracting the potential variable characteristics of the future probability occupation grid map;

constructing an OGM occupation raster pattern prediction generation module by adopting a variation automatic encoder, taking potential variable characteristics of a future probability occupation raster pattern as input, and outputting a predicted occupation raster pattern sequence by a decoder;

sequentially using each raster pattern as a feasible space for sampling candidate trajectories;

generating candidate tracks in the feasible space, and acquiring initial tracks meeting kinematic constraint and barrier constraint;

a smoother and safer trajectory is generated based on the initial trajectory using a graph optimization method.

2. The lightweight predictive navigation method for an applicable indoor environment according to claim 1, wherein projecting the environmental variable into a coordinate system in which the future agent is located by performing coordinate transformation of the sensed measurement values of the agent pose information, the velocity information, and the environmental characterization obtained by the multi-sensor system comprises:

firstly, calculating the current obstacle occupation position according to the measured value of the laser radar, and acquiring the obstacle occupation position under the coordinate system of the current intelligent agent;

deducing the pose of the intelligent agent at the next moment according to the conversion relation between the current pose and the next moment of the intelligent agent and the current pose information of the intelligent agent; and projecting the current obstacle distribution to a coordinate system of the pose of the intelligent agent after the set time length, and obtaining the projection of the environment variable under the coordinate system of the intelligent agent in the future.

3. The lightweight predictive navigation method for an applicable indoor environment according to claim 1, wherein taking into account the occupancy of the environment by the self-moving and static obstacles of the dynamic obstacle, capturing the interaction between the two, and decoupling the dynamic and static components, extracting the potential variable features of the future probability occupancy trellis diagram comprises the steps of:

feeding the whole grid map information into a convolution long-short time memory network module, wherein the convolution long-short time memory network module takes a one-dimensional occupied grid graph as input, and generates dynamic grid features by using a convolution kernel with the size of 3;

predicting a local map of the static obstacle in the future by adopting a Bayesian generation method, and filtering out dynamic information to obtain local map information of the static obstacle;

and taking the extracted dynamic grid characteristics and the local map information of the static obstacle as the input of the OWM occupation grid map prediction generation module.

4. The lightweight predictive navigation method applicable to indoor environments according to claim 1, wherein the OGM occupying raster image prediction generation module comprises an encoder and a decoder, the encoder is composed of two convolution layers and two residual layers, and parameters of the two residual layers are consistent; the decoder consists of two residual layers and two deconvolution layers, the residual layer structure is identical to that in the encoder.

5. The lightweight predictive navigation method for an applicable indoor environment according to claim 1, wherein when each raster pattern is sequentially used as a feasible space for sampling candidate trajectories, specifically, based on an extended sequence of occupied raster patterns, the occupied space in the predicted sequence is derived through a position transformation relationship, in a set period of time, an agent uniformly moves, a centroid is a center of a circle, a feasible space representation of each frame is obtained through an area that the agent can reach, and states in the feasible interval obey a strict speed interval constraint and a collision constraint.

6. The lightweight predictive navigation method applicable to indoor environments according to claim 1, wherein candidate tracks are generated in the feasible space, frames are selected in the feasible space by adopting a random sampling method when initial tracks meeting kinematic constraint and barrier constraint are obtained, grid graph sequences are sampled individually and one by one, the sampled space represents an unoccupied area in each graph, and a search space is obtained on the basis of meeting the kinematic constraint of an intelligent agent and ensuring accessibility during sampling; when sampling is carried out, if sampling points between adjacent frames can be connected under the condition of no collision, judging whether the connected edges meet the dynamic constraint and the kinematic constraint of an intelligent agent, and if so, connecting the sampling points together to form the edges; traversing the node list to identify the lowest cost node and initiating a backtracking process to obtain an initial trajectory that satisfies the kinematic and obstacle constraints.

7. The lightweight predictive navigation method for an applicable indoor environment of claim 1, wherein generating a smoother, safer trajectory based on the initial trajectory using a graph optimization method comprises:

creating smoother and safer trajectories using the graph optimization method is formulated as a nonlinear optimization problem:

wherein L ({ P) _t }|{m _t:t+β }) represents optimization objective, { m _t:t+β "represents a raster pattern sequence, C _ego Represents the outline of the bicycle, C _obs Represents the outline of the obstacle and,represents the maximum and minimum speeds achievable, ω _min ,ω _{max} Represents the maximum and minimum angular velocities achievable, |a _v |≤a _{max} Indicating that the linear acceleration is less than the maximum linear acceleration, indicating that the angular acceleration is less than the maximum angular acceleration;

converting the hard constraint into punishment in the objective function, and treating the hard constraint as a soft constraint; and (3) performing iterative optimization until the optimized track meets kinematic constraint and barrier constraint or reaches the iterative upper limit, and calculating to obtain a final planning path.

8. The lightweight predictive navigation system suitable for the indoor environment is characterized by comprising a pose prediction module, a dynamic and static feature extraction module, an occupied grid pattern sequence prediction module, a feasible space acquisition module, an initial track acquisition module and a track optimization module;

the pose prediction module is used for carrying out coordinate conversion on sensing measurement values on pose information, speed information and environment characterization of the intelligent agent obtained by the multi-sensor system, and projecting the environment variables to a coordinate system where the intelligent agent is located in the future;

the dynamic and static feature extraction module is used for taking into account the occupation of the dynamic barrier and the static barrier in the environment by the self-walking of the environment variable under the coordinate system of the future intelligent agent, capturing the interaction between the dynamic component and the static component, decoupling the dynamic component and the static component, and extracting the potential variable feature of the future probability occupation grid map;

the occupied raster pattern sequence prediction module adopts a variation automatic encoder to construct an OWM occupied raster pattern prediction generation module, potential variable characteristics of future probability occupied raster patterns are used as input, and a decoder outputs a predicted occupied raster pattern sequence;

the feasible space acquisition module is used for sequentially using each grid graph as a feasible space of the sampling candidate track;

the initial track acquisition module is used for generating candidate tracks in the feasible space and acquiring initial tracks meeting kinematic constraint and obstacle constraint;

the track optimization module adopts a graph optimization method to generate smoother and safer tracks based on the initial tracks.

9. A computer device comprising a processor and a memory, the memory storing a computer executable program, the processor reading the computer executable program from the memory and executing the program, the processor executing the program to implement the lightweight predictive navigation method of any one of claims 1-7 for use in an indoor environment.

10. A computer readable storage medium, wherein a computer program is stored in the computer readable storage medium, the computer program, when executed by a processor, is capable of implementing the lightweight predictive navigation method of any one of claims 1-7, which is applicable to indoor environments.