CN112967516B

CN112967516B - Global dynamic path planning method for matching of key parameters of quick parking lot end with whole vehicle

Info

Publication number: CN112967516B
Application number: CN202110151518.5A
Authority: CN
Inventors: 张永华; 朱金波
Original assignee: Wuhu Bolatu Information Technology Co ltd
Current assignee: Wuhu Ruite Microelectronics Co ltd
Priority date: 2021-02-03
Filing date: 2021-02-03
Publication date: 2022-07-26
Anticipated expiration: 2041-02-03
Also published as: CN112967516A

Abstract

The invention belongs to the field of automatic driving of vehicles, and discloses a global dynamic path planning method for matching key parameters of a fast parking lot end with a whole vehicle, comprising five steps: entering parameters and parking space status into a cloud platform of the parking lot; The vehicle matches the target parking space and the corresponding global path; the latest positioning information of the vehicle is sensed through the radar device; according to the latest monitoring parking space status and the position of the latest vehicle positioning point, the field cloud platform is used to plan the optimal global path corresponding to the latest positioning point and the target parking space; the optimal parking route can be selected according to the planned path, and the steps are repeated until the parking task is completed. The present invention obtains the vehicle parameters and motion information in the parking lot in real time through the information interaction between the yard cloud platform and the vehicle-mounted communication device, and plans a globally optimal parking for each vehicle with a parking task in real time through the yard cloud platform. trajectories, so that different types of vehicles can always complete the parking task as soon as possible.

Description

Global dynamic path planning method for matching key parameters of fast parking lot with vehicle

技术领域technical field

本发明涉及快速停车场端关键参数与整车匹配全局动态路径规划方法，属于车辆自动驾驶技术领域。The invention relates to a global dynamic path planning method for matching key parameters of a fast parking lot terminal with a whole vehicle, and belongs to the technical field of vehicle automatic driving.

背景技术Background technique

近些年，随着车辆智能化水平的逐步提高，自动驾驶技术在很多场景下得以初步实现，如自适应巡航ACC，自动紧急刹车AEB，自动泊车等。其中ACC和AEB由于在复杂的城市交通道路下，依然有很多工况难以应付，给驾驶员带来的实际体验往往不够理想,甚至在极限工况下会带来一些碰撞危险。而停车场内的自动泊车技术由于是在封闭的场景下，很少有行人及其他交通参与者。此外，场内车辆速度较低,基本保持在10km/h以下，并且有场内云平台给每辆车发布指令，进行协同控制，为停车场内的自动泊车系统带来了相当多的便利，使得停车场内自动泊车技术变得更有前景，并且更加迫切。In recent years, with the gradual improvement of the level of vehicle intelligence, autonomous driving technology has been initially realized in many scenarios, such as adaptive cruise ACC, automatic emergency braking AEB, and automatic parking. Among them, ACC and AEB still have many working conditions that are difficult to cope with under complex urban traffic roads, and the actual experience brought to the driver is often not ideal, and even brings some collision dangers under extreme working conditions. As the automatic parking technology in the parking lot is in a closed scene, there are few pedestrians and other traffic participants. In addition, the speed of vehicles in the field is relatively low, which is basically kept below 10km/h, and there is a cloud platform in the field to issue instructions to each vehicle for coordinated control, which brings considerable convenience to the automatic parking system in the parking lot. , making the automatic parking technology in the parking lot more promising and more urgent.

发明内容SUMMARY OF THE INVENTION

针对现有技术的不足，本发明提供了快速停车场端关键参数与整车匹配全局动态路径规划方法，解决了不同型号车辆无法尽快完成停车任务的问题。In view of the deficiencies of the prior art, the present invention provides a global dynamic path planning method for matching the key parameters of the fast parking lot end with the whole vehicle, and solves the problem that vehicles of different types cannot complete the parking task as soon as possible.

本发明为解决其技术问题采用如下技术方案：The present invention adopts following technical scheme for solving its technical problem:

本发明所述的快速停车场端关键参数与整车匹配全局动态路径规划方法，包括以下步骤：The global dynamic path planning method for matching the key parameters of the fast parking lot terminal with the whole vehicle according to the present invention includes the following steps:

步骤一：将停车场端关键参数及当前车位状态实时录入场端云平台；Step 1: Enter the key parameters of the parking lot and the current parking space status into the cloud platform of the parking lot in real time;

步骤二：当每辆车通过停车场入口时，场端云平台能通过车牌识别出车辆型号，并结合停车场当前车位状态利用强化学习的方法给每辆车匹配出目标车位以及相应的全局路径；Step 2: When each vehicle passes through the entrance of the parking lot, the on-site cloud platform can identify the vehicle model through the license plate, and use the reinforcement learning method to match each vehicle with the target parking space and the corresponding global path in combination with the current parking space status of the parking lot. ;

步骤三：当车辆具体进入停车场内时，通过车载毫米波雷达，摄像头，并结合停车场内地图，感知到车辆的位置及所处的最新定位点信息；Step 3: When the vehicle enters the parking lot, the vehicle's location and the latest positioning point information are sensed through the on-board millimeter-wave radar and camera combined with the map in the parking lot;

步骤四：场端云平台根据停车场内监测的最新车位状态，以及车辆所处的最新定位点位置，重新利用强化学习的方法实时规划出最新定位点对应的最优全局路径以及目标车位；Step 4: According to the latest parking space status monitored in the parking lot and the latest positioning point position of the vehicle, the field cloud platform reuses the reinforcement learning method to plan the optimal global path corresponding to the latest positioning point and the target parking space in real time;

步骤五：根据规划出的路径，将跟踪控制指令发送到底层控制器和停车场云台，从而进入下一时刻，并重复步骤三到步骤五，直至完成泊车任务。Step 5: According to the planned path, send the tracking control command to the bottom controller and the parking lot PTZ, so as to enter the next moment, and repeat steps 3 to 5 until the parking task is completed.

进一步，在所述步骤一中，将停车场端关键参数及当前车位状态实时录入场端云平台时，具体包括以下内容：Further, in the first step, when the key parameters of the parking lot and the current parking space status are entered into the cloud platform of the parking lot in real time, the specific content includes the following:

1)停车场端的关键参数包括车位分区A,B,C,D,E等，其中不同分区可以停靠不同类型的车辆，如A区可以停靠轴距超过2m的大车，B区则停靠略小一些的车辆，具体如下，1) The key parameters of the parking lot include parking space zones A, B, C, D, E, etc., in which different zones can park different types of vehicles. For example, zone A can park large vehicles with a wheelbase of more than 2m, and zone B can park slightly smaller Some of the vehicles are as follows,

A＝{a₁,a₂,…,a_p}B＝{b₁,b₂,…,b_q}A={a ₁ ,a ₂ ,...,a _p }B={b ₁ ,b ₂ ,...,b _q }

其中a_i，b_i为具体车位情况，如果当前车位为空则为1，已经停车则为0；Among them, a _i , b _i are the specific parking space conditions, if the current parking space is empty, it is 1, and it is 0 if it has been parked;

2)停车场端的关键参数包括场内预先标记的定位点，代表场内道路特征信息，每个定位点作为一个状态s,将所有定位点标记后可得到n个状态:S＝{s₁,s₂,…,s_n}；2) The key parameters of the parking lot include the pre-marked positioning points in the field, which represent the road feature information in the field. Each positioning point is used as a state s. After marking all the positioning points, n states can be obtained: S={s ₁ , s ₂ ,…,s _n };

3)关键定位点下车位信息，要使得车辆停入某个车位，需要经过某个定位点才能停过去，具体表示如下，3) The parking space information of the key positioning point, to make the vehicle park in a certain parking space, it needs to pass a certain positioning point to stop, the specific expression is as follows,

其中s_m为关键定位点，a_i,a_i+1,b_j,b_j+1,c_j,c_j+1,…为经过该定位点可直接到达的车位，如想进入a₁-a₅这五个车位，则必须经过s₃或者s₆才能到达，从而表示如Among them, s _m is the key positioning point, a _i , a _i+1 , b _j , b _j+1 , c _j , c _j+1 , ... are the parking spaces that can be directly reached through the positioning point, if you want to enter a ₁ - The five parking spaces of a ₅ must be reached through s ₃ or s ₆ , which means that if

4)定义定位点与定位点之间的动作及转移关系，具体车辆到达每个定位点可以进行4个基本动作A1～A4,其中A1代表向前行驶，A2代表向左转弯，A3代表向右转弯，A4代表静止，对应的状态转移关系可由停车场内结构参数确定，如在S6处采取动作A1则到达S5；采取动作A3则到达S4；采取其它动作还到达本身状态S6，将所有状态对应的转移关系都表征出来可以得到一个状态转移矩阵T。4) Define the action and transfer relationship between the positioning point and the positioning point. The specific vehicle can perform 4 basic actions A1 to A4 when it reaches each positioning point, where A1 means driving forward, A2 means turning left, and A3 means right Turning, A4 stands for static, and the corresponding state transition relationship can be determined by the structural parameters in the parking lot. For example, if action A1 is taken at S6, it will reach S5; if action A3 is taken, it will reach S4; if other actions are taken, it will also reach its own state S6, and all states correspond to The transition relationships of , are characterized, and a state transition matrix T can be obtained.

进一步，在所述步骤二中，利用强化学习的方法给每辆车匹配出目标车位以及相应的全局路径，具体步骤如下：Further, in the second step, the method of reinforcement learning is used to match the target parking space and the corresponding global path for each vehicle, and the specific steps are as follows:

1)当识别出车型后，即可根据车型安排出对应的分区，此时该分区内所有的空位都可能成为最终的目标车位，将这些空车位所属的定位点提取出来，这些定位点也就成为全局路径的最终定位点，可表示如下：1) After the vehicle type is identified, the corresponding partition can be arranged according to the vehicle type. At this time, all the vacant spaces in the partition may become the final target parking spaces, and the positioning points to which these empty parking spaces belong are extracted, and these positioning points are also becomes the final anchor point of the global path, which can be expressed as follows:

其中S_f为最终定位点集合，

为全局路径可能经过的最终定位点；where S _f is the final set of positioning points,

is the final positioning point that the global path may pass through;

2)建立全局路径的强化学习过程，其中状态集：S＝{s₁,s₂,…,s_n}，动作集Action＝{A₁,A₂,A₃,A₄}，及状态转移矩阵T都已经在2.2，2.4中建立，对应的回报函数可以表示如下：2) The reinforcement learning process of establishing a global path, in which the state set: S={s ₁ , s ₂ ,...,s _n }, the action set Action={A ₁ , A ₂ , A ₃ , A ₄ }, and the state transition The matrix T has been established in 2.2 and 2.4, and the corresponding reward function can be expressed as follows:

其中s_t+1为在状态S_t处采取动作A_t达到的下一状态，由状态转移矩阵得到：s_t+1＝T(s_t,A_t)，该回报代表当下一状态到达最终定位点时，回报会特别大；如果没有到达，则付出一定的距离成本；Among them, s _t+1 is the next state reached by taking action A _t at state S _t , which is obtained from the state transition matrix: s _t+1 =T(s _t , A _t ), the return represents when the next state reaches the final positioning When the point is reached, the reward will be particularly large; if it is not reached, a certain distance cost will be paid;

3)建立Q学习迭代方程：3) Establish the Q-learning iterative equation:

其中α为学习率，γ为遗忘因子；where α is the learning rate and γ is the forgetting factor;

4)随机初始化状态S_t，通过迭代方程不断学习，当状态s_t+1∈S_f时即进入下一循环，循环多次收敛以后，即可得到从当前定位点s₀到目标车位最终定位点S_f对应的全局路径：4) Randomly initialize the state S _t , and continuously learn through the iterative equation. When the state s _t+1 ∈ S _f , the next cycle is entered. After the cycle converges for many times, the final positioning from the current positioning point s ₀ to the target parking space can be obtained. The global path corresponding to point S _f :

s₀ a₀ s₁ a₁ … s_f s ₀ a ₀ s ₁ a ₁ … s _f

5)到达最终定位点S_f后，选择距离该定位点最近的空车位作为目标车位即可。5) After reaching the final positioning point S _f , select the empty parking space closest to the positioning point as the target parking space.

进一步，所述步骤四中重新利用强化学习的方法实时规划出最优全局路径以及目标车位时，具体步骤如下：Further, when reusing the reinforcement learning method in the step 4 to plan the optimal global path and the target parking space in real time, the specific steps are as follows:

1)根据停车场内监测的最新车位状态，需要对3.1)中的最终定位点集合进行更新，得到当前t时刻对应的新的

1) According to the latest parking space status monitored in the parking lot, it is necessary to update the final set of positioning points in 3.1) to obtain a new corresponding to the current time t.

2)重复3.2)-3.3)的强化学习过程，重新进行迭代训练，当状态

时进入下一循环，循环多次收敛后，即可得到车辆所在的最新定位点到更新后的最终定位点的全局路径：

2) Repeat the reinforcement learning process of 3.2)-3.3), and re-train iteratively, when the state

When entering the next loop, after the loop converges for many times, the global path from the latest positioning point where the vehicle is located to the updated final positioning point can be obtained:

与现有技术相比，本发明具有的有益效果是：Compared with the prior art, the present invention has the following beneficial effects:

通过场端云平台与车载通讯设备的信息交互，实时获取停车场内的车型参数及运动信息，并通过场端云平台给每辆有停车任务的车辆实时规划出一条全局最优的停车轨迹，使得不同型号车辆总能高效快速尽快完成停车任务。Through the information interaction between the field cloud platform and the vehicle communication equipment, the vehicle parameters and motion information in the parking lot are obtained in real time, and a globally optimal parking trajectory is planned in real time for each vehicle with a parking task through the field cloud platform. So that different types of vehicles can always complete the parking task efficiently and quickly as soon as possible.

附图说明Description of drawings

图1是本发明停车场关键参数。Fig. 1 is the key parameter of the parking lot of the present invention.

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白，以下结合附图及实施案例对本发明进行深入地详细说明。应当理解，此处所描述的具体实施案例仅仅用以解释本发明，并不用于限定发明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be described in detail below with reference to the accompanying drawings and implementation cases. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the invention.

如图1所示，本发明所述的快速停车场端关键参数与整车匹配全局动态路径规划方法，包括以下步骤：As shown in Figure 1, the global dynamic path planning method for matching the key parameters of the fast parking lot terminal with the whole vehicle according to the present invention includes the following steps:

其中s_m为关键定位点，a_i,a_i+1,b_j,b_j+1,c_j,c_j+1,…为经过该定位点可直接到达的车位，如要想进入a₁-a₅这五个车位，则必须经过s₃或者s₆才能到达，从而表示如下，Among them, s _m is the key positioning point, a _i , a _i+1 , b _j , b _j+1 , c _j , c _j+1 , ... are the parking spaces that can be directly reached through the positioning point, if you want to enter a ₁ -a ₅ these five parking spaces, you must go through s ₃ or s ₆ to arrive, which is expressed as follows,

其中S_f为最终定位点集合，

is the final positioning point that the global path may pass through;

3)建立Q学习迭代方程：3) Establish the Q-learning iterative equation:

s₀ a₀ s₁ a₁ … s_f s ₀ a ₀ s ₁ a ₁ … s _f

2)重复3.2)-3.3)的强化学习过程，重新进行迭代训练，当状态

本专利主要研究自动泊车中的关键技术，主要是考虑停车场端关键参数时各种整车进行动态路径规划的技术，从而实现每辆有停车任务的车辆实时规划出一条全局最优的停车轨迹，使得不同型号车辆总能高效快速尽快完成停车任务。This patent mainly studies the key technologies in automatic parking, mainly the technology of dynamic path planning for various vehicles when the key parameters of the parking lot are considered, so that each vehicle with a parking task can plan a globally optimal parking in real time. trajectories, so that different types of vehicles can always complete the parking task efficiently and quickly as soon as possible.

Claims

1. The global dynamic path planning method for matching the key parameters of the fast parking lot end with the whole vehicle is characterized by comprising the following steps of:

the method comprises the following steps: recording parking lot end key parameters and the current parking stall state into a lot end cloud platform in real time;

step two: when each vehicle passes through the entrance of the parking lot, the field end cloud platform can identify the vehicle model through the license plate, and matches each vehicle with a target parking space and a corresponding global path by using a reinforcement learning method in combination with the current parking space state of the parking lot;

step three: when a vehicle enters a parking lot, the position of the vehicle and the latest positioning point information of the vehicle are sensed through a vehicle-mounted millimeter wave radar and a camera in combination with a map in the parking lot;

step four: the field end cloud platform re-utilizes a reinforcement learning method to plan an optimal global path and a target parking space corresponding to a latest positioning point in real time according to the latest parking space state monitored in the parking field and the latest positioning point position of the vehicle;

step five: according to the planned path, sending a tracking control instruction to the bottom controller and the parking lot holder so as to enter the next moment, and repeating the third step to the fifth step until the parking task is completed;

when the parking lot end key parameters and the current parking space state are recorded into the lot end cloud platform in real time, the method specifically comprises the following contents:

2.1) the key parameters of the parking lot end comprise parking space subareas A, B, C, D and E, wherein different parking spaces stop different types of vehicles, for example, a large vehicle with the wheelbase exceeding 2m stops in the area A, a slightly smaller vehicle stops in the area B, and concretely,

A＝{a ₁ ,a ₂ ,…,a _p }B＝{b ₁ ,b ₂ ,…,b _q }

wherein a is _i ，b _i For the specific parking space condition, if the current parking space is empty, the current parking space is 1, and if the current parking space is already parked, the current parking space is 0;

2.2) key parameters of the parking lot end comprise anchor points marked in advance in the parking lot, representing road characteristic information in the parking lot, taking each anchor point as a state s, and marking all the anchor points to obtain n states:

S＝{s ₁ ,s ₂ ,…,s _n }；

2.3) the information of the vehicle position under the key positioning point, the vehicle can stop after passing through a certain positioning point when the vehicle is stopped in a certain position, which is specifically expressed as follows,

wherein s is _m Is a key anchor point, a _i ,a _i+1 ,b _j ,b _j+1 ,c _j ,c _j+1 … indicates a parking space directly accessible via the location point, for example, to enter a ₁ -a ₅ The five parking spaces must pass through s ₃ Or s ₆ Can be reached, and thus is represented as follows,

2.4) defining the action and transfer relation between the positioning points, specifically, the vehicle arrives at each positioning point to perform 4 basic actions A1-A4, wherein A1 represents forward driving, A2 represents leftward turning, A3 represents rightward turning, A4 represents static, the corresponding state transfer relation can be determined by the structural parameters in the parking lot, and if the action A1 is taken at S6, the vehicle arrives at S5; take action A3 to S4; taking other actions and reaching the self state S6, and representing the transfer relations corresponding to all the states to obtain a state transfer matrix T;

a target parking space and a corresponding global path are matched for each vehicle by using a reinforcement learning method, and the method comprises the following specific steps:

3.1) after the vehicle type is identified, arranging a corresponding partition according to the vehicle type, wherein all vacant positions in the partition can become final target parking spaces, extracting positioning points belonging to the vacant parking spaces, and the positioning points become final positioning points of the global path, and can be represented as follows:

wherein S _f To be the finalA set of anchor points is set up,

final anchor points which can be passed by the global path;

3.2) a reinforcement learning process to establish a global path, wherein the state set: s ═ S ₁ ,s ₂ ,…,s _n }, Action set Action ═ a ₁ ,A ₂ ,A ₃ ,A ₄ The, and state transition matrix T are already established in 2.2), 2.4), the corresponding return function is expressed as follows:

wherein s is _t+1 Is in a state S _t Take action A _t The next state reached, derived from the state transition matrix:

s _t+1 ＝T(s _t ,A _t ) The return represents that the return is particularly large when the next state reaches the final anchor point; if not, paying a certain distance cost;

3.3) establishing a Q learning iterative equation:

Q(s _t ,a _t )＝Q(s _t ,a _t )+α[r _t +γmax _ai Q(s _t+1 ,a _i )-Q(s _t ,a _t )]

wherein alpha is the learning rate and gamma is the forgetting factor;

3.4) random initialization State S _t Continuously learning through an iterative equation, when the state s _t+1 ∈S _f Entering the next cycle, and obtaining the current positioning point s after the cycle is converged for multiple times ₀ Final positioning point S to target parking space _f The corresponding global path:

s ₀ →a ₀ →s ₁ →a ₁ →…→s _f

3.5) reach the final location S _f And then, selecting the empty parking space closest to the positioning point as the target parking space.

2. The method for planning the global dynamic path for matching the key parameters of the fast parking lot end with the whole vehicle according to claim 1, wherein when the optimal global path and the target parking space are planned in real time by the reinforcement learning method in the fourth step, the specific steps are as follows:

4.1) according to the latest parking space state monitored in the parking lot, updating the final positioning point set in 3.1) to obtain a new positioning point corresponding to the current time t

4.2) repeating the reinforcement learning process of 3.2) -3.3), and carrying out iterative training again, when the state is the same

And then entering the next cycle, and after the cycle is converged for multiple times, obtaining a global path from the latest positioning point where the vehicle is located to the updated final positioning point: