WO2022062349A1 - 车辆控制方法、设备、存储介质和电子装置 - Google Patents

车辆控制方法、设备、存储介质和电子装置 Download PDF

Info

Publication number
WO2022062349A1
WO2022062349A1 PCT/CN2021/083039 CN2021083039W WO2022062349A1 WO 2022062349 A1 WO2022062349 A1 WO 2022062349A1 CN 2021083039 W CN2021083039 W CN 2021083039W WO 2022062349 A1 WO2022062349 A1 WO 2022062349A1
Authority
WO
WIPO (PCT)
Prior art keywords
vehicle
target
node
state information
current
Prior art date
Application number
PCT/CN2021/083039
Other languages
English (en)
French (fr)
Inventor
林霄
张九才
尚进
王薏
Original Assignee
广州汽车集团股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州汽车集团股份有限公司 filed Critical 广州汽车集团股份有限公司
Priority to CN202180005858.5A priority Critical patent/CN114829226A/zh
Publication of WO2022062349A1 publication Critical patent/WO2022062349A1/zh

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W30/00Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units, or advanced driver assistance systems for ensuring comfort, stability and safety or drive control systems for propelling or retarding the vehicle
    • B60W30/18Propelling the vehicle
    • B60W30/18009Propelling the vehicle related to particular drive situations
    • B60W30/18159Traversing an intersection
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W30/00Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units, or advanced driver assistance systems for ensuring comfort, stability and safety or drive control systems for propelling or retarding the vehicle
    • B60W30/08Active safety systems predicting or avoiding probable or impending collision or attempting to minimise its consequences
    • B60W30/095Predicting travel path or likelihood of collision
    • B60W30/0953Predicting travel path or likelihood of collision the prediction being responsive to vehicle dynamic parameters
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W30/00Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units, or advanced driver assistance systems for ensuring comfort, stability and safety or drive control systems for propelling or retarding the vehicle
    • B60W30/08Active safety systems predicting or avoiding probable or impending collision or attempting to minimise its consequences
    • B60W30/095Predicting travel path or likelihood of collision
    • B60W30/0956Predicting travel path or likelihood of collision the prediction being responsive to traffic or environmental parameters
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W30/00Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units, or advanced driver assistance systems for ensuring comfort, stability and safety or drive control systems for propelling or retarding the vehicle
    • B60W30/18Propelling the vehicle
    • B60W30/18009Propelling the vehicle related to particular drive situations
    • B60W30/18145Cornering
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/02Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
    • B60W40/04Traffic conditions
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W60/00Drive control systems specially adapted for autonomous road vehicles
    • B60W60/001Planning or execution of driving tasks
    • B60W60/0011Planning or execution of driving tasks involving control alternatives for a single driving scenario, e.g. planning several paths to avoid obstacles
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W60/00Drive control systems specially adapted for autonomous road vehicles
    • B60W60/001Planning or execution of driving tasks
    • B60W60/0015Planning or execution of driving tasks specially adapted for safety
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W60/00Drive control systems specially adapted for autonomous road vehicles
    • B60W60/001Planning or execution of driving tasks
    • B60W60/0027Planning or execution of driving tasks using trajectory prediction for other traffic participants
    • B60W60/00274Planning or execution of driving tasks using trajectory prediction for other traffic participants considering possible movement changes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2554/00Input parameters relating to objects
    • B60W2554/40Dynamic objects, e.g. animals, windblown objects
    • B60W2554/404Characteristics
    • B60W2554/4048Field of view, e.g. obstructed view or direction of gaze
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2556/00Input parameters relating to data
    • B60W2556/45External transmission of data to or from the vehicle
    • B60W2556/50External transmission of data to or from the vehicle for navigation systems

Definitions

  • the present invention relates to the field of vehicle control, and in particular, to a vehicle control method, device, storage medium and electronic device.
  • Embodiments of the present invention provide a vehicle control method and device, a storage medium and an electronic device to solve the problem of relatively low control security of the vehicle control method in the related art.
  • a vehicle control method comprising: acquiring real-time road condition information of a target vehicle; when the real-time road condition information indicates that the target vehicle has reached a target intersection, acquiring a first vehicle of the target vehicle in a current driving state state information, and obtain the second vehicle state information of the target vehicle in the area where the target vehicle is located under the current driving state, wherein the target vehicle includes surrounding vehicles passing through the target intersection and a virtual vehicle configured for the current observation field of view of the target vehicle;
  • the vehicle state information and the second vehicle state information are input into the vehicle motion prediction model to obtain the prediction result, wherein the vehicle motion prediction model is used to predict the driving action of the vehicle in the next control cycle based on the vehicle state information of the current control cycle; according to the prediction As a result, the target vehicle is controlled to perform the target travel action.
  • a vehicle control device comprising: a first acquisition unit for acquiring real-time road condition information of a target vehicle; a second acquisition unit, when the real-time road condition information indicates that the target vehicle has reached the target At the intersection, obtain the first vehicle state information of the target vehicle in the current driving state, and obtain the second vehicle state information of the target vehicle in the area where the target intersection is located under the current driving state, wherein the target vehicle includes surrounding vehicles passing through the target intersection and A virtual vehicle configured with the current observation field of view of the target vehicle; the input prediction unit is used to input the first vehicle state information and the second vehicle state information into the vehicle motion prediction model to obtain the prediction result, wherein the vehicle motion prediction model is used for Based on the vehicle state information of the current control cycle, the vehicle's running action in the next control cycle is predicted; and a control unit is configured to control the target vehicle to execute the target running action according to the prediction result.
  • a computer-readable storage medium stores a computer program, wherein the computer program executes the vehicle control method.
  • the corresponding virtual vehicle when the target vehicle is about to pass through the target intersection, the corresponding virtual vehicle is configured in the obstructed observation field of the target intersection to simulate potential moving obstacles in the occluded area, combined with surrounding vehicles and the actual location of the target intersection.
  • the vehicle state information of the target vehicle in the area is used to predict the target driving action of the target vehicle when it passes through the target intersection through the vehicle motion prediction model.
  • the invention enables the target vehicle to fully understand the road condition information of the target intersection when it automatically drives through the target intersection, thereby avoiding the collision risk during driving, improving the safety of vehicle control, and thus overcoming the relative relative safety of vehicle control in the prior art. lower problem.
  • FIG. 1 is a flowchart of an alternative vehicle control method according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of an application environment of an alternative vehicle control method according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of an alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 6 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 7 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 9 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 10 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 11 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 12 is a schematic diagram of another alternative vehicle control method according to an embodiment of the present invention.
  • FIG. 13 is a block diagram of an alternative vehicle control device according to an embodiment of the present invention.
  • a vehicle control method is provided.
  • the vehicle control method includes:
  • the target vehicle when the real-time road condition information indicates that the target vehicle has reached the target intersection, obtain the first vehicle state information of the target vehicle in the current driving state, and obtain the second vehicle state information of the target vehicle in the area where the target vehicle is located under the current driving state, wherein,
  • the target vehicle includes surrounding vehicles passing through the target intersection and virtual vehicles configured for the current observation field of view of the target vehicle;
  • S106 Input the first vehicle state information and the second vehicle state information into a vehicle motion prediction model to obtain a prediction result, wherein the vehicle motion prediction model is used to predict the vehicle's state in the next control cycle based on the vehicle state information in the current control cycle driving action;
  • the above-mentioned vehicle control method may be applied to, but not limited to, a vehicle configured with an advanced driver assistance system (ADAS) to help the vehicle perform efficient and safe driving control operations during automatic driving.
  • ADAS advanced driver assistance system
  • the ADAS system uses various sensors installed on the vehicle to perceive the surrounding environment at any time during the driving process of the vehicle, collect data, identify, detect and track static and dynamic objects, collect the map data of the navigator, and calculate and analyze the system. Analysis makes the driver aware of possible dangers in advance, thereby effectively improving the driving safety of the vehicle.
  • the real-time road condition information of the target vehicle after obtaining the real-time road condition information of the target vehicle, if the real-time road condition information indicates that the target vehicle has reached the target intersection, then obtain the first vehicle state information of the target vehicle in the current driving state and the current driving state of the target vehicle.
  • the second vehicle state information in the area where the target intersection is located.
  • the vehicles here include surrounding vehicles passing through the target intersection and virtual vehicles configured for the current viewing field of view of the target vehicle.
  • the first vehicle state information and the second vehicle state information are input into the vehicle motion prediction model, which is used to predict the driving action of the vehicle based on the vehicle state information to obtain the prediction result, and control the target vehicle according to the prediction result to execute the target driving action.
  • the corresponding virtual vehicle is configured in the obstructed observation field of the target intersection to simulate potential moving obstacles in the occluded area, combined with surrounding vehicles and vehicles that actually pass the target vehicle in the area where the target intersection is located.
  • the state information is used to predict the target driving action of the target vehicle when it passes through the target intersection through the vehicle motion prediction model.
  • the invention enables the target vehicle to fully understand the road condition information of the target intersection when automatically driving through the target intersection, thereby avoiding the collision risk during driving, improving the safety of vehicle control, and thus overcoming the relative safety of vehicle control in the prior art. lower problem.
  • Fig. 2 Assume that what is shown in Fig. 2 is the real-time road condition information about the target intersection 20 currently reached by the target vehicle 202; In the direction indicated by the dashed arrow in 2, the target vehicle 202 will turn left into the lateral lane.
  • the lateral lane includes two lanes, lane_1 and lane_2, wherein the passing direction of lane_1 is opposite to the driving direction of the target vehicle 202 , and the passing direction of lane_2 is the same as the driving direction of the target vehicle 202 .
  • the diagonally filled rectangle 202 represents the target vehicle 202
  • the grid line filled rectangle 204 - 1 and the grid line filled rectangle 204 - 2 represent the target vehicle 202 disposed at the borders of both sides of the current viewing field of view.
  • the rectangles filled with horizontal lines represent surrounding vehicles 206
  • the blank rectangles 200 represent static obstacles 200 at the target intersection 20 .
  • the current viewing field of view of the target vehicle 202 is the triangle area filled with dots in FIG. 2 .
  • a virtual vehicle 204-1 and a virtual vehicle 204-2 are set on the boundary of the current observation field, wherein the virtual vehicle 204-1 is set on the left side of the lane_1, and the virtual vehicle 204-1 drives to the right to enter the target intersection 20 , the virtual vehicle 204-2 is set on the right side of the lane_2, and the virtual vehicle 204-2 drives to the left to enter the target intersection 20.
  • the virtual vehicle 204-1 and the virtual vehicle 204-2 are each configured with a maximum travel speed allowed in the corresponding lane.
  • a potential moving obstacle in the blind spot of the target vehicle 202 is simulated.
  • the vehicle motion prediction model inputs the vehicle state information of the virtual vehicle, the vehicle state information of the target vehicle itself, and the vehicle state information of the surrounding vehicles currently passing through the target intersection, and predicts the target vehicle to be executed in the next control cycle for the target vehicle. actions to ensure the safety of the target vehicle when passing through the target intersection.
  • the corresponding virtual vehicle when the target vehicle is about to pass through the target intersection, the corresponding virtual vehicle is configured in the obstructed observation field of the target intersection to simulate potential moving obstacles in the occluded area, combined with surrounding vehicles and the actual location of the target intersection.
  • the vehicle state information of the target vehicle in the area is used to predict the target driving action of the target vehicle when it passes through the target intersection through the vehicle motion prediction model.
  • the invention enables the target vehicle to fully understand the road condition information of the target intersection when it automatically drives through the target intersection, thereby avoiding the collision risk during driving, improving the safety of vehicle control, and thus overcoming the relative relative safety of vehicle control in the prior art. lower problem.
  • obtaining the second vehicle state information of the target vehicle in the area where the target vehicle is located under the current driving state includes:
  • S1 Determine the arrival time of the target vehicle to the target intersection
  • S1 Acquire surrounding vehicle status information of surrounding vehicles entering the area where the target intersection is located at the arrival time, wherein the second vehicle status information includes surrounding vehicle status information;
  • S3 Determine the field of view boundary of the current observation field of view of the target vehicle that matches the arrival time
  • S4 Acquire virtual vehicle state information of the virtual vehicle configured at the boundary of the field of view, where the second vehicle state information includes virtual vehicle state information.
  • the method before acquiring the virtual vehicle state information of the virtual vehicle configured in the visual field boundary, the method further includes:
  • the driver when facing the intersection shown in Fig. 3(a), in general, the driver will adopt the strategy of moving slowly forward.
  • the driver moves forward slowly, watching for oncoming traffic in the lane ahead if planning a right turn, and watching for oncoming traffic in both lanes if planning a left turn. In this way, the driver carefully expands the field of view until there is enough safe distance to turn.
  • a virtual vehicle in order to simulate potential obstacles, can be configured at the boundary of the observation field, and the driving direction and driving speed can be assigned to the vehicle to simulate the real vehicle, so as to predict whether the target vehicle will collide with the virtual vehicle when passing through the target intersection, and then Prevents the target vehicle from colliding with potentially moving obstacles in the occluded area.
  • Collision scenario of target vehicle 202 ie, vehicle ego
  • vehicle 202 collides with another vehicle 402 .
  • Their positions, velocities and angles are known or observable.
  • the first is the movement of other vehicles (ie surrounding vehicles).
  • the route of a potential vehicle ie a virtual vehicle
  • the virtual vehicle needs to be processed.
  • the vehicle status information of the vehicle ego includes the position, speed, angle of the vehicle, and the length and width of the vehicle.
  • Dynamic obstacles can cause similar visual disturbances.
  • the virtual vehicle 602-3 is placed at the boundary in the same manner as static obstacles.
  • Figure 6(b) For the case where both static occlusion and dynamic occlusion exist, the field of view boundary can be calculated in the same way, so the virtual vehicles (ie, virtual vehicle 602-5 to virtual vehicle 602-7) can be as shown in Fig. 6(d)(e) placed in a similar manner as shown.
  • the virtual vehicles ie, virtual vehicle 602-5 to virtual vehicle 602--7
  • a large truck 604 is just in front of the vehicle ego 202 . In this case, the area behind the truck 604 and within the boundaries will be occluded. To handle this situation, place a virtual vehicle in the lane behind the truck.
  • Figure 6(c)(f) the positions of the virtual vehicles are symmetrical with the vehicle ego 202 relative to the truck 604, ie, virtual vehicle 602-4
  • the virtual vehicle For the lane opposite the vehicle ego direction, the virtual vehicle is always placed on the left view boundary. For the same lane as the vehicle ego direction, the virtual vehicle is placed on the right view boundary. Also, if the area the vehicle ego is facing is occluded, the virtual vehicle is also placed there. Channels 7(a)-7(d) as shown.
  • Each channel has occluded parts (empty areas) and observable parts (diagonally filled areas). Occlusions are formed by static and/or dynamic obstacles. Based on the orientation of the lane and the orientation of the vehicle ego, the virtual vehicle is placed on the field of view boundary and on the occluded area that the vehicle ego is facing.
  • a virtual vehicle is configured at the boundary of the observation field to simulate potential moving obstacles outside the observation field of view, so that the target vehicle can avoid potential risks during automatic driving, thereby ensuring that the target vehicle passes through the target intersection driving safety.
  • the first vehicle state information and the second vehicle state information are input into the vehicle motion prediction model, to obtain the prediction result including:
  • the target vehicle independently determines the driving action in the next control cycle with respect to the current real-time road condition information and the vehicle state information of the relevant vehicle in each control cycle.
  • the target vehicle is controlled to move gradually, so as to smoothly pass the target intersection and avoid the risk of collision.
  • controlling the target vehicle to perform the target driving action according to the prediction result includes: determining the target driving speed of the target vehicle according to the target acceleration; and controlling the target vehicle to travel according to the target driving speed to pass the target intersection.
  • the model established by the POMDP framework can integrate different vehicle state information, and with the help of the solution of the POMDP solver, accurately predict the driving action of the target vehicle in the next control cycle, thereby ensuring the automatic driving of the target vehicle. Safety through the intersection.
  • predicting and obtaining the target acceleration of the driving action to be performed by the target vehicle in the next control cycle after the current control cycle includes:
  • S1 in the partially observable Markov decision process model, based on the second vehicle state information, calculate the driving probability that each target vehicle chooses to drive on any lane when passing through the target intersection, and obtain a driving probability set, where the second vehicle
  • the status information includes: the position, driving speed, driving direction and driving route of the target vehicle;
  • the target acceleration based on the travel probability set and the first vehicle state information, wherein the first vehicle state information includes: the position, the travel speed, the travel direction and the predicted travel route of the target vehicle.
  • the time evolution of the traffic situation is required.
  • a reward is assigned to the environmental state.
  • a strategy is obtained by solving an optimization problem that maximizes the expected reward over a period of time. Since other road vehicle movements are unknown, the probability distribution of the movements needs to be maintained and updated every time a new observation is obtained. All of this can be done within the POMDP framework.
  • POMDP is a mathematical framework for continuous decision-making under uncertainty. It is characterized by the tuple (S,A,O,T,Z,R).
  • S, A, O represent the state space, action space and observation space, respectively. Since POMDP is conventionally applied to applications with discrete spaces (such as games), S, A, O are all finite sets of values.
  • the belief state b(s) represents the probability of being in state s,b:S ⁇ [0,1].
  • the belief b(s) can be updated by Bayes' rule.
  • POMDP aims to find an optimal policy ⁇ * that maps belief states to actions that maximize the expected value of cumulative reward. as shown in Equation 3.
  • the state of POMDP includes the vehicle ego and the information of all surrounding vehicles except the vehicles in the occluded area. To address the occlusion issue, virtual vehicles are configured as described above. The status also includes information about the virtual vehicle.
  • the state of the vehicle ego is defined as follows:
  • l ego (x ego ; y ego ) is the current position.
  • is the direction.
  • r ego represents a predefined route in that operation.
  • v ego means longitudinal velocity.
  • the state of the surrounding vehicle vehi is defined in the same manner as above. However, for vehicles with different routing options, is unknown. For example, in Figure 5, the vehicle on the left side of lane_1 may go straight or turn right, which cannot be directly determined. The state of the vehicle ego is completely known. In addition to vehicles with multiple routing options, the status of other vehicles can be observed. For simplicity, observation noise is not introduced, which can easily be added in practical settings depending on the sensor fusion accuracy.
  • Action space The actions of the vehicle ego are generated by the POMDP solver. In this application, this action is longitudinal acceleration.
  • Transition function is used to predict the future state of the surrounding environment, given the actions performed at the current time step. In this way, a temporal evolution of the traffic situation can be achieved during planning.
  • the transition model is shown in Equation 5. a is set to 0 for all surrounding vehicles.
  • y i represents the reference position y for a given x on the route ri.
  • Reward function In this operation, the reward function R(s, a)) includes three parts, as follows:
  • R collision is usually set to a higher value to ensure driving safety. Also, if the vehicle ego needs to travel at high speed and its speed is lower than the speed limit of the current lane, a negative reward is assigned.
  • the magnitude of R v should be much smaller than the other rewards and adjusted according to the current speed.
  • the driving probability of the target vehicle is obtained when each lane is selected, and the driving action of the target vehicle is determined in combination with the driving trend, so as to obtain the driving trend of the target vehicle, so that the target vehicle can drive when passing through the target intersection. Trajectories do not intersect, ie, collisions are avoided. Thereby, the safety of vehicle control is improved.
  • the target acceleration calculated from the travel probability set and the first vehicle state information includes:
  • each node on the search tree is configured with a corresponding The node reward value, which is associated with the predicted collision result based on the current node path, each node is connected to the following child nodes: an acceleration node for controlling vehicle acceleration, a deceleration node for controlling vehicle deceleration, and a vehicle maintaining For the current speed holding node, the selection priority of the acceleration node is higher than the selection priority of the hold node, and the selection priority of the hold node is higher than the selection priority of the deceleration node.
  • searching for the target path in the pre-built search tree includes:
  • the current node is used as the node of the target path, and the node is obtained from the child node connected with the current node as the current node;
  • the POMDP is solved in an online manner. This is necessary because the visible range changes as the vehicle ego progresses.
  • the online solver plans a series of actions up to a certain horizon. But the vehicle ego only performs the first action and will be recalculated at the next point in time.
  • POMDPs are usually solved by traversing a belief search tree, choosing the optimal action along the path with the largest expected reward.
  • the computational complexity of building a "complete tree" is too high to be practical. Therefore, various POMDP approximate solvers have been developed. Most are designed for discrete state spaces without taking into account the characteristics of certain applications. In this application, we develop a customized POMDP solver that can solve problems caused by virtual vehicles while reducing computational complexity. Solver details are as follows.
  • Belief update Usually, belief search trees take into account possible observations, so belief updates are performed during the construction of the tree. This is unnecessary for applications with continuous state spaces, since the observation space is not a finite set of values. These virtual observations are generated by probabilistic observation models and may differ significantly from real observations. In addition, planning is done online and can adapt to changing circumstances. Therefore, in this operation, the belief of each node remains unchanged. As shown in Figure 9. Each node represents a belief state. The three edges branching from the node represent acceleration ("+"), maintain current speed (“0"), and deceleration ("-"). The arrow dashed line indicates the successful path from root to leaf. All slash-filled nodes have been visited, blank nodes will not be visited.
  • Action priority The vehicle ego is expected to travel at high speeds with less deceleration. This is usually achieved by assigning a negative reward when the speed of the vehicle ego is below a certain threshold (such as the speed limit of the current lane). It has been successfully applied to the intersection scene.
  • a certain threshold such as the speed limit of the current lane. It has been successfully applied to the intersection scene.
  • different actions are assigned different priorities. “Acceleration” has the highest priority, followed by "Maintain current speed”, and "Deceleration” has the lowest priority. In this way, high speed driving is encouraged, which can push the vehicle ego out of a standstill.
  • a search strategy is developed.
  • a depth-first search is performed, as shown in FIG. 9 .
  • the solution of the present application is to try to find a path that starts from the root and ends at the leaves without any collisions. When such a path is found, tree traversal stops. In this way, acceleration is always accessed first, and deceleration is only accessed if acceleration and maintaining constant speed do not work. For example, if the dashed line in Figure 9 is the complete path without collisions, then the optimal action at the current point in time is to accelerate. Nodes filled with slashes have already been visited, while blank nodes will never be visited because a successful path has been found. This strategy also increases the average computational cost. Assuming that each leaf has the same probability p to be the end point of a successful path, the expectation of how many leaf nodes need to be visited forms a geometric progression, as shown in Equation 9.
  • searching for the target path in the pre-built search tree according to the travel probability set and the first vehicle state information includes:
  • S1 traverse the search tree, and sequentially use each node in the search tree as the current node to perform the following operations:
  • the reference cumulative value set is obtained by performing weighted summation on the node reward value of the starting node of the search tree and the node reward value of the current node, wherein the reference
  • the cumulative value set includes a plurality of second cumulative values, which are cumulative values obtained by weighted summing the node reward value of the starting node and the node reward value of any other node at the same level as the current node;
  • the target path when the target path is found in the pre-built search tree, it also includes: configuring a corresponding node reward value for each node in the search tree, wherein the node reward value is when passing through the target intersection. is the result of the weighted sum of the first reward value based on the current node's target vehicle colliding with the search path, the second reward value based on the current node's target vehicle not colliding with the search path when passing through the target intersection, and the reference reward value.
  • SUMO is an open source package for traffic simulation. It can simulate road networks as well as other traffic elements. API and TraCI are also provided for online interaction.
  • the behavior of each vehicle can be controlled at each simulation step.
  • the acceleration output by the POMDP cannot be directly controlled, and only the maximum acceleration can be set. Therefore, we adjust the speed at each simulation time step given the current action according to Equation 5, and the time interval between planning and simulation is 0.1 s.
  • SUMO provides an Intelligent Driver Model (IDM) to control the behavior of each vehicle, it is not used in this operation. There are two reasons. First, vehicles with IDMs slow down to avoid collisions, sometimes freezing traffic if a collision is predicted. Second, the intelligent behavior of surrounding vehicles makes the method presented in this application difficult to evaluate. Therefore, all other vehicles are driven in a pre-set mode without collision avoidance.
  • IDM Intelligent Driver Model
  • FIG. 10(a) shows the behavior of the vehicle ego when only static obstacles are present.
  • the vehicle ego plans to turn left, so it is assumed that there are two virtual vehicles (as shown in Fig. 6). In this case, there are no real surrounding vehicles.
  • Figure 10(b) shows the speed profile of the vehicle ego along with the position y.
  • the vehicle ego accelerates first because the hypothetical virtual vehicle is far away from it.
  • the vehicle ego stops accelerating because continued acceleration may produce a collision, but deceleration is not required at the same time.
  • the vehicle ego continues to approach the horizontal lane ahead, it begins to decelerate. After it has determined that it is safe to turn left, it accelerates until it reaches the maximum speed allowed.
  • the method proposed in this application is effective in avoiding collisions. Furthermore, the method can be easily extended to other intersections with different lane layouts.
  • a vehicle control device for implementing the above-mentioned vehicle control method is also provided. As shown in Figure 13, the device includes:
  • a first obtaining unit 1302 configured to obtain real-time road condition information of the target vehicle
  • the second obtaining unit 1304 when the real-time road condition information indicates that the target vehicle has reached the target intersection, obtains the first vehicle state information of the target vehicle in the current driving state, and obtains the second information of the target vehicle in the area where the target intersection is located under the current driving state.
  • vehicle status information wherein the target vehicle includes surrounding vehicles passing through the target intersection and virtual vehicles configured for the current viewing field of view of the target vehicle;
  • the input prediction unit 1306 is used to input the first vehicle state information and the second vehicle state information into the vehicle motion prediction model, and obtain the prediction result, wherein the vehicle motion prediction model is used to obtain the prediction result based on the vehicle state information of the current control cycle. Predict the driving action of the vehicle in the next control cycle;
  • the control unit 1308 is used to control the target vehicle according to the prediction result to execute the target driving action.
  • a computer-readable storage medium stores a computer program, and the computer program executes the steps in any of the method embodiments when running.
  • the computer-readable storage medium can be configured to store a computer program, and the computer program is used to perform the following steps:
  • the target vehicle includes surrounding vehicles passing through the target intersection and virtual vehicles configured for the current observation field of view of the target vehicle;
  • S3 Input the first vehicle state information and the second vehicle state information into the vehicle motion prediction model to obtain a prediction result, wherein the vehicle motion prediction model is used to predict the vehicle's state in the next control cycle based on the vehicle state information of the current control cycle driving action;
  • a program instructing the relevant hardware of the terminal device.
  • the program may be stored in a computer-readable storage medium, which may include a flash disk, read only memory (ROM), random access memory (RAM), magnetic disk, or optical disk.
  • the integrated unit in the above-described embodiment is implemented in the form of a software functional unit, and is sold or used as an independent product, the integrated unit may be stored in the above-described computer-readable storage medium.
  • the technical solution of the present invention essentially constitutes part of the prior art, or all or part of the technical solution can be implemented in the form of a software product, the computer software product is stored in a storage medium, and the software product includes instructions for indicating one or more A plurality of computer devices (which may be personal computers, servers, network devices, etc.) to execute several instructions of all or part of the steps of the methods described in the embodiments of the present invention.
  • the disclosed client may be implemented in other ways.
  • the embodiments described above with respect to the apparatus are merely exemplary.
  • the division of units is only logical function division, and other division methods are also possible in actual operation.
  • multiple units or components may be combined or integrated into another system, or certain features may be omitted or not implemented.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be achieved through some interfaces. Indirect couplings or communication connections between units or modules may be accomplished electronically or in other forms.
  • Units that appear as separate units may or may not be physically separate, and the illustrated portions of units may or may not be physical units, which may be located at one location, or may be distributed over multiple network elements. Some or all of the units may be selected according to actual requirements to achieve the effects of the solutions of the embodiments.
  • the functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may physically exist independently, or two or more units may be integrated into one unit.
  • the integrated unit can be implemented in the form of hardware, and can also be implemented in the form of software functional units.

Abstract

一种车辆控制方法、设备、存储介质和电子装置。该方法包括:获取目标车辆(202)的实时路况信息;当实时路况信息表明目标车辆(202)到达目标路口(20)时,获取当前行驶状态下目标车辆(202)的第一车辆状态信息,以及获取当前行驶状态下目标车辆(202)在目标路口(20)所在区域的第二车辆状态信息,其中,目标车辆(202)包括通过目标路口(20)的周围车辆(206)和为目标车辆(202)的当前观察视野配置的虚拟车辆(204-1、204-2;602-1~602-9);将第一车辆状态信息和第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;根据预测结果控制目标车辆(202)以执行目标行驶动作。

Description

车辆控制方法、设备、存储介质和电子装置 技术领域
本发明涉及车辆控制领域,具体地,涉及车辆控制方法、设备、存储介质和电子装置。
背景技术
如今,为了简化驾驶操作,越来越多的智能驾驶技术被应用于车辆,帮助驾驶员更安全可靠地进行驾驶。显然,自动驾驶已然成为重要的发展趋势。
然而,没有信号灯的路口存在各种不确定的交通情况,现有的自动驾驶技术还不能处理这种情况。例如,行驶的车辆突然出现在路口,遮挡自动驾驶的目标车辆的当前驾驶视野,会导致目标车辆容易与遮挡区域中的潜在障碍物发生碰撞。即,现有技术中车辆在通过路口时,由于不能完全了解路况,在车辆控制期间往往难以有效地保证驾驶安全。
这个问题目前还没有有效的解决办法。
发明内容
本发明的实施例提供了一种车辆控制方法和设备、存储介质和电子装置,以解决相关技术中车辆控制方法的控制安全性相对较低的问题。
根据本发明实施例的一个方面,提供了一种车辆控制方法,包括:获取目标车辆的实时路况信息;当实时路况信息表明目标车辆到达目标路口时,获取当前行驶状态下目标车辆的第一车辆状态信息,以及获取当前行驶状态下目标车辆在目标路口所在区域的第二车辆状态信息,其中,目标车辆包括通过目标路口的周围车辆和为目标车辆的当前观察视野配置的虚拟车辆;将第一车辆状态信息和第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;根据预测结果控制目标车辆以执行目标行驶动作。
根据本发明实施例的另一方面,进一步提供了一种车辆控制设备,包括:第一获取单元,用于获取目标车辆的实时路况信息;第二获取单元,当实时路况信息表明目标车辆到达目标路口时,获取当前行驶状态下目标车辆的第一车辆状态信息,以及获取当前行驶状态下目标车辆在目标路口所在区域的第二车辆状态信息,其中,目标车辆包括通过目标路口的周围车辆和为目标车辆的当前观察视野而配置的虚拟车辆;输入预测单元,用于将第一车辆状态信息和第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;以及控制单元,用于根据预测结果控制目标车辆以执行目标行驶动作。
根据本发明实施例的另一方面,进一步提供了一种计算机可读存储介质。所述计算机可读存储介质存储有计算机程序,其中所述计算机程序运行是执行所述车辆控制方法。
本发明的实施例中,当目标车辆将要通过目标路口时,在目标路口被遮挡的观察视野配置对应的虚拟车辆,以模拟遮挡区域内的潜在移动障碍物,结合周围车辆和实际通过目标路口所在区域的 目标车辆的车辆状态信息,通过车辆运动预测模型预测目标车辆通过目标路口时的目标行驶动作。本发明使得目标车辆在自动驾驶通过目标路口时,能够全面了解目标路口的路况信息,从而避免行驶中的碰撞风险,提高了车辆控制的安全性,进而克服了现有技术中车辆控制安全性相对较低的问题。
附图说明
这里描述的附图为本申请提供了进一步理解,并且形成本申请的一部分。本申请的示意性实施例及其描述仅用于解释本申请,而非旨在限定本申请。In the drawings:
图1是根据本发明一实施例的一种可替换性车辆控制方法的流程图;
图2是根据本发明一实施例的一种可替换性车辆控制方法的应用环境的示意图;
图3是根据本发明一实施例的一种可替换性车辆控制方法的示意图;
图4是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图5是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图6是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图7是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图8是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图9是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图10是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图11是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图12是根据本发明一实施例的另一种可替换性车辆控制方法的示意图;
图13是根据本发明一实施例的一种可替换性车辆控制设备的结构图;
具体实施方式
为了使本领域技术人员更好地理解本发明的解决方案,以下结合本发明实施例中的附图,清楚完整地描述了本发明实施例中的技术方案。显然,所描述的实施例仅仅是本发明实施例的一部分,而不是全部。本领域普通技术人员基于本发明的实施例在没有创造性努力的情况下获得的其他实施例均属于本发明的保护范围。
应注意,本发明的说明书、权利要求书和附图中,“第一”和“第二”等用语用于区分相似的描述对象,不一定表示顺序或次序。应理解,本申请使用的数据可以在适用的情况互换,从而本申请的实施例可以以不同于这里示出或描述的顺序来实现。此外,用语“包括”和“具有”等旨在涵盖非排他性的包含,例如,一种过程、方法、系统、产品或装置,包括一系列步骤或单元,不一定限于明确列出的那些步骤或单元,而是可包括未明确列出的,或这种过程、方法、产品或装置固有的其他步骤或单元。
根据本发明实施例的一个方面,提供了一种车辆控制方法。可选地,作为可选的实施方式,如图1所示,该车辆控制方法包括:
S102,获取目标车辆的实时路况信息;
S104,当实时路况信息表明目标车辆到达目标路口时,获取当前行驶状态下目标车辆的第一车辆状态信息,以及获取当前行驶状态下目标车辆在目标路口所在区域的第二车辆状态信息,其中,目标车辆包括通过目标路口的周围车辆和为目标车辆的当前观察视野配置的虚拟车辆;
S106,将第一车辆状态信息和第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;
S108,根据预测结果控制目标车辆,以执行目标行驶动作。
可选地,在该实施例中,上述车辆控制方法可以应用于但不限于配置有高级驾驶辅助系统(ADAS)的车辆,以帮助车辆在自动驾驶时进行高效、安全的驾驶控制操作。ADAS系统利用安装在车辆上的各种传感器,在车辆行驶过程中随时感知周围环境,采集数据,对静态和动态物体进行识别、检测和跟踪,收集导航仪的地图数据,并对系统进行计算和分析,使驾驶员提前意识到可能存在的危险,从而有效提高车辆的驾驶安全性。
以及,本实施例中,在获取目标车辆的实时路况信息之后,如果实时路况信息指示目标车辆到达目标路口,则获取目标车辆在当前行驶状态下的第一车辆状态信息和目标车辆当前行驶状态下在目标路口所处区域的第二车辆状态信息。这里的车辆包括正在通过目标路口的周围车辆和为目标车辆的当前观察视野配置的虚拟车辆。将第一车辆状态信息和所述第二车辆状态信息输入到车辆运动预测模型中,用于基于车辆状态信息预测车辆的行驶动作,以获得预测结果,并根据预测结果控制目标车辆,以执行目标行驶动作。即,目标车辆将要通过目标路口时,在目标路口被遮挡的观察视野配置对应的虚拟车辆,以模拟遮挡区域内的潜在移动障碍物,结合周围车辆和实际通过目标路口所在区域的目标车辆的车辆状态信息,通过车辆运动预测模型预测目标车辆通过目标路口时的目标行驶动作。本发明使得目标车辆在自动驾驶通过目标路口时,能够全面了解目标路口的路况信息,从而避免行驶中的碰撞风险,提高了车辆控制的安全性,进而克服了现有技术中车辆控制安全性相对较低的问题。
以下结合图2所示进行具体描述:假设图2中所示的是关于目标车辆202当前到达的目标十字路口20的实时路况信息;并且,假设目标车辆202的当前行驶目的是左转,如图2中箭头虚线所示的方向,目标车辆202将左转进入横向车道。横向车道包括车道_1和车道_2两个车道,其中车道_1的通过方向与目标车辆202的行驶方向相反,车道_2的通过方向与目标车辆202的行驶方向相同。进一步,本申请中,对角线填充的矩形202表示目标车辆202,网格线填充的矩形204-1和网格线填充的矩形204-2表示设置在当前观察视野的两侧的边界处的虚拟车辆204-1和虚拟车辆204-2,横线填充的矩形表示周围车辆206,空白矩形200表示目标路口20的静态障碍物200。
如图2所示,基于右侧设置有静态障碍物200的目标路口20的实时路况信息,可以看出目标车辆202的当前观察视野范围是图2中用点填充的三角形区域。然后,在当前观察视野的边界上设置虚拟车辆204-1和虚拟车辆204-2,其中虚拟车辆204-1设置在车道_1的左侧,虚拟车辆204-1向 右行驶以进入目标路口20,虚拟车辆204-2设置在车道_2的右侧,虚拟车辆204-2向左行驶以进入目标交叉路口20。本申请中,虚拟车辆204-1和虚拟车辆204-2各自配置有相应车道中允许的最大行驶速度。通过设置虚拟车辆以避免目标车辆202在进入目标路口20后与潜在移动障碍物碰撞,模拟目标车辆202在视野盲区中的潜在移动障碍物。
进一步,车辆运动预测模型输入有虚拟车辆的车辆状态信息、目标车辆本身的车辆状态信息以及当前正通过目标路口的周围车辆的车辆状态信息,并为目标车辆预测下一控制周期要执行的目标行驶动作,以确保目标车辆通过目标路口时的安全性。
本发明的实施例中,当目标车辆将要通过目标路口时,在目标路口被遮挡的观察视野配置对应的虚拟车辆,以模拟遮挡区域内的潜在移动障碍物,结合周围车辆和实际通过目标路口所在区域的目标车辆的车辆状态信息,通过车辆运动预测模型预测目标车辆通过目标路口时的目标行驶动作。本发明使得目标车辆在自动驾驶通过目标路口时,能够全面了解目标路口的路况信息,从而避免行驶中的碰撞风险,提高了车辆控制的安全性,进而克服了现有技术中车辆控制安全性相对较低的问题。
作为可选的方案,获取当前行驶状态下目标车辆在目标路口所在区域的第二车辆状态信息包括:
S1:确定目标车辆到达目标路口的到达时间;
S1:获取在到达时间进入目标路口所在区域的周围车辆的周围车辆状态信息,其中第二车辆状态信息包括周围车辆状态信息;
S3:确定与到达时间匹配的目标车辆的当前观察视野的视野边界;
S4:获取配置在视野边界的虚拟车辆的虚拟车辆状态信息,其中,第二车辆状态信息包括虚拟车辆状态信息。
可选地,该实施例中,在获取配置在视野边界的虚拟车辆的虚拟车辆状态信息之前,该方法还包括:
1.在目标路口的第一车道与视野边界相交的第一交叉线上设置第一虚拟车辆,其中,第一交叉线是第一虚拟车辆进入当前观察视野的起始线,第一虚拟车辆在第一车道上的行驶方向与目标车辆的行驶方向相同,且第一虚拟车辆配置有第一车道允许的第一最大行驶速度;
2.在目标路口的第二车道与视野边界相交的第二交叉线上设置第二虚拟车辆,其中,第二交叉线是第二虚拟车辆进入当前观察视野的起始线,第二虚拟车辆在第二车道上的行驶方向与目标车辆的行驶方向相反,第二虚拟车辆配置有第二车道允许的第二最大行驶速度。
应注意,自动驾驶车辆通过无信号灯的路口时,不仅需要考虑视野中可见区域的静态障碍物(如交通安全岛)和动态障碍物(如行驶车辆);还需考虑观察视野之外的遮挡区域中的潜在障碍物(例如被遮挡的行驶车辆),从而允许目标车辆模仿人的自主行为。当遇到图3(a)-3(d)所示的路口时,通过一步改善视野,可以避开动态障碍物,通过目标路口。
例如,当面对图3(a)中所示的路口时,一般来说,驾驶员会采用缓慢挪动前进的策略。驾驶员慢慢向前移动,如果计划右转,就观察前方车道上迎面而来的车辆,如果计划左转,就观察两条车道上迎面而来的车辆。这样,驾驶员小心地扩大可视范围,直到有足够的安全距离转弯。
图3(b)中所示的动态遮挡物以及图3(c)和3(d)中所示的更为复杂的情况都以类似的方式处理。基于规则的方法看起来可以模仿给定特定交通场景下的人类决策。然而,它难以应对不断增加的复杂性和不确定性,缺乏一般化的能力。
因此,为了模拟潜在障碍物,可以在观察视野的边界处配置虚拟车辆,为车辆分配行驶方向和行驶速度,以模拟真实车辆,从而预测目标车辆在通过目标路口时是否会与虚拟车辆碰撞,进而防止目标车辆与遮挡区域内的潜在移动障碍物碰撞。.目标车辆202(即,车辆ego)的碰撞场景如图4所示,车辆202与另一车辆402碰撞。它们的位置、速度和角度是已知的或可观察的。
为了实现无碰撞轨线,需要考虑两个不确定性来源。第一个是其他车辆(即周围车辆)的动向。对于具有多条路线选择的车辆,如果其中一条路线与车辆ego的路线相交,那么这种不确定性应在规划时考虑到。另一种不确定性来自遮挡区域。如果被遮挡区域中的潜在车辆(即虚拟车辆)的路线与车辆ego的路线相交,则需要对虚拟车辆进行处理。车辆ego的车辆状态信息包括车辆的位置、速度、角度以及车辆的长度和宽度。
这些信息可以从不同类型车辆的先验信息的感知结果估计出来。基于这些观察结果,沿着预设路线生成一系列加速度a=[a 0;a 1;a 2;..,],可使车辆ego能够安全高效地通过拥堵的交叉路口。
结合以下例子进行详细描述;假设目标车辆的实时路况信息如图5所示。车辆ego 202的视野被遮挡。每条车道的视野边界用竖线表示。
此处,静态障碍物和动态障碍物造成的遮挡均考虑在内。我们提出了一种通用的方法来处理各种遮挡场景,不区分遮挡类型。用图5所示的简单场景来解释该方法。车辆ego计划向左转,有两条水平车道与车辆ego的路线相交。如图5所示,各个车道上有左视野边界和右视野边界。边界以外的车辆是看不见的。为了规划安全的轨线,假设虚拟车辆处在被遮挡的区域。这里我们采用一种较差的方法。虚拟车辆被放置在视野边界上,作为用于规划的真实车辆。对于车道_1,左侧迎面而来的车辆可能会干扰车辆ego 202,因此虚拟车辆602-1被放置在左边界1处。同样,另一虚拟车辆602-2被放置在右边界2。如图6(a)所示。
动态障碍物可能会导致类似的视野障碍。虚拟车辆602-3以与静态障碍物相同的方式放置在边界处。如图6(b)所示。对于静态遮挡和动态遮挡都存在的情况,可以以相同的方式计算出视野边界,因此虚拟车辆(即,虚拟车辆602-5~虚拟车辆602-7)可如图6(d)(e)所示的类似方式放置。当一辆大卡车604正好在车辆ego 202的前面时。这种情况下,卡车604后面和边界内的区域将被遮挡。为了处理这种情况,在卡车后面的车道上放置虚拟车辆。如图6(c)(f)所示,虚拟车辆的位置与车辆ego 202相对于卡车604对称,即,虚拟车辆602-4和虚拟车辆602-8、虚拟车辆602-9。
对于与车辆ego方向相反的车道,虚拟车辆总是放置在左视野边界上。对于与车辆ego方向相同的车道,虚拟车辆放置在右视野边界上。此外,如果车辆ego所面向的区域被遮挡,则虚拟车辆也放置在那里。如图所示的7(a)-7(d)通道。
每个通道都有遮挡部分(空白区域)和可观察部分(对角线填充区域)。遮挡是由静态障碍物和/或动态障碍物形成的。根据车道的方向和车辆ego的方向,虚拟车辆被放置在视野边界以及车辆ego正对的遮挡区域上。
本申请提供的实施例,通过在观察视野的边界处配置虚拟车辆,来模拟观察视野之外的潜在移动障碍物,使得目标车辆能够在自动驾驶时避免潜在的风险,从而确保目标车辆通过目标路口时的驾驶安全。
作为可选的方案,将所述第一车辆状态信息和第二车辆状态信息输入到车辆运动预测模型中,以获取预测结果包括:
S1,将第一车辆状态信息和第二车辆状态信息输入到部分可观察马尔可夫决策过程模型,其中车辆运动预测模型包括部分可观察马尔可夫决策过程模型;
S2,通过部分可观察马尔可夫决策过程模型,预测并获取目标车辆在当前控制周期之后的下一个控制周期内要执行的行驶动作的目标加速度,其中,预测结果包括目标加速度。
应注意,该实施例中,为了在不区分遮挡类型的情况下充分考虑各种遮挡情境,可以通过使用但不限于基于部分可观察马尔可夫POMDP框架建立的模型来实现精确的预测。
可选地,该实施例中,在自动驾驶期间,目标车辆针对当前实时路况信息和每个控制周期中相关车辆的车辆状态信息独立地决定下一个控制周期的行驶动作。从而实现了控制目标车辆逐步移动,以便平稳地通过目标路口,避免碰撞风险。
可选地,本实施例中,根据预测结果控制目标车辆以执行目标行驶动作包括:根据目标加速度确定目标车辆的目标行驶速度;控制目标车辆根据目标行驶速度行驶以通过目标路口。
通过本申请提供的实施例,利用POMDP框架建立的模型,可以融合不同的车辆状态信息,并借助POMDP求解器的解,准确预测目标车辆在下一个控制周期中的行驶动作,从而确保目标车辆自动驾驶通过路口的安全性。
作为可选的方案,通过部分可观察马尔可夫决策过程模型,预测并获取目标车辆在当前控制周期之后的下一个控制周期内要执行的行驶动作的目标加速度包括:
S1,在部分可观察马尔可夫决策过程模型中,基于第二车辆状态信息,计算每个目标车辆通过目标路口时选择在任意车道上行驶的行驶概率,获得行驶概率集,其中,第二车辆状态信息包括:目标车辆的位置、行驶速度、行驶方向和行驶路线;
S2,基于行驶概率集和第一车辆状态信息计算目标加速度,其中第一车辆状态信息包括:目标车辆的位置、行驶速度、行驶方向和预测行驶路线。
对于POMDP规划,所有配置的虚拟车辆都视为真实车辆。这包括具有不同路线选择的虚拟车辆的可选路线。各虚拟车辆配置了当前车道允许的最大速度,用于预测。与真实车辆不同,虚拟车辆 的放置将在每个时间步阶根据当前的可见性重置。由于虚拟车辆并不像预测的那样移动,由一般POMDP求解器生成的“最优”动作可能不是好的决策,有时甚至会导致系统崩溃。
为了规划安全的轨线,需要交通情况的时间演化。一般地,在每个时间步阶,给环境状态分配奖励。然后通过解决一优化问题得到策略,使得一段时间内的期望奖励最大化。由于其他道路车辆动向是未知的,因此每次获得新的观察结果后,需要保持和更新动向的概率分布。所有这些都可以在POMDP框架内进行。
POMDP是一种不确定情况下连续决策的数学框架。其特征在于元组(S,A,O,T,Z,R)。此处S,A,O分别表示状态空间、动作空间和观察空间。由于常规上,POMDP应用于具有离散空间的应用程序(如游戏),S,A,O都是有限的值集。T表示状态转移函数,具有概率表示T(s,a,s')=p(s'|a,s),其遵循马尔可夫假设,其状态s'只取决于最近的状态s,与之前的状态无关。Z表示观察模型Z(o,s',a)=p(o|s',a)。R表示奖励函数
Figure PCTCN2021083039-appb-000001
该函数为在状态s中执行动作a生成奖励值.未来的奖励按因子γ指数计算。因此,从t=0开始的累计奖励值表示如下:
Figure PCTCN2021083039-appb-000002
由于一些状态信息(如周围车辆的路线),不能直接测量,所以在状态上配置了概率分布,称为信念状态。信念状态b(s)表示处于状态s,b:S→[0,1]的概率。当执行动作a并获得新的观察值o时,信念b(s)可通过贝叶斯规则更新。
Figure PCTCN2021083039-appb-000003
POMDP旨在找到一个最优策略π *,该策略将信念状态映射到动作,从而最大化累计奖励的期望值。如等式3所示。
Figure PCTCN2021083039-appb-000004
POMDP建模的细节如下:
1)状态和观察:POMDP的状态包括车辆ego和所有周围车辆的信息,除了遮挡区域内的车辆。为了解决遮挡问题,配置了虚拟车辆,如上文所述。该状态也包括虚拟车辆的信息。车辆ego的状态定义如下:
s ego=[l ego,v egoego,r ego,]    (4)
其中l ego=(x ego;y ego)为当前位置。θ为方向。r ego表示该操作中预先定义的路线。v ego表示纵向速度。周围车辆veh i的状态的定义方式同上。然而,对于具有不同路线选择的车辆,
Figure PCTCN2021083039-appb-000005
是未知的。例如,图5中车道_1左侧路段的车辆可能直行,也可能右转,无法直接判断。而车辆ego的 状态是完全已知的。除具有多种路线选择的车辆之外,可以观察到其他车辆的状态。简便起见,不引入观察噪声,在实际设置中,根据传感器融合精度可轻易添加观察噪声。
2)动作空间:车辆ego的动作由POMDP求解器生成。本申请中,该动作是纵向加速度。动作空间为A={+3m/s 2,0m/s 2,-3m/s 2},其值分别表示加速、维持当前速度和减速。
3)过渡功能:过渡函数用于预测周围环境的未来状态,给定在当前时间步阶执行的动作。这样,可在规划期间实现交通状况的时间演变。这里我们假设周围车辆的动向和速度保持不变,车辆ego的速度将根据每个时间步阶的动作进行调整。过渡模型如等式5所示。对于所有周围车辆,a设置为0。
Figure PCTCN2021083039-appb-000006
4)信念更新:由于一些车辆有多条路线选择,每条路线的概率用新的观察值进行调整。这是按照贝叶斯法则进行的。例如,图8中,veh i有两个路线选项,r 1和r 2,每条路线都有自己的参考速度,分别由v 1和v 2表示。给定当前位置p和速度v,路线i的概率;i=1;2通过等式6更新。
Figure PCTCN2021083039-appb-000007
假设l和v是有条件地独立的,给定r i,从而得到
p(l,v|r i)=p(l|r i)p(v|r i)=Ν(y;y iy,i)Ν(v;v iv,i)    (7)
这里使用高斯模型。y i代表路线r i上给定x的参考位置y.从数据中可得到参数σ y,i和σ v,i。该操作中,使用σ y,i=10和_σ v,i=10,因为小的方差可能会由于连续倍增导致迅速下降到零。可以用位置p和速度v作为两个属性来创建更为复杂的预测器,并根据连续的历史信息进行训练。不过那并非本申请的重点。
5)奖励函数:该操作中,奖励函数R(s,a))包括三个部分,如下所示:
R(s,a)=R end(s)+R collision(s)+R v(s)    (8)
如果车辆ego通过十字路口,将获得正奖励R end(s)。如果车辆ego与其他道路车辆发生碰撞,将得到负奖励R collision(s)。R collision(s)通常设置较高的数值,以保证驾驶安全。此外,如果车辆ego需以高速行驶,而其速度低于当前车道的速度限制,则分配负奖励。
该操作中,设置R end(s)=1000,R collision(s)=-10000。R v的量级应比其他奖励小很多,且根据当前的速度调整。
6)关于虚拟车辆的问题:通常,通过最大化等式3可以获得最优动作。然而,它在本申请的技术问题中并不好用。这是因为引入了虚拟车辆。在POMDP规划过程中,虚拟车辆的处理方式与真实 车辆相同。不过虚拟车辆并未真的前进。事实上,虚拟车辆的位置在每个时间步阶都会重置,给定车辆ego的最新可见性,这可能会导致状态冻结。例如,根据等式3,车辆ego可能会静止不动,等待虚拟车辆先走。然而,假设虚拟车辆总是处在同一视野边界,这样车辆ego总是静止不动。为了避免出现冻结状态,车辆ego需要一个推力,即使静止不动也有最大的预期奖励。为了使车辆ego解除冻结状态,同时鼓励高驾驶速度,不同的优先级可分配给不同的动作。
通过本申请提供的实施例,在选择每个车道时获取目标车辆的行驶概率,并结合行驶动向确定目标车辆的行驶动作,来获取目标车辆的行驶动向,使得目标车辆在通过目标路口时的行驶轨线不会相交,即,避免碰撞。从而提高车辆控制的安全性。
作为可选的方案,行驶概率集和第一车辆状态信息计算目标加速度包括:
S1,根据行驶概率集和第一车辆状态信息,在预先构建的搜索树中搜索目标路径,并将目标路径中目标节点对应的加速度作为目标加速度,其中,搜索树上的各个节点配置有对应的节点奖励值,节点奖励值与基于当前节点路径的预测碰撞结果相关联,各个节点连接到以下子节点:用于控制车辆加速的加速节点、用于控制车辆减速的减速节点、用于控制车辆保持当前速度的保持节点,加速节点的选择优先级高于保持节点的选择优先级,保持节点的选择优先级高于减速节点的选择优先级。
作为可选的方案,根据所述行驶概率集和第一车辆状态信息,在预先构建的搜索树中搜索目标路径包括:
S1,遍历搜索树,直到找到目标路径;
S11,获取当前节点对应的当前加速度;
S12,当目标车辆以当前加速度对应的行驶速度行驶时,使用当前加速度、行驶概率集和第一车辆状态信息来预测目标车辆与目标车辆的碰撞预测结果;
S13,当碰撞预测结果表明没有碰撞时,将当前节点作为目标路径的节点,并从与当前节点连接的子节点获取节点作为当前节点;
S14,当所找到的搜索路径中的所有节点的碰撞预测结果表明没有碰撞时,将搜索路径确定为目标路径。
在该操作中,为了应对动态环境,POMDP是以联机在线的方式求解的。这是必要的,因为可视范围会随着车辆ego的前进而变化。在线求解器规划好一系列动作,直到特定视野。但是车辆ego只执行第一个动作,并且会在下一个时间点重新计算。
POMDP通常通过遍历信念搜索树来求解,沿着具有最大期望奖励的路径选择最优动作。然而,构建“完全树”的计算复杂度太高,不实用。因此,人们开发了各种POMDP近似求解器。大部分是为离散状态空间设计的,没有考虑到某些应用的特性。本申请中,我们开发了一种定制化的POMDP求解器,它可以解决由虚拟车辆引起的问题,同时降低计算复杂度。求解器详情如下。
1)信念更新。通常,信念搜索树会考虑可能存在的观察情况,因此信念更新是在树的构建期间执行的。这对于具有连续状态空间的应用来说是不必要的,因为观察空间不是一组有限的值。这些虚拟观察是由概率观察模型生成的,可能与真实观察有很大差异。此外,规划是以在线方式进行的, 可以适应环境变化。因此,该操作中,每个节点的信念保持不变。如图9所示。每个节点代表一个信念状态。从节点分支出来的三条边分别代表加速(“+”),保持当前速度(“0”)和减速(“-”)。箭头虚线表示从根到叶的成功路径。所有斜线填充的节点均已被访问过,空白节点将不会被访问。
3)动作优先级。预计车辆ego以较少的降速高速行驶。通常是通过当车辆ego的速度低于某个阈值(例如当前车道的速度限制)时分配负奖励来实现的。已成功应用于交叉路口场景。此处,不同的行动分配有不同的优先级。“加速”优先级最高,其次是“保持当前速度”,“减速”优先级最低。这样,鼓励高速行驶,可将车辆ego推离停滞状态。为了保证具有较高优先级的动作总是最先被访问,开发了搜索策略。
4)搜索策略为了处理动作优先级,执行深度优先搜索,如图9所示。此处,本申请的方案是尝试找到一条从根开始并在没有任何碰撞的情况下结束于叶的路径。找到这样的路径时,树遍历停止。这样,总是先访问加速,只有在加速和保持恒速不起作用的情况下,才访问减速。例如,如果图9中的虚线是没有碰撞的完整路径,那么当前时间点的最优动作是加速。斜线填充的节点已经被访问过,而空白节点将永远不会被访问,因为已经找到了一条成功的路径。这种策略同时提高了平均计算成本。假设每片叶子都有相同的概率p成为一条成功路径的终点,那么需要访问多少叶子节点的期望就形成了一个几何级数,如等式9所示。
Figure PCTCN2021083039-appb-000008
与叶子的数量|a| H相比,其中|a|表示行动空间大小,H表示规划的视野,
Figure PCTCN2021083039-appb-000009
通常要小得多,最小可达到|a| H
除了上述搜索策略,其他技巧也有助于降低计算复杂度。由于其他车辆的轨线在预测期间是固定的,所以可以预先计算每个未来时间步阶的各个车辆的状态,从而消除每个节点处的重复计算。高计算成本的另一个原因是包含不同状态组合的信念状态。假设每辆车有2个路线选项,场景中有5辆车,那么组合总数为25。检查每个组合的碰撞情况,并在各个节点分配奖励。如果同时考虑速度变化,组合的总数会很大,无法实时计算。这可以通过分解效用函数来实现,该函数对每辆车进行规划,并合并结果以选择最优行动。
作为可选的方案,根据行驶概率集和第一车辆状态信息,在预先构建的搜索树中搜索目标路径包括:
S1,遍历搜索树,依次将搜索树中的各个节点作为当前节点,以执行以下操作:
S11,获取第一累积值和参考累积值集,所述参考累积值集是通过对搜索树的起始节点的节点奖励值和当前节点的节点奖励值进行加权求和而获得的,其中,参考累积值集包括多个第二累积值,第二累积值是通过对起始节点的节点奖励值和与当前节点处于同一级别的任何其他节点的节点奖励值进行加权求和而获得的累积值;
S12,当第一累积值大于参考累积值集中的各个第二累积值时,将当前节点作为目标路径中的节点,并从与当前节点连接的子节点获取节点作为当前节点;
S13,当当前节点位于叶节点层时,将从起始节点到当前节点的路径作为目标路径。
可选地,该实施例中,当在预先构建的搜索树中找到目标路径时,还包括:为搜索树中的各个节点配置相应的节点奖励值,其中,节点奖励值是当穿过目标路口时基于当前节点的目标车辆与搜索路径碰撞的第一奖励值、当穿过目标路口时基于当前节点的目标车辆不与搜索路径碰撞的第二奖励值、以及参考奖励值的加权和的结果。
参考图10-12中所示的示例,充分描述了本申请的实施例中的解决方案。
该操作中的实验是在SUMO仿真器中进行的。SUMO是一个用于交通仿真的开源包。它可以模拟道路网络以及其他交通要素。还提供了API和TraCI,用于在线交互。可以在每个仿真步骤中控制每辆车的动作。但是不能直接控制POMDP输出的加速度,只能设置最大加速度。因此,我们根据等式5给定当前动作在每个模拟时间步阶调整速度,规划和仿真的时间间隔为0.1s。虽然SUMO提供了智能驾驶员模型(IDM)来控制每辆车的行为,但本操作中并没有用到它。原因有两个。首先,装有IDM的车辆会减速以避免碰撞,如果预测到碰撞,有时会冻结交通。其次,周围车辆的智能行为使本申请提供的方法难以评估。因此,所有其他车辆都按照预先设定的没有避免碰撞的模式行驶。
首先分析当只有静态障碍物存在时车辆ego的行为。如图10(a)所示,车辆ego计划左转,因此假定有两个虚拟车辆(如图6所示)。这种情况下,不存在真正的周围车辆。图10(b)示出了车辆ego的速度曲线以及位置y。正如我们所看到的,车辆ego首先加速,因为假定的虚拟车辆离它很远。然后,车辆ego停止加速,因为持续加速可能会产生碰撞,但同时不需要减速。随着车辆ego持续接近前方的水平车道,它开始减速。当它确定左转是安全的之后,它开始加速,直到达到允许的最大速度。
为了掌握对交通的全面观察,车辆ego需要到达y>=-4m的位置。如图10(b)所示,在获得全面之前,车辆ego在t=2:3s开始加速。因为在t=2:3s时,车辆ego和虚拟车辆之间有足够的距离,因此左转不会产生任何碰撞。这也是人类驾驶员的反应。很多情况下,我们无法对交通充分观察,但只要确保有足够的距离,就可以穿过十字路口。
在图11中,我们将本申请提出的方法与另外两种策略进行了比较。一种是V2V通信,它允许车辆ego访问遮挡区域的信息。另一种并非解决遮挡问题,因此没有假定虚拟车辆。可见,本申请提出的方法具有与V2V通信相似的曲线。如果没有采取措施来处理遮挡,车辆ego将加速,直到它观察到另一辆车辆。这种情况下,两辆车在t=3:7s时碰撞。
此处我们假设一个场景,一辆移动的卡车挡住了车辆ego的视线。为排除其他因素影响,除去了静态遮挡物。如图12所示,车辆ego在路口左转,一辆卡车正在靠近并阻挡车辆ego的视线,同时,另一辆车辆在卡车后面,并且无法被车辆ego观察到。由于卡车挡住了视线,按照图6(b)和6(c)依次放置了虚拟车辆。
将本申请提供的方法与其他两种策略,V2V通信和无虚拟车辆的对照组(baseline)进行比较。图12示出了上述三种方式的速度曲线。可以看出,V2V和本申请提供的方法都可以在没有任何碰撞的情况下完成左转,而对照组在t=5:7s时发生了碰撞。我们还注意到,V2V和本申请提供的方法的速度曲线非常接近,只是本申请提供的方法更保守些。本申请讨论了自动驾驶车辆通过拥堵路口的策略。覆盖了静态遮挡物和动态遮挡物。提供了一种处理各种遮挡场景的方法。采用POMDP框架处理所提出的问题。由于引入了虚拟车辆,它们的行为与真实车辆不同,一般的POMDP求解器给出的动作已然不是最优的,有时会冻结整个系统。为了配合所提出的方法,本申请开发了一种特制的POMDP求解器。同时,还可以降低了平均计算复杂度。
在静态和动态遮挡场景下对该方法的性能进行了评估。实验结果表明,车辆ego表现出与人类驾驶员相似的驾驶行为。我们还将其与其他策略进行了比较。结果表明
本申请提出的方法在避免碰撞方面是有效的。此外,本方法可轻松地扩展到其他不同车道布局的交叉路口。
应注意,为了简洁性,上述方法实施例被描述为一系列动作组合。然而,本领域技术人员应理解,本发明不受所描述的动作顺序的限制,本发明中的某些步骤可以其他顺序执行或同时执行。其次,本领域技术人员还应知道,说明书中描述的实施例都是优选实施例,并且所涉及的动作和模块在本发明中并非必要的。
根据本发明实施例的另一方面,还提供了一种用于实现上述车辆控制方法的车辆控制设备。如图13所示,该设备包括:
1)第一获取单元1302,用于获取目标车辆的实时路况信息;
2)第二获取单元1304,当实时路况信息表明目标车辆到达目标路口时,获取当前行驶状态下目标车辆的第一车辆状态信息,以及获取当前行驶状态下目标车辆在目标路口所在区域的第二车辆状态信息,其中,目标车辆包括通过目标路口的周围车辆和为目标车辆的当前观察视野配置的虚拟车辆;
3)输入预测单元1306,用于将第一车辆状态信息和第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;
4)控制单元1308,用于根据预测结果控制目标车辆以执行目标行驶动作。
关于具体实施例,可参考上述车辆控制方法的描述,本实施例中不再赘述。
根据本发明实施例的另一方面,还提供了一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,且该计算机程序在运行时执行任一方法实施例中的步骤。
可选地,本实施例中,该计算机可读存储介质可配置为存储计算机程序,该计算机程序用于执行以下步骤:
S1,获取目标车辆的实时路况信息;
S2,当实时路况信息表明目标车辆到达目标路口时,获取当前行驶状态下目标车辆的第一车辆状态信息,以及获取当前行驶状态下目标车辆在目标路口所在区域的第二车辆状态信息,其中,目标车辆包括通过目标路口的周围车辆和为目标车辆的当前观察视野配置的虚拟车辆;
S3,将第一车辆状态信息和第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;
S4,根据预测结果控制目标车辆,以执行目标行驶动作。
可选地,该实施例中,本领域普通技术人员可以理解,上述实施例的方法的全部或部分步骤可以由指示终端设备的相关硬件的程序来实现。该程序可以存储在计算机可读存储介质中,该存储介质可以包括闪存盘、只读存储器(ROM)、随机存取存储器(RAM)、磁盘或光盘。
本发明实施例的序号仅用于描述,并非表示优选实施例。
如果上述实施例中的集成单元以软件功能单元的形式实现,并且作为独立产品出售或使用,则该集成单元可存储在上述计算机可读存储介质中。基于此,本发明的技术方案实质上构成现有技术的部分,或全部或部分技术方案可以通过软件产品的形式实现,该计算机软件产品存储于存储介质中,该软件产品包括用于指示一个或多个计算机设备(可以是个人计算机、服务器、网络设备等)以执行本发明实施例中描述的方法的全部或部分步骤的若干指令。
本发明的上述实施例中,实施例的描述主要突出重点,对于在特定实施例中没有详细描述的部分,可以参考其他实施例的相关描述。
应理解,本申请提供的几个实施例中,所公开的客户端可以以其他方式实现。上述关于设备的实施例仅仅是示例性的。例如,单元的划分只是逻辑功能划分,在实际操作中也可以有其他划分方式。再如,多个单元或组件可以被组合或集成到另一个系统中,或者可忽略或不执行某些特征。此外,所示出或讨论的相互之间的耦合或直接耦合或通信连接可以通过一些接口来实现。单元或模块之间的间接耦合或通信连接可以通过电子或其他形式实现。
以单独形式出现的单元可以是或不是物理上分开的,并示出的单元部分可以是或不是物理单元,其可以位于某一位置,或者可以分布在多个网络元件上。可根据实际需求选择部分或全部单元,以实现实施例的方案的效果。
此外,本发明实施例中的功能单元可集成到一个处理单元中,或每个单元可以物理上单独存在,或者两个或更多个单元集成到一个单元中。该集成单元可以以硬件的形式实现,也可以以软件功能单元的形式实现。
以上仅仅是本发明的优选实施例。应注意,本领域普通技术人员可以在不脱离本发明原理的情况下进行进一步的改进和修改,并且这些改进和修改也应当属于本发明的保护范围。

Claims (12)

  1. 一种车辆控制方法,其特征在于,包括:
    获取目标车辆的实时路况信息;
    当所述实时路况信息表明所述目标车辆到达目标路口时,获取当前行驶状态下所述目标车辆的第一车辆状态信息,以及获取当前行驶状态下所述目标车辆在所述目标路口所在区域的第二车辆状态信息,其中,所述目标车辆包括通过所述目标路口的周围车辆和为所述目标车辆的当前观察视野配置的虚拟车辆;
    将所述第一车辆状态信息和所述第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,所述车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;
    根据所述预测结果控制所述目标车辆,以执行目标行驶动作。
  2. 根据权利要求1所述的方法,其特征在于,所述获取当前行驶状态下所述目标车辆在所述目标路口所在区域的第二车辆状态信息包括:
    确定所述目标车辆到达所述目标路口的到达时间;
    获取在所述到达时间进入所述目标路口所在区域的所述周围车辆的周围车辆状态信息,其中所述第二车辆状态信息包括所述周围车辆状态信息;
    确定与所述到达时间匹配的所述目标车辆的所述当前观察视野的视野边界;
    获取配置在所述视野边界的所述虚拟车辆的虚拟车辆状态信息,其中,所述第二车辆状态信息包括所述虚拟车辆状态信息。
  3. 根据权利要求2所述的方法,其中,在获取配置在所述视野边界的所述虚拟车辆的虚拟车辆状态信息之前,所述方法还包括:
    在所述目标路口的第一车道与所述视野边界相交的第一交叉线上设置第一虚拟车辆,其中,所述第一交叉线是所述第一虚拟车辆进入所述当前观察视野的起始线,所述第一虚拟车辆在所述第一车道上的行驶方向与所述目标车辆的行驶方向相同,所述第一虚拟车辆配置有所述第一车道允许的第一最大行驶速度;
    在所述目标路口的第二车道与所述视野边界相交的第二交叉线上设置第二虚拟车辆,其中,所述第二交叉线是所述第二虚拟车辆进入所述当前观察视野的起始线,所述第二虚拟车辆在所述第二车道上的行驶方向与所述目标车辆的行驶方向相反,所述第二虚拟车辆配置有所述第二车道允许的第二最大行驶速度。
  4. 根据权利要求1所述的方法,其中,所述将所述第一车辆状态信息和所述第二车辆状态信息输入到车辆运动预测模型中,获取预测结果包括:
    将所述第一车辆状态信息和所述第二车辆状态信息输入到部分可观察马尔可夫决策过程模型,其中所述车辆运动预测模型包括所述部分可观察马尔可夫决策过程模型;
    通过所述部分可观察马尔可夫决策过程模型,预测并获取所述目标车辆在当前控制周期之后的下一个控制周期内要执行的行驶动作的目标加速度,其中,所述预测结果包括所述目标加速度。
  5. 根据权利要求4所述的方法,其中,所述根据所述预测结果控制所述目标车辆以执行目标行驶动作包括:
    根据所述目标加速度确定所述目标车辆的目标行驶速度;
    控制所述目标车辆根据所述目标行驶速度行驶以通过所述目标路口。
  6. 根据权利要求4所述的方法,其中,所述通过所述部分可观察马尔可夫决策过程模型,预测并获取所述目标车辆在当前控制周期之后的下一个控制周期内要执行的行驶动作的目标加速度包括:
    在所述部分可观察马尔可夫决策过程模型中,基于所述第二车辆状态信息,计算每个所述目标车辆通过所述目标路口时选择在任意车道上行驶的行驶概率,获得行驶概率集,其中,所述第二车辆状态信息包括:所述目标车辆的位置、行驶速度、行驶方向和行驶路线;
    基于所述行驶概率集和所述第一车辆状态信息计算目标加速度,其中所述第一车辆状态信息包括:所述目标车辆的位置、行驶速度、行驶方向和预测行驶路线。
  7. 根据权利要求6所述的方法,其中,所述基于所述行驶概率集和所述第一车辆状态信息计算目标加速度包括:
    根据所述行驶概率集和所述第一车辆状态信息,在预先构建的搜索树中搜索目标路径,并将所述目标路径中目标节点对应的加速度作为目标加速度,其中,所述搜索树上的各个节点配置有对应的节点奖励值,所述节点奖励值与基于当前节点路径的预测碰撞结果相关联,各个节点连接到以下子节点:用于控制车辆加速的加速节点、用于控制车辆减速的减速节点、用于控制车辆保持当前速度的保持节点,所述加速节点的选择优先级高于所述保持节点的选择优先级,所述保持节点的选择优先级高于所述减速节点的选择优先级。
  8. 根据权利要求7所述的方法,其中,所述根据所述行驶概率集和所述第一车辆状态信息,在预先构建的搜索树中搜索目标路径包括:
    遍历所述搜索树,直到找到目标路径;
    获取当前节点对应的当前加速度;
    用所述当前加速度、所述行驶概率集和所述第一车辆状态信息预测碰撞预测结果,所述碰撞预测结果表明当目标车辆以所述当前加速度对应的行驶速度行驶时目标车辆与目标车辆是否碰撞;
    当所述碰撞预测结果表明没有碰撞时,将当前节点作为目标路径中的节点,并从与当前节点连接的子节点获取节点作为当前节点;
    当所找到的搜索路径中的所有节点的碰撞预测结果表明没有碰撞时,将搜索路径确定为目标路径。
  9. 根据权利要求7所述的方法,其中,所述根据所述行驶概率集和所述第一车辆状态信息,在预先构建的搜索树中搜索目标路径包括:
    遍历所述搜索树,依次将所述搜索树中的各个节点作为当前节点,以执行以下操作:
    获取第一累积值和参考累积值集,所述参考累积值集是通过对搜索树的起始节点的节点奖励值和当前节点的节点奖励值进行加权求和而获得的,其中,所述参考累积值集包括多个第二累积值,所述第二累积值是通过对起始节点的节点奖励值和与当前节点位于同一级别的任何其他节点的节点奖励值进行加权求和而获得的累积值;
    当所述第一累积值大于参考累积值集中的各个第二累积值时,将当前节点作为所述目标路径中的节点,并从与当前节点连接的子节点获取节点作为当前节点;
    在当前节点位于叶节点层时,将从起始节点到当前节点的路径作为目标路径。
  10. 根据权利要求7所述的方法,其中,当在预先构建的搜索树中搜索目标路径时,所述方法还包括:
    为搜索树中的各个节点配置相应的节点奖励值,其中,所述节点奖励值是当穿过目标路口时基于当前节点的所述目标车辆与搜索路径碰撞的第一奖励值、当穿过目标路口时基于当前节点的所述目标车辆不与搜索路径碰撞的第二奖励值、以及参考奖励值的加权和的结果。
  11. 一种车辆控制设备,其特征在于,包括:
    第一获取单元,用于获取目标车辆的实时路况信息;
    第二获取单元,当所述实时路况信息表明所述目标车辆到达目标路口时,获取当前行驶状态下所述目标车辆的第一车辆状态信息,以及获取当前行驶状态下所述目标车辆在所述目标路口所在区域的第二车辆状态信息,其中,所述目标车辆包括通过所述目标路口的周围车辆和为所述目标车辆的当前观察视野而配置的虚拟车辆;
    输入预测单元,用于将所述第一车辆状态信息和所述第二车辆状态信息输入到车辆运动预测模型中,获取预测结果,其中,所述车辆运动预测模型用于基于当前控制周期的车辆状态信息来预测车辆在下一控制周期的行驶动作;
    控制单元,用于根据所述预测结果控制所述目标车辆,以执行目标行驶动作。
  12. 一种计算机可读存储介质,所述存储介质存储有程序,其特征在于,所述程序运行时执行权利要求1所述的方法。
PCT/CN2021/083039 2020-09-28 2021-03-25 车辆控制方法、设备、存储介质和电子装置 WO2022062349A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202180005858.5A CN114829226A (zh) 2020-09-28 2021-03-25 车辆控制方法、设备、存储介质和电子装置

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/033,982 2020-09-28
US17/033,982 US11866070B2 (en) 2020-09-28 2020-09-28 Vehicle control method and apparatus, storage medium, and electronic device

Publications (1)

Publication Number Publication Date
WO2022062349A1 true WO2022062349A1 (zh) 2022-03-31

Family

ID=80823230

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/083039 WO2022062349A1 (zh) 2020-09-28 2021-03-25 车辆控制方法、设备、存储介质和电子装置

Country Status (3)

Country Link
US (1) US11866070B2 (zh)
CN (1) CN114829226A (zh)
WO (1) WO2022062349A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2613037A (en) * 2021-11-19 2023-05-24 Motional Ad Llc Predicting motion of hypothetical agents

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230030815A1 (en) * 2021-07-29 2023-02-02 Argo AI, LLC Complementary control system for an autonomous vehicle
CN115123131B (zh) * 2022-07-08 2024-01-09 浙江吉利控股集团有限公司 车辆安全电子控制方法、装置、设备及存储介质
CN114872735B (zh) * 2022-07-10 2022-10-04 成都工业职业技术学院 基于神经网络算法的自动驾驶物流车辆决策方法和装置
CN115610435B (zh) * 2022-12-02 2023-04-11 福思(杭州)智能科技有限公司 对象行驶意图的预测方法和装置、存储介质及电子装置
CN117523874B (zh) * 2024-01-05 2024-04-12 每日互动股份有限公司 一种车辆通行状态的获取方法、装置、介质及电子设备

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106114503A (zh) * 2015-05-05 2016-11-16 沃尔沃汽车公司 用于确定安全车辆轨迹的方法和装置
US20180231974A1 (en) * 2017-02-14 2018-08-16 Honda Research Institute Europe Gmbh Risk based driver assistance for approaching intersections of limited visibility
CN110427021A (zh) * 2018-05-01 2019-11-08 本田技研工业株式会社 用于生成自动驾驶车辆交叉路口导航指令的系统和方法
CN110603497A (zh) * 2017-02-10 2019-12-20 日产北美公司 自主车辆操作管理控制
CN111226268A (zh) * 2017-05-02 2020-06-02 密歇根大学董事会 用于自动驾驶车辆的模拟车辆交通

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9037404B2 (en) * 2013-06-01 2015-05-19 Savari, Inc. System and method for map matching
WO2019083513A1 (en) * 2017-10-24 2019-05-02 Nissan North America, Inc. LOCATION DETERMINATION FOR VEHICLE OPERATION
CN109933086B (zh) 2019-03-14 2022-08-30 天津大学 基于深度q学习的无人机环境感知与自主避障方法
KR20210134317A (ko) * 2019-03-29 2021-11-09 인텔 코포레이션 자율 차량 시스템
US20210101625A1 (en) * 2019-10-08 2021-04-08 Motional Ad Llc Navigating multi-way stop intersections with an autonomous vehicle
US11884294B2 (en) * 2019-12-30 2024-01-30 Nvidia Corporation Lane change planning and control in autonomous machine applications
EP4039557A1 (en) * 2021-02-09 2022-08-10 Aptiv Technologies Limited Trajectory validation for autonomous driving

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106114503A (zh) * 2015-05-05 2016-11-16 沃尔沃汽车公司 用于确定安全车辆轨迹的方法和装置
CN110603497A (zh) * 2017-02-10 2019-12-20 日产北美公司 自主车辆操作管理控制
US20180231974A1 (en) * 2017-02-14 2018-08-16 Honda Research Institute Europe Gmbh Risk based driver assistance for approaching intersections of limited visibility
CN111226268A (zh) * 2017-05-02 2020-06-02 密歇根大学董事会 用于自动驾驶车辆的模拟车辆交通
CN110427021A (zh) * 2018-05-01 2019-11-08 本田技研工业株式会社 用于生成自动驾驶车辆交叉路口导航指令的系统和方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2613037A (en) * 2021-11-19 2023-05-24 Motional Ad Llc Predicting motion of hypothetical agents

Also Published As

Publication number Publication date
US20220097736A1 (en) 2022-03-31
US11866070B2 (en) 2024-01-09
CN114829226A (zh) 2022-07-29

Similar Documents

Publication Publication Date Title
WO2022062349A1 (zh) 车辆控制方法、设备、存储介质和电子装置
US11900797B2 (en) Autonomous vehicle planning
CN110333714B (zh) 一种无人驾驶汽车路径规划方法和装置
Lin et al. Decision making through occluded intersections for autonomous driving
JP6837558B2 (ja) 自車両の制御方法及び自車両の制御システム
US10678248B2 (en) Fast trajectory planning via maneuver pattern selection
Bouton et al. Scalable decision making with sensor occlusions for autonomous driving
WO2022052406A1 (zh) 一种自动驾驶训练方法、装置、设备及介质
US11480967B2 (en) Pass route planning method and apparatus, device and readable storage medium
CN111301419A (zh) 用于sae 4级自动化车道变更的基于强化学习的方法
Schörner et al. Predictive trajectory planning in situations with hidden road users using partially observable markov decision processes
KR102588008B1 (ko) 진입하거나 진출하는 차량을 검출하기 위한 방법 및 제어 유닛
WO2018220418A1 (en) Driving assistance method and system
CN113844451B (zh) 行驶设备控制方法、装置、电子设备及存储介质
Zhang et al. Improved occlusion scenario coverage with a POMDP-based behavior planner for autonomous urban driving
Neel et al. Improving bounds on occluded vehicle states for use in safe motion planning
Zhang et al. Traffic mirror-aware pomdp behavior planning for autonomous urban driving
CN112116226B (zh) 仿真车辆的控制方法、装置、计算机设备和存储介质
CN116499487B (zh) 一种车辆的路径规划方法、装置、设备及介质
US20230162539A1 (en) Driving decision-making method and apparatus and chip
CN115743183A (zh) 自动驾驶控制方法、装置、设备、介质及车辆
CN115782876A (zh) 一种变道轨迹生成方法、装置、设备及自动驾驶车辆
CN114633765A (zh) 基于概率栅格图的速度决策方法及其装置、相关产品
CN117622227B (zh) 一种车辆换道避障控制方法及装置
Nichting et al. Case study on gap selection for automated vehicles based on deep q-learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21870738

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21870738

Country of ref document: EP

Kind code of ref document: A1