WO2021237768A1 - Data-driven-based system for implementing automatic iteration of prediction model - Google Patents

Data-driven-based system for implementing automatic iteration of prediction model Download PDF

Info

Publication number
WO2021237768A1
WO2021237768A1 PCT/CN2020/094133 CN2020094133W WO2021237768A1 WO 2021237768 A1 WO2021237768 A1 WO 2021237768A1 CN 2020094133 W CN2020094133 W CN 2020094133W WO 2021237768 A1 WO2021237768 A1 WO 2021237768A1
Authority
WO
WIPO (PCT)
Prior art keywords
behavior data
network model
road
target network
model
Prior art date
Application number
PCT/CN2020/094133
Other languages
French (fr)
Chinese (zh)
Inventor
董维山
张驰
杨文娟
蒋竺希
Original Assignee
初速度(苏州)科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 初速度(苏州)科技有限公司 filed Critical 初速度(苏州)科技有限公司
Priority to DE112020003091.1T priority Critical patent/DE112020003091T5/en
Publication of WO2021237768A1 publication Critical patent/WO2021237768A1/en

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W60/00Drive control systems specially adapted for autonomous road vehicles
    • B60W60/001Planning or execution of driving tasks
    • B60W60/0027Planning or execution of driving tasks using trajectory prediction for other traffic participants
    • B60W60/00272Planning or execution of driving tasks using trajectory prediction for other traffic participants relying on extrapolation of current movement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/04Generating or distributing clock signals or signals derived directly therefrom
    • G06F1/14Time supervision arrangements, e.g. real time clock
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0108Measuring and analyzing of parameters relative to traffic conditions based on the source of data
    • G08G1/0112Measuring and analyzing of parameters relative to traffic conditions based on the source of data from the vehicle, e.g. floating car data [FCD]
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications
    • G08G1/0145Measuring and analyzing of parameters relative to traffic conditions for specific applications for active traffic flow control
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W2050/0001Details of the control system
    • B60W2050/0019Control system elements or transfer functions
    • B60W2050/0028Mathematical models, e.g. for simulation
    • B60W2050/0031Mathematical model of the vehicle
    • B60W2050/0034Multiple-track, 2D vehicle model, e.g. four-wheel model
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2554/00Input parameters relating to objects
    • B60W2554/40Dynamic objects, e.g. animals, windblown objects
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2556/00Input parameters relating to data
    • B60W2556/45External transmission of data to or from the vehicle

Definitions

  • the present invention relates to the technical field of intelligent driving, in particular to a data-driven system for realizing automatic iteration of prediction models.
  • knowing the trajectory of road participants in advance is conducive to their own safe driving.
  • the future trajectory of road participants is determined by the predictive model set on the unmanned vehicle
  • the prediction model needs to be updated from time to time.
  • the current method of updating the prediction model is: the developer performs offline annotation on the behavior data of a large number of road participants to obtain the annotation information, and then the developer conducts model training and evaluation based on the behavior data of the road participants and the corresponding annotation information to obtain new predictions Model, and then the developer will update the new predictive model to the unmanned vehicle through network transmission or hard disk connection.
  • the invention provides a data-driven system for realizing automatic iteration of prediction models without manual participation, greatly reducing labor costs, and having a high degree of automation.
  • the specific technical solution is as follows.
  • the present invention provides a data-driven system for automatic iteration of predictive models.
  • the system includes a processor and a cloud.
  • the processor is provided with a predictive model, and the predictive model is used to predict road participants. Trajectory of future movement;
  • the processor obtains the behavior data of each road participant, where the behavior data includes the environmental static map information at the current moment and the historical movement trajectory of the road participant before the current moment collected by the collection device installed in itself, based on The motion trajectory of each road participant observed by its own sensor is labeled with the behavior data of each road participant to obtain the corresponding labeling information, and the behavior data of each road participant is selected from the behavior data of each road participant that meets the preset screening requirements First behavior data, sending the first behavior data and corresponding annotation information to the cloud;
  • the cloud stores the received first behavior data and corresponding annotation information in the original database, and performs feature extraction on the first behavior data according to a preset feature extraction method to obtain a feature extraction amount, and the feature extraction amount is And the corresponding labeling information are stored as training samples in the training sample library, and when the preset automatic trigger conditions are met, the training samples stored in the training sample library after the last training sample is extracted are extracted to train the initial network model to obtain the target A network model, wherein the target network model is used to correlate the behavior data of road participants with the corresponding future motion trajectory, and evaluate the target network model according to a preset evaluation method to obtain the evaluation result, when the evaluation result When the model update requirement is met, sending the target network model to the processor;
  • the processor receives the target network model, and updates the prediction model to the target network model.
  • the processor uses the motion trajectory of each road participant observed by its own sensor as label information corresponding to the behavior data of each road participant.
  • the processor predicts and obtains the future motion trajectory corresponding to the behavior data of each road participant based on the prediction model, and for each road participant, calculates the future motion trajectory corresponding to the behavior data of the road participant and The difference between the movement trajectories of the road participants observed by its own sensor, and the behavior data of the road participants whose difference is greater than the preset difference is used as the first behavior data;
  • the processor determines the behavior category corresponding to the behavior data of each road participant according to the label information corresponding to the behavior data of each road participant, and uses the behavior data of the road participants whose behavior category is a preset category as the first One behavioral data;
  • the processor determines the type of each road participant, and uses the behavior data of the road participant whose type is a preset type as the first behavior data.
  • the preset category is lane changing behavior or overtaking behavior.
  • the preset type is a large vehicle, a pedestrian, or a two-wheeled vehicle.
  • the processor stores the first behavior data and corresponding annotation information before sending the first behavior data and corresponding annotation information to the cloud.
  • the cloud extracts the training samples stored in the training sample library after the last training sample is extracted
  • the training samples train the initial network model to obtain the target network model
  • the cloud extracts the training samples stored in the training sample library after the last time the training samples were extracted to train the initial network model to obtain the target Network model.
  • the cloud predicts the behavior data of each road participant to be predicted in the data set to be predicted based on the target network model to obtain the corresponding future motion trajectory;
  • the target network model is sent to the processor.
  • the road participants include vehicles and/or pedestrians.
  • the processor marks the behavior data of each road participant based on the motion trajectory of each road participant observed by its own sensor to obtain the corresponding label information, and realizes automatic labeling.
  • Annotate information instead of manual offline annotation, and then send the filtered first behavior data and corresponding annotation information to the cloud.
  • the cloud determines that the preset automatic trigger condition is met, it will extract the training sample library after the last training sample was extracted
  • the stored training samples train the initial network model to obtain the target network model, which can automatically trigger the model training when the preset automatic trigger conditions are met, and then automatically trigger the evaluation after the target network model is obtained, and the evaluation result meets the model
  • the target network model is sent to the processor to achieve the automatic deployment of the target network model.
  • the processor receives the target network model, it can automatically update the prediction model to the target network model.
  • the processor labels the behavior data of each road participant to obtain corresponding labeling information based on the motion trajectory of each road participant observed by its own sensor, and realizes automatic labeling to obtain the labeling information instead of manual offline labeling.
  • the training samples stored in the training sample library after the last training sample was extracted can be used to train the initial network model to obtain the target network model. Therefore, the embodiment of the present invention satisfies the preset
  • the model training can be automatically triggered without manual participation, which greatly reduces labor costs and has a high degree of automation.
  • the evaluation is automatically triggered, and the increase in the evaluation index of the target network model relative to the evaluation index of the prediction model meets the preset increase requirement, that is, the performance of the target network model is better than the prediction model to be updated
  • the target network model is sent to the processor, the automatic deployment of the target network model is achieved without manual participation, which greatly reduces labor costs and has a high degree of automation.
  • the processor After the processor receives the target network model, it can automatically update the prediction model to the target network model without manual intervention, which greatly reduces labor costs and has a high degree of automation.
  • a data-driven system for automatic iteration of prediction models is provided. Only one developer can complete the automatic update of the prediction model, which greatly improves the efficiency of research and development and reduces the cost of research and development. .
  • the evaluation is automatically triggered, and the increase in the evaluation index of the target network model relative to the evaluation index of the prediction model meets the preset increase requirement, that is, the performance of the target network model is better than the prediction model to be updated
  • the target network model is sent to the processor, the automatic deployment of the target network model is achieved without manual participation, which greatly reduces labor costs and has a high degree of automation.
  • FIG. 1 is a schematic structural diagram of a data-driven system for realizing automatic iteration of predictive models according to an embodiment of the present invention.
  • the embodiment of the present invention discloses a data-driven system for realizing automatic iteration of a prediction model, which can automatically update the prediction model without manual participation, greatly reduces labor costs, and has a high degree of automation.
  • the embodiments of the present invention will be described in detail below.
  • FIG. 1 is a schematic structural diagram of a data-driven system for realizing automatic iteration of predictive models according to an embodiment of the present invention.
  • a data-driven system for realizing automatic iteration of prediction models provided by an embodiment of the present invention includes a processor 10 and a cloud 20, the processor 10 and the cloud 20 are in communication connection, and the processor 10 is provided with a prediction model and a prediction model. It is used to predict the future trajectory of road participants, where road participants include vehicles and/or pedestrians.
  • the processor 10 automatically acquires the behavior data of each road participant and makes online annotations.
  • the behavior data includes the environmental static map information at the current moment and the data collected by the collection equipment installed in itself. The historical movement trajectory of road participants before the current moment.
  • the processor 10 can obtain the behavior data of each road participant as follows: it is equipped with collection equipment and positioning system, such as a camera and a global positioning system, so that it can perform real-time monitoring during driving.
  • the collection range is collected and its own location is located.
  • the current environmental static map information and the movement trajectory of each road participant can be obtained.
  • the environmental static map information can include Road marking information and road sign information, such as: zebra crossing and red street light information.
  • the way that the processor 10 obtains the behavior data of each road participant may be as follows: it is installed with a collection device, a positioning system, and a high-precision map, such as a camera, a global positioning system, and a high-precision map. , It can collect the collection range and locate its position in real time while driving. Based on the real-time collected image, its real-time location information and high-precision map, it can get the environment static map information at the current moment and the participation of each road The trajectory of the person.
  • the way to label the behavior data of road participants is to use the observed real movement trajectory for labeling, and with the passage of time, its own sensor can naturally observe the real movement trajectory of each road participant, that is, In other words, the prediction result given by the prediction model at the current moment can directly observe the true value in the future, that is, it can be observed at a very low cost whether the prediction result of its prediction actually occurs, whether it is correct, and to what extent it is correct.
  • it is an unmanned vehicle and its own sensor is a vehicle sensor.
  • the prediction model uses the known static map information of the environment at the current time t0 and the historical movement trajectories of road participants accumulated and observed before t0, a certain adjacent lane is made.
  • the vehicle sensor can directly observe the true trajectory of a vehicle in the adjacent lane.
  • the behavior data of each road participant can be marked based on the motion trajectory of each road participant observed by its own sensor to obtain the corresponding marking information.
  • the observed motion trajectory of each road participant is used as the label information corresponding to the behavior data of each road participant.
  • a data-driven system for realizing automatic iteration of predictive models can be applied to the field of unmanned vehicles, robotics, and other fields that can realize automatic driving.
  • the aforementioned processor 10 may be a vehicle-mounted processor
  • the collection device installed in the own vehicle may be a collection device of the own vehicle
  • the own sensor may be a vehicle sensor of the own vehicle.
  • the behavior data and the corresponding labeling information need to be sent to the cloud for processing, and because not every road participant’s behavior data is It is valuable for model training, so the data that is valuable for model training can be selected from the behavior data of road participants for processing, that is, the first behavior that meets the preset screening requirements is selected from the behavior data of each road participant Then, the first behavior data and the corresponding annotation information are sent to the cloud 20.
  • the first behavior data can be filtered through a data filter.
  • the processor 10 may also store the first behavior data and corresponding annotation information before sending the first behavior data and corresponding annotation information to the cloud 20.
  • the processor 10 predicts and obtains the future motion trajectory corresponding to the behavior data of each road participant based on the predictive model, and calculates the future motion trajectory corresponding to the behavior data of the road participant and the one observed by its own sensor for each road participant. For the difference between the movement trajectories of the road participants, the behavior data of the road participants whose difference is greater than the preset difference is used as the first behavior data.
  • the prediction model Since there is a big difference between the future motion trajectory of a certain road participant predicted by the prediction model and the observed motion trajectory of the road participant, it means that the prediction model is not yet capable of the future motion of the road participant. The motion trajectory is predicted more accurately. Therefore, the behavior data of these road participants with large differences is valuable data for model training. The behavior data of these road participants with large differences can be used to train a new prediction model. When the training is completed, the new prediction model It is possible to accurately predict the future trajectories of these road participants with large differences.
  • the processor 10 screens the first behavior data, it can predict the future motion trajectory corresponding to the behavior data of each road participant based on the prediction model, and then compare it with the observed motion trajectory, and take the larger difference as The first behavior data, that is, for each road participant, the difference between the future motion trajectory corresponding to the behavior data of the road participant and the motion trajectory of the road participant observed by its own sensor is calculated, and the difference is greater than expected
  • the first behavior data that is, for each road participant, the difference between the future motion trajectory corresponding to the behavior data of the road participant and the motion trajectory of the road participant observed by its own sensor is calculated, and the difference is greater than expected
  • the difference between the future motion trajectory corresponding to the behavior data of the road participant and the motion trajectory of the road participant observed by its own sensor is calculated, and the difference is greater than the preset
  • the behavior data of different road participants is used as the first behavior data to achieve the purpose of screening data that is valuable for model training from the behavior data of road participants.
  • the second type is the first type:
  • the processor 10 determines the behavior category corresponding to the behavior data of each road participant according to the label information corresponding to the behavior data of each road participant, and uses the behavior data of the road participant whose behavior category is a preset category as the first behavior data .
  • the category is an important behavior category, it may cause a traffic accident, that is, the behavior data of road participants corresponding to the important behavior category is valuable data for model training.
  • the behavior data of road participants in important behavior categories can be used as the first behavior data, that is, according to the label information corresponding to the behavior data of each road participant, each road participant is determined
  • the behavior data of the road participant whose behavior category is the preset category is used as the first behavior data.
  • the preset category is lane change behavior or Overtaking behavior.
  • the behavior category corresponding to the behavior data of each road participant is determined according to the label information corresponding to the behavior data of each road participant, and the behavior data of the road participant whose behavior category is the preset category is taken as the first behavior
  • the method of data achieves the purpose of screening data that is valuable for model training from the behavior data of road participants.
  • the third type is the third type.
  • the processor 10 determines the type of each road participant, and uses the behavior data of the road participant whose type is a preset type as the first behavior data.
  • the trajectory of certain types of road participants may have an impact on the trajectory of other road participants. For example, when driving, most vehicles will be far away from large vehicles. The trajectory of other vehicles may affect the trajectory of other vehicles; or, as pedestrians and two-wheelers are disadvantaged groups, most vehicles will avoid pedestrians and two-wheelers to change the trajectory, thus affecting the trajectory of other road participants
  • the behavioral data of certain types of road participants is valuable data for model training.
  • the processor 10 when the processor 10 filters the first behavior data, it can determine the type of each road participant, and use the behavior data of the road participants whose type is the preset type as the first behavior data.
  • the preset type is Large vehicles, pedestrians or two-wheelers.
  • the cloud 20 receives the first behavior data and corresponding annotation information sent by the processor 10, and stores the received first behavior data and corresponding annotation information in the original database.
  • a new prediction model needs to be generated.
  • the training samples used by the training model are required. Therefore, the first behavior data can be extracted according to the preset feature extraction method.
  • the amount of feature extraction, the amount of feature extraction and the corresponding label information are stored as training samples in the training sample library. Among them, developers can change the feature extraction method at any time according to their needs.
  • the training stored in the training sample library after the last training sample is extracted
  • the sample trains the initial network model to obtain the target network model, where the target network model is used to correlate the feature extraction amount used as the training sample with the corresponding annotation information, and because the feature extraction amount is the representative amount of the behavior data of the road participants
  • the annotation information is the annotation amount of the future motion trajectory. Therefore, the target network model is used to correlate the behavior data of road participants with the corresponding future motion trajectory.
  • the training samples stored in the training sample library after the last training sample was extracted can be used to train the initial network model to obtain the target network model. Therefore, the embodiment of the present invention meets the requirements
  • the model training can be automatically triggered without manual participation, which greatly reduces labor costs and has a high degree of automation.
  • the preset automatic trigger condition can be: the number of training samples stored in the training sample library after the last training sample is extracted reaches the preset number threshold, or the time between the time when the last training sample is extracted and the current time reaches The preset duration.
  • the cloud 20 extracts the training sample library The training samples stored after the training samples were extracted last time train the initial network model to obtain the target network model.
  • the cloud 20 Extract the training samples stored after the last training sample was extracted in the training sample library to train the initial network model to obtain the target network model.
  • the target network model needs to be evaluated according to the preset evaluation method to obtain the evaluation result.
  • the target network model is sent to the processor 10.
  • the cloud 20 evaluates the target network model according to the preset evaluation method to obtain the evaluation result.
  • the method of sending the target network model to the processor 10 may be:
  • the cloud 20 predicts the behavior data of each road participant to be predicted in the data set to be predicted based on the target network model to obtain the corresponding future motion trajectory;
  • the size of the evaluation index of the target network model is calculated
  • the target network model is sent to the processor 10.
  • the performance of the target network model is better than the prediction model to be updated by the increase of the evaluation index. Therefore, it is necessary to calculate the size of the evaluation index of the target network model.
  • the behavior data of each road participant to be predicted is predicted to obtain the corresponding future motion trajectory, and then the future motion trajectory corresponding to the behavior data of each road participant to be predicted and each road participant to be predicted observed by its own sensor Calculate the size of the evaluation index of the target network model.
  • the data set to be predicted may be a set of all training samples in the training sample library, or may be a set of other training samples specifically used for evaluation, which is not limited in the embodiment of the present invention.
  • the evaluation index may include the model prediction accuracy rate and/or the absolute error of the model prediction.
  • the increase amount of the evaluation index of the target network model relative to the evaluation index of the prediction model can be calculated.
  • the increase amount meets the preset increase requirement, the performance of the target network model is excellent.
  • the target network model can be sent to the processor 10 at this time.
  • the target network model is obtained, the evaluation is automatically triggered, and the increase in the evaluation index of the target network model relative to the evaluation index of the prediction model meets the preset increase requirement, that is, the performance of the target network model is better than the forecast to be updated
  • the target network model is sent to the processor 10 to achieve automatic deployment of the target network model without manual participation, which greatly reduces labor costs and has a high degree of automation.
  • the cloud 20 will send the target network model to the processor 10. Therefore, the processor 10 can update the prediction model as long as it receives the target network model.
  • the prediction model is updated to the target network model. Therefore, after the processor 10 receives the target network model, it can automatically update the prediction model to the target network model, without manual involvement, greatly reducing labor costs, and having a high degree of automation.
  • the processor 10 marks the behavior data of each road participant based on the movement trajectory of each road participant observed by its own sensor to obtain the corresponding labeling information, which realizes automatic labeling. Obtain the annotation information instead of manual offline annotation, and then send the filtered first behavior data and the corresponding annotation information to the cloud 20.
  • the cloud 20 determines that the preset automatic trigger condition is met, it extracts the training sample library from the previous extraction
  • the training samples stored after the training samples train the initial network model to obtain the target network model, which realizes that the model training can be automatically triggered when the preset automatic trigger conditions are met, and then after the target network model is obtained, the evaluation is automatically triggered, and the evaluation is performed
  • the target network model is sent to the processor 10 to achieve the automatic deployment of the target network model.
  • the processor 10 receives the target network model, it can automatically update the prediction model to the target network model. It can be seen from the above that no manual participation is required for data labeling, model training, model evaluation, or model update, which greatly reduces labor costs and has a high degree of automation.
  • the embodiment of the present invention provides a data-driven system for automatic iteration of predictive models. Only one developer can complete the automatic update of the predictive model, which greatly improves the efficiency of research and development and reduces the cost of research and development. .
  • modules in the device in the embodiment may be distributed in the device in the embodiment according to the description of the embodiment, or may be located in one or more devices different from this embodiment with corresponding changes.
  • the modules of the above-mentioned embodiments can be combined into one module or further divided into multiple sub-modules.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Transportation (AREA)
  • Automation & Control Theory (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Mechanical Engineering (AREA)
  • Computer Hardware Design (AREA)
  • Evolutionary Computation (AREA)
  • Geometry (AREA)
  • Traffic Control Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Filters That Use Time-Delay Elements (AREA)

Abstract

A data-driven-based system for implementing the automatic iteration of a prediction model, comprising a processor and a cloud. The processor labels behavior data of each road participant on the basis of an observed movement trajectory of each road participant to obtain corresponding labeling information, thus achieving automatic labeling, and then sends filtered first behavior data and the corresponding labeling information to the cloud. When determining that preset automatic trigger conditions are met, the cloud extracts training samples to train an initial network model and obtain a target network model, so that model training is automatically triggered, and then evaluation is automatically triggered. Moreover, when evaluation results meet model update requirements, the cloud sends the target network model to the processor to achieve the automatic deployment of the target network model. The processor automatically updates the prediction model to the target network model after receiving the target network model. Hence, no manual participation is required for data labeling, model training, model evaluation or model updating, and the degree of automation is high.

Description

一种基于数据驱动的实现预测模型自动化迭代的系统A data-driven system for automatic iteration of predictive models 技术领域Technical field
本发明涉及智能驾驶技术领域,具体而言,涉及一种基于数据驱动的实现预测模型自动化迭代的系统。The present invention relates to the technical field of intelligent driving, in particular to a data-driven system for realizing automatic iteration of prediction models.
背景技术Background technique
在自动驾驶场景中,提前预知道路参与者的运动轨迹,有利于自身的安全驾驶,例如在无人车自动驾驶场景中,通过在无人车上设置的预测模型对道路参与者的未来运动轨迹进行预测,随着道路场景的复杂度不断升高,需要时常对预测模型进行更新。In the autonomous driving scenario, knowing the trajectory of road participants in advance is conducive to their own safe driving. For example, in the autonomous driving scenario of unmanned vehicles, the future trajectory of road participants is determined by the predictive model set on the unmanned vehicle To make predictions, as the complexity of the road scene continues to increase, the prediction model needs to be updated from time to time.
目前预测模型的更新方式为:开发人员对大量的道路参与者的行为数据进行离线标注得到标注信息,然后开发人员基于道路参与者的行为数据以及对应的标注信息进行模型训练以及评测得到新的预测模型,然后开发人员将新的预测模型通过网络传输或硬盘连接的方式更新至无人车。The current method of updating the prediction model is: the developer performs offline annotation on the behavior data of a large number of road participants to obtain the annotation information, and then the developer conducts model training and evaluation based on the behavior data of the road participants and the corresponding annotation information to obtain new predictions Model, and then the developer will update the new predictive model to the unmanned vehicle through network transmission or hard disk connection.
可见上述预测模型的更新方式,需要人工进行数据标注以及人工触发模型训练和模型更新,对人的依赖性较高,使得人工成本较大,自动化程度低。It can be seen that the above-mentioned update method of the prediction model requires manual data annotation and manual triggering of model training and model update, which is highly dependent on humans, resulting in high labor costs and low automation.
发明内容Summary of the invention
本发明提供了一种基于数据驱动的实现预测模型自动化迭代的系统,无需人工参与,大大减少人工成本,自动化程度高。具体的技术方案如下。The invention provides a data-driven system for realizing automatic iteration of prediction models without manual participation, greatly reducing labor costs, and having a high degree of automation. The specific technical solution is as follows.
第一方面,本发明提供了一种基于数据驱动的实现预测模型自动化迭代的系统,所述系统包括处理器和云端,所述处理器设置有预测模型,所述预测模型用于预测道路参与者的未来运动轨迹;In the first aspect, the present invention provides a data-driven system for automatic iteration of predictive models. The system includes a processor and a cloud. The processor is provided with a predictive model, and the predictive model is used to predict road participants. Trajectory of future movement;
所述处理器获取每个道路参与者的行为数据,其中,所述行为数据包括当前时刻的环境静态地图信息以及安装于自身的采集设备采集的当前时刻之前的道路参与者的历史运动轨迹,基于自身的传感器所观测到的每个道路参与者的运动轨迹对每个道路参与者的行为数据进行标注得到对应的标注信息, 从每个道路参与者的行为数据中筛选出符合预设筛选要求的第一行为数据,将所述第一行为数据和对应的标注信息发送至所述云端;The processor obtains the behavior data of each road participant, where the behavior data includes the environmental static map information at the current moment and the historical movement trajectory of the road participant before the current moment collected by the collection device installed in itself, based on The motion trajectory of each road participant observed by its own sensor is labeled with the behavior data of each road participant to obtain the corresponding labeling information, and the behavior data of each road participant is selected from the behavior data of each road participant that meets the preset screening requirements First behavior data, sending the first behavior data and corresponding annotation information to the cloud;
所述云端将接收到的第一行为数据和对应的标注信息存储至原始数据库中,根据预设的特征提取方法对所述第一行为数据进行特征提取得到特征提取量,将所述特征提取量和对应的标注信息作为训练样本存储至训练样本库中,当满足预设自动触发条件时,提取所述训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型,其中,所述目标网络模型用于使得道路参与者的行为数据与对应的未来运动轨迹相关联,根据预设评测方式对所述目标网络模型进行评测得到评测结果,当所述评测结果满足模型更新要求时,将所述目标网络模型发送至所述处理器;The cloud stores the received first behavior data and corresponding annotation information in the original database, and performs feature extraction on the first behavior data according to a preset feature extraction method to obtain a feature extraction amount, and the feature extraction amount is And the corresponding labeling information are stored as training samples in the training sample library, and when the preset automatic trigger conditions are met, the training samples stored in the training sample library after the last training sample is extracted are extracted to train the initial network model to obtain the target A network model, wherein the target network model is used to correlate the behavior data of road participants with the corresponding future motion trajectory, and evaluate the target network model according to a preset evaluation method to obtain the evaluation result, when the evaluation result When the model update requirement is met, sending the target network model to the processor;
所述处理器接收所述目标网络模型,并将所述预测模型更新为所述目标网络模型。The processor receives the target network model, and updates the prediction model to the target network model.
可选的,所述处理器将自身的传感器所观测到的每个道路参与者的运动轨迹作为每个道路参与者的行为数据对应的标注信息。Optionally, the processor uses the motion trajectory of each road participant observed by its own sensor as label information corresponding to the behavior data of each road participant.
可选的,所述处理器基于所述预测模型预测得到每个道路参与者的行为数据对应的未来运动轨迹,针对每个道路参与者,计算该道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的该道路参与者的运动轨迹之间的差异,将所述差异大于预设差异的道路参与者的行为数据作为第一行为数据;Optionally, the processor predicts and obtains the future motion trajectory corresponding to the behavior data of each road participant based on the prediction model, and for each road participant, calculates the future motion trajectory corresponding to the behavior data of the road participant and The difference between the movement trajectories of the road participants observed by its own sensor, and the behavior data of the road participants whose difference is greater than the preset difference is used as the first behavior data;
或者,or,
所述处理器根据每个道路参与者的行为数据对应的标注信息,确定每个道路参与者的行为数据对应的行为类别,将所述行为类别为预设类别的道路参与者的行为数据作为第一行为数据;The processor determines the behavior category corresponding to the behavior data of each road participant according to the label information corresponding to the behavior data of each road participant, and uses the behavior data of the road participants whose behavior category is a preset category as the first One behavioral data;
或者,or,
所述处理器判断每个道路参与者的类型,将所述类型为预设类型的道路参与者的行为数据作为第一行为数据。The processor determines the type of each road participant, and uses the behavior data of the road participant whose type is a preset type as the first behavior data.
可选的,所述预设类别为变道行为或超车行为。Optionally, the preset category is lane changing behavior or overtaking behavior.
可选的,所述预设类型为大型车辆、行人或两轮车。Optionally, the preset type is a large vehicle, a pedestrian, or a two-wheeled vehicle.
可选的,所述处理器在将所述第一行为数据和对应的标注信息发送至所 述云端之前,存储所述第一行为数据和对应的标注信息。Optionally, the processor stores the first behavior data and corresponding annotation information before sending the first behavior data and corresponding annotation information to the cloud.
可选的,当所述训练样本库中在上一次提取训练样本之后存储的训练样本的数量达到预设数量阈值时,所述云端提取所述训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型;Optionally, when the number of training samples stored in the training sample library after the last training sample is extracted reaches a preset number threshold, the cloud extracts the training samples stored in the training sample library after the last training sample is extracted The training samples train the initial network model to obtain the target network model;
或者,or,
当上一次提取训练样本的时刻与当前时刻之间的时长达到预设时长时,所述云端提取所述训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型。When the time between the last time the training sample was extracted and the current time reaches the preset time length, the cloud extracts the training samples stored in the training sample library after the last time the training samples were extracted to train the initial network model to obtain the target Network model.
可选的,所述云端基于所述目标网络模型对待预测数据集中的每个待预测道路参与者的行为数据进行预测得到对应的未来运动轨迹;Optionally, the cloud predicts the behavior data of each road participant to be predicted in the data set to be predicted based on the target network model to obtain the corresponding future motion trajectory;
根据每个待预测道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的每个待预测道路参与者的运动轨迹,计算得到所述目标网络模型的评测指标的大小;Calculate the size of the evaluation index of the target network model according to the future motion trajectory corresponding to the behavior data of each road participant to be predicted and the motion trajectory of each road participant to be predicted observed by its own sensor;
计算所述目标网络模型的评测指标相对于所述预测模型的评测指标的涨幅量;Calculating an increase amount of the evaluation index of the target network model relative to the evaluation index of the prediction model;
当所述涨幅量满足预设涨幅要求时,将所述目标网络模型发送至所述处理器。When the amount of increase meets a preset increase requirement, the target network model is sent to the processor.
可选的,所述道路参与者包括车辆和/或行人。Optionally, the road participants include vehicles and/or pedestrians.
由上述内容可知,本发明实施例中处理器基于自身的传感器所观测到的每个道路参与者的运动轨迹对每个道路参与者的行为数据进行标注得到对应的标注信息,实现了自动标注得到标注信息而不是人工离线标注,然后将筛选出的第一行为数据和对应的标注信息发送至云端,云端在确定满足预设自动触发条件时,就提取训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型,实现了在满足预设自动触发条件时就可以自动触发模型训练,然后在得到目标网络模型后,自动触发评测,并在评测结果满足模型更新要求时,将目标网络模型发送至处理器,达到目标网络模型的自动化部署,最后处理器在接收到目标网络模型后,即可自动将预测模型更新为目标网络模型,综上可见,无论是数据标注、模型训练、模型测评还是模型更新都无需人工参与,大大减少人工成本,自动化 程度高。当然,实施本发明的任一产品或方法并不一定需要同时达到以上所述的所有优点。It can be seen from the foregoing that, in the embodiment of the present invention, the processor marks the behavior data of each road participant based on the motion trajectory of each road participant observed by its own sensor to obtain the corresponding label information, and realizes automatic labeling. Annotate information instead of manual offline annotation, and then send the filtered first behavior data and corresponding annotation information to the cloud. When the cloud determines that the preset automatic trigger condition is met, it will extract the training sample library after the last training sample was extracted The stored training samples train the initial network model to obtain the target network model, which can automatically trigger the model training when the preset automatic trigger conditions are met, and then automatically trigger the evaluation after the target network model is obtained, and the evaluation result meets the model When updating the request, the target network model is sent to the processor to achieve the automatic deployment of the target network model. Finally, after the processor receives the target network model, it can automatically update the prediction model to the target network model. In summary, it can be seen that whether it is Data labeling, model training, model evaluation, or model update do not require manual participation, which greatly reduces labor costs and has a high degree of automation. Of course, implementing any product or method of the present invention does not necessarily need to achieve all the advantages described above at the same time.
本发明实施例的创新点包括:The innovative points of the embodiments of the present invention include:
1、处理器基于自身的传感器所观测到的每个道路参与者的运动轨迹对每个道路参与者的行为数据进行标注得到对应的标注信息,实现了自动标注得到标注信息而不是人工离线标注。1. The processor labels the behavior data of each road participant to obtain corresponding labeling information based on the motion trajectory of each road participant observed by its own sensor, and realizes automatic labeling to obtain the labeling information instead of manual offline labeling.
2、当满足预设自动触发条件时,就可以提取训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型,因此,本发明实施例在满足预设自动触发条件时,就可以自动触发模型训练,无需人工参与,大大减少人工成本,自动化程度高。2. When the preset automatic trigger conditions are met, the training samples stored in the training sample library after the last training sample was extracted can be used to train the initial network model to obtain the target network model. Therefore, the embodiment of the present invention satisfies the preset When the condition is automatically triggered, the model training can be automatically triggered without manual participation, which greatly reduces labor costs and has a high degree of automation.
3、在得到目标网络模型后,自动触发评测,并在目标网络模型的评测指标相对于预测模型的评测指标的涨幅量满足预设涨幅要求,即目标网络模型的性能优于待更新的预测模型时,将目标网络模型发送至处理器,达到目标网络模型的自动化部署,无需人工参与,大大减少人工成本,自动化程度高。3. After the target network model is obtained, the evaluation is automatically triggered, and the increase in the evaluation index of the target network model relative to the evaluation index of the prediction model meets the preset increase requirement, that is, the performance of the target network model is better than the prediction model to be updated When the target network model is sent to the processor, the automatic deployment of the target network model is achieved without manual participation, which greatly reduces labor costs and has a high degree of automation.
4、处理器在接收到目标网络模型后,即可自动将预测模型更新为目标网络模型,无需人工参与,大大减少人工成本,自动化程度高。4. After the processor receives the target network model, it can automatically update the prediction model to the target network model without manual intervention, which greatly reduces labor costs and has a high degree of automation.
5、本发明实施例提供的一种基于数据驱动的实现预测模型自动化迭代的系统,仅需一名开发人员即可完成对预测模型的自动更新,大大提高了研发人效,减小了研发成本。5. According to an embodiment of the present invention, a data-driven system for automatic iteration of prediction models is provided. Only one developer can complete the automatic update of the prediction model, which greatly improves the efficiency of research and development and reduces the cost of research and development. .
6、在得到目标网络模型后,自动触发评测,并在目标网络模型的评测指标相对于预测模型的评测指标的涨幅量满足预设涨幅要求,即目标网络模型的性能优于待更新的预测模型时,将目标网络模型发送至处理器,达到目标网络模型的自动化部署,无需人工参与,大大减少人工成本,自动化程度高。6. After the target network model is obtained, the evaluation is automatically triggered, and the increase in the evaluation index of the target network model relative to the evaluation index of the prediction model meets the preset increase requirement, that is, the performance of the target network model is better than the prediction model to be updated When the target network model is sent to the processor, the automatic deployment of the target network model is achieved without manual participation, which greatly reduces labor costs and has a high degree of automation.
附图说明Description of the drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单介绍。显而易见地,下面描述中的附图仅仅是本发明的一些实施例。对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work.
图1为本发明实施例提供的一种基于数据驱动的实现预测模型自动化迭代的系统的结构示意图。FIG. 1 is a schematic structural diagram of a data-driven system for realizing automatic iteration of predictive models according to an embodiment of the present invention.
图1中,10处理器,20云端。In Figure 1, 10 processors, 20 clouds.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整的描述。显然,所描述的实施例仅仅是本发明的一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有付出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.
需要说明的是,本发明实施例及附图中的术语“包括”和“具有”以及它们的任何变形,意图在于覆盖不排他的包含。例如包含的一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其他步骤或单元。It should be noted that the terms "including" and "having" in the embodiments of the present invention and the drawings and any variations thereof are intended to cover non-exclusive inclusions. For example, the process, method, system, product, or device that contains a series of steps or units is not limited to the listed steps or units, but optionally also includes unlisted steps or units, or optionally also includes Other steps or units inherent in these processes, methods, products or equipment.
本发明实施例公开了一种基于数据驱动的实现预测模型自动化迭代的系统,能够自动化进行预测模型的更新,无需人工参与,大大减少人工成本,自动化程度高。下面对本发明实施例进行详细说明。The embodiment of the present invention discloses a data-driven system for realizing automatic iteration of a prediction model, which can automatically update the prediction model without manual participation, greatly reduces labor costs, and has a high degree of automation. The embodiments of the present invention will be described in detail below.
图1为本发明实施例提供的一种基于数据驱动的实现预测模型自动化迭代的系统的结构示意图。参见图1,本发明实施例提供的一种基于数据驱动的实现预测模型自动化迭代的系统包括处理器10和云端20,处理器10和云端20通信连接,处理器10设置有预测模型,预测模型用于预测道路参与者的未来运动轨迹,其中,道路参与者包括车辆和/或行人。FIG. 1 is a schematic structural diagram of a data-driven system for realizing automatic iteration of predictive models according to an embodiment of the present invention. 1, a data-driven system for realizing automatic iteration of prediction models provided by an embodiment of the present invention includes a processor 10 and a cloud 20, the processor 10 and the cloud 20 are in communication connection, and the processor 10 is provided with a prediction model and a prediction model. It is used to predict the future trajectory of road participants, where road participants include vehicles and/or pedestrians.
本发明实施例中为了提高自动化程度,通过处理器10自动获取每个道路参与者的行为数据并在线进行标注,其中,行为数据包括当前时刻的环境静态地图信息以及安装于自身的采集设备采集的当前时刻之前的道路参与者的历史运动轨迹。In the embodiment of the present invention, in order to improve the degree of automation, the processor 10 automatically acquires the behavior data of each road participant and makes online annotations. The behavior data includes the environmental static map information at the current moment and the data collected by the collection equipment installed in itself. The historical movement trajectory of road participants before the current moment.
在一种实现方式中,处理器10获取每个道路参与者的行为数据的方式可以为:自身安装有采集设备和定位系统,例如摄像机和全球定位系统,因此,自身在行驶过程中可以实时对采集范围进行采集并对自身位置进行定位,基于实时采集得到的图像以及自身的实时位置信息可以得到当前时刻的环境静 态地图信息以及每个道路参与者的运动轨迹,其中,环境静态地图信息可以包括道路标线信息和道路指示牌信息,例如:斑马线和红路灯信息。In an implementation manner, the processor 10 can obtain the behavior data of each road participant as follows: it is equipped with collection equipment and positioning system, such as a camera and a global positioning system, so that it can perform real-time monitoring during driving. The collection range is collected and its own location is located. Based on the real-time collected images and its own real-time location information, the current environmental static map information and the movement trajectory of each road participant can be obtained. The environmental static map information can include Road marking information and road sign information, such as: zebra crossing and red street light information.
在另一种实现方式中,处理器10获取每个道路参与者的行为数据的方式可以为:自身安装有采集设备、定位系统和高精度地图,例如摄像机、全球定位系统和高精度地图,因此,自身在行驶过程中可以实时对采集范围进行采集并对自身位置进行定位,基于实时采集得到的图像、自身的实时位置信息以及高精度地图可以得到当前时刻的环境静态地图信息以及每个道路参与者的运动轨迹。In another implementation manner, the way that the processor 10 obtains the behavior data of each road participant may be as follows: it is installed with a collection device, a positioning system, and a high-precision map, such as a camera, a global positioning system, and a high-precision map. , It can collect the collection range and locate its position in real time while driving. Based on the real-time collected image, its real-time location information and high-precision map, it can get the environment static map information at the current moment and the participation of each road The trajectory of the person.
由于对道路参与者的行为数据进行标注的方式就是使用观测到的真实的运动轨迹进行标注,而随着时间的推移,自身的传感器自然可以观测到各个道路参与者的真实的运动轨迹,也就是说,预测模型在当前时刻给出的预测结果,可在未来时刻直接观测到真值,即可以极低成本观测到其预测的预测结果是否真实发生、是否正确以及在多大程度上是正确的。例如:自身为无人车,自身的传感器为车辆传感器,当预测模型根据当前时刻t0已知的环境静态地图信息与t0前累积观测到的道路参与者的历史运动轨迹,做出相邻车道某车辆将会在未来时刻t1移动至自车正前方5米的预测,随着时间推移,在真实t1时刻,车辆传感器可直接观测到相邻车道某辆车的真实的运动轨迹。此时,就可以基于自身的传感器所观测到的每个道路参与者的运动轨迹对每个道路参与者的行为数据进行标注得到对应的标注信息,示例性的,处理器10将自身的传感器所观测到的每个道路参与者的运动轨迹作为每个道路参与者的行为数据对应的标注信息。Because the way to label the behavior data of road participants is to use the observed real movement trajectory for labeling, and with the passage of time, its own sensor can naturally observe the real movement trajectory of each road participant, that is, In other words, the prediction result given by the prediction model at the current moment can directly observe the true value in the future, that is, it can be observed at a very low cost whether the prediction result of its prediction actually occurs, whether it is correct, and to what extent it is correct. For example: it is an unmanned vehicle and its own sensor is a vehicle sensor. When the prediction model uses the known static map information of the environment at the current time t0 and the historical movement trajectories of road participants accumulated and observed before t0, a certain adjacent lane is made. It is predicted that the vehicle will move to 5 meters directly in front of the vehicle at time t1 in the future. As time goes by, at time t1, the vehicle sensor can directly observe the true trajectory of a vehicle in the adjacent lane. At this time, the behavior data of each road participant can be marked based on the motion trajectory of each road participant observed by its own sensor to obtain the corresponding marking information. The observed motion trajectory of each road participant is used as the label information corresponding to the behavior data of each road participant.
需要说明的是,本发明实施例提供的一种基于数据驱动的实现预测模型自动化迭代的系统可以应用于无人车领域、机器人领域以及其他可实现自动驾驶的领域,当应用于无人车领域时,上述处理器10可以为车载处理器,安装于自身的采集设备可以是自车的采集设备,自身的传感器可以为自车的车辆传感器。It should be noted that a data-driven system for realizing automatic iteration of predictive models provided by the embodiments of the present invention can be applied to the field of unmanned vehicles, robotics, and other fields that can realize automatic driving. When applied to the field of unmanned vehicles, At this time, the aforementioned processor 10 may be a vehicle-mounted processor, the collection device installed in the own vehicle may be a collection device of the own vehicle, and the own sensor may be a vehicle sensor of the own vehicle.
由于处理器10的计算能力有限,在对道路参与者的行为数据进行标注后,需要将行为数据和对应的标注信息发送至云端进行处理,又由于并不是每个道路参与者的行为数据都是对模型训练有价值的,因此可以从道路参与者的行为数据中选取对模型训练有价值的数据进行处理,即从每个道路参与者的行为数据中筛选出符合预设筛选要求的第一行为数据,然后将第一行为数据 和对应的标注信息发送至云端20,示例性的,可以通过数据筛选器筛选出第一行为数据。Due to the limited computing power of the processor 10, after labeling the behavior data of road participants, the behavior data and the corresponding labeling information need to be sent to the cloud for processing, and because not every road participant’s behavior data is It is valuable for model training, so the data that is valuable for model training can be selected from the behavior data of road participants for processing, that is, the first behavior that meets the preset screening requirements is selected from the behavior data of each road participant Then, the first behavior data and the corresponding annotation information are sent to the cloud 20. Exemplarily, the first behavior data can be filtered through a data filter.
为了避免数据丢失,处理器10在将第一行为数据和对应的标注信息发送至云端20之前,还可以存储第一行为数据和对应的标注信息。In order to avoid data loss, the processor 10 may also store the first behavior data and corresponding annotation information before sending the first behavior data and corresponding annotation information to the cloud 20.
其中,筛选出第一行为数据的方式有多种,包括但不限于以下几种:Among them, there are many ways to filter out the first-line data, including but not limited to the following:
第一种:The first:
处理器10基于预测模型预测得到每个道路参与者的行为数据对应的未来运动轨迹,针对每个道路参与者,计算该道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的该道路参与者的运动轨迹之间的差异,将差异大于预设差异的道路参与者的行为数据作为第一行为数据。The processor 10 predicts and obtains the future motion trajectory corresponding to the behavior data of each road participant based on the predictive model, and calculates the future motion trajectory corresponding to the behavior data of the road participant and the one observed by its own sensor for each road participant. For the difference between the movement trajectories of the road participants, the behavior data of the road participants whose difference is greater than the preset difference is used as the first behavior data.
由于如果通过预测模型预测得到的某一道路参与者的未来运动轨迹与所观测到的该道路参与者的运动轨迹之间的差异较大,就说明预测模型还不能够对该道路参与者的未来运动轨迹进行较为准确的预测。因此,这些差异较大的道路参与者的行为数据就是对模型训练有价值的数据,可以通过这些差异较大的道路参与者的行为数据训练新的预测模型,当训练完成后,新的预测模型就可以对这些差异较大的道路参与者的未来运动轨迹进行准确的预测。Since there is a big difference between the future motion trajectory of a certain road participant predicted by the prediction model and the observed motion trajectory of the road participant, it means that the prediction model is not yet capable of the future motion of the road participant. The motion trajectory is predicted more accurately. Therefore, the behavior data of these road participants with large differences is valuable data for model training. The behavior data of these road participants with large differences can be used to train a new prediction model. When the training is completed, the new prediction model It is possible to accurately predict the future trajectories of these road participants with large differences.
因此,在处理器10筛选第一行为数据时,可以基于预测模型预测得到每个道路参与者的行为数据对应的未来运动轨迹,然后再与观测到的运动轨迹进行对比,将差异较大的作为第一行为数据,即针对每个道路参与者,计算该道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的该道路参与者的运动轨迹之间的差异,将差异大于预设差异的道路参与者的行为数据作为第一行为数据。Therefore, when the processor 10 screens the first behavior data, it can predict the future motion trajectory corresponding to the behavior data of each road participant based on the prediction model, and then compare it with the observed motion trajectory, and take the larger difference as The first behavior data, that is, for each road participant, the difference between the future motion trajectory corresponding to the behavior data of the road participant and the motion trajectory of the road participant observed by its own sensor is calculated, and the difference is greater than expected Let the behavior data of different road participants be the first behavior data.
由此,针对每个道路参与者,通过计算该道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的该道路参与者的运动轨迹之间的差异,并将差异大于预设差异的道路参与者的行为数据作为第一行为数据的方式达到从道路参与者的行为数据中筛选出对模型训练有价值的数据的目的。Therefore, for each road participant, the difference between the future motion trajectory corresponding to the behavior data of the road participant and the motion trajectory of the road participant observed by its own sensor is calculated, and the difference is greater than the preset The behavior data of different road participants is used as the first behavior data to achieve the purpose of screening data that is valuable for model training from the behavior data of road participants.
第二种:The second type:
处理器10根据每个道路参与者的行为数据对应的标注信息,确定每个道路参与者的行为数据对应的行为类别,将行为类别为预设类别的道路参与者 的行为数据作为第一行为数据。The processor 10 determines the behavior category corresponding to the behavior data of each road participant according to the label information corresponding to the behavior data of each road participant, and uses the behavior data of the road participant whose behavior category is a preset category as the first behavior data .
由于道路参与者的行为有多类,其中某些行为类别是较为重要的,如果通过预测模型预测得到的某一道路参与者的未来运动轨迹不准确,进一步导致无法准确得到该道路参与者的行为类别是否为重要的行为类别,则有可能造成交通事故,也就是说,重要的行为类别对应的道路参与者的行为数据就是对模型训练有价值的数据。Since there are many types of behaviors of road participants, some of which are more important. If the future trajectory of a road participant predicted by the predictive model is inaccurate, it will further lead to the inaccuracy of the road participant’s behavior. Whether the category is an important behavior category, it may cause a traffic accident, that is, the behavior data of road participants corresponding to the important behavior category is valuable data for model training.
因此,在处理器10筛选第一行为数据时,可以将重要的行为类别的道路参与者的行为数据作为第一行为数据,即根据每个道路参与者的行为数据对应的标注信息,确定每个道路参与者的行为数据对应的行为类别,在确定了行为类别后,将行为类别为预设类别的道路参与者的行为数据作为第一行为数据,示例性的,预设类别为变道行为或超车行为。Therefore, when the processor 10 screens the first behavior data, the behavior data of road participants in important behavior categories can be used as the first behavior data, that is, according to the label information corresponding to the behavior data of each road participant, each road participant is determined The behavior category corresponding to the behavior data of the road participant. After the behavior category is determined, the behavior data of the road participant whose behavior category is the preset category is used as the first behavior data. Illustratively, the preset category is lane change behavior or Overtaking behavior.
由此,通过根据每个道路参与者的行为数据对应的标注信息,确定每个道路参与者的行为数据对应的行为类别,将行为类别为预设类别的道路参与者的行为数据作为第一行为数据的方式,达到从道路参与者的行为数据中筛选出对模型训练有价值的数据的目的。Therefore, the behavior category corresponding to the behavior data of each road participant is determined according to the label information corresponding to the behavior data of each road participant, and the behavior data of the road participant whose behavior category is the preset category is taken as the first behavior The method of data achieves the purpose of screening data that is valuable for model training from the behavior data of road participants.
第三种:The third type:
处理器10判断每个道路参与者的类型,将类型为预设类型的道路参与者的行为数据作为第一行为数据。The processor 10 determines the type of each road participant, and uses the behavior data of the road participant whose type is a preset type as the first behavior data.
由于道路参与者的类型有多种,某些类型的道路参与者的运动轨迹可能对其他道路参与者的运动轨迹产生影响,例如:在行驶时,大部分车辆会远离大型车辆,因此,大型车辆的运动轨迹可能会影响其他车辆的运动轨迹;或者,行人和两轮车作为弱势群体,大部分车辆会避让行人和两轮车从而改变运动轨迹,因此,对其他道路参与者的运动轨迹产生影响的某些类型的道路参与者的行为数据就是对模型训练有价值的数据。Since there are many types of road participants, the trajectory of certain types of road participants may have an impact on the trajectory of other road participants. For example, when driving, most vehicles will be far away from large vehicles. The trajectory of other vehicles may affect the trajectory of other vehicles; or, as pedestrians and two-wheelers are disadvantaged groups, most vehicles will avoid pedestrians and two-wheelers to change the trajectory, thus affecting the trajectory of other road participants The behavioral data of certain types of road participants is valuable data for model training.
因此,在处理器10筛选第一行为数据时,可以判断每个道路参与者的类型,将类型为预设类型的道路参与者的行为数据作为第一行为数据,示例性的,预设类型为大型车辆、行人或两轮车。Therefore, when the processor 10 filters the first behavior data, it can determine the type of each road participant, and use the behavior data of the road participants whose type is the preset type as the first behavior data. For example, the preset type is Large vehicles, pedestrians or two-wheelers.
由此,通过判断每个道路参与者的类型,将类型为预设类型的道路参与者的行为数据作为第一行为数据的方式,达到从道路参与者的行为数据中筛选出对模型训练有价值的数据的目的。Therefore, by judging the type of each road participant, and taking the behavior data of the road participants of the preset type as the first behavior data, it is possible to filter out the behavior data of road participants that is valuable for model training. The purpose of the data.
云端20接收处理器10发送的第一行为数据和对应的标注信息,并将接收到的第一行为数据和对应的标注信息存储至原始数据库中。为了对预测模型进行更新,需要生成新的预测模型,为了生成新的预测模型,就需要训练模型所使用的训练样本,因此,可以根据预设的特征提取方法对第一行为数据进行特征提取得到特征提取量,将特征提取量和对应的标注信息作为训练样本存储至训练样本库中。其中,开发人员可以根据需求随时更换特征提取方法。The cloud 20 receives the first behavior data and corresponding annotation information sent by the processor 10, and stores the received first behavior data and corresponding annotation information in the original database. In order to update the prediction model, a new prediction model needs to be generated. In order to generate a new prediction model, the training samples used by the training model are required. Therefore, the first behavior data can be extracted according to the preset feature extraction method. The amount of feature extraction, the amount of feature extraction and the corresponding label information are stored as training samples in the training sample library. Among them, developers can change the feature extraction method at any time according to their needs.
由于每次从处理器10发送的第一行为数据和对应的标注信息的数量是有限的,众所周知,模型训练是需要大量的训练样本的,如果仅通过一次或几次存到训练样本库中的训练样本进行模型训练是无法得到较好的训练结果的,因此,可以设定预设自动触发条件,当满足预设自动触发条件时,提取训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型,其中,目标网络模型用于使得作为训练样本的特征提取量和对应的标注信息相关联,又由于特征提取量是道路参与者的行为数据的表征量,标注信息是未来运动轨迹的标注量,因此,目标网络模型用于使得道路参与者的行为数据与对应的未来运动轨迹相关联。Since the number of first behavior data and corresponding annotation information sent from the processor 10 each time is limited, it is well known that model training requires a large number of training samples. If only one or several passes are stored in the training sample library Model training with training samples cannot get good training results. Therefore, preset automatic trigger conditions can be set. When the preset automatic trigger conditions are met, the training stored in the training sample library after the last training sample is extracted The sample trains the initial network model to obtain the target network model, where the target network model is used to correlate the feature extraction amount used as the training sample with the corresponding annotation information, and because the feature extraction amount is the representative amount of the behavior data of the road participants The annotation information is the annotation amount of the future motion trajectory. Therefore, the target network model is used to correlate the behavior data of road participants with the corresponding future motion trajectory.
由此,当满足预设自动触发条件时,就可以提取训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型,因此,本发明实施例在满足预设自动触发条件时,就可以自动触发模型训练,无需人工参与,大大减少人工成本,自动化程度高。Thus, when the preset automatic trigger conditions are met, the training samples stored in the training sample library after the last training sample was extracted can be used to train the initial network model to obtain the target network model. Therefore, the embodiment of the present invention meets the requirements When the automatic trigger condition is set, the model training can be automatically triggered without manual participation, which greatly reduces labor costs and has a high degree of automation.
其中,预设自动触发条件可以为:训练样本库中在上一次提取训练样本之后存储的训练样本的数量达到预设数量阈值,或者,上一次提取训练样本的时刻与当前时刻之间的时长达到预设时长。Among them, the preset automatic trigger condition can be: the number of training samples stored in the training sample library after the last training sample is extracted reaches the preset number threshold, or the time between the time when the last training sample is extracted and the current time reaches The preset duration.
当训练样本库中在上一次提取训练样本之后存储的训练样本的数量达到预设数量阈值时,说明训练样本的数量已经达到可以进行模型训练的数据量,此时,云端20提取训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型。When the number of training samples stored in the training sample library after the last extraction of the training samples reaches the preset number threshold, it means that the number of training samples has reached the amount of data that can be used for model training. At this time, the cloud 20 extracts the training sample library The training samples stored after the training samples were extracted last time train the initial network model to obtain the target network model.
当上一次提取训练样本的时刻与当前时刻之间的时长达到预设时长时,说明训练样本的数量随着时间增长越来越多,已经达到可以进行模型训练的数据量,此时,云端20提取训练样本库中在上一次提取训练样本之后存储的 训练样本对初始网络模型进行训练得到目标网络模型。When the time between the last time the training sample was extracted and the current time reaches the preset time, it means that the number of training samples has grown more and more over time, and it has reached the amount of data that can be used for model training. At this time, the cloud 20 Extract the training samples stored after the last training sample was extracted in the training sample library to train the initial network model to obtain the target network model.
如果要更新预测模型,则需要新的预测模型相比待更新的预测模型有优势,因此,在得到目标网络模型后,需要根据预设评测方式对目标网络模型进行评测得到评测结果,当评测结果满足模型更新要求时,将目标网络模型发送至处理器10。If you want to update the prediction model, you need the new prediction model to have advantages over the prediction model to be updated. Therefore, after the target network model is obtained, the target network model needs to be evaluated according to the preset evaluation method to obtain the evaluation result. When the model update requirement is met, the target network model is sent to the processor 10.
其中,云端20根据预设评测方式对目标网络模型进行评测得到评测结果,当评测结果满足模型更新要求时,将目标网络模型发送至处理器10的方式可以为:Among them, the cloud 20 evaluates the target network model according to the preset evaluation method to obtain the evaluation result. When the evaluation result meets the model update requirement, the method of sending the target network model to the processor 10 may be:
云端20基于目标网络模型对待预测数据集中的每个待预测道路参与者的行为数据进行预测得到对应的未来运动轨迹;The cloud 20 predicts the behavior data of each road participant to be predicted in the data set to be predicted based on the target network model to obtain the corresponding future motion trajectory;
根据每个待预测道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的每个待预测道路参与者的运动轨迹,计算得到目标网络模型的评测指标的大小;According to the future motion trajectory corresponding to the behavior data of each road participant to be predicted and the motion trajectory of each road participant to be predicted observed by its own sensor, the size of the evaluation index of the target network model is calculated;
计算目标网络模型的评测指标相对于预测模型的评测指标的涨幅量;Calculate the increase amount of the evaluation index of the target network model relative to the evaluation index of the prediction model;
当涨幅量满足预设涨幅要求时,将目标网络模型发送至处理器10。When the amount of increase meets the preset increase requirement, the target network model is sent to the processor 10.
通常通过评测指标的涨幅量来评测目标网络模型的性能是否优于待更新的预测模型,因此,需要计算得到目标网络模型的评测指标的大小,即云端20基于目标网络模型对待预测数据集中的每个待预测道路参与者的行为数据进行预测得到对应的未来运动轨迹,然后根据每个待预测道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的每个待预测道路参与者的运动轨迹,计算得到目标网络模型的评测指标的大小。Usually, the performance of the target network model is better than the prediction model to be updated by the increase of the evaluation index. Therefore, it is necessary to calculate the size of the evaluation index of the target network model. The behavior data of each road participant to be predicted is predicted to obtain the corresponding future motion trajectory, and then the future motion trajectory corresponding to the behavior data of each road participant to be predicted and each road participant to be predicted observed by its own sensor Calculate the size of the evaluation index of the target network model.
示例性的,待预测数据集可以为训练样本库中的所有训练样本的集合,也可以为其他专门用于评测的训练样本的集合,本发明实施例对此并不做任何限制。评测指标可以包括模型预测准确率和/或模型预测绝对误差。Exemplarily, the data set to be predicted may be a set of all training samples in the training sample library, or may be a set of other training samples specifically used for evaluation, which is not limited in the embodiment of the present invention. The evaluation index may include the model prediction accuracy rate and/or the absolute error of the model prediction.
在计算得到目标网络模型的评测指标的大小后,即可计算目标网络模型的评测指标相对于预测模型的评测指标的涨幅量,当涨幅量满足预设涨幅要求时,说明目标网络模型的性能优于待更新的预测模型,此时,可以将目标网络模型发送至处理器10。After calculating the size of the evaluation index of the target network model, the increase amount of the evaluation index of the target network model relative to the evaluation index of the prediction model can be calculated. When the increase amount meets the preset increase requirement, the performance of the target network model is excellent. For the prediction model to be updated, the target network model can be sent to the processor 10 at this time.
由此,在得到目标网络模型后,自动触发评测,并在目标网络模型的评 测指标相对于预测模型的评测指标的涨幅量满足预设涨幅要求,即目标网络模型的性能优于待更新的预测模型时,将目标网络模型发送至处理器10,达到目标网络模型的自动化部署,无需人工参与,大大减少人工成本,自动化程度高。Therefore, after the target network model is obtained, the evaluation is automatically triggered, and the increase in the evaluation index of the target network model relative to the evaluation index of the prediction model meets the preset increase requirement, that is, the performance of the target network model is better than the forecast to be updated When modeling, the target network model is sent to the processor 10 to achieve automatic deployment of the target network model without manual participation, which greatly reduces labor costs and has a high degree of automation.
由于目标网络模型的性能优于待更新的预测模型时,云端20才会将目标网络模型发送至处理器10,因此,处理器10只要接收到目标网络模型,就可对预测模型进行更新,即将预测模型更新为目标网络模型。由此,处理器10在接收到目标网络模型后,即可自动将预测模型更新为目标网络模型,无需人工参与,大大减少人工成本,自动化程度高。Since the performance of the target network model is better than the prediction model to be updated, the cloud 20 will send the target network model to the processor 10. Therefore, the processor 10 can update the prediction model as long as it receives the target network model. The prediction model is updated to the target network model. Therefore, after the processor 10 receives the target network model, it can automatically update the prediction model to the target network model, without manual involvement, greatly reducing labor costs, and having a high degree of automation.
由上述内容可知,本发明实施例中处理器10基于自身的传感器所观测到的每个道路参与者的运动轨迹对每个道路参与者的行为数据进行标注得到对应的标注信息,实现了自动标注得到标注信息而不是人工离线标注,然后将筛选出的第一行为数据和对应的标注信息发送至云端20,云端20在确定满足预设自动触发条件时,就提取训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型,实现了在满足预设自动触发条件时就可以自动触发模型训练,然后在得到目标网络模型后,自动触发评测,并在评测结果满足模型更新要求时,将目标网络模型发送至处理器10,达到目标网络模型的自动化部署,最后处理器10在接收到目标网络模型后,即可自动将预测模型更新为目标网络模型,综上可见,无论是数据标注、模型训练、模型测评还是模型更新都无需人工参与,大大减少人工成本,自动化程度高。It can be seen from the above content that in the embodiment of the present invention, the processor 10 marks the behavior data of each road participant based on the movement trajectory of each road participant observed by its own sensor to obtain the corresponding labeling information, which realizes automatic labeling. Obtain the annotation information instead of manual offline annotation, and then send the filtered first behavior data and the corresponding annotation information to the cloud 20. When the cloud 20 determines that the preset automatic trigger condition is met, it extracts the training sample library from the previous extraction The training samples stored after the training samples train the initial network model to obtain the target network model, which realizes that the model training can be automatically triggered when the preset automatic trigger conditions are met, and then after the target network model is obtained, the evaluation is automatically triggered, and the evaluation is performed When the result meets the model update requirements, the target network model is sent to the processor 10 to achieve the automatic deployment of the target network model. Finally, after the processor 10 receives the target network model, it can automatically update the prediction model to the target network model. It can be seen from the above that no manual participation is required for data labeling, model training, model evaluation, or model update, which greatly reduces labor costs and has a high degree of automation.
并且,本发明实施例提供的一种基于数据驱动的实现预测模型自动化迭代的系统,仅需一名开发人员即可完成对预测模型的自动更新,大大提高了研发人效,减小了研发成本。In addition, the embodiment of the present invention provides a data-driven system for automatic iteration of predictive models. Only one developer can complete the automatic update of the predictive model, which greatly improves the efficiency of research and development and reduces the cost of research and development. .
本领域普通技术人员可以理解:附图只是一个实施例的示意图,附图中的模块或流程并不一定是实施本发明所必须的。Those of ordinary skill in the art can understand that the drawings are only schematic diagrams of an embodiment, and the modules or processes in the drawings are not necessarily necessary for implementing the present invention.
本领域普通技术人员可以理解:实施例中的装置中的模块可以按照实施例描述分布于实施例的装置中,也可以进行相应变化位于不同于本实施例的一个或多个装置中。上述实施例的模块可以合并为一个模块,也可以进一步拆分成多个子模块。A person of ordinary skill in the art can understand that the modules in the device in the embodiment may be distributed in the device in the embodiment according to the description of the embodiment, or may be located in one or more devices different from this embodiment with corresponding changes. The modules of the above-mentioned embodiments can be combined into one module or further divided into multiple sub-modules.
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: The technical solutions recorded in the foregoing embodiments are modified, or some of the technical features are equivalently replaced; these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (9)

  1. 一种基于数据驱动的实现预测模型自动化迭代的系统,其特征在于,所述系统包括处理器和云端,所述处理器设置有预测模型,所述预测模型用于预测道路参与者的未来运动轨迹;A data-driven system for realizing automatic iteration of prediction models, characterized in that the system includes a processor and a cloud, the processor is provided with a prediction model, and the prediction model is used to predict the future trajectory of road participants ;
    所述处理器获取每个道路参与者的行为数据,其中,所述行为数据包括当前时刻的环境静态地图信息以及安装于自身的采集设备采集的当前时刻之前的道路参与者的历史运动轨迹,基于自身的传感器所观测到的每个道路参与者的运动轨迹对每个道路参与者的行为数据进行标注得到对应的标注信息,从每个道路参与者的行为数据中筛选出符合预设筛选要求的第一行为数据,将所述第一行为数据和对应的标注信息发送至所述云端;The processor obtains the behavior data of each road participant, where the behavior data includes the environmental static map information at the current moment and the historical movement trajectory of the road participant before the current moment collected by the collection device installed in itself, based on The motion trajectory of each road participant observed by its own sensor is labeled with the behavior data of each road participant to obtain the corresponding label information, and the behavior data of each road participant is selected from the behavior data of each road participant that meets the preset screening requirements First behavior data, sending the first behavior data and corresponding annotation information to the cloud;
    所述云端将接收到的第一行为数据和对应的标注信息存储至原始数据库中,根据预设的特征提取方法对所述第一行为数据进行特征提取得到特征提取量,将所述特征提取量和对应的标注信息作为训练样本存储至训练样本库中,当满足预设自动触发条件时,提取所述训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型,其中,所述目标网络模型用于使得道路参与者的行为数据与对应的未来运动轨迹相关联,根据预设评测方式对所述目标网络模型进行评测得到评测结果,当所述评测结果满足模型更新要求时,将所述目标网络模型发送至所述处理器;The cloud stores the received first behavior data and corresponding annotation information in the original database, and performs feature extraction on the first behavior data according to a preset feature extraction method to obtain a feature extraction amount, and the feature extraction amount is And the corresponding labeling information are stored as training samples in the training sample library, and when the preset automatic trigger conditions are met, the training samples stored in the training sample library after the last training sample is extracted are extracted to train the initial network model to obtain the target A network model, wherein the target network model is used to correlate the behavior data of road participants with the corresponding future motion trajectory, and evaluate the target network model according to a preset evaluation method to obtain the evaluation result, when the evaluation result When the model update requirement is met, sending the target network model to the processor;
    所述处理器接收所述目标网络模型,并将所述预测模型更新为所述目标网络模型。The processor receives the target network model, and updates the prediction model to the target network model.
  2. 如权利要求1所述的系统,其特征在于,所述处理器将自身的传感器所观测到的每个道路参与者的运动轨迹作为每个道路参与者的行为数据对应的标注信息。The system according to claim 1, wherein the processor uses the movement trajectory of each road participant observed by its own sensor as the label information corresponding to the behavior data of each road participant.
  3. 如权利要求1所述的系统,其特征在于,所述处理器基于所述预测模型预测得到每个道路参与者的行为数据对应的未来运动轨迹,针对每个道路参与者,计算该道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的该道路参与者的运动轨迹之间的差异,将所述差异大于预设差异的道路参与者的行为数据作为第一行为数据;The system of claim 1, wherein the processor predicts and obtains the future motion trajectory corresponding to the behavior data of each road participant based on the prediction model, and calculates the road participant for each road participant The difference between the future motion trajectory corresponding to the behavior data of and the motion trajectory of the road participant observed by its own sensor, and the behavior data of the road participant whose difference is greater than the preset difference is used as the first behavior data;
    或者,or,
    所述处理器根据每个道路参与者的行为数据对应的标注信息,确定每个道路参与者的行为数据对应的行为类别,将所述行为类别为预设类别的道路参与者的行为数据作为第一行为数据;The processor determines the behavior category corresponding to the behavior data of each road participant according to the label information corresponding to the behavior data of each road participant, and uses the behavior data of the road participants whose behavior category is a preset category as the first One behavioral data;
    或者,or,
    所述处理器判断每个道路参与者的类型,将所述类型为预设类型的道路参与者的行为数据作为第一行为数据。The processor determines the type of each road participant, and uses the behavior data of the road participant whose type is a preset type as the first behavior data.
  4. 如权利要求3所述的系统,其特征在于,所述预设类别为变道行为或超车行为。The system according to claim 3, wherein the preset category is lane changing behavior or overtaking behavior.
  5. 如权利要求3所述的系统,其特征在于,所述预设类型为大型车辆、行人或两轮车。The system of claim 3, wherein the preset type is a large vehicle, a pedestrian, or a two-wheeled vehicle.
  6. 如权利要求1所述的系统,其特征在于,所述处理器在将所述第一行为数据和对应的标注信息发送至所述云端之前,存储所述第一行为数据和对应的标注信息。The system according to claim 1, wherein the processor stores the first behavior data and corresponding annotation information before sending the first behavior data and corresponding annotation information to the cloud.
  7. 如权利要求1所述的系统,其特征在于,当所述训练样本库中在上一次提取训练样本之后存储的训练样本的数量达到预设数量阈值时,所述云端提取所述训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型;The system according to claim 1, wherein when the number of training samples stored in the training sample library after the last training sample is extracted reaches a preset number threshold, the cloud extracts the training sample from the training sample library. Training the initial network model with the training samples stored after the training samples were extracted last time to obtain the target network model;
    或者,or,
    当上一次提取训练样本的时刻与当前时刻之间的时长达到预设时长时,所述云端提取所述训练样本库中在上一次提取训练样本之后存储的训练样本对初始网络模型进行训练得到目标网络模型。When the time between the last time the training sample was extracted and the current time reaches the preset time length, the cloud extracts the training samples stored in the training sample library after the last time the training samples were extracted to train the initial network model to obtain the target Network model.
  8. 如权利要求1所述的系统,其特征在于,所述云端基于所述目标网络模型对待预测数据集中的每个待预测道路参与者的行为数据进行预测得到对应的未来运动轨迹;The system according to claim 1, wherein the cloud predicts the behavior data of each road participant to be predicted in the data set to be predicted based on the target network model to obtain the corresponding future motion trajectory;
    根据每个待预测道路参与者的行为数据对应的未来运动轨迹与自身的传感器所观测到的每个待预测道路参与者的运动轨迹,计算得到所述目标网络模型的评测指标的大小;Calculate the size of the evaluation index of the target network model according to the future motion trajectory corresponding to the behavior data of each road participant to be predicted and the motion trajectory of each road participant to be predicted observed by its own sensor;
    计算所述目标网络模型的评测指标相对于所述预测模型的评测指标的涨幅量;Calculating an increase amount of the evaluation index of the target network model relative to the evaluation index of the prediction model;
    当所述涨幅量满足预设涨幅要求时,将所述目标网络模型发送至所述处理器。When the amount of increase meets a preset increase requirement, the target network model is sent to the processor.
  9. 如权利要求1所述的系统,其特征在于,所述道路参与者包括车辆和/或行人。The system of claim 1, wherein the road participants include vehicles and/or pedestrians.
PCT/CN2020/094133 2020-05-29 2020-06-03 Data-driven-based system for implementing automatic iteration of prediction model WO2021237768A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
DE112020003091.1T DE112020003091T5 (en) 2020-05-29 2020-06-03 System for realizing automatic iteration of predictive model based on data operation

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010471824.2 2020-05-29
CN202010471824.2A CN113799793B (en) 2020-05-29 2020-05-29 System for realizing automatic iteration of prediction model based on data driving

Publications (1)

Publication Number Publication Date
WO2021237768A1 true WO2021237768A1 (en) 2021-12-02

Family

ID=78745307

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/094133 WO2021237768A1 (en) 2020-05-29 2020-06-03 Data-driven-based system for implementing automatic iteration of prediction model

Country Status (3)

Country Link
CN (1) CN113799793B (en)
DE (1) DE112020003091T5 (en)
WO (1) WO2021237768A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114742236A (en) * 2022-04-24 2022-07-12 重庆长安汽车股份有限公司 Environmental vehicle behavior prediction model training method and system
CN116721399B (en) * 2023-07-26 2023-11-14 之江实验室 Point cloud target detection method and device for quantitative perception training
CN116665025B (en) * 2023-07-31 2023-11-14 福思(杭州)智能科技有限公司 Data closed-loop method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160176397A1 (en) * 2014-12-23 2016-06-23 Toyota Motor Engineering & Manufacturing North America, Inc. Risk mitigation for autonomous vehicles relative to oncoming objects
CN109878512A (en) * 2019-01-15 2019-06-14 北京百度网讯科技有限公司 Automatic Pilot control method, device, equipment and computer readable storage medium
CN109878515A (en) * 2019-03-12 2019-06-14 百度在线网络技术(北京)有限公司 Predict method, apparatus, storage medium and the terminal device of track of vehicle
CN109969172A (en) * 2017-12-26 2019-07-05 华为技术有限公司 Control method for vehicle, equipment and computer storage medium
CN110858454A (en) * 2018-08-22 2020-03-03 福特全球技术公司 Predicting an intent of movement of an object
CN110997387A (en) * 2017-06-20 2020-04-10 优特诺股份有限公司 Risk handling for vehicles with autonomous driving capability
CN111137282A (en) * 2019-12-04 2020-05-12 宝能汽车有限公司 Vehicle collision prediction method and device, vehicle and electronic equipment

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11545033B2 (en) * 2017-06-22 2023-01-03 Apollo Intelligent Driving Technology (Beijing) Co., Ltd. Evaluation framework for predicted trajectories in autonomous driving vehicle traffic prediction
US11017550B2 (en) * 2017-11-15 2021-05-25 Uatc, Llc End-to-end tracking of objects
CN108182695B (en) * 2017-12-29 2021-10-29 纳恩博(北京)科技有限公司 Target tracking model training method and device, electronic equipment and storage medium
US11370423B2 (en) * 2018-06-15 2022-06-28 Uatc, Llc Multi-task machine-learned models for object intention determination in autonomous driving
DE102019134048A1 (en) * 2019-12-11 2020-03-26 FEV Group GmbH Procedure for predicting pedestrian behavior

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160176397A1 (en) * 2014-12-23 2016-06-23 Toyota Motor Engineering & Manufacturing North America, Inc. Risk mitigation for autonomous vehicles relative to oncoming objects
CN110997387A (en) * 2017-06-20 2020-04-10 优特诺股份有限公司 Risk handling for vehicles with autonomous driving capability
CN109969172A (en) * 2017-12-26 2019-07-05 华为技术有限公司 Control method for vehicle, equipment and computer storage medium
CN110858454A (en) * 2018-08-22 2020-03-03 福特全球技术公司 Predicting an intent of movement of an object
CN109878512A (en) * 2019-01-15 2019-06-14 北京百度网讯科技有限公司 Automatic Pilot control method, device, equipment and computer readable storage medium
CN109878515A (en) * 2019-03-12 2019-06-14 百度在线网络技术(北京)有限公司 Predict method, apparatus, storage medium and the terminal device of track of vehicle
CN111137282A (en) * 2019-12-04 2020-05-12 宝能汽车有限公司 Vehicle collision prediction method and device, vehicle and electronic equipment

Also Published As

Publication number Publication date
DE112020003091T5 (en) 2022-03-31
CN113799793B (en) 2023-05-12
CN113799793A (en) 2021-12-17

Similar Documents

Publication Publication Date Title
CN111179585B (en) Site testing method and device for automatic driving vehicle
WO2021237768A1 (en) Data-driven-based system for implementing automatic iteration of prediction model
CN113642633B (en) Method, device, equipment and medium for classifying driving scene data
CN112700470B (en) Target detection and track extraction method based on traffic video stream
Moers et al. The exid dataset: A real-world trajectory dataset of highly interactive highway scenarios in germany
Essa et al. Simulated traffic conflicts: do they accurately represent field-measured conflicts?
WO2020034903A1 (en) Smart navigation method and system based on topological map
CN114970321A (en) Scene flow digital twinning method and system based on dynamic trajectory flow
CN114023062B (en) Traffic flow information monitoring method based on deep learning and edge calculation
CN102024330A (en) Intelligent traffic signal control system, method and equipment based on high-definition video technology
CN110782120A (en) Method, system, equipment and medium for evaluating traffic flow model
Wang et al. Realtime wide-area vehicle trajectory tracking using millimeter-wave radar sensors and the open TJRD TS dataset
CN110889444A (en) Driving track feature classification method based on convolutional neural network
CN114783188A (en) Inspection method and device
CN102024331A (en) Intelligent traffic signal control system
CN110021161A (en) A kind of prediction technique and system of traffic direction
CN114694078A (en) Traffic behavior judgment method based on multi-target tracking
CN111325811B (en) Lane line data processing method and processing device
US11682298B2 (en) Practical method to collect and measure real-time traffic data with high accuracy through the 5G network and accessing these data by cloud computing
CN116524718A (en) Remote visual processing method and system for intersection data
US20220172606A1 (en) Systems and Methods for Extracting Data From Autonomous Vehicles
CN114241373A (en) End-to-end vehicle behavior detection method, system, equipment and storage medium
CN112950960B (en) Method for judging reverse running of automatic driving vehicle
Špaňhel et al. Detection of traffic violations of road users based on convolutional neural networks
CN114973704A (en) Method, device, equipment and storage medium for generating signal control strategy

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20937891

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 20937891

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 04.07.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 20937891

Country of ref document: EP

Kind code of ref document: A1