Summary of the invention
The invention solves one of technical problem be how comprehensively to reflect the driving environment of vehicle, realize quasi-
Peopleization automatic Pilot.
To achieve the above object, the present invention provides a kind of automatic driving method, comprising: according to the environment sensing of acquisition
Information and driver's operation information establish Vehicular automatic driving database, and the Vehicular automatic driving database is divided into instruction
Practice collection and test set;The running environment risk field that training is established according to the environment sensing information in the training set, according to institute
The driver's operation information stated in the running environment risk field and the training set of training carries out the deep learning model
Training;The running environment risk field that test is established according to the environment sensing information in the test set, by the test
Running environment risk field inputs the deep learning model, exports the vehicle control variable of prediction, by comparing the vehicle of prediction
Driver's operation information in control variable and the test set tests the deep learning model.
Wherein, training use or test running environment risk field using following methods foundation: according to the resting bodily form
At potential energy field information, moving object formed kinetic energy field information and driver formed behavior field information establish running environment
Risk field;Wherein,
The potential energy field information formed for the running environment risk field of training, stationary object and moving object are formed dynamic
Energy field information determines that the behavior field information that driver is formed is according to the training according to the environment sensing information in the training set
Driver's operation information of concentration determines;
The potential energy field information formed for the running environment risk field of test, stationary object and moving object are formed dynamic
Energy field information determines that the behavior field information that driver is formed is according to the test according to the environment sensing information in the test set
Driver's operation information of concentration determines.
In one embodiment, the potential energy field information that stationary object is formed is true according to the attribute and road conditions of stationary object
It is fixed;The kinetic energy field information that moving object is formed is determined according to the attribute of moving object, motion state and road conditions.
In the case where environment sensing information is collected by multiple sensors, this method further include:
The coordinate system of multiple sensors is converted, to form unified coordinate system;
And/or
Using the same target of the mahalanobis distance association different sensors observation of target, to the same of different sensors observation
Target is weighted and averaged by probability of happening, the probability of happening as the same target.
In one embodiment, it is grasped according to the driver in the running environment risk field and the training set of the training
It is used as information to be trained the deep learning model to include: by the running environment risk field of the training and the training set
In driver's operation information input deep learning model, export the vehicle control variable of prediction and the damage of driver's operation information
It breaks one's promise breath;According to the ginseng of vehicle control variable in deep learning model described in the loss Information revision of driver's operation information
Number.
To achieve the above object, the present invention provides a kind of automatic driving device, comprising: sample forms module, is used for
Vehicular automatic driving database is established according to the environment sensing information of acquisition and driver's operation information, and the vehicle is automatic
Driving data library is divided into training set and test set;Model training module, for being believed according to the environment sensing in the training set
Breath establishes the running environment risk field of training, according to driving in the running environment risk field and the training set of the training
The person's of sailing operation information is trained the deep learning model;Model measurement module, for according to the ring in the test set
Border perception information establishes the running environment risk field of test, and the running environment risk field of the test is inputted the depth
Learning model exports the vehicle control variable of prediction, by comparing driving in the vehicle control variable and the test set of prediction
The person's of sailing operation information tests the deep learning model.
The model training module includes that unit is established in the first risk field, the potential energy field letter for being formed according to stationary object
The behavior field information that the kinetic energy field information and driver that breath, moving object are formed are formed establishes the running environment risk of training
?;Wherein, the kinetic energy field information that the potential energy field information and moving object that stationary object is formed are formed is according in the training set
Environment sensing information determines that the behavior field information that driver is formed is determined according to driver's operation information in the training set;
The model measurement module includes that unit is established in the second risk field, the potential energy field letter for being formed according to stationary object
The behavior field information that the kinetic energy field information and driver that breath, moving object are formed are formed establishes the running environment risk of test
?;Wherein, the kinetic energy field information that the potential energy field information and moving object that stationary object is formed are formed is according in the test set
Environment sensing information determines that the behavior field information that driver is formed is determined according to driver's operation information in the test set.
Wherein, the potential energy field information that stationary object is formed is determined according to the attribute and road conditions of stationary object;Moving object
The kinetic energy field information that body is formed is determined according to the attribute of moving object, motion state and road conditions.
In the case where environment sensing information is collected by multiple sensors, it includes: data that the sample, which forms module,
Processing unit and sample form unit;
The data processing unit, is used for
The coordinate system of multiple sensors is converted, to form unified coordinate system;
And/or
Using the same target of the mahalanobis distance association different sensors observation of target, to the same of different sensors observation
Target is weighted and averaged by probability of happening, the probability of happening as the same target;
The sample forms unit, establishes vehicle certainly for the environment sensing information and driver's operation information according to acquisition
Dynamic driving data library, and the Vehicular automatic driving database is divided into training set and test set.
The model training module includes model training unit, for by the running environment risk field of the training and institute
Driver's operation information input deep learning model in training set is stated, vehicle control variable and the driver's operation of prediction are exported
The loss information of information;According to vehicle control in deep learning model described in the loss Information revision of driver's operation information
The parameter of variable.
The present invention is based on the risk models that the theory of field establishes vehicle running environment, thus complete using running environment risk field
The driving environment of the reflection vehicle in face, the automatic Pilot being advantageously implemented under different road environments.And according to running environment wind
Dangerous field and driver's operation are trained Vehicular automatic driving model, to learn the experience of outstanding human driver, realize quasi-
Peopleization automatic Pilot.
Specific embodiment
The invention proposes a kind of automatic driving method, this method establishes traveling using the environment sensing information of acquisition
Environmental risk field, according to running environment risk field and driver's operation training deep learning model, it can be achieved that vehicle is driven automatically
It sails, reduces the training difficulty of Vehicular automatic driving model (abbreviation model).
Fig. 1 is the flow diagram of automatic driving method one embodiment of the present invention.As shown in Figure 1, this method packet
Include following steps:
Step S102 establishes Vehicular automatic driving data according to the environment sensing information of acquisition and driver's operation information
Library, and Vehicular automatic driving database is divided into training set and test set, to form sample.Training set is used to training pattern,
It is used in model training stage;Test set is used to verify the availability of model, uses in the model measurement stage.
Wherein, environment sensing information is the environmental data acquired by least one sensor.For example, vehicle-mounted vidicon acquires
Image, point cloud information and the target information of millimetre-wave radar of laser radar etc., but be not limited to examples cited.
Wherein, driver's operation information includes the information such as vehicle steering angle, vehicle plus/minus speed.It is abundant more in order to obtain
The driving data of sample can select the driving data of different drivers.For example, the database sampling frequency used is 10 hertz,
It selects different two hours driving datas of driver as training set, amounts to 72000 frames, select driving for different driver's half an hour
Data are sailed as test set, amount to 18000 frames.By learning the driving behavior of different drivers, the personification of vehicle may be implemented
Change automatic Pilot.
Step S104 establishes the running environment risk field of training according to the environment sensing information in training set, according to instruction
The driver's operation information in running environment risk field and training set practiced is trained deep learning model.
In one embodiment, deep learning model is trained include: will training running environment risk field with
Driver's operation information in training set inputs deep learning model, exports the vehicle control variable of prediction, such as Vehicular turn
Angle, vehicle plus/minus speed etc., (operate according to the vehicle control variable of prediction and desired vehicle control variable according to driver
Information determines desired vehicle control variable) the loss information that determines driver's operation information, according to driver's operation information
Lose the parameter of vehicle control variable in Information revision deep learning model.By the iteration of certain number, met the requirements
Deep learning model.Wherein, deep learning model for example can be depth convolutional neural networks model.
Running environment risk field can comprehensively describe running environment, be advantageously implemented driving under different road environments automatically
It sails.
Step S106 establishes the running environment risk field of test according to the environment sensing information in test set, will test
Running environment risk field inputs deep learning model, exports the vehicle control variable of prediction, by comparing the vehicle of prediction
Driver's operation information in control variable and test set tests deep learning model.
A kind of illustrative test method is described as follows, if the driver in the vehicle control variable and test set of prediction
Gap between operation information, which is less than preset value, can then determine that deep learning model can that is, the consistency of the two is preferable
With.Wherein, vehicle control variable is for example including vehicle steering angle, vehicle plus/minus speed etc..
However, it will be understood by those skilled in the art that above-mentioned test method is not unique.For example, by the vehicle of prediction
Control variable for controlling vehicle, observation vehicle can normally travel, if can be with normally travel, it is determined that depth
It is available to practise model.
The present invention is based on the risk models that the theory of field establishes vehicle running environment for above-described embodiment, to utilize traveling ring
The driving environment of vehicle, the automatic Pilot being advantageously implemented under different road environments are comprehensively reflected in border risk field.And according to
Running environment risk field and driver's operation are trained Vehicular automatic driving model, to learn the warp of outstanding human driver
It tests, realizes the automatic Pilot that personalizes.In addition, relative to directly being instructed using environment sensing information to Vehicular automatic driving model
Practice, reduces the training difficulty of Vehicular automatic driving model.
The present invention also provides the methods that a kind of pair of deep learning model is trained.It is shown in Figure 2 to depth
The flow chart that model is trained is practised, for the data in training set, training process is as follows:
Step S202 identifies the environment sensing information acquired in training set by least one sensor, identifies
Such as the driving-environment informations such as stationary object, moving object, road.
Identification process is described by taking video camera, laser radar, millimetre-wave radar etc. as an example below.
According to the image that video camera acquires, the targets such as lane line and vehicle are identified.Wherein it is possible to use image processing method
Method identifies the lane line in image.This method passes through image adaptive Threshold segmentation, the extraction of lane markings line feature point, characteristic point
Cluster with fitting, lane lines matching and tracking and etc., realize lane line accurately identify and tenacious tracking.Wherein it is possible to make
With the vehicle target in the method identification image of machine learning.This method uses HOG (Histogram of Oriented
Gradient, histograms of oriented gradients) feature and a kind of AdaBoost (iterative algorithm) cascade classifier training vehicle detection mould
Type, and then use the accurate detection of vehicle detection model realization vehicle target.It will be understood by those skilled in the art that for row
The targets such as people, cyclist, road, road sign can be no longer superfluous here with reference to the recognition methods of lane line above-mentioned and vehicle target
It states.
In addition, laser radar can get the point cloud information (i.e. spatial coordinated information) on barrier and road surface on road.Millimeter
Wave radar can get the information such as the Position And Velocity of barrier (such as vehicle, fence target).
Step S204 optionally in the case where multiple sensors acquire environment sensing information, can also carry out coordinate and turn
It changes and/or data fusion.
Wherein, coordinate conversion refers to that the coordinate system to multiple sensors is converted, to form unified coordinate system, after making
Continuous data fusion is easier.A kind of method of coordinate conversion for example can be, and image coordinate system is converted to camera coordinates,
The coordinate system of camera coordinates and other sensors is transformed into unified vehicle axis system again and (such as is fixed on the coordinate from vehicle
System, coordinate origin is in vehicle centroid), realize the coordinate conversion of different sensors perception information.
Wherein, since the attribute of different sensors perception information is different, as millimetre-wave radar lateral resolution is low, vision passes
Sensor range accuracy difference etc., the same target that the present invention is observed using the mahalanobis distance association different sensors of target, further
In order to merge different sensors observation, the same target of different sensors observation is weighted and averaged by probability of happening, is made
For the probability of happening of the same target, to realize the fusion of multi-sensor information and being effectively estimated for observation time of day.
Wherein, joint probability data association (JPDA, Joint Probability Data Association) side can be used for example
Method is weighted and averaged the same target that different sensors are observed by probability of happening.
It is converted by above-mentioned coordinate or data fusion, can more accurately identify road environment information.
Step S206 establishes the running environment risk field of training according to the environment sensing information in training set.
The present invention provides the risk field method for building up that one kind can comprehensively reflect the degree of risk of vehicle running environment.
That is, according to the potential energy field information of stationary object (vehicle such as stopped) formation, moving object (such as the vehicle and pedestrian of movement)
The behavior field information that the kinetic energy field information of formation and driver are formed establishes running environment risk field, and formula is expressed as follows:
Es=Er+Ev+Ed (1)
Wherein, Es indicates running environment risk field, and Er indicates the potential energy field information that stationary object is formed, and Ev indicates moving object
The kinetic energy field information that body is formed, Ed indicate the behavior field information that driver is formed.
The potential energy field information formed for the running environment risk field of training, stationary object and moving object are formed dynamic
Energy field information determines that the behavior field information that driver is formed is according to driving in training set according to the environment sensing information in training set
The person's of sailing operation information determines.Specifically, the physical field that stationary object influences traffic safety on potential energy field characterization road, potential energy
The size and Orientation of field field strength is mainly determined by stationary object attribute and road conditions.Kinetic energy field is moving object on characterization road
On traffic safety influence physical field, the size and Orientation of kinetic energy field field strength mainly by the attribute of moving object, motion state and
Road conditions determine.Behavior field is to characterize the physical field that influences on traffic safety of driver behavior pattern, behavior field field strength it is big
It is small mainly to be determined by the behavioral trait of driver.Under the same terms, the driver of radical type is usually made than conservative driver
At driving risk it is big, behavior field field strength is with regard to big;The low driver of driving efficiency driver's row usually higher than driving efficiency
It is big for field field strength.
Fig. 3 shows the schematic diagram of the running environment risk field under a typical road environment.For convenience of the training of deep learning
Process by risk field discretization and can project on two-dimensional image.Wherein, the abscissa of risk field picture indicates the cross of vehicle
To direction, ordinate indicates that the longitudinal direction of vehicle, image pixel value indicate degree of risk (such as can quantify to 0 to 255).
In the present embodiment for example it is contemplated that 20 meters of left and right vehicle wheel, 100 meters first, rear 50 meters of range, each pixel represent 0.5 meter
Length, therefore the risk field gray level image size generated is 300x80.
Step S208, by driver's operation information in the running environment risk field and training set of training, (i.e. supervision is believed
Breath) input deep learning model, export the vehicle control variable of prediction.
Wherein, driver's operation information includes the information such as vehicle steering angle, vehicle plus/minus speed.It is abundant more in order to obtain
The driving data of sample can select the driving data of different drivers.
Wherein, deep learning model for example can be depth convolutional neural networks model, which includes five layers of convolutional layer
With two layers of full articulamentum, the last layer exports two-dimensional vehicle control amount.
Step S210 (is operated according to driver and is believed according to the vehicle control variable of prediction and desired vehicle control variable
Breath determines) determine the loss information of driver's operation information, such as with L2 loss function, according to the loss of driver's operation information
The parameter of vehicle control variable in Information revision deep learning model.
By the iteration of certain number (such as 100,000 times), available satisfactory deep learning model, thus complete
The training process of pairs of deep learning model.
Above-described embodiment, the theory based on field establish the risk evaluation model of vehicle running environment, merge multiple sensors
Information input establishes comprehensive running environment description system, the automatic Pilot being advantageously implemented under different road environments.In conjunction with vehicle
Running environment and the corresponding vehicle operating output of driver, based on the automatic Pilot model of deep learning method study vehicle,
The automatic Pilot of vehicle can be achieved.By learn different drivers driving behavior, it can be achieved that vehicle the automatic Pilot that personalizes.
The present invention also provides the methods that a kind of pair of deep learning model is tested.It is shown in Figure 4 to depth
The flow chart that model is tested is practised, for the data in test set, test process is as follows:
Step S402 identifies the environment sensing information acquired in test set by least one sensor, identifies
Such as the driving-environment informations such as stationary object, moving object, road.
Wherein, the recognition methods of the environment sensing information in test set can be believed with the environment sensing in reference pair training set
The recognition methods (referring to step S202) of breath, which is not described herein again.
Step S404 optionally in the case where multiple sensors acquire environment sensing information, can also carry out coordinate and turn
It changes and/or data fusion.
Wherein, the coordinate conversion of the environment sensing information in test set and/or data fusion method can be instructed with reference pair
Practice the coordinate conversion and/or data fusion method (referring to step S204) of the environment sensing information concentrated, which is not described herein again.
Step S406 establishes the running environment risk field of test according to the environment sensing information in test set.
Wherein, the method for building up of the running environment risk field of test can be with reference to the running environment risk field of training
Method for building up (refers to step S206), and which is not described herein again.
Step S408, the deep learning model that the running environment risk field input training of test is obtained, i.e., using instruction
The running environment risk field of experienced deep learning model treatment input, exports the vehicle control variable of prediction.
For example, the risk field gray level image that the size of generation is 300x80 is input to the depth convolutional Neural that training obtains
Network model obtains the information such as two-dimensional vehicle control amount, such as vehicle steering angle, plus/minus speed by returning.
Step S410, by comparing driver's operation information in the vehicle control variable and test set of prediction to depth
Model is practised to be tested.
A kind of illustrative test method is described as follows, if the driver in the vehicle control variable and test set of prediction
Gap between operation information, which is less than preset value, can then determine that deep learning model can that is, the consistency of the two is preferable
With.Wherein, vehicle control variable is for example including vehicle steering angle, vehicle plus/minus speed etc..
It, can be according to vehicle control amount (such as vehicle turn that deep learning model exports if deep learning model is available
To angle, plus/minus speed etc.), effective control to vehicle is realized using PID (proportional integral differential) control.
The present invention also provides a kind of automatic driving devices, and with reference to Fig. 5, which includes:
Sample forms module 502, establishes vehicle certainly for the environment sensing information and driver's operation information according to acquisition
Dynamic driving data library, and Vehicular automatic driving database is divided into training set and test set;
Model training module 504, for establishing the running environment wind of training according to the environment sensing information in training set
Deep learning model is instructed according to driver's operation information in the running environment risk field and training set of training in dangerous field
Practice;
Model measurement module 506, for establishing the running environment wind of test according to the environment sensing information in test set
The running environment risk field of test is inputted deep learning model, exports the vehicle control variable of prediction, by comparing by dangerous field
Driver's operation information in the vehicle control variable and test set of prediction tests deep learning model.
With reference to Fig. 6, in the case where environment sensing information is collected by multiple sensors, sample forms module 502 and wraps
Include: data processing unit 5022 and sample form unit 5024;
Data processing unit 5022 is converted for the coordinate system to multiple sensors, to form unified coordinate system;
And/or the same target of the mahalanobis distance association different sensors observation using target, to the same mesh of different sensors observation
Mark is weighted and averaged by probability of happening, the probability of happening as the same target.
Sample forms unit 5024, establishes vehicle certainly for the environment sensing information and driver's operation information according to acquisition
Dynamic driving data library, and Vehicular automatic driving database is divided into training set and test set.
Wherein, model training module 504 includes that unit 5042 is established in the first risk field, for what is formed according to stationary object
The behavior field information that the kinetic energy field information and driver that potential energy field information, moving object are formed are formed establishes the traveling of training
Environmental risk field;Wherein, the kinetic energy field information that the potential energy field information and moving object that stationary object is formed are formed is according to training set
In environment sensing information determine, driver formed behavior field information according in training set driver's operation information determine.
Wherein, model training module 504 includes model training unit 5044, the running environment risk field for that will train
Deep learning model is inputted with driver's operation information in training set, exports vehicle control variable and the driver's operation of prediction
The loss information of information;According to the ginseng of vehicle control variable in the loss Information revision deep learning model of driver's operation information
Number.
Wherein, model measurement module 506 includes that unit 5062 is established in the second risk field, for what is formed according to stationary object
The behavior field information that the kinetic energy field information and driver that potential energy field information, moving object are formed are formed establishes the traveling of test
Environmental risk field;Wherein, the kinetic energy field information that the potential energy field information and moving object that stationary object is formed are formed is according to test set
In environment sensing information determine, driver formed behavior field information according in test set driver's operation information determine.
Wherein, the potential energy field information that stationary object is formed is determined according to the attribute and road conditions of stationary object;Moving object
The kinetic energy field information that body is formed is determined according to the attribute of moving object, motion state and road conditions.
Wherein, model measurement module 506 includes model measurement unit 5064, for by the running environment risk field of test
Deep learning model is inputted, the vehicle control variable of prediction is exported, in the vehicle control variable and test set by comparing prediction
Driver's operation information deep learning model is tested.
The present invention is based on the risk models that the theory of field establishes vehicle running environment, thus complete using running environment risk field
The driving environment of the reflection vehicle in face, the automatic Pilot being advantageously implemented under different road environments.And according to running environment wind
Dangerous field and driver's operation are trained Vehicular automatic driving model, to learn the experience of outstanding human driver, realize quasi-
Peopleization automatic Pilot.In addition, being reduced relative to being directly trained using environment sensing information to Vehicular automatic driving model
The training difficulty of Vehicular automatic driving model.
Finally it is noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations.This
The those of ordinary skill in field is it is understood that be possible to modify the technical solutions described in the foregoing embodiments or right
Part of technical characteristic is equivalently replaced;These are modified or replaceed, and it does not separate the essence of the corresponding technical solution originally
Invent the spirit and scope of each embodiment technical solution.