CN110450771A - A kind of intelligent automobile stability control method based on deeply study - Google Patents
A kind of intelligent automobile stability control method based on deeply study Download PDFInfo
- Publication number
- CN110450771A CN110450771A CN201910809910.7A CN201910809910A CN110450771A CN 110450771 A CN110450771 A CN 110450771A CN 201910809910 A CN201910809910 A CN 201910809910A CN 110450771 A CN110450771 A CN 110450771A
- Authority
- CN
- China
- Prior art keywords
- network model
- formula
- vehicle
- current
- parameter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W10/00—Conjoint control of vehicle sub-units of different type or different function
- B60W10/18—Conjoint control of vehicle sub-units of different type or different function including control of braking systems
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W10/00—Conjoint control of vehicle sub-units of different type or different function
- B60W10/20—Conjoint control of vehicle sub-units of different type or different function including control of steering systems
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W30/00—Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units, or advanced driver assistance systems for ensuring comfort, stability and safety or drive control systems for propelling or retarding the vehicle
- B60W30/02—Control of vehicle driving stability
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/02—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
- B60W40/06—Road conditions
- B60W40/068—Road friction coefficient
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/10—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to vehicle motion
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/10—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to vehicle motion
- B60W40/105—Speed
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/12—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to parameters of the vehicle itself, e.g. tyre models
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/12—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to parameters of the vehicle itself, e.g. tyre models
- B60W40/13—Load or weight
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W50/0098—Details of control systems ensuring comfort, safety or stability not otherwise provided for
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W2050/0001—Details of the control system
- B60W2050/0002—Automatic control, details of type of controller or control system architecture
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W2050/0001—Details of the control system
- B60W2050/0043—Signal treatments, identification of variables or parameters, parameter estimation or state estimation
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2520/00—Input parameters relating to overall vehicle dynamics
- B60W2520/10—Longitudinal speed
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2530/00—Input parameters relating to vehicle conditions or values, not covered by groups B60W2510/00 or B60W2520/00
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2530/00—Input parameters relating to vehicle conditions or values, not covered by groups B60W2510/00 or B60W2520/00
- B60W2530/10—Weight
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2710/00—Output or target parameters relating to a particular sub-units
- B60W2710/18—Braking system
- B60W2710/182—Brake pressure, e.g. of fluid or between pad and disc
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2710/00—Output or target parameters relating to a particular sub-units
- B60W2710/20—Steering systems
Abstract
The invention discloses a kind of intelligent automobile stability control method based on deeply study, step includes: the decision output and vehicle structure parameter, driving parameters of 1 acquisition automobile Lateral Controller;2 define state parameter, action parameter and the reward function of deeply learning method;3 construct and train the network model of deeply learning method, obtain optimal movement network model;4 obtain automobile current state parameter st, to export currently additional yaw moment ▽ M using the optimal movement network modeltWith amendment corner ▽ δt;5 judge vehicle steadily state;6 determine current amendment corner ▽ δ according to motor turning property and steering wheel angle directiontDirection and current additional yaw moment ▽ MtMovement wheel.The present invention is able to achieve the rule of the optimistic coordinated control under steady working condition and limiting condition between direct yaw moment control and course changing control, to realize vehicle stabilization control, guarantees safety and the comfort of driver and passenger.
Description
Technical field
The present invention relates to automobile dynamics control field, more specifically a kind of intelligent vapour based on deeply study
Vehicle stability control method.
Background technique
When turning to, slip angle of tire increases automobile, and lateral force increases, and enables the vehicle to travel according to driver intention,
But under some low attachments and anxious steering situation, the lateral force of vehicle is easy to reach limit of adhesion, and vehicle can occur to break away, is anxious
Turn, rollover etc. dangerous working conditions.Currently, the major way that above-mentioned dangerous working condition can be intervened be active steering control and directly
Yaw moment control.Active steering control is by changing the yaw moment of vehicle to steering wheel Introduced Malaria corner;It is directly horizontal
It is poor that pendulum Torque Control mainly passes through adjusting wheel braking force formation brake force, to generate additional yaw moment to adjust vehicle
Understeer or ovdersteering.
The influence of active steering and direct yaw moment control to automotive performance respectively has advantage and disadvantage, independent active steering control
It is smaller on speed influence, it ensure that the comfort of driver and passenger, but ineffective under limiting condition, it is steady to be unable to control vehicle
It is fixed, it can not meet the security requirement of driver and passenger;Individual direct yaw moment control system, it is ensured that driver and passenger exist
Safety under limiting condition, but be affected to longitudinal acceleration of the vehicle, it can not meet the comfort requirement of driver and passenger.And vehicle
As complicated nonlinear system, there are many couplings between each system, and it is steady to control vehicle in each state of vehicle
Relatively optimal control output is established a capital, is not simple linear relationship between these optimal control outputs, passes through design
Linear tuning controller can not guarantee safety and the comfort of driver and passenger well.
Summary of the invention
The present invention is to solve above-mentioned the shortcomings of the prior art, proposes a kind of intelligent automobile based on deeply study
Stability control method, to be able to achieve under steady working condition and limiting condition between direct yaw moment control and course changing control
Optimistic coordinated control rule, to realize vehicle stabilization control, guarantees safety and the comfort of driver and passenger.
The present invention adopts the following technical scheme that in order to solve the technical problem
A kind of the characteristics of intelligent automobile stability control method based on deeply study of the invention is as follows
It carries out:
Step 1: obtaining the front wheel angle δ of vehicle lateral control device decision outputfAnd vehicle structure parameter, comprising: vehicle
Wheelspan L, mass center are to wheel base from LfAnd Lr, front and back wheel cornering stiffness C1And C2, car mass m;
Obtain vehicle driving parameter, comprising: steering wheel angle sw, speed u and surface friction coefficient μ;
Step 2: calculating ideal yaw velocity w using formula (1)d:
In formula (1), g is acceleration of gravity, and w is yaw velocity, and is had:
Step 3: calculating ideal side slip angle β using formula (3)d:
βd=-min | β |, | βmax|}·sign(δf) (3)
In formula (3), β is vehicle centroid side drift angle, βmaxFor vehicle maximum side slip angle, and have:
Step 4: the vehicle status parameters s of deeply learning method is defined using formula (6):
S={ w, β, sw, wd,βd} (6)
Step 5: the action parameter a of deeply learning method is defined using formula (7):
In formula (7),Corner is corrected for steering wheel,To add yaw moment;
Step 6: the reward function r of deeply learning method is established using formula (8):
R=re+rps+rv+rm+rsw+rst (8)
In formula (8), reFor error reward function, and have:
In formula (9),For yaw-rate error,For side slip angle error, and have:
In formula (8), rpsFor fixed prize value function, and have:
In formula (8), rvFor speed difference reward function, and have:
In formula (8), rmTo add yaw moment reward function, and have:
In formula (8), rswFor correction angle reward function, and have:
In formula (8), rstFor stable region reward function, and have:
Step 7: the network model of building deeply learning method:
Step 7.1: building acts network model, comprising: one layer of input layer comprising a neuron respectively contains n1It is a
The m of neuron1Layer hidden layer, one layer of output layer comprising 2 neurons;Initialization action network parameter is θμ;
Step 7.2: building evaluation network model, comprising: respectively include two layers of input layer of 1 neuron, respectively contain n2
The m of a neuron2Layer hidden layer, wherein m2Layer hidden layer is full articulamentum, one layer of output layer comprising 1 neuron;Just
It is θ that beginningization, which evaluates network parameter,Q;
Step 7.3: building target action network model identical with the movement network architecture, and enable target action
Network parameter θμ′=θμ, objective appraisal network model identical with the evaluation network architecture is constructed, and enable objective appraisal
Network parameter θQ′=θQ;
Step 8: N sample is formed by i-th sample:
Initialize i-th of vehicle status parameters si, and with i-th of vehicle status parameters siAs the movement network model
Input, by the movement network model export μ (si|θμ);
I-th of vehicle action parameter a is obtained using formula (17)i:
ai=μ (si|θμ)+Ni (17)
In formula (17), NiIndicate i-th of random noise;
I-th of vehicle reward value r is obtained according to formula (8)i, and obtain updated i-th of vehicle status parameters s 'i;To
It obtains obtaining i-th sample, is denoted as (si,ai,ri,s′i), and then obtain N sample;
Step 9: being trained with network model of the N sample to the deeply learning method, to obtain
Obtain optimal movement network model and optimal evaluation network model;
Step 10: judging whether formula (18) and formula (19) are set up, if setting up, then it represents that automobile is in stable state,
Otherwise, it indicates that automobile plays pendulum, and executes step 11:
In formula (18), k1For the first border coefficient of stable region, k2For stable region the second boundary coefficient;For side slip angle
Speed;
In formula (19), ε is adjustable parameter;
Step 11: obtaining vehicle's current condition parameter stAs the input of optimal movement network model, thus described in utilizing
Optimal movement network model output currently adds yaw momentWith amendment corner
Step 12: judge whether formula (20) is true, if so, the steering property for then indicating automobile is understeer, then enables
Movement wheel is inner rear wheel, and executes step 13, otherwise, indicates that the steering property of automobile is negative understeer, then enables movement wheel
For outer front-wheel, and execute step 14;
wd×(w-wd) > 0 (20)
Step 13: if δf> 0 then enables amendment cornerDirection to the left, if δf< 0 then enables amendment cornerSide
To the right;
Step 14: if δf> 0 then enables amendment cornerDirection to the right, if δf< 0 then enables amendment cornerSide
To the left.
The characteristics of intelligent automobile stability control method of the present invention, lies also in, and the step 9 is according to the following procedure
It carries out:
Step 9.1: initialization Study rate parameter is α, and return rate parameter is γ;Initialize i=1;
Step 9.2: with i-th of vehicle status parameters siThe input that network model is acted as current i-th, by institute
It states current i-th of movement network model and exports i-th of output valve μ (si|θμ);
With i-th of vehicle status parameters si, i-th of vehicle action parameter aiWith i-th of output of the movement network
Value μ (si|θμ) as it is described it is current i-th evaluation network model input, by i-th of vehicle status parameters siWith i-th
A vehicle action parameter aiI-th of output valve Q is exported by current i-th of evaluation network modeli(ai);By the movement
I-th of output valve μ (s of network modeli|θμ) by i-th of output valve Q of current i-th of evaluation network model outputi(μ
(si|θμ));
With updated i-th of vehicle status parameters s 'iAs current i-th of target action network model
Input exports i-th of output valve μ (s ' by current i-th of target action network modeli|θμ′);
With updated i-th of vehicle status parameters s 'iWith i-th of output valve μ (s ' of target action network modeli
|θμ′) input as current i-th of objective appraisal network model, it is defeated by current i-th of objective appraisal network model
I-th of output valve Q ' outi(a′i);
According to i-th of output valve Q of current i-th of evaluation network modeli(μ(si|θμ)) Utilization strategies gradient method pair
Current i-th of movement network model is updated, thus obtain the updated movement network model of i-th and as i-th+
1 movement network model;
The output Q of network model is evaluated according to current i-thi(ai) and current i-th of objective appraisal network model
Output Q 'i(a′i), current i-th of evaluation network model is updated using loss function is minimized, to obtain
The updated evaluation network model of i-th simultaneously evaluates network model as i+1;
Step 9.3: after i+1 is assigned to i, judge whether i > N is true, if so, it then indicates to obtain optimal movement network
Model and optimal evaluation network model, otherwise, return step 9.2 execute.
Compared with prior art, the invention has the advantages that:
1, model-free and extensive prediction advantage of the present invention using deeply learning algorithm, it is determined that it and vehicle stabilization
Property the relevant input state of control and output action, devise the reward function for being adapted to coordinated control, construct and train most
Excellent movement network model, thus using the model under steady working condition and limiting condition can decision go out optimal stable coordination
Control strategy ensure that safety and the comfort of driver and passenger to realize vehicle stabilization control;
2, the deeply learning algorithm that the present invention is based on is not needed upon auto model algorithm for design model, is used
Deep neural network there is very strong non-linear expression's ability, vehicle condition and active steering can be given expression to, difference is braked
Non-linear relation between control more meets true feelings compared to the linear controller designed based on simplified auto model
Condition;
3, control method of the invention comparison is without control, active steering control, direct yaw moment control and linear distribution
Coordinated control all has preferable control effect under different operating conditions, has better robustness, then has under limiting condition
Better comfort.
Detailed description of the invention
Fig. 1 is that the present invention is based on the intelligent automobile stabilitraks that deeply learns;
Fig. 2 is the training process figure of deeply learning method of the present invention.
Specific embodiment
In the present embodiment, it is a kind of based on deeply study intelligent automobile stability control method can be current according to automobile
State parameter, decision goes out current amendment corner and additional yaw moment, to realize stability of automobile coordinated control.Specifically
It says, as shown in Figure 1, being to carry out as follows:
Step 1: obtaining the front wheel angle δ of vehicle lateral control device decision outputfAnd vehicle structure parameter, comprising: vehicle
Wheelspan L, mass center are to wheel base from LfAnd Lr, front and back wheel cornering stiffness C1And C2, car mass m;
Obtain vehicle driving parameter, comprising: steering wheel angle sw, speed u and surface friction coefficient μ;
Step 2: calculating ideal yaw velocity w using formula (1)d:
In formula (1), g is acceleration of gravity, and w is yaw velocity, and is had:
Step 3: calculating ideal side slip angle β using formula (3)d:
βd=-min | β |, | βmax|}·sign(δf) (3)
In formula (3), β is vehicle centroid side drift angle, βmaxFor vehicle maximum side slip angle, and have:
Step 4: the vehicle status parameters s of deeply learning method is defined using formula (6):
S={ w, β, sw, wd,βd} (6)
Step 5: the action parameter a of deeply learning method is defined using formula (7):
In formula (7),Corner is corrected for steering wheel, value range is (0,20), and unit takes °,To add sideway power
Square value range is (0,20), and unit takes Nm;
Step 6: the reward function r of deeply learning method is established using formula (8):
R=re+rps+rv+rm+rsw+rst (8)
Reward function is the core of entire depth nitrification enhancement, can guide the adjustment side of deep neural network parameter
To.Design principle should be provided first in design, and specific reward function is then redesigned according to design principle.
It is 4 priority that function setup is rewarded in this example, and priority is higher, then the principle is more important, design principle are as follows:
1 grade: object of the present invention is to realize Vehicle Stability Control, therefore guaranteeing that the stability of automobile is top priority;
2 grades: course changing control has advantage compared to control for brake, so to guarantee that course changing control will be controlled prior to braking
System;
3 grades: using lesser active steering angle or lesser braking pressure control vehicle steadily as far as possible;
4 grades: for automobile in stability region, making movement output as far as possible is 0;
In formula (8), reFor error reward function, corresponding to 1 grade of design principle, the smaller then reward value of error is bigger, in order to
The change rate of the importance of prominent 1 grade of design principle, error reward function should be maximum, therefore designs quadratic function as 1 grade
Reward function, and have:
In formula (9),For yaw-rate error,For side slip angle error, and have:
In formula (8), rpsFor fixed prize value function, correspond to 2 grades of design principles, preferentially using course changing control can obtain compared with
Big reward value, and have:
In formula (8), rvFor speed difference reward function, correspond to 2 grades of Controlling principles, turning to influences speed compared to braking
It is smaller, biggish reward value can be obtained, and have:
In formula (8), rmTo add yaw moment reward function, corresponding to 3 grades of design principles, and have:
In formula (8), rswFor correction angle reward function, correspond to 3 grades of design principles, and have:
In formula (8), rstFor stable region reward function, correspond to 4 grades of design principles, in stable region, movement is smaller, encourages
It encourages bigger, and has:
Step 7: the network model of building deeply learning method:
Step 7.1: building acts network model, comprising: one layer of input layer comprising a neuron respectively contains n1It is a
The m of neuron1Layer hidden layer, one layer of output layer comprising 2 neurons;Initialization action network parameter is θμ;
Step 7.2: building evaluation network model, comprising: respectively include two layers of input layer of 1 neuron, respectively contain n2
The m of a neuron2Layer hidden layer, wherein m2Layer hidden layer is full articulamentum, one layer of output layer comprising 1 neuron;Just
It is θ that beginningization, which evaluates network parameter,Q;
Step 7.3: building target action network model identical with movement network architecture, and enable target action network
Parameter θμ′=θμ, building objective appraisal network model identical with evaluation network architecture, and enable objective appraisal network parameter
θQ′=θQ;
Step 8: N sample is formed by i-th sample:
Initialize i-th of vehicle status parameters si, and with i-th of vehicle status parameters siAs the defeated of movement network model
Enter, μ (s is exported by movement network modeli|θμ);
I-th of vehicle action parameter a is obtained using formula (17)i:
ai=μ (si|θμ)+Ni (17)
In formula (17), NiIndicate i-th of random noise;
I-th of vehicle reward value r is obtained according to formula (8)i, and obtain updated i-th of vehicle status parameters s 'i;To
It obtains obtaining i-th sample, is denoted as (si,ai,ri,s′i), and then obtain N sample;
Step 9: as shown in Fig. 2, being trained with network model of the N sample to deeply learning method:
Step 9.1: initialization Study rate parameter is α, and return rate parameter is γ;Initialize i=1;
Step 9.2: with i-th of vehicle status parameters siThe input that network model is acted as current i-th, by current the
I movement network model exports i-th of output valve μ (si|θμ);
With i-th of vehicle status parameters si, i-th of vehicle action parameter aiWith i-th of output valve μ (s of movement networki|
θμ) as current i-th evaluate network model input, by i-th of vehicle status parameters siWith i-th of vehicle action parameter
aiI-th of output valve Q is exported by current i-th of evaluation network modeli(ai);By i-th of output valve μ of movement network model
(si|θμ) by i-th of output valve Q of current i-th of evaluation network model outputi(μ(si|θμ));
With updated i-th of vehicle status parameters s 'iAs the input of current i-th of target action network model, by
Current i-th of target action network model exports i-th of output valve μ (s 'i|θμ′);
With updated i-th of vehicle status parameters si' and target action network model i-th of output valve μ (s 'i|
θμ′) input as current i-th of objective appraisal network model, by current i-th of objective appraisal network model export i-th it is defeated
Value Q ' outi(a′i);
I-th of output valve Q of network model is evaluated according to current i-thi(μ(si|θμ)) Utilization strategies gradient method is to current
I-th of movement network model is updated, to obtain the updated movement network model of i-th and act as i+1
Network model;
The output Q of network model is evaluated with current i-thi(ai) and current i-th of objective appraisal network model output
Q′i(a′i), current i-th of evaluation network model is updated using loss function is minimized, thus after obtaining i-th update
Evaluation network model and as i+1 evaluate network model;
Step 9.3: after i+1 is assigned to i, judge whether i > N is true, if so, it then indicates to obtain optimal movement network
Model and optimal evaluation network model, otherwise, return step 9.2 execute;
Step 10: judging whether formula (18) and formula (19) are set up, if setting up, then it represents that automobile is in stable state,
Otherwise, it indicates that automobile plays pendulum, and executes step 11:
In formula (18), k1For the first border coefficient of stable region, k2For stable region the second boundary coefficient;For side slip angle
Speed;
In formula (19), ε is adjustable parameter;
Step 11: obtaining vehicle's current condition parameter stAs the input of optimal movement network model, thus using optimal
Act network model output currently additional yaw momentWith amendment corner
Step 12: judge whether formula (20) is true, if so, the steering property for then indicating automobile is understeer, then enables
Movement wheel is inner rear wheel, and executes step 13, otherwise, indicates that the steering property of automobile is negative understeer, then enables movement wheel
For outer front-wheel, and execute step 14;
wd×(w-wd) > 0 (20)
Step 13: if δf> 0 then enables amendment cornerDirection to the left, if δf< 0 then enables amendment cornerSide
To the right;
Step 14: if δf> 0 then enables amendment cornerDirection to the right, if δf< 0 then enables amendment cornerSide
To the left.
Claims (2)
1. a kind of intelligent automobile stability control method based on deeply study, it is characterized in that carrying out as follows:
Step 1: obtaining the front wheel angle δ of vehicle lateral control device decision outputfAnd vehicle structure parameter, comprising: car gage
L, mass center is to wheel base from LfAnd Lr, front and back wheel cornering stiffness C1And C2, car mass m;
Obtain vehicle driving parameter, comprising: steering wheel angle sw, speed u and surface friction coefficient μ;
Step 2: calculating ideal yaw velocity w using formula (1)d:
In formula (1), g is acceleration of gravity, and w is yaw velocity, and is had:
Step 3: calculating ideal side slip angle β using formula (3)d:
βd=-min | β |, | βmax|}·sign(δf) (3)
In formula (3), β is vehicle centroid side drift angle, βmaxFor vehicle maximum side slip angle, and have:
Step 4: the vehicle status parameters s of deeply learning method is defined using formula (6):
S={ w, β, sw, wd,βd} (6)
Step 5: the action parameter a of deeply learning method is defined using formula (7):
A={ ▽ δ, ▽ M } (7)
In formula (7), ▽ δ is that steering wheel corrects corner, and ▽ M is additional yaw moment;
Step 6: the reward function r of deeply learning method is established using formula (8):
R=re+rps+rv+rm+rsw+rst (8)
In formula (8), reFor error reward function, and have:
re=-▽ w2-▽β2+50 (9)
In formula (9), ▽ w is yaw-rate error, and ▽ β is side slip angle error, and is had:
▽ w=w-wd (10)
▽ β=β-βd (11)
In formula (8), rpsFor fixed prize value function, and have:
In formula (8), rvFor speed difference reward function, and have:
In formula (8), rmTo add yaw moment reward function, and have:
In formula (8), rswFor correction angle reward function, and have:
rsw=-| ▽ δ |+10 (15)
In formula (8), rstFor stable region reward function, and have:
rst=-(| ▽ δ |+| ▽ M |)/10 (16)
Step 7: the network model of building deeply learning method:
Step 7.1: building acts network model, comprising: one layer of input layer comprising a neuron respectively contains n1A nerve
The m of member1Layer hidden layer, one layer of output layer comprising 2 neurons;Initialization action network parameter is θμ;
Step 7.2: building evaluation network model, comprising: respectively include two layers of input layer of 1 neuron, respectively contain n2A nerve
The m of member2Layer hidden layer, wherein m2Layer hidden layer is full articulamentum, one layer of output layer comprising 1 neuron;Initialization is commented
Valence network parameter is θQ;
Step 7.3: building target action network model identical with the movement network architecture, and enable target action network
Parameter θμ′=θμ, objective appraisal network model identical with the evaluation network architecture is constructed, and enable objective appraisal network
Parameter θQ′=θQ;
Step 8: N sample is formed by i-th sample:
Initialize i-th of vehicle status parameters si, and with i-th of vehicle status parameters siAs the defeated of the movement network model
Enter, μ (s is exported by the movement network modeli|θμ);
I-th of vehicle action parameter a is obtained using formula (17)i:
ai=μ (si|θμ)+Ni (17)
In formula (17), NiIndicate i-th of random noise;
I-th of vehicle reward value r is obtained according to formula (8)i, and obtain updated i-th of vehicle status parameters s 'i;To obtain
I-th sample is obtained, (s is denoted asi,ai,ri,s′i), and then obtain N sample;
Step 9: being trained with network model of the N sample to the deeply learning method, to be obtained
Optimal movement network model and optimal evaluation network model;
Step 10: judging whether formula (18) and formula (19) are set up, if setting up, then it represents that automobile is in stable state, otherwise,
It indicates that automobile plays pendulum, and executes step 11:
|▽w|≤|ε·wd| (19)
In formula (18), k1For the first border coefficient of stable region, k2For stable region the second boundary coefficient;For side slip angle speed;
In formula (19), ε is adjustable parameter;
Step 11: obtaining vehicle's current condition parameter stAs the input of optimal movement network model, thus using described optimal dynamic
Make network model output currently additional yaw moment ▽ MtWith amendment corner ▽ δt;
Step 12: judge whether formula (20) is true, if so, the steering property for then indicating automobile is understeer, then enables movement
Wheel is inner rear wheel, and executes step 13, otherwise, indicates that the steering property of automobile is negative understeer, then it is outer for enabling movement wheel
Front-wheel, and execute step 14;
wd×(w-wd) > 0 (20)
Step 13: if δf> 0 then enables amendment corner ▽ δtDirection to the left, if δf< 0 then enables amendment corner ▽ δtDirection to
It is right;
Step 14: if δf> 0 then enables amendment corner ▽ δtDirection to the right, if δf< 0 then enables amendment corner ▽ δtDirection to
It is left.
2. intelligent automobile stability control method according to claim 1, characterized in that the step 9 is by following mistake
Cheng Jinhang:
Step 9.1: initialization Study rate parameter is α, and return rate parameter is γ;Initialize i=1;
Step 9.2: with i-th of vehicle status parameters siThe input that network model is acted as current i-th, is worked as by described
Preceding i-th of movement network model exports i-th of output valve μ (si|θμ);
With i-th of vehicle status parameters si, i-th of vehicle action parameter aiWith i-th of output valve μ of the movement network
(si|θμ) as it is described it is current i-th evaluation network model input, by i-th of vehicle status parameters siWith i-th
Vehicle action parameter aiI-th of output valve Q is exported by current i-th of evaluation network modeli(ai);By the movement net
I-th of output valve μ (s of network modeli|θμ) by i-th of output valve Q of current i-th of evaluation network model outputi(μ(si
|θμ));
With updated i-th of vehicle status parameters s 'iAs the input of current i-th of target action network model,
I-th of output valve μ (s ' is exported by current i-th of target action network modeli|θμ′);
With updated i-th of vehicle status parameters s 'iWith i-th of output valve μ (s ' of target action network modeli|θμ′)
As the input of current i-th of objective appraisal network model, by current i-th of objective appraisal network model output the
I output valve Q 'i(a′i);
According to i-th of output valve Q of current i-th of evaluation network modeli(μ(si|θμ)) Utilization strategies gradient method is to described
Current i-th of movement network model is updated, to obtain the updated movement network model of i-th and as i+1
Act network model;
The output Q of network model is evaluated according to current i-thi(ai) and current i-th of objective appraisal network model is defeated
Q ' outi(a′i), current i-th of evaluation network model is updated using loss function is minimized, to obtain i-th
Updated evaluation network model simultaneously evaluates network model as i+1;
Step 9.3: after i+1 is assigned to i, judge whether i > N is true, if so, it then indicates to obtain optimal movement network model
With optimal evaluation network model, otherwise, return step 9.2 is executed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910809910.7A CN110450771B (en) | 2019-08-29 | 2019-08-29 | Intelligent automobile stability control method based on deep reinforcement learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910809910.7A CN110450771B (en) | 2019-08-29 | 2019-08-29 | Intelligent automobile stability control method based on deep reinforcement learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110450771A true CN110450771A (en) | 2019-11-15 |
CN110450771B CN110450771B (en) | 2021-03-09 |
Family
ID=68489893
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910809910.7A Active CN110450771B (en) | 2019-08-29 | 2019-08-29 | Intelligent automobile stability control method based on deep reinforcement learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110450771B (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110843746A (en) * | 2019-11-28 | 2020-02-28 | 的卢技术有限公司 | Anti-lock brake control method and system based on reinforcement learning |
CN111605542A (en) * | 2020-05-06 | 2020-09-01 | 南京航空航天大学 | Vehicle stability system based on safety boundary and control method |
CN111746633A (en) * | 2020-07-02 | 2020-10-09 | 南京航空航天大学 | Vehicle distributed steering driving system control method based on reinforcement learning |
CN111873991A (en) * | 2020-07-22 | 2020-11-03 | 中国第一汽车股份有限公司 | Vehicle steering control method, device, terminal and storage medium |
CN112861269A (en) * | 2021-03-11 | 2021-05-28 | 合肥工业大学 | Automobile longitudinal multi-state control method based on deep reinforcement learning preferential extraction |
CN113386790A (en) * | 2021-06-09 | 2021-09-14 | 扬州大学 | Automatic driving decision-making method for cross-sea bridge road condition |
CN115123159A (en) * | 2022-06-27 | 2022-09-30 | 重庆邮电大学 | AEB control method and system based on DDPG deep reinforcement learning |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP4183667A1 (en) * | 2021-11-18 | 2023-05-24 | Volvo Truck Corporation | Method for closed loop control of a position of a fifth wheel of a vehicle |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1439556A (en) * | 2002-02-23 | 2003-09-03 | 株式会社万都 | Control method for vehicle running stability |
US20040060765A1 (en) * | 2002-09-27 | 2004-04-01 | Ford Global Technologies, Inc. | Yaw control for an automotive vehicle using steering actuators |
JP4021185B2 (en) * | 2001-12-07 | 2007-12-12 | 本田技研工業株式会社 | Yaw moment feedback control method |
DE102011010491A1 (en) * | 2011-02-07 | 2012-08-09 | Audi Ag | Method for activating electronics stability control device of e.g. all-wheel driven motor car, involves determining function such that function value increases with increasing lateral force of front and rear wheels |
CN105253141A (en) * | 2015-09-09 | 2016-01-20 | 北京理工大学 | Wheel longitudinal force regulation-based vehicle handling stability control method |
CN106828464A (en) * | 2017-01-06 | 2017-06-13 | 合肥工业大学 | A kind of vehicle body stable control method and system based on coefficient of road adhesion estimation |
-
2019
- 2019-08-29 CN CN201910809910.7A patent/CN110450771B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4021185B2 (en) * | 2001-12-07 | 2007-12-12 | 本田技研工業株式会社 | Yaw moment feedback control method |
CN1439556A (en) * | 2002-02-23 | 2003-09-03 | 株式会社万都 | Control method for vehicle running stability |
US20040060765A1 (en) * | 2002-09-27 | 2004-04-01 | Ford Global Technologies, Inc. | Yaw control for an automotive vehicle using steering actuators |
DE102011010491A1 (en) * | 2011-02-07 | 2012-08-09 | Audi Ag | Method for activating electronics stability control device of e.g. all-wheel driven motor car, involves determining function such that function value increases with increasing lateral force of front and rear wheels |
CN105253141A (en) * | 2015-09-09 | 2016-01-20 | 北京理工大学 | Wheel longitudinal force regulation-based vehicle handling stability control method |
CN106828464A (en) * | 2017-01-06 | 2017-06-13 | 合肥工业大学 | A kind of vehicle body stable control method and system based on coefficient of road adhesion estimation |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110843746A (en) * | 2019-11-28 | 2020-02-28 | 的卢技术有限公司 | Anti-lock brake control method and system based on reinforcement learning |
CN111605542A (en) * | 2020-05-06 | 2020-09-01 | 南京航空航天大学 | Vehicle stability system based on safety boundary and control method |
CN111746633A (en) * | 2020-07-02 | 2020-10-09 | 南京航空航天大学 | Vehicle distributed steering driving system control method based on reinforcement learning |
CN111873991A (en) * | 2020-07-22 | 2020-11-03 | 中国第一汽车股份有限公司 | Vehicle steering control method, device, terminal and storage medium |
CN111873991B (en) * | 2020-07-22 | 2022-04-08 | 中国第一汽车股份有限公司 | Vehicle steering control method, device, terminal and storage medium |
CN112861269A (en) * | 2021-03-11 | 2021-05-28 | 合肥工业大学 | Automobile longitudinal multi-state control method based on deep reinforcement learning preferential extraction |
CN112861269B (en) * | 2021-03-11 | 2022-08-30 | 合肥工业大学 | Automobile longitudinal multi-state control method based on deep reinforcement learning preferential extraction |
CN113386790A (en) * | 2021-06-09 | 2021-09-14 | 扬州大学 | Automatic driving decision-making method for cross-sea bridge road condition |
CN113386790B (en) * | 2021-06-09 | 2022-07-12 | 扬州大学 | Automatic driving decision-making method for cross-sea bridge road condition |
CN115123159A (en) * | 2022-06-27 | 2022-09-30 | 重庆邮电大学 | AEB control method and system based on DDPG deep reinforcement learning |
Also Published As
Publication number | Publication date |
---|---|
CN110450771B (en) | 2021-03-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110450771A (en) | A kind of intelligent automobile stability control method based on deeply study | |
US8718872B2 (en) | Vehicle attitude controller | |
CN111267835B (en) | Four-wheel independent drive automobile stability control method based on model prediction algorithm | |
CN109976159A (en) | Intelligent vehicle crosswise joint method based on safely controllable domain | |
US10059344B2 (en) | Vehicle behavior control apparatus | |
Goodarzi et al. | Automatic path control based on integrated steering and external yaw-moment control | |
CN106461052A (en) | Method and system for controlling stability and yaw response of a vehicle by locking a differential at elevated speeds | |
CN101189149A (en) | Vehicle braking/driving force control apparatus | |
CN108773376A (en) | A kind of the automobile objective layered Collaborative Control and optimization method of fusion driving intention | |
CN105936273A (en) | Vehicle active torque inter-wheel and inter-axis distribution method | |
CN102421655A (en) | Specification information estimating device and vehicle | |
Jalali | Stability control of electric vehicles with in-wheel motors | |
JP2014144681A (en) | Vehicular driving force control unit | |
CN114802177A (en) | Torque distribution system for redistributing torque between axles of vehicle | |
Lu et al. | From vehicle stability control to intelligent personal minder: Real-time vehicle handling limit warning and driver style characterization | |
Maeda et al. | Performance of driver-vehicle system in emergency avoidance | |
CN110239499B (en) | Vehicle control device and vehicle control method | |
Zheng et al. | Comparison of active front wheel steering and differential braking for yaw/roll stability enhancement of a coach | |
Tamaddoni et al. | Optimal vehicle stability control design based on preview game theory concept | |
US8442736B2 (en) | System for enhancing cornering performance of a vehicle controlled by a safety system | |
CN106672072A (en) | Control method for steer-by-wire automobile active front-wheel steering control system | |
Peng et al. | Integrated vehicle control based on tire force reserve optimization concept | |
CN109747710B (en) | Method for controlling a rear axle steering system of a motor vehicle | |
Xiao | The research of fuzzy variable transmission ratio for steer-by-wire system of electric forklift | |
JP5018051B2 (en) | Vehicle driving force control device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |