CN110450771A - A kind of intelligent automobile stability control method based on deeply study - Google Patents

A kind of intelligent automobile stability control method based on deeply study Download PDF

Info

Publication number
CN110450771A
CN110450771A CN201910809910.7A CN201910809910A CN110450771A CN 110450771 A CN110450771 A CN 110450771A CN 201910809910 A CN201910809910 A CN 201910809910A CN 110450771 A CN110450771 A CN 110450771A
Authority
CN
China
Prior art keywords
network model
formula
vehicle
current
parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910809910.7A
Other languages
Chinese (zh)
Other versions
CN110450771B (en
Inventor
黄鹤
郭伟锋
张炳力
张润
王博文
吴润晨
程进
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hefei University of Technology
Hefei Polytechnic University
Original Assignee
Hefei Polytechnic University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hefei Polytechnic University filed Critical Hefei Polytechnic University
Priority to CN201910809910.7A priority Critical patent/CN110450771B/en
Publication of CN110450771A publication Critical patent/CN110450771A/en
Application granted granted Critical
Publication of CN110450771B publication Critical patent/CN110450771B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W10/00Conjoint control of vehicle sub-units of different type or different function
    • B60W10/18Conjoint control of vehicle sub-units of different type or different function including control of braking systems
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W10/00Conjoint control of vehicle sub-units of different type or different function
    • B60W10/20Conjoint control of vehicle sub-units of different type or different function including control of steering systems
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W30/00Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units, or advanced driver assistance systems for ensuring comfort, stability and safety or drive control systems for propelling or retarding the vehicle
    • B60W30/02Control of vehicle driving stability
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/02Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
    • B60W40/06Road conditions
    • B60W40/068Road friction coefficient
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/10Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to vehicle motion
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/10Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to vehicle motion
    • B60W40/105Speed
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/12Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to parameters of the vehicle itself, e.g. tyre models
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/12Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to parameters of the vehicle itself, e.g. tyre models
    • B60W40/13Load or weight
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W50/0098Details of control systems ensuring comfort, safety or stability not otherwise provided for
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W2050/0001Details of the control system
    • B60W2050/0002Automatic control, details of type of controller or control system architecture
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W2050/0001Details of the control system
    • B60W2050/0043Signal treatments, identification of variables or parameters, parameter estimation or state estimation
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2520/00Input parameters relating to overall vehicle dynamics
    • B60W2520/10Longitudinal speed
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2530/00Input parameters relating to vehicle conditions or values, not covered by groups B60W2510/00 or B60W2520/00
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2530/00Input parameters relating to vehicle conditions or values, not covered by groups B60W2510/00 or B60W2520/00
    • B60W2530/10Weight
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2710/00Output or target parameters relating to a particular sub-units
    • B60W2710/18Braking system
    • B60W2710/182Brake pressure, e.g. of fluid or between pad and disc
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2710/00Output or target parameters relating to a particular sub-units
    • B60W2710/20Steering systems

Abstract

The invention discloses a kind of intelligent automobile stability control method based on deeply study, step includes: the decision output and vehicle structure parameter, driving parameters of 1 acquisition automobile Lateral Controller;2 define state parameter, action parameter and the reward function of deeply learning method;3 construct and train the network model of deeply learning method, obtain optimal movement network model;4 obtain automobile current state parameter st, to export currently additional yaw moment ▽ M using the optimal movement network modeltWith amendment corner ▽ δt;5 judge vehicle steadily state;6 determine current amendment corner ▽ δ according to motor turning property and steering wheel angle directiontDirection and current additional yaw moment ▽ MtMovement wheel.The present invention is able to achieve the rule of the optimistic coordinated control under steady working condition and limiting condition between direct yaw moment control and course changing control, to realize vehicle stabilization control, guarantees safety and the comfort of driver and passenger.

Description

A kind of intelligent automobile stability control method based on deeply study
Technical field
The present invention relates to automobile dynamics control field, more specifically a kind of intelligent vapour based on deeply study Vehicle stability control method.
Background technique
When turning to, slip angle of tire increases automobile, and lateral force increases, and enables the vehicle to travel according to driver intention, But under some low attachments and anxious steering situation, the lateral force of vehicle is easy to reach limit of adhesion, and vehicle can occur to break away, is anxious Turn, rollover etc. dangerous working conditions.Currently, the major way that above-mentioned dangerous working condition can be intervened be active steering control and directly Yaw moment control.Active steering control is by changing the yaw moment of vehicle to steering wheel Introduced Malaria corner;It is directly horizontal It is poor that pendulum Torque Control mainly passes through adjusting wheel braking force formation brake force, to generate additional yaw moment to adjust vehicle Understeer or ovdersteering.
The influence of active steering and direct yaw moment control to automotive performance respectively has advantage and disadvantage, independent active steering control It is smaller on speed influence, it ensure that the comfort of driver and passenger, but ineffective under limiting condition, it is steady to be unable to control vehicle It is fixed, it can not meet the security requirement of driver and passenger;Individual direct yaw moment control system, it is ensured that driver and passenger exist Safety under limiting condition, but be affected to longitudinal acceleration of the vehicle, it can not meet the comfort requirement of driver and passenger.And vehicle As complicated nonlinear system, there are many couplings between each system, and it is steady to control vehicle in each state of vehicle Relatively optimal control output is established a capital, is not simple linear relationship between these optimal control outputs, passes through design Linear tuning controller can not guarantee safety and the comfort of driver and passenger well.
Summary of the invention
The present invention is to solve above-mentioned the shortcomings of the prior art, proposes a kind of intelligent automobile based on deeply study Stability control method, to be able to achieve under steady working condition and limiting condition between direct yaw moment control and course changing control Optimistic coordinated control rule, to realize vehicle stabilization control, guarantees safety and the comfort of driver and passenger.
The present invention adopts the following technical scheme that in order to solve the technical problem
A kind of the characteristics of intelligent automobile stability control method based on deeply study of the invention is as follows It carries out:
Step 1: obtaining the front wheel angle δ of vehicle lateral control device decision outputfAnd vehicle structure parameter, comprising: vehicle Wheelspan L, mass center are to wheel base from LfAnd Lr, front and back wheel cornering stiffness C1And C2, car mass m;
Obtain vehicle driving parameter, comprising: steering wheel angle sw, speed u and surface friction coefficient μ;
Step 2: calculating ideal yaw velocity w using formula (1)d:
In formula (1), g is acceleration of gravity, and w is yaw velocity, and is had:
Step 3: calculating ideal side slip angle β using formula (3)d:
βd=-min | β |, | βmax|}·sign(δf) (3)
In formula (3), β is vehicle centroid side drift angle, βmaxFor vehicle maximum side slip angle, and have:
Step 4: the vehicle status parameters s of deeply learning method is defined using formula (6):
S={ w, β, sw, wdd} (6)
Step 5: the action parameter a of deeply learning method is defined using formula (7):
In formula (7),Corner is corrected for steering wheel,To add yaw moment;
Step 6: the reward function r of deeply learning method is established using formula (8):
R=re+rps+rv+rm+rsw+rst (8)
In formula (8), reFor error reward function, and have:
In formula (9),For yaw-rate error,For side slip angle error, and have:
In formula (8), rpsFor fixed prize value function, and have:
In formula (8), rvFor speed difference reward function, and have:
In formula (8), rmTo add yaw moment reward function, and have:
In formula (8), rswFor correction angle reward function, and have:
In formula (8), rstFor stable region reward function, and have:
Step 7: the network model of building deeply learning method:
Step 7.1: building acts network model, comprising: one layer of input layer comprising a neuron respectively contains n1It is a The m of neuron1Layer hidden layer, one layer of output layer comprising 2 neurons;Initialization action network parameter is θμ
Step 7.2: building evaluation network model, comprising: respectively include two layers of input layer of 1 neuron, respectively contain n2 The m of a neuron2Layer hidden layer, wherein m2Layer hidden layer is full articulamentum, one layer of output layer comprising 1 neuron;Just It is θ that beginningization, which evaluates network parameter,Q
Step 7.3: building target action network model identical with the movement network architecture, and enable target action Network parameter θμ′μ, objective appraisal network model identical with the evaluation network architecture is constructed, and enable objective appraisal Network parameter θQ′Q
Step 8: N sample is formed by i-th sample:
Initialize i-th of vehicle status parameters si, and with i-th of vehicle status parameters siAs the movement network model Input, by the movement network model export μ (siμ);
I-th of vehicle action parameter a is obtained using formula (17)i:
ai=μ (siμ)+Ni (17)
In formula (17), NiIndicate i-th of random noise;
I-th of vehicle reward value r is obtained according to formula (8)i, and obtain updated i-th of vehicle status parameters s 'i;To It obtains obtaining i-th sample, is denoted as (si,ai,ri,s′i), and then obtain N sample;
Step 9: being trained with network model of the N sample to the deeply learning method, to obtain Obtain optimal movement network model and optimal evaluation network model;
Step 10: judging whether formula (18) and formula (19) are set up, if setting up, then it represents that automobile is in stable state, Otherwise, it indicates that automobile plays pendulum, and executes step 11:
In formula (18), k1For the first border coefficient of stable region, k2For stable region the second boundary coefficient;For side slip angle Speed;
In formula (19), ε is adjustable parameter;
Step 11: obtaining vehicle's current condition parameter stAs the input of optimal movement network model, thus described in utilizing Optimal movement network model output currently adds yaw momentWith amendment corner
Step 12: judge whether formula (20) is true, if so, the steering property for then indicating automobile is understeer, then enables Movement wheel is inner rear wheel, and executes step 13, otherwise, indicates that the steering property of automobile is negative understeer, then enables movement wheel For outer front-wheel, and execute step 14;
wd×(w-wd) > 0 (20)
Step 13: if δf> 0 then enables amendment cornerDirection to the left, if δf< 0 then enables amendment cornerSide To the right;
Step 14: if δf> 0 then enables amendment cornerDirection to the right, if δf< 0 then enables amendment cornerSide To the left.
The characteristics of intelligent automobile stability control method of the present invention, lies also in, and the step 9 is according to the following procedure It carries out:
Step 9.1: initialization Study rate parameter is α, and return rate parameter is γ;Initialize i=1;
Step 9.2: with i-th of vehicle status parameters siThe input that network model is acted as current i-th, by institute It states current i-th of movement network model and exports i-th of output valve μ (siμ);
With i-th of vehicle status parameters si, i-th of vehicle action parameter aiWith i-th of output of the movement network Value μ (siμ) as it is described it is current i-th evaluation network model input, by i-th of vehicle status parameters siWith i-th A vehicle action parameter aiI-th of output valve Q is exported by current i-th of evaluation network modeli(ai);By the movement I-th of output valve μ (s of network modeliμ) by i-th of output valve Q of current i-th of evaluation network model outputi(μ (siμ));
With updated i-th of vehicle status parameters s 'iAs current i-th of target action network model Input exports i-th of output valve μ (s ' by current i-th of target action network modeliμ′);
With updated i-th of vehicle status parameters s 'iWith i-th of output valve μ (s ' of target action network modeliμ′) input as current i-th of objective appraisal network model, it is defeated by current i-th of objective appraisal network model I-th of output valve Q ' outi(a′i);
According to i-th of output valve Q of current i-th of evaluation network modeli(μ(siμ)) Utilization strategies gradient method pair Current i-th of movement network model is updated, thus obtain the updated movement network model of i-th and as i-th+ 1 movement network model;
The output Q of network model is evaluated according to current i-thi(ai) and current i-th of objective appraisal network model Output Q 'i(a′i), current i-th of evaluation network model is updated using loss function is minimized, to obtain The updated evaluation network model of i-th simultaneously evaluates network model as i+1;
Step 9.3: after i+1 is assigned to i, judge whether i > N is true, if so, it then indicates to obtain optimal movement network Model and optimal evaluation network model, otherwise, return step 9.2 execute.
Compared with prior art, the invention has the advantages that:
1, model-free and extensive prediction advantage of the present invention using deeply learning algorithm, it is determined that it and vehicle stabilization Property the relevant input state of control and output action, devise the reward function for being adapted to coordinated control, construct and train most Excellent movement network model, thus using the model under steady working condition and limiting condition can decision go out optimal stable coordination Control strategy ensure that safety and the comfort of driver and passenger to realize vehicle stabilization control;
2, the deeply learning algorithm that the present invention is based on is not needed upon auto model algorithm for design model, is used Deep neural network there is very strong non-linear expression's ability, vehicle condition and active steering can be given expression to, difference is braked Non-linear relation between control more meets true feelings compared to the linear controller designed based on simplified auto model Condition;
3, control method of the invention comparison is without control, active steering control, direct yaw moment control and linear distribution Coordinated control all has preferable control effect under different operating conditions, has better robustness, then has under limiting condition Better comfort.
Detailed description of the invention
Fig. 1 is that the present invention is based on the intelligent automobile stabilitraks that deeply learns;
Fig. 2 is the training process figure of deeply learning method of the present invention.
Specific embodiment
In the present embodiment, it is a kind of based on deeply study intelligent automobile stability control method can be current according to automobile State parameter, decision goes out current amendment corner and additional yaw moment, to realize stability of automobile coordinated control.Specifically It says, as shown in Figure 1, being to carry out as follows:
Step 1: obtaining the front wheel angle δ of vehicle lateral control device decision outputfAnd vehicle structure parameter, comprising: vehicle Wheelspan L, mass center are to wheel base from LfAnd Lr, front and back wheel cornering stiffness C1And C2, car mass m;
Obtain vehicle driving parameter, comprising: steering wheel angle sw, speed u and surface friction coefficient μ;
Step 2: calculating ideal yaw velocity w using formula (1)d:
In formula (1), g is acceleration of gravity, and w is yaw velocity, and is had:
Step 3: calculating ideal side slip angle β using formula (3)d:
βd=-min | β |, | βmax|}·sign(δf) (3)
In formula (3), β is vehicle centroid side drift angle, βmaxFor vehicle maximum side slip angle, and have:
Step 4: the vehicle status parameters s of deeply learning method is defined using formula (6):
S={ w, β, sw, wdd} (6)
Step 5: the action parameter a of deeply learning method is defined using formula (7):
In formula (7),Corner is corrected for steering wheel, value range is (0,20), and unit takes °,To add sideway power Square value range is (0,20), and unit takes Nm;
Step 6: the reward function r of deeply learning method is established using formula (8):
R=re+rps+rv+rm+rsw+rst (8)
Reward function is the core of entire depth nitrification enhancement, can guide the adjustment side of deep neural network parameter To.Design principle should be provided first in design, and specific reward function is then redesigned according to design principle.
It is 4 priority that function setup is rewarded in this example, and priority is higher, then the principle is more important, design principle are as follows:
1 grade: object of the present invention is to realize Vehicle Stability Control, therefore guaranteeing that the stability of automobile is top priority;
2 grades: course changing control has advantage compared to control for brake, so to guarantee that course changing control will be controlled prior to braking System;
3 grades: using lesser active steering angle or lesser braking pressure control vehicle steadily as far as possible;
4 grades: for automobile in stability region, making movement output as far as possible is 0;
In formula (8), reFor error reward function, corresponding to 1 grade of design principle, the smaller then reward value of error is bigger, in order to The change rate of the importance of prominent 1 grade of design principle, error reward function should be maximum, therefore designs quadratic function as 1 grade Reward function, and have:
In formula (9),For yaw-rate error,For side slip angle error, and have:
In formula (8), rpsFor fixed prize value function, correspond to 2 grades of design principles, preferentially using course changing control can obtain compared with Big reward value, and have:
In formula (8), rvFor speed difference reward function, correspond to 2 grades of Controlling principles, turning to influences speed compared to braking It is smaller, biggish reward value can be obtained, and have:
In formula (8), rmTo add yaw moment reward function, corresponding to 3 grades of design principles, and have:
In formula (8), rswFor correction angle reward function, correspond to 3 grades of design principles, and have:
In formula (8), rstFor stable region reward function, correspond to 4 grades of design principles, in stable region, movement is smaller, encourages It encourages bigger, and has:
Step 7: the network model of building deeply learning method:
Step 7.1: building acts network model, comprising: one layer of input layer comprising a neuron respectively contains n1It is a The m of neuron1Layer hidden layer, one layer of output layer comprising 2 neurons;Initialization action network parameter is θμ
Step 7.2: building evaluation network model, comprising: respectively include two layers of input layer of 1 neuron, respectively contain n2 The m of a neuron2Layer hidden layer, wherein m2Layer hidden layer is full articulamentum, one layer of output layer comprising 1 neuron;Just It is θ that beginningization, which evaluates network parameter,Q
Step 7.3: building target action network model identical with movement network architecture, and enable target action network Parameter θμ′μ, building objective appraisal network model identical with evaluation network architecture, and enable objective appraisal network parameter θQ′Q
Step 8: N sample is formed by i-th sample:
Initialize i-th of vehicle status parameters si, and with i-th of vehicle status parameters siAs the defeated of movement network model Enter, μ (s is exported by movement network modeliμ);
I-th of vehicle action parameter a is obtained using formula (17)i:
ai=μ (siμ)+Ni (17)
In formula (17), NiIndicate i-th of random noise;
I-th of vehicle reward value r is obtained according to formula (8)i, and obtain updated i-th of vehicle status parameters s 'i;To It obtains obtaining i-th sample, is denoted as (si,ai,ri,s′i), and then obtain N sample;
Step 9: as shown in Fig. 2, being trained with network model of the N sample to deeply learning method:
Step 9.1: initialization Study rate parameter is α, and return rate parameter is γ;Initialize i=1;
Step 9.2: with i-th of vehicle status parameters siThe input that network model is acted as current i-th, by current the I movement network model exports i-th of output valve μ (siμ);
With i-th of vehicle status parameters si, i-th of vehicle action parameter aiWith i-th of output valve μ (s of movement networki| θμ) as current i-th evaluate network model input, by i-th of vehicle status parameters siWith i-th of vehicle action parameter aiI-th of output valve Q is exported by current i-th of evaluation network modeli(ai);By i-th of output valve μ of movement network model (siμ) by i-th of output valve Q of current i-th of evaluation network model outputi(μ(siμ));
With updated i-th of vehicle status parameters s 'iAs the input of current i-th of target action network model, by Current i-th of target action network model exports i-th of output valve μ (s 'iμ′);
With updated i-th of vehicle status parameters si' and target action network model i-th of output valve μ (s 'i| θμ′) input as current i-th of objective appraisal network model, by current i-th of objective appraisal network model export i-th it is defeated Value Q ' outi(a′i);
I-th of output valve Q of network model is evaluated according to current i-thi(μ(siμ)) Utilization strategies gradient method is to current I-th of movement network model is updated, to obtain the updated movement network model of i-th and act as i+1 Network model;
The output Q of network model is evaluated with current i-thi(ai) and current i-th of objective appraisal network model output Q′i(a′i), current i-th of evaluation network model is updated using loss function is minimized, thus after obtaining i-th update Evaluation network model and as i+1 evaluate network model;
Step 9.3: after i+1 is assigned to i, judge whether i > N is true, if so, it then indicates to obtain optimal movement network Model and optimal evaluation network model, otherwise, return step 9.2 execute;
Step 10: judging whether formula (18) and formula (19) are set up, if setting up, then it represents that automobile is in stable state, Otherwise, it indicates that automobile plays pendulum, and executes step 11:
In formula (18), k1For the first border coefficient of stable region, k2For stable region the second boundary coefficient;For side slip angle Speed;
In formula (19), ε is adjustable parameter;
Step 11: obtaining vehicle's current condition parameter stAs the input of optimal movement network model, thus using optimal Act network model output currently additional yaw momentWith amendment corner
Step 12: judge whether formula (20) is true, if so, the steering property for then indicating automobile is understeer, then enables Movement wheel is inner rear wheel, and executes step 13, otherwise, indicates that the steering property of automobile is negative understeer, then enables movement wheel For outer front-wheel, and execute step 14;
wd×(w-wd) > 0 (20)
Step 13: if δf> 0 then enables amendment cornerDirection to the left, if δf< 0 then enables amendment cornerSide To the right;
Step 14: if δf> 0 then enables amendment cornerDirection to the right, if δf< 0 then enables amendment cornerSide To the left.

Claims (2)

1. a kind of intelligent automobile stability control method based on deeply study, it is characterized in that carrying out as follows:
Step 1: obtaining the front wheel angle δ of vehicle lateral control device decision outputfAnd vehicle structure parameter, comprising: car gage L, mass center is to wheel base from LfAnd Lr, front and back wheel cornering stiffness C1And C2, car mass m;
Obtain vehicle driving parameter, comprising: steering wheel angle sw, speed u and surface friction coefficient μ;
Step 2: calculating ideal yaw velocity w using formula (1)d:
In formula (1), g is acceleration of gravity, and w is yaw velocity, and is had:
Step 3: calculating ideal side slip angle β using formula (3)d:
βd=-min | β |, | βmax|}·sign(δf) (3)
In formula (3), β is vehicle centroid side drift angle, βmaxFor vehicle maximum side slip angle, and have:
Step 4: the vehicle status parameters s of deeply learning method is defined using formula (6):
S={ w, β, sw, wdd} (6)
Step 5: the action parameter a of deeply learning method is defined using formula (7):
A={ ▽ δ, ▽ M } (7)
In formula (7), ▽ δ is that steering wheel corrects corner, and ▽ M is additional yaw moment;
Step 6: the reward function r of deeply learning method is established using formula (8):
R=re+rps+rv+rm+rsw+rst (8)
In formula (8), reFor error reward function, and have:
re=-▽ w2-▽β2+50 (9)
In formula (9), ▽ w is yaw-rate error, and ▽ β is side slip angle error, and is had:
▽ w=w-wd (10)
▽ β=β-βd (11)
In formula (8), rpsFor fixed prize value function, and have:
In formula (8), rvFor speed difference reward function, and have:
In formula (8), rmTo add yaw moment reward function, and have:
In formula (8), rswFor correction angle reward function, and have:
rsw=-| ▽ δ |+10 (15)
In formula (8), rstFor stable region reward function, and have:
rst=-(| ▽ δ |+| ▽ M |)/10 (16)
Step 7: the network model of building deeply learning method:
Step 7.1: building acts network model, comprising: one layer of input layer comprising a neuron respectively contains n1A nerve The m of member1Layer hidden layer, one layer of output layer comprising 2 neurons;Initialization action network parameter is θμ
Step 7.2: building evaluation network model, comprising: respectively include two layers of input layer of 1 neuron, respectively contain n2A nerve The m of member2Layer hidden layer, wherein m2Layer hidden layer is full articulamentum, one layer of output layer comprising 1 neuron;Initialization is commented Valence network parameter is θQ
Step 7.3: building target action network model identical with the movement network architecture, and enable target action network Parameter θμ′μ, objective appraisal network model identical with the evaluation network architecture is constructed, and enable objective appraisal network Parameter θQ′Q
Step 8: N sample is formed by i-th sample:
Initialize i-th of vehicle status parameters si, and with i-th of vehicle status parameters siAs the defeated of the movement network model Enter, μ (s is exported by the movement network modeliμ);
I-th of vehicle action parameter a is obtained using formula (17)i:
ai=μ (siμ)+Ni (17)
In formula (17), NiIndicate i-th of random noise;
I-th of vehicle reward value r is obtained according to formula (8)i, and obtain updated i-th of vehicle status parameters s 'i;To obtain I-th sample is obtained, (s is denoted asi,ai,ri,s′i), and then obtain N sample;
Step 9: being trained with network model of the N sample to the deeply learning method, to be obtained Optimal movement network model and optimal evaluation network model;
Step 10: judging whether formula (18) and formula (19) are set up, if setting up, then it represents that automobile is in stable state, otherwise, It indicates that automobile plays pendulum, and executes step 11:
|▽w|≤|ε·wd| (19)
In formula (18), k1For the first border coefficient of stable region, k2For stable region the second boundary coefficient;For side slip angle speed;
In formula (19), ε is adjustable parameter;
Step 11: obtaining vehicle's current condition parameter stAs the input of optimal movement network model, thus using described optimal dynamic Make network model output currently additional yaw moment ▽ MtWith amendment corner ▽ δt
Step 12: judge whether formula (20) is true, if so, the steering property for then indicating automobile is understeer, then enables movement Wheel is inner rear wheel, and executes step 13, otherwise, indicates that the steering property of automobile is negative understeer, then it is outer for enabling movement wheel Front-wheel, and execute step 14;
wd×(w-wd) > 0 (20)
Step 13: if δf> 0 then enables amendment corner ▽ δtDirection to the left, if δf< 0 then enables amendment corner ▽ δtDirection to It is right;
Step 14: if δf> 0 then enables amendment corner ▽ δtDirection to the right, if δf< 0 then enables amendment corner ▽ δtDirection to It is left.
2. intelligent automobile stability control method according to claim 1, characterized in that the step 9 is by following mistake Cheng Jinhang:
Step 9.1: initialization Study rate parameter is α, and return rate parameter is γ;Initialize i=1;
Step 9.2: with i-th of vehicle status parameters siThe input that network model is acted as current i-th, is worked as by described Preceding i-th of movement network model exports i-th of output valve μ (siμ);
With i-th of vehicle status parameters si, i-th of vehicle action parameter aiWith i-th of output valve μ of the movement network (siμ) as it is described it is current i-th evaluation network model input, by i-th of vehicle status parameters siWith i-th Vehicle action parameter aiI-th of output valve Q is exported by current i-th of evaluation network modeli(ai);By the movement net I-th of output valve μ (s of network modeliμ) by i-th of output valve Q of current i-th of evaluation network model outputi(μ(siμ));
With updated i-th of vehicle status parameters s 'iAs the input of current i-th of target action network model, I-th of output valve μ (s ' is exported by current i-th of target action network modeliμ′);
With updated i-th of vehicle status parameters s 'iWith i-th of output valve μ (s ' of target action network modeliμ′) As the input of current i-th of objective appraisal network model, by current i-th of objective appraisal network model output the I output valve Q 'i(a′i);
According to i-th of output valve Q of current i-th of evaluation network modeli(μ(siμ)) Utilization strategies gradient method is to described Current i-th of movement network model is updated, to obtain the updated movement network model of i-th and as i+1 Act network model;
The output Q of network model is evaluated according to current i-thi(ai) and current i-th of objective appraisal network model is defeated Q ' outi(a′i), current i-th of evaluation network model is updated using loss function is minimized, to obtain i-th Updated evaluation network model simultaneously evaluates network model as i+1;
Step 9.3: after i+1 is assigned to i, judge whether i > N is true, if so, it then indicates to obtain optimal movement network model With optimal evaluation network model, otherwise, return step 9.2 is executed.
CN201910809910.7A 2019-08-29 2019-08-29 Intelligent automobile stability control method based on deep reinforcement learning Active CN110450771B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910809910.7A CN110450771B (en) 2019-08-29 2019-08-29 Intelligent automobile stability control method based on deep reinforcement learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910809910.7A CN110450771B (en) 2019-08-29 2019-08-29 Intelligent automobile stability control method based on deep reinforcement learning

Publications (2)

Publication Number Publication Date
CN110450771A true CN110450771A (en) 2019-11-15
CN110450771B CN110450771B (en) 2021-03-09

Family

ID=68489893

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910809910.7A Active CN110450771B (en) 2019-08-29 2019-08-29 Intelligent automobile stability control method based on deep reinforcement learning

Country Status (1)

Country Link
CN (1) CN110450771B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110843746A (en) * 2019-11-28 2020-02-28 的卢技术有限公司 Anti-lock brake control method and system based on reinforcement learning
CN111605542A (en) * 2020-05-06 2020-09-01 南京航空航天大学 Vehicle stability system based on safety boundary and control method
CN111746633A (en) * 2020-07-02 2020-10-09 南京航空航天大学 Vehicle distributed steering driving system control method based on reinforcement learning
CN111873991A (en) * 2020-07-22 2020-11-03 中国第一汽车股份有限公司 Vehicle steering control method, device, terminal and storage medium
CN112861269A (en) * 2021-03-11 2021-05-28 合肥工业大学 Automobile longitudinal multi-state control method based on deep reinforcement learning preferential extraction
CN113386790A (en) * 2021-06-09 2021-09-14 扬州大学 Automatic driving decision-making method for cross-sea bridge road condition
CN115123159A (en) * 2022-06-27 2022-09-30 重庆邮电大学 AEB control method and system based on DDPG deep reinforcement learning

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4183667A1 (en) * 2021-11-18 2023-05-24 Volvo Truck Corporation Method for closed loop control of a position of a fifth wheel of a vehicle

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1439556A (en) * 2002-02-23 2003-09-03 株式会社万都 Control method for vehicle running stability
US20040060765A1 (en) * 2002-09-27 2004-04-01 Ford Global Technologies, Inc. Yaw control for an automotive vehicle using steering actuators
JP4021185B2 (en) * 2001-12-07 2007-12-12 本田技研工業株式会社 Yaw moment feedback control method
DE102011010491A1 (en) * 2011-02-07 2012-08-09 Audi Ag Method for activating electronics stability control device of e.g. all-wheel driven motor car, involves determining function such that function value increases with increasing lateral force of front and rear wheels
CN105253141A (en) * 2015-09-09 2016-01-20 北京理工大学 Wheel longitudinal force regulation-based vehicle handling stability control method
CN106828464A (en) * 2017-01-06 2017-06-13 合肥工业大学 A kind of vehicle body stable control method and system based on coefficient of road adhesion estimation

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4021185B2 (en) * 2001-12-07 2007-12-12 本田技研工業株式会社 Yaw moment feedback control method
CN1439556A (en) * 2002-02-23 2003-09-03 株式会社万都 Control method for vehicle running stability
US20040060765A1 (en) * 2002-09-27 2004-04-01 Ford Global Technologies, Inc. Yaw control for an automotive vehicle using steering actuators
DE102011010491A1 (en) * 2011-02-07 2012-08-09 Audi Ag Method for activating electronics stability control device of e.g. all-wheel driven motor car, involves determining function such that function value increases with increasing lateral force of front and rear wheels
CN105253141A (en) * 2015-09-09 2016-01-20 北京理工大学 Wheel longitudinal force regulation-based vehicle handling stability control method
CN106828464A (en) * 2017-01-06 2017-06-13 合肥工业大学 A kind of vehicle body stable control method and system based on coefficient of road adhesion estimation

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110843746A (en) * 2019-11-28 2020-02-28 的卢技术有限公司 Anti-lock brake control method and system based on reinforcement learning
CN111605542A (en) * 2020-05-06 2020-09-01 南京航空航天大学 Vehicle stability system based on safety boundary and control method
CN111746633A (en) * 2020-07-02 2020-10-09 南京航空航天大学 Vehicle distributed steering driving system control method based on reinforcement learning
CN111873991A (en) * 2020-07-22 2020-11-03 中国第一汽车股份有限公司 Vehicle steering control method, device, terminal and storage medium
CN111873991B (en) * 2020-07-22 2022-04-08 中国第一汽车股份有限公司 Vehicle steering control method, device, terminal and storage medium
CN112861269A (en) * 2021-03-11 2021-05-28 合肥工业大学 Automobile longitudinal multi-state control method based on deep reinforcement learning preferential extraction
CN112861269B (en) * 2021-03-11 2022-08-30 合肥工业大学 Automobile longitudinal multi-state control method based on deep reinforcement learning preferential extraction
CN113386790A (en) * 2021-06-09 2021-09-14 扬州大学 Automatic driving decision-making method for cross-sea bridge road condition
CN113386790B (en) * 2021-06-09 2022-07-12 扬州大学 Automatic driving decision-making method for cross-sea bridge road condition
CN115123159A (en) * 2022-06-27 2022-09-30 重庆邮电大学 AEB control method and system based on DDPG deep reinforcement learning

Also Published As

Publication number Publication date
CN110450771B (en) 2021-03-09

Similar Documents

Publication Publication Date Title
CN110450771A (en) A kind of intelligent automobile stability control method based on deeply study
US8718872B2 (en) Vehicle attitude controller
CN111267835B (en) Four-wheel independent drive automobile stability control method based on model prediction algorithm
CN109976159A (en) Intelligent vehicle crosswise joint method based on safely controllable domain
US10059344B2 (en) Vehicle behavior control apparatus
Goodarzi et al. Automatic path control based on integrated steering and external yaw-moment control
CN106461052A (en) Method and system for controlling stability and yaw response of a vehicle by locking a differential at elevated speeds
CN101189149A (en) Vehicle braking/driving force control apparatus
CN108773376A (en) A kind of the automobile objective layered Collaborative Control and optimization method of fusion driving intention
CN105936273A (en) Vehicle active torque inter-wheel and inter-axis distribution method
CN102421655A (en) Specification information estimating device and vehicle
Jalali Stability control of electric vehicles with in-wheel motors
JP2014144681A (en) Vehicular driving force control unit
CN114802177A (en) Torque distribution system for redistributing torque between axles of vehicle
Lu et al. From vehicle stability control to intelligent personal minder: Real-time vehicle handling limit warning and driver style characterization
Maeda et al. Performance of driver-vehicle system in emergency avoidance
CN110239499B (en) Vehicle control device and vehicle control method
Zheng et al. Comparison of active front wheel steering and differential braking for yaw/roll stability enhancement of a coach
Tamaddoni et al. Optimal vehicle stability control design based on preview game theory concept
US8442736B2 (en) System for enhancing cornering performance of a vehicle controlled by a safety system
CN106672072A (en) Control method for steer-by-wire automobile active front-wheel steering control system
Peng et al. Integrated vehicle control based on tire force reserve optimization concept
CN109747710B (en) Method for controlling a rear axle steering system of a motor vehicle
Xiao The research of fuzzy variable transmission ratio for steer-by-wire system of electric forklift
JP5018051B2 (en) Vehicle driving force control device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant