CN112590774A - Intelligent electric automobile drifting and warehousing control method based on deep reinforcement learning - Google Patents
Intelligent electric automobile drifting and warehousing control method based on deep reinforcement learning Download PDFInfo
- Publication number
- CN112590774A CN112590774A CN202011530836.4A CN202011530836A CN112590774A CN 112590774 A CN112590774 A CN 112590774A CN 202011530836 A CN202011530836 A CN 202011530836A CN 112590774 A CN112590774 A CN 112590774A
- Authority
- CN
- China
- Prior art keywords
- wheel
- vehicle
- drifting
- reinforcement learning
- rear wheel
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 230000002787 reinforcement Effects 0.000 title claims abstract description 32
- 230000008569 process Effects 0.000 claims abstract description 28
- 230000009471 action Effects 0.000 claims abstract description 17
- 230000008859 change Effects 0.000 claims abstract description 10
- 238000012549 training Methods 0.000 claims description 18
- 238000012546 transfer Methods 0.000 claims description 13
- 230000001133 acceleration Effects 0.000 claims description 12
- 229920006395 saturated elastomer Polymers 0.000 claims description 12
- 230000006870 function Effects 0.000 claims description 7
- 238000013528 artificial neural network Methods 0.000 claims description 5
- 238000004364 calculation method Methods 0.000 claims description 4
- 238000005096 rolling process Methods 0.000 claims description 3
- 230000000694 effects Effects 0.000 claims description 2
- 230000005484 gravity Effects 0.000 claims description 2
- 239000000126 substance Substances 0.000 claims description 2
- 238000013461 design Methods 0.000 description 3
- 238000006073 displacement reaction Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010924 continuous production Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000003137 locomotive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W30/00—Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units
- B60W30/06—Automatic manoeuvring for parking
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W2050/0001—Details of the control system
- B60W2050/0019—Control system elements or transfer functions
- B60W2050/0028—Mathematical models, e.g. for simulation
- B60W2050/0031—Mathematical model of the vehicle
- B60W2050/0034—Multiple-track, 2D vehicle model, e.g. four-wheel model
Landscapes
- Engineering & Computer Science (AREA)
- Automation & Control Theory (AREA)
- Transportation (AREA)
- Mechanical Engineering (AREA)
- Human Computer Interaction (AREA)
- Control Of Driving Devices And Active Controlling Of Vehicle (AREA)
Abstract
The invention relates to an intelligent electric automobile drifting storage control method based on deep reinforcement learning, which comprises the following steps of: 1) constructing a vehicle dynamics model for deep reinforcement learning and a tire model under a tire force saturation working condition; 2) and a TD3 algorithm facing the drifting storage control is adopted to realize the drifting storage of the intelligent electric automobile. Compared with the prior art, the invention has high control precision and good robustness, can ensure that the vehicle can accurately finish the drifting and warehousing actions, can ensure that the vehicle can accurately reach the warehouse location by continuously adjusting the steering wheel angle in the drifting process, and can actively change the central position of the warehouse location in the drifting process of the vehicle so as to ensure that the vehicle drifts towards the updated warehouse location.
Description
Technical Field
The invention relates to the field of automobile warehousing control, in particular to an intelligent electric automobile drifting warehousing control method based on deep reinforcement learning.
Background
The vehicle continues to run with the rear tires saturated and the rear axle slipping, called drift, in two different drift states:
(1) the rear axle drives and the rear wheel rotates in a sliding mode, the mass center slip angle of the vehicle and the vehicle speed can be kept at a constant value by controlling the driving force of the rear axle and the steering angle of the front wheel, so that the vehicle is in a stable state, and most of vehicles in the market are driven by the front axle, so that the research value of the drifting action in the state is relatively small.
(2) The method comprises the following steps that a drifting action is reproduced according to an open-loop control law and possibly interfered by an external environment and a self-vehicle state, so that a vehicle cannot drift to stop at a storage position, for example, a lateral displacement error and a course angle error exist in the process of approaching the storage position, the vehicle does not completely meet a preset drifting trigger pose state when the drifting action is triggered, a certain deviation exists, and the deviation is reserved until the drifting is finished according to the completion of the drifting action of an open-loop controller; in addition, because the response of the bottom-layer actuator is limited, the open-loop control cannot ensure that the response of the actuator is consistent every time, and the vehicle can deviate from a preset drift track when the response deviates; non-uniformity in the road surface causes sudden changes in tire force during the drift process, which changes the drift path.
Disclosure of Invention
The invention aims to overcome the defects in the prior art and provide an intelligent electric vehicle drifting and warehousing control method based on deep reinforcement learning.
The purpose of the invention can be realized by the following technical scheme:
1. an intelligent electric vehicle drifting and warehousing control method based on deep reinforcement learning is characterized by comprising the following steps:
1) constructing a vehicle dynamics model for deep reinforcement learning and a tire model under a tire force saturation working condition;
2) and a TD3 algorithm facing the drifting storage control is adopted to realize the drifting storage of the intelligent electric automobile.
In the step 1), the vehicle dynamics model is specifically to consider front-back and left-right load transferThe four-wheel three-degree-of-freedom vehicle dynamics model comprises the speed v at the position of the mass center of the vehiclemA centroid slip angle β and a yaw rate ω.
In the four-wheel three-degree-of-freedom vehicle dynamics model, the expression of four-wheel vertical force considering longitudinal and lateral acceleration is as follows:
in the formula, hmIs the height of the center of mass, bf、brA front and rear track width of ax、ayLongitudinal and lateral accelerations at the centre of mass, without taking account of the effect of the rotation of the body, FzFL、FzFR、FzRL、FzRRThe vertical forces of the left front wheel, the right front wheel, the left rear wheel and the right rear wheel are respectively, m is the mass of the electric automobile, g is the gravity acceleration, l is the wheelbase and l is the vertical forcef、lrDistance of front and rear axes to centre of mass, FxFL、FxFR、FxRL、FxRRRespectively a left front vehicle, a right front vehicle, a left rear vehicle and a right rear vehicleLongitudinal force of the wheel, FyFL、FyFR、FyRL、FyRRThe lateral forces of the left front wheel, the right front wheel, the left rear wheel and the right rear wheel are respectively, and delta is the corner of the front wheel.
In the drifting process, considering that the load transfer is too large to cause a certain wheel to be lifted off the ground, the situation that the vertical load of the wheel is reduced to 0 and the load transfer reaches the upper limit occurs, when the steering wheel drifts to the left, the load transfers to the right, and when the left rear wheel is lifted off the ground, the vertical force of the left rear wheel is 0, at the moment, the load which is transferred too much is redistributed to the left front wheel and the right rear wheel according to the longitudinal lateral acceleration, the wheelbase and the wheelbase, and then the following steps are carried out:
ΔFtrans=|FzRL|
F′zRL=0
wherein, Δ FtransIs an excessively transferred load, F'zRLIs assigned a vertical force, F ', of the rear left rear wheel'zRRIs distributed as a vertical force, F'zFLTo distribute the vertical force of the rear left front wheel.
The four-wheel three-degree-of-freedom vehicle dynamics model considering front-back and left-right load transfer is subjected to stress analysis, and the obtained vehicle dynamics balance equation is as follows:
φ=β+ψ
the longitudinal speed v of the vehicle is obtained by calculation according to the abovemxAnd lateral vehicle speed vmyThen, there are:
vmx=vm·cosβ
vmy=vm·sinβ
wherein the content of the first and second substances,is the rate of change of speed at the vehicle's center of mass,is the centroid side slip angular velocity, phi is the global azimuth angle of the vehicle speed at the centroid,is the global azimuth velocity of the vehicle speed at the centroid,is the rate of change of yaw angular velocity, psi is the global heading angle of the vehicle, IzIs a yaw moment of inertia, vxFor the longitudinal speed of the vehicle, vyIs the vehicle lateral speed.
In the step 1), the tire model for deep reinforcement learning training comprises a front wheel tire force model and a rear wheel tire force model.
For the rear wheel tire force model, in the drifting process, the rear wheel is braked and locked and is purely rubbed on the road surface, the tire force direction of the rear wheel is opposite to the direction of the wheel center instantaneous speed, and the expression of the longitudinal and lateral tire force components of the rear wheel obtained by carrying out stress analysis on the rear wheel is as follows:
for the left rear wheel:
for the right rear wheel:
Fr_sat=μ1Fz
wherein v isxRL、vyRLRespectively the longitudinal and lateral speeds at the wheel center of the left and the rear wheels vxRR、vyRRRespectively the longitudinal and lateral speeds, lambda, at the wheel center of the right rear wheelL、λRAre respectively the wheel center slip angles of the left and right rear wheels FxRL、FyRLLongitudinal and lateral forces of the left rear wheel, FxRR、FyRRLongitudinal and lateral forces of the right rear wheel, respectively, FrRL_sat、FrRR_satRespectively, the horizontal saturated tire force of the left and right rear wheels, Fr_satRepresents the corresponding wheel horizontal saturated tire force, mu1For road surface utilization of the coefficient of adhesion, F, when the wheels are lockedzIndicating the vertical force of the corresponding wheel.
For the front wheel tire force model, in the drifting process, the front wheel tire force is not saturated, and the improved Burckhardt tire model is adopted to fit the tire force to express the relation between the lateral force and the slip angle, and the following steps are provided:
wherein, theta1~θ5As a fitting parameter, α is a front wheel side slip angle;
left wheel slip angle alphaLAnd a right wheel side slip angle alphaRCan be obtained by the following formula:
the front wheel is in a free rolling state without applying braking force and driving force, and has FxFL=0,FxFRWhen determining the front wheel tire force direction, only the lateral forces are taken into account, which is then perpendicular to the tire plane and is determined by the front wheel steering angle.
The step 2) specifically comprises the following steps:
21) designing a TD3 algorithm for drift storage control, and constructing an Actor network and a Critic network, wherein the method specifically comprises the following steps:
the Critic network and the Actor network are both BP neural networks composed of full connection layers, the input of the Critic network is the vehicle state and action, the output is Q value, the input of the Actor network is the vehicle state, the output is action, the vehicle state is a parameter representing the vehicle state in the drifting process, and the parameter comprises the base coordinates (e) under a relative coordinate system with the vehicle mass center as the origin and the locomotive orientation as the positive direction of the y axisx、ey) And the direction of the storage locationVelocity v at the center of mass of the vehiclemThe mass center slip angle beta and the yaw angular velocity omega, wherein the action is the steering wheel rotation angle;
22) constructing a reward function r (k) having:
23) and training the Actor network and the Critic network, and finishing the drifting and warehousing of the intelligent electric automobile.
In step 23), before training the Actor network and Critic network, determining the boundary of the drifting warehousing controller, randomly taking a value of the target base position of each vehicle drifting according to the boundary, in iterative training, calculating the vehicle state by the vehicle according to the randomly selected target base position and orientation, training the Critic network and the Actor network according to the value, randomly updating the target base position in the training process, expanding the training data set and improving the capacity.
Compared with the prior art, the invention has the following advantages:
the intelligent electric automobile drifting and warehousing control method is designed based on the deep reinforcement learning TD3 algorithm, the control precision is improved, the problem that errors exist in drifting and warehousing due to the fact that the road surface is not uniform is solved, the center point of the warehouse position can be changed, vehicles can move to the updated warehouse position, and the robustness of a control system is improved.
And secondly, in the process of drifting and warehousing, the vehicle can adjust the pose by continuously adjusting the angle of the steering wheel, so that the vehicle can accurately drift and warehouse.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
Fig. 2 is a diagram illustrating the state parameter definition of a part of the drift process.
Fig. 3 is a flow chart of a drift control algorithm based on deep reinforcement learning.
Detailed Description
The invention is described in detail below with reference to the figures and specific embodiments.
As shown in fig. 1, the invention provides an intelligent electric vehicle drifting and warehousing control method based on deep reinforcement learning, which comprises the following steps:
1) the method comprises the following steps of building a vehicle dynamics model and a tire model for deep reinforcement learning training, and specifically comprises the following steps:
11) building vehicle dynamics model for deep reinforcement learning
Four-wheel three-degree-of-freedom vehicle dynamics model considering front-back and left-right load transfer, wherein three degrees of freedom are respectively the speed v at the position of the vehicle mass centermThe size beta of the centroid slip angle and the yaw angular velocity omega;
since the longitudinal lateral acceleration of the vehicle is large during the drift, the influence of the front-rear and left-right load transfer of the vehicle on the vertical force of the tire must be considered. The four-wheel vertical force calculation formula considering the longitudinal and lateral acceleration is as follows (1):
in the formula, hmIs the height of the center of mass, bf、brA front and rear track width, ax、ayThe longitudinal lateral acceleration without considering the influence of the rotation of the vehicle body at the centroid is obtained by the following formula (2):
in the drifting process, the situation that one wheel is lifted off the ground due to overlarge load transfer, the vertical load of the corresponding wheel is reduced to 0, and the load transfer reaches the upper limit needs to be considered. Due to the tail-flick braking process, the load is transferred towards the front axle, so only the possibility of the rear wheels coming off the ground is considered. If the steering wheel drifts to the left and the load is transferred to the right, there is a possibility that the left rear wheel will lift off the ground. When F is obtained by calculation according to a formulazRLWhen the wheel vertical force is less than 0, the wheel vertical force is made to be 0, and the excessively transferred load is redistributed to the left front wheel and the right rear wheel according to the longitudinal lateral acceleration, the wheel base and the wheel base, and the formula is expressed as follows:
and (3) carrying out stress analysis on the vehicle model to obtain a vehicle dynamic balance equation as follows:
in the formula, delta is a front wheel corner; phi is the global azimuth angle of the vehicle speed at the centroid,is the rate of change of velocity direction; psi is the global azimuth angle of the vehicle head,the vehicle yaw angular velocity, namely the change rate of the direction of the vehicle head; according to phi ═ beta + psi and the integral of the above formula, the mass center speed v of the vehicle at each moment can be obtainedmThe mass center slip angle beta and the yaw angular velocity omega, and then the longitudinal and lateral vehicle speed of the vehicle is obtained according to the formula (8):
12) building tire model for deep reinforcement learning under tire force saturation working condition
Different from the driving conditions of the conventional working conditions, the rear wheel tire force is saturated during drifting, the lateral speed and the mass center lateral deviation angle of the vehicle body are large, and the longitudinal lateral vehicle speed is in a state of rapid change, so that the vehicle system is a strong nonlinear time-varying system with high longitudinal-lateral coupling, and the vehicle real-time saturated tire force is obtained by the formula (9):
Fr_sat=μ1Fz (9)
in the formula (I), the compound is shown in the specification,for horizontal tire force resultant, μ1For road surface utilization of the coefficient of adhesion, mu, when the wheels are locked1=0.9μmaxI.e. a peak adhesion coefficient of 0.9 times, the peak adhesion coefficient being 1.
121) Rear wheel tire force model
In the process of locking and braking of the rear axle, the tire force is saturated, no matter how the magnitude of the slip angle changes, the magnitude of the longitudinal and lateral force resultant force is unchanged, the change of the slip angle can be ignored when the horizontal tire force of the rear wheel in the drifting process is solved, and the tire force of the rear axle in the drifting state can be directly solved.
Since the rear wheel brakes are locked and the wheel is purely frictional on the road, the tire force direction is determined by the wheel center speed direction, i.e. the tire force direction is opposite to the direction of the wheel center instantaneous speed. And (3) carrying out stress analysis on the rear wheel in the drifting process to obtain an expression of the longitudinal and lateral tire force component of the rear wheel:
left rear wheel:
the right rear wheel:
in the formula, vxRL、vyRLRespectively the longitudinal and lateral speeds at the wheel center of the left and the rear wheels vxRR,vyRRRespectively the longitudinal and lateral speeds of the wheel center of the right rear wheel; lambda [ alpha ]L、λRRespectively are the wheel center slip angles of the left and right rear wheels; fxRL、FyRLAre respectively asLongitudinal and lateral forces of the left and rear wheels, FxRR、FyRRRespectively the longitudinal force and the lateral force of the right rear wheel; frRL_sat、FrRR_satThe horizontal saturated tire force of the left and right rear wheels is respectively.
122) Front wheel tyre force model
The tire force of the front wheel is not saturated, the longitudinal side of the front wheel is decoupled, and the tire lateral force is obtained by adopting a tire model suitable for the quasi-static working condition. Fitting the tire force by adopting an improved Burckhardt tire model, and expressing the relation between the lateral force and the slip angle, wherein the following steps are carried out:
in the formula, theta1~θ5As a fitting parameter, α is a front wheel slip angle, and left and right wheel slip angles can be obtained by equations (13) and (14).
Since no braking force or driving force is applied, the front wheel is considered to be in a free rolling state, and the wheel longitudinal force is approximately 0, i.e., FxFL=0,F xFR0. Only the lateral forces are considered in determining the front wheel tire force direction, so the front wheel tire force direction is perpendicular to the tire plane, determined by the front wheel steering angle.
2) TD3 algorithm design facing drift warehousing control.
In the drifting process, a deep reinforcement learning algorithm is adopted, an established drifting vehicle dynamics model is taken as a basis, and accurate drifting and warehousing of the vehicle are realized according to an end-to-end drifting controller, and the method specifically comprises the following steps:
in the TD3 algorithm, the input of a Critic network is the vehicle state and action, and the output is a Q value; the input of the Actor network is the vehicle state, and the output is the action, namely the steering wheel angle;
and selecting parameters representing the vehicle state in the drifting process as the input of the Critic network and the Actor network, wherein the parameters can uniquely represent the vehicle state at a certain moment in drifting and have dynamic correlation with the steering wheel angle input value. The 6 state parameters are: relative coordinate system lower position coordinate e taking vehicle mass center as origin and vehicle head orientation as positive y-axis directionx、eyAnd the direction of the storage locationResultant velocity v of longitudinal lateral vehicle velocity of vehiclemA centroid slip angle β and a yaw rate ω. e.g. of the typex、eyAndreflecting the difference between the current time position and the course angle of the vehicle and the expected state in the drift process, as shown in FIG. 2, vmBeta and omega characterize the rate of change of the first three.
After the deep neural network trained by the reinforcement learning algorithm is determined, the reward function is designed so as to calculate reward values corresponding to different states of the vehicle in the drifting process. The reward function is designed as follows:
in the formula, wx、wy、Are each ex、eyAndthe weight of (c). Because the displacement error from the center of the storage position and the heading angle error from the orientation of the storage position when the vehicle is stopped stably are concerned, the third power of the vehicle speed is put in the denominator term, so that the lower the vehicle speed is and the closer the vehicle is to the stop, the longitudinal side of the vehicle can be made to be verticalThe larger the absolute value of the reward value calculated by the displacement error and the course angle error is. According to the algorithm principle, when the vehicle finally stops at a position far away from the storage position, a very small reward value is calculated; when the vehicle stops near the center of the garage, the calculated reward value is close to 0, and the target Q value corresponding to the preamble state and the action is larger. The Actor network maximizes the Q value as much as possible when calculating the steering wheel angle according to the vehicle state, so that the vehicle is finally parked in a garage. When the reward function is designed, the controlled quantity is put into the reward function, but in the process of drifting and warehousing, the steering wheel angle is always adjusted, which is a continuous process, the influence of one-time steering on the result of drifting and warehousing cannot be defined, and therefore the weight coefficient is set to be 0.
Before network training, the "boundary" of the on-board drift-in-garage controller is first determined, and it is believed that the vehicle's final position and final heading angle will not exceed this boundary regardless of the applied steering wheel angle.
And randomly taking values of the target storage position of each vehicle drift according to the controller boundary. When a complete drift process is finished, a random target library position (X) is setaim,Yaim) And orientation psiaimAnd satisfies the constraints of the controller boundaries described above.
In the iterative training, the vehicle calculates the vehicle state e with the target garage position and orientationx、eyAndthe Critic network and the Actor network are trained accordingly, the position of the target library position is randomly updated in the training process, the training data set is expanded, and the generalization capability of the network can be improved.
Examples
In this embodiment, the method for controlling the drift storage implemented according to the above method specifically includes:
step one, building a four-wheel three-degree-of-freedom vehicle dynamics model based on drifting and warehousing of depth reinforcement learning and building a tire model under a tire force saturation working condition. Considering front and rear and left and right loadsA four-wheel three-degree-of-freedom vehicle dynamic model for load transfer. The three degrees of freedom are respectively: velocity v at the vehicle's center of massmThe size beta of the centroid slip angle and the yaw angular velocity omega.
And step two, performing Critic network and Actor network design and reward function design on the basis of the vehicle dynamics model of the deep reinforcement learning. The input of the Critic network is the vehicle state and action, and the output is the Q value; the input of the Actor network is the vehicle state, and the output is the action. The number of input quantity and output quantity is small, the corresponding relation is simple, a Critic network and an Actor network are built by adopting a BP neural network consisting of full connection layers, and a drift control algorithm flow based on deep reinforcement learning is shown in FIG. 3.
Claims (10)
1. An intelligent electric vehicle drifting and warehousing control method based on deep reinforcement learning is characterized by comprising the following steps:
1) constructing a vehicle dynamics model for deep reinforcement learning and a tire model under a tire force saturation working condition;
2) and a TD3 algorithm facing the drifting storage control is adopted to realize the drifting storage of the intelligent electric automobile.
2. The intelligent electric vehicle drifting storage control method based on deep reinforcement learning of claim 1, wherein in step 1), the vehicle dynamics model is a four-wheel three-degree-of-freedom vehicle dynamics model considering front-back and left-right load transfer, and the three degrees of freedom include speed v at vehicle mass centermA centroid slip angle β and a yaw rate ω.
3. The intelligent electric vehicle drifting warehousing control method based on deep reinforcement learning of claim 2 is characterized in that in the four-wheel three-degree-of-freedom vehicle dynamics model, an expression of four-wheel vertical force considering longitudinal and lateral acceleration is as follows:
in the formula, hmIs the height of the center of mass, bf、brA front and rear track width of ax、ayLongitudinal and lateral accelerations at the centre of mass, without taking account of the effect of the rotation of the body, FzFL、FzFR、FzRL、FzRRThe vertical forces of the left front wheel, the right front wheel, the left rear wheel and the right rear wheel are respectively, m is the mass of the electric automobile, g is the gravity acceleration, l is the wheelbase and l is the vertical forcef、lrDistance of front and rear axes to centre of mass, FxFL、FxFR、FxRL、FxRRThe longitudinal forces of the left front wheel, the right front wheel, the left rear wheel and the right rear wheel respectively, FyFL、FyFR、FyRL、FyRRThe lateral forces of the left front wheel, the right front wheel, the left rear wheel and the right rear wheel are respectively, and delta is the corner of the front wheel.
4. The intelligent electric vehicle drifting storage control method based on deep reinforcement learning of claim 3 is characterized in that in the drifting process, considering that a certain wheel is lifted off the ground due to excessive load transfer, the situation that the vertical load of the wheel is reduced to 0 and the load transfer reaches the upper limit occurs, when a steering wheel drifts to the left side, the load is transferred to the right side, and when a left rear wheel is lifted off the ground, the vertical force of the left rear wheel is 0, at this time, the excessively transferred load is redistributed to the left front wheel and the right rear wheel according to the longitudinal lateral acceleration, the wheelbase and the wheelbase, and the following steps are included:
ΔFtrans=|FzRL|
F′zRL=0
wherein, Δ FtransIs an excessively transferred load, F'zRLIs assigned a vertical force, F ', of the rear left rear wheel'zRRIs distributed as a vertical force, F'zFLTo distribute the vertical force of the rear left front wheel.
5. The intelligent electric vehicle drifting storage control method based on deep reinforcement learning of claim 4 is characterized in that a four-wheel three-degree-of-freedom vehicle dynamics model considering front-rear and left-right load transfer is subjected to stress analysis, and a vehicle dynamics balance equation is obtained by:
φ=β+ψ
the longitudinal speed v of the vehicle is obtained by calculation according to the abovemxAnd lateral vehicle speed vmyThen, there are:
vmx=vm·cosβ
vmy=vm·sinβ
wherein the content of the first and second substances,is the rate of change of speed at the vehicle's center of mass,is the centroid side slip angular velocity, phi is the global azimuth angle of the vehicle speed at the centroid,is the global azimuth velocity of the vehicle speed at the centroid,is the rate of change of yaw angular velocity, psi is the global heading angle of the vehicle, IzIs a yaw moment of inertia, vxFor the longitudinal speed of the vehicle, vyIs the vehicle lateral speed.
6. The intelligent electric vehicle drifting-in control method based on deep reinforcement learning of claim 5, characterized in that in step 1), the tire models used for deep reinforcement learning training include a front tire force model and a rear tire force model.
7. The intelligent electric vehicle drifting storage control method based on deep reinforcement learning of claim 6 is characterized in that, for the rear wheel tire force model, in the drifting process, the rear wheel is locked in a braking mode and is purely rubbed on the road surface, the direction of the tire force of the rear wheel is opposite to the direction of the instantaneous speed of the wheel center of the wheel, and the expression of the longitudinal lateral tire force component of the rear wheel obtained by performing stress analysis on the rear wheel is as follows:
for the left rear wheel:
for the right rear wheel:
Fr_sat=μ1Fz
wherein v isxRL、vyRLRespectively the longitudinal and lateral speeds at the wheel center of the left and the rear wheels vxRR、vyRRRespectively the longitudinal and lateral speeds, lambda, at the wheel center of the right rear wheelL、λRAre respectively the wheel center slip angles of the left and right rear wheels FxRL、FyRLLongitudinal and lateral forces of the left rear wheel, FxRR、FyRRLongitudinal and lateral forces of the right rear wheel, respectively, FrRL_sat、FrRR_satRespectively, the horizontal saturated tire force of the left and right rear wheels, Fr_satRepresents the corresponding wheel horizontal saturated tire force, mu1For road surface utilization of the coefficient of adhesion, F, when the wheels are lockedzIndicating the vertical force of the corresponding wheel.
8. The intelligent electric vehicle drifting warehousing control method based on deep reinforcement learning of claim 7 is characterized in that for a front wheel tire force model, in the drifting process, if the front wheel tire force is not saturated yet, the tire force is fitted by adopting an improved Burckhardt tire model to express the relation between the lateral force and the slip angle, and the following steps are provided:
wherein, theta1~θ5As a fitting parameter, α is a front wheel side slip angle;
left wheel slip angle alphaLAnd a right wheel side slip angle alphaRCan be obtained by the following formula:
the front wheel is in a free rolling state without applying braking force and driving force, and has FxFL=0,FxFRWhen determining the front wheel tire force direction, only the lateral forces are taken into account, which is then perpendicular to the tire plane and is determined by the front wheel steering angle.
9. The intelligent electric vehicle drifting-in control method based on deep reinforcement learning according to claim 8, wherein the step 2) specifically comprises the following steps:
21) designing a TD3 algorithm for drift storage control, and constructing an Actor network and a Critic network, wherein the method specifically comprises the following steps:
the Critic network and the Actor network are both BP neural networks composed of full connection layers, the input of the Critic network is the vehicle state and action, the output is the Q value, the input of the Actor network is the vehicle state, the output is action, the Critic network and the Actor network are the BP neural networks composed of full connection layers, the input of the Actor network is the vehicle state, the output of the Actor network is action, the CriticThe vehicle state is a parameter for representing the vehicle state in the drifting process, and comprises a lower library position coordinate (e) of a relative coordinate system with the vehicle mass center as an origin and the head orientation as the positive direction of the y axisx、ey) And the direction of the storage locationVelocity v at the center of mass of the vehiclemThe mass center slip angle beta and the yaw angular velocity omega, wherein the action is the steering wheel rotation angle;
22) constructing a reward function r (k) having:
23) and training the Actor network and the Critic network, and finishing the drifting and warehousing of the intelligent electric automobile.
10. The intelligent electric vehicle drifting and warehousing control method based on deep reinforcement learning of claim 9 is characterized in that in step 23), before training an Actor network and a Critic network, a boundary of a drifting and warehousing controller is determined, a random value is taken for a target position of a warehouse of vehicle drifting each time according to the boundary, in iterative training, a vehicle calculates a vehicle state according to a randomly selected target position and orientation, the Critic network and the Actor network are trained accordingly, and a training data set is expanded and the capacity is improved by randomly updating the target position in the training process.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011530836.4A CN112590774B (en) | 2020-12-22 | 2020-12-22 | Intelligent electric automobile drifting and warehousing control method based on deep reinforcement learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011530836.4A CN112590774B (en) | 2020-12-22 | 2020-12-22 | Intelligent electric automobile drifting and warehousing control method based on deep reinforcement learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112590774A true CN112590774A (en) | 2021-04-02 |
CN112590774B CN112590774B (en) | 2022-02-18 |
Family
ID=75200195
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011530836.4A Active CN112590774B (en) | 2020-12-22 | 2020-12-22 | Intelligent electric automobile drifting and warehousing control method based on deep reinforcement learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112590774B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114852043A (en) * | 2022-03-23 | 2022-08-05 | 武汉理工大学 | HEV energy management method and system based on layered return TD3 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN205531568U (en) * | 2015-12-31 | 2016-08-31 | 肖全景 | Multi -functional device of parking |
CN206528518U (en) * | 2017-01-11 | 2017-09-29 | 陈生泰 | The system that wheel can be made to turn to any angle and the different steering angles of coordination with one another |
KR20180052983A (en) * | 2016-11-11 | 2018-05-21 | 현대자동차주식회사 | Control method for drift logic of vehicle equipped with epb |
CN108202726A (en) * | 2016-12-20 | 2018-06-26 | 通用汽车环球科技运作有限责任公司 | The vehicle braking mode driven for racing |
CN111890951A (en) * | 2020-08-07 | 2020-11-06 | 吉林大学 | Intelligent electric automobile trajectory tracking and motion control method |
-
2020
- 2020-12-22 CN CN202011530836.4A patent/CN112590774B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN205531568U (en) * | 2015-12-31 | 2016-08-31 | 肖全景 | Multi -functional device of parking |
KR20180052983A (en) * | 2016-11-11 | 2018-05-21 | 현대자동차주식회사 | Control method for drift logic of vehicle equipped with epb |
CN108202726A (en) * | 2016-12-20 | 2018-06-26 | 通用汽车环球科技运作有限责任公司 | The vehicle braking mode driven for racing |
CN206528518U (en) * | 2017-01-11 | 2017-09-29 | 陈生泰 | The system that wheel can be made to turn to any angle and the different steering angles of coordination with one another |
CN111890951A (en) * | 2020-08-07 | 2020-11-06 | 吉林大学 | Intelligent electric automobile trajectory tracking and motion control method |
Non-Patent Citations (2)
Title |
---|
E.JELAVIC等: "autonomous drift parking using a switched control strategy with onboard sensor", 《IFAC-PAPERSONLINE》 * |
张放: "极限工况下自动驾驶车辆的轨迹规划与运动控制", 《中国博士学位论文全文数据库(电子期刊)》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114852043A (en) * | 2022-03-23 | 2022-08-05 | 武汉理工大学 | HEV energy management method and system based on layered return TD3 |
CN114852043B (en) * | 2022-03-23 | 2024-06-18 | 武汉理工大学 | HEV energy management method and system based on layered return TD3 |
Also Published As
Publication number | Publication date |
---|---|
CN112590774B (en) | 2022-02-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
USRE40496E1 (en) | Attitude sensing system for an automotive vehicle relative to the road | |
US8649938B2 (en) | System, program product, and method for dynamic control of vehicle | |
US10384672B1 (en) | Vehicle stability control system | |
Hashemi et al. | Vehicle stability control: Model predictive approach and combined-slip effect | |
GB2434127A (en) | A method for dynamically determining axle loadings of a movingvehicle | |
JP6286091B1 (en) | Vehicle state estimation device, control device, suspension control device, and suspension device. | |
Tian et al. | Integrated control with DYC and DSS for 4WID electric vehicles | |
CN113460056B (en) | Vehicle road surface adhesion coefficient estimation method based on Kalman filtering and least square method | |
WO2022266824A1 (en) | Steering control method and apparatus | |
CN112668093A (en) | Optimal distribution control method for all-wheel longitudinal force of distributed driving automobile | |
JP2003146154A (en) | Vehicle state quantity estimation method | |
WO2018029914A1 (en) | Vehicle state quantity estimation device | |
CN109017805B (en) | Method for controlling stability of running system vehicle with uncertainty | |
CN111731267B (en) | Distributed electric vehicle stability control system and method equipped with non-inflatable elastic wheels | |
JP6360246B1 (en) | Vehicle state estimation device, control device, suspension control device, suspension device, steering control device, and steering device | |
CN113147420A (en) | Target optimization torque distribution method based on road adhesion coefficient identification | |
CN116061934A (en) | Architecture for managing chassis and driveline actuators and model-based predictive control method | |
CN112590774B (en) | Intelligent electric automobile drifting and warehousing control method based on deep reinforcement learning | |
CN113401112B (en) | Control method for re-stabilization of out-of-control vehicle | |
CN113264048B (en) | Vehicle, system and method for operating vehicle | |
JP2003231429A (en) | Action to route of vehicle by measured value of lateral force in consideration of load movement on both sides of vehicle symmetrical with respect to center thereof | |
JP6286092B1 (en) | Suspension control device and suspension device. | |
Leng et al. | Deep reinforcement learning-based drift parking control of automated vehicles | |
CN114148411B (en) | Drift control method of wheeled unmanned platform | |
Li et al. | Trajectory following control for automated drifting of 4WID vehicles |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |