CN116278992A - Fuel cell automobile energy management method integrating information physical system - Google Patents

Fuel cell automobile energy management method integrating information physical system Download PDF

Info

Publication number
CN116278992A
CN116278992A CN202310271286.6A CN202310271286A CN116278992A CN 116278992 A CN116278992 A CN 116278992A CN 202310271286 A CN202310271286 A CN 202310271286A CN 116278992 A CN116278992 A CN 116278992A
Authority
CN
China
Prior art keywords
network
power
vehicle
power battery
fuel cell
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310271286.6A
Other languages
Chinese (zh)
Inventor
何洪文
李昆昂
贾淳淳
周稼铭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Institute of Technology BIT
Original Assignee
Beijing Institute of Technology BIT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Institute of Technology BIT filed Critical Beijing Institute of Technology BIT
Priority to CN202310271286.6A priority Critical patent/CN116278992A/en
Publication of CN116278992A publication Critical patent/CN116278992A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60LPROPULSION OF ELECTRICALLY-PROPELLED VEHICLES; SUPPLYING ELECTRIC POWER FOR AUXILIARY EQUIPMENT OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRODYNAMIC BRAKE SYSTEMS FOR VEHICLES IN GENERAL; MAGNETIC SUSPENSION OR LEVITATION FOR VEHICLES; MONITORING OPERATING VARIABLES OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRIC SAFETY DEVICES FOR ELECTRICALLY-PROPELLED VEHICLES
    • B60L58/00Methods or circuit arrangements for monitoring or controlling batteries or fuel cells, specially adapted for electric vehicles
    • B60L58/40Methods or circuit arrangements for monitoring or controlling batteries or fuel cells, specially adapted for electric vehicles for controlling a combination of batteries and fuel cells
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60LPROPULSION OF ELECTRICALLY-PROPELLED VEHICLES; SUPPLYING ELECTRIC POWER FOR AUXILIARY EQUIPMENT OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRODYNAMIC BRAKE SYSTEMS FOR VEHICLES IN GENERAL; MAGNETIC SUSPENSION OR LEVITATION FOR VEHICLES; MONITORING OPERATING VARIABLES OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRIC SAFETY DEVICES FOR ELECTRICALLY-PROPELLED VEHICLES
    • B60L50/00Electric propulsion with power supplied within the vehicle
    • B60L50/50Electric propulsion with power supplied within the vehicle using propulsion power supplied by batteries or fuel cells
    • B60L50/75Electric propulsion with power supplied within the vehicle using propulsion power supplied by batteries or fuel cells using propulsion power supplied by both fuel cells and batteries
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/02Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
    • B60W40/04Traffic conditions
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/02Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
    • B60W40/06Road conditions
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/02Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
    • B60W40/06Road conditions
    • B60W40/072Curvature of the road
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • B60W40/02Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
    • B60W40/06Road conditions
    • B60W40/076Slope angle of the road
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W2050/0001Details of the control system
    • B60W2050/0019Control system elements or transfer functions
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W2050/0001Details of the control system
    • B60W2050/0019Control system elements or transfer functions
    • B60W2050/0028Mathematical models, e.g. for simulation
    • B60W2050/0031Mathematical model of the vehicle
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W50/00Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
    • B60W2050/0001Details of the control system
    • B60W2050/0019Control system elements or transfer functions
    • B60W2050/0028Mathematical models, e.g. for simulation
    • B60W2050/0037Mathematical models of vehicle sub-units
    • B60W2050/0039Mathematical models of vehicle sub-units of the propulsion unit
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2510/00Input parameters relating to a particular sub-units
    • B60W2510/24Energy storage means
    • B60W2510/242Energy storage means for electrical energy
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2510/00Input parameters relating to a particular sub-units
    • B60W2510/28Fuel cells
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2710/00Output or target parameters relating to a particular sub-units
    • B60W2710/24Energy storage means
    • B60W2710/242Energy storage means for electrical energy
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2710/00Output or target parameters relating to a particular sub-units
    • B60W2710/28Fuel cells

Landscapes

  • Engineering & Computer Science (AREA)
  • Transportation (AREA)
  • Mechanical Engineering (AREA)
  • Automation & Control Theory (AREA)
  • Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Sustainable Development (AREA)
  • Sustainable Energy (AREA)
  • Power Engineering (AREA)
  • Human Computer Interaction (AREA)
  • Electric Propulsion And Braking For Vehicles (AREA)

Abstract

The invention provides a fuel cell automobile energy management method integrating an information physical system, which not only considers the energy flow and consumption in the automobile system, but also considers the influence of future road topography and traffic conditions on the automobile energy management, and explores the optimal control in a feasible domain by combining a depth deterministic strategy gradient algorithm, thereby effectively avoiding discrete errors and improving the reliability of strategies. The invention realizes the information interaction between the vehicle system and the network layer through the information physical system, and brings future topographic information, traffic information, battery aging, fuel cell durability constraint, hydrogen consumption and the like obtained through the information physical system into the control frame, thereby having important practical significance for achieving the optimal balance of the system durability and the hydrogen consumption of the real vehicle.

Description

Fuel cell automobile energy management method integrating information physical system
Technical Field
The invention belongs to the technical field of energy management of fuel cell hybrid power systems, and particularly relates to an energy management method of a fuel cell automobile integrating an information physical system.
Background
At present, proton membrane exchange fuel cells are increasingly used by new energy automobiles, especially hybrid electric vehicles, due to the advantages of cleanliness, high energy efficiency and the like. However, the dynamic response of the fuel cell is slow, and the speed and power change during the running process of the vehicle are rapid and drastic, which makes the energy management of the fuel cell hybrid power system formed by matching with the power cell more difficult. In the energy management strategy for the fuel cell hybrid electric vehicle, the problems that the actual performance transition depends on the set conditions and the modeling process exist; although the power distribution of the vehicle power system itself or other high-power components on the vehicle has been comprehensively considered in many prior arts such as chinese patent publication No. CN113085665a, environmental factors outside the vehicle such as road topography, traffic conditions, etc. are not considered, and these factors tend to have a stronger influence on the energy consumption during the running of the vehicle according to practical experience, so that the existing energy management strategies for the fuel cell hybrid power system are not perfect, and still have a great room for improvement.
Disclosure of Invention
In view of the above, the present invention provides a fuel cell vehicle energy management method integrating information physical systems, which specifically includes the following steps:
step one, acquiring vehicle state information, power battery state information and fuel battery state information of a fuel battery hybrid electric vehicle; wherein the vehicle state information includes: vehicle speed v, acceleration acc, driving motor rotation speed omega motor Torque T of driving motor motor Efficiency eta of driving motor motor The method comprises the steps of carrying out a first treatment on the surface of the The power battery state information includes: power battery voltage and current, internal resistance and SOC;the fuel cell state information includes: fuel cell output power P FC Efficiency eta FC Rate of change of power Δp FC
Step two, establishing a longitudinal dynamics model of the automobile according to the dynamics of the automobile; sequentially establishing a fuel cell hydrogen consumption model, a power cell equivalent circuit model, a power cell life attenuation model and a driving motor model aiming at a topological structure of a fuel cell hybrid power system;
step three, acquiring real-time driving state information comprising the speed v and the acceleration acc from CAN signals of the vehicle, and acquiring geographic position data of the vehicle through a GPS module; uploading the driving state information and the geographic position data to a cloud server by utilizing a vehicle-mounted network, wherein the cloud server acquires the gradient, curvature and traffic information of a future road which the vehicle is about to pass through based on the information and feeds back the gradient, curvature and traffic information to the vehicle;
step four, selecting a vehicle speed v, an acceleration acc, a power battery SOC, a power battery SOH and a future road gradient i according to a depth deterministic strategy gradient (Deep Deterministic Policy Gradient, DDPG) algorithm f Future road curvature c f Future road traffic information t f As state variables, and constitute a state space S:
S=[v,acc,SOC,SOH,i f ,c f ,t f ]
selecting a fuel cell power change rate DeltaP FC As an action variable, and constitutes an action space a:
a=[ΔP FC |ΔP FC ∈[-3,+3]kW]
four optimization targets including overall vehicle hydrogen consumption, power battery life, power battery SOC maintenance and fuel battery power limit are set, and corresponding reward functions r are constructed:
Figure BDA0004134750110000021
wherein p is 1 For the unit price of hydrogen per kilogram,
Figure BDA0004134750110000022
for hydrogen mass, p 2 For power battery replacement price, α and β are weighting coefficients for power battery SOC maintenance and fuel battery power variation limit, respectively, SOC tar Δp, target value for battery SOC maintenance FCmax Maximum value for fuel cell power conversion limit;
initializing the DDPG algorithm, constructing a training set by utilizing historical data or vehicle state information corresponding to standard working conditions, and training the algorithm, so that the trained algorithm can obtain optimal action variables according to real-time state variables.
Further, the specific form of the automobile longitudinal dynamics model established in the second step is as follows:
Figure BDA0004134750110000023
η t =η DC/AC ·η EM ·η tra
P tol =P FC ·η DC/DC +P bat
wherein P is tol For the total power required for the vehicle to travel, eta t Is the efficiency of the vehicle, m is the weight of the vehicle, g is the gravitational acceleration, f is the rolling resistance coefficient, α is the road gradient, A is the frontal area, C D Is the air resistance coefficient, v is the vehicle speed, delta is the conversion coefficient of the vehicle rotating mass, eta DC/AC 、η EM 、η tra 、η DC/DC Efficiency of DC/AC converter, drive motor, drive train and DC/DC converter, respectively, P FC 、P bat The output power of the fuel cell and the power cell respectively;
the specific form of the fuel cell hydrogen consumption model is as follows:
Figure BDA0004134750110000024
Figure BDA0004134750110000025
wherein,,
Figure BDA0004134750110000026
for the instantaneous hydrogen consumption of the fuel cell system, +.>
Figure BDA0004134750110000027
For the heating value of hydrogen, < >>
Figure BDA0004134750110000028
The theoretical power generated for the consumed hydrogen, t is a time variable;
the power battery equivalent circuit model is specifically formed by:
Figure BDA0004134750110000029
Figure BDA00041347501100000210
wherein V is ocv For the open-circuit voltage of the power battery, I bat Is the current of the power battery, R 0 Is the internal resistance of the power battery, Q bat Is the power battery capacity;
the specific form of the power battery life attenuation model is as follows:
Figure BDA0004134750110000031
Figure BDA0004134750110000032
Figure BDA0004134750110000033
wherein Q is loss For power battery capacity loss, c is power battery discharge multiplying power, B (c) is compensation factor, E a (c) For activation energy, R is an ideal gas constant, T is the absolute temperature of the power battery, A (c) is the ampere-hour throughput of the power battery, and N (c) is the equivalent charge and discharge quantity of the power battery;
the driving motor model is specifically formed by:
η motor =f(ω motor ,T motor )
when the rotating speed omega of the motor motor And torque T motor After the determination, the efficiency eta of the driving motor can be obtained motor
Further, the DDPG algorithm specifically comprises a actor network mu, a commentator network Q and an experience pool; the said interview network outputs the comprehensive score Q (s, a) for action-rewards based on state variable s and action variable a; the actor network can maximize Q (s, a) output by the critics network through training;
the experience pool is used for forming and storing a state variable s, a motion variable a, a rewarding value r and a next state variable s' corresponding to a certain state into experience samples, and when the number of the experience samples in the experience pool exceeds the storable number of the experience pool, old data can be covered; the algorithm training is specifically carried out by using small batches of samples randomly extracted from an experience pool;
the actor network updates by performing gradient descent of the form of the objective function corresponding to the optimization objective:
J(θ μ )=E[Q(s,μ(s))]
Figure BDA0004134750110000034
Figure BDA0004134750110000035
wherein J (θ) μ ) As an objective function, θ μ As a function of the network parameters of the actor,
Figure BDA0004134750110000036
representing the gradient, E (·) is the mathematical expectation, η is the learning rate of the actor network; symbol ≡ represents the item to the left of which the item to the right is determined;
the actor network and the commentator network respectively have corresponding parameters theta μ′ Target actor network mu' of (1) and with parameter theta Q′ Target critics network Q'; the target actor network outputs a corresponding action variable a ' based on a next state variable s ', and s ' and a ' are input to the target critic network together to output Q ' (s ', a '); the evaluator network is used for minimizing the TD error between the current Q value and the time sequence differential target thereof, and the specific form is as follows:
y target (t)=r(s,a)+γQ′(s',a'|θ Q′ )
δ(t)=y target (t)-Q(s,a|θ Q )
wherein y is target (t) is a time sequence differential target, and delta (t) is a TD error;
the update of the commentator network is likewise effected using the gradient descent method of the following form:
Figure BDA0004134750110000041
Figure BDA0004134750110000042
wherein, beta is the learning rate of the criticizer network;
the target actor network and the target criticizer network adopt a delay updating mode, and only after the actor network and the criticizer network are updated for a preset number of times, the target actor network and the target criticizer network are updated, and corresponding network parameters are updated in a soft mode by the following steps:
θ μ′ ←τθ μ +(1-τ)θ μ′
θ Q′ ←τθ Q +(1-τ)θ Q′
where τ is the soft update factor.
According to the fuel cell automobile energy management method integrating the information physical system, provided by the invention, not only are the energy flow and consumption in the automobile system considered, but also the influence of future road topography and traffic conditions on the automobile energy management is considered, and the optimal control in the feasible domain is explored by combining the depth deterministic strategy gradient algorithm, so that the discrete error is effectively avoided, and the reliability of the strategy is improved. According to the invention, the information interaction between the vehicle system and the network layer is realized through the information physical system, and future topographic information obtained through the information physical system, battery aging, fuel cell durability constraint, hydrogen consumption and the like are brought into the control frame, so that the method has important practical significance for achieving the optimal balance of the system durability and the hydrogen consumption of the real vehicle.
Drawings
FIG. 1 is a flow chart of a method provided by the present invention;
FIG. 2 is an alternative topology of a fuel cell hybrid power system to which the present invention is applicable;
fig. 3 is a schematic block diagram of the DDPG algorithm.
Detailed Description
The following description of the embodiments of the present invention will be made apparent and fully in view of the accompanying drawings, in which some, but not all embodiments of the invention are shown. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The fuel cell automobile energy management method integrating the information physical system provided by the invention, as shown in figure 1, specifically comprises the following steps:
step one, acquiring vehicle state information, power battery state information and fuel battery state information of a fuel battery hybrid electric vehicle; wherein the vehicle state information includes: vehicle speed v, acceleration acc, driving motor rotation speed omega motor Driving motorTorque T motor Efficiency eta of driving motor motor The method comprises the steps of carrying out a first treatment on the surface of the The power battery state information includes: power battery voltage and current, internal resistance and SOC; the fuel cell state information includes: fuel cell output power P FC Efficiency eta FC Rate of change of power Δp FC
Step two, establishing a longitudinal dynamics model of the automobile according to the dynamics of the automobile; sequentially establishing a fuel cell hydrogen consumption model, a power cell equivalent circuit model, a power cell life attenuation model and a driving motor model aiming at a topological structure of a fuel cell hybrid power system;
step three, acquiring real-time driving state information comprising the speed v and the acceleration acc from CAN signals of the vehicle, and acquiring geographic position data of the vehicle through a GPS module; uploading the driving state information and the geographic position data to a cloud server by utilizing a vehicle-mounted network, wherein the cloud server acquires the gradient, curvature and traffic information of a future road which the vehicle is about to pass through based on the information and feeds back the gradient, curvature and traffic information to the vehicle;
step four, selecting a vehicle speed v, an acceleration acc, a power battery SOC, a power battery SOH and a future road gradient i according to a depth deterministic strategy gradient (Deep Deterministic Policy Gradient, DDPG) algorithm f Future road curvature c f Future road traffic information t f As state variables, and constitute a state space S:
S=[v,acc,SOC,SOH,i f ,c f ,t f ]
selecting a fuel cell power change rate DeltaP FC As an action variable, and constitutes an action space a:
a=[ΔP FC |ΔP FC ∈[-3,+3]kW]
four optimization targets including overall vehicle hydrogen consumption, power battery life, power battery SOC maintenance and fuel battery power limit are set, and corresponding reward functions r are constructed:
Figure BDA0004134750110000051
wherein p is 1 For the unit price of hydrogen per kilogram,
Figure BDA0004134750110000052
for hydrogen mass, p 2 For power battery replacement price, α and β are weighting coefficients for power battery SOC maintenance and fuel battery power variation limit, respectively, SOC tar Δp, target value for battery SOC maintenance FCmax Maximum value for fuel cell power conversion limit;
initializing the DDPG algorithm, constructing a training set by utilizing historical data or vehicle state information corresponding to standard working conditions, and training the algorithm, so that the trained algorithm can obtain optimal action variables according to real-time state variables.
Fig. 2 shows an alternative topology of a fuel cell hybrid system to which the method provided by the invention can be applied.
In a preferred embodiment of the present invention, the longitudinal dynamics model of the automobile established in the second step is in the specific form of:
Figure BDA0004134750110000053
η t =η DC/AC ·η EM ·η tra
P tol =P FC ·η DC/DC +P bat
wherein P is tol For the total power required for the vehicle to travel, eta t Is the efficiency of the vehicle, m is the weight of the vehicle, g is the gravitational acceleration, f is the rolling resistance coefficient, α is the road gradient, A is the frontal area, C D Is the air resistance coefficient, v is the vehicle speed, delta is the conversion coefficient of the vehicle rotating mass, eta DC/AC 、η EM 、η tra 、η DC/DC Efficiency of DC/AC converter, drive motor, drive train and DC/DC converter, respectively, P FC 、P bat The output power of the fuel cell and the power cell respectively;
the specific form of the fuel cell hydrogen consumption model is as follows:
Figure BDA0004134750110000061
Figure BDA0004134750110000062
wherein,,
Figure BDA0004134750110000063
for the instantaneous hydrogen consumption of the fuel cell system, +.>
Figure BDA0004134750110000064
For the heating value of hydrogen, < >>
Figure BDA0004134750110000065
The theoretical power generated for the consumed hydrogen, t is a time variable;
the power battery equivalent circuit model is specifically formed by:
Figure BDA00041347501100000610
Figure BDA0004134750110000066
wherein V is ocv For the open-circuit voltage of the power battery, I bat Is the current of the power battery, R 0 Is the internal resistance of the power battery, Q bat Is the power battery capacity;
the specific form of the power battery life attenuation model is as follows:
Figure BDA0004134750110000067
Figure BDA0004134750110000068
Figure BDA0004134750110000069
wherein Q is loss For power battery capacity loss, c is power battery discharge multiplying power, B (c) is compensation factor, E a (c) For activation energy, R is an ideal gas constant, T is the absolute temperature of the power battery, A (c) is the ampere-hour throughput of the power battery, and N (c) is the equivalent charge and discharge quantity of the power battery;
the driving motor model is specifically formed by:
η motor =f(ω motor ,T motor )
when the rotating speed omega of the motor motor And torque T motor After the determination, the efficiency eta of the driving motor can be obtained motor
In a preferred embodiment of the present invention, as shown in fig. 3, the DDPG algorithm specifically includes a actor network μ, a reviewer network Q, and an experience pool; the said interview network outputs the comprehensive score Q (s, a) for action-rewards based on state variable s and action variable a; the actor network can maximize Q (s, a) output by the critics network through training;
the experience pool is used for forming and storing a state variable s, a motion variable a, a rewarding value r and a next state variable s' corresponding to a certain state into experience samples, and when the number of the experience samples in the experience pool exceeds the storable number of the experience pool, old data can be covered; the algorithm training is specifically carried out by using small batches of samples randomly extracted from an experience pool;
the actor network updates by performing gradient descent of the form of the objective function corresponding to the optimization objective:
J(θ μ )=E[Q(s,μ(s))]
Figure BDA0004134750110000071
Figure BDA0004134750110000072
wherein J (θ) μ ) As an objective function, θ μ As a function of the network parameters of the actor,
Figure BDA0004134750110000073
representing the gradient, E (·) is the mathematical expectation, η is the learning rate of the actor network; symbol ≡ represents the item to the left of which the item to the right is determined;
the actor network and the commentator network respectively have corresponding parameters theta μ′ Target actor network mu' of (1) and with parameter theta Q′ Target critics network Q'; the target actor network outputs a corresponding action variable a ' based on a next state variable s ', and s ' and a ' are input to the target critic network together to output Q ' (s ', a '); the evaluator network is used for minimizing the TD error between the current Q value and the time sequence differential target thereof, and the specific form is as follows:
y target (t)=r(s,a)+γQ′(s',a'|θ Q′ )
δ(t)=y target (t)-Q(s,a|θ Q )
wherein y is target (t) is a time sequence differential target, and delta (t) is a TD error;
the update of the commentator network is likewise effected using the gradient descent method of the following form:
Figure BDA0004134750110000074
Figure BDA0004134750110000075
wherein, beta is the learning rate of the criticizer network;
the target actor network and the target criticizer network adopt a delay updating mode, and only after the actor network and the criticizer network are updated for a preset number of times, the target actor network and the target criticizer network are updated, and corresponding network parameters are updated in a soft mode by the following steps:
θ μ′ ←τθ μ +(1-τ)θ μ′
θ Q′ ←τθ Q +(1-τ)θ Q′
where τ is the soft update factor.
It should be understood that, the sequence number of each step in the embodiment of the present invention does not mean that the execution sequence of each process should be determined by the function and the internal logic of each process, and should not limit the implementation process of the embodiment of the present invention.
Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (3)

1. A fuel cell automobile energy management method integrating an information physical system is characterized in that: the method specifically comprises the following steps:
step one, acquiring vehicle state information, power battery state information and fuel battery state information of a fuel battery hybrid electric vehicle; wherein the vehicle state information includes: vehicle speed v, acceleration acc, driving motor rotation speed omega motor Torque T of driving motor motor Efficiency eta of driving motor motor The method comprises the steps of carrying out a first treatment on the surface of the The power battery state information includes: power battery voltage and current, internal resistance and SOC; the fuel cell state information includes: fuel cell output power P FC Efficiency eta FC Rate of change of power Δp FC
Step two, establishing a longitudinal dynamics model of the automobile according to the dynamics of the automobile; sequentially establishing a fuel cell hydrogen consumption model, a power cell equivalent circuit model, a power cell life attenuation model and a driving motor model aiming at a topological structure of a fuel cell hybrid power system;
step three, acquiring real-time driving state information comprising the speed v and the acceleration acc from CAN signals of the vehicle, and acquiring geographic position data of the vehicle through a GPS module; uploading the driving state information and the geographic position data to a cloud server by utilizing a vehicle-mounted network, wherein the cloud server acquires the gradient, curvature and traffic information of a future road which the vehicle is about to pass through based on the information and feeds back the gradient, curvature and traffic information to the vehicle;
step four, selecting a vehicle speed v, an acceleration acc, a power battery SOC, a power battery SOH and a future road gradient i according to a DDPG algorithm f Future road curvature c f Future road traffic information t f As state variables, and constitute a state space S:
S=[v,acc,SOC,SOH,i f ,c f ,t f ]
selecting a fuel cell power change rate DeltaP FC As an action variable, and constitutes an action space a:
a=[ΔP FC |ΔP FC ∈[-3,+3]kW]
four optimization targets including overall vehicle hydrogen consumption, power battery life, power battery SOC maintenance and fuel battery power limit are set, and corresponding reward functions r are constructed:
r=p 1 ·[m H2 (t)]+p 2 ·Q bat ·ΔSOH+α·[SOC(t)-SOC tar ] 2 +β·|ΔP FC /ΔP FCmax |
wherein p is 1 For unit price per kilogram of hydrogen, m H2 For hydrogen mass, p 2 For power battery replacement price, α and β are weighting coefficients for power battery SOC maintenance and fuel battery power variation limit, respectively, SOC tar Δp, target value for battery SOC maintenance FCmax Maximum value for fuel cell power conversion limit;
initializing the DDPG algorithm, constructing a training set by utilizing historical data or vehicle state information corresponding to standard working conditions, and training the algorithm, so that the trained algorithm can obtain optimal action variables according to real-time state variables.
2. The method of claim 1, wherein: the specific form of the automobile longitudinal dynamics model established in the second step is as follows:
Figure FDA0004134750100000021
η t =η DC/AC ·η EM ·η tra
P tol =P FC ·η DC/DC +P bat
wherein P is tol For the total power required for the vehicle to travel, eta t Is the efficiency of the vehicle, m is the weight of the vehicle, g is the gravitational acceleration, f is the rolling resistance coefficient, α is the road gradient, A is the frontal area, C D Is the air resistance coefficient, v is the vehicle speed, delta is the conversion coefficient of the vehicle rotating mass, eta DC/AC 、η EM 、η tra 、η DC/DC Efficiency of DC/AC converter, drive motor, drive train and DC/DC converter, respectively, P FC 、P bat The output power of the fuel cell and the power cell respectively;
the specific form of the fuel cell hydrogen consumption model is as follows:
Figure FDA0004134750100000022
Figure FDA0004134750100000023
wherein,,
Figure FDA0004134750100000024
for the instantaneous hydrogen consumption of the fuel cell system, +.>
Figure FDA0004134750100000025
For the heating value of hydrogen, < >>
Figure FDA0004134750100000026
The theoretical power generated for the consumed hydrogen, t is a time variable;
the power battery equivalent circuit model is specifically formed by:
Figure FDA0004134750100000027
Figure FDA0004134750100000028
wherein V is ocv For the open-circuit voltage of the power battery, I bat Is the current of the power battery, R 0 Is the internal resistance of the power battery, Q bat Is the power battery capacity;
the specific form of the power battery life attenuation model is as follows:
Figure FDA0004134750100000029
Figure FDA00041347501000000210
Figure FDA00041347501000000211
wherein Q is loss For power battery capacity loss, c is power battery discharge multiplying power, B (c) is compensation factor, E a (c) For activation energy, R is ideal gas constant, T is absolute temperature of power battery, A (c) is ampere-hour throughput of power battery, and N (c) is power batteryEquivalent charge and discharge quantity;
the driving motor model is specifically formed by:
η motor =f(ω motor ,T motor )
when the rotating speed omega of the motor motor And torque T motor After the determination, the efficiency eta of the driving motor can be obtained motor
3. The method of claim 1, wherein: the DDPG algorithm specifically comprises a actor network mu, a criticism network Q and an experience pool; the said interview network outputs the comprehensive score Q (s, a) for action-rewards based on state variable s and action variable a; the actor network can maximize Q (s, a) output by the critics network through training;
the experience pool is used for forming and storing a state variable s, a motion variable a, a rewarding value r and a next state variable s' corresponding to a certain state into experience samples, and when the number of the experience samples in the experience pool exceeds the storable number of the experience pool, old data can be covered; the algorithm training is specifically carried out by using small batches of samples randomly extracted from an experience pool;
the actor network updates by performing gradient descent of the form of the objective function corresponding to the optimization objective:
J(θ μ )=E[Q(s,μ(s))]
Figure FDA0004134750100000031
Figure FDA0004134750100000032
wherein J (θ) μ ) As an objective function, θ μ The method is characterized in that the method is an actor network parameter, wherein, the actor network parameter is V represents a gradient, E (·) is a mathematical expectation, and eta is a learning rate of the actor network; symbol ≡ represents the item to the left of which the item to the right is determined;
the actor network and the commentator network respectively have corresponding parameters theta μ′ Target actor network mu' of (1) and with parameter theta Q′ Target critics network Q'; the target actor network outputs a corresponding action variable a ' based on a next state variable s ', and s ' and a ' are input to the target critic network together to output Q ' (s ', a '); the evaluator network is used for minimizing the TD error between the current Q value and the time sequence differential target thereof, and the specific form is as follows:
y target (t)=r(s,a)+γQ′(s',a'|θ Q′ )
δ(t)=y target (t)-Q(s,a|θ Q )
wherein y is target (t) is a time sequence differential target, and delta (t) is a TD error;
the update of the commentator network is likewise effected using the gradient descent method of the following form:
Figure FDA0004134750100000033
Figure FDA0004134750100000034
wherein, beta is the learning rate of the criticizer network;
the target actor network and the target criticizer network adopt a delay updating mode, and only after the actor network and the criticizer network are updated for a preset number of times, the target actor network and the target criticizer network are updated, and corresponding network parameters are updated in a soft mode by the following steps:
θ μ′ ←τθ μ +(1-τ)θ μ′
θ Q′ ←τθ Q +(1-τ)θ Q′
where τ is the soft update factor.
CN202310271286.6A 2023-03-20 2023-03-20 Fuel cell automobile energy management method integrating information physical system Pending CN116278992A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310271286.6A CN116278992A (en) 2023-03-20 2023-03-20 Fuel cell automobile energy management method integrating information physical system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310271286.6A CN116278992A (en) 2023-03-20 2023-03-20 Fuel cell automobile energy management method integrating information physical system

Publications (1)

Publication Number Publication Date
CN116278992A true CN116278992A (en) 2023-06-23

Family

ID=86812859

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310271286.6A Pending CN116278992A (en) 2023-03-20 2023-03-20 Fuel cell automobile energy management method integrating information physical system

Country Status (1)

Country Link
CN (1) CN116278992A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117104084A (en) * 2023-10-24 2023-11-24 新研氢能源科技有限公司 Management method and device for hydrogen fuel cell system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117104084A (en) * 2023-10-24 2023-11-24 新研氢能源科技有限公司 Management method and device for hydrogen fuel cell system
CN117104084B (en) * 2023-10-24 2024-01-09 新研氢能源科技有限公司 Management method and device for hydrogen fuel cell system

Similar Documents

Publication Publication Date Title
CN110341690B (en) PHEV energy management method based on deterministic strategy gradient learning
Wu et al. Continuous reinforcement learning of energy management with deep Q network for a power split hybrid electric bus
CN113085665B (en) Fuel cell automobile energy management method based on TD3 algorithm
CN111267827B (en) Energy management method and system for hybrid electric vehicle
CN112810504A (en) Fuel cell automobile energy management method based on nonlinear prediction model control
CN113554337B (en) Plug-in hybrid electric vehicle energy management strategy construction method integrating traffic information
CN112498334B (en) Robust energy management method and system for intelligent network-connected hybrid electric vehicle
CN116278992A (en) Fuel cell automobile energy management method integrating information physical system
CN115476841A (en) Plug-in hybrid electric vehicle energy management method based on improved multi-target DDPG
Vignesh et al. Intelligent energy management through neuro-fuzzy based adaptive ECMS approach for an optimal battery utilization in plugin parallel hybrid electric vehicle
CN116461391A (en) Energy management method for fuel cell hybrid electric vehicle
CN116494839A (en) Whale optimization algorithm-based fuel cell automobile energy management method
Jia et al. Health-conscious deep reinforcement learning energy management for fuel cell buses integrating environmental and look-ahead road information
CN114291067B (en) Hybrid electric vehicle convex optimization energy control method and system based on prediction
CN114969982A (en) Fuel cell automobile deep reinforcement learning energy management method based on strategy migration
Sun et al. System-level energy management optimization based on external information for power-split hybrid electric buses
Janulin et al. Energy minimization in city electric vehicle using optimized multi-speed transmission
CN114801897A (en) DDPG algorithm-based fuel cell hybrid power system energy management method
CN114954541B (en) Cooperative method and device for power distribution and tracking speed of vehicle
CN112434374A (en) Capacity matching method and system for composite power supply system of pure electric vehicle
Ma et al. The optimized real-time energy management strategy for fuel-cell hybrid trucks through dynamic programming
CN117184095B (en) Hybrid electric vehicle system control method based on deep reinforcement learning
Niu et al. Energy Management Strategy Based on an Improved TD3 Reinforcement Algorithm with Novel Experience Replay
Abd-Elhaleem et al. Improved Power Management Under Uncertain Driving Conditions for Plug-In Hybrid Electric Vehicles via Intelligent Controller
CN115782700B (en) Fuel cell hybrid electric vehicle energy management method considering battery thermal health constraint

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination