CN116278992A - Fuel cell automobile energy management method integrating information physical system - Google Patents
Fuel cell automobile energy management method integrating information physical system Download PDFInfo
- Publication number
- CN116278992A CN116278992A CN202310271286.6A CN202310271286A CN116278992A CN 116278992 A CN116278992 A CN 116278992A CN 202310271286 A CN202310271286 A CN 202310271286A CN 116278992 A CN116278992 A CN 116278992A
- Authority
- CN
- China
- Prior art keywords
- network
- power
- vehicle
- power battery
- fuel cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000000446 fuel Substances 0.000 title claims abstract description 57
- 238000007726 management method Methods 0.000 title claims abstract description 14
- 239000001257 hydrogen Substances 0.000 claims abstract description 28
- 229910052739 hydrogen Inorganic materials 0.000 claims abstract description 28
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 claims abstract description 24
- 230000000875 corresponding effect Effects 0.000 claims description 21
- 238000000034 method Methods 0.000 claims description 16
- 230000001133 acceleration Effects 0.000 claims description 12
- 230000009471 action Effects 0.000 claims description 12
- 230000006870 function Effects 0.000 claims description 12
- 238000012423 maintenance Methods 0.000 claims description 9
- 230000008859 change Effects 0.000 claims description 7
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 238000005457 optimization Methods 0.000 claims description 6
- 150000002431 hydrogen Chemical class 0.000 claims description 4
- 230000004913 activation Effects 0.000 claims description 3
- 239000007789 gas Substances 0.000 claims description 3
- 238000011478 gradient descent method Methods 0.000 claims description 3
- 238000010438 heat treatment Methods 0.000 claims description 3
- 238000005096 rolling process Methods 0.000 claims description 3
- 238000012876 topography Methods 0.000 abstract description 3
- 230000032683 aging Effects 0.000 abstract description 2
- 230000003993 interaction Effects 0.000 abstract description 2
- 230000008569 process Effects 0.000 description 5
- 230000004075 alteration Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000003749 cleanliness Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60L—PROPULSION OF ELECTRICALLY-PROPELLED VEHICLES; SUPPLYING ELECTRIC POWER FOR AUXILIARY EQUIPMENT OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRODYNAMIC BRAKE SYSTEMS FOR VEHICLES IN GENERAL; MAGNETIC SUSPENSION OR LEVITATION FOR VEHICLES; MONITORING OPERATING VARIABLES OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRIC SAFETY DEVICES FOR ELECTRICALLY-PROPELLED VEHICLES
- B60L58/00—Methods or circuit arrangements for monitoring or controlling batteries or fuel cells, specially adapted for electric vehicles
- B60L58/40—Methods or circuit arrangements for monitoring or controlling batteries or fuel cells, specially adapted for electric vehicles for controlling a combination of batteries and fuel cells
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60L—PROPULSION OF ELECTRICALLY-PROPELLED VEHICLES; SUPPLYING ELECTRIC POWER FOR AUXILIARY EQUIPMENT OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRODYNAMIC BRAKE SYSTEMS FOR VEHICLES IN GENERAL; MAGNETIC SUSPENSION OR LEVITATION FOR VEHICLES; MONITORING OPERATING VARIABLES OF ELECTRICALLY-PROPELLED VEHICLES; ELECTRIC SAFETY DEVICES FOR ELECTRICALLY-PROPELLED VEHICLES
- B60L50/00—Electric propulsion with power supplied within the vehicle
- B60L50/50—Electric propulsion with power supplied within the vehicle using propulsion power supplied by batteries or fuel cells
- B60L50/75—Electric propulsion with power supplied within the vehicle using propulsion power supplied by batteries or fuel cells using propulsion power supplied by both fuel cells and batteries
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/02—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
- B60W40/04—Traffic conditions
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/02—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
- B60W40/06—Road conditions
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/02—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
- B60W40/06—Road conditions
- B60W40/072—Curvature of the road
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W40/00—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
- B60W40/02—Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models related to ambient conditions
- B60W40/06—Road conditions
- B60W40/076—Slope angle of the road
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W2050/0001—Details of the control system
- B60W2050/0019—Control system elements or transfer functions
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W2050/0001—Details of the control system
- B60W2050/0019—Control system elements or transfer functions
- B60W2050/0028—Mathematical models, e.g. for simulation
- B60W2050/0031—Mathematical model of the vehicle
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W50/00—Details of control systems for road vehicle drive control not related to the control of a particular sub-unit, e.g. process diagnostic or vehicle driver interfaces
- B60W2050/0001—Details of the control system
- B60W2050/0019—Control system elements or transfer functions
- B60W2050/0028—Mathematical models, e.g. for simulation
- B60W2050/0037—Mathematical models of vehicle sub-units
- B60W2050/0039—Mathematical models of vehicle sub-units of the propulsion unit
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2510/00—Input parameters relating to a particular sub-units
- B60W2510/24—Energy storage means
- B60W2510/242—Energy storage means for electrical energy
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2510/00—Input parameters relating to a particular sub-units
- B60W2510/28—Fuel cells
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2710/00—Output or target parameters relating to a particular sub-units
- B60W2710/24—Energy storage means
- B60W2710/242—Energy storage means for electrical energy
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2710/00—Output or target parameters relating to a particular sub-units
- B60W2710/28—Fuel cells
Landscapes
- Engineering & Computer Science (AREA)
- Transportation (AREA)
- Mechanical Engineering (AREA)
- Automation & Control Theory (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Sustainable Development (AREA)
- Sustainable Energy (AREA)
- Power Engineering (AREA)
- Human Computer Interaction (AREA)
- Electric Propulsion And Braking For Vehicles (AREA)
Abstract
The invention provides a fuel cell automobile energy management method integrating an information physical system, which not only considers the energy flow and consumption in the automobile system, but also considers the influence of future road topography and traffic conditions on the automobile energy management, and explores the optimal control in a feasible domain by combining a depth deterministic strategy gradient algorithm, thereby effectively avoiding discrete errors and improving the reliability of strategies. The invention realizes the information interaction between the vehicle system and the network layer through the information physical system, and brings future topographic information, traffic information, battery aging, fuel cell durability constraint, hydrogen consumption and the like obtained through the information physical system into the control frame, thereby having important practical significance for achieving the optimal balance of the system durability and the hydrogen consumption of the real vehicle.
Description
Technical Field
The invention belongs to the technical field of energy management of fuel cell hybrid power systems, and particularly relates to an energy management method of a fuel cell automobile integrating an information physical system.
Background
At present, proton membrane exchange fuel cells are increasingly used by new energy automobiles, especially hybrid electric vehicles, due to the advantages of cleanliness, high energy efficiency and the like. However, the dynamic response of the fuel cell is slow, and the speed and power change during the running process of the vehicle are rapid and drastic, which makes the energy management of the fuel cell hybrid power system formed by matching with the power cell more difficult. In the energy management strategy for the fuel cell hybrid electric vehicle, the problems that the actual performance transition depends on the set conditions and the modeling process exist; although the power distribution of the vehicle power system itself or other high-power components on the vehicle has been comprehensively considered in many prior arts such as chinese patent publication No. CN113085665a, environmental factors outside the vehicle such as road topography, traffic conditions, etc. are not considered, and these factors tend to have a stronger influence on the energy consumption during the running of the vehicle according to practical experience, so that the existing energy management strategies for the fuel cell hybrid power system are not perfect, and still have a great room for improvement.
Disclosure of Invention
In view of the above, the present invention provides a fuel cell vehicle energy management method integrating information physical systems, which specifically includes the following steps:
step one, acquiring vehicle state information, power battery state information and fuel battery state information of a fuel battery hybrid electric vehicle; wherein the vehicle state information includes: vehicle speed v, acceleration acc, driving motor rotation speed omega motor Torque T of driving motor motor Efficiency eta of driving motor motor The method comprises the steps of carrying out a first treatment on the surface of the The power battery state information includes: power battery voltage and current, internal resistance and SOC;the fuel cell state information includes: fuel cell output power P FC Efficiency eta FC Rate of change of power Δp FC ;
Step two, establishing a longitudinal dynamics model of the automobile according to the dynamics of the automobile; sequentially establishing a fuel cell hydrogen consumption model, a power cell equivalent circuit model, a power cell life attenuation model and a driving motor model aiming at a topological structure of a fuel cell hybrid power system;
step three, acquiring real-time driving state information comprising the speed v and the acceleration acc from CAN signals of the vehicle, and acquiring geographic position data of the vehicle through a GPS module; uploading the driving state information and the geographic position data to a cloud server by utilizing a vehicle-mounted network, wherein the cloud server acquires the gradient, curvature and traffic information of a future road which the vehicle is about to pass through based on the information and feeds back the gradient, curvature and traffic information to the vehicle;
step four, selecting a vehicle speed v, an acceleration acc, a power battery SOC, a power battery SOH and a future road gradient i according to a depth deterministic strategy gradient (Deep Deterministic Policy Gradient, DDPG) algorithm f Future road curvature c f Future road traffic information t f As state variables, and constitute a state space S:
S=[v,acc,SOC,SOH,i f ,c f ,t f ]
selecting a fuel cell power change rate DeltaP FC As an action variable, and constitutes an action space a:
a=[ΔP FC |ΔP FC ∈[-3,+3]kW]
four optimization targets including overall vehicle hydrogen consumption, power battery life, power battery SOC maintenance and fuel battery power limit are set, and corresponding reward functions r are constructed:
wherein p is 1 For the unit price of hydrogen per kilogram,for hydrogen mass, p 2 For power battery replacement price, α and β are weighting coefficients for power battery SOC maintenance and fuel battery power variation limit, respectively, SOC tar Δp, target value for battery SOC maintenance FCmax Maximum value for fuel cell power conversion limit;
initializing the DDPG algorithm, constructing a training set by utilizing historical data or vehicle state information corresponding to standard working conditions, and training the algorithm, so that the trained algorithm can obtain optimal action variables according to real-time state variables.
Further, the specific form of the automobile longitudinal dynamics model established in the second step is as follows:
η t =η DC/AC ·η EM ·η tra
P tol =P FC ·η DC/DC +P bat
wherein P is tol For the total power required for the vehicle to travel, eta t Is the efficiency of the vehicle, m is the weight of the vehicle, g is the gravitational acceleration, f is the rolling resistance coefficient, α is the road gradient, A is the frontal area, C D Is the air resistance coefficient, v is the vehicle speed, delta is the conversion coefficient of the vehicle rotating mass, eta DC/AC 、η EM 、η tra 、η DC/DC Efficiency of DC/AC converter, drive motor, drive train and DC/DC converter, respectively, P FC 、P bat The output power of the fuel cell and the power cell respectively;
the specific form of the fuel cell hydrogen consumption model is as follows:
wherein,,for the instantaneous hydrogen consumption of the fuel cell system, +.>For the heating value of hydrogen, < >>The theoretical power generated for the consumed hydrogen, t is a time variable;
the power battery equivalent circuit model is specifically formed by:
wherein V is ocv For the open-circuit voltage of the power battery, I bat Is the current of the power battery, R 0 Is the internal resistance of the power battery, Q bat Is the power battery capacity;
the specific form of the power battery life attenuation model is as follows:
wherein Q is loss For power battery capacity loss, c is power battery discharge multiplying power, B (c) is compensation factor, E a (c) For activation energy, R is an ideal gas constant, T is the absolute temperature of the power battery, A (c) is the ampere-hour throughput of the power battery, and N (c) is the equivalent charge and discharge quantity of the power battery;
the driving motor model is specifically formed by:
η motor =f(ω motor ,T motor )
when the rotating speed omega of the motor motor And torque T motor After the determination, the efficiency eta of the driving motor can be obtained motor 。
Further, the DDPG algorithm specifically comprises a actor network mu, a commentator network Q and an experience pool; the said interview network outputs the comprehensive score Q (s, a) for action-rewards based on state variable s and action variable a; the actor network can maximize Q (s, a) output by the critics network through training;
the experience pool is used for forming and storing a state variable s, a motion variable a, a rewarding value r and a next state variable s' corresponding to a certain state into experience samples, and when the number of the experience samples in the experience pool exceeds the storable number of the experience pool, old data can be covered; the algorithm training is specifically carried out by using small batches of samples randomly extracted from an experience pool;
the actor network updates by performing gradient descent of the form of the objective function corresponding to the optimization objective:
J(θ μ )=E[Q(s,μ(s))]
wherein J (θ) μ ) As an objective function, θ μ As a function of the network parameters of the actor,representing the gradient, E (·) is the mathematical expectation, η is the learning rate of the actor network; symbol ≡ represents the item to the left of which the item to the right is determined;
the actor network and the commentator network respectively have corresponding parameters theta μ′ Target actor network mu' of (1) and with parameter theta Q′ Target critics network Q'; the target actor network outputs a corresponding action variable a ' based on a next state variable s ', and s ' and a ' are input to the target critic network together to output Q ' (s ', a '); the evaluator network is used for minimizing the TD error between the current Q value and the time sequence differential target thereof, and the specific form is as follows:
y target (t)=r(s,a)+γQ′(s',a'|θ Q′ )
δ(t)=y target (t)-Q(s,a|θ Q )
wherein y is target (t) is a time sequence differential target, and delta (t) is a TD error;
the update of the commentator network is likewise effected using the gradient descent method of the following form:
wherein, beta is the learning rate of the criticizer network;
the target actor network and the target criticizer network adopt a delay updating mode, and only after the actor network and the criticizer network are updated for a preset number of times, the target actor network and the target criticizer network are updated, and corresponding network parameters are updated in a soft mode by the following steps:
θ μ′ ←τθ μ +(1-τ)θ μ′
θ Q′ ←τθ Q +(1-τ)θ Q′
where τ is the soft update factor.
According to the fuel cell automobile energy management method integrating the information physical system, provided by the invention, not only are the energy flow and consumption in the automobile system considered, but also the influence of future road topography and traffic conditions on the automobile energy management is considered, and the optimal control in the feasible domain is explored by combining the depth deterministic strategy gradient algorithm, so that the discrete error is effectively avoided, and the reliability of the strategy is improved. According to the invention, the information interaction between the vehicle system and the network layer is realized through the information physical system, and future topographic information obtained through the information physical system, battery aging, fuel cell durability constraint, hydrogen consumption and the like are brought into the control frame, so that the method has important practical significance for achieving the optimal balance of the system durability and the hydrogen consumption of the real vehicle.
Drawings
FIG. 1 is a flow chart of a method provided by the present invention;
FIG. 2 is an alternative topology of a fuel cell hybrid power system to which the present invention is applicable;
fig. 3 is a schematic block diagram of the DDPG algorithm.
Detailed Description
The following description of the embodiments of the present invention will be made apparent and fully in view of the accompanying drawings, in which some, but not all embodiments of the invention are shown. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The fuel cell automobile energy management method integrating the information physical system provided by the invention, as shown in figure 1, specifically comprises the following steps:
step one, acquiring vehicle state information, power battery state information and fuel battery state information of a fuel battery hybrid electric vehicle; wherein the vehicle state information includes: vehicle speed v, acceleration acc, driving motor rotation speed omega motor Driving motorTorque T motor Efficiency eta of driving motor motor The method comprises the steps of carrying out a first treatment on the surface of the The power battery state information includes: power battery voltage and current, internal resistance and SOC; the fuel cell state information includes: fuel cell output power P FC Efficiency eta FC Rate of change of power Δp FC ;
Step two, establishing a longitudinal dynamics model of the automobile according to the dynamics of the automobile; sequentially establishing a fuel cell hydrogen consumption model, a power cell equivalent circuit model, a power cell life attenuation model and a driving motor model aiming at a topological structure of a fuel cell hybrid power system;
step three, acquiring real-time driving state information comprising the speed v and the acceleration acc from CAN signals of the vehicle, and acquiring geographic position data of the vehicle through a GPS module; uploading the driving state information and the geographic position data to a cloud server by utilizing a vehicle-mounted network, wherein the cloud server acquires the gradient, curvature and traffic information of a future road which the vehicle is about to pass through based on the information and feeds back the gradient, curvature and traffic information to the vehicle;
step four, selecting a vehicle speed v, an acceleration acc, a power battery SOC, a power battery SOH and a future road gradient i according to a depth deterministic strategy gradient (Deep Deterministic Policy Gradient, DDPG) algorithm f Future road curvature c f Future road traffic information t f As state variables, and constitute a state space S:
S=[v,acc,SOC,SOH,i f ,c f ,t f ]
selecting a fuel cell power change rate DeltaP FC As an action variable, and constitutes an action space a:
a=[ΔP FC |ΔP FC ∈[-3,+3]kW]
four optimization targets including overall vehicle hydrogen consumption, power battery life, power battery SOC maintenance and fuel battery power limit are set, and corresponding reward functions r are constructed:
wherein p is 1 For the unit price of hydrogen per kilogram,for hydrogen mass, p 2 For power battery replacement price, α and β are weighting coefficients for power battery SOC maintenance and fuel battery power variation limit, respectively, SOC tar Δp, target value for battery SOC maintenance FCmax Maximum value for fuel cell power conversion limit;
initializing the DDPG algorithm, constructing a training set by utilizing historical data or vehicle state information corresponding to standard working conditions, and training the algorithm, so that the trained algorithm can obtain optimal action variables according to real-time state variables.
Fig. 2 shows an alternative topology of a fuel cell hybrid system to which the method provided by the invention can be applied.
In a preferred embodiment of the present invention, the longitudinal dynamics model of the automobile established in the second step is in the specific form of:
η t =η DC/AC ·η EM ·η tra
P tol =P FC ·η DC/DC +P bat
wherein P is tol For the total power required for the vehicle to travel, eta t Is the efficiency of the vehicle, m is the weight of the vehicle, g is the gravitational acceleration, f is the rolling resistance coefficient, α is the road gradient, A is the frontal area, C D Is the air resistance coefficient, v is the vehicle speed, delta is the conversion coefficient of the vehicle rotating mass, eta DC/AC 、η EM 、η tra 、η DC/DC Efficiency of DC/AC converter, drive motor, drive train and DC/DC converter, respectively, P FC 、P bat The output power of the fuel cell and the power cell respectively;
the specific form of the fuel cell hydrogen consumption model is as follows:
wherein,,for the instantaneous hydrogen consumption of the fuel cell system, +.>For the heating value of hydrogen, < >>The theoretical power generated for the consumed hydrogen, t is a time variable;
the power battery equivalent circuit model is specifically formed by:
wherein V is ocv For the open-circuit voltage of the power battery, I bat Is the current of the power battery, R 0 Is the internal resistance of the power battery, Q bat Is the power battery capacity;
the specific form of the power battery life attenuation model is as follows:
wherein Q is loss For power battery capacity loss, c is power battery discharge multiplying power, B (c) is compensation factor, E a (c) For activation energy, R is an ideal gas constant, T is the absolute temperature of the power battery, A (c) is the ampere-hour throughput of the power battery, and N (c) is the equivalent charge and discharge quantity of the power battery;
the driving motor model is specifically formed by:
η motor =f(ω motor ,T motor )
when the rotating speed omega of the motor motor And torque T motor After the determination, the efficiency eta of the driving motor can be obtained motor 。
In a preferred embodiment of the present invention, as shown in fig. 3, the DDPG algorithm specifically includes a actor network μ, a reviewer network Q, and an experience pool; the said interview network outputs the comprehensive score Q (s, a) for action-rewards based on state variable s and action variable a; the actor network can maximize Q (s, a) output by the critics network through training;
the experience pool is used for forming and storing a state variable s, a motion variable a, a rewarding value r and a next state variable s' corresponding to a certain state into experience samples, and when the number of the experience samples in the experience pool exceeds the storable number of the experience pool, old data can be covered; the algorithm training is specifically carried out by using small batches of samples randomly extracted from an experience pool;
the actor network updates by performing gradient descent of the form of the objective function corresponding to the optimization objective:
J(θ μ )=E[Q(s,μ(s))]
wherein J (θ) μ ) As an objective function, θ μ As a function of the network parameters of the actor,representing the gradient, E (·) is the mathematical expectation, η is the learning rate of the actor network; symbol ≡ represents the item to the left of which the item to the right is determined;
the actor network and the commentator network respectively have corresponding parameters theta μ′ Target actor network mu' of (1) and with parameter theta Q′ Target critics network Q'; the target actor network outputs a corresponding action variable a ' based on a next state variable s ', and s ' and a ' are input to the target critic network together to output Q ' (s ', a '); the evaluator network is used for minimizing the TD error between the current Q value and the time sequence differential target thereof, and the specific form is as follows:
y target (t)=r(s,a)+γQ′(s',a'|θ Q′ )
δ(t)=y target (t)-Q(s,a|θ Q )
wherein y is target (t) is a time sequence differential target, and delta (t) is a TD error;
the update of the commentator network is likewise effected using the gradient descent method of the following form:
wherein, beta is the learning rate of the criticizer network;
the target actor network and the target criticizer network adopt a delay updating mode, and only after the actor network and the criticizer network are updated for a preset number of times, the target actor network and the target criticizer network are updated, and corresponding network parameters are updated in a soft mode by the following steps:
θ μ′ ←τθ μ +(1-τ)θ μ′
θ Q′ ←τθ Q +(1-τ)θ Q′
where τ is the soft update factor.
It should be understood that, the sequence number of each step in the embodiment of the present invention does not mean that the execution sequence of each process should be determined by the function and the internal logic of each process, and should not limit the implementation process of the embodiment of the present invention.
Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.
Claims (3)
1. A fuel cell automobile energy management method integrating an information physical system is characterized in that: the method specifically comprises the following steps:
step one, acquiring vehicle state information, power battery state information and fuel battery state information of a fuel battery hybrid electric vehicle; wherein the vehicle state information includes: vehicle speed v, acceleration acc, driving motor rotation speed omega motor Torque T of driving motor motor Efficiency eta of driving motor motor The method comprises the steps of carrying out a first treatment on the surface of the The power battery state information includes: power battery voltage and current, internal resistance and SOC; the fuel cell state information includes: fuel cell output power P FC Efficiency eta FC Rate of change of power Δp FC ;
Step two, establishing a longitudinal dynamics model of the automobile according to the dynamics of the automobile; sequentially establishing a fuel cell hydrogen consumption model, a power cell equivalent circuit model, a power cell life attenuation model and a driving motor model aiming at a topological structure of a fuel cell hybrid power system;
step three, acquiring real-time driving state information comprising the speed v and the acceleration acc from CAN signals of the vehicle, and acquiring geographic position data of the vehicle through a GPS module; uploading the driving state information and the geographic position data to a cloud server by utilizing a vehicle-mounted network, wherein the cloud server acquires the gradient, curvature and traffic information of a future road which the vehicle is about to pass through based on the information and feeds back the gradient, curvature and traffic information to the vehicle;
step four, selecting a vehicle speed v, an acceleration acc, a power battery SOC, a power battery SOH and a future road gradient i according to a DDPG algorithm f Future road curvature c f Future road traffic information t f As state variables, and constitute a state space S:
S=[v,acc,SOC,SOH,i f ,c f ,t f ]
selecting a fuel cell power change rate DeltaP FC As an action variable, and constitutes an action space a:
a=[ΔP FC |ΔP FC ∈[-3,+3]kW]
four optimization targets including overall vehicle hydrogen consumption, power battery life, power battery SOC maintenance and fuel battery power limit are set, and corresponding reward functions r are constructed:
r=p 1 ·[m H2 (t)]+p 2 ·Q bat ·ΔSOH+α·[SOC(t)-SOC tar ] 2 +β·|ΔP FC /ΔP FCmax |
wherein p is 1 For unit price per kilogram of hydrogen, m H2 For hydrogen mass, p 2 For power battery replacement price, α and β are weighting coefficients for power battery SOC maintenance and fuel battery power variation limit, respectively, SOC tar Δp, target value for battery SOC maintenance FCmax Maximum value for fuel cell power conversion limit;
initializing the DDPG algorithm, constructing a training set by utilizing historical data or vehicle state information corresponding to standard working conditions, and training the algorithm, so that the trained algorithm can obtain optimal action variables according to real-time state variables.
2. The method of claim 1, wherein: the specific form of the automobile longitudinal dynamics model established in the second step is as follows:
η t =η DC/AC ·η EM ·η tra
P tol =P FC ·η DC/DC +P bat
wherein P is tol For the total power required for the vehicle to travel, eta t Is the efficiency of the vehicle, m is the weight of the vehicle, g is the gravitational acceleration, f is the rolling resistance coefficient, α is the road gradient, A is the frontal area, C D Is the air resistance coefficient, v is the vehicle speed, delta is the conversion coefficient of the vehicle rotating mass, eta DC/AC 、η EM 、η tra 、η DC/DC Efficiency of DC/AC converter, drive motor, drive train and DC/DC converter, respectively, P FC 、P bat The output power of the fuel cell and the power cell respectively;
the specific form of the fuel cell hydrogen consumption model is as follows:
wherein,,for the instantaneous hydrogen consumption of the fuel cell system, +.>For the heating value of hydrogen, < >>The theoretical power generated for the consumed hydrogen, t is a time variable;
the power battery equivalent circuit model is specifically formed by:
wherein V is ocv For the open-circuit voltage of the power battery, I bat Is the current of the power battery, R 0 Is the internal resistance of the power battery, Q bat Is the power battery capacity;
the specific form of the power battery life attenuation model is as follows:
wherein Q is loss For power battery capacity loss, c is power battery discharge multiplying power, B (c) is compensation factor, E a (c) For activation energy, R is ideal gas constant, T is absolute temperature of power battery, A (c) is ampere-hour throughput of power battery, and N (c) is power batteryEquivalent charge and discharge quantity;
the driving motor model is specifically formed by:
η motor =f(ω motor ,T motor )
when the rotating speed omega of the motor motor And torque T motor After the determination, the efficiency eta of the driving motor can be obtained motor 。
3. The method of claim 1, wherein: the DDPG algorithm specifically comprises a actor network mu, a criticism network Q and an experience pool; the said interview network outputs the comprehensive score Q (s, a) for action-rewards based on state variable s and action variable a; the actor network can maximize Q (s, a) output by the critics network through training;
the experience pool is used for forming and storing a state variable s, a motion variable a, a rewarding value r and a next state variable s' corresponding to a certain state into experience samples, and when the number of the experience samples in the experience pool exceeds the storable number of the experience pool, old data can be covered; the algorithm training is specifically carried out by using small batches of samples randomly extracted from an experience pool;
the actor network updates by performing gradient descent of the form of the objective function corresponding to the optimization objective:
J(θ μ )=E[Q(s,μ(s))]
wherein J (θ) μ ) As an objective function, θ μ The method is characterized in that the method is an actor network parameter, wherein, the actor network parameter is V represents a gradient, E (·) is a mathematical expectation, and eta is a learning rate of the actor network; symbol ≡ represents the item to the left of which the item to the right is determined;
the actor network and the commentator network respectively have corresponding parameters theta μ′ Target actor network mu' of (1) and with parameter theta Q′ Target critics network Q'; the target actor network outputs a corresponding action variable a ' based on a next state variable s ', and s ' and a ' are input to the target critic network together to output Q ' (s ', a '); the evaluator network is used for minimizing the TD error between the current Q value and the time sequence differential target thereof, and the specific form is as follows:
y target (t)=r(s,a)+γQ′(s',a'|θ Q′ )
δ(t)=y target (t)-Q(s,a|θ Q )
wherein y is target (t) is a time sequence differential target, and delta (t) is a TD error;
the update of the commentator network is likewise effected using the gradient descent method of the following form:
wherein, beta is the learning rate of the criticizer network;
the target actor network and the target criticizer network adopt a delay updating mode, and only after the actor network and the criticizer network are updated for a preset number of times, the target actor network and the target criticizer network are updated, and corresponding network parameters are updated in a soft mode by the following steps:
θ μ′ ←τθ μ +(1-τ)θ μ′
θ Q′ ←τθ Q +(1-τ)θ Q′
where τ is the soft update factor.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310271286.6A CN116278992A (en) | 2023-03-20 | 2023-03-20 | Fuel cell automobile energy management method integrating information physical system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310271286.6A CN116278992A (en) | 2023-03-20 | 2023-03-20 | Fuel cell automobile energy management method integrating information physical system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116278992A true CN116278992A (en) | 2023-06-23 |
Family
ID=86812859
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310271286.6A Pending CN116278992A (en) | 2023-03-20 | 2023-03-20 | Fuel cell automobile energy management method integrating information physical system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116278992A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117104084A (en) * | 2023-10-24 | 2023-11-24 | 新研氢能源科技有限公司 | Management method and device for hydrogen fuel cell system |
-
2023
- 2023-03-20 CN CN202310271286.6A patent/CN116278992A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117104084A (en) * | 2023-10-24 | 2023-11-24 | 新研氢能源科技有限公司 | Management method and device for hydrogen fuel cell system |
CN117104084B (en) * | 2023-10-24 | 2024-01-09 | 新研氢能源科技有限公司 | Management method and device for hydrogen fuel cell system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110341690B (en) | PHEV energy management method based on deterministic strategy gradient learning | |
Wu et al. | Continuous reinforcement learning of energy management with deep Q network for a power split hybrid electric bus | |
CN113085665B (en) | Fuel cell automobile energy management method based on TD3 algorithm | |
CN111267827B (en) | Energy management method and system for hybrid electric vehicle | |
CN112810504A (en) | Fuel cell automobile energy management method based on nonlinear prediction model control | |
CN113554337B (en) | Plug-in hybrid electric vehicle energy management strategy construction method integrating traffic information | |
CN112498334B (en) | Robust energy management method and system for intelligent network-connected hybrid electric vehicle | |
CN116278992A (en) | Fuel cell automobile energy management method integrating information physical system | |
CN115476841A (en) | Plug-in hybrid electric vehicle energy management method based on improved multi-target DDPG | |
Vignesh et al. | Intelligent energy management through neuro-fuzzy based adaptive ECMS approach for an optimal battery utilization in plugin parallel hybrid electric vehicle | |
CN116461391A (en) | Energy management method for fuel cell hybrid electric vehicle | |
CN116494839A (en) | Whale optimization algorithm-based fuel cell automobile energy management method | |
Jia et al. | Health-conscious deep reinforcement learning energy management for fuel cell buses integrating environmental and look-ahead road information | |
CN114291067B (en) | Hybrid electric vehicle convex optimization energy control method and system based on prediction | |
CN114969982A (en) | Fuel cell automobile deep reinforcement learning energy management method based on strategy migration | |
Sun et al. | System-level energy management optimization based on external information for power-split hybrid electric buses | |
Janulin et al. | Energy minimization in city electric vehicle using optimized multi-speed transmission | |
CN114801897A (en) | DDPG algorithm-based fuel cell hybrid power system energy management method | |
CN114954541B (en) | Cooperative method and device for power distribution and tracking speed of vehicle | |
CN112434374A (en) | Capacity matching method and system for composite power supply system of pure electric vehicle | |
Ma et al. | The optimized real-time energy management strategy for fuel-cell hybrid trucks through dynamic programming | |
CN117184095B (en) | Hybrid electric vehicle system control method based on deep reinforcement learning | |
Niu et al. | Energy Management Strategy Based on an Improved TD3 Reinforcement Algorithm with Novel Experience Replay | |
Abd-Elhaleem et al. | Improved Power Management Under Uncertain Driving Conditions for Plug-In Hybrid Electric Vehicles via Intelligent Controller | |
CN115782700B (en) | Fuel cell hybrid electric vehicle energy management method considering battery thermal health constraint |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |