WO2022111232A1 - Method for optimizing control model of water cooling system, electronic device, and storage medium - Google Patents

Method for optimizing control model of water cooling system, electronic device, and storage medium Download PDF

Info

Publication number
WO2022111232A1
WO2022111232A1 PCT/CN2021/127980 CN2021127980W WO2022111232A1 WO 2022111232 A1 WO2022111232 A1 WO 2022111232A1 CN 2021127980 W CN2021127980 W CN 2021127980W WO 2022111232 A1 WO2022111232 A1 WO 2022111232A1
Authority
WO
WIPO (PCT)
Prior art keywords
control
parameters
parameter
model
evaluation value
Prior art date
Application number
PCT/CN2021/127980
Other languages
French (fr)
Chinese (zh)
Inventor
弄庆鹏
周祥生
屠要峰
李忠良
王壮
高洪
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2022111232A1 publication Critical patent/WO2022111232A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24FAIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
    • F24F11/00Control or safety arrangements
    • F24F11/30Control or safety arrangements for purposes related to the operation of the system, e.g. for safety or monitoring
    • F24F11/46Improving electric energy efficiency or saving
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the embodiments of the present application relate to the technical field of data processing, and in particular, to a method for optimizing a control model of a water cooling system, an electronic device, and a storage medium.
  • the power consumption of data centers continues to increase with the expansion of the scale.
  • the water cooling system accounts for about half of the non-Internet technology (IT) energy consumption of data centers. Therefore, effectively reducing the energy consumption of the water cooling system is one of the keys to reducing non-IT energy consumption in data centers.
  • the data center will optimize the control strategy model of the water cooling system by directly interacting with the physical environment of the water cooling system or by interacting with the simulated environment to obtain the control strategy and its control effect data, so as to send control commands to the water cooling system that can reduce energy consumption.
  • the model when the model is used to directly interact with the physical environment of the water cooling system, the model is generally not optimized, and the control strategy generated by an unoptimized model is likely to affect the normal operation of the data center; on the other hand, the simulation environment is used to interact
  • the simulation environment When the control strategy and its control effect data are obtained to optimize the control strategy model of the water cooling system, the simulation environment usually deviates from the real data center environment and the real environment of the water cooling system, and the real environment parameters cannot be fitted.
  • An embodiment of the present application provides a method for optimizing a control model of a water cooling system, and the method includes the following steps: acquiring parameters of a data center; creating a state transition model and a control model; optimizing the state transition model according to the parameters of the data center; The optimized state transition model and the parameters of the data center are optimized for the control model, and the optimized control model is obtained.
  • An embodiment of the present application further provides an electronic device, the device includes: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores data that can be processed by the at least one processor.
  • the instructions are executed by the processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute the above-mentioned control model optimization method for a water cooling system.
  • An embodiment of the present application further provides a computer-readable storage medium storing a computer program, and when the computer program is executed by a processor, the above-mentioned control model optimization method for a water cooling system is implemented.
  • FIG. 1 is a flowchart of a control model optimization method for a water cooling system provided by a first embodiment of the present application
  • FIG. 2 is a flowchart of step 103 in the control model optimization method of the water cooling system provided by the first embodiment of the present application shown in FIG. 1;
  • FIG. 3 is a flowchart of a control model optimization method for a water cooling system provided by a second embodiment of the present application.
  • FIG. 4 is a flowchart of step 305 in the control model optimization method of the water cooling system provided by the second embodiment of the present application shown in FIG. 3;
  • FIG. 5 is a flowchart of a control model optimization method for a water cooling system provided by a third embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of an electronic device provided by a fourth embodiment of the present application.
  • the main purpose of the embodiments of the present application is to propose a control model optimization method, electronic device and storage medium for a water cooling system, aiming to provide an optimized control model for the water cooling system of a data center through a real interactive environment interaction process, so that It can fit the real environment and effectively reduce the energy consumption of the water cooling system without affecting the normal operation of the data center.
  • the first embodiment of the present application relates to a control model optimization method of a water cooling system, as shown in FIG. 1 , which specifically includes:
  • step 101 parameters of the data center are acquired.
  • the data center includes the data center water cooling system, the data center computer room and the data center environment.
  • the data center water cooling system can provide control parameters and output parameters in the data
  • the data center computer room can provide the data center operating parameters
  • the data center environment can provide the data center environment parameters.
  • control parameters also include the operating number of cooling towers, the operating number of cooling pumps, the operating number of refrigeration pumps, the operating number of plate replacements, the temperature of plate replacement water, the water temperature of cooling towers, the operating frequency of cooling tower fans, the operating frequency of cooling pumps, Parameters such as the operating frequency of the refrigeration pump, the setting value of the pressure difference of the refrigeration main pipe;
  • the output parameters also include parameters such as the energy consumption of the water cooling system, the air temperature at the end of the water cooling system, the return air temperature at the end of the water cooling system, and the cooling capacity of the water cooling system;
  • the operating parameters also include the IT energy parameters such as power consumption parameters and data center temperature settings; environmental parameters also include parameters such as outdoor dry bulb temperature, outdoor wet bulb temperature, and outdoor humidity.
  • parameters of the data center may also include other parameters in the actual use process, which will not be repeated here.
  • Step 102 creating a state transition model and a control model.
  • the state transition model is a model describing the state change relationship of the data center, which can obtain the current output parameters of the data center according to the input environmental parameters of the data center, the operating parameters of the data center and the control parameters of the data center, and predict The environmental parameters of the data center and the operation parameters of the data center in the next state after the output parameter is acted on are obtained.
  • the control model is the control model of the water cooling system. Inputting the environmental parameters of the data center and the operating parameters of the data center can obtain the control strategy of the data center, wherein the control strategy is expressed in the form of control parameters.
  • Step 103 optimize the state transition model according to the parameters of the data center.
  • the environmental parameters, operating parameters, control parameters, etc. of the parameters that can have an effect on the control effect in the data center are used as the input data of the model, and the output parameters that can reflect the control effect are used as the output of the model corresponding to the input data. data, and train the model to achieve model optimization.
  • step 103 includes:
  • Step 201 Input the environmental parameters, operating parameters and control parameters into the state transition model to obtain the predicted output parameters.
  • Step 202 Calculate the state transition loss according to the predicted output parameter and the output parameter.
  • the loss function may be any loss function that can reflect the deviation between the predicted output parameter and the actual output parameter.
  • Step 203 optimize the state transition model according to the state transition loss.
  • the state transition loss obtained in step 202 is used as training data to train the state transition model, so as to realize the optimization of the model.
  • Step 104 optimize the control model according to the optimized state transition model and parameters of the data center, and obtain the optimized control model.
  • the training data is directly or indirectly obtained from the parameters of the state transition model, the control model and the data center, and then the control model is trained according to the obtained training data to realize the optimization of the control model.
  • the control model optimization method for a water cooling system provides training data for the created state transition model and control model by acquiring the parameters of the data center, and then trains and optimizes the state transition model according to the parameters of the data center to provide the optimal control model.
  • the real, offline environment is used for interactive training, and then the control model is optimized according to the optimized state transition model and the parameters of the data center to realize the optimization of the control model of the water cooling system.
  • the present application provides a control model for the water cooling system of the data center, which can effectively reduce the energy consumption of the water cooling system.
  • the present application can also optimize the control model instead of using the initial control model, which is safer and more reliable. Provide an optimized control model for the water cooling system of the data center through the interaction process of the real interactive environment, so that it can fit the real environment and effectively reduce the energy consumption of the water cooling system without affecting the normal operation of the data center.
  • the second embodiment of the present application relates to a control model optimization method for a water cooling system.
  • This embodiment is roughly the same as the first embodiment, except that step 104 uses reinforcement learning to optimize the control model.
  • the specific process is shown in Figure 3 shown:
  • step 301 parameters of the data center are acquired.
  • step 301 in this embodiment is substantially the same as step 101 in the first embodiment, and details are not repeated here.
  • Step 302 creating a state transition model and a control model.
  • step 302 in this embodiment is substantially the same as step 102 in the first embodiment, and details are not repeated here.
  • Step 303 Optimize the state transition model according to the parameters of the data center.
  • step 303 in this embodiment is substantially the same as step 103 in the first embodiment, and details are not repeated here.
  • Step 304 randomly select a set of sample data from the parameters of the data center.
  • various types of parameters in a certain state of the data center are randomly selected, and only data in the same state can constitute a set of sample data.
  • Step 305 using the control model and the state transition model to process the sample data to obtain training samples.
  • step 305 includes:
  • Step 401 generating state representation data according to the environmental parameters and the operating parameters.
  • the data is operated in the form of a matrix, which can effectively improve the running rate and the execution efficiency of the steps. Therefore, the data can be vectorized to further obtain the state representation vector of the data center.
  • step 402 the state representation data is input into the control model to obtain predictive control parameters.
  • the state representation vector is input into the control model, and the control model outputs a vector of control parameters predicted by the model.
  • Step 403 Input the predicted control parameters, the environmental parameters and the operating parameters into the state transition model, obtain the predicted output parameters, and update the environmental parameters and the operating parameters according to the predicted output parameters.
  • the predicted output parameters output by the model can be obtained, and the environmental parameters and operating parameters in the next state can also be obtained.
  • the environment parameters and running parameters are updated to the set environment parameters and running parameters for the next iteration cycle.
  • Step 404 Update the state representation data according to the updated environment parameters and operating parameters.
  • Step 405 Evaluate the predictive control parameters according to the predictive control parameters, the predictive output parameters and the control parameters, and obtain a control reward.
  • control action evaluation value of the control strategy first obtain the control action evaluation value of the control strategy according to the predicted control parameters and the control parameters, and then obtain the control effect evaluation value according to the predicted output parameter, wherein the control effect evaluation value includes the energy consumption evaluation value and the cooling capacity evaluation value, Finally, comprehensively analyze the control action evaluation value, energy consumption evaluation value and cooling capacity evaluation value to obtain control rewards.
  • the current predicted control parameters and the control parameters of the data center it is substituted into the preset function to obtain the control action evaluation value; according to the first predicted output parameters, the initial energy consumption parameter and the initial cooling capacity parameter are obtained, Obtain the current predicted energy consumption parameters and the current predicted cooling capacity parameters according to the current predicted output parameters, obtain the historical predicted energy consumption parameters according to the last predicted output parameters, and obtain the energy consumption according to the current predicted energy consumption parameters and the historical predicted energy consumption parameters Obtain the deviation degree of energy consumption according to the current predicted energy consumption parameters and initial energy consumption parameters, average the growth rate and deviation to obtain the energy consumption evaluation value, and analyze according to the initial cooling capacity parameters and the current predicted cooling capacity parameters, Obtain the cooling capacity evaluation value; perform the weighted average of the obtained control action evaluation value and the cooling capacity evaluation value to obtain the constraint evaluation value, and perform the weighted average of the energy consumption evaluation value and the constraint evaluation value to obtain the control reward.
  • the function used in this embodiment is not limited, and may be any function that can obtain an intuitive and accurate evaluation result according to the above data.
  • Step 406 Generate a training sample according to the state representation data before the update, the predicted control parameters, the control reward and the updated state representation data.
  • a training sample is a quadruple, and this quadruple consists of the state representation data before the update obtained in step 401 , the predicted control parameters obtained in step 402 , the control reward obtained in step 405 , and the update obtained in step 44
  • the post state characterizes the data composition.
  • Step 407 detecting whether the number of training samples reaches a second threshold.
  • step 408 if yes, go to step 408, if not, go to step 402.
  • Step 408 taking all the training samples as a group of training samples.
  • Step 306 optimize the control model according to the training samples.
  • a set of training samples obtained in step 408 is used as a training set to train the control model to complete an optimization.
  • Step 307 Detect whether the optimization times of the control model reach a first threshold.
  • step 308 if yes, go to step 308, if not, go to step 304.
  • first threshold and the second threshold are only to distinguish between the threshold of the number of optimizations and the threshold of the number of training samples. There is no connection between the first threshold and the second threshold, and they are two values set according to requirements.
  • Step 308 Obtain and save the optimized control model.
  • this embodiment can add the evaluation of cooling capacity, so as to ensure that the temperature control of the water cooling system meets the temperature control requirements of the data center, so as to ensure the normal operation of the water cooling system It can meet the temperature control requirements of the data center and reduce the energy consumption of the water cooling system.
  • the third embodiment of the present application applies the control model optimization method for the water cooling system Take the water cooling system of the data center described in the table below as an example.
  • control model optimization method of the water cooling system includes:
  • Step 501 Obtain and analyze the offline data packet of data center historical collection uploaded by the data center to obtain a data sample.
  • each data sample includes the output state parameters of the water cooling system, the environmental state parameters of the data center, the control parameters of the water cooling system, and the operating state parameters of the data center.
  • a data sample includes: the output state parameters of the water cooling system include the energy consumption parameter of the water cooling system (CoolingEnergy), the current cooling capacity parameter of the water cooling system (Cooling_cal), the air temperature parameter at the end of the water cooling system (AirOutAvgTemp), the end return of the water cooling system. Air temperature parameter (AirInAvgTemp);
  • Data center environmental status parameters include outdoor dry bulb temperature (OutsideDBTemp1), outdoor relative humidity (OutsideRHumidity1), and outdoor wet bulb temperature (OutsideWetTemp1);
  • Data center operating status parameters include IT energy consumption (ITEnergy) and data center temperature setting (DCRoomTempSet);
  • the control parameters of the water cooling system include the cooling tower running number (CTNum), the freezing pump running number (CHWPNum), the cooling pump running number (CWPNum), the plate replacement running number (HENum), the plate replacement water temperature setting value (HECHWSTempSet), the cooling tower water supply Temperature setting value (CTWSTempSet), refrigeration header 1 differential pressure setting value (CH1_CWSPress), refrigeration header 2 differential pressure setting value (CH2_CWSPress), cooling water pump 1 operating frequency setting value (CWP1_Frequency), cooling water pump 2 operating frequency setting value (CWP2_Frequency) , cooling tower 1 fan 1 operating frequency setting value (CT1_FAN1_Frequency), cooling tower 1 fan 2 operating frequency setting value (CT1_FAN2_Frequency), cooling tower 2 fan 1 operating frequency setting value (CT2_FAN1_Frequency), cooling tower 2 fan 2 operating frequency setting value ( CT2_FAN2_Frequency), chilled water pump 1 operating frequency setting value (CHWP1_Frequency), chilled
  • the model before executing the next step, it is also necessary to create a data center water cooling system output state transition model M1 and a water cooling system control parameter exploration optimization model M2, and initialize the model M1 and the model M2.
  • the model After the model is created, it also includes setting the maximum training times of the model M2 and the maximum exploration times of the water-cooling system control parameters of the randomly sampled samples.
  • the maximum training times of the model M2 is equivalent to the first threshold in the second embodiment.
  • the maximum number of searches for the water-cooling system control parameters of the sampled samples corresponds to the second threshold in the second embodiment.
  • step 502 the output state transition model M1 is trained by using the parameters of the data center, and the output state transition model M3 of the water cooling system of the data center after the training is optimized is obtained.
  • the data center environmental state parameters, data center operating state parameters, and water cooling system control parameters are vectorized to generate a data center hybrid state representation through fusion operations, and the data center hybrid state representation is used as the input feature of the model M1, while the water cooling
  • the system output state parameters are vectorized as the output features of the model M1, and the model M1 is trained. After the model optimization meets the setting requirements, the training is stopped, and the model M3 is obtained and saved.
  • Step 503 Randomly sample the parsed data samples to obtain a data sample.
  • Step 504 obtain the data center state representation vector S and the water cooling system reference control parameter vector A_base according to the sample data.
  • Step 505 Input the vector S and the vector A_base into the model M3, and obtain the corresponding reference parameter param_base of the output state of the water cooling system.
  • the vector S and the vector A_base are fused and calculated to generate a data center mixed state representation vector, and then the data center mixed state representation vector is input into the model M3 to obtain the parameter param_base output by the model.
  • the parameter param_base actually outputs the state parameter param_hist of the first and previous water cooling system of each sample. Specifically, the value of the parameter param_base is assigned to the parameter param_hist.
  • Step 506 Input the vector S into the model M2 to obtain the water-cooling system control exploratory parameter vector A_explore.
  • the vector A_explore is the control parameter corresponding to the next state obtained after one exploration, including at least the number of cooling tower operations (CTNum), the number of refrigeration pump operations (CHWPNum), the number of cooling pump operations (CWPNum), and the number of plate change operations.
  • HENum plate replacement water temperature setting value (HECHWSTempSet), cooling tower water supply temperature setting value (CTWSTempSet), refrigeration header 1 differential pressure setting value (CH1_CWSPress), refrigeration header 2 differential pressure setting value (CH2_CWSPress), cooling water pump 1 running Frequency setting value (CWP1_Frequency), cooling water pump 2 operating frequency setting value (CWP2_Frequency), cooling tower 1 fan 1 operating frequency setting value (CT1_FAN1_Frequency), cooling tower 1 fan 2 operating frequency setting value (CT1_FAN2_Frequency), cooling tower 2 fan 1 operating Frequency setting value (CT2_FAN1_Frequency), cooling tower 2 fan 2 operating frequency setting value (CT2_FAN2_Frequency), chilled water pump 1 operating frequency setting value (CHWP1_Frequency), chilled water pump 2 operating frequency setting value (CHWP2_Frequency).
  • Step 507 Obtain the current water cooling system output state parameter param_explore according to the vector S and the vector A_explore, further obtain the energy consumption evaluation value and the cooling capacity evaluation value, and obtain the control action evaluation value according to the parameter A_base and the parameter A_explore.
  • the vector S and vector A_explore are input into the water cooling system output state transition model M3 to obtain the current water cooling system output state parameter param_explore; according to the parameter param_base, parameter param_explore and parameter param_hist, respectively obtain the energy consumption evaluation value and cooling capacity evaluation value.
  • the vector S and A_explore are combined and calculated to generate the data center mixed state representation vector S_mix, and then the vector S_mix is input into the model M3 to obtain the parameter param_explore output by the model, and then obtained according to the parameter param_base, parameter param_explore and parameter param_hist
  • the energy consumption evaluation method is as follows:
  • reward_ce is the energy consumption evaluation value
  • the two 0.5 are the weights of the deviation degree of energy consumption and the growth rate of energy consumption.
  • the arithmetic average method is used for comprehensive calculation here, and the weight is set to (0.5, 0.5), but the weight can also be set to other values according to actual needs. In other examples, other statistical analysis methods other than average may also be used for calculation, which will not be repeated here.
  • the cooling capacity evaluation value is obtained as follows:
  • the cooling capacity parameter Cooling_cal is obtained from the water cooling system output state reference parameter param_base as the reference cooling capacity cool_cal_ ⁇
  • the cooling capacity parameter Cooling_cal is obtained from the current water cooling system output state parameter param_explore as the cooling capacity parameter corresponding to the current water cooling system control exploratory parameter cool_cal_t, and calculate the cooling capacity constraint evaluation value g(cool_cal_t, cool_cal_ ⁇ ) through the water cooling system control parameter cooling capacity constraint evaluation function g(x).
  • the constraint evaluation function g(x) can be designed according to business requirements, and the specific form of the constraint evaluation function g(x) is not limited.
  • the current water-cooling system control exploratory parameter constraint evaluation value f(A_base, A_explore) is calculated by the water-cooling system control parameter constraint evaluation function f(x).
  • the water cooling system control parameter constraint evaluation function f(x) can be designed according to business requirements, and the specific form of the water cooling system control parameter constraint evaluation function f(x) is not limited.
  • Step 508 Obtain a control reward according to the action evaluation value, the cooling capacity evaluation value and the energy consumption evaluation value.
  • the action constraint evaluation value is first obtained according to the action evaluation value and the cooling capacity evaluation value.
  • safe_eval_action 0.5 ⁇ f(A_base,A_explore)+0.5 ⁇ g(cool_cal_t,cool_cal_ ⁇ )
  • safe_eval_action is the action constraint evaluation value
  • f(A_base, A_explore) and g(cool_cal_t, cool_cal_ ⁇ ) are the action evaluation value and the coolness evaluation value, respectively
  • the two 0.5 are the weights of the action evaluation value and the coolness evaluation value respectively.
  • the arithmetic average method is used for comprehensive calculation here, and the weight is set to (0.5, 0.5), but the weight can also be set to other values according to actual needs. In other examples, other statistical analysis methods other than average may also be used for calculation, which will not be repeated here.
  • control reward is calculated through the energy consumption evaluation value and the water cooling system control parameter action constraint evaluation value.
  • R is the control reward
  • reward_ce is the energy consumption evaluation value
  • safe_eval_action is the action constraint evaluation value
  • the two 0.5 are the weights of the energy consumption evaluation value and the action constraint evaluation value respectively.
  • the arithmetic average method is used for comprehensive calculation here, and the weight is set to (0.5, 0.5), but the weight can also be set to other values according to actual needs. In other examples, other statistical analysis methods other than average may also be used for calculation, which will not be repeated here.
  • R 0.5 ⁇ reward_ce+0.25 ⁇ f(A_base,A_explore)+0.25 ⁇ g(cool_cal_t,cool_cal_ ⁇ )
  • R is the control reward
  • reward_ce is the energy consumption evaluation value
  • f(A_base, A_explore) is the action evaluation value
  • g(cool_cal_t, cool_cal_ ⁇ ) is the cooling capacity evaluation value
  • 0.5, 0.25, and 0.25 are the weights.
  • Step 509 Store the tuple (S, A_explore, R, S) in the experience sample pool and update the parameter param_hist to the parameter param_explore.
  • Step 510 detecting whether the current number of explorations reaches the maximum number.
  • step 506 if yes, go to step 506, if not, go to step 511.
  • Step 511 using the data in the experience sample pool to optimize the model M2.
  • one sample can obtain the data required for one optimization of the model M2, and the optimization times are incremented by 1 after one time.
  • Step 512 detecting whether the current optimization times reaches the maximum value.
  • step 505 if yes, go to step 505, if not, go to step 513.
  • step 513 the model M2 is saved, and an on-line deployment field test is performed on the model M2.
  • the control model optimization method for a water cooling system provides training data for the created state transition model and control model by acquiring the parameters of the data center, and then trains and optimizes the state transition model according to the parameters of the data center to provide the optimal control model.
  • the real, offline environment is used for interactive training, and then the control model is optimized according to the optimized state transition model and the parameters of the data center to realize the optimization of the control model of the water cooling system.
  • the present application provides a control model for the water cooling system of the data center, which can effectively reduce the energy consumption of the water cooling system.
  • the present application can also optimize the control model instead of using the initial control model, which is safer and more reliable. Provide an optimized control model for the water cooling system of the data center through the interaction process of the real interactive environment, so that it can fit the real environment and effectively reduce the energy consumption of the water cooling system without affecting the normal operation of the data center.
  • the fourth embodiment of the present application relates to an electronic device, as shown in FIG. 6 , comprising: at least one processor 601 ; and a memory 602 communicatively connected to the at least one processor 601 ; wherein the memory 602 stores data that can be accessed by at least one processor 601 .
  • Instructions executed by one processor 601, the instructions are executed by at least one processor 601, so that at least one processor 601 can execute the control model optimization method for a water cooling system described in any of the above method embodiments.
  • the memory 602 and the processor 601 are connected by a bus, and the bus may include any number of interconnected buses and bridges, and the bus connects one or more processors 601 and various circuits of the memory 602 together.
  • the bus may also connect together various other circuits, such as peripherals, voltage regulators, and power management circuits, which are well known in the art and therefore will not be described further herein.
  • the bus interface provides the interface between the bus and the transceiver.
  • a transceiver may be a single element or multiple elements, such as multiple receivers and transmitters, providing a means for communicating with various other devices over a transmission medium.
  • the data processed by the processor 601 is transmitted on the wireless medium through the antenna, and further, the antenna also receives the data and transmits the data to the processor 601 .
  • Processor 601 is responsible for managing the bus and general processing, and may also provide various functions including timing, peripheral interface, voltage regulation, power management, and other control functions.
  • the memory 602 may be used to store data used by the processor 601 when performing operations.
  • the fifth embodiment of the present application relates to a computer-readable storage medium storing a computer program.
  • the above method embodiments are implemented when the computer program is executed by the processor.
  • the aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Tourism & Hospitality (AREA)
  • Operations Research (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Development Economics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Chemical & Material Sciences (AREA)
  • Combustion & Propulsion (AREA)
  • Mechanical Engineering (AREA)
  • Feedback Control In General (AREA)

Abstract

Embodiments of the present application relate to the technical field of data processing, and provide a method for optimizing a control model of a water cooling system, an electronic device, and a storage medium. The method for optimizing a control model of a water cooling system comprises: obtaining parameters of a data center; creating a state transition model and a control model; optimizing the state transition model according to the parameters of the data center; and optimizing the control model according to the optimized state transition model and the parameters of the data center, and obtaining the optimized control model.

Description

水冷系统的控制模型优化方法、电子设备和存储介质Control model optimization method, electronic device and storage medium for water cooling system
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请基于申请号为“202011377439.8”、申请日为2020年11月30日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此以引入方式并入本申请。This application is based on the Chinese patent application with the application number "202011377439.8" and the filing date is November 30, 2020, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby incorporated by reference Application.
技术领域technical field
本申请的实施例涉及数据处理技术领域,特别涉及一种水冷系统的控制模型优化方法、电子设备和存储介质。The embodiments of the present application relate to the technical field of data processing, and in particular, to a method for optimizing a control model of a water cooling system, an electronic device, and a storage medium.
背景技术Background technique
数据中心的耗电量随着规模的扩大不断增加,其中,水冷系统在数据中心的非互联网技术(Internet Technology,IT)能耗中大约占一半。因此,有效降低水冷系统的能耗是数据中心降低非IT能耗的关键之一。通常数据中心会通过模型与水冷系统的物理环境直接交互或者仿真环境交互获取控制策略及其控制效果数据来对水冷系统控制策略模型进行优化,从而对水冷系统发送能够降低能耗的控制命令。The power consumption of data centers continues to increase with the expansion of the scale. Among them, the water cooling system accounts for about half of the non-Internet technology (IT) energy consumption of data centers. Therefore, effectively reducing the energy consumption of the water cooling system is one of the keys to reducing non-IT energy consumption in data centers. Usually, the data center will optimize the control strategy model of the water cooling system by directly interacting with the physical environment of the water cooling system or by interacting with the simulated environment to obtain the control strategy and its control effect data, so as to send control commands to the water cooling system that can reduce energy consumption.
然而,一方面采用模型与水冷系统的物理环境直接交互时,模型一般未经优化,一个未经优化的模型所生成的控制策略很可能会影响数据中心的正常运行;另一方面采用仿真环境交互获取控制策略及其控制效果数据来对水冷系统控制策略模型进行优化时,仿真环境通常与真实的数据中心环境及水冷系统真实环境存在偏差,无法对真实环境参数进行拟合。However, on the one hand, when the model is used to directly interact with the physical environment of the water cooling system, the model is generally not optimized, and the control strategy generated by an unoptimized model is likely to affect the normal operation of the data center; on the other hand, the simulation environment is used to interact When the control strategy and its control effect data are obtained to optimize the control strategy model of the water cooling system, the simulation environment usually deviates from the real data center environment and the real environment of the water cooling system, and the real environment parameters cannot be fitted.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供了水冷系统的控制模型优化方法,所述方法包括以下步骤:获取数据中心的参数;创建状态转移模型和控制模型;根据所述数据中心的参数优化所述状态转移模型;根据优化后的所述状态转移模型和所述数据中心的参数对所述控制模型进行优化,获取优化后的所述控制模型。An embodiment of the present application provides a method for optimizing a control model of a water cooling system, and the method includes the following steps: acquiring parameters of a data center; creating a state transition model and a control model; optimizing the state transition model according to the parameters of the data center; The optimized state transition model and the parameters of the data center are optimized for the control model, and the optimized control model is obtained.
本申请实施例还提出了一种电子设备,所述设备包括:至少一个处理器;以及,与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行上述的水冷系统的控制模型优化方法。An embodiment of the present application further provides an electronic device, the device includes: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores data that can be processed by the at least one processor. The instructions are executed by the processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute the above-mentioned control model optimization method for a water cooling system.
本申请实施例还提出了一种计算机可读存储介质,存储有计算机程序,所述计算机程序被处理器执行时实现上述的水冷系统的控制模型优化方法。An embodiment of the present application further provides a computer-readable storage medium storing a computer program, and when the computer program is executed by a processor, the above-mentioned control model optimization method for a water cooling system is implemented.
附图说明Description of drawings
一个或多个实施例通过与之对应的附图中的图片进行示例性说明,这些示例性说明并不构成对实施例的限定。One or more embodiments are exemplified by the pictures in the corresponding drawings, and these exemplified descriptions do not constitute limitations on the embodiments.
图1是本申请的第一实施例提供的水冷系统的控制模型优化方法的流程图;1 is a flowchart of a control model optimization method for a water cooling system provided by a first embodiment of the present application;
图2是图1所示的本申请的第一实施例提供的水冷系统的控制模型优化方法中步骤103的流程图;FIG. 2 is a flowchart of step 103 in the control model optimization method of the water cooling system provided by the first embodiment of the present application shown in FIG. 1;
图3是本申请的第二实施例提供的水冷系统的控制模型优化方法的流程图;3 is a flowchart of a control model optimization method for a water cooling system provided by a second embodiment of the present application;
图4是图3所示的本申请的第二实施例提供的水冷系统的控制模型优化方法中步骤305的流程图;FIG. 4 is a flowchart of step 305 in the control model optimization method of the water cooling system provided by the second embodiment of the present application shown in FIG. 3;
图5是本申请的第三实施例提供的水冷系统的控制模型优化方法的流程图;5 is a flowchart of a control model optimization method for a water cooling system provided by a third embodiment of the present application;
图6是本申请的第四实施例提供的电子设备的结构示意图。FIG. 6 is a schematic structural diagram of an electronic device provided by a fourth embodiment of the present application.
具体实施方式Detailed ways
本申请实施例的主要目的在于提出一种水冷系统的控制模型优化方法、电子设备和存储介质,旨在为数据中心的水冷系统提供一种经过真实交互环境交互过程的优化后的控制模型,使得能够拟合真实环境,有效降低水冷系统的能耗的同时不会影响数据中心的正常运行。The main purpose of the embodiments of the present application is to propose a control model optimization method, electronic device and storage medium for a water cooling system, aiming to provide an optimized control model for the water cooling system of a data center through a real interactive environment interaction process, so that It can fit the real environment and effectively reduce the energy consumption of the water cooling system without affecting the normal operation of the data center.
为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合附图对本申请的各实施例进行详细的阐述。然而,本领域的普通技术人员可以理解,在本申请各实施例中,为了使读者更好地理解本申请而提出了许多技术细节。但是,即使没有这些技术细节和基于以下各实施例的种种变化和修改,也可以实现本申请所要求保护的技术方案。以下各个实施例的划分是为了描述方便,不应对本申请的具体实现方式构成任何限定,各个实施例在不矛盾的前提下可以相互结合相互引用。In order to make the objectives, technical solutions and advantages of the embodiments of the present application more clear, each embodiment of the present application will be described in detail below with reference to the accompanying drawings. However, those of ordinary skill in the art can understand that, in each embodiment of the present application, many technical details are provided for the reader to better understand the present application. However, even without these technical details and various changes and modifications based on the following embodiments, the technical solutions claimed in the present application can be realized. The following divisions of the various embodiments are for the convenience of description, and should not constitute any limitation on the specific implementation of the present application, and the various embodiments may be combined with each other and referred to each other on the premise of not contradicting each other.
本申请的第一实施例涉及一种水冷系统的控制模型优化方法,如图1所示,具体包括:The first embodiment of the present application relates to a control model optimization method of a water cooling system, as shown in FIG. 1 , which specifically includes:
步骤101,获取数据中心的参数。In step 101, parameters of the data center are acquired.
具体地说,接收数据中心发送的数据包,解析数据包,获取数据中心的参数集合。数据中心包括数据中心水冷系统、数据中心机房和数据中心环境等,数据中心水冷系统能够提供数据中的控制参数和输出参数,数据中心机房能够提供数据中心的运行参数,数据中心环境能够提供数据中的环境参数。Specifically, the data packet sent by the data center is received, the data packet is parsed, and the parameter set of the data center is obtained. The data center includes the data center water cooling system, the data center computer room and the data center environment. The data center water cooling system can provide control parameters and output parameters in the data, the data center computer room can provide the data center operating parameters, and the data center environment can provide the data center environment parameters.
更具体地说,控制参数还包括冷却塔运行数量、冷却泵运行数量、冷冻泵运行数量、板换运行数量、板换出水温度、冷却塔供水温度、冷却塔风机运行频率、冷却泵运行频率、冷冻泵运行频率、冷冻总管压差设置值等参数;输出参数还包括水冷系统能耗、水冷系统末端出风温度、水冷系统末端回风温度、水冷系统冷量等参数;运行参数还包括IT能耗参数、数据中心设置温度等参数;环境参数还包括室外干球温度、室外湿球温度、室外湿度等参数。More specifically, the control parameters also include the operating number of cooling towers, the operating number of cooling pumps, the operating number of refrigeration pumps, the operating number of plate replacements, the temperature of plate replacement water, the water temperature of cooling towers, the operating frequency of cooling tower fans, the operating frequency of cooling pumps, Parameters such as the operating frequency of the refrigeration pump, the setting value of the pressure difference of the refrigeration main pipe; the output parameters also include parameters such as the energy consumption of the water cooling system, the air temperature at the end of the water cooling system, the return air temperature at the end of the water cooling system, and the cooling capacity of the water cooling system; the operating parameters also include the IT energy parameters such as power consumption parameters and data center temperature settings; environmental parameters also include parameters such as outdoor dry bulb temperature, outdoor wet bulb temperature, and outdoor humidity.
当然,以上仅为具体的举例说明,在实际的使用过程中数据中心的参数还可以包括其他参数,此处不做一一赘述。Of course, the above is only a specific example, and the parameters of the data center may also include other parameters in the actual use process, which will not be repeated here.
步骤102,创建状态转移模型和控制模型。 Step 102, creating a state transition model and a control model.
具体地说,状态转移模型是描述数据中心的状态变化关系的模型,能够将根据输入的数据中心的环境参数、数据中心的运行参数和数据中心的控制参数得到数据中心的当前输出参数,以及预测出该输出参数作用后的下一状态的数据中心的环境参数、数据中心的运行参数。控制模型是水冷系统的控制模型,输入数据中心的环境参数和数据中心的运行参数能够得到数据中心的控制策略,其中,控制策略以控制参数的形式表现。Specifically, the state transition model is a model describing the state change relationship of the data center, which can obtain the current output parameters of the data center according to the input environmental parameters of the data center, the operating parameters of the data center and the control parameters of the data center, and predict The environmental parameters of the data center and the operation parameters of the data center in the next state after the output parameter is acted on are obtained. The control model is the control model of the water cooling system. Inputting the environmental parameters of the data center and the operating parameters of the data center can obtain the control strategy of the data center, wherein the control strategy is expressed in the form of control parameters.
步骤103,根据数据中心的参数优化状态转移模型。 Step 103 , optimize the state transition model according to the parameters of the data center.
具体地说,将数据中心中能够对控制效果产生作用的参数的环境参数、运行参数、控制参数等作为模型的输入数据,将能够体现控制效果的输出参数等作为模型与输入数据相对应的输出数据,对模型进行训练以实现模型优化。Specifically, the environmental parameters, operating parameters, control parameters, etc. of the parameters that can have an effect on the control effect in the data center are used as the input data of the model, and the output parameters that can reflect the control effect are used as the output of the model corresponding to the input data. data, and train the model to achieve model optimization.
更具体地说,如图2,步骤103包括:More specifically, as shown in Figure 2, step 103 includes:
步骤201,将环境参数、运行参数和控制参数输入状态转移模型,获取预测输出参数。Step 201: Input the environmental parameters, operating parameters and control parameters into the state transition model to obtain the predicted output parameters.
步骤202,根据预测输出参数和输出参数计算状态转移损失。Step 202: Calculate the state transition loss according to the predicted output parameter and the output parameter.
本实施例不对用于计算状态转移的损失函数进行限定,在实际的使用过程中,损失函数可以为任意一种能够反映预测输出参数和实际输出参数之间的偏差的损失函数。This embodiment does not limit the loss function used for calculating the state transition. In actual use, the loss function may be any loss function that can reflect the deviation between the predicted output parameter and the actual output parameter.
步骤203,根据状态转移损失优化状态转移模型。 Step 203, optimize the state transition model according to the state transition loss.
具体地说,利用步骤202得到的状态转移损失作为训练数据来训练状态转移模型,从而实现模型的优化。Specifically, the state transition loss obtained in step 202 is used as training data to train the state transition model, so as to realize the optimization of the model.
步骤104,根据优化后的状态转移模型和数据中心的参数对控制模型进行优化,获取优化后的控制模型。 Step 104 , optimize the control model according to the optimized state transition model and parameters of the data center, and obtain the optimized control model.
具体地说,本实施例通过对状态转移模型、控制模型和数据中心的参数直接或者间接地获取训练数据,然后根据得到的训练数据对控制模型进行训练以实现控制模型的优化。Specifically, in this embodiment, the training data is directly or indirectly obtained from the parameters of the state transition model, the control model and the data center, and then the control model is trained according to the obtained training data to realize the optimization of the control model.
本实施例提出的水冷系统的控制模型优化方法,通过获取数据中心的参数,为创建的状态转移模型和控制模型提供训练数据,然后根据数据中心的参数训练优化状态转移模型,为控制模型优化提供真实、离线的环境用以交互训练,接着根据优化后的状态转移模型和数据中心的参数对控制模型进行优化,实现对水冷系统的控制模型的优化。首先本申请为数据中心的水冷系统提供一种了控制模型,因而能够有效降低水冷系统的能耗,其次,本申请还能够对控制模型进行优化,而不是使用初始的控制模型,更加安全可靠,为数据中心的水冷系统提供一种经过真实交互环境交互过程的优化后的控制模型,使得能够拟合真实环境,有效降低水冷系统的能耗的同时不会影响数据中心的正常运行。The control model optimization method for a water cooling system proposed in this embodiment provides training data for the created state transition model and control model by acquiring the parameters of the data center, and then trains and optimizes the state transition model according to the parameters of the data center to provide the optimal control model. The real, offline environment is used for interactive training, and then the control model is optimized according to the optimized state transition model and the parameters of the data center to realize the optimization of the control model of the water cooling system. First, the present application provides a control model for the water cooling system of the data center, which can effectively reduce the energy consumption of the water cooling system. Second, the present application can also optimize the control model instead of using the initial control model, which is safer and more reliable. Provide an optimized control model for the water cooling system of the data center through the interaction process of the real interactive environment, so that it can fit the real environment and effectively reduce the energy consumption of the water cooling system without affecting the normal operation of the data center.
本申请的第二实施例涉及一种水冷系统的控制模型优化方法,本实施例与第一实施例大致相同,区别在于,步骤104采用强化学习的方式对控制模型进行优化,具体流程如图3所示:The second embodiment of the present application relates to a control model optimization method for a water cooling system. This embodiment is roughly the same as the first embodiment, except that step 104 uses reinforcement learning to optimize the control model. The specific process is shown in Figure 3 shown:
步骤301,获取数据中心的参数。In step 301, parameters of the data center are acquired.
具体地说,本实施例中的步骤301与第一实施例中的步骤101大致相同,此处不一一赘述。Specifically, step 301 in this embodiment is substantially the same as step 101 in the first embodiment, and details are not repeated here.
步骤302,创建状态转移模型和控制模型。 Step 302, creating a state transition model and a control model.
具体地说,本实施例中的步骤302与第一实施例中的步骤102大致相同,此处不一一赘述。Specifically, step 302 in this embodiment is substantially the same as step 102 in the first embodiment, and details are not repeated here.
步骤303,根据数据中心的参数优化状态转移模型。Step 303: Optimize the state transition model according to the parameters of the data center.
具体地说,本实施例中的步骤303与第一实施例中的步骤103大致相同,此处不一一赘述。Specifically, step 303 in this embodiment is substantially the same as step 103 in the first embodiment, and details are not repeated here.
步骤304,从数据中心的参数中随机选取一组样本数据。 Step 304, randomly select a set of sample data from the parameters of the data center.
具体地说,随机选取数据中心某一状态下的各类型参数,只有同一状态下的数据才能构成一组样本数据。Specifically, various types of parameters in a certain state of the data center are randomly selected, and only data in the same state can constitute a set of sample data.
步骤305,利用控制模型和状态转移模型处理样本数据,获取训练样本。 Step 305, using the control model and the state transition model to process the sample data to obtain training samples.
具体地说,如图4所示,步骤305包括:Specifically, as shown in Figure 4, step 305 includes:
步骤401,根据环境参数和运行参数生成状态表征数据。 Step 401, generating state representation data according to the environmental parameters and the operating parameters.
具体地说,数据以矩阵的形式进行运算,能够有效提高运行速率,提高步骤的执行效率,因此可以将数据进行向量化,进一步得到数据中心的状态表征向量。Specifically, the data is operated in the form of a matrix, which can effectively improve the running rate and the execution efficiency of the steps. Therefore, the data can be vectorized to further obtain the state representation vector of the data center.
步骤402,将状态表征数据输入控制模型,获取预测控制参数。In step 402, the state representation data is input into the control model to obtain predictive control parameters.
具体地说,将状态表征向量输入控制模型,控制模型输出一个模型预测的控制参数向量。Specifically, the state representation vector is input into the control model, and the control model outputs a vector of control parameters predicted by the model.
步骤403,将预测控制参数、环境参数和运行参数输入状态转移模型,获取预测输出参数并根据预测输出参数更新环境参数和运行参数。Step 403: Input the predicted control parameters, the environmental parameters and the operating parameters into the state transition model, obtain the predicted output parameters, and update the environmental parameters and the operating parameters according to the predicted output parameters.
具体地说,将预测控制参数、环境参数和运行参数输入状态转移模型后,能够得到模型输出的预测输出参数,同时还能够得到下一状态下的环境参数和运行参数,将下一状态下的环境参数和运行参数更新到设置的环境参数和运行参数,以进行下一轮迭代循环。Specifically, after the predictive control parameters, environmental parameters and operating parameters are input into the state transition model, the predicted output parameters output by the model can be obtained, and the environmental parameters and operating parameters in the next state can also be obtained. The environment parameters and running parameters are updated to the set environment parameters and running parameters for the next iteration cycle.
步骤404,根据更新后的环境参数和运行参数更新状态表征数据。Step 404: Update the state representation data according to the updated environment parameters and operating parameters.
步骤405,根据预测控制参数、预测输出参数和控制参数对预测控制参数进行评估,获取控制奖励。Step 405: Evaluate the predictive control parameters according to the predictive control parameters, the predictive output parameters and the control parameters, and obtain a control reward.
具体地说,首先根据预测控制参数和控制参数获取对控制策略的控制动作评估值,然后根据预测输出参数获取控制效果评估值,其中,控制效果评估值包括能耗评估值和冷量评估值,最后对控制动作评估值、能耗评估值和冷量评估值进行综合分析,获取控制奖励。Specifically, first obtain the control action evaluation value of the control strategy according to the predicted control parameters and the control parameters, and then obtain the control effect evaluation value according to the predicted output parameter, wherein the control effect evaluation value includes the energy consumption evaluation value and the cooling capacity evaluation value, Finally, comprehensively analyze the control action evaluation value, energy consumption evaluation value and cooling capacity evaluation value to obtain control rewards.
更具体地说,根据当前的预测控制参数和数据中心的控制参数代入预先设置的函数中,获取控制动作评估值;根据第一次的预测输出参数中获取初始能耗参数和初始冷量参数,根据当前的预测输出参数中获取当前预测能耗参数和当前预测冷量参数,根据上一次的预测输出参数中获取历史预测能耗参数,根据当前预测能耗参数和历史预测能耗参数获取能耗的增长率,根据当前预测能耗参数和初始能耗参数获取能耗的偏离度,对增长率和偏离度进行平均获取能耗评估值,根据初始冷量参数和当前预测冷量参数进行分析,获取冷量评估值;对得到的控制动作评估值和冷量评估值进行加权平均,获取约束评估值,并对能耗评估值和约束评估值进行加权平均,得到控制奖励。More specifically, according to the current predicted control parameters and the control parameters of the data center, it is substituted into the preset function to obtain the control action evaluation value; according to the first predicted output parameters, the initial energy consumption parameter and the initial cooling capacity parameter are obtained, Obtain the current predicted energy consumption parameters and the current predicted cooling capacity parameters according to the current predicted output parameters, obtain the historical predicted energy consumption parameters according to the last predicted output parameters, and obtain the energy consumption according to the current predicted energy consumption parameters and the historical predicted energy consumption parameters Obtain the deviation degree of energy consumption according to the current predicted energy consumption parameters and initial energy consumption parameters, average the growth rate and deviation to obtain the energy consumption evaluation value, and analyze according to the initial cooling capacity parameters and the current predicted cooling capacity parameters, Obtain the cooling capacity evaluation value; perform the weighted average of the obtained control action evaluation value and the cooling capacity evaluation value to obtain the constraint evaluation value, and perform the weighted average of the energy consumption evaluation value and the constraint evaluation value to obtain the control reward.
需要说明的是,不对本实施例中使用的函数进行限定,可以是任意一种能够根据上述数据得到直观、准确的评价结果的函数。It should be noted that the function used in this embodiment is not limited, and may be any function that can obtain an intuitive and accurate evaluation result according to the above data.
步骤406,根据更新前的状态表征数据、预测控制参数、控制奖励和更新后的状态表征数据生成一个训练样本。Step 406: Generate a training sample according to the state representation data before the update, the predicted control parameters, the control reward and the updated state representation data.
具体地说,一个训练样本是一个四元组,且这个四元组由步骤401获取的更新前的状态表征数据、步骤402获取的预测控制参数、步骤405获取的控制奖励和步骤44获取的更新后的状态表征数据组成。Specifically, a training sample is a quadruple, and this quadruple consists of the state representation data before the update obtained in step 401 , the predicted control parameters obtained in step 402 , the control reward obtained in step 405 , and the update obtained in step 44 The post state characterizes the data composition.
步骤407,检测训练样本的数量是否达到第二阈值。 Step 407, detecting whether the number of training samples reaches a second threshold.
具体地说,若是,执行步骤408,若否,执行步骤402。Specifically, if yes, go to step 408, if not, go to step 402.
步骤408,将所有训练样本作为一组训练样本。 Step 408, taking all the training samples as a group of training samples.
步骤306,根据训练样本优化控制模型。 Step 306, optimize the control model according to the training samples.
具体地说,将步骤408得到的一组训练样本作为一个训练集来训练控制模型,完成一次优化。Specifically, a set of training samples obtained in step 408 is used as a training set to train the control model to complete an optimization.
步骤307,检测控制模型的优化次数是否达到第一阈值。Step 307: Detect whether the optimization times of the control model reach a first threshold.
具体地说,若是,执行步骤308,若否,执行步骤304。Specifically, if yes, go to step 308, if not, go to step 304.
需要说明的是第一阈值和第二阈值只是为了区分是优化次数的阈值还是训练样本数量的阈值,第一阈值和第二阈值之间不存在联系,是根据需求设置的两个数值。It should be noted that the first threshold and the second threshold are only to distinguish between the threshold of the number of optimizations and the threshold of the number of training samples. There is no connection between the first threshold and the second threshold, and they are two values set according to requirements.
步骤308,获取并保存优化后的控制模型。Step 308: Obtain and save the optimized control model.
本实施例相对于现有技术而言,在第一实施例的基础上,能够加入冷量的评价,从而保证水冷系统的温控满足数据中心的温控要求,使得既能保证水冷系统正常运行满足数据中心温控要求,又能够降低水冷系统的能耗。Compared with the prior art, on the basis of the first embodiment, this embodiment can add the evaluation of cooling capacity, so as to ensure that the temperature control of the water cooling system meets the temperature control requirements of the data center, so as to ensure the normal operation of the water cooling system It can meet the temperature control requirements of the data center and reduce the energy consumption of the water cooling system.
为了使本领域技术人员能够更清楚地理解以上本申请第一实施例和第二实施例公开的水冷系统的控制模型优化方法整体流程,本申请第三实施例以水冷系统的控制模型优化方法应用在下表描述的数据中心的水冷系统上为例进行说明。In order to enable those skilled in the art to more clearly understand the overall flow of the control model optimization method for the water cooling system disclosed in the first and second embodiments of the present application, the third embodiment of the present application applies the control model optimization method for the water cooling system Take the water cooling system of the data center described in the table below as an example.
Figure PCTCN2021127980-appb-000001
Figure PCTCN2021127980-appb-000001
如图5所示,本申请的第三实施例提供的水冷系统的控制模型优化方法,包括:As shown in FIG. 5 , the control model optimization method of the water cooling system provided by the third embodiment of the present application includes:
步骤501,获取数据中心上传的数据中心历史采集离线数据包并解析,获取数据样本。Step 501: Obtain and analyze the offline data packet of data center historical collection uploaded by the data center to obtain a data sample.
具体地说,解析得到多个数据样本,每个数据样本包括水冷系统输出状态参数、数据中心环境状态参数、水冷系统控制参数和数据中心运行状态参数。Specifically, a plurality of data samples are obtained through analysis, and each data sample includes the output state parameters of the water cooling system, the environmental state parameters of the data center, the control parameters of the water cooling system, and the operating state parameters of the data center.
更具体地说,一个数据样本包括:水冷系统输出状态参数包括水冷系统能耗参数(CoolingEnergy)、水冷系统当前冷量参数(Cooling_cal)、水冷系统末端出风温度参数(AirOutAvgTemp)、水冷系统末端回风温度参数(AirInAvgTemp);More specifically, a data sample includes: the output state parameters of the water cooling system include the energy consumption parameter of the water cooling system (CoolingEnergy), the current cooling capacity parameter of the water cooling system (Cooling_cal), the air temperature parameter at the end of the water cooling system (AirOutAvgTemp), the end return of the water cooling system. Air temperature parameter (AirInAvgTemp);
数据中心环境状态参数包括室外干球温度(OutsideDBTemp1)、室外相对湿度(OutsideRHumidity1)、室外湿球温度(OutsideWetTemp1);Data center environmental status parameters include outdoor dry bulb temperature (OutsideDBTemp1), outdoor relative humidity (OutsideRHumidity1), and outdoor wet bulb temperature (OutsideWetTemp1);
数据中心运行状态参数包括IT能耗(ITEnergy)、数据中心设置温度(DCRoomTempSet);Data center operating status parameters include IT energy consumption (ITEnergy) and data center temperature setting (DCRoomTempSet);
水冷系统控制参数包括冷却塔运行数量(CTNum)、冷冻泵运行数量(CHWPNum)、冷却泵运行数量(CWPNum)、板换运行数量(HENum)、板换出水温度设置值(HECHWSTempSet)、冷却塔供水温度设置值(CTWSTempSet)、冷冻总管1压差设置值(CH1_CWSPress)、冷冻总管2压差设置值(CH2_CWSPress)、冷却水泵1运行频率设置值(CWP1_Frequency)、冷却水泵2运行频率设置值(CWP2_Frequency)、冷却塔1风机1运行频率设置值(CT1_FAN1_Frequency)、冷却塔1风机2运行频率设置值(CT1_FAN2_Frequency)、冷却塔2风机1运行频率设置值(CT2_FAN1_Frequency)、冷却塔2风机2运行频率设置值(CT2_FAN2_Frequency)、冷冻水泵1运行频率设置值(CHWP1_Frequency)、冷冻水泵2运行频率设置值(CHWP2_Frequency)。The control parameters of the water cooling system include the cooling tower running number (CTNum), the freezing pump running number (CHWPNum), the cooling pump running number (CWPNum), the plate replacement running number (HENum), the plate replacement water temperature setting value (HECHWSTempSet), the cooling tower water supply Temperature setting value (CTWSTempSet), refrigeration header 1 differential pressure setting value (CH1_CWSPress), refrigeration header 2 differential pressure setting value (CH2_CWSPress), cooling water pump 1 operating frequency setting value (CWP1_Frequency), cooling water pump 2 operating frequency setting value (CWP2_Frequency) , cooling tower 1 fan 1 operating frequency setting value (CT1_FAN1_Frequency), cooling tower 1 fan 2 operating frequency setting value (CT1_FAN2_Frequency), cooling tower 2 fan 1 operating frequency setting value (CT2_FAN1_Frequency), cooling tower 2 fan 2 operating frequency setting value ( CT2_FAN2_Frequency), chilled water pump 1 operating frequency setting value (CHWP1_Frequency), chilled water pump 2 operating frequency setting value (CHWP2_Frequency).
需要说明的是,在执行下一步骤之前还需要创建数据中心水冷系统输出状态转移模型M1和水冷系统控制参数探索优化模型M2,并初始化模型M1和模型M2。在创建模型之后还包括设置模型M2最大的训练次数和随机采样样本的水冷系统控制参数最大的探索次数,具体地说,模型M2最大的训练次数相当于第二实施例中的第一阈值,随机采样样本的水冷系统控制参数最大的探索次数相当于第二实施例中的第二阈值。It should be noted that, before executing the next step, it is also necessary to create a data center water cooling system output state transition model M1 and a water cooling system control parameter exploration optimization model M2, and initialize the model M1 and the model M2. After the model is created, it also includes setting the maximum training times of the model M2 and the maximum exploration times of the water-cooling system control parameters of the randomly sampled samples. Specifically, the maximum training times of the model M2 is equivalent to the first threshold in the second embodiment. The maximum number of searches for the water-cooling system control parameters of the sampled samples corresponds to the second threshold in the second embodiment.
步骤502,利用数据中心的参数对输出状态转移模型M1进行训练,获取训练优化后的数据中心水冷系统输出状态转移模型M3。In step 502, the output state transition model M1 is trained by using the parameters of the data center, and the output state transition model M3 of the water cooling system of the data center after the training is optimized is obtained.
具体地说,将数据中心环境状态参数、数据中心运行状态参数、水冷系统控制参数向量化后经融合运算生成数据中心混合状态表征,并将数据中心混合状态表征作为模型M1的输入特征,而水冷系统输出状态参数向量化后作为模型M1的输出特征,对模型M1进行训练, 模型优化达到设置要求后停止训练,获取模型M3并保存模型M3。Specifically, the data center environmental state parameters, data center operating state parameters, and water cooling system control parameters are vectorized to generate a data center hybrid state representation through fusion operations, and the data center hybrid state representation is used as the input feature of the model M1, while the water cooling The system output state parameters are vectorized as the output features of the model M1, and the model M1 is trained. After the model optimization meets the setting requirements, the training is stopped, and the model M3 is obtained and saved.
步骤503,对解析出来的数据样本进行随机采样,获取一个数据样本。Step 503: Randomly sample the parsed data samples to obtain a data sample.
步骤504,根据样本数据获取数据中心状态表征向量S和水冷系统基准控制参数向量A_base。 Step 504 , obtain the data center state representation vector S and the water cooling system reference control parameter vector A_base according to the sample data.
具体地说,对数据样本中的数据中心环境参数和数据中心运行状态参数进行向量化,获取数据中心状态表征向量S,对数据样本中的水冷系统控制参数进行向量化,得到数据样本的水冷系统基准控制参数向量A_base。Specifically, vectorize the data center environmental parameters and data center operating state parameters in the data sample, obtain the data center state representation vector S, vectorize the water cooling system control parameters in the data sample, and obtain the water cooling system of the data sample. Baseline control parameter vector A_base.
步骤505,将向量S和向量A_base输入到模型M3中,获取相应的水冷系统输出状态基准参数param_base。Step 505: Input the vector S and the vector A_base into the model M3, and obtain the corresponding reference parameter param_base of the output state of the water cooling system.
具体的说,将向量S和向量A_base进行融合计算,生成数据中心混合状态表征向量,然后将数据中心混合状态表征向量输入模型M3中,得到模型输出的参数param_base。Specifically, the vector S and the vector A_base are fused and calculated to generate a data center mixed state representation vector, and then the data center mixed state representation vector is input into the model M3 to obtain the parameter param_base output by the model.
需要说明的是,参数param_base除了作为一个参考值进行使用,实际上还在每个样本的第一个前一次水冷系统输出状态参数param_hist,具体地说,将参数param_base的值赋给参数param_hist。It should be noted that, in addition to being used as a reference value, the parameter param_base actually outputs the state parameter param_hist of the first and previous water cooling system of each sample. Specifically, the value of the parameter param_base is assigned to the parameter param_hist.
需要说明的是,此时还需要对数据样本的水冷系统控制参数当前的探索次数清零explore_count=0。It should be noted that, at this time, it is also necessary to clear explore_count=0 for the current exploration times of the water cooling system control parameters of the data sample.
步骤506,将向量S输入到模型M2中,获取水冷系统控制探索性参数向量A_explore。Step 506: Input the vector S into the model M2 to obtain the water-cooling system control exploratory parameter vector A_explore.
具体地说,向量A_explore是一次探索后得到的下一个状态对应的控制参数,至少包括冷却塔运行数量(CTNum)、冷冻泵运行数量(CHWPNum)、冷却泵运行数量(CWPNum)、板换运行数量(HENum)、板换出水温度设置值(HECHWSTempSet)、冷却塔供水温度设置值(CTWSTempSet)、冷冻总管1压差设置值(CH1_CWSPress)、冷冻总管2压差设置值(CH2_CWSPress)、冷却水泵1运行频率设置值(CWP1_Frequency)、冷却水泵2运行频率设置值(CWP2_Frequency)、冷却塔1风机1运行频率设置值(CT1_FAN1_Frequency)、冷却塔1风机2运行频率设置值(CT1_FAN2_Frequency)、冷却塔2风机1运行频率设置值(CT2_FAN1_Frequency)、冷却塔2风机2运行频率设置值(CT2_FAN2_Frequency)、冷冻水泵1运行频率设置值(CHWP1_Frequency)、冷冻水泵2运行频率设置值(CHWP2_Frequency)。Specifically, the vector A_explore is the control parameter corresponding to the next state obtained after one exploration, including at least the number of cooling tower operations (CTNum), the number of refrigeration pump operations (CHWPNum), the number of cooling pump operations (CWPNum), and the number of plate change operations. (HENum), plate replacement water temperature setting value (HECHWSTempSet), cooling tower water supply temperature setting value (CTWSTempSet), refrigeration header 1 differential pressure setting value (CH1_CWSPress), refrigeration header 2 differential pressure setting value (CH2_CWSPress), cooling water pump 1 running Frequency setting value (CWP1_Frequency), cooling water pump 2 operating frequency setting value (CWP2_Frequency), cooling tower 1 fan 1 operating frequency setting value (CT1_FAN1_Frequency), cooling tower 1 fan 2 operating frequency setting value (CT1_FAN2_Frequency), cooling tower 2 fan 1 operating Frequency setting value (CT2_FAN1_Frequency), cooling tower 2 fan 2 operating frequency setting value (CT2_FAN2_Frequency), chilled water pump 1 operating frequency setting value (CHWP1_Frequency), chilled water pump 2 operating frequency setting value (CHWP2_Frequency).
步骤507,根据向量S、向量A_explore获取当前水冷系统输出状态参数param_explore并进一步获取能耗评估值和冷量评估值,并根据参数A_base和参数A_explore获取控制动作评估值。Step 507: Obtain the current water cooling system output state parameter param_explore according to the vector S and the vector A_explore, further obtain the energy consumption evaluation value and the cooling capacity evaluation value, and obtain the control action evaluation value according to the parameter A_base and the parameter A_explore.
具体地说,对向量S和向量A_explore输入到水冷系统输出状态转移模型M3中,获取当前水冷系统输出状态参数param_explore;根据参数param_base、参数param_explore和参数param_hist,分别获取能耗评估值和冷量评估值。Specifically, the vector S and vector A_explore are input into the water cooling system output state transition model M3 to obtain the current water cooling system output state parameter param_explore; according to the parameter param_base, parameter param_explore and parameter param_hist, respectively obtain the energy consumption evaluation value and cooling capacity evaluation value.
更具体地说,将向量S和A_explore进行融合计算,生成数据中心混合状态表征向量S_mix,然后将向量S_mix输入模型M3中,得到模型输出的参数param_explore,然后根据参数param_base、参数param_explore和参数param_hist获取能耗评估值方法如下:More specifically, the vector S and A_explore are combined and calculated to generate the data center mixed state representation vector S_mix, and then the vector S_mix is input into the model M3 to obtain the parameter param_explore output by the model, and then obtained according to the parameter param_base, parameter param_explore and parameter param_hist The energy consumption evaluation method is as follows:
从水冷系统输出状态基准参数param_base中获取水冷系统能耗参数CE Φ,从当前水冷系统输出状态参数param_explore获取水冷系统能耗参数CE t,从前一次水冷系统输出状态参数param_hist获取水冷系统能耗参数CE t-1,并按照下面公式计算当前水冷系统控制探索性 参数A_explore的能耗奖励值: Obtain the water cooling system energy consumption parameter CE Φ from the water cooling system output state reference parameter param_base, obtain the water cooling system energy consumption parameter CE t from the current water cooling system output state parameter param_explore, and obtain the water cooling system energy consumption parameter CE from the previous water cooling system output state parameter param_hist t-1 , and calculate the energy consumption reward value of the current water cooling system control exploratory parameter A_explore according to the following formula:
Figure PCTCN2021127980-appb-000002
Figure PCTCN2021127980-appb-000002
其中,reward_ce是能耗评估值,
Figure PCTCN2021127980-appb-000003
是能耗的偏离度,
Figure PCTCN2021127980-appb-000004
是能耗的增长率,两个0.5分别是能耗的偏离度和能耗的增长率的权值。
Among them, reward_ce is the energy consumption evaluation value,
Figure PCTCN2021127980-appb-000003
is the deviation of energy consumption,
Figure PCTCN2021127980-appb-000004
is the growth rate of energy consumption, and the two 0.5 are the weights of the deviation degree of energy consumption and the growth rate of energy consumption.
需要说明的是,此处采用算数平均的方法综合计算,将权值设为(0.5,0.5),但是还可以根据实际需求将权值设定为其他值。在其他例子中,还可以采用平均之外的其他统计分析方法计算,此处不一一赘述。It should be noted that the arithmetic average method is used for comprehensive calculation here, and the weight is set to (0.5, 0.5), but the weight can also be set to other values according to actual needs. In other examples, other statistical analysis methods other than average may also be used for calculation, which will not be repeated here.
接着根据参数param_base和参数param_explore获取冷量评估值如下:Then, according to the parameter param_base and parameter param_explore, the cooling capacity evaluation value is obtained as follows:
具体地说,从水冷系统输出状态基准参数param_base中获取冷量参数Cooling_cal作为基准冷量cool_cal_Φ,从当前水冷系统输出状态参数param_explore获取冷量参数Cooling_cal作为当前水冷系统控制探索性参数所对应冷量参数cool_cal_t,并通过水冷系统控制参数冷量约束评估函数g(x)计算冷量约束评估值g(cool_cal_t,cool_cal_Φ)。需要说明的是,约束评估函数g(x)可以根据业务需求进行设计,不对约束评估函数g(x)的具体形式进行限定。Specifically, the cooling capacity parameter Cooling_cal is obtained from the water cooling system output state reference parameter param_base as the reference cooling capacity cool_cal_Φ, and the cooling capacity parameter Cooling_cal is obtained from the current water cooling system output state parameter param_explore as the cooling capacity parameter corresponding to the current water cooling system control exploratory parameter cool_cal_t, and calculate the cooling capacity constraint evaluation value g(cool_cal_t, cool_cal_Φ) through the water cooling system control parameter cooling capacity constraint evaluation function g(x). It should be noted that the constraint evaluation function g(x) can be designed according to business requirements, and the specific form of the constraint evaluation function g(x) is not limited.
最后,根据参数A_base和参数A_explore获取控制动作评估值。Finally, obtain the evaluation value of the control action according to the parameter A_base and the parameter A_explore.
具体地说,通过水冷系统控制参数约束评估函数f(x)计算当前水冷系统控制探索性参数约束评估值f(A_base,A_explore)。需要说明的是,水冷系统控制参数约束评估函数f(x)可以根据业务需求进行设计,不对水冷系统控制参数约束评估函数f(x)的具体形式进行限定。Specifically, the current water-cooling system control exploratory parameter constraint evaluation value f(A_base, A_explore) is calculated by the water-cooling system control parameter constraint evaluation function f(x). It should be noted that the water cooling system control parameter constraint evaluation function f(x) can be designed according to business requirements, and the specific form of the water cooling system control parameter constraint evaluation function f(x) is not limited.
步骤508,根据动作评估值、冷量评估值和能耗评估值获取控制奖励。Step 508: Obtain a control reward according to the action evaluation value, the cooling capacity evaluation value and the energy consumption evaluation value.
具体地说,首先根据动作评估值和冷量评估值获取动作约束评估值。Specifically, the action constraint evaluation value is first obtained according to the action evaluation value and the cooling capacity evaluation value.
按照下面公式计算当前水冷系统控制探索性参数A_explore的动作约束评估值:Calculate the action constraint evaluation value of the current water cooling system control exploratory parameter A_explore according to the following formula:
safe_eval_action=0.5·f(A_base,A_explore)+0.5·g(cool_cal_t,cool_cal_Φ)safe_eval_action=0.5·f(A_base,A_explore)+0.5·g(cool_cal_t,cool_cal_Φ)
其中,safe_eval_action是动作约束评估值,f(A_base,A_explore)和g(cool_cal_t,cool_cal_Φ)分别是动作评估值和冷量评估值,两个0.5分别是动作评估值和冷量评估值的权值。Among them, safe_eval_action is the action constraint evaluation value, f(A_base, A_explore) and g(cool_cal_t, cool_cal_Φ) are the action evaluation value and the coolness evaluation value, respectively, and the two 0.5 are the weights of the action evaluation value and the coolness evaluation value respectively.
需要说明的是,此处采用算数平均的方法综合计算,将权值设为(0.5,0.5),但是还可以根据实际需求将权值设定为其他值。在其他例子中,还可以采用平均之外的其他统计分析方法计算,此处不一一赘述。It should be noted that the arithmetic average method is used for comprehensive calculation here, and the weight is set to (0.5, 0.5), but the weight can also be set to other values according to actual needs. In other examples, other statistical analysis methods other than average may also be used for calculation, which will not be repeated here.
然后通过能耗评估值和水冷系统控制参数动作约束评估值计算控制奖励。Then, the control reward is calculated through the energy consumption evaluation value and the water cooling system control parameter action constraint evaluation value.
采用如下公式计算:Calculated using the following formula:
R=0.5·reward_ce+0.5·safe_eval_actionR=0.5·reward_ce+0.5·safe_eval_action
其中,R是控制奖励,reward_ce是能耗评估值,safe_eval_action是动作约束评估值,两个0.5分别是能耗评估值和动作约束评估值的权值。Among them, R is the control reward, reward_ce is the energy consumption evaluation value, safe_eval_action is the action constraint evaluation value, and the two 0.5 are the weights of the energy consumption evaluation value and the action constraint evaluation value respectively.
需要说明的是,此处采用算数平均的方法综合计算,将权值设为(0.5,0.5),但是还可以根据实际需求将权值设定为其他值。在其他例子中,还可以采用平均之外的其他统计分析方法计算,此处不一一赘述。It should be noted that the arithmetic average method is used for comprehensive calculation here, and the weight is set to (0.5, 0.5), but the weight can also be set to other values according to actual needs. In other examples, other statistical analysis methods other than average may also be used for calculation, which will not be repeated here.
需要说明的是,上述步骤实际上是将动作评估值、冷量评估值先综合分析再加入能耗评估值继续分析,直接三者分析的情况下,上述过程等价为采用如下公式计算控制奖励:It should be noted that the above steps are actually a comprehensive analysis of the action evaluation value and the cooling capacity evaluation value before adding the energy consumption evaluation value to continue the analysis. In the case of direct analysis of the three, the above process is equivalent to using the following formula to calculate the control reward. :
R=0.5·reward_ce+0.25·f(A_base,A_explore)+0.25·g(cool_cal_t,cool_cal_Φ)R=0.5·reward_ce+0.25·f(A_base,A_explore)+0.25·g(cool_cal_t,cool_cal_Φ)
其中,R是控制奖励,reward_ce是能耗评估值,f(A_base,A_explore)是动作评估值,g(cool_cal_t,cool_cal_Φ)是冷量评估值,0.5、0.25、0.25是权值。Among them, R is the control reward, reward_ce is the energy consumption evaluation value, f(A_base, A_explore) is the action evaluation value, g(cool_cal_t, cool_cal_Φ) is the cooling capacity evaluation value, and 0.5, 0.25, and 0.25 are the weights.
步骤509,将元组(S,A_explore,R,S)存储到经验样本池中并更新参数param_hist为参数param_explore。Step 509: Store the tuple (S, A_explore, R, S) in the experience sample pool and update the parameter param_hist to the parameter param_explore.
具体地说,得到一个元组(S,A_explore,R,S)就是完成一次探索,探索次数就会加1还需要将参数param_explore的值赋给参数param_hist。Specifically, to get a tuple (S, A_explore, R, S) is to complete an exploration, and the number of explorations will be increased by 1, and the value of the parameter param_explore needs to be assigned to the parameter param_hist.
步骤510,检测当前探索次数是否达到最大次数。 Step 510, detecting whether the current number of explorations reaches the maximum number.
具体地说,若是,执行步骤506,若否,执行步骤511。Specifically, if yes, go to step 506, if not, go to step 511.
步骤511,利用经验样本池中的数据对模型M2进行优化。 Step 511, using the data in the experience sample pool to optimize the model M2.
具体地说,一个样本可以得到模型M2优化一次所需的数据,一次结束后优化次数加1。Specifically, one sample can obtain the data required for one optimization of the model M2, and the optimization times are incremented by 1 after one time.
步骤512,检测当前优化次数是否达到最大值。 Step 512, detecting whether the current optimization times reaches the maximum value.
具体地说,若是,执行步骤505,若否,执行步骤513。Specifically, if yes, go to step 505, if not, go to step 513.
步骤513,保存模型M2,对模型M2进行上线部署现场测试。In step 513, the model M2 is saved, and an on-line deployment field test is performed on the model M2.
本实施例提出的水冷系统的控制模型优化方法,通过获取数据中心的参数,为创建的状态转移模型和控制模型提供训练数据,然后根据数据中心的参数训练优化状态转移模型,为控制模型优化提供真实、离线的环境用以交互训练,接着根据优化后的状态转移模型和数据中心的参数对控制模型进行优化,实现对水冷系统的控制模型的优化。首先本申请为数据中心的水冷系统提供一种了控制模型,因而能够有效降低水冷系统的能耗,其次,本申请还能够对控制模型进行优化,而不是使用初始的控制模型,更加安全可靠,为数据中心的水冷系统提供一种经过真实交互环境交互过程的优化后的控制模型,使得能够拟合真实环境,有效降低水冷系统的能耗的同时不会影响数据中心的正常运行。The control model optimization method for a water cooling system proposed in this embodiment provides training data for the created state transition model and control model by acquiring the parameters of the data center, and then trains and optimizes the state transition model according to the parameters of the data center to provide the optimal control model. The real, offline environment is used for interactive training, and then the control model is optimized according to the optimized state transition model and the parameters of the data center to realize the optimization of the control model of the water cooling system. First, the present application provides a control model for the water cooling system of the data center, which can effectively reduce the energy consumption of the water cooling system. Second, the present application can also optimize the control model instead of using the initial control model, which is safer and more reliable. Provide an optimized control model for the water cooling system of the data center through the interaction process of the real interactive environment, so that it can fit the real environment and effectively reduce the energy consumption of the water cooling system without affecting the normal operation of the data center.
此外,应当理解的是,上面各种方法的步骤划分,只是为了描述清楚,实现时可以合并为一个步骤或者对某些步骤进行拆分,分解为多个步骤,只要包括相同的逻辑关系,都在本专利的保护范围内;对算法中或者流程中添加无关紧要的修改或者引入无关紧要的设计,但不改变其算法和流程的核心设计都在该专利的保护范围内。In addition, it should be understood that the division of steps of the various methods above is only for the purpose of describing clearly, and can be combined into one step or split into some steps during implementation, and decomposed into multiple steps, as long as the same logical relationship is included, all Within the protection scope of this patent; adding insignificant modifications to the algorithm or process or introducing insignificant designs, but not changing the core design of the algorithm and process are all within the protection scope of this patent.
本申请的第四实施例涉及一种电子设备,如图6所示,包括:至少一个处理器601;以及,与至少一个处理器601通信连接的存储器602;其中,存储器602存储有可被至少一个处理器601执行的指令,指令被至少一个处理器601执行,以使至少一个处理器601能够执行上述任一方法实施例所描述的水冷系统的控制模型优化方法。The fourth embodiment of the present application relates to an electronic device, as shown in FIG. 6 , comprising: at least one processor 601 ; and a memory 602 communicatively connected to the at least one processor 601 ; wherein the memory 602 stores data that can be accessed by at least one processor 601 . Instructions executed by one processor 601, the instructions are executed by at least one processor 601, so that at least one processor 601 can execute the control model optimization method for a water cooling system described in any of the above method embodiments.
其中,存储器602和处理器601采用总线方式连接,总线可以包括任意数量的互联的总线和桥,总线将一个或多个处理器601和存储器602的各种电路连接在一起。总线还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路连接在一起,这些都是本领域所公知的,因此,本文不再对其进行进一步描述。总线接口在总线和收发机之间提供接口。收发机可以是一个元件,也可以是多个元件,比如多个接收器和发送器,提供用于在传输介质上与各种其他装置通信的单元。经处理器601处理的数据通过天线在无线介质上进行传输,进一步,天线还接收数据并将数据传输给处理器601。The memory 602 and the processor 601 are connected by a bus, and the bus may include any number of interconnected buses and bridges, and the bus connects one or more processors 601 and various circuits of the memory 602 together. The bus may also connect together various other circuits, such as peripherals, voltage regulators, and power management circuits, which are well known in the art and therefore will not be described further herein. The bus interface provides the interface between the bus and the transceiver. A transceiver may be a single element or multiple elements, such as multiple receivers and transmitters, providing a means for communicating with various other devices over a transmission medium. The data processed by the processor 601 is transmitted on the wireless medium through the antenna, and further, the antenna also receives the data and transmits the data to the processor 601 .
处理器601负责管理总线和通常的处理,还可以提供各种功能,包括定时,外围接口,电压调节、电源管理以及其他控制功能。而存储器602可以被用于存储处理器601在执行操作时所使用的数据。 Processor 601 is responsible for managing the bus and general processing, and may also provide various functions including timing, peripheral interface, voltage regulation, power management, and other control functions. The memory 602 may be used to store data used by the processor 601 when performing operations.
本申请第五实施例涉及一种计算机可读存储介质,存储有计算机程序。计算机程序被处理器执行时实现上述方法实施例。The fifth embodiment of the present application relates to a computer-readable storage medium storing a computer program. The above method embodiments are implemented when the computer program is executed by the processor.
即,本领域技术人员可以理解,实现上述实施例方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序存储在一个存储介质中,包括若干指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。That is, those skilled in the art can understand that all or part of the steps in the method of implementing the above embodiments can be completed by instructing the relevant hardware through a program, and the program is stored in a storage medium and includes several instructions to make a device ( It may be a single chip microcomputer, a chip, etc.) or a processor (processor) to execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .
本领域的普通技术人员可以理解,上述各实施例是实现本申请的具体实施例,而在实际应用中,可以在形式上和细节上对其作各种改变,而不偏离本申请的精神和范围。Those of ordinary skill in the art can understand that the above-mentioned embodiments are specific embodiments for realizing the present application, and in practical applications, various changes in form and details can be made without departing from the spirit and the spirit of the present application. scope.

Claims (10)

  1. 一种水冷系统的控制模型优化方法,包括:A control model optimization method for a water cooling system, comprising:
    获取数据中心的参数;Get the parameters of the data center;
    创建状态转移模型和控制模型;Create state transition models and control models;
    根据所述数据中心的参数优化所述状态转移模型;Optimizing the state transition model according to the parameters of the data center;
    根据优化后的所述状态转移模型和所述数据中心的参数对所述控制模型进行优化,获取优化后的所述控制模型。The control model is optimized according to the optimized state transition model and parameters of the data center, and the optimized control model is obtained.
  2. 根据权利要求1所述的方法,其中,所述数据中心的参数包括环境参数、运行参数、控制参数和输出参数,所述根据所述数据中心的参数优化所述状态转移模型,包括:The method according to claim 1, wherein the parameters of the data center include environmental parameters, operating parameters, control parameters and output parameters, and the optimizing the state transition model according to the parameters of the data center includes:
    将所述环境参数、所述运行参数和所述控制参数输入所述状态转移模型,获取预测输出参数;Inputting the environmental parameters, the operating parameters and the control parameters into the state transition model to obtain predicted output parameters;
    根据所述预测输出参数和所述输出参数计算状态转移损失;calculating a state transition loss according to the predicted output parameter and the output parameter;
    根据所述状态转移损失优化所述状态转移模型。The state transition model is optimized according to the state transition loss.
  3. 根据权利要求1或2所述的方法,其中,所述根据优化后的所述状态转移模型和所述数据中心的参数对预先创建的控制模型进行优化,获取优化后的所述控制模型,包括:The method according to claim 1 or 2, wherein, optimizing a pre-created control model according to the optimized state transition model and parameters of the data center, and obtaining the optimized control model, comprising: :
    采样步骤,从所述数据中心的参数中随机选取一组样本数据;Sampling step, randomly select a group of sample data from the parameters of the data center;
    利用所述控制模型和所述状态转移模型处理所述样本数据,获取训练样本;Process the sample data by using the control model and the state transition model to obtain training samples;
    根据所述训练样本优化所述控制模型;Optimizing the control model according to the training samples;
    检测所述控制模型的优化次数是否达到第一阈值;Detecting whether the optimization times of the control model reaches a first threshold;
    若是,获取并保存优化后的所述控制模型;If so, obtain and save the optimized control model;
    若否,返回所述采样步骤。If not, return to the sampling step.
  4. 根据权利要求3所述的方法,其中,所述样本数据包括所述环境参数、所述运行参数和所述控制参数,所述利用所述控制模型和所述状态转移模型处理所述样本数据,获取训练样本包括:4. The method of claim 3, wherein the sample data includes the environmental parameters, the operating parameters, and the control parameters, and wherein the sample data is processed using the control model and the state transition model, Obtaining training samples includes:
    根据所述环境参数和所述运行参数生成状态表征数据;generating state characterization data according to the environmental parameters and the operating parameters;
    将所述状态表征数据输入所述控制模型,获取预测控制参数;Inputting the state representation data into the control model to obtain predictive control parameters;
    将所述预测控制参数、所述环境参数和所述运行参数输入所述状态转移模型,获取所述预测输出参数并根据所述预测输出参数更新所述环境参数和所述运行参数;inputting the predicted control parameters, the environmental parameters and the operating parameters into the state transition model, obtaining the predicted output parameters and updating the environmental parameters and the operating parameters according to the predicted output parameters;
    根据更新后的所述环境参数和所述运行参数更新所述状态表征数据;Update the state representation data according to the updated environmental parameters and the operating parameters;
    根据所述预测控制参数、所述预测输出参数和所述控制参数对所述预测控制参数进行评估,获取控制奖励;Evaluate the predicted control parameter according to the predicted control parameter, the predicted output parameter and the control parameter, and obtain a control reward;
    根据更新前的所述状态表征数据、所述预测控制参数、所述控制奖励和更新后的所述状态表征数据生成一个训练样本。A training sample is generated according to the state characterization data before updating, the predictive control parameter, the control reward, and the updated state characterization data.
  5. 根据权利要求4所述的方法,其中,还包括:The method of claim 4, further comprising:
    若检测到所述训练样本的数量未达到第二阈值,利用所述控制模型和所述状态转移模型处理更新后的所述环境参数、更新后的所述运行参数和所述预测控制参数,获取训练样本,直到所述训练样本的数量达到所述第二阈值。If it is detected that the number of training samples does not reach the second threshold, use the control model and the state transition model to process the updated environmental parameters, the updated operating parameters and the predictive control parameters, and obtain training samples until the number of training samples reaches the second threshold.
  6. 根据权利要求4或5所述的方法,其中,所述根据所述预测控制参数、所述预测输出 参数和所述控制参数对所述预测控制参数进行评估,获取控制奖励,包括:The method according to claim 4 or 5, wherein, evaluating the predictive control parameter according to the predictive control parameter, the predictive output parameter and the control parameter to obtain a control reward, comprising:
    根据所述预测控制参数和所述控制参数获取对控制策略的控制动作评估值;Obtaining the control action evaluation value for the control strategy according to the predicted control parameter and the control parameter;
    根据所述预测输出参数获取控制效果评估值,其中,所述控制效果评估值包括能耗评估值和冷量评估值;Obtaining a control effect evaluation value according to the predicted output parameter, wherein the control effect evaluation value includes an energy consumption evaluation value and a cooling capacity evaluation value;
    对所述控制动作评估值、所述能耗评估值和所述冷量评估值进行综合分析,获取所述控制奖励。The control reward is obtained by comprehensively analyzing the control action evaluation value, the energy consumption evaluation value and the cooling capacity evaluation value.
  7. 根据权利要求6所述的方法,其中,所述根据所述预测输出参数获取控制效果评估值,包括:The method according to claim 6, wherein the obtaining the control effect evaluation value according to the predicted output parameter comprises:
    根据第一次的所述预测输出参数中获取初始能耗参数和初始冷量参数;Obtain the initial energy consumption parameter and the initial cooling capacity parameter according to the first predicted output parameter;
    根据当前的所述预测输出参数中获取当前预测能耗参数和当前预测冷量参数;Obtain the current predicted energy consumption parameter and the current predicted cooling capacity parameter according to the current predicted output parameter;
    根据上一次的所预测输出参数中获取历史预测能耗参数;Obtain historical predicted energy consumption parameters from the last predicted output parameters;
    根据所述当前预测能耗参数和所述历史预测能耗参数获取能耗的增长率;Obtain the growth rate of energy consumption according to the current predicted energy consumption parameter and the historical predicted energy consumption parameter;
    根据所述当前预测能耗参数和所述初始能耗参数获取能耗的偏离度;Obtaining the deviation degree of energy consumption according to the current predicted energy consumption parameter and the initial energy consumption parameter;
    对所述增长率和所述偏离度进行平均获取所述能耗评估值;averaging the growth rate and the deviation to obtain the energy consumption evaluation value;
    根据所述初始冷量参数和所述当前预测冷量参数进行分析,获取所述冷量评估值。Analysis is performed according to the initial cooling capacity parameter and the current predicted cooling capacity parameter to obtain the cooling capacity evaluation value.
  8. 根据权利要求6或7所述的方法,其中,所述对所述控制动作评估值、所述能耗评估值和所述冷量评估值进行综合分析,获取所述控制奖励,包括:The method according to claim 6 or 7, wherein the comprehensive analysis of the control action evaluation value, the energy consumption evaluation value and the cooling capacity evaluation value to obtain the control reward comprises:
    对所述控制动作评估值和所述冷量评估值进行加权平均,获取约束评估值;Carry out a weighted average to the control action evaluation value and the cooling capacity evaluation value to obtain a constraint evaluation value;
    对所述能耗评估值和所述约束评估值进行加权平均,获取所述控制奖励。A weighted average is performed on the energy consumption evaluation value and the constraint evaluation value to obtain the control reward.
  9. 一种电子设备,其特征在于,包括:An electronic device, comprising:
    至少一个处理器;以及,at least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如权利要求1至8中任意一项所述水冷系统的控制模型优化方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the execution of any one of claims 1 to 8 The control model optimization method of the water cooling system is described.
  10. 一种计算机可读存储介质,存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1至8中任一项所述的水冷系统的控制模型优化方法。A computer-readable storage medium storing a computer program, characterized in that, when the computer program is executed by a processor, the control model optimization method for a water cooling system according to any one of claims 1 to 8 is implemented.
PCT/CN2021/127980 2020-11-30 2021-11-01 Method for optimizing control model of water cooling system, electronic device, and storage medium WO2022111232A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011377439.8A CN114580688A (en) 2020-11-30 2020-11-30 Control model optimization method of water cooling system, electronic equipment and storage medium
CN202011377439.8 2020-11-30

Publications (1)

Publication Number Publication Date
WO2022111232A1 true WO2022111232A1 (en) 2022-06-02

Family

ID=81755330

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/127980 WO2022111232A1 (en) 2020-11-30 2021-11-01 Method for optimizing control model of water cooling system, electronic device, and storage medium

Country Status (2)

Country Link
CN (1) CN114580688A (en)
WO (1) WO2022111232A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116307946A (en) * 2023-05-19 2023-06-23 中科航迈数控软件(深圳)有限公司 Processing environment-based adaptive energy consumption assessment method, device, equipment and medium
WO2023246717A1 (en) * 2022-06-23 2023-12-28 华为技术有限公司 Liquid cooling system control method and apparatus, device, and system
CN117408170A (en) * 2023-12-15 2024-01-16 南京群顶科技股份有限公司 Energy-saving predictive control method suitable for water cooling system of data center
CN117529070A (en) * 2024-01-08 2024-02-06 四川建诚智造科技有限公司 Heat dissipation control method and system for stable operation of photovoltaic inverter

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114727570B (en) * 2022-06-09 2022-09-02 南京群顶科技有限公司 Refrigerating unit water temperature adjusting method based on temperature change relation
CN116466672B (en) * 2023-06-13 2023-11-10 深圳市宝腾互联科技有限公司 Data center machine room parameter regulation and control method based on artificial intelligence and related device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104698843A (en) * 2015-02-06 2015-06-10 同济大学 Model prediction control based energy saving control method of data center
CN109882996A (en) * 2019-01-25 2019-06-14 珠海格力电器股份有限公司 A kind of method and apparatus of control
CN111505944A (en) * 2019-01-30 2020-08-07 珠海格力电器股份有限公司 Energy-saving control strategy learning method, and method and device for realizing air conditioning energy control
CN111609534A (en) * 2020-05-25 2020-09-01 珠海拓芯科技有限公司 Temperature control method and device and central temperature control system
WO2020199648A1 (en) * 2019-04-01 2020-10-08 珠海格力电器股份有限公司 Control method and device for air conditioner

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104698843A (en) * 2015-02-06 2015-06-10 同济大学 Model prediction control based energy saving control method of data center
CN109882996A (en) * 2019-01-25 2019-06-14 珠海格力电器股份有限公司 A kind of method and apparatus of control
CN111505944A (en) * 2019-01-30 2020-08-07 珠海格力电器股份有限公司 Energy-saving control strategy learning method, and method and device for realizing air conditioning energy control
WO2020199648A1 (en) * 2019-04-01 2020-10-08 珠海格力电器股份有限公司 Control method and device for air conditioner
CN111609534A (en) * 2020-05-25 2020-09-01 珠海拓芯科技有限公司 Temperature control method and device and central temperature control system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JIAMIN DU, SHUHONG LI, XINMEI LI: "Energy-saving optimization of cooling system based on ResNet-GWO", JOURNAL OF SOUTHEAST UNIVERSITY (NATURAL SCIENCE EDITION), vol. 50, no. 5, 1 September 2020 (2020-09-01), pages 866 - 874, XP055932774 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023246717A1 (en) * 2022-06-23 2023-12-28 华为技术有限公司 Liquid cooling system control method and apparatus, device, and system
CN116307946A (en) * 2023-05-19 2023-06-23 中科航迈数控软件(深圳)有限公司 Processing environment-based adaptive energy consumption assessment method, device, equipment and medium
CN116307946B (en) * 2023-05-19 2023-08-18 中科航迈数控软件(深圳)有限公司 Processing environment-based adaptive energy consumption assessment method, device, equipment and medium
CN117408170A (en) * 2023-12-15 2024-01-16 南京群顶科技股份有限公司 Energy-saving predictive control method suitable for water cooling system of data center
CN117408170B (en) * 2023-12-15 2024-03-08 南京群顶科技股份有限公司 Energy-saving predictive control method suitable for water cooling system of data center
CN117529070A (en) * 2024-01-08 2024-02-06 四川建诚智造科技有限公司 Heat dissipation control method and system for stable operation of photovoltaic inverter
CN117529070B (en) * 2024-01-08 2024-03-26 四川建诚智造科技有限公司 Heat dissipation control method and system for stable operation of photovoltaic inverter

Also Published As

Publication number Publication date
CN114580688A (en) 2022-06-03

Similar Documents

Publication Publication Date Title
WO2022111232A1 (en) Method for optimizing control model of water cooling system, electronic device, and storage medium
WO2023093820A1 (en) Device control optimization method, display platform, cloud server, and storage medium
US11513480B2 (en) Method and device for automatically diagnosing and controlling apparatus in intelligent building
CN104463381B (en) Building energy consumption Forecasting Methodology based on KPCA and WLSSVM
JP2012149839A (en) Air conditioner linkage control system, air conditioner linkage control method, and air conditioner linkage control program
Zhu et al. Uncertainty and sensitivity analysis of cooling and heating loads for building energy planning
CN112214689A (en) Method and system for maximizing influence of group in social network
CN114330852B (en) Energy-saving optimization method and device for tail end air conditioning system of integrated data center cabinet
US20230040886A1 (en) Building equipment energy management control system and control therefor
CN111582604A (en) Data processing method and device, electronic device, and medium
CN115238599B (en) Energy-saving method and model reinforcement learning training method and device for refrigerating system
CN115983438A (en) Method and device for determining operation strategy of data center terminal air conditioning system
WO2022213494A1 (en) Energy-saving control method, system and device for water chilling unit of central air conditioner
Shi et al. Building energy model reduction using model-cluster-reduce pipeline
CN117641542B (en) Method for online optimization of power consumption strategy of terminal of Internet of things
CN113821903B (en) Temperature control method and equipment, modularized data center and storage medium
WO2024016586A1 (en) Machine room temperature control method and apparatus, and electronic device and storage medium
CN116795198A (en) Energy consumption optimization method and device for data center and storage medium
Wang et al. A data mining approach to discover critical events for event-driven optimization in building air conditioning systems
CN115388586A (en) Refrigeration control model, device, electronic equipment and storage medium
CN111103799B (en) Energy efficiency optimization modeling method based on teacher-student network online exploration learning
Man et al. TEPAWSN: A formal analysis tool for wireless sensor networks
CN117763955A (en) Modeling method and device for refrigerating system equipment
Chen et al. Improve the stability and robustness of power management through model-free deep reinforcement learning
CN114650190B (en) Energy-saving method, system, terminal equipment and storage medium for data center network

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21896733

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 24/10/2023)