CN114755919A - Air conditioner room control method and device, equipment and storage medium - Google Patents

Air conditioner room control method and device, equipment and storage medium Download PDF

Info

Publication number
CN114755919A
CN114755919A CN202210353204.8A CN202210353204A CN114755919A CN 114755919 A CN114755919 A CN 114755919A CN 202210353204 A CN202210353204 A CN 202210353204A CN 114755919 A CN114755919 A CN 114755919A
Authority
CN
China
Prior art keywords
action
executed
actions
value
air conditioner
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210353204.8A
Other languages
Chinese (zh)
Inventor
林治强
江瑞
何晓钰
鲁永浩
胡磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jixin Taifu Electromechanical Technology Co ltd
Original Assignee
Beijing Jixin Taifu Electromechanical Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jixin Taifu Electromechanical Technology Co ltd filed Critical Beijing Jixin Taifu Electromechanical Technology Co ltd
Priority to CN202210353204.8A priority Critical patent/CN114755919A/en
Publication of CN114755919A publication Critical patent/CN114755919A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Automation & Control Theory (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Air Conditioning Control Device (AREA)

Abstract

The disclosure relates to a method and a device for controlling an air conditioner room, equipment and a storage medium, wherein the method comprises the following steps: selecting one of a plurality of preset groups of actions by taking the received running state parameters of the air conditioner room as current running state parameters; acquiring running state parameters after executing the action, and calculating the action value after executing the action; judging whether the cycle stop condition is met according to the action value or the executed action after the action is executed: if not, taking the operation state parameter after the action is executed as the current operation state parameter, and executing the step of selecting one group of actions in the preset groups of actions until the circulation stop condition is met; if yes, determining the action with the maximum action value under the same operation state parameter from all the action values, using the action to control the operation of the air conditioner room, and obtaining the optimal action which enables the power consumption to be minimum and the temperature to be appropriate under different operation states, thereby improving the energy efficiency of the air conditioner room through the optimal action.

Description

Air conditioner room control method and device, equipment and storage medium
Technical Field
The present disclosure relates to the field of air conditioner rooms, and in particular, to an air conditioner room control method, apparatus, device, and storage medium.
Background
The energy consumption of the air conditioner room accounts for more than 70% of the energy consumption of the air conditioning system, so that the air conditioner room is an important part for energy conservation due to high energy consumption ratio, and how to optimize the energy efficiency of the air conditioner room becomes a technical problem to be solved urgently.
In the prior art, the energy efficiency of the air conditioner room is generally optimized by constructing a neural network model taking the operation parameters of the air conditioner room as input and the power consumption of the air conditioner room as output. However, the constructed neural network model is complex, and accurate power consumption of the air conditioner room is not easy to obtain, so that the energy efficiency of the air conditioner room cannot be accurately optimized.
Disclosure of Invention
In order to solve the technical problems or at least partially solve the technical problems, embodiments of the present disclosure provide an air conditioner room control method and apparatus, a device, and a storage medium.
In a first aspect, an embodiment of the present disclosure provides an air conditioner room control method, where the method includes:
receiving operation state parameters of an air conditioner room, and selecting one action of multiple preset actions under the current operation state parameters to execute the selected action;
acquiring running state parameters after executing actions, calculating action values after executing the actions, and constructing a corresponding relation among the current running state parameters, the executed actions and the action values after executing the actions;
Judging whether the cycle stop condition is met according to the action value or the executed action after the action is executed:
when the circulation stop condition is not met, the operation state parameter after the action is executed is taken as the current operation state parameter, and the step of selecting one group of actions in the preset groups of actions is executed until the circulation stop condition is met;
and determining the action with the maximum action value under the same operation state parameter from all the action values calculated after the action is executed based on the constructed corresponding relation among the operation state parameters, the executed action and the action values, and controlling the operation of the air conditioner room, wherein each operation state parameter represents a preset value range.
In a possible implementation, before the obtaining the operating state parameter after the performing of the action, the method further includes:
and starting timing from the execution of the selected action, and executing the step of acquiring the running state parameters after the execution of the action when the accumulated duration reaches the preset duration.
In one possible embodiment, one of the preset plurality of actions is selected from the preset plurality of actions based on a greedy algorithm.
In one possible implementation, the post-action value is calculated by the expression:
Figure BDA0003581484480000021
wherein, Q' (S)t,At) Value of post-action taken for time t, Q (S)t,At) Taking the pre-action value for t, alpha and gamma are penalty factor for updating Q and discount factor for return, Rt+1In return after taking action, π (a | S)t+1) For operating the state parameter S after taking actiont+1Probability of taking action a, Q (S)t+1And a) is the value of the pre-action taken at the time t + 1.
In one possible implementation, the reward after action is calculated by the following expression:
Figure BDA0003581484480000022
wherein R ist+1For the return after taking action, β 1 and β 2 are proportionality coefficients, TieIndoor temperature, T, corresponding to the ith air conditionerilFor the ith air conditionerPreset temperature, WitActual power corresponding to the ith air conditioner, WipAnd the preset power is corresponding to the ith air conditioner.
In a possible implementation, the loop stop condition includes that a difference between the action value after the action is performed and the action value before the action is performed is less than or equal to a preset threshold, or the number of times of performing the action is greater than or equal to a preset number.
In one possible implementation manner, the operation state parameters of the air conditioner room include any of chilled water inlet temperature, chilled water outlet temperature, cooling water inlet temperature, cooling water outlet temperature, chilled water flow, cooling water pump rotation speed, chilled water pump rotation speed, cooling tower fan rotation speed, indoor environment temperature, outdoor environment temperature, environment people flow, weather conditions, total system operation power and total system operation efficiency; and each group of preset actions comprises any of increasing a set value of the outlet water temperature of the chilled water, reducing the set value of the outlet water temperature of the chilled water, setting the number of different water chilling unit running units, increasing a set value of the rotating speed of a water pump, reducing the set value of the rotating speed of the water pump, setting the number of different water pumping unit running units, increasing a set value of the rotating speed of a fan of the cooling tower, reducing a set value of the rotating speed of a fan of the cooling tower, increasing a set value of the cooling water temperature, and reducing the set value of the cooling water temperature.
In a second aspect, an embodiment of the present disclosure provides an air conditioning room control apparatus including:
the selection module is used for receiving the running state parameters of the air conditioner room and selecting one group of actions in multiple preset groups of actions under the current running state parameters so as to execute the selected actions;
the computing module is used for acquiring the running state parameters after the action is executed, computing the action value after the action is executed, and constructing the corresponding relation among the current running state parameters, the executed action and the action value after the action is executed;
the judging module is used for judging whether the circulation stopping condition is met or not according to the action value after the action is executed or the executed action, and when the circulation stopping condition is not met, the running state parameter after the action is executed is used as the current running state parameter, and the step of selecting one group of actions in the preset groups of actions is executed until the circulation stopping condition is met;
and the determining module is used for determining the action with the maximum action value under the same operation state parameter from all the action values calculated after the action is executed based on the constructed corresponding relation among the operation state parameters, the executed action and the action value, and is used for controlling the operation of the air conditioner room, wherein each operation state parameter represents a preset value range.
In a third aspect, an embodiment of the present disclosure provides an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete communication with each other through the communication bus;
a memory for storing a computer program;
and the processor is used for realizing the control method of the air conditioner room when executing the program stored in the memory.
In a fourth aspect, an embodiment of the present disclosure provides a computer-readable storage medium, on which a computer program is stored, where the computer program is executed by a processor to implement the above-mentioned air conditioner room control method.
Compared with the prior art, the technical scheme provided by the embodiment of the disclosure at least has part or all of the following advantages:
the air conditioner room control method of the embodiment of the disclosure receives an operation state parameter of an air conditioner room, and selects one of a plurality of preset groups of actions to execute the selected action under the current operation state parameter; acquiring running state parameters after executing actions, calculating action values after executing the actions, and constructing a corresponding relation among the current running state parameters, the executed actions and the action values after executing the actions; judging whether the cycle stop condition is met according to the action value or the executed action after the action is executed: when the circulation stop condition is not met, the operation state parameter after the action is executed is taken as the current operation state parameter, and the step of selecting one group of actions in the preset groups of actions is executed until the circulation stop condition is met; based on the constructed corresponding relation among the operation state parameters, the executed actions and the action values, the action with the maximum action value under the same operation state parameter is determined from all the action values calculated after the action is executed and is used for controlling the operation of the air conditioner room, wherein each operation state parameter represents a preset value range, the optimal action which enables the power consumption to be minimum and the temperature to be proper under different operation states can be obtained under the condition of not depending on a neural network model, and therefore the energy efficiency of the air conditioner room is improved through the optimal action.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and, together with the description, serve to explain the principles of the disclosure.
In order to more clearly illustrate the embodiments of the present disclosure or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the related art will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.
Fig. 1 schematically shows a flow chart of an air conditioner room control method according to an embodiment of the present disclosure;
fig. 2 schematically shows a block diagram of the structure of an air conditioning room control apparatus according to an embodiment of the present disclosure; and
fig. 3 schematically shows a block diagram of an electronic device according to an embodiment of the present disclosure.
Detailed Description
To make the objects, technical solutions and advantages of the embodiments of the present disclosure more clear, the technical solutions of the embodiments of the present disclosure will be described clearly and completely with reference to the drawings in the embodiments of the present disclosure, and it is obvious that the described embodiments are some embodiments of the present disclosure, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments disclosed herein without making any creative effort, shall fall within the protection scope of the present disclosure.
Referring to fig. 1, an embodiment of the present disclosure provides an air conditioner room control method, including:
s1, receiving the operation state parameters of the air conditioner room, and selecting one group of preset actions under the current operation state parameters to execute the selected actions;
in practical applications, one of the preset groups of actions is selected from the preset groups of actions based on a greedy algorithm.
S2, acquiring the operation state parameters after executing the action, calculating the action value after executing the action, and constructing the corresponding relation among the current operation state parameters, the executed action and the action value after executing the action;
in practical application, the operating state parameters of the air conditioner room comprise any of chilled water inlet temperature, chilled water outlet temperature, cooling water inlet temperature, cooling water outlet temperature, chilled water flow, cooling water pump rotating speed, chilled water pump rotating speed, cooling tower fan rotating speed, indoor environment temperature, outdoor environment temperature, environment people flow, weather conditions, total system operating power and total system operating efficiency; each group of preset actions comprises any of increasing a set value of the outlet water temperature of the chilled water, decreasing the set value of the outlet water temperature of the chilled water, setting the number of different water chilling unit running units, increasing a set value of the rotating speed of a water pump, decreasing a set value of the rotating speed of the water pump, setting the number of different water pumping unit running units, increasing a set value of the rotating speed of a fan of a cooling tower, decreasing a set value of the rotating speed of the fan of the cooling tower, increasing a set value of the cooling water temperature, and decreasing the set value of the cooling water temperature.
And S3, judging whether the circulation stop condition is met according to the action value after the action is executed or the executed action:
if yes, go to step S4;
if not, go to step S5;
in practical applications, the cycle stop condition includes that a difference between the action value after the action is executed and the action value before the action is executed is less than or equal to a preset threshold, or the number of times of the executed action is greater than or equal to a preset number of times.
S4, determining the action with the maximum action value under the same operation state parameter from all action values calculated after the action is executed based on the constructed corresponding relation among the operation state parameters, the executed action and the action value, and controlling the operation of the air conditioner room, wherein each operation state parameter represents a preset value range;
in practical applications, the same operating state parameter represents: each of the plurality of operating condition parameters has a predetermined range, for example, when the operating condition parameter includes an ambient temperature TeIndoor temperature TinWhen 4 state quantities of chilled water inlet water temperature and current system operating power are adopted, the same operating state parameter is represented, TeIs any one of (30 ℃, 35 ℃), T inIs any value in (27 ℃, 30 ℃), the inlet temperature of the chilled water is any value in (25 ℃, 30 ℃), and the current system operating power is any value in (300kw, 400 kw).
And S5, taking the operation state parameter after the action is executed as the current operation state parameter, and executing the step of selecting one group of actions in the preset groups of actions until the circulation stop condition is met.
In this embodiment, in step S2, before the obtaining the operation state parameter after the executing action, the method further includes:
and starting timing from the execution of the selected action, and executing the step of acquiring the running state parameters after the execution of the action when the accumulated time reaches the preset time.
In the present embodiment, in step S2, the value of the post-action is calculated by the following expression:
Figure BDA0003581484480000051
wherein, Q' (S)t,At) Value of post-action taken for time t, Q (S)t,At) The value of the action before the action is taken for the moment t, alpha and gamma are respectively a penalty coefficient for updating Q and a discount factor for returning, Rt+1In return after taking action, π (a | S)t+1) For operating a state parameter S after taking an actiont+1Probability of taking action a, Q (S)t+1And a) is the value of the action before the action is taken at the moment t + 1.
In the present embodiment, in step S2, the reward after taking action is calculated by the following expression:
Figure BDA0003581484480000052
wherein R ist+1For return after action, β 1 and β 2 are proportionality coefficients, TieIndoor temperature, T, corresponding to the ith air conditionerilPreset temperature, W, for the ith air conditioneritActual power, W, for the ith air conditioneripAnd the preset power corresponding to the ith air conditioner.
The operating state parameters of the air conditioner room comprise the ambient temperature TeIndoor temperature TinThe application method of the air conditioner room control method of the embodiment is explained by taking four actions of setting chilled water outlet temperature to be up-regulated by 2 degrees and down-regulated by 2 degrees, and setting cooling water return temperature to be up-regulated by 2 degrees and down-regulated by 2 degrees as examples, wherein after the chilled water outlet temperature and the cooling water return temperature are regulated, relevant running parameters of a water pump unit and a cooling tower of the system can correspondingly change, so that the power is changed.
The method comprises the following specific steps:
step one, establishing a time sequence difference reinforcement learning model according to the state quantities and the multiple groups of actions, and initializing an action value matrix Q, wherein all values are 0, for example;
Secondly, collecting the model state S at the current momenttThe model state StIncluding the ambient temperature TeIndoor temperature TinThe method comprises the steps of firstly, judging the temperature of inlet water of chilled water, judging which action is to be taken at the current moment according to action values of different actions taken at the moment, wherein an epsilon greedy algorithm strategy is adopted for taking the action, and the strategy shows that an action is randomly selected to be taken by having the probability of epsilon, and the action with the maximum action value is selected to be taken by having the probability of 1-epsilon. For example:
the state at the present time is StWhen the action values of different actions taken in the state are checked, (30 ℃, 27 ℃, 25 ℃, 300Kw), as shown above, the chilled water outlet water temperature is set to be adjusted up by 2 degrees and adjusted down by 2 degrees, the cooling water return water temperature is set to be adjusted up by 2 degrees and adjusted down by 2 degrees, and four actions are totally adopted, and since the initialized action values are all 0, one action A is randomly takent
Third step, according to the action A taken previouslytAnd the state at the time t, we obtain the state S at the time t +1t+1And real-time reporting Rt+1At this time, A is selected according to the action expectationt+1Then using St+1,At+1To update the operation value Q (S)t,At):
Figure BDA0003581484480000061
Wherein, alpha and gamma are respectively a penalty factor for updating Q and a discount factor for returning, R t+1I.e. the instant reward at time t +1 described above:
Figure BDA0003581484480000062
the difference between the current indoor temperature and the suitable temperature and the difference between the current power and the better power are considered in the instant return, and the return is restricted by a coefficient.
Wherein, pi (a | S)t+1) Is shown at St+1Probability of taking action a, hereThe probability of the system taking each action is considered to be the same, where there are 4 actions, then there can be π (a | S)t+1) 1/4. For example:
in the second step, the random selection of chilled water is adjusted up by 2 degrees, which is obviously a bad action, resulting in a higher room temperature at time t +1, e.g. 24 ℃ as the optimum temperature, and a slight decrease in power when the temperature at the present time is 29 ℃, but the immediate return will get a negative value, StWhen the action value of other actions is not updated, S is encountered again next timetIn the state of (2), the system does not select the action of adjusting the temperature of the outlet water of the chilled water to 2 degrees again.
And fourthly, repeating the second step to the fourth step until the model learns a better action value matrix, and automatically adjusting the operation parameters of the system through the learned action value matrix, thereby reducing the power consumption and ensuring the more proper room temperature.
Referring to fig. 2, an embodiment of the present disclosure provides an air conditioning room control apparatus including:
a selection module 11, configured to receive an operation state parameter of an air conditioner room, and select one of multiple preset actions under the current operation state parameter, so as to execute the selected action;
the calculation module 12 is configured to obtain an operation state parameter after the action is executed, calculate an action value after the action is executed, and construct a corresponding relationship between the current operation state parameter, the executed action, and the action value after the action is executed;
the judging module 13 is configured to judge whether the cycle stop condition is met according to the action value after the action is executed or the executed action, and when the cycle stop condition is not met, use the running state parameter after the action is executed as the current running state parameter, and execute a step of selecting one of the preset groups of actions until the cycle stop condition is met;
and the determining module 14 is configured to determine, based on the constructed corresponding relationship between the operating state parameter and the executed action and the action value, an action with the largest action value in the same operating state parameter from all the action values calculated after the action is executed, and is used to control the operation of the air conditioner room, where each operating state parameter represents a preset value range.
The specific details of the implementation process of the functions and actions of each unit in the above device are the implementation processes of the corresponding steps in the above method, and are not described herein again.
For the device embodiment, since it basically corresponds to the method embodiment, reference may be made to the partial description of the method embodiment for relevant points. The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of the present invention. One of ordinary skill in the art can understand and implement it without inventive effort.
In the second embodiment, any multiple of the selecting module 11, the calculating module 12, the judging module 13 and the determining module 14 may be combined and implemented in one module, or any one of the modules may be split into multiple modules. Alternatively, at least part of the functionality of one or more of these modules may be combined with at least part of the functionality of the other modules and implemented in one module. At least one of the selecting module 11, the calculating module 12, the judging module 13 and the determining module 14 may be at least partially implemented as a hardware circuit, such as a Field Programmable Gate Array (FPGA), a Programmable Logic Array (PLA), a system on a chip, a system on a substrate, a system on a package, an Application Specific Integrated Circuit (ASIC), or may be implemented by any other reasonable manner of integrating or packaging a circuit, such as hardware or firmware, or implemented by any one of three implementations of software, hardware and firmware, or any suitable combination of any of them. Alternatively, at least one of the selecting module 11, the calculating module 12, the judging module 13 and the determining module 14 may be at least partially implemented as a computer program module, which, when executed, may perform a corresponding function.
Referring to fig. 3, an electronic device according to a third exemplary embodiment of the present disclosure includes a processor 1110, a communication interface 1120, a memory 1130, and a communication bus 1140, where the processor 1110, the communication interface 1120, and the memory 1130 complete communication with each other through the communication bus 1140;
a memory 1130 for storing computer programs;
the processor 1110 is configured to implement the following air conditioner room control method when executing the program stored in the memory 1130:
receiving operation state parameters of an air conditioner room, and selecting one action of multiple preset actions under the current operation state parameters to execute the selected action;
acquiring running state parameters after executing actions, calculating action values after executing the actions, and constructing a corresponding relation among the current running state parameters, the executed actions and the action values after executing the actions;
judging whether the cycle stop condition is met according to the action value or the executed action after the action is executed:
when the circulation stopping condition is not met, the operation state parameter after the action is executed is taken as the current operation state parameter, and the step of selecting one group of actions in the preset groups of actions is executed until the circulation stopping condition is met;
And determining the action with the maximum action value under the same operation state parameter from all the action values calculated after the action is executed based on the constructed corresponding relation among the operation state parameters, the executed action and the action values, and controlling the operation of the air conditioner room, wherein each operation state parameter represents a preset value range.
The communication bus 1140 may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus 1140 may be divided into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one thick line is shown, but this is not intended to represent only one bus or type of bus.
The communication interface 1120 is used for communication between the electronic device and other devices.
The Memory 1130 may include a Random Access Memory (RAM) and a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. Optionally, the memory 1130 may also be at least one memory device located remotely from the processor 1110.
The Processor 1110 may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, or discrete hardware components.
A fourth exemplary embodiment of the present disclosure also provides a computer-readable storage medium. The computer readable storage medium has stored thereon a computer program which, when executed by a processor, implements the air conditioner room control method as described above.
The computer-readable storage medium may be contained in the apparatus/device described in the above embodiments; or may be separate and not incorporated into the device/apparatus. The computer-readable storage medium carries one or more programs which, when executed, implement an air conditioner room control method according to an embodiment of the present disclosure.
According to embodiments of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium, which may include, for example but is not limited to: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present disclosure, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
It is noted that, in this document, relational terms such as "first" and "second," and the like, are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.
The previous description is only for the purpose of describing particular embodiments of the present disclosure, so as to enable those skilled in the art to understand or implement the present disclosure. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the disclosure. Thus, the present disclosure is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. A control method for an air conditioner room is characterized by comprising the following steps:
receiving operation state parameters of an air conditioner room, and selecting one action of multiple preset actions under the current operation state parameters to execute the selected action;
acquiring running state parameters after executing actions, calculating action values after executing the actions, and constructing a corresponding relation among the current running state parameters, the executed actions and the action values after executing the actions;
judging whether the cycle stop condition is met according to the action value or the executed action after the action is executed:
when the circulation stop condition is not met, the operation state parameter after the action is executed is taken as the current operation state parameter, and the step of selecting one group of actions in the preset groups of actions is executed until the circulation stop condition is met;
And determining the action with the maximum action value under the same operation state parameter from all action values calculated after the action is executed based on the constructed corresponding relation among the operation state parameters, the executed action and the action value, and controlling the operation of the air conditioner room, wherein each operation state parameter represents a preset value range.
2. The method of claim 1, wherein prior to said obtaining the operating state parameters after performing the action, the method further comprises:
and starting timing from the execution of the selected action, and executing the step of acquiring the running state parameters after the execution of the action when the accumulated time reaches the preset time.
3. The method of claim 1, wherein one of the predetermined plurality of actions is selected from a predetermined plurality of actions based on a greedy algorithm.
4. The method of claim 1, wherein the value of the action after the action is taken is calculated by the expression:
Figure FDA0003581484470000011
wherein, Q' (S)t,At) Value of post-action taken for time t, Q (S)t,At) The value of the action before the action is taken for the moment t, alpha and gamma are respectively a penalty coefficient for updating Q and a discount factor for returning, R t+1In return after taking action, π (a | S)t+1) For operating the state parameter S after taking actiont+1Probability of taking action a, Q (S)t+1And a) is the value of the pre-action taken at the time t + 1.
5. The method of claim 4, wherein the reward after taking action is calculated by the expression:
Figure FDA0003581484470000021
wherein R ist+1For return after action, β 1 and β 2 are proportionality coefficients, TieIndoor temperature, T, corresponding to the ith air conditionerilA preset temperature, W, corresponding to the ith air conditioneritActual power corresponding to the ith air conditioner, WipAnd the preset power is corresponding to the ith air conditioner.
6. The method according to claim 1, wherein the loop stop condition comprises that a difference between a value of the action after the action is performed and a value of the action before the action is performed is less than or equal to a preset threshold, or that a number of the actions performed is greater than or equal to a preset number.
7. The method of claim 1, wherein the operating state parameters of the air-conditioning room comprise any of chilled water inlet temperature, chilled water outlet temperature, cooling water inlet temperature, cooling water outlet temperature, chilled water flow, cooling water pump speed, chilled water pump speed, cooling tower fan speed, indoor ambient temperature, outdoor ambient temperature, ambient people flow, weather conditions, total system operating power, and total system operating efficiency; and each group of preset actions comprises any of increasing a set value of the outlet water temperature of the chilled water, reducing the set value of the outlet water temperature of the chilled water, setting the number of different water chilling unit running units, increasing a set value of the rotating speed of a water pump, reducing the set value of the rotating speed of the water pump, setting the number of different water pumping unit running units, increasing a set value of the rotating speed of a fan of the cooling tower, reducing a set value of the rotating speed of a fan of the cooling tower, increasing a set value of the cooling water temperature, and reducing the set value of the cooling water temperature.
8. An air conditioning room control apparatus, comprising:
the selection module is used for receiving the running state parameters of the air conditioner room and selecting one group of actions in multiple preset groups of actions under the current running state parameters so as to execute the selected actions;
the computing module is used for acquiring the running state parameters after the action is executed, computing the action value after the action is executed, and constructing the corresponding relation among the current running state parameters, the executed action and the action value after the action is executed;
the judging module is used for judging whether the circulation stopping condition is met or not according to the action value after the action is executed or the executed action, and when the circulation stopping condition is not met, the running state parameter after the action is executed is used as the current running state parameter, and the step of selecting one group of actions in the preset groups of actions is executed until the circulation stopping condition is met;
and the determining module is used for determining the action with the maximum action value under the same operation state parameter from all the action values calculated after the action is executed based on the constructed corresponding relation among the operation state parameters, the executed action and the action value, and is used for controlling the operation of the air conditioner room, wherein each operation state parameter represents a preset value range.
9. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory are communicated with each other through the communication bus;
a memory for storing a computer program;
a processor for implementing the air conditioner room control method according to any one of claims 1 to 7 when executing the program stored in the memory.
10. A computer-readable storage medium on which a computer program is stored, wherein the computer program, when executed by a processor, implements the air conditioner room control method of any one of claims 1 to 7.
CN202210353204.8A 2022-04-06 2022-04-06 Air conditioner room control method and device, equipment and storage medium Pending CN114755919A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210353204.8A CN114755919A (en) 2022-04-06 2022-04-06 Air conditioner room control method and device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210353204.8A CN114755919A (en) 2022-04-06 2022-04-06 Air conditioner room control method and device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN114755919A true CN114755919A (en) 2022-07-15

Family

ID=82329872

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210353204.8A Pending CN114755919A (en) 2022-04-06 2022-04-06 Air conditioner room control method and device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114755919A (en)

Similar Documents

Publication Publication Date Title
CN111765604B (en) Control method and device of air conditioner
CN112577161B (en) Air conditioner energy consumption model training method and air conditioner system control method
CN111536671A (en) Air conditioning system operation control method and device, electronic equipment and storage medium
US11243503B2 (en) Building management system with online configurable system identification
CN108844190B (en) Air conditioner self-learning optimal control system
CN113739360B (en) Energy-saving control method, device, equipment and medium based on multiple intelligent agents of cold station
CN111928429A (en) Energy-saving control method and device for data center refrigerating system
CN110631221A (en) Control method and device of air conditioner, terminal and storage medium
CN112283834A (en) Chilled water outlet temperature control method and device for water chilling unit
CN114282727A (en) Computer room temperature prediction method and system and electronic equipment
CN114755919A (en) Air conditioner room control method and device, equipment and storage medium
JPH06147598A (en) Air conditioning load prediction method
CN113791538A (en) Control method, control device and control system of machine room equipment
CN116017936A (en) Control method and device for air conditioner room, electronic equipment and storage medium
CN113449766A (en) Short-time air-conditioning load prediction method, device and equipment
CN115585541A (en) Control method and device of air conditioner room system, electronic equipment and storage medium
CN112781177A (en) Method and device for predicting air conditioner operation power, electronic equipment and storage medium
CN111023267A (en) Self-cleaning control method and device and multi-split air conditioner
CN109323367B (en) Air conditioner and defrosting method and device thereof
CN116826782A (en) Power grid frequency modulation method, device, equipment and storage medium based on temperature control load
KR102427604B1 (en) Method for controling an electronic device including ai model trained to predict future power generation
CN116261300A (en) Combined optimization method and device for refrigerating equipment and airflow organization of data center
CN112413862B (en) Method and device for determining energy consumption coefficient, storage medium and electronic device
CN113434034B (en) Large-scale cluster energy-saving method for adjusting CPU frequency of calculation task by utilizing deep learning
CN109614688B (en) Load management model optimization method, device, medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination