CN116661567A - Method, device and equipment for controlling temperature of server - Google Patents

Method, device and equipment for controlling temperature of server Download PDF

Info

Publication number
CN116661567A
CN116661567A CN202310526409.6A CN202310526409A CN116661567A CN 116661567 A CN116661567 A CN 116661567A CN 202310526409 A CN202310526409 A CN 202310526409A CN 116661567 A CN116661567 A CN 116661567A
Authority
CN
China
Prior art keywords
bmc
temperature
determining
control state
radiator
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310526409.6A
Other languages
Chinese (zh)
Inventor
王魁英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
Alibaba China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba China Co Ltd filed Critical Alibaba China Co Ltd
Priority to CN202310526409.6A priority Critical patent/CN116661567A/en
Publication of CN116661567A publication Critical patent/CN116661567A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/20Cooling means
    • G06F1/206Cooling means comprising thermal management
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Control Of Temperature (AREA)
  • Power Sources (AREA)

Abstract

The application provides a temperature control method, a device and equipment of a server, wherein the server comprises a Baseboard Management Controller (BMC), a temperature sensor and a radiator, and the method comprises the following steps: determining a control state of the BMC on the radiator, wherein the control state is a controllable state or an out-of-control state; when the control state is a runaway state, acquiring the current temperature acquired by the temperature sensor; and adjusting the operation parameters of the radiator according to the current temperature, and switching the control state of the BMC into a controllable state. The temperature control reliability of the server is improved.

Description

Method, device and equipment for controlling temperature of server
Technical Field
The present application relates to the field of computers, and in particular, to a method, an apparatus, and a device for controlling server temperature.
Background
The server can comprise a plurality of components such as a processor, an image processor, a network card and the like, and the current temperatures of the components can be acquired through a plurality of temperature sensors.
In the related art, a plurality of current temperatures are generally obtained through a baseboard management controller (Baseboard Manager Controller, BMC), and an operation parameter of a radiator in a server is adjusted according to the plurality of current temperatures, so as to reduce an operation temperature of the server. However, in the above manner, the BMC may malfunction, and the operation parameters of the radiator may not be adjusted, resulting in low reliability of temperature control of the server.
Disclosure of Invention
The application provides a method, a device and equipment for controlling temperature of a server, which are used for improving the reliability of temperature control of the server.
In a first aspect, an embodiment of the present application provides a method for controlling temperature of a server, where the server includes a baseboard management controller BMC, a temperature sensor, and a radiator, and the method includes:
determining a control state of the BMC on the radiator, wherein the control state is a controllable state or an out-of-control state;
when the control state is the out-of-control state, acquiring the current temperature acquired by the temperature sensor;
and adjusting the operation parameters of the radiator according to the current temperature, and switching the control state of the BMC into the controllable state.
In one possible implementation, determining a control state of the BMC for the heatsink includes:
determining a first time length, wherein the first time length is a time difference between a moment when the current temperature is received from the BMC and a current moment;
and determining the control state according to the first time length.
In one possible embodiment, determining the control state according to the first time period includes:
if the first time length is greater than or equal to a preset time length, determining that the control state is the out-of-control state;
And if the first time period is smaller than the preset time period, determining the control state according to a plurality of historical temperatures received from the BMC in a latest time period, wherein the latest time period is a time period in a preset time period before the current time.
In one possible embodiment, determining the control state from a plurality of historical temperatures received from the BMC over a recent period of time includes:
if the historical temperatures are respectively greater than or equal to a temperature threshold value, determining that the control state is the out-of-control state;
and if the historical temperatures of the plurality of historical temperatures are smaller than the temperature threshold, determining the control state to be the controllable state.
In one possible embodiment, determining the control state from a plurality of historical temperatures received from the BMC over a recent period of time includes:
determining a temperature change trend according to the plurality of historical temperatures, wherein the temperature change trend is an increasing trend, a decreasing trend or an increasing and decreasing fluctuation trend;
if the temperature change trend is the incremental trend, determining that the control state is the out-of-control state;
and if the temperature change trend is the decreasing trend or the increasing and decreasing fluctuation trend, determining the control state as the controllable state.
In one possible embodiment, adjusting the operating parameters of the radiator according to the current temperature includes:
determining whether the current temperature is greater than or equal to the temperature threshold;
if yes, determining a target operation parameter, and adjusting the operation parameter of the radiator to the target operation parameter.
In one possible implementation, determining the target operating parameter includes:
determining a maximum operating parameter of the radiator;
and determining the maximum operation parameter to the target operation parameter.
In one possible implementation, determining the target operating parameter includes:
determining an operation parameter to be adjusted according to the current temperature and the current operation parameter;
determining a maximum operating parameter of the radiator;
if the operation parameter to be adjusted is smaller than or equal to the maximum operation parameter, determining the operation parameter to be adjusted as the target operation parameter;
and if the operation parameter to be adjusted is larger than the maximum operation parameter, determining the maximum operation parameter as the target operation parameter.
In one possible embodiment, the number of heat sinks is a plurality; adjusting the operating parameters of the radiator to the target operating parameters, including:
Determining a first radiator at which the temperature sensor is located from a plurality of radiators of the server;
and adjusting the operation parameters of the first radiator to the target operation parameters.
In one possible implementation, switching the control state of the BMC to the controllable state includes:
and sending a restarting instruction to the BMC, wherein the restarting instruction is used for controlling the BMC to restart so as to switch the control state of the BMC into the controllable state.
In one possible embodiment, the method further comprises:
and when the control state is a controllable state, the BMC adjusts the operation parameters of the radiator according to the current temperature.
In one possible embodiment, the radiator is a fan, and the operating parameter is a rotational speed of the fan.
In a possible embodiment, the server further comprises a control unit for performing the server temperature control method.
In a second aspect, an embodiment of the present application provides a server temperature control device, including: the device comprises a determining module, an acquiring module and an adjusting module, wherein,
the determining module is used for determining a control state of the BMC on the radiator, wherein the control state is a controllable state or an out-of-control state;
The acquisition module is used for acquiring the current temperature acquired by the temperature sensor when the control state is the out-of-control state;
the adjusting module is used for adjusting the operation parameters of the radiator according to the current temperature and switching the control state of the BMC into the controllable state.
In one possible implementation manner, the determining module is specifically configured to:
determining a first time length, wherein the first time length is a time difference between a moment when the current temperature is received from the BMC and a current moment;
and determining the control state according to the first time length.
In one possible implementation manner, the determining module is specifically configured to:
if the first time length is greater than or equal to a preset time length, determining that the control state is the out-of-control state;
and if the first time period is smaller than the preset time period, determining the control state according to a plurality of historical temperatures received from the BMC in a latest time period, wherein the latest time period is a time period in a preset time period before the current time.
In one possible implementation manner, the determining module is specifically configured to:
if the historical temperatures are respectively greater than or equal to a temperature threshold value, determining that the control state is the out-of-control state;
And if the historical temperatures of the plurality of historical temperatures are smaller than the temperature threshold, determining the control state to be the controllable state.
In one possible implementation manner, the determining module is specifically configured to:
determining a temperature change trend according to the plurality of historical temperatures, wherein the temperature change trend is an increasing trend, a decreasing trend or an increasing and decreasing fluctuation trend;
if the temperature change trend is the incremental trend, determining that the control state is the out-of-control state;
and if the temperature change trend is the decreasing trend or the increasing and decreasing fluctuation trend, determining the control state as the controllable state.
In one possible embodiment, the adjustment module is specifically configured to:
determining whether the current temperature is greater than or equal to the temperature threshold;
if yes, determining a target operation parameter, and adjusting the operation parameter of the radiator to the target operation parameter.
In one possible embodiment, the adjustment module is specifically configured to:
determining a maximum operating parameter of the radiator;
and determining the maximum operation parameter to the target operation parameter.
In one possible embodiment, the adjustment module is specifically configured to:
Determining an operation parameter to be adjusted according to the current temperature and the current operation parameter;
determining a maximum operating parameter of the radiator;
if the operation parameter to be adjusted is smaller than or equal to the maximum operation parameter, determining the operation parameter to be adjusted as the target operation parameter;
and if the operation parameter to be adjusted is larger than the maximum operation parameter, determining the maximum operation parameter as the target operation parameter.
In one possible embodiment, the number of heat sinks is a plurality; the adjusting module is specifically used for:
determining a first radiator at which the temperature sensor is located from a plurality of radiators of the server;
and adjusting the operation parameters of the first radiator to the target operation parameters.
In one possible embodiment, the adjustment module is specifically configured to:
and sending a restarting instruction to the BMC, wherein the restarting instruction is used for controlling the BMC to restart so as to switch the control state of the BMC into the controllable state.
In one possible embodiment, the method further comprises:
and when the control state is a controllable state, the BMC adjusts the operation parameters of the radiator according to the current temperature.
In one possible embodiment, the radiator is a fan, and the operating parameter is a rotational speed of the fan.
In a possible embodiment, the server further comprises a control unit for performing the server temperature control method.
In a third aspect, an embodiment of the present application provides a server, including: a baseboard management controller BMC, a control unit, a temperature sensor and a heat sink, wherein,
the temperature sensor is connected with the BMC through a bus, the BMC is connected with the control unit through a bus, and the BMC and the control unit are respectively connected with the radiator through a bus;
the control unit and the BMC are configured to perform the server temperature control method according to any one of the first aspect.
In a fourth aspect, embodiments of the present application provide a computer-readable storage medium having stored therein computer-executable instructions for implementing the server temperature control method of any one of the first aspects when the computer-executable instructions are executed by a processor.
In a fifth aspect, an embodiment of the present application provides a computer program product, comprising a computer program, which when executed by a processor implements the server temperature control method according to any one of the first aspects.
The embodiment of the application provides a method, a device and equipment for controlling the temperature of a server, wherein a control unit can determine the control state of a BMC on a radiator. When the control state is in a runaway state, the control unit can acquire the current temperature acquired by the temperature sensor, adjust the operation parameters of the radiator according to the current temperature, and switch the control state of the BMC into a controllable state. When the BMC fails, the control unit can adjust the operation parameters of the radiator according to the current temperature so as to reduce the operation temperature of the server, and the last defense line for preventing and controlling the over-temperature of the server is established through the control unit, so that the temperature control reliability of the server is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute a limitation on the application. In the drawings:
FIG. 1 is a schematic view of a scenario provided by an exemplary embodiment of the present application;
fig. 2 is a schematic flow chart of a server temperature control method according to an exemplary embodiment of the present application;
FIG. 3 is a schematic illustration of a recent period of time provided by an exemplary embodiment of the present application;
Fig. 4 is a schematic flow chart of another temperature control method of a server according to an exemplary embodiment of the present application;
FIG. 5A is a schematic diagram showing a trend of temperature variation according to an exemplary embodiment of the present application;
FIG. 5B is a second schematic diagram of a trend of temperature variation according to an exemplary embodiment of the present application;
FIG. 5C is a third schematic diagram of a trend of temperature variation according to an exemplary embodiment of the present application;
fig. 6 is a schematic process diagram of a server temperature control method according to an exemplary embodiment of the present application;
FIG. 7 is a schematic diagram of a temperature control device of a server according to an exemplary embodiment of the present application;
fig. 8 is a schematic structural diagram of a server according to an exemplary embodiment of the present application.
Detailed Description
It should be noted that, the user information (including but not limited to user equipment information, user personal information, etc.) and the data (including but not limited to data for analysis, stored data, presented data, etc.) related to the present application are information and data authorized by the user or fully authorized by each party, and the collection, use and processing of the related data need to comply with related laws and regulations and standards, and provide corresponding operation entries for the user to select authorization or rejection.
In order to make the objects, technical solutions and advantages of the present application more apparent, the technical solutions of the present application will be clearly and completely described below with reference to specific embodiments of the present application and corresponding drawings. It will be apparent that the described embodiments are only some, but not all, embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
Fig. 1 is a schematic view of a scenario provided in an exemplary embodiment of the present application. Referring to fig. 1, the server includes a baseboard management controller BMC, a control unit, a temperature sensor, and a heat sink.
The temperature sensor may include an air outlet temperature sensor, a temperature sensor corresponding to a central processing unit (Central Processing Unit, CPU) (hereinafter, abbreviated as CPU temperature sensor), a temperature sensor corresponding to a graphic processor (Graphics Processing Unit, GPU) (hereinafter, abbreviated as GPU temperature sensor), a temperature sensor corresponding to a hard disk (hereinafter, abbreviated as hard disk temperature sensor), and the like.
The heat sink may be a fan in a server.
The temperature sensor may collect the current temperature at different locations in the server. For example, an air outlet temperature sensor may collect the current temperature of the server air outlet and a GPU temperature sensor may collect the current temperature at the GPU.
For any one of the temperature sensors, the temperature sensor may send the collected current temperature to the BMC, so that the BMC may obtain the current temperature and may send the current temperature to the control unit. For example, if the temperature sensor 1 is an air outlet temperature sensor, the temperature sensor may collect the temperature of the air outlet of the server. Assuming that the current temperature 1 is 42 degrees, the temperature sensor 1 may send the current temperature 1 to the BMC, and after the BMC acquires the current temperature 1, the current temperature 1 may be sent to the control unit, i.e. 42 degrees.
When the BMC fails, namely the control state of the BMC on the radiator is in a runaway state, the control unit can control the operation parameters of the radiator according to the current temperature. For example, if the current temperature 1 is 42 degrees, the control unit may control the operation parameters of the radiator according to the current temperature 1.
In the related art, a plurality of current temperatures are generally obtained through a BMC, and an operation parameter of a radiator in a server is adjusted according to the plurality of current temperatures, so as to reduce an operation temperature of the server. However, in the above manner, the BMC may malfunction, and the operation parameters of the radiator may not be adjusted, resulting in low reliability of temperature control of the server.
In the embodiment of the application, when the control state of the BMC on the radiator is in an out-of-control state, the control unit can acquire the current temperature acquired by the temperature sensor and adjust the operation parameters of the radiator according to the current temperature. When the BMC fails, the control unit can adjust the operation parameters of the radiator according to the current temperature so as to reduce the operation temperature of the server, so that the temperature control reliability of the server is improved.
The technical scheme shown in the application is described in detail by specific examples. It should be noted that the following embodiments may exist alone or in combination with each other, and for the same or similar content, the description will not be repeated in different embodiments.
Fig. 2 is a schematic flow chart of a server temperature control method according to an exemplary embodiment of the present application. Referring to fig. 2, the method may include:
s201, determining the control state of the BMC on the radiator.
The execution body of the embodiment of the application can be a server or a server temperature control device arranged in the server. The server temperature control device can be realized by software or a combination of software and hardware. Optionally, a control unit may be further included in the server. For ease of understanding, the following description will be given taking the execution body as an example of the control unit.
Alternatively, the control unit may be a programmable logic device (Programmable Logic Device, PLD) or a complex programmable logic device (Complex Programmable Logic Device, CPLD).
Alternatively, the BMC and the heatsink may be connected via an I2C bus. The BMC may control the heatsink through the I2C bus.
The control state of the BMC on the heatsink may include a controllable state or an out of control state. When the BMC normally operates, the BMC can control the radiator, and the control state of the BMC on the radiator is a controllable state; when the BMC fails, the BMC cannot control the radiator, and the control state of the BMC on the radiator is in an out-of-control state.
Alternatively, for any one of the temperature sensors, the temperature sensor may be connected to the BMC through an I2C bus.
In an alternative embodiment, the temperature sensor may send a plurality of current temperatures to the BMC according to a sending period, so that the BMC may obtain the plurality of current temperatures.
Alternatively, the transmission period may be 1 minute (min).
For example, if the transmission period is 1min, if the temperature sensor 1 is an air outlet temperature sensor, the temperature sensor 1 may collect the current temperature of the air outlet of the server, and if the current temperature 1 of the air outlet of the server is 42 degrees, the temperature sensor 1 may send the current temperature 1, that is, 42 degrees, to the BMC, so that the BMC obtains the current temperature 1.
Alternatively, the BMC and the control unit may be connected via an I2C bus. The BMC can send the current temperature to the control unit through the I2C bus according to the sending period so as to enable the control unit to acquire the current temperature. For example, if the current temperature obtained by the BMC is 42 degrees, the BMC may send the current temperature to the control unit, so that the control unit obtains the current temperature, that is, 42 degrees.
In an alternative embodiment, the control state of the BMC on the heatsink may be determined by: determining a first time length; the control state is determined based on the first time period.
The first duration may be a time difference between a time when the last control unit received the current temperature from the BMC and the current time.
For example, if the current time is 10:00, the last time the control unit received the current temperature from the BMC is 9:59, and the first time period is 1min.
Alternatively, determining the control state according to the first time length may include the following 2 cases:
case 1: the first time period is greater than or equal to a preset time period.
Alternatively, the preset duration may be manually preset. The preset duration may be a duration between the current time and the historical time. The preset duration may be greater than at least one transmission period.
For example, if the transmission period is 1min and the current time is 10:00, the preset duration may be between 9:50 and 10:00 for 10min.
In this case, since the first time period is greater than or equal to the preset time period, it is indicated that the control unit has not received the current temperature sent by the BMC within the preset time period, it may determine that the BMC is faulty, and it may determine that the control state of the BMC on the radiator is in an out-of-control state.
For example, if the current time is 10:00, the time when the last control unit receives the current temperature from the BMC is 9:50, the first time is 10min, if the preset time is 10min, the transmission period is 1min, and since the first time is equal to the preset time, it indicates that the control unit does not receive the current temperature sent by the BMC within 10min, the control unit can determine that the BMC is faulty, that is, it can determine that the control state of the BMC on the radiator is in a runaway state.
Case 2: the first time period is less than a preset time period.
In this case, it is explained that the control unit may receive a plurality of current temperatures from the BMC within a preset period of time, and the control unit may determine the control state according to a plurality of historical temperatures received from the BMC within a recent period of time.
The latest time period may be a time period within a preset time period before the current time.
Next, the latest period will be described with reference to fig. 3.
Fig. 3 is a schematic diagram of a recent period provided by an exemplary embodiment of the present application. Referring to fig. 3, the preset time length is a time length between a current time and a historical time, the first time length is a time difference between a time when the control unit last receives the current temperature and the current time, and the last time period is a time period within the preset time length before the current time, that is, a time length between a time when the control unit last receives the current temperature and the historical time. During this last period, the control unit has received a plurality of historical temperatures from the BMC.
For example, if the current time is 10:00, the historical time is 9:50, the preset time period is 10min between the historical time 9:50 and the current time 10:00, and the time when the control unit last receives the current temperature from the BMC is 9:59, the last time period may be a period between 9:50 and 9:59. During this last period, the control unit has received a plurality of historical temperatures from the BMC.
Optionally, the control unit may determine the control state of the BMC on the heatsink based on a plurality of historical temperatures received from the BMC during the recent period. For example, if the plurality of historical temperatures received by the control unit include the current temperature of the air outlet being 42 degrees, the control unit may determine that the control state of the BMC on the radiator is an out-of-control state according to the current temperature of the air outlet being 42 degrees.
S202, when the control state is an out-of-control state, acquiring the current temperature acquired by the temperature sensor.
Alternatively, since the BMC may receive a plurality of current temperatures sent by the temperature sensor, the BMC may save the plurality of current temperatures.
When the BMC fails while the BMC is out of control, the BMC may fail to different degrees, and the current temperature acquired by the temperature sensor may be acquired by the control unit in the following 2 cases:
case 1: the BMC may send a plurality of current temperatures to the control unit.
In this case, the control unit may receive a plurality of current temperatures transmitted by the BMC.
For example, if the BMC obtains 5 current temperatures, which are 50 degrees, 35 degrees, 40 degrees, 43 degrees, and 49 degrees, respectively, the BMC may send the 5 current temperatures to the control unit, so that the control unit may obtain the 5 current temperatures.
Case 2: the BMC cannot send multiple current temperatures to the control unit.
In this case, the control unit may acquire a plurality of current temperatures in the BMC.
For example, if the BMC obtains 5 current temperatures, which are 50 degrees, 35 degrees, 40 degrees, 43 degrees, and 49 degrees, respectively, the BMC may store the 5 current temperatures, and the control unit may obtain the 5 current temperatures in the BMC.
S203, adjusting the operation parameters of the radiator according to the current temperature, and switching the control state of the BMC to a controllable state.
Alternatively, the operating parameter of the radiator may be the rotational speed in revolutions per minute (r/min). For example, the speed of the heat sink may be 1500r/min.
Alternatively, the rotational speed of the radiator may be adjusted by pulse width modulation (Pulse width modulation, PWM) frequency. For example, if the PWM frequency is 150 hertz (Hz), and the rotational speed of the radiator is 1500r/min, the control unit may adjust the PWM frequency of the radiator to 150Hz to adjust the rotational speed of the radiator to 1500r/min.
In an alternative embodiment, it may be determined whether the current temperature is greater than or equal to a temperature threshold; if not, the current temperature is the normal temperature, and the operation parameters of the radiator do not need to be adjusted; if so, determining the target operation parameter and adjusting the operation parameter of the radiator to the target operation parameter.
Alternatively, a temperature threshold may be provided in the control unit. The temperature threshold may be manually preset. For example, the control unit may be provided with a temperature threshold, assuming that the temperature threshold is 40 degrees.
For example, if the temperature sensor 1 is an air outlet temperature sensor, and the temperature threshold corresponding to the air outlet temperature is 40 degrees, if the control unit obtains 5 current temperatures collected by the temperature sensor 1, which are 29 degrees, 35 degrees, 36 degrees, 37 degrees and 30 degrees respectively, and since the 5 current temperatures do not exceed the temperature threshold, the control unit can determine that the current temperature is a normal temperature, and the control unit does not need to adjust the operation parameters of the radiator located at the air outlet of the server; if the 5 current temperatures are 39 degrees, 41 degrees, 42 degrees, 39 degrees and 37 degrees, if some of the 5 current temperatures exceed the temperature threshold, the control unit can determine that an abnormal temperature exists, the control unit can determine a target operation parameter, and the operation parameter of the radiator at the air outlet of the server is adjusted to the target operation parameter. Assuming that the control unit determines that the target PWM frequency is 120Hz and the target operating parameter is 1200r/min, the control unit may adjust the PWM frequency of the radiator to 120Hz to adjust the target operating parameter of the radiator, i.e., the target rotational speed, to 1200r/min.
In an alternative embodiment, a restart instruction may be sent to the BMC to switch the control state of the BMC to a controllable state. The restart instruction may be used to control a BMC restart.
The control unit may send a restart instruction to the BMC to control the BMC to restart. The BMC can realize automatic fault repair by restarting, so that the BMC has self-recovery capability and can normally operate, and the control state of the BMC on the radiator can be switched from a runaway state to a controllable state.
Optionally, when the control state of the BMC on the radiator is a controllable state, the BMC may adjust the operation parameters of the radiator according to the current temperature.
In the embodiment of the application, the control unit can determine the control state of the BMC on the radiator. When the control state is in a runaway state, the control unit can acquire the current temperature acquired by the temperature sensor, adjust the operation parameters of the radiator according to the current temperature, and switch the control state of the BMC into a controllable state. When the BMC fails, the control unit can adjust the operation parameters of the radiator according to the current temperature so as to reduce the operation temperature of the server, and the last defense line for preventing and controlling the over-temperature of the server is established through the control unit, so that the temperature control reliability of the server is improved.
Next, the server temperature control method will be described in detail with reference to fig. 4 on the basis of the embodiment shown in fig. 2.
Fig. 4 is a flow chart of another temperature control method for a server according to an exemplary embodiment of the present application. Referring to fig. 4, the method may include:
s401, determining a first time length.
The first duration may be a time difference between a time when the last control unit received the current temperature from the BMC and the current time.
For example, if the current time is 10:00 and the last time the control unit received the current temperature from the BMC is 9:59, the first duration may be 1min.
For example, if the current time is 10:00 and the last time the control unit received the current temperature from the BMC is 9:50, the first duration may be 10 minutes.
S402, determining a control state according to the first time length.
Optionally, if the first time period is greater than or equal to the preset time period, the control unit may determine that the control state of the BMC on the radiator is an out-of-control state; if the first time length is less than the preset time length, the control unit may further determine a control state of the BMC on the radiator according to a plurality of historical temperatures received from the BMC in the latest time period.
For example, if the first time length is 11min, the preset time length is 10min, and if the transmission period is 1min, because the first time length is longer than the preset time length, which indicates that the control unit has not received a plurality of current temperatures sent by the BMC within the preset 10min, the control unit may determine that the control state of the BMC on the radiator is an out-of-control state.
For example, if the first time period is 1min, since the first time period is less than the preset time period, the control unit may determine the control state of the BMC on the radiator according to a plurality of historical temperatures received from the BMC in the latest time period. Assuming that the current time is 10:00, the last period may be a period between 9:50-9:59, then during this last period the control unit may receive multiple historical temperatures from the BMC.
In an alternative embodiment, determining the control state based on a plurality of historical temperatures received from the BMC over a recent period of time may include 2 cases:
case 1: the plurality of historical temperatures are each greater than or equal to a temperature threshold.
In this case, since the plurality of historical temperatures are obtained according to the transmission period, the plurality of historical temperatures are respectively greater than or equal to the temperature threshold, which indicates that the current temperature collected by the temperature sensor is always high, that is, the BMC does not adjust the first radiator where the temperature sensor is located, so as to perform cooling treatment, and then it can be determined that the control state of the BMC on the radiator is in an out-of-control state.
For example, if the air outlet temperature threshold corresponding to the air outlet temperature is 40 degrees, and the plurality of historical temperatures are the air outlet temperatures acquired by the air outlet temperature sensors and are 40 degrees, 41 degrees, 43 degrees, 42 degrees, 44 degrees, 45 degrees and 46 degrees respectively, the control unit can determine that the control state of the BMC on the radiator is in a runaway state because the plurality of historical temperatures are all greater than the air outlet temperature threshold of 40 degrees.
Case 2: the plurality of historical temperatures has a historical temperature less than a temperature threshold.
In this case, since the plurality of historical temperatures are obtained according to the transmission period, the historical temperatures of the plurality of historical temperatures are smaller than the temperature threshold, which indicates that the current temperature collected by the temperature sensor may be decreasing, that is, that the BMC may adjust the operation parameter of the first radiator where the temperature sensor is located, so as to perform cooling processing on the temperature sensor, it may be determined that the control state of the BMC on the radiator is a controllable state.
For example, if the temperature threshold corresponding to the air outlet temperature is 40 degrees, and the plurality of historical temperatures are the air outlet temperatures acquired by the air outlet temperature sensor and are 39 degrees, 40 degrees, 41 degrees, 39 degrees, 38 degrees, 39 degrees, 40 degrees, 39 degrees and 38 degrees respectively, then since the historical temperatures are less than the air outlet temperature threshold, it is indicated that the BMC may be already adjusting the operation parameters of the first radiator located at the air outlet of the server, the control unit may determine that the control state of the BMC on the radiator is a controllable state.
In another alternative embodiment, the control state may be determined from a plurality of historical temperatures received from the BMC over a recent period of time by: determining a temperature change trend according to the plurality of historical temperatures; and determining a control state according to the temperature change trend.
Alternatively, the temperature change trend may be an increasing trend, a decreasing trend, or an increasing and decreasing fluctuation trend.
Next, the temperature change tendency will be described with reference to fig. 5A to 5C.
Fig. 5A is a schematic diagram illustrating a trend of temperature change according to an exemplary embodiment of the present application. Referring to fig. 5A, for example, if the temperature sensor 1 is an air outlet temperature sensor, the plurality of historical temperatures collected by the temperature sensor 1 are 38 degrees, 39 degrees, 40 degrees, 41 degrees, and 42 degrees, respectively, and then the temperature change trend is determined to be an increasing trend according to the plurality of historical temperatures.
Fig. 5B is a schematic diagram ii of a temperature variation trend according to an exemplary embodiment of the present application. Referring to figure 5B of the drawings,
for example, if the temperature sensor 1 is an air outlet temperature sensor, and the plurality of historical temperatures collected by the temperature sensor 1 are 41 degrees, 40 degrees, 39 degrees, 38 degrees, and 37 degrees, respectively, the temperature change trend is determined to be a decreasing trend according to the plurality of historical temperatures.
Fig. 5C is a schematic diagram III of a trend of temperature variation according to an exemplary embodiment of the present application. Referring to figure 5C of the drawings,
for example, if the temperature sensor 1 is an outlet temperature sensor, and the plurality of historical temperatures collected by the temperature sensor 1 are 39 degrees, 40 degrees, 39 degrees, 38 degrees, 39 degrees, 40 degrees, and 39 degrees, respectively, the temperature change trend is determined to be an increasing/decreasing fluctuation trend according to the plurality of historical temperatures.
Alternatively, determining the control state according to the temperature variation trend may include the following 2 cases:
case 1: the trend of temperature change is an increasing trend.
In this case, since the plurality of historical temperatures are obtained according to the transmission period, the temperature change trend of the plurality of historical temperatures is an increasing trend, which indicates that the current temperature of the temperature sensor is increasing, that is, that the BMC does not adjust the first radiator where the temperature sensor is located according to the plurality of historical temperatures, so as to perform cooling processing, it can be determined that the control state of the BMC on the radiator is in an out-of-control state.
For example, if the temperature threshold corresponding to the air outlet temperature is 40 degrees, and the plurality of historical temperatures are the air outlet temperatures acquired by the air outlet temperature sensor and are 38 degrees, 39 degrees, 40 degrees, 41 degrees and 42 degrees respectively, the temperature change trend of the plurality of historical temperatures can be determined to be an increasing trend, and the control unit can determine that the control state of the BMC on the radiator is a runaway state.
Case 2: the temperature change trend is a decreasing trend or an increasing and decreasing fluctuation trend.
In this case, since the plurality of historical temperatures are obtained according to the transmission period, the temperature change trend of the plurality of historical temperatures is a decreasing trend or an increasing and decreasing trend, which indicates that the current temperature of the temperature sensor may be decreasing, that is, that the BMC may be adjusting the first radiator where the temperature sensor is located, so as to perform cooling processing on the temperature sensor, it may be determined that the control state of the BMC on the radiator is a controllable state.
For example, if the temperature threshold corresponding to the air outlet temperature is 40 degrees, and the plurality of historical temperatures are the air outlet temperatures acquired by the air outlet temperature sensor, and are 41 degrees, 40 degrees, 39 degrees, 38 degrees and 37 degrees respectively, the temperature change trend of the plurality of historical temperatures can be determined to be a decreasing trend, and the control unit can determine that the control state of the BMC on the radiator is a controllable state.
S403, when the control state is an out-of-control state, acquiring the current temperature acquired by the temperature sensor.
It should be noted that, the specific execution process of step S403 may refer to the specific execution process of step S202, and will not be described herein.
S404, determining whether the current temperature is greater than or equal to a temperature threshold.
Because the server can be provided with a plurality of temperature sensors, a plurality of temperature thresholds can be arranged in the control unit, different temperature sensors can acquire temperatures of different positions, and the temperatures of different positions can correspond to different temperature thresholds.
Alternatively, for any one of the current temperatures, the control unit may determine a temperature threshold corresponding to the current temperature.
The control unit may determine whether the current temperature is greater than or equal to a temperature threshold. If not, the current temperature is the normal temperature, and the operation parameters of the radiator do not need to be adjusted; if so, it is indicated that the current temperature is an abnormal temperature, S405 may be executed.
For example, if there are 3 temperature sensors, namely, an air outlet temperature sensor, a GPU temperature sensor, and a hard disk temperature sensor, the current temperatures and temperature thresholds of the 3 components may be as shown in table 1:
TABLE 1
Temperature sensor Current temperature Temperature threshold
Air outlet temperature sensor 41 degrees 40 degrees
GPU temperature sensor 45 degrees 50 degrees
Hard disk temperature sensor 40 degrees 45 degrees
Then, since the current temperature 41 degrees acquired by the air outlet temperature sensor is greater than the temperature threshold value 40 degrees, it is indicated that the current temperature is an abnormal temperature, and S405 may be executed.
S405, determining target operation parameters.
Alternatively, determining the target operating parameter may include 3 ways:
mode 1: a maximum operating parameter of the radiator may be determined and the maximum operating parameter may be determined as the target operating parameter.
Alternatively, the operating parameter may be a rotational speed, and the maximum operating parameter may be a maximum value of the rotational speed of the radiator. Assume that the maximum operating parameter is 3000r/min.
For example, if the maximum operating parameter is 3000r/min, the control unit may determine the maximum operating parameter of 3000r/min as the target operating parameter.
Mode 2: the maximum operating parameter of the radiator may be determined, the maximum safe operating parameter may be determined based on the maximum operating parameter, and the target operating parameter may be determined based on the maximum safe operating parameter.
Alternatively, the maximum safe operating parameter may be manually preset. The maximum safe operating parameter may be 80% of the maximum operating parameter.
For example, if the maximum operating parameter is 3000r/min, the maximum safe operating parameter may be 80% of the maximum operating parameter, and the maximum safe parameter may be 2400r/min.
Mode 3: according to the current temperature and the current operation parameters, the operation parameters to be adjusted are determined, the maximum operation parameters of the radiator are determined, and then the target operation parameters can be determined according to the operation parameters to be adjusted and the maximum operation parameters.
The current operating parameter may be a current rotational speed of the radiator. Assume that the current operating parameter of the radiator is 1500r/min.
Alternatively, the control unit may determine the operating parameter to be adjusted based on the current temperature and the current operating parameter. For example, if the current operation parameter of the radiator is 1500r/min, the current temperature is 42 degrees, and the current temperature exceeds the temperature threshold by 2 degrees, it is assumed that the control unit is preset with an adjustment rule as follows: every time the current temperature exceeds 1 degree, the current operation parameter of the fan is increased by 100r/min, and the control unit can determine that the operation parameter to be adjusted is 1700r/min according to the current temperature of 42 degrees and the current operation parameter of 1500r/min.
Alternatively, the control unit may determine a maximum operating parameter of the radiator. For example, the control unit may determine that the maximum operating parameter of the radiator is 3000r/min.
Optionally, determining the target operating parameter according to the operating parameter to be adjusted and the maximum operating parameter may include the following 2 cases:
case 1: the operating parameter to be adjusted is less than or equal to the maximum operating parameter.
In this case, the operation parameter to be adjusted may be determined as the target operation parameter.
For example, if the control unit determines that the operation parameter to be adjusted is 1700r/min and the maximum operation parameter is 3000r/min, since the operation parameter to be adjusted is less than or equal to the maximum operation parameter, the operation parameter to be adjusted 1700r/min may be determined as the target operation parameter.
Case 2: the operating parameter to be adjusted is greater than the maximum operating parameter.
In this case, the maximum operating parameter may be determined as the target operating parameter.
For example, if the control unit determines that the operation parameter to be adjusted is 3500r/min and the maximum operation parameter is 3000r/min, since the operation parameter to be adjusted is greater than the maximum operation parameter, the maximum operation parameter 3000r/min may be determined as the target operation parameter.
S406, adjusting the operation parameters of the radiator to target operation parameters.
Alternatively, the control unit and the heat sink may be connected via an I2C bus. The control unit may adjust the operating parameters of the radiator to target operating parameters through the I2C.
Optionally, since the number of heat sinks in the server is plural; adjusting the operating parameters of the radiator to the target operating parameters may include 2 ways:
mode 1: and determining a first radiator at which the temperature sensor is positioned in a plurality of radiators of the server, and adjusting the operation parameters of the first radiator to target operation parameters.
For example, if the temperature sensor 1 is an air outlet temperature sensor, the air outlet is provided with a first radiator which is the radiator 1, the current operation parameter of the radiator 1 is 1500r/min, and the target operation parameter is 2400r/min, the control unit may adjust the operation parameter 1500r/min of the radiator 1 to the target operation parameter 2400r/min through I2C.
In mode 1, the operation parameter of the first radiator at the position where the temperature sensor is located is adjusted to be the target operation parameter, so that the radiator can be accurately adjusted according to the current temperature, and accurate cooling for the temperature sensor can be realized.
Mode 2: and adjusting the operation parameters of a plurality of radiators of the server to target operation parameters.
For example, if there are 8 radiators in the server, and the target operation parameter is 2400r/min, the control unit may adjust the operation parameters of the 8 radiators to the target operation parameter 2400r/min through I2C.
In mode 2, the operating parameters of the plurality of radiators of the server are adjusted to the target operating parameters, so that the temperature of the whole server can be quickly reduced.
S407, switching the control state of the BMC to a controllable state.
In an alternative embodiment, a restart instruction may be sent to the BMC to switch the control state of the BMC to a controllable state. The restart instruction may be used to control a BMC restart.
The control unit may send a restart instruction to the BMC to control the BMC to restart. The BMC can realize automatic fault repair through restarting, so that the BMC has self-recovery capability and can normally operate, and the control state of the BMC on the radiator can be switched from a runaway state to a controllable state.
And S408, when the control state is a controllable state, the BMC adjusts the operation parameters of the radiator according to the current temperature.
Optionally, a smart algorithm may be provided in the BMC. When the control state of the BMC on the radiator is a controllable state, the BMC can adjust the operation parameters of the radiator according to an intelligent algorithm and the current temperature, so that the radiator can be accurately adjusted.
Optionally, the BMC may increase the operating parameters of the radiator according to the current temperature to reduce the temperature; the operation parameters of the radiator can be reduced according to the current temperature so as to realize energy saving.
For example, if the temperature sensor 1 is an air outlet temperature sensor, if the current temperature of the air outlet obtained by the BMC is 39 degrees, the BMC may increase the operation parameters of the radiator located at the air outlet according to the intelligent algorithm and the current temperature of the air outlet by 39 degrees, so as to reduce the temperature of the air outlet; if the current temperature of the air outlet obtained by the BMC is 20 degrees, the BMC can adjust the operation parameters of the radiator at the air outlet to be smaller according to the intelligent algorithm and the current temperature of the air outlet by 20 degrees so as to realize energy saving.
In the embodiment of the application, the control unit may determine the first time length and determine the control state according to the first time length. When the control state is a runaway state, the control unit may acquire the current temperature acquired by the temperature sensor, and may determine whether the current temperature is greater than or equal to a temperature threshold. If yes, the control unit may determine the target operating parameter and adjust the operating parameter of the radiator to the target operating parameter. The control unit may switch the control state of the BMC to a controllable state. When the control state is a controllable state, the BMC can adjust the operation parameters of the radiator according to the current temperature. When the BMC fails, the control unit can adjust the operation parameters of the radiator according to the current temperature so as to reduce the operation temperature of the server, and the last defense line for preventing and controlling the over-temperature of the server is established through the control unit, so that the temperature control reliability of the server is improved.
Next, the server temperature control method is described in further detail by way of a specific example with reference to fig. 6 on the basis of any of the above embodiments.
Fig. 6 is a schematic process diagram of a server temperature control method according to an exemplary embodiment of the present application. Referring to fig. 6, a server may include a plurality of temperature sensors, a BMC, a control unit, and a plurality of heat sinks. The control unit may be a programmable logic device; the plurality of temperature sensors may be an air outlet temperature sensor, a GPU temperature sensor, … …, a hard disk temperature sensor, respectively.
For any one of the temperature sensors, the temperature sensor may acquire the temperature of the corresponding component. For example, the air outlet temperature sensor may collect the temperature of the air outlet, the GPU temperature sensor may collect the temperature of the GPU, … …, the hard disk temperature sensor may collect the temperature of the hard disk.
The plurality of heat sinks may be the heat sink 1, the heat sinks 2, … …, and the heat sinks n, n may be integers greater than or equal to 1, respectively. The plurality of heat sinks may be located at different locations in the server. For example, the heat sink 1 may be located at an air outlet, the heat sink 2 may be located near the GPU, … …, and the heat sink n may be located near the hard disk.
In step (1), the temperature sensor may acquire a plurality of current temperatures. For example, the air outlet temperature sensor can collect the temperature of the air outlet to obtain a plurality of air outlet temperatures; the GPU temperature sensor can acquire the temperatures of the GPUs so as to acquire a plurality of GPU temperatures; … …; the hard disk temperature sensor can collect the temperature of the hard disk to obtain a plurality of hard disk temperatures.
In step (2), each temperature sensor may send the collected multiple current temperatures to the BMC according to the sending period. For example, if the transmission period is 1min, the air outlet temperature sensor may send the air outlet temperatures to the BMC every 1min to send a plurality of air outlet temperatures; the GPU temperature sensor may send GPU temperatures to the BMC every 1min to send a plurality of GPU temperatures; … …; the hard disk temperature sensor may send a plurality of hard disk temperatures to the BMC every 1min to send the plurality of hard disk temperatures.
In step (3), the BMC may transmit a plurality of current temperatures to the programmable logic device according to the transmission period.
In step (4), the programming logic device may determine a control state of the BMC for the radiator, and when the control state is a runaway state, may determine a target operating parameter according to the current temperature, and adjust the operating parameter of the radiator according to the target operating parameter.
Specifically, the programmable logic device may determine a first time period between a time when the current temperature was last received from the BMC and the current time. If the first time length is greater than or equal to the preset time length, the programmable logic device can determine that the control state is an out-of-control state; if the first time period is less than the preset time period, the programmable logic device may determine the control state according to a plurality of historical temperatures received from the BMC during the last time period. For example, if the first time period is 10min and the preset time period is 10min, the programmable logic device may determine the control state according to a plurality of historical temperatures received from the BMC in the latest time period because the first time period is less than the preset time period.
Alternatively, the programmable logic device may be preset with a plurality of temperature thresholds. For example, the programmable logic device may be preset with an air outlet temperature threshold of 40 degrees, a GPU temperature threshold of 50 degrees, … …, and a hard disk temperature threshold of 45 degrees.
Alternatively, the programmable logic device may determine a plurality of historical temperatures collected by the same temperature sensor among a plurality of historical temperatures.
For any one temperature sensor, if the historical temperatures acquired by the temperature sensor are respectively greater than or equal to a temperature threshold, the programmable logic device can determine that the control state of the BMC on the radiator is in an out-of-control state; or if the temperature change trend of the plurality of historical temperatures is an increasing trend, the programmable logic device can determine that the control state of the BMC on the radiator is in a runaway state.
For example, if the temperature sensor 1 is an air outlet temperature sensor, if the programmable logic device determines that the plurality of historical temperatures collected by the temperature sensor 1 are 40 degrees, 41 degrees, 42 degrees, 43 degrees, 44 degrees, 45 degrees, and 46 degrees, respectively, the programmable logic device may determine that the control state of the BMC on the radiator is an out-of-control state because the plurality of historical temperatures are all greater than or equal to the air outlet temperature threshold value of 40 degrees; or if the programmable logic device determines that the temperature change trend of the plurality of historical temperatures corresponding to the tuyere temperature is an increasing trend, the programmable logic device can determine that the control state of the BMC on the radiator is an out-of-control state.
When the BMC is determined to be in an out-of-control state with respect to the radiator, the programmable logic device may obtain the current temperature and determine whether the current temperature is greater than or equal to a temperature threshold. If not, the current temperature is the normal temperature, and the operation parameters of the radiator do not need to be adjusted; if yes, the current temperature is the abnormal temperature, the programmable logic device determines the target operation parameter, and the operation parameter of the radiator is adjusted according to the target operation parameter.
Alternatively, the programmable logic device may determine a maximum operating parameter, or a maximum safe operating parameter, as the target operating parameter; the operation parameters to be adjusted can be determined according to the current temperature and the current operation parameters, the maximum operation parameters of the radiator can be determined, and then the target operation parameters can be determined according to the operation parameters to be adjusted and the maximum operation parameters.
Optionally, the programmable logic device may determine a first radiator where the temperature sensor is located, and adjust an operation parameter of the first radiator to a target operation parameter; alternatively, the programmable logic device may adjust the operating parameters of all the heat sinks to target operating parameters.
For example, if the temperature sensor 1 is an air outlet temperature sensor, the first radiator is set at the air outlet of the server as the radiator 1, and if the programmable logic device determines that the target operation parameter is 2400r/min, the programmable logic device may adjust the rotation speed of the radiator 1 to 2400r/min; alternatively, the programming logic device may adjust the rotational speeds of the radiator 1, the radiator 2, … …, and the radiator n to 2400r/min.
In step (5), the programmable logic device may send a restart instruction to the BMC to control the BMC to restart. The BMC can realize automatic fault repair through restarting, so that the BMC has self-recovery capability and can normally operate, and the control state of the BMC on the radiator can be switched from a runaway state to a controllable state.
In step (6), when the control state of the BMC on the radiator is a controllable state, the BMC can adjust the operation parameters of the radiator according to the intelligent algorithm and the current temperature.
Optionally, the BMC may increase the operating parameters of the radiator according to the current temperature to reduce the temperature; the operation parameters of the radiator can be reduced according to the current temperature so as to realize energy saving.
According to the technical scheme, the final defense line for preventing and controlling the over-temperature of the server can be established through the programmable logic device without modifying the existing hardware design, and the reliability of temperature control of the server is improved.
In the embodiment of the application, the temperature sensor can acquire a plurality of current temperatures and send the plurality of current temperatures to the BMC. After the BMC receives the plurality of current temperatures, the plurality of current temperatures may be transmitted to the programmable logic device according to the transmission period. The programmable logic device can determine the control state of the BMC on the radiator, and when the control state is in a runaway state, the programmable logic device can determine a target operation parameter according to the current temperature and adjust the operation parameter of the radiator according to the target operation parameter. The programmable logic device may also switch the control state of the BMC to a controllable state. When the control state is a controllable state, the BMC can adjust the operation parameters of the radiator according to the current temperature. When the BMC fails, the programmable logic device can adjust the operation parameters of the radiator according to the current temperature so as to reduce the operation temperature of the server, and the last defense line for preventing and controlling the over-temperature of the server is established through the programmable logic device, so that the temperature control reliability of the server is improved.
Fig. 7 is a schematic structural diagram of a temperature control device of a server according to an exemplary embodiment of the present application. Referring to fig. 7, the apparatus includes: a determination module 11, an acquisition module 12 and an adjustment module 13, wherein,
the determining module 11 is configured to determine a control state of the BMC on the radiator, where the control state is a controllable state or an uncontrolled state;
the obtaining module 12 is configured to obtain a current temperature collected by the temperature sensor when the control state is the out-of-control state;
the adjusting module 13 is configured to adjust an operation parameter of the radiator according to the current temperature, and switch a control state of the BMC to the controllable state.
The server temperature control device provided by the embodiment of the application can execute the technical scheme shown in the embodiment of the method, and the implementation principle and the beneficial effects are similar, and are not repeated here.
In a possible embodiment, the determining module 11 is specifically configured to:
determining a first time length, wherein the first time length is a time difference between a moment when the current temperature is received from the BMC and a current moment;
and determining the control state according to the first time length.
In a possible embodiment, the determining module 11 is specifically configured to:
if the first time length is greater than or equal to a preset time length, determining that the control state is the out-of-control state;
and if the first time period is smaller than the preset time period, determining the control state according to a plurality of historical temperatures received from the BMC in a latest time period, wherein the latest time period is a time period in a preset time period before the current time.
In a possible embodiment, the determining module 11 is specifically configured to:
if the historical temperatures are respectively greater than or equal to a temperature threshold value, determining that the control state is the out-of-control state;
and if the historical temperatures of the plurality of historical temperatures are smaller than the temperature threshold, determining the control state to be the controllable state.
In a possible embodiment, the determining module 11 is specifically configured to:
determining a temperature change trend according to the plurality of historical temperatures, wherein the temperature change trend is an increasing trend, a decreasing trend or an increasing and decreasing fluctuation trend;
if the temperature change trend is the incremental trend, determining that the control state is the out-of-control state;
and if the temperature change trend is the decreasing trend or the increasing and decreasing fluctuation trend, determining the control state as the controllable state.
In one possible embodiment, the adjustment module 13 is specifically configured to:
determining whether the current temperature is greater than or equal to the temperature threshold;
if yes, determining a target operation parameter, and adjusting the operation parameter of the radiator to the target operation parameter.
In one possible embodiment, the adjustment module 13 is specifically configured to:
determining a maximum operating parameter of the radiator;
and determining the maximum operation parameter to the target operation parameter.
In one possible embodiment, the adjustment module 13 is specifically configured to:
determining an operation parameter to be adjusted according to the current temperature and the current operation parameter;
determining a maximum operating parameter of the radiator;
if the operation parameter to be adjusted is smaller than or equal to the maximum operation parameter, determining the operation parameter to be adjusted as the target operation parameter;
and if the operation parameter to be adjusted is larger than the maximum operation parameter, determining the maximum operation parameter as the target operation parameter.
In one possible embodiment, the number of heat sinks is a plurality; the adjusting module 13 is specifically configured to:
determining a first radiator at which the temperature sensor is located from a plurality of radiators of the server;
And adjusting the operation parameters of the first radiator to the target operation parameters.
In one possible embodiment, the adjustment module 13 is specifically configured to:
and sending a restarting instruction to the BMC, wherein the restarting instruction is used for controlling the BMC to restart so as to switch the control state of the BMC into the controllable state.
In one possible embodiment, when the control state is a controllable state, the BMC adjusts an operation parameter of the radiator according to the current temperature.
In one possible embodiment, the radiator is a fan, and the operating parameter is a rotational speed of the fan.
The server temperature control device shown in the embodiment of fig. 7 may be a control unit in the server.
The server temperature control device provided by the embodiment of the application can execute the technical scheme shown in the embodiment of the method, and the implementation principle and the beneficial effects are similar, and are not repeated here.
Fig. 8 is a schematic structural diagram of a server according to an exemplary embodiment of the present application. Referring to fig. 8, the server may include a BMC, a control unit, a plurality of temperature sensors, and a plurality of heat sinks.
The plurality of temperature sensors and the BMC can be connected through an I2C bus. For example, if the plurality of temperature sensors are the air outlet temperature sensor, the GPU temperature sensor, … …, and the hard disk temperature sensor, the air outlet temperature sensor, the GPU temperature sensor, … …, and the hard disk temperature sensor may be connected to the BMC through the I2C bus, respectively.
The BMC and the control unit can be connected through an I2C bus.
The plurality of heat sinks may be the heat sink 1, the heat sinks 2, … …, and the heat sink n, respectively. The BMC and the control unit may be connected to the plurality of heat sinks via an I2C bus, respectively.
The BMC and the control unit may perform the server temperature control method as shown in any of the method embodiments described above.
Accordingly, an embodiment of the present application provides a computer readable storage medium, where computer executable instructions are stored, and when the computer executable instructions are executed by a processor, the computer readable storage medium is configured to implement the server temperature control method according to any one of the above method embodiments.
Accordingly, embodiments of the present application may also provide a computer program product, including a computer program, which when executed by a processor may implement a server temperature control method as shown in any of the above method embodiments.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In one typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include volatile memory in a computer-readable medium, random Access Memory (RAM) and/or nonvolatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of computer-readable media.
Computer readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of storage media for a computer include, but are not limited to, phase change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium, which can be used to store information that can be accessed by a computing device. Computer-readable media, as defined herein, does not include transitory computer-readable media (transmission media), such as modulated data signals and carrier waves.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article or apparatus that comprises the element.
The foregoing is merely exemplary of the present application and is not intended to limit the present application. Various modifications and variations of the present application will be apparent to those skilled in the art. Any modification, equivalent replacement, improvement, etc. which come within the spirit and principles of the application are to be included in the scope of the claims of the present application.

Claims (14)

1. A server temperature control method, applied to a server, wherein the server comprises a baseboard management controller BMC, a temperature sensor and a radiator, the method comprises:
Determining a control state of the BMC on the radiator, wherein the control state is a controllable state or an out-of-control state;
when the control state is the out-of-control state, acquiring the current temperature acquired by the temperature sensor;
and adjusting the operation parameters of the radiator according to the current temperature, and switching the control state of the BMC into the controllable state.
2. The method of claim 1, wherein determining the control state of the BMC for the heatsink comprises:
determining a first time length, wherein the first time length is a time difference between a moment when the current temperature is received from the BMC and a current moment;
and determining the control state according to the first time length.
3. The method of claim 2, wherein determining the control state based on the first time period comprises:
if the first time length is greater than or equal to a preset time length, determining that the control state is the out-of-control state;
and if the first time period is smaller than the preset time period, determining the control state according to a plurality of historical temperatures received from the BMC in a latest time period, wherein the latest time period is a time period in a preset time period before the current time.
4. The method of claim 3, wherein determining the control state based on a plurality of historical temperatures received from the BMC over a recent period of time comprises:
if the historical temperatures are respectively greater than or equal to a temperature threshold value, determining that the control state is the out-of-control state;
and if the historical temperatures of the plurality of historical temperatures are smaller than the temperature threshold, determining the control state to be the controllable state.
5. The method of claim 3, wherein determining the control state based on a plurality of historical temperatures received from the BMC over a recent period of time comprises:
determining a temperature change trend according to the plurality of historical temperatures, wherein the temperature change trend is an increasing trend, a decreasing trend or an increasing and decreasing fluctuation trend;
if the temperature change trend is the incremental trend, determining that the control state is the out-of-control state;
and if the temperature change trend is the decreasing trend or the increasing and decreasing fluctuation trend, determining the control state as the controllable state.
6. The method of any of claims 1-5, wherein adjusting the operating parameters of the heat sink based on the current temperature comprises:
Determining whether the current temperature is greater than or equal to the temperature threshold;
if yes, determining a target operation parameter, and adjusting the operation parameter of the radiator to the target operation parameter.
7. The method of claim 6, wherein determining the target operating parameter comprises:
determining a maximum operating parameter of the radiator;
and determining the maximum operation parameter to the target operation parameter.
8. The method of claim 6, wherein determining the target operating parameter comprises:
determining an operation parameter to be adjusted according to the current temperature and the current operation parameter;
determining a maximum operating parameter of the radiator;
if the operation parameter to be adjusted is smaller than or equal to the maximum operation parameter, determining the operation parameter to be adjusted as the target operation parameter;
and if the operation parameter to be adjusted is larger than the maximum operation parameter, determining the maximum operation parameter as the target operation parameter.
9. The method of any one of claims 1-8, wherein the number of heat sinks is a plurality; adjusting the operating parameters of the radiator to the target operating parameters, including:
Determining a first radiator at which the temperature sensor is located from a plurality of radiators of the server;
and adjusting the operation parameters of the first radiator to the target operation parameters.
10. The method according to any of claims 1-9, wherein switching the control state of the BMC to the controllable state comprises:
and sending a restarting instruction to the BMC, wherein the restarting instruction is used for controlling the BMC to restart so as to switch the control state of the BMC into the controllable state.
11. The method according to any one of claims 1-10, further comprising:
and when the control state is a controllable state, the BMC adjusts the operation parameters of the radiator according to the current temperature.
12. The method according to any one of claims 1-11, wherein the server further comprises a control unit for performing the method according to any one of claims 1-11.
13. A server, comprising: a baseboard management controller BMC, a control unit, a temperature sensor and a heat sink, wherein,
the temperature sensor is connected with the BMC through a bus, the BMC is connected with the control unit through a bus, and the BMC and the control unit are respectively connected with the radiator through a bus;
The control unit and the BMC are adapted to perform the method of any of the preceding claims 1-12.
14. A computer readable storage medium having stored therein computer executable instructions which when executed by a processor implement the method of any of claims 1-12.
CN202310526409.6A 2023-05-10 2023-05-10 Method, device and equipment for controlling temperature of server Pending CN116661567A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310526409.6A CN116661567A (en) 2023-05-10 2023-05-10 Method, device and equipment for controlling temperature of server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310526409.6A CN116661567A (en) 2023-05-10 2023-05-10 Method, device and equipment for controlling temperature of server

Publications (1)

Publication Number Publication Date
CN116661567A true CN116661567A (en) 2023-08-29

Family

ID=87712767

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310526409.6A Pending CN116661567A (en) 2023-05-10 2023-05-10 Method, device and equipment for controlling temperature of server

Country Status (1)

Country Link
CN (1) CN116661567A (en)

Similar Documents

Publication Publication Date Title
RU2695546C2 (en) Method and device for controlling fan rotation speed in electronic device
CN110799757B (en) Fan control method, fan control device and electronic equipment
CN113826082B (en) Method and equipment for controlling heat dissipation device
CN107608865B (en) Data storage method and device
CN105371420A (en) Refrigeration control method, device and system
CN110595015B (en) Adjusting method and device of frequency conversion equipment
CN111158447A (en) Server heat dissipation parameter regulation and control method, device, equipment and readable storage medium
CN111255554B (en) Engine fan rotation control method and apparatus
EP3477214A1 (en) Active control method and device for inverter air conditioner
CN112987893B (en) Fan speed regulation method and device, storage medium and electronic equipment
CN114110919B (en) External fan control method, device, computer readable storage medium and air conditioner
CN113568805B (en) Real-time adjustment method and device for power consumption of server and electronic equipment
CN116661567A (en) Method, device and equipment for controlling temperature of server
CN111361458B (en) Power calculation method and device and electronic equipment
CN117570049A (en) Fan control method and device, baseboard management controller and server
CN110736242B (en) Control method and device of air conditioner, storage medium and processor
CN116914325A (en) Temperature control method, electronic device, and computer-readable storage medium
CN116954334A (en) Heat dissipation control method, device, server, computer equipment and storage medium
CN115013346B (en) Fan speed regulation method, device and medium based on device
CN111412633A (en) Air conditioner control method and device, storage medium and air conditioner
CN114564093B (en) Equipment heat dissipation method and device, electronic equipment and storage medium
CN106567846B (en) A kind of blower regulation method and apparatus of Advanced telecom computing architecture ATCA subrack
CN116826103B (en) Control method, control device and storage medium for heat dissipation system of fuel cell
CN111324156A (en) Passive device overheating protection method and device and power electronic equipment
CN116610477A (en) Method, device, equipment and storage medium for adjusting power consumption of graphic processor

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination