US11385985B2 - Server power consumption management method and device - Google Patents

Server power consumption management method and device Download PDF

Info

Publication number
US11385985B2
US11385985B2 US16/818,765 US202016818765A US11385985B2 US 11385985 B2 US11385985 B2 US 11385985B2 US 202016818765 A US202016818765 A US 202016818765A US 11385985 B2 US11385985 B2 US 11385985B2
Authority
US
United States
Prior art keywords
power consumption
server
value
power
node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
US16/818,765
Other versions
US20200210304A1 (en
Inventor
Jiangtao Wang
Zhibing Li
Lang Tao
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of US20200210304A1 publication Critical patent/US20200210304A1/en
Assigned to HUAWEI TECHNOLOGIES CO., LTD. reassignment HUAWEI TECHNOLOGIES CO., LTD. EMPLOYMENT AGREEMENT Assignors: TAO, LANG, LI, ZHIBING
Assigned to HUAWEI TECHNOLOGIES CO., LTD. reassignment HUAWEI TECHNOLOGIES CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: WANG, JIANGTAO
Application granted granted Critical
Publication of US11385985B2 publication Critical patent/US11385985B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/28Supervision thereof, e.g. detecting power-supply failure by out of limits supervision
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/30Means for acting in the event of power-supply failure or interruption, e.g. power-supply fluctuations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3206Monitoring of events, devices or parameters that trigger a change in power modality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3234Power saving characterised by the action undertaken
    • G06F1/3237Power saving characterised by the action undertaken by disabling clock generation or distribution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3234Power saving characterised by the action undertaken
    • G06F1/3287Power saving characterised by the action undertaken by switching off individual functional units in the computer system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
    • G06F11/3062Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations where the monitored property is the power consumption
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • Embodiments of the present application relates to the field of information technologies, and in particular, to a server power consumption management method and a device.
  • a server is usually equipped with a power supply including a plurality of power modules.
  • a power consumption capping technology can ensure that power consumption of the server is maintained in a stable level when the server is running, to improve power utilization. Users set a capping value of power consumption of the entire server, and the power consumption of the entire server is periodically checked when the server is running. If the power consumption reaches the capping value, measures, such as reducing a frequency of a central processing unit (CPU) of the server, are used to limit the power consumption of the server within an error range of 5% of target power consumption.
  • CPU central processing unit
  • an embodiment provides a server power consumption management method.
  • a power supply supplies power to a server, the power supply includes a power module, and a power consumption management device communicates with the power supply and the server.
  • the method includes: receiving, by the power consumption management device, fault information of the power module, and reducing first power consumption of the server by a first value to obtain second power consumption of the server, where the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server; and adjusting, by the power consumption management device, the second power consumption of the server based on a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
  • a specific implementation of reducing the power consumption of the server by the first value includes but is not limited to pulling down Prochot and Memhot pins of a CPU, turning off a component such as a clock, temporarily powering off a fan, triggering a low load or a hibernate mode of a component, or the like.
  • the power consumption management device reduces the power consumption of the server by the first value within a holding time to below maximum power consumption that can be provided by the power supply after the power module is faulty. This ensures that the server does not break down.
  • a power consumption capping technology can precisely adjust the power consumption of the server.
  • the power consumption management device periodically detects the power consumption of the server, and calculates a difference between the power consumption of the server and the power consumption capping value of the server. When the difference is greater than a preset error value, a power control device adjusts the power consumption of the server, continues to detect the power consumption, and calculates the difference until the difference falls within a preset error range.
  • a specific implementation of the power consumption adjustment is mainly adjustment of a running state of a high-power component, including but not limited to CPU frequency and voltage adjustment, CPU core enabling and disabling, a CPU P/T-state, a memory frequency, a T state of a memory, reading, writing, and hibernation states of a hard disk, an L0/L1 pin state of a high-speed peripheral component interconnect express (PCIe) network adapter, a working status of a graphics processing unit (GPU), a fan speed, and other manners in which precise control on the power consumption of the server can be implemented.
  • PCIe peripheral component interconnect express
  • the server includes a plurality of nodes.
  • the reducing, by the power consumption management device, first power consumption of the server by a first value to obtain second power consumption of the server specifically includes: obtaining, by the power consumption management device, a reduced value of power consumption of each node; and reducing, by the power consumption management device, the power consumption of each node by a second value based on the reduced value of the power consumption of each node, where the sum of the second values of the plurality of nodes is equal to the first value.
  • the adjusting, by the power consumption management device, the second power consumption of the server based on a power consumption capping value of the server specifically includes: obtaining, by the power consumption management device, a power consumption capping value of each node, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting, by the power consumption management device, the power consumption of each node based on the power consumption capping value of each node.
  • an embodiment provides a power consumption management device.
  • the power consumption management device communicates with a power supply and a server, the power supply supplies power to the server, and the power supply includes a power module; and the power consumption management device includes a power consumption reduction unit and a power consumption capping unit.
  • the power consumption reduction unit is configured to perform the following operations: receiving fault information of the power module, and reducing first power consumption of the server by a first value to obtain second power consumption of the server, where the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server.
  • the power consumption capping unit is configured to perform the following operation: adjusting the second power consumption of the server based on a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
  • the power consumption reduction unit is further configured to send the fault information to the power consumption capping unit; and that the power consumption capping unit is configured to receive the fault information of the power module specifically includes: receiving the fault information from the power consumption reduction unit.
  • the server includes a plurality of nodes
  • the power consumption reduction unit includes a plurality of power consumption reduction subunits
  • the power consumption capping unit includes a plurality of power consumption capping subunits
  • each power consumption reduction subunit and a power consumption capping subunit communicate with one of the nodes.
  • Each power consumption reduction subunit is configured to perform the following operations: receiving the fault information, obtaining a reduced value of power consumption of each node, and reducing the power consumption of each node by a second value, where the sum of the second values of the plurality of nodes is equal to the first value.
  • Each power consumption capping subunit is configured to perform the following operations: obtaining a power consumption capping value of each node based on the fault information, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting the power consumption of each node based on the power consumption capping value of each node.
  • the power consumption reduction unit further includes a power consumption reduction management unit
  • the power consumption capping unit further includes a power consumption capping management unit.
  • the power consumption reduction management unit is configured to perform the following operations: receiving the fault information, and forwarding the fault information to each power consumption reduction subunit and the power consumption capping management unit; and that each power consumption reduction subunit receives the fault information of the power module specifically includes: receiving the fault information forwarded by the power consumption reduction management unit.
  • the power consumption capping management unit is configured to perform the following operations: receiving the fault information, and forwarding the fault information to each power consumption capping subunit. That each power consumption capping subunit receives the fault information of the power module specifically includes: receiving the fault information forwarded by the power consumption reduction management unit.
  • an embodiment provides a power consumption management device.
  • the power consumption management device communicates with a power supply and a server, the power supply supplies power to the server, and the power supply includes a power module; and the power consumption management device includes an interface and a processor, where the interface communicates with the processor, and the interface is configured to receive fault information of the power module.
  • the processor is configured to perform the following operations: reducing, based on the fault information, first power consumption of the server by a first value to obtain second power consumption of the server, where the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server; and adjusting the second power consumption of the server based on the fault information and a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
  • the server includes a plurality of nodes. That the processor is configured to reduce first power consumption of a server by a first value to obtain second power consumption of the server specifically includes: obtaining a reduced value of power consumption of each node; and reducing the power consumption of each node by a second value based on the reduced value of the power consumption of each node, where the sum of the second values of the plurality of nodes is equal to the first value.
  • That the power consumption management device adjusts the second power consumption of the server based on the power consumption capping value of the server specifically includes: obtaining a power consumption capping value of each node, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting the power consumption of each node based on the power consumption capping value of each node.
  • an embodiment provides a power consumption management device.
  • a non-volatile readable storage medium includes a first computer instruction used to receive fault information of a power module, and reduce first power consumption of a server by a first value to obtain second power consumption of the server, where the server is powered by a power supply, the power supply includes the power module, and the power consumption management device communicates with the power supply and the server; and the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server.
  • the non-volatile readable storage medium further includes a second instruction used to adjust the second power consumption of the server based on a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
  • the server includes a plurality of nodes. That a first instruction is used to reduce first power consumption of a server by a first value to obtain second power consumption of the server specifically includes: obtaining a reduced value of power consumption of each node; and reducing the power consumption of each node by a second value based on the reduced value of the power consumption of each node, where the sum of the second values of the plurality of nodes is equal to the first value.
  • That a second instruction is used to adjust the second power consumption of the server based on a power consumption capping value of the server specifically includes: obtaining a power consumption capping value of each node, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting the power consumption of each node based on the power consumption capping value of each node.
  • FIG. 1 a is a schematic diagram of a single-node server equipped with a power supply
  • FIG. 1 b is a schematic diagram of a multi-node server equipped with a power supply
  • FIG. 2 is a schematic diagram of a power supply management module and a power supply in a server
  • FIG. 3 is a schematic diagram of performing steps by a power consumption management device in a single-node server
  • FIG. 4 is a schematic structural diagram of a power consumption management device in a single-node server
  • FIG. 5 is a schematic diagram of an architecture of a power consumption management device, a server, and a power supply in a multi-node server;
  • FIG. 6 is a schematic diagram of performing steps by a power consumption management device in a multi-node server.
  • FIG. 7 is a schematic diagram of a power consumption management device.
  • a server 100 is connected to a power supply 110 , and the power supply 110 includes a power module 111 a , a power module 111 b , . . . , and a power module 111 n .
  • a server 100 is a server including a plurality of nodes, and the server 100 includes a node 101 a , a node 101 b , . . . , and a node 101 n that are connected to each other, and a power supply 110 includes a power module 111 a , a power module 111 b , . . . , and a power module 111 n .
  • a connection 102 represents an example connection relationship between the nodes 101 .
  • FIG. 2 is a schematic diagram of a relationship between the power consumption management device 200 and a power supply 110 of the server 100 .
  • the power consumption management device 200 in the server 100 communicates with each power module 111 , and receives fault information of each power module 111 .
  • the fault information may be an interrupt signal or a signal in another form, and this is not limited in this embodiment.
  • the power consumption management device 200 stores a reduced value, calculated when a power module 111 is faulty, of power consumption of the server 100 . As shown in FIG. 3 , specific steps performed by the power consumption management device 200 are as follows.
  • a specific implementation of reducing the power consumption of the server 100 by the first value includes but is not limited to pulling down Prochot and Memhot pins of a CPU, turning off a component such as a clock, temporarily powering off a fan, triggering a low load or a hibernate mode of a component, or the like.
  • the power consumption management device reduces the power consumption of the server 100 by the first value within a holding time to below maximum power consumption that can be provided by the power supply 110 after the power module 111 is faulty. This ensures that the server 100 does not break down.
  • a specific implementation of adjusting the power consumption of the server 100 is mainly adjusting a running state of a high-power component, including but not limited to CPU frequency and voltage adjustment, CPU core enabling and disabling, a CPU P/T-state, a memory frequency, a T state of a memory, reading, writing, and hibernation states of a hard disk, an L0/L1 pin state of a high-speed peripheral component interconnect express (PCIe) network adapter, a working status of a graphics processing unit (GPU), a fan speed, and other manners in which the power consumption of the server 100 can be controlled precisely.
  • PCIe peripheral component interconnect express
  • a power consumption capping technology is a specific implementation of step 303 .
  • the power consumption management device 200 periodically detects the power consumption of the server 100 , and calculates a difference between the power consumption of the server 100 and the power consumption capping value of the server 100 .
  • a power control device 200 adjusts the power consumption of the server 100 , continues to detect the power consumption, and calculates the difference until the difference falls within a preset error range.
  • the power consumption capping technology can precisely adjust the power consumption of the server 100 , so that the power consumption of the server 100 approximates to, based on a preset error, the maximum power consumption that can be provided by the power supply 110 when the power module 111 is faulty.
  • step 302 methods used for reducing the power consumption of the server 100 cannot precisely reduce the power consumption of the server 100 to the maximum power consumption that can be provided by the power supply 110 after the power module is faulty. Therefore, the power consumption management device 200 needs to perform step 303 , to adjust the power consumption of the server 200 to approximate to the maximum power consumption that can be provided by the power supply 110 after the power module is faulty. This improves utilization of the power module 111 that normally works. Step 302 and step 303 are combined. This avoids a breakdown of the server 100 , and further improves the utilization of the power supply 110 after the power module 111 is faulty.
  • a structure of the power consumption management device 200 of the single-node server in FIG. 2 includes a power consumption reduction unit 210 and a power consumption capping unit 220 .
  • the power consumption reduction unit 210 is configured to: receive the fault information of the power module 111 , perform step 301 , and forward the fault information to the power consumption capping unit 220 .
  • the power consumption capping unit 220 is configured to receive the fault information forwarded by the power consumption reduction unit 210 , and perform step 302 to implement power consumption and power supply management of the server 100 .
  • the power consumption reduction unit 210 is implemented by a complex programmable logic device (CPLD), a baseboard management controller (BMC), and another power supply control unit that can reduce the power consumption of the server within a holding time after the power module 111 is faulty. This is not limited in this embodiment.
  • CPLD complex programmable logic device
  • BMC baseboard management controller
  • another power supply control unit that can reduce the power consumption of the server within a holding time after the power module 111 is faulty. This is not limited in this embodiment.
  • the power consumption capping unit 220 is implemented by using the BMC, Intel Node Manager, and a combination of the BMC and a basic input/output system (BIOS).
  • BIOS basic input/output system
  • the power consumption reduction unit 210 and the power consumption capping unit 220 may be integrated into a chip or another hardware device, or may be independent electronic components coupled in an electrical, mechanical, or other form, or two or more units are integrated into one unit. This is not limited in this embodiment.
  • the server 100 in this embodiment may be a multi-node server.
  • FIG. 5 shows a connection relationship between a power consumption management device 500 , a multi-node server 100 , and a power supply 110 .
  • the server 100 includes the node 101 a , the node 101 b , . . . , and the node 101 n .
  • the power supply 110 supplies power to the server 100 , and the connection 102 is omitted in FIG. 5 .
  • the power consumption management device 500 includes a power consumption reduction unit 510 and a power consumption capping unit 520 .
  • the power consumption reduction unit 510 includes a power consumption reduction management unit 511 , a power consumption reduction subunit 512 a , a power consumption reduction subunit 512 b , . . . , and a power consumption reduction subunit 512 n .
  • the power consumption capping unit 520 includes a power consumption capping management unit 521 , a power consumption capping subunit 522 a , a power consumption capping subunit 522 b , . . . , and a power consumption capping subunit 522 n .
  • the power consumption reduction subunit 512 b , . . . , and the power consumption reduction subunit 512 n respectively communicate with the node 101 a , the node 101 b , .
  • the power consumption capping subunit 522 a , the power consumption capping subunit 522 b , . . . , and the power consumption capping subunit 522 n also respectively communicate with the node 101 a , the node 101 b , . . . , and the node 101 .
  • the power consumption capping management unit 521 manages all power consumption capping subunits 522
  • the power consumption reduction management unit 511 manages all power consumption reduction subunits 522 .
  • the power consumption reduction management unit 511 communicates with the power supply 110 .
  • the power consumption reduction management unit 511 and the power consumption capping management unit 521 store a reduced value of the power consumption of the server 100 when a power module 111 is faulty.
  • the fault information may be an interrupt signal or a signal in another form, and this is not limited in this embodiment.
  • a specific implementation of reducing the power consumption of the server 100 includes but is not limited to pulling down Prochot and Memhot pins of a CPU, turning off a component such as a clock, temporarily powering off a fan, triggering a low load or a hibernate mode of a component, or the like.
  • the implementation of rapidly reducing the power consumption cannot precisely control the power consumption of each node. As a result, low utilization of the power supply 110 is caused. In this case, the power consumption management device 500 performs step 604 .
  • the power consumption management device 500 obtains power consumption capping values of each node based on the fault information, where the sum of the power consumption capping values of the plurality of nodes is a power consumption capping value of the server 100 .
  • An initial capping value for a power consumption capping operation on each node is a difference between a power consumption value of the node when the power module 111 works normally, and a reduced value of the power consumption of the node when the power module 111 is faulty.
  • a specific implementation of step 605 is mainly adjusting a running state of a high-power component, including but not limited to CPU frequency and voltage adjustment, CPU core enabling and disabling, a CPU P/T-state, a memory frequency, a T state of a memory, reading, writing, and hibernation states of a hard disk, an L0/L1 pin state of a high-speed peripheral component interconnect express (PCIe) network adapter, a working status of a graphics processing unit (GPU), a fan speed, and other manners in which the power consumption of the server 100 can be controlled precisely.
  • PCIe peripheral component interconnect express
  • the power consumption capping technology in step 605 in a specific implementation includes the following steps: A power control device 200 periodically detects the power consumption of the server 100 , and calculates a difference between the power consumption of the server 100 and the power consumption capping value of the server 100 . When the difference is greater than a preset error value, the power control device 200 adjusts the power consumption of the server 100 , continues to detect the power consumption, and calculates the difference until the difference falls within a preset error range.
  • the power consumption capping technology can precisely adjust the power consumption of the server 100 , so that the power consumption of the server 100 approximates to, based on a preset error, the maximum power consumption that can be provided by the power supply 110 when the power module 111 is faulty.
  • step 604 and step 605 avoids a breakdown of the server 100 when power is off, and further improves utilization of the power supply 110 after the power module 111 is faulty.
  • FIG. 6 shows an internal structure of the power consumption management device 500 in this method, and a specific structure is described above.
  • the power consumption reduction management unit 511 is configured to receive the fault information of the power module 111 , and forward the fault information to each power consumption reduction subunit 512 .
  • Each power consumption reduction subunit 512 performs step 602 and step 603 to rapidly reduce the power consumption of the server 100 within the holding time of the faulty power module 111 . This ensures running of the nodes 512 , namely, the server 100 .
  • the power consumption reduction management unit 511 is further configured to forward the fault information to the power consumption capping management unit 521 .
  • the power consumption capping management unit 521 is configured to receive fault information of the power consumption reduction management unit 511 , and forward the fault information to each power consumption capping subunit 522 .
  • Each power consumption capping subunit 522 performs step 604 and step 605 to adjust, within the preset error range, the power consumption of each node 101 to the maximum power consumption that can be provided by the power supply 110 when the power module 111 is faulty.
  • constituent parts of the power consumption management device 500 may be integrated into a chip or another hardware device, or may be independent electronic components coupled in an electrical, mechanical, or other form, or two or more units may be integrated into one unit. This is not limited in this embodiment.
  • the power consumption management device 700 includes an interface device 710 and a processor 720 .
  • the processor 720 may include a CPU and a memory. There may be one or more CPUs.
  • the processor 720 may further be a field programmable gate array (FPGA), a combination of an FPGA and the CPU, or a combination of the CPU and a BIOS. This is not limited in this embodiment of the present application.
  • the interface device 710 is configured to receive the fault information of the power module 111 .
  • the processor 720 is configured to perform step 302 and step 303 .
  • the processor 720 is configured to perform step 602 to step 605 .
  • An embodiment further provides a non-volatile readable storage medium.
  • the readable storage medium includes a first instruction used to perform step 301 and step 302 and a second instruction used to perform step 303 .
  • the server 100 is a multi-node server, the readable storage medium includes a first instruction used to perform step 603 to step 603 and a second instruction used to perform step 604 and step 605 .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • Computer Hardware Design (AREA)
  • Power Sources (AREA)

Abstract

A power consumption management method and a power consumption management device are provided. When a power module of a power supply for a server is faulty, a power consumption management device receives fault information sent by the power supply, and reduces first power consumption, calculated when the power module works normally, of the server by a first value to second power consumption of the server based on the fault information. The first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server. In addition, the power consumption management device adjusts the second power consumption of the server based on a power consumption capping value of the server. According to the application, a breakdown of the server is avoided, and the power utilization after the power module is faulty is further improved.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation of International Application No. PCT/CN2018/105194, filed on Sep. 12, 2018, which claims priority to Chinese Patent Application No. 201710826652.4, filed on Sep. 14, 2017. The disclosures of the aforementioned applications are hereby incorporated by reference in their entireties.
TECHNICAL FIELD
Embodiments of the present application relates to the field of information technologies, and in particular, to a server power consumption management method and a device.
BACKGROUND
A server is usually equipped with a power supply including a plurality of power modules. A power consumption capping technology can ensure that power consumption of the server is maintained in a stable level when the server is running, to improve power utilization. Users set a capping value of power consumption of the entire server, and the power consumption of the entire server is periodically checked when the server is running. If the power consumption reaches the capping value, measures, such as reducing a frequency of a central processing unit (CPU) of the server, are used to limit the power consumption of the server within an error range of 5% of target power consumption.
However, when a power module of the server is faulty, because a time for completing a power consumption capping operation is far longer than a holding time that can be used for maintaining normal work of the server when the power module is faulty, total power consumption that can be provided by the power supply is rapidly reduced to below current running power consumption of the server, leading to a breakdown of the server.
SUMMARY
According to an aspect, an embodiment provides a server power consumption management method. A power supply supplies power to a server, the power supply includes a power module, and a power consumption management device communicates with the power supply and the server. The method includes: receiving, by the power consumption management device, fault information of the power module, and reducing first power consumption of the server by a first value to obtain second power consumption of the server, where the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server; and adjusting, by the power consumption management device, the second power consumption of the server based on a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
A specific implementation of reducing the power consumption of the server by the first value includes but is not limited to pulling down Prochot and Memhot pins of a CPU, turning off a component such as a clock, temporarily powering off a fan, triggering a low load or a hibernate mode of a component, or the like. After the power module is faulty, the power consumption management device reduces the power consumption of the server by the first value within a holding time to below maximum power consumption that can be provided by the power supply after the power module is faulty. This ensures that the server does not break down. A power consumption capping technology can precisely adjust the power consumption of the server. The power consumption management device periodically detects the power consumption of the server, and calculates a difference between the power consumption of the server and the power consumption capping value of the server. When the difference is greater than a preset error value, a power control device adjusts the power consumption of the server, continues to detect the power consumption, and calculates the difference until the difference falls within a preset error range. A specific implementation of the power consumption adjustment is mainly adjustment of a running state of a high-power component, including but not limited to CPU frequency and voltage adjustment, CPU core enabling and disabling, a CPU P/T-state, a memory frequency, a T state of a memory, reading, writing, and hibernation states of a hard disk, an L0/L1 pin state of a high-speed peripheral component interconnect express (PCIe) network adapter, a working status of a graphics processing unit (GPU), a fan speed, and other manners in which precise control on the power consumption of the server can be implemented. This method avoids a breakdown of the server, and further improves power utilization after the power module is faulty.
With reference to a first aspect, in a first possible implementation of the first aspect, the server includes a plurality of nodes. The reducing, by the power consumption management device, first power consumption of the server by a first value to obtain second power consumption of the server specifically includes: obtaining, by the power consumption management device, a reduced value of power consumption of each node; and reducing, by the power consumption management device, the power consumption of each node by a second value based on the reduced value of the power consumption of each node, where the sum of the second values of the plurality of nodes is equal to the first value. The adjusting, by the power consumption management device, the second power consumption of the server based on a power consumption capping value of the server specifically includes: obtaining, by the power consumption management device, a power consumption capping value of each node, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting, by the power consumption management device, the power consumption of each node based on the power consumption capping value of each node.
According to a second aspect, an embodiment provides a power consumption management device. The power consumption management device communicates with a power supply and a server, the power supply supplies power to the server, and the power supply includes a power module; and the power consumption management device includes a power consumption reduction unit and a power consumption capping unit. The power consumption reduction unit is configured to perform the following operations: receiving fault information of the power module, and reducing first power consumption of the server by a first value to obtain second power consumption of the server, where the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server. The power consumption capping unit is configured to perform the following operation: adjusting the second power consumption of the server based on a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
With reference to the second aspect, in a first possible implementation of the second aspect, the power consumption reduction unit is further configured to send the fault information to the power consumption capping unit; and that the power consumption capping unit is configured to receive the fault information of the power module specifically includes: receiving the fault information from the power consumption reduction unit.
With reference to the second aspect or the first implementation of the second aspect, in a second possible implementation of the second aspect, the server includes a plurality of nodes, the power consumption reduction unit includes a plurality of power consumption reduction subunits, the power consumption capping unit includes a plurality of power consumption capping subunits, and each power consumption reduction subunit and a power consumption capping subunit communicate with one of the nodes. Each power consumption reduction subunit is configured to perform the following operations: receiving the fault information, obtaining a reduced value of power consumption of each node, and reducing the power consumption of each node by a second value, where the sum of the second values of the plurality of nodes is equal to the first value. Each power consumption capping subunit is configured to perform the following operations: obtaining a power consumption capping value of each node based on the fault information, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting the power consumption of each node based on the power consumption capping value of each node.
With reference to the second aspect, in a third possible implementation of the second aspect, the power consumption reduction unit further includes a power consumption reduction management unit, and the power consumption capping unit further includes a power consumption capping management unit. The power consumption reduction management unit is configured to perform the following operations: receiving the fault information, and forwarding the fault information to each power consumption reduction subunit and the power consumption capping management unit; and that each power consumption reduction subunit receives the fault information of the power module specifically includes: receiving the fault information forwarded by the power consumption reduction management unit. The power consumption capping management unit is configured to perform the following operations: receiving the fault information, and forwarding the fault information to each power consumption capping subunit. That each power consumption capping subunit receives the fault information of the power module specifically includes: receiving the fault information forwarded by the power consumption reduction management unit.
According to a third aspect, an embodiment provides a power consumption management device. The power consumption management device communicates with a power supply and a server, the power supply supplies power to the server, and the power supply includes a power module; and the power consumption management device includes an interface and a processor, where the interface communicates with the processor, and the interface is configured to receive fault information of the power module. The processor is configured to perform the following operations: reducing, based on the fault information, first power consumption of the server by a first value to obtain second power consumption of the server, where the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server; and adjusting the second power consumption of the server based on the fault information and a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
With reference to the third aspect, in a first possible implementation of the third aspect, the server includes a plurality of nodes. That the processor is configured to reduce first power consumption of a server by a first value to obtain second power consumption of the server specifically includes: obtaining a reduced value of power consumption of each node; and reducing the power consumption of each node by a second value based on the reduced value of the power consumption of each node, where the sum of the second values of the plurality of nodes is equal to the first value. That the power consumption management device adjusts the second power consumption of the server based on the power consumption capping value of the server specifically includes: obtaining a power consumption capping value of each node, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting the power consumption of each node based on the power consumption capping value of each node.
According to a fourth aspect, an embodiment provides a power consumption management device. A non-volatile readable storage medium includes a first computer instruction used to receive fault information of a power module, and reduce first power consumption of a server by a first value to obtain second power consumption of the server, where the server is powered by a power supply, the power supply includes the power module, and the power consumption management device communicates with the power supply and the server; and the first power consumption is a power consumption value of the server calculated when the power module works normally, and the first value is not less than a reduced value, calculated when the power module is faulty, of power consumption of the server. The non-volatile readable storage medium further includes a second instruction used to adjust the second power consumption of the server based on a power consumption capping value of the server, where the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server.
With reference to the fourth aspect, in a first possible implementation of the fourth aspect, the server includes a plurality of nodes. That a first instruction is used to reduce first power consumption of a server by a first value to obtain second power consumption of the server specifically includes: obtaining a reduced value of power consumption of each node; and reducing the power consumption of each node by a second value based on the reduced value of the power consumption of each node, where the sum of the second values of the plurality of nodes is equal to the first value. That a second instruction is used to adjust the second power consumption of the server based on a power consumption capping value of the server specifically includes: obtaining a power consumption capping value of each node, where the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and adjusting the power consumption of each node based on the power consumption capping value of each node.
BRIEF DESCRIPTION OF DRAWINGS
FIG. 1a is a schematic diagram of a single-node server equipped with a power supply;
FIG. 1b is a schematic diagram of a multi-node server equipped with a power supply;
FIG. 2 is a schematic diagram of a power supply management module and a power supply in a server;
FIG. 3 is a schematic diagram of performing steps by a power consumption management device in a single-node server;
FIG. 4 is a schematic structural diagram of a power consumption management device in a single-node server;
FIG. 5 is a schematic diagram of an architecture of a power consumption management device, a server, and a power supply in a multi-node server;
FIG. 6 is a schematic diagram of performing steps by a power consumption management device in a multi-node server; and
FIG. 7 is a schematic diagram of a power consumption management device.
DESCRIPTION OF EMBODIMENTS
In FIG. 1a , a server 100 is connected to a power supply 110, and the power supply 110 includes a power module 111 a, a power module 111 b, . . . , and a power module 111 n. In FIG. 1b , a server 100 is a server including a plurality of nodes, and the server 100 includes a node 101 a, a node 101 b, . . . , and a node 101 n that are connected to each other, and a power supply 110 includes a power module 111 a, a power module 111 b, . . . , and a power module 111 n. A connection 102 represents an example connection relationship between the nodes 101.
An embodiment provides a power consumption management device 200 of a server 100. FIG. 2 is a schematic diagram of a relationship between the power consumption management device 200 and a power supply 110 of the server 100. When the server 100 is a single-node server, the power consumption management device 200 in the server 100 communicates with each power module 111, and receives fault information of each power module 111. In this embodiment, the fault information may be an interrupt signal or a signal in another form, and this is not limited in this embodiment. The power consumption management device 200 stores a reduced value, calculated when a power module 111 is faulty, of power consumption of the server 100. As shown in FIG. 3, specific steps performed by the power consumption management device 200 are as follows.
301. Receive the fault information of the power module 111.
302. Reduce first power consumption of the server 100 by a first value to obtain second power consumption of the server 100, where the first power consumption is a power consumption value of the server 100 when the power module 111 works normally, and the first value is not less than the reduced value of the power consumption of the server 100 when the power module 111 is faulty.
A specific implementation of reducing the power consumption of the server 100 by the first value includes but is not limited to pulling down Prochot and Memhot pins of a CPU, turning off a component such as a clock, temporarily powering off a fan, triggering a low load or a hibernate mode of a component, or the like. In step 301, after the power module 111 is faulty, the power consumption management device reduces the power consumption of the server 100 by the first value within a holding time to below maximum power consumption that can be provided by the power supply 110 after the power module 111 is faulty. This ensures that the server 100 does not break down.
303. Adjust the second power consumption of the server 100 based on a power consumption capping value of the server 100, where the power consumption capping value of the server 100 is a difference between the first power consumption and the reduced value of the power consumption of the server 100 when the power module 111 is faulty.
In step 303, a specific implementation of adjusting the power consumption of the server 100 is mainly adjusting a running state of a high-power component, including but not limited to CPU frequency and voltage adjustment, CPU core enabling and disabling, a CPU P/T-state, a memory frequency, a T state of a memory, reading, writing, and hibernation states of a hard disk, an L0/L1 pin state of a high-speed peripheral component interconnect express (PCIe) network adapter, a working status of a graphics processing unit (GPU), a fan speed, and other manners in which the power consumption of the server 100 can be controlled precisely.
A power consumption capping technology is a specific implementation of step 303. The power consumption management device 200 periodically detects the power consumption of the server 100, and calculates a difference between the power consumption of the server 100 and the power consumption capping value of the server 100. When the difference is greater than a preset error value, a power control device 200 adjusts the power consumption of the server 100, continues to detect the power consumption, and calculates the difference until the difference falls within a preset error range. The power consumption capping technology can precisely adjust the power consumption of the server 100, so that the power consumption of the server 100 approximates to, based on a preset error, the maximum power consumption that can be provided by the power supply 110 when the power module 111 is faulty.
Generally, in step 302, methods used for reducing the power consumption of the server 100 cannot precisely reduce the power consumption of the server 100 to the maximum power consumption that can be provided by the power supply 110 after the power module is faulty. Therefore, the power consumption management device 200 needs to perform step 303, to adjust the power consumption of the server 200 to approximate to the maximum power consumption that can be provided by the power supply 110 after the power module is faulty. This improves utilization of the power module 111 that normally works. Step 302 and step 303 are combined. This avoids a breakdown of the server 100, and further improves the utilization of the power supply 110 after the power module 111 is faulty.
As shown in FIG. 4, a structure of the power consumption management device 200 of the single-node server in FIG. 2 includes a power consumption reduction unit 210 and a power consumption capping unit 220. The power consumption reduction unit 210 is configured to: receive the fault information of the power module 111, perform step 301, and forward the fault information to the power consumption capping unit 220. The power consumption capping unit 220 is configured to receive the fault information forwarded by the power consumption reduction unit 210, and perform step 302 to implement power consumption and power supply management of the server 100.
Specifically, the power consumption reduction unit 210 is implemented by a complex programmable logic device (CPLD), a baseboard management controller (BMC), and another power supply control unit that can reduce the power consumption of the server within a holding time after the power module 111 is faulty. This is not limited in this embodiment.
The power consumption capping unit 220 is implemented by using the BMC, Intel Node Manager, and a combination of the BMC and a basic input/output system (BIOS).
In addition, the power consumption reduction unit 210 and the power consumption capping unit 220 may be integrated into a chip or another hardware device, or may be independent electronic components coupled in an electrical, mechanical, or other form, or two or more units are integrated into one unit. This is not limited in this embodiment.
The server 100 in this embodiment may be a multi-node server. FIG. 5 shows a connection relationship between a power consumption management device 500, a multi-node server 100, and a power supply 110. Same as that in FIG. 1b , the server 100 includes the node 101 a, the node 101 b, . . . , and the node 101 n. The power supply 110 supplies power to the server 100, and the connection 102 is omitted in FIG. 5. The power consumption management device 500 includes a power consumption reduction unit 510 and a power consumption capping unit 520. The power consumption reduction unit 510 includes a power consumption reduction management unit 511, a power consumption reduction subunit 512 a, a power consumption reduction subunit 512 b, . . . , and a power consumption reduction subunit 512 n. The power consumption capping unit 520 includes a power consumption capping management unit 521, a power consumption capping subunit 522 a, a power consumption capping subunit 522 b, . . . , and a power consumption capping subunit 522 n. The power consumption reduction subunit 512 b, . . . , and the power consumption reduction subunit 512 n respectively communicate with the node 101 a, the node 101 b, . . . , and the node 101 n. The power consumption capping subunit 522 a, the power consumption capping subunit 522 b, . . . , and the power consumption capping subunit 522 n also respectively communicate with the node 101 a, the node 101 b, . . . , and the node 101. The power consumption capping management unit 521 manages all power consumption capping subunits 522, and the power consumption reduction management unit 511 manages all power consumption reduction subunits 522. In addition, the power consumption reduction management unit 511 communicates with the power supply 110. The power consumption reduction management unit 511 and the power consumption capping management unit 521 store a reduced value of the power consumption of the server 100 when a power module 111 is faulty.
In the architecture shown in FIG. 5, specific steps performed by the power consumption management device 500 are as follows.
601. Receive the fault information of the power module 111.
The fault information may be an interrupt signal or a signal in another form, and this is not limited in this embodiment.
602. Obtain reduced values of power consumption of the node 101 a, the node 101 b, . . . , and the node 101 n.
603. Reduce power consumption of each node by a second value based on reduced values of the power consumption of the node 101 a, the node 101 b, . . . , and the node 101 n, so that within a holding time of the faulty power module 111, total power consumption of all nodes, namely, power consumption of the server 100, is reduced to below maximum power consumption that can be provided by the power supply 110 after the power module 111 is faulty. This ensures that the server 100 does not break down.
A specific implementation of reducing the power consumption of the server 100 includes but is not limited to pulling down Prochot and Memhot pins of a CPU, turning off a component such as a clock, temporarily powering off a fan, triggering a low load or a hibernate mode of a component, or the like. As described above, the implementation of rapidly reducing the power consumption cannot precisely control the power consumption of each node. As a result, low utilization of the power supply 110 is caused. In this case, the power consumption management device 500 performs step 604.
604. The power consumption management device 500 obtains power consumption capping values of each node based on the fault information, where the sum of the power consumption capping values of the plurality of nodes is a power consumption capping value of the server 100.
605. Adjust the power consumption of each node based on the power consumption capping values of each node, so that the power consumption of the server 100 approximates to, based on a preset error, the maximum power consumption that can be provided by the power supply 110 when the power module 111 is faulty.
An initial capping value for a power consumption capping operation on each node is a difference between a power consumption value of the node when the power module 111 works normally, and a reduced value of the power consumption of the node when the power module 111 is faulty.
A specific implementation of step 605 is mainly adjusting a running state of a high-power component, including but not limited to CPU frequency and voltage adjustment, CPU core enabling and disabling, a CPU P/T-state, a memory frequency, a T state of a memory, reading, writing, and hibernation states of a hard disk, an L0/L1 pin state of a high-speed peripheral component interconnect express (PCIe) network adapter, a working status of a graphics processing unit (GPU), a fan speed, and other manners in which the power consumption of the server 100 can be controlled precisely.
The power consumption capping technology in step 605 in a specific implementation includes the following steps: A power control device 200 periodically detects the power consumption of the server 100, and calculates a difference between the power consumption of the server 100 and the power consumption capping value of the server 100. When the difference is greater than a preset error value, the power control device 200 adjusts the power consumption of the server 100, continues to detect the power consumption, and calculates the difference until the difference falls within a preset error range. The power consumption capping technology can precisely adjust the power consumption of the server 100, so that the power consumption of the server 100 approximates to, based on a preset error, the maximum power consumption that can be provided by the power supply 110 when the power module 111 is faulty.
Performing step 604 and step 605 avoids a breakdown of the server 100 when power is off, and further improves utilization of the power supply 110 after the power module 111 is faulty.
FIG. 6 shows an internal structure of the power consumption management device 500 in this method, and a specific structure is described above. The power consumption reduction management unit 511 is configured to receive the fault information of the power module 111, and forward the fault information to each power consumption reduction subunit 512. Each power consumption reduction subunit 512 performs step 602 and step 603 to rapidly reduce the power consumption of the server 100 within the holding time of the faulty power module 111. This ensures running of the nodes 512, namely, the server 100.
The power consumption reduction management unit 511 is further configured to forward the fault information to the power consumption capping management unit 521. The power consumption capping management unit 521 is configured to receive fault information of the power consumption reduction management unit 511, and forward the fault information to each power consumption capping subunit 522. Each power consumption capping subunit 522 performs step 604 and step 605 to adjust, within the preset error range, the power consumption of each node 101 to the maximum power consumption that can be provided by the power supply 110 when the power module 111 is faulty.
Similarly, constituent parts of the power consumption management device 500 may be integrated into a chip or another hardware device, or may be independent electronic components coupled in an electrical, mechanical, or other form, or two or more units may be integrated into one unit. This is not limited in this embodiment.
An embodiment further provides a power consumption management device 700, as shown in FIG. 7. The power consumption management device 700 includes an interface device 710 and a processor 720. The processor 720 may include a CPU and a memory. There may be one or more CPUs. The processor 720 may further be a field programmable gate array (FPGA), a combination of an FPGA and the CPU, or a combination of the CPU and a BIOS. This is not limited in this embodiment of the present application. The interface device 710 is configured to receive the fault information of the power module 111. When a server 100 is a single-node server, the processor 720 is configured to perform step 302 and step 303. When the server 100 is a multi-node server, the processor 720 is configured to perform step 602 to step 605.
An embodiment further provides a non-volatile readable storage medium. When a server 100 is a single-node server, the readable storage medium includes a first instruction used to perform step 301 and step 302 and a second instruction used to perform step 303. When the server 100 is a multi-node server, the readable storage medium includes a first instruction used to perform step 603 to step 603 and a second instruction used to perform step 604 and step 605.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatuses and methods may be implemented in other manners. For example, division of the unit in the described apparatus embodiment is merely logical function division, or may be other division during actual implementation. For example, a plurality of units or components may be combined or integrated into another system, or some features may be ignored or may not be performed. In addition, the displayed or discussed mutual couplings or direct couplings or communication connections may be implemented by using some interfaces. The indirect couplings or communication connections between the apparatuses or units may be implemented in an electrical manner, a mechanical manner, or another manner.

Claims (9)

What is claimed is:
1. A power consumption management method, comprising:
receiving, by a power consumption management device, fault information of a power module, wherein the power consumption management device communicates with a server, and a power supply that supplies power to the server comprises the power module;
reducing, by the power consumption management device, first power consumption of the server by a first value to obtain second power consumption of the server, wherein the first power consumption is a power consumption value of the server determined during a period in which the power module works normally, and the first value is not less than a reduced value of power consumption of the server, and the reduced value is determined during a period in which the power module is faulty;
adjusting, by the power consumption management device, the second power consumption of the server based on a power consumption capping value of the server, wherein the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server,
wherein reducing the power consumption of the server includes pulling down Prochot and Memhot pins of a CPU; and
in response to a difference between the power consumption capping value of the server and the power consumption of the server being greater than a preset error value, continuously adjust the power consumption of the server, until the difference falls within a preset error range, so as to adjust the power consumption of the server to approximate to a maximum power consumption provided by the power supply upon the power module being faulty.
2. The method according to claim 1, wherein the server comprises a plurality of nodes, wherein the reducing the first power consumption of the server comprises:
obtaining, by the power consumption management device, a reduced value of power consumption of each node; and
reducing, by the power consumption management device, the power consumption of each node by a second value based on the reduced value of the power consumption of each node, wherein a sum of second values of the plurality of nodes is equal to the first value.
3. The method according to claim 2, wherein the adjusting the second power consumption of the server comprises:
obtaining, by the power consumption management device, a power consumption capping value of each node, wherein the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and
adjusting, by the power consumption management device, the power consumption of each node based on the power consumption capping value of each node.
4. A power consumption management device, comprising:
a processor, and
a memory, storing processor-executable instructions which are executed by the processor and cause the device to perform operations including:
receiving fault information of a power module, wherein the power consumption management device communicates with a server, and a power supply that supplies power to the server comprises the power module;
reducing, based on the fault information, first power consumption of the server by a first value to obtain second power consumption of the server, wherein the first power consumption is a power consumption value of the server determined during a period in which the power module works normally, and the first value is not less than a reduced value of power consumption of the server, and the reduced value is determined during a period in which the power module is faulty;
adjusting the second power consumption of the server based on the fault information and a power consumption capping value of the server, wherein the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server,
wherein reducing the power consumption of the server includes pulling down Prochot and Memhot pins of a CPU; and
in response to a difference between the power consumption capping value of the server and the power consumption of the server being greater than a preset error value, continuously adjusting the power consumption of the server, until the difference falls within a preset error range, so as to adjust the power consumption of the server to approximate to a maximum power consumption provided by the power supply upon the power module being faulty.
5. The power consumption management device according to claim 4, wherein the server comprises a plurality of nodes, wherein the processor is configured to:
obtain a reduced value of power consumption of each node; and
reduce the power consumption of each node by a second value based on the reduced value of the power consumption of each node, wherein a sum of the second values of the plurality of nodes is equal to the first value.
6. The power consumption management device according to claim 5, wherein the processor is configured to:
obtain a power consumption capping value of each node, wherein the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and
adjust the power consumption of each node based on the power consumption capping value of each node.
7. A non-transitory computer-readable storage medium, storing processor-executable instructions to be executed by a processor of a power consumption management device, wherein the instructions comprise:
instruction for receiving fault information of a power module;
instruction for reducing first power consumption of a server by a first value to obtain second power consumption of the server, wherein the server is powered by a power supply, the power supply comprises the power module, and the power consumption management device communicates with the server; and the first power consumption is a power consumption value of the server determined during a period in which the power module works normally, and the first value is not less than a reduced value of power consumption of the server, and the reduced value is determined during a period in which the power module is faulty;
instruction for adjusting the second power consumption of the server based on a power consumption capping value of the server, wherein the power consumption capping value of the server is a difference between the first power consumption and the reduced value of the power consumption of the server,
wherein reducing the power consumption of the server includes pulling down Prochot and Memhot pins of a CPU; and
in response to a difference between the power consumption capping value of the server and the power consumption of the server being greater than a preset error value, instruction for continuously adjusting the power consumption of the server, until the difference falls within a preset error range, so as to adjust the power consumption of the server to approximate to a maximum power consumption provided by the power supply upon the power module being faulty.
8. The non-transitory computer-readable storage medium according to claim 7, wherein the server comprises a plurality of nodes, wherein the instructions comprises:
instruction for obtaining a reduced value of power consumption of each node; and
instruction for reducing the power consumption of each node by a second value based on the reduced value of the power consumption of each node, wherein a sum of the second values of the plurality of nodes is equal to the first value.
9. The non-transitory computer-readable storage medium according to claim 8, wherein the instructions comprise:
instruction for obtaining a power consumption capping value of each node, wherein the sum of the power consumption capping values of the plurality of nodes is the power consumption capping value of the server; and
instruction for adjusting the power consumption of each node based on the power consumption capping value of each node.
US16/818,765 2017-09-14 2020-03-13 Server power consumption management method and device Active US11385985B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201710826652.4A CN107783882B (en) 2017-09-14 2017-09-14 Server power consumption management method and equipment
CN201710826652.4 2017-09-14
PCT/CN2018/105194 WO2019052461A1 (en) 2017-09-14 2018-09-12 Method and device for managing power consumption of server

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/105194 Continuation WO2019052461A1 (en) 2017-09-14 2018-09-12 Method and device for managing power consumption of server

Publications (2)

Publication Number Publication Date
US20200210304A1 US20200210304A1 (en) 2020-07-02
US11385985B2 true US11385985B2 (en) 2022-07-12

Family

ID=61437913

Family Applications (1)

Application Number Title Priority Date Filing Date
US16/818,765 Active US11385985B2 (en) 2017-09-14 2020-03-13 Server power consumption management method and device

Country Status (3)

Country Link
US (1) US11385985B2 (en)
CN (1) CN107783882B (en)
WO (1) WO2019052461A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107783882B (en) * 2017-09-14 2021-01-01 华为技术有限公司 Server power consumption management method and equipment
CN108983946B (en) * 2018-06-13 2021-04-27 烽火通信科技股份有限公司 Server power consumption control method, system and equipment
CN109062618B (en) * 2018-06-29 2022-01-11 深圳市同泰怡信息技术有限公司 Development method, system and medium for server single-node power consumption capping firmware
CN110362175A (en) * 2019-06-29 2019-10-22 苏州浪潮智能科技有限公司 A kind of control method for fan and device
US11416353B2 (en) * 2019-09-13 2022-08-16 Dell Products L.P. DIMM voltage regulator soft start-up for power fault detection
US11461161B2 (en) * 2019-09-13 2022-10-04 Lenovo Enterprise Solutions (Singapore) Pte. Ltd. Using server power to predict failures
CN111309132B (en) * 2020-02-21 2021-10-29 苏州浪潮智能科技有限公司 Method for multi-gear power supply redundancy of server
CN111478824B (en) * 2020-03-20 2022-04-26 苏州浪潮智能科技有限公司 Network card power consumption testing method, device and system
CN112558748B (en) * 2020-11-26 2023-05-12 山东云海国创云计算装备产业创新中心有限公司 Method, device and equipment for limiting power consumption of server and readable storage medium
TWI777320B (en) * 2020-12-04 2022-09-11 神雲科技股份有限公司 Power consumption adjustment method and server

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050172157A1 (en) * 2004-01-30 2005-08-04 Dell Products L.P. System and method for managing power consumption in a computer system having a redundant power supply
US20070250218A1 (en) * 2006-04-20 2007-10-25 Paul Culley Power management logic that reconfigures a load when a power supply fails
US20080320322A1 (en) * 2007-06-25 2008-12-25 Green Alan M Dynamic Converter Control for Efficient Operation
CN101937264A (en) 2010-08-27 2011-01-05 北京星网锐捷网络技术有限公司 Power supply power management method and device as well as modularizing equipment
US20120030493A1 (en) * 2009-04-17 2012-02-02 Cepulis Darren J Power Capping System And Method
CN102541239A (en) 2010-12-16 2012-07-04 鸿富锦精密工业(深圳)有限公司 Network equipment and power consumption control method thereof
CN102916835A (en) 2012-10-18 2013-02-06 华为技术有限公司 Method and device for regulating power consumption of equipments
US20150355699A1 (en) * 2014-06-04 2015-12-10 Enrique Castro-Leon Data center management
US20160094426A1 (en) 2014-09-30 2016-03-31 Microsoft Corporation Monitoring of shared server set power supply units
CN107783882A (en) 2017-09-14 2018-03-09 华为技术有限公司 A kind of server energy consumption management method and equipment

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050172157A1 (en) * 2004-01-30 2005-08-04 Dell Products L.P. System and method for managing power consumption in a computer system having a redundant power supply
US20070250218A1 (en) * 2006-04-20 2007-10-25 Paul Culley Power management logic that reconfigures a load when a power supply fails
US20080320322A1 (en) * 2007-06-25 2008-12-25 Green Alan M Dynamic Converter Control for Efficient Operation
US20120030493A1 (en) * 2009-04-17 2012-02-02 Cepulis Darren J Power Capping System And Method
CN101937264A (en) 2010-08-27 2011-01-05 北京星网锐捷网络技术有限公司 Power supply power management method and device as well as modularizing equipment
CN102541239A (en) 2010-12-16 2012-07-04 鸿富锦精密工业(深圳)有限公司 Network equipment and power consumption control method thereof
CN102916835A (en) 2012-10-18 2013-02-06 华为技术有限公司 Method and device for regulating power consumption of equipments
EP2747344B1 (en) 2012-10-18 2016-01-20 Huawei Technologies Co., Ltd. Method and device for adjusting power consumption of apparatuses
US20150355699A1 (en) * 2014-06-04 2015-12-10 Enrique Castro-Leon Data center management
US20160094426A1 (en) 2014-09-30 2016-03-31 Microsoft Corporation Monitoring of shared server set power supply units
CN107783882A (en) 2017-09-14 2018-03-09 华为技术有限公司 A kind of server energy consumption management method and equipment

Also Published As

Publication number Publication date
US20200210304A1 (en) 2020-07-02
WO2019052461A1 (en) 2019-03-21
CN107783882A (en) 2018-03-09
CN107783882B (en) 2021-01-01

Similar Documents

Publication Publication Date Title
US11385985B2 (en) Server power consumption management method and device
US9563257B2 (en) Dynamic energy-saving method and apparatus for PCIE device, and communication system thereof
US10817043B2 (en) System and method for entering and exiting sleep mode in a graphics subsystem
JP5707321B2 (en) Sleep processor
US9158359B2 (en) Adaptive voltage scaling using a serial interface
WO2019104947A1 (en) System-on-chip, universal serial bus master device, system and awaking method
US20220229479A1 (en) Power supply unit (psu)-based power supply system
US20150121104A1 (en) Information processing method, information processing apparatus, and non-transitory computer-readable storage medium
US11644884B2 (en) Controlling a processor clock
CN104571442A (en) Power platform-based memory board POWER-on time sequence control method
US10394309B2 (en) Power gated communication controller
JP2021163480A (en) Power management system and power management method
TWI243983B (en) System and method of power management
CN104885032A (en) Computing system and processor with fast power surge detection and instruction throttle down to provide for low cost power supply unit
TWI726550B (en) Method of providing power in standby phase
US9201822B2 (en) Host controller apparatus, information processing apparatus, and event information output method
WO2023029375A1 (en) Power source consumption management apparatus for four-way server
US10254821B2 (en) Managing surprise hot plug in low power state
US20210360118A1 (en) Communication apparatus, imaging apparatus, control method of the same, and storage medium
US20170185128A1 (en) Method and apparatus to control number of cores to transition operational states
TWI744581B (en) Electronic device and powering method thereof
CN113741280A (en) Intelligent management control device of homemade VPX framework
TWI459189B (en) Motherboard and power management method thereof
CN102221870B (en) Power redundancy supply method and device
TW201407359A (en) Daisy-chained apparatus and system thereof

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: AWAITING TC RESP., ISSUE FEE NOT PAID

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

AS Assignment

Owner name: HUAWEI TECHNOLOGIES CO., LTD., CHINA

Free format text: EMPLOYMENT AGREEMENT;ASSIGNORS:LI, ZHIBING;TAO, LANG;SIGNING DATES FROM 20150119 TO 20190131;REEL/FRAME:060389/0252

Owner name: HUAWEI TECHNOLOGIES CO., LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:WANG, JIANGTAO;REEL/FRAME:060005/0707

Effective date: 20200525

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE

CC Certificate of correction