CN115237719A - Early warning method and system for reliability of server power supply - Google Patents

Early warning method and system for reliability of server power supply Download PDF

Info

Publication number
CN115237719A
CN115237719A CN202210902093.1A CN202210902093A CN115237719A CN 115237719 A CN115237719 A CN 115237719A CN 202210902093 A CN202210902093 A CN 202210902093A CN 115237719 A CN115237719 A CN 115237719A
Authority
CN
China
Prior art keywords
power supply
server
early warning
information
reliability
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210902093.1A
Other languages
Chinese (zh)
Inventor
刘坤
张鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202210902093.1A priority Critical patent/CN115237719A/en
Publication of CN115237719A publication Critical patent/CN115237719A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data

Abstract

The invention provides a method and a system for early warning of server power supply reliability, wherein the method comprises the steps of monitoring state information of a power supply, characteristic parameter information of each component of the power supply and index data of power supply operation respectively; comparing the monitored abnormal information with a preset abnormal value to obtain a corresponding risk level; and presetting a coping strategy based on the risk level correspondence, wherein the coping strategy comprises field inspection, and early warning prompt of a server power supply is carried out by combining a field inspection result and the risk level, and the field inspection is used for acquiring external environment information of the server. The invention sets multi-directional monitoring on the server power supply, which comprises state information, index data and characteristic parameters, and carries out polling monitoring on the server power supply, and the accurate power supply state information is obtained by verifying the state information monitored by the BMC through the CPLD, thereby avoiding the condition that the error alarm occurs in the single-mode monitoring of the existing BMC and ensuring the accuracy of early warning.

Description

Early warning method and system for reliability of server power supply
Technical Field
The invention relates to the technical field of server power management, in particular to a method and a system for early warning of server power reliability.
Background
With the rapid popularization and development of the internet, the cloud computing technology makes great progress, cloud data centers are built successively, and cloud service products gradually enter the daily life of people. People's daily life relies on network communication more, and the server that acts as the network backbone becomes more and more important, and along with the use of server in a large number, data center computer lab is established in succession, and the quantity and the scale of computer lab server increase day by day, for guaranteeing data security of data center computer lab server, data center computer lab server power supply stability is very important, and the server power is the most important power module of server.
At present, a rack type storage mode is generally adopted for servers of a data center machine room, a plurality of servers can be placed in each group of racks, the rack loading density is high, and each server is independently powered and works by at least 2 redundant power supplies. In order to meet the long-time and uninterrupted operation requirement of the server and the complex front-end data processing working condition, the power supply of the server needs to have higher reliability. If the power supply of the server is powered off due to the internal software and hardware faults of the power supply or the external complex working conditions, the server is shut down due to the loss of redundancy and the interruption of power supply, and potential safety hazards are brought to client data.
In actual work of the machine room server, a server BMC (Baseboard Management controller) is mainly used for monitoring a power supply working state, for example, a server power supply working alarm, the BMC records alarm content and transmits the alarm content to a front-end monitoring interface through a BMC communication port through network communication, and a machine room maintenance worker judges a fault reason by analyzing a BMC feedback log. However, this kind of monitoring method is only suitable for power replacement and maintenance after a fault, and cannot avoid the risk of power failure of the server by predicting the power failure in advance, which greatly affects the operation continuity and safety of the server.
Disclosure of Invention
The invention provides a server power supply reliability early warning method and system, which are used for solving the problems that the stability of a server power supply is influenced due to the lack of accurate fault early warning in the conventional server power supply monitoring strategy.
In order to achieve the purpose, the invention adopts the following technical scheme:
the invention provides a server power supply reliability early warning method in a first aspect, which comprises the following steps:
monitoring state information of the power supply, characteristic parameter information of each component of the power supply and index data of power supply operation respectively;
comparing the monitored abnormal information with a preset abnormal value to obtain a corresponding risk level;
and based on the preset coping strategy corresponding to the risk level, the coping strategy comprises field inspection, and the early warning prompt of the power supply of the server is carried out by combining a field inspection result with the risk level, wherein the field inspection is used for acquiring external environment information of the server.
Further, the method comprises the steps of:
and for the server power supply which sends the early warning prompt, a fault point is positioned and maintained through the troubleshooting robot.
Further, the state information comprises temperature information of the power supply, an output overcurrent signal of the power supply and an overvoltage signal.
Further, the monitoring of the state information specifically includes:
and responding to the state information abnormal alarm reported by the substrate management controller, calling the complex programmable logic device to acquire the state information of the alarm item corresponding to the current abnormal alarm, comparing the state information with the state information acquired by the substrate management controller, and if the comparison result is consistent, forming abnormal information.
Furthermore, the power supply component for monitoring the characteristic parameter information comprises a power factor correction feedback circuit, a diode circuit, a communication optocoupler, each drive chip and a standby control circuit.
Further, the monitoring of the characteristic parameter information specifically includes:
the complex programmable logic device polls the characteristic parameter information in the server power register, and the characteristic parameter information is collected in real time through a sensor;
and comparing the characteristic parameter information with a preset value, recording the occurrence times of abnormal information, marking the times according to a preset rule, and taking the marked value as the abnormal information.
Further, the index data includes output power consumption of the power supply, output current, and a voltage value of the output signal.
Further, the monitoring of the index data specifically includes:
the method comprises the steps that a complex programmable logic device polls index data in a server power register, and the index data are obtained and/or calculated in real time through a power chip;
and comparing the index data with a preset value, recording the occurrence times of abnormal information, marking the times according to a preset rule, and taking the marked value as the abnormal information.
The invention provides a system for early warning of reliability of a server power supply, which comprises:
the power supply on-line monitoring module is used for respectively monitoring the state information of the power supply, the characteristic parameter information of each component of the power supply and the index data of the power supply operation;
the reliability early warning module is used for comparing the monitored abnormal information with a preset abnormal value to obtain a corresponding risk level;
and the data center machine room control module is used for presetting a coping strategy based on the risk level correspondence, the coping strategy comprises field inspection, and early warning prompt of a server power supply is carried out by combining a field inspection result and the risk level, and the field inspection is used for acquiring external environment information of the server.
Furthermore, the system further comprises a server power supply maintenance module, and the server power supply maintenance module is used for positioning a fault point and maintaining the server power supply which sends the early warning prompt according to a preset strategy through the fault maintenance robot.
The server power reliability early warning system of the second aspect of the present invention can implement the methods of the first aspect and the implementation manners of the first aspect, and achieve the same effects.
The effect provided in the summary of the invention is only the effect of the embodiment, not all the effects of the invention, and one of the above technical solutions has the following advantages or beneficial effects:
the invention sets multi-directional monitoring on the server power supply, which comprises state information, index data and characteristic parameters, polling monitoring is carried out on the server power supply, accurate power supply state information is obtained through CPLD verification on the state information monitored by the BMC, the condition that the error alarm occurs in the single-mode monitoring of the existing BMC is avoided, and the accuracy of early warning is ensured. The early warning power supply is positioned and overhauled through the machine room robot, so that the influence of personnel entering a machine room and the machine room environment is avoided, and the labor cost is saved.
Drawings
In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present invention, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained based on these drawings without creative efforts.
FIG. 1 is a schematic flow diagram of an embodiment of the method of the present invention;
fig. 2 is a schematic structural diagram of an embodiment of the system of the present invention.
Detailed Description
In order to clearly explain the technical features of the present invention, the following detailed description of the present invention is provided with reference to the accompanying drawings. The following disclosure provides many different embodiments, or examples, for implementing different features of the invention. To simplify the disclosure of the present invention, the components and arrangements of specific examples are described below. Furthermore, the present invention may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and procedures are omitted so as to not unnecessarily limit the invention.
The embodiment of the invention provides a server power supply reliability early warning method, which comprises the following steps:
s1, monitoring state information of a power supply, characteristic parameter information of each part of the power supply and index data of power supply operation respectively;
s2, comparing the monitored abnormal information with a preset abnormal value to obtain a corresponding risk level;
and S3, corresponding to preset coping strategies based on the risk levels, wherein the coping strategies comprise field inspection, and early warning prompt of a server power supply is carried out by combining a field inspection result and the risk levels, and the field inspection is used for obtaining external environment information of the server.
In one implementation manner of the embodiment of the present invention, the method further includes the steps of:
and for the server power supply which sends the early warning prompt, a fault point is positioned and maintained through the troubleshooting robot.
In step S1, the state information includes temperature information of the power supply, an overcurrent signal output by the power supply, and an overvoltage signal.
The monitoring of the state information specifically comprises: the server BMC polls the power state information of the server and compares the power state information with the specification value, if the power state information does not meet the requirement of the specification value, the BMC displays the power alarm state, reads the parameter information of the alarm power supply of the server through the CPLD based on the power alarm state, and compares the parameter information with the alarm information fed back by the BMC. If the comparison results are different, the instruction is issued to command the BMC to read again and feed back until the comparison results are the same. And if the fault alarm information is matched with the fault alarm information fed back by the BMC, determining the fault alarm information to be abnormal information.
The CPLD real-time polling server power supply register is characterized in that characteristic parameter (real-time voltage values and working states of a PFC feedback circuit, a PFC OVP detection loop and the like, working temperatures and states of a key circuit diode, a communication optical coupler, an isolation drive IC, a standby control chip and a standby integrated chip) index data of each power supply component are collected by a power supply sensor in real time, the CPLD records parameter data of each key component of the power supply, analyzes the key parameter data of each power supply component and compares the key parameter data with a specification interval. If the polling data is within the range of the standard specification interval, the power state output is 00; if the polling data exceeds the range of the standard specification interval, the power state output of each power supply abnormal component is 01; if the polling data of the same component or the same characteristic parameter continuously exceeds the range of the standard specification interval for 3 times, the power state output of each power supply abnormal component is 10.
The CPLD monitors index data such as output power consumption, output current, voltage (12V, vingiod, alert and PG) of a server power supply in real time, the index data are obtained through a power supply register, and a power supply chip obtains and/or calculates the index data in real time and stores the index data in the power supply register; and the CPLD polls and records the data of each parameter and compares the data with the specification value. In order to ensure the power supply redundancy of the whole machine, the output power consumption of the power supply of the machine room server is less than 50 percent (within the standard specification range) of the rated power of the power supply; in order to ensure the stable power supply of the whole machine, the output current of 2 power supplies of the power supply of the machine room server meets the current sharing requirement (the standard specification range is less than 20% of load, the current sharing degree is less than 10%, and the current sharing degree is less than 5% of the load of 20% or more); in order to ensure the power supply and communication reliability of the server power supply, the output signal quality of 12V (standard specification range: 12.0V-12.8V), vingiod (standard specification range: 2.4V-3.46V), alert (standard specification range: 2.4V-3.46V) and PG (standard specification range: 2.4V-3.46V) of the server power supply is within the specification range. If the polling data is within the range of the standard specification interval, the power state output is 00; if the polling data exceeds the range of the standard specification interval, the output of each abnormal state of each power supply is 01; if the polling data exceeds the range of the standard specification interval for 3 times continuously, the power state output is 10. The server power supply state online monitoring module collects the power supply alarm information and the power supply state output value, and transmits the collected information to the server power supply reliability early warning module.
In step S2, alarm information and power state output values of each power supply of the server transmitted by the server power state online monitoring module are received and summarized, a power reliability state general table of the server in the machine room is generated, and the server power supplies are divided into a low risk area (power state output value 00), a medium risk area (power state output value less than 10), and a high risk area (power state output value greater than or equal to 10) based on the power reliability state general table information.
For the low-risk area power supply, generating a low-risk power supply list; and for the power supply in the middle risk area, generating a risk power supply list and corresponding alarm information to a data center machine room control module to perform power supply reliability identification analysis. For the power supply in the high risk area, the server power supply reliability early warning module transmits a high risk power supply list and corresponding warning information to the data center machine room control module to perform power supply overhaul flow analysis.
In step S3, for the medium-risk server power supply, the machine room manager needs to determine whether the risk warning information of each power supply is an effective warning that needs to be managed and controlled, and finally, evaluate the early warning level of the server power supply. If the final early warning level of the server power supply is in danger and the site inspection needs exist, the accurate position of a fault server part is located, a server power supply index acquisition command is issued to the automatic machine room maintenance robot through the internet of things wireless transmission technology, the automatic machine room maintenance robot moves to the fault server position according to fault location to shoot, acquire indexes such as video and smell of the fault power supply working state, transmit acquired data to a data center machine room control module, and machine room management personnel check and process feedback information through a visual interface of the control module.
For the high-risk server power supply, the machine room manager judges whether the risk alarm information of each power supply is an effective alarm needing to be controlled, and finally evaluates the early warning level of the server power supply. If the final early warning level of the server power supply is high risk and the need of field power supply replacement exists, a machine room manager issues a power supply replacement field confirmation instruction to the automatic server power supply maintenance module, the automatic server power supply maintenance module receives the need and positions the accurate position of a fault server part, a server power supply index acquisition command is issued to the automatic machine room maintenance robot, the automatic machine room maintenance robot moves to the position of the fault server according to fault positioning to shoot, acquire indexes such as video and smell of the fault power supply working state, and transmits acquired data to the data center machine room control module. And the machine room management personnel check the feedback information and finally confirm the maintenance requirement, issue a formal maintenance instruction, and position-move the machine room automatic maintenance robot to the fault power position to complete the automatic replacement and power-on operation of the fault power through the actions of power line and power plug-pull of the mechanical arm.
As shown in fig. 2, an embodiment of the present invention further provides an early warning system for server power reliability, where the system includes a power online monitoring module 1, a reliability early warning module 2, a data center room control module 3, and a server power overhaul module 4.
The power supply online monitoring module 1 is used for respectively monitoring state information of a power supply, characteristic parameter information of each component of the power supply and index data of power supply operation; the reliability early warning module 2 is used for comparing the monitored abnormal information with a preset abnormal value to obtain a corresponding risk level; the data center machine room control module 3 is used for presetting coping strategies based on the risk level correspondence, the coping strategies comprise field inspection, early warning prompt of a server power supply is carried out by combining a field inspection result and the risk level, and the field inspection is used for obtaining external environment information of the server. And the server power supply maintenance module 4 is used for positioning a fault point and maintaining the server power supply which sends the early warning prompt according to a preset strategy through the fault maintenance robot.
The power supply online monitoring module: on one hand, the server BMC polls the server power supply state information and compares the server power supply state information with the specification value, if the power supply state information does not meet the requirement of the specification value, the BMC displays the power supply alarm state and transmits 10 the power supply alarm state to the power supply online monitoring module. And the power supply online monitoring module receives the abnormal alarm information fed back by the BMC, immediately responds to the abnormal alarm information, reads the parameter information of the alarm power supply of the server through the CPLD, and compares the parameter information with the alarm information fed back by the BMC. If the comparison results are different, an instruction is issued to instruct the BMC to read again and feed back until the comparison results are the same. If the fault alarm information fed back by the BMC is matched with the fault alarm information, the power supply online monitoring module determines that the fault alarm information is correct, and transmits the fault alarm information and the power supply state output value to the reliability early warning module.
On one hand, the CPLD polls characteristic parameter index data (real-time voltage values and working states of a PFC feedback circuit, a PFC OVP detection loop and the like, and working temperatures and states of a key circuit diode, a communication optical coupler, an isolation drive IC, a standby control chip and a standby integrated chip) of each component of the power supply collected by a power supply sensor in a register of the power supply of the server in real time, records parameter data of each key component of the power supply and transmits the parameter data to the power supply online monitoring module, and the server power supply state online monitoring module analyzes the key parameter data of each component of the power supply and compares the key parameter data with a specification interval. If the polling data is within the range of the standard specification interval, the power state output is 00; if the polling data exceeds the range of the standard specification interval, the power state output of each power supply abnormal component is 01; if the polling data of the same component or the same characteristic parameter continuously exceeds the range of the standard specification interval for 3 times, the power state output of each power supply abnormal component is 10. The server power supply state online monitoring module collects the power supply warning information and the power supply state output value and transmits the collected information to the reliability early warning module.
On one hand, the CPLD monitors index data (what the data come) such as output power consumption, output current, output signal voltage (12V, vingiod, alert, PG) and the like of the power supply of the server in real time, polls and records parameter data, and compares the parameter data with a specification value. In order to ensure the power supply redundancy of the whole machine, the output power consumption of the power supply of the machine room server is less than 50 percent (within the range of standard specification) of the rated power of the power supply; in order to ensure the stable power supply of the whole machine, the output current of 2 power supplies of the machine room server power supply meets the current-sharing requirement (the standard specification range is less than 20% of load, the non-uniformity is less than 10%, and the non-uniformity is less than 5% at the load of 20% or above); in order to ensure the power supply and communication reliability of the server power supply, the output signal quality of 12V (standard specification range: 12.0V-12.8V), vingiod (standard specification range: 2.4V-3.46V), alert (standard specification range: 2.4V-3.46V) and PG (standard specification range: 2.4V-3.46V) of the server power supply is within the specification range. If the polling data is within the range of the standard specification interval, the power state output is 00; if the polling data exceeds the range of the standard specification interval, the output of each abnormal state of each power supply is 01; if the polling data exceeds the range of the standard specification interval for 3 times continuously, the power state output is 10. The server power supply state online monitoring module collects the power supply warning information and the power supply state output value and transmits the collected information to the reliability early warning module.
The reliability early warning module receives and summarizes the warning information and the power state output value of each power supply of the server transmitted by the power supply online monitoring module to generate a power supply reliability state general table of the server of the machine room, and the power supplies of the server are divided into a low risk area (the power supply state output value is 00), a medium risk area (the power supply state output value is less than 10) and a high risk area (the power supply state output value is more than or equal to 10) based on the power supply reliability state general table information.
For the low-risk area power supply, the reliability early warning module transmits a low-risk power supply list to the data center machine room control module for display. For the power supply in the medium risk area, the reliability early warning module transmits a medium risk power supply list and corresponding warning information to the data center machine room control module to perform power supply reliability identification analysis. For the high-risk area power supply, the reliability early warning module transmits a high-risk power supply list and corresponding warning information to the data center machine room control module to perform power supply overhaul flow analysis.
And the data center machine room control module receives the server power supply risk list and the corresponding warning information fed back by the reliability early warning module, and a machine room manager checks the machine room server power supply risk list and the corresponding warning information through a visual interface of the data center machine room control module.
For the medium-risk server power supply, the machine room management personnel need to judge whether the risk alarm information of each power supply is an effective alarm needing to be managed and controlled, and finally evaluate the early warning level of the server power supply. If the final early warning level of the server power supply is in danger and the on-site inspection is needed, then the machine room management personnel issues an inspection instruction to the maintenance module, the maintenance module receives the requirement and positions the accurate position of a fault server part, a server power supply index acquisition command is issued to the machine room automatic maintenance robot through the Internet of things wireless transmission technology, the machine room automatic maintenance robot moves to the fault server position according to fault positioning to shoot the working state of the fault power supply, video, odor and other index acquisition are carried out, acquired data are transmitted to the data center machine room control module, and the machine room management personnel check and process the feedback information through a visual interface of the control module.
For high-risk server power supplies, machine room management personnel need to judge whether risk alarm information of each power supply is effective alarm needing management and control, and finally evaluate the early warning level of the server power supply. If the final early warning level of the server power supply is high risk and the need of replacing the power supply on site exists, machine room management personnel issue a power supply replacement site confirmation instruction to the maintenance module, the maintenance module receives the need and positions the accurate position of the fault server part, issue a server power supply index acquisition command to the machine room automatic maintenance robot, the machine room automatic maintenance robot moves to the fault server position according to fault positioning to carry out index acquisition such as photographing, video and smell on the working state of the fault power supply, and transmits the acquired data to the data center machine room control module. And the machine room management personnel check the feedback information and finally confirm the maintenance requirement, issue a formal maintenance instruction, and position-move the machine room automatic maintenance robot to the fault power position to complete the automatic replacement and power-on operation of the fault power through the actions of power line and power plug-pull of the mechanical arm.
The scheme can also realize automatic monitoring, early warning and overhauling of the reliability of the power supply of the machine room server.
Although the embodiments of the present invention have been described with reference to the accompanying drawings, it is not intended to limit the scope of the invention, and it should be understood by those skilled in the art that various modifications and variations can be made without inventive changes in the technical solutions of the present invention.

Claims (10)

1. A server power supply reliability early warning method is characterized by comprising the following steps:
monitoring state information of the power supply, characteristic parameter information of each component of the power supply and index data of power supply operation respectively;
comparing the monitored abnormal information with a preset abnormal value to obtain a corresponding risk level;
and presetting a coping strategy based on the risk level correspondence, wherein the coping strategy comprises field inspection, and early warning prompt of a server power supply is carried out by combining a field inspection result and the risk level, and the field inspection is used for acquiring external environment information of the server.
2. The early warning method for the reliability of the power supply of the server as claimed in claim 1, wherein the method further comprises the steps of:
and for the server power supply which sends the early warning prompt, a fault point is positioned and maintained through the troubleshooting robot.
3. The server power supply reliability early warning method according to claim 1 or 2, wherein the state information comprises temperature information of a power supply, a power supply output overcurrent signal and an overvoltage signal.
4. The early warning method for the reliability of the server power supply according to claim 3, wherein the monitoring of the state information specifically comprises:
and responding to the state information abnormal alarm reported by the substrate management controller, calling the complex programmable logic device to acquire the state information of the alarm item corresponding to the current abnormal alarm, comparing the state information with the state information acquired by the substrate management controller, and if the comparison result is consistent, forming abnormal information.
5. The early warning method for the reliability of the server power supply according to claim 1 or 2, wherein the power supply component for monitoring the characteristic parameter information comprises a power factor correction feedback circuit, a diode circuit, a communication optocoupler, each drive chip and a standby control circuit.
6. The early warning method for the reliability of the server power supply according to claim 5, wherein the monitoring of the characteristic parameter information specifically comprises:
the complex programmable logic device polls the characteristic parameter information in the server power register, and the characteristic parameter information is collected in real time through a sensor;
and comparing the characteristic parameter information with a preset value, recording the occurrence times of abnormal information, marking the times according to a preset rule, and taking the marked value as the abnormal information.
7. The server power supply reliability early warning method according to claim 1 or 2, wherein the index data includes output power consumption, output current and voltage value of the output signal of the power supply.
8. The early warning method for the reliability of the server power supply according to claim 7, wherein the monitoring of the index data specifically comprises:
the method comprises the steps that a complex programmable logic device polls index data in a server power register, and the index data are obtained and/or calculated in real time through a power chip;
and comparing the index data with a preset value, recording the occurrence times of abnormal information, marking the times according to a preset rule, and taking the marked value as the abnormal information.
9. The early warning system of server power reliability is characterized in that the system comprises:
the power supply on-line monitoring module is used for respectively monitoring the state information of the power supply, the characteristic parameter information of each part of the power supply and the index data of the power supply operation;
the reliability early warning module is used for comparing the monitored abnormal information with a preset abnormal value to obtain a corresponding risk level;
and the data center machine room control module is used for presetting a coping strategy based on the risk level correspondence, the coping strategy comprises field inspection, and early warning prompt of a server power supply is carried out by combining a field inspection result and the risk level, and the field inspection is used for acquiring external environment information of the server.
10. The early warning system for the reliability of the server power supply according to claim 9, further comprising a server power supply maintenance module, wherein the server power supply maintenance module locates a fault point for the server power supply which sends the early warning prompt through a fault maintenance robot and maintains the fault point according to a preset strategy.
CN202210902093.1A 2022-07-28 2022-07-28 Early warning method and system for reliability of server power supply Pending CN115237719A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210902093.1A CN115237719A (en) 2022-07-28 2022-07-28 Early warning method and system for reliability of server power supply

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210902093.1A CN115237719A (en) 2022-07-28 2022-07-28 Early warning method and system for reliability of server power supply

Publications (1)

Publication Number Publication Date
CN115237719A true CN115237719A (en) 2022-10-25

Family

ID=83677338

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210902093.1A Pending CN115237719A (en) 2022-07-28 2022-07-28 Early warning method and system for reliability of server power supply

Country Status (1)

Country Link
CN (1) CN115237719A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116578461A (en) * 2023-04-21 2023-08-11 广东云下汇金科技有限公司 Warning device and method for abnormal state of data center server

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116578461A (en) * 2023-04-21 2023-08-11 广东云下汇金科技有限公司 Warning device and method for abnormal state of data center server
CN116578461B (en) * 2023-04-21 2024-04-05 广东云下汇金科技有限公司 Warning device and method for abnormal state of data center server

Similar Documents

Publication Publication Date Title
CN110119128B (en) Monitoring management system for laboratory electrical equipment
CN103425116B (en) A kind of electric power scheduling automatization system troubleshooting methodology
CN115102290A (en) Real-time safety early warning system of smart power grids
CN112987696A (en) Regional power distribution network equipment management platform and operation method thereof
CN115237719A (en) Early warning method and system for reliability of server power supply
CN114594349B (en) Direct current insulation monitoring method and terminal in energy storage system
CN116755964A (en) Fault prediction and health management system for reinforcement server
CN113110378A (en) Terminal control machine and intelligent detection and diagnosis method
CN117221145A (en) Equipment fault predictive maintenance system based on Internet of things platform
CN112541011A (en) Power grid terminal fault processing method and device based on RPA and AI
CN114265324B (en) Method and device for monitoring running state of equipment and terminal equipment
CN116820820A (en) Server fault monitoring method and system
CN111338891A (en) Fan stability testing method and device
CN113344739B (en) Data safety supervision method and system for automatically checking power failure
CN116088438A (en) Intelligent electric control system and method of slitting machine
CN109633507A (en) Voltage monitoring instrument with self-diagnostic function
CN112445680A (en) Remote monitoring and overhauling system and method for server of data center machine room
CN112990503A (en) Power equipment maintenance management system
CN116736027B (en) Equipment fault early warning system for medical self-service terminal
CN115277353B (en) Remote fault active and passive early warning method for intelligent cabinet
CN113469453B (en) Elevator evaluation method and elevator evaluation device based on information physical system
CN113640618B (en) Power distribution station house monitoring method and terminal
CN116708560B (en) Data exchange method of intelligent building and energy efficiency monitoring system based on Internet of things
Guan et al. Fault Handling of Digital System in Nuclear Power Plants
CN116185158A (en) Method and device for determining power failure of server, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination