CN103593276A - Fault diagnosis method for server in power-down state - Google Patents
Fault diagnosis method for server in power-down state Download PDFInfo
- Publication number
- CN103593276A CN103593276A CN201310576783.3A CN201310576783A CN103593276A CN 103593276 A CN103593276 A CN 103593276A CN 201310576783 A CN201310576783 A CN 201310576783A CN 103593276 A CN103593276 A CN 103593276A
- Authority
- CN
- China
- Prior art keywords
- server
- bmc
- power
- led light
- fault diagnosis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Test And Diagnosis Of Digital Computers (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention discloses a fault diagnosis method for a server in the power-down state. A server management controller BMC is responsible for monitoring the operation state of the server, and uses LED indicating lamps corresponding to components for carrying out fault diagnosis; when a fault occurs in the components of the server, the BMC is responsible for lightening the LED indicating lamps corresponding to the components, wherein a USB interface is designed to be arranged on a front panel of the server, the USB interface of a BMC monitoring chip is guided to the front panel of a case and is used for enabling an external USB mobile power source to provide power for the BMC and provide the fault diagnosis capability when the BMC is in the power-down state. When the fault occurs in the server, the BMC records the corresponding fault state into EEPROM and stores the fault state permanently until the fault state changes, and the BMC restores the fault state; the USB mobile power source is used for providing power for the BMC monitoring chip when the server is in the power-down state, the BMC detects that power is supplied by the USB mobile power source, lightens the LED indicating lamps of the components according to the server fault state stored in the EEPROM correspondingly, so that the fault diagnosis function is continued to be finished on the condition that the server is in the power-down state.
Description
Technical field
The present invention relates to server failure diagnostic field, be specifically related to a kind of method of power-down state server failure diagnosis.
Technical background
Development along with high-performance calculation machine technology, the parts of server are on the increase, also more and more urgent to the failure monitoring of server component, diagnosis, the monitoring management unit B MC of server (Baseboard Management Controller) is responsible for each parts of server to carry out condition monitoring.But along with increasing of server component, LED light for unit failure diagnosis reaches 30-40, for high-performance server, LED light for fault diagnosis reaches more than 60, the LED light of front panel cannot meet the requirement of server failure diagnosis, so the LED light of fault diagnosis can only be placed on mainboard.There is so again another one problem, when fault diagnosis is carried out to server in fault diagnosis personnel scene, server must be taken out from rack, open the chassis lid of server, these operations must be pulled out the power supply of server, and after traditional server power down, malfunction will be cleared, cannot playback server malfunction during operation, also just cannot reach the object of fault diagnosis.
Along with being on the increase of server component, complexity is more and more higher, and the time that after breaking down, tracing trouble spends is more and more longer, and traditional server cannot meet the requirement of unit failure diagnosis, and just to server failure, diagnosis has proposed new requirement for this.Therefore, the method for a kind of power-down state server failure diagnosis proposed just necessary.
Summary of the invention
The technical problem to be solved in the present invention is: a kind of method that power-down state server failure diagnosis is provided.
The technical solution adopted in the present invention is: the fault diagnosis that the present invention mentions is to take the fault diagnosis that server B MC (Baseboard Management Controller) is core.
A kind of method of power-down state server failure diagnosis, the LED light that each parts of server by utilizing are corresponding is carried out fault diagnosis, server management controller BMC is responsible for monitoring server running status, when server component breaks down, BMC is responsible for lighting the LED light that parts are corresponding, wherein, on server front panel, design a USB interface, the USB interface of BMC monitoring chip is guided on the front panel of cabinet, for external USB portable power source, to BMC, power, the ability of fault diagnosis under BMC power-down state is provided; In server normal course of operation, BMC is responsible for the running status of monitoring server, and when server breaks down, BMC is recorded to corresponding malfunction in EEPROM, and persistence, until malfunction changes, is preserved malfunction again; Under server power-down state, use USB portable power source to power to BMC monitoring chip, BMC detects the power supply of USB portable power source, can be according to the server failure state being kept in EEPROM, the LED light of corresponding point highlights part, reach the object of power-down state server failure diagnosis, made up the defect of traditional server trouble diagnosibility deficiency.
Described method flow is as follows:
First, in server normal course of operation, the running status of each parts of BMC monitoring server, when server component breaks down, the LED light of BMC corresponding point highlights part;
Secondly, during fault diagnosis, server power supply is broken, the LED light of parts is all extinguished, and then, takes out the battery interface position that USB portable power source is inserted in server front panel, portable power source is only given the peripheral circuit power supply of BMC chip and BMC chip, in BMC start-up course, powered battery detected, the malfunction while moving powering on shows fault diagnosis personnel by LED light again, is labeled as diagnostic process simultaneously;
Finally, after having diagnosed, insert power supply, it is Power supply that BMC detects, and no longer according to the malfunction of preserving, lights corresponding LED light, avoids that LED mistake is bright brings puzzlement to client, simultaneously, BMC detects the mark of diagnostic process, and the malfunction being kept in EEPROM is emptied, and removes the mark of diagnostic process simultaneously;
After server failure diagnosis, normally operation.
Beneficial effect of the present invention is:
Application based on high-performance server, parts are used more and more, complexity is more and more higher, after breaking down the time of tracing trouble cost more and more longer, so essential to the fault automatic monitoring of parts, diagnostic function.The invention provides and a kind ofly can under server power-down conditions, continue the fault diagnosis functions of server, made up the deficiency of traditional server method for diagnosing faults, make it be more suitable for high-performance computer application, thereby there is development space very widely.
Embodiment
In conjunction with the embodiments to the detailed description of the invention.
Embodiment 1:
A kind of method of power-down state server failure diagnosis, server management controller BMC is responsible for monitoring server running status, utilize the LED light that each parts are corresponding to carry out fault diagnosis, when server component breaks down, BMC is responsible for lighting the LED light that parts are corresponding, wherein, on server front panel, design a USB interface, the USB interface of BMC monitoring chip is guided on the front panel of cabinet, for external USB portable power source, to BMC, power, the ability of fault diagnosis under BMC power-down state is provided; In server normal course of operation, BMC is responsible for the running status of monitoring server, and when server breaks down, BMC is recorded to corresponding malfunction in EEPROM, and persistence, until malfunction changes, is preserved malfunction again; Under server power-down state, use USB portable power source to the power supply of BMC monitoring chip, BMC detects the power supply of USB portable power source, can be according to the server failure state being kept in EEPROM, the LED light of corresponding point highlights part.
Embodiment 2:
On the basis of embodiment 1, method flow is as follows described in the present embodiment:
First, in server normal course of operation, the running status of each parts of BMC monitoring server, when server component breaks down, the LED light of BMC corresponding point highlights part;
Secondly, during fault diagnosis, server power supply is broken, the LED light of parts is all extinguished, and then, takes out the battery interface position that USB portable power source is inserted in server front panel, portable power source is only given the peripheral circuit power supply of BMC chip and BMC chip, in BMC start-up course, powered battery detected, the malfunction while moving powering on shows fault diagnosis personnel by LED light again, is labeled as diagnostic process simultaneously;
Finally, after having diagnosed, insert power supply, it is Power supply that BMC detects, and no longer according to the malfunction of preserving, lights corresponding LED light, avoids that LED mistake is bright brings puzzlement to client, simultaneously, BMC detects the mark of diagnostic process, and the malfunction being kept in EEPROM is emptied, and removes the mark of diagnostic process simultaneously;
After server failure diagnosis, normally operation.
Claims (2)
1. the method for power-down state server failure diagnosis, wherein, server management controller BMC is responsible for monitoring server running status, utilize the LED light that each parts are corresponding to carry out fault diagnosis, when server component breaks down, BMC is responsible for lighting the LED light that parts are corresponding, it is characterized in that: on server front panel, design a USB interface, the USB interface of BMC monitoring chip is guided on the front panel of cabinet, for external USB portable power source, to BMC, power, the ability of fault diagnosis under BMC power-down state is provided; In server normal course of operation, BMC is responsible for the running status of monitoring server, and when server breaks down, BMC is recorded to corresponding malfunction in EEPROM, and persistence, until malfunction changes, is preserved malfunction again; Under server power-down state, use USB portable power source to the power supply of BMC monitoring chip, BMC detects the power supply of USB portable power source, can be according to the server failure state being kept in EEPROM, the LED light of corresponding point highlights part.
2. the method for a kind of power-down state server failure diagnosis according to claim 1, is characterized in that, described method flow is as follows:
First, in server normal course of operation, the running status of each parts of BMC monitoring server, when server component breaks down, the LED light of BMC corresponding point highlights part;
Secondly, during fault diagnosis, server power supply is broken, the LED light of parts is all extinguished, and then, takes out the battery interface position that USB portable power source is inserted in server front panel, portable power source is only given the peripheral circuit power supply of BMC chip and BMC chip, in BMC start-up course, powered battery detected, the malfunction while moving powering on shows fault diagnosis personnel by LED light again, is labeled as diagnostic process simultaneously;
Finally, after having diagnosed, insert power supply, it is Power supply that BMC detects, and no longer according to the malfunction of preserving, lights corresponding LED light, avoids that LED mistake is bright brings puzzlement to client, simultaneously, BMC detects the mark of diagnostic process, and the malfunction being kept in EEPROM is emptied, and removes the mark of diagnostic process simultaneously;
After server failure diagnosis, normally operation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310576783.3A CN103593276A (en) | 2013-11-19 | 2013-11-19 | Fault diagnosis method for server in power-down state |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310576783.3A CN103593276A (en) | 2013-11-19 | 2013-11-19 | Fault diagnosis method for server in power-down state |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103593276A true CN103593276A (en) | 2014-02-19 |
Family
ID=50083428
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310576783.3A Pending CN103593276A (en) | 2013-11-19 | 2013-11-19 | Fault diagnosis method for server in power-down state |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103593276A (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104199757A (en) * | 2014-09-05 | 2014-12-10 | 浪潮电子信息产业股份有限公司 | Off-line alarming method for fault messages of server system |
CN105095058A (en) * | 2015-08-27 | 2015-11-25 | 浪潮电子信息产业股份有限公司 | Design method applied to server offline diagnosis |
CN106407090A (en) * | 2016-09-23 | 2017-02-15 | 郑州云海信息技术有限公司 | An optical path diagnosis server state display panel |
CN107193701A (en) * | 2017-06-06 | 2017-09-22 | 郑州云海信息技术有限公司 | Server master board and method for diagnosing faults with fault diagnosis functions |
CN107688524A (en) * | 2017-09-05 | 2018-02-13 | 郑州云海信息技术有限公司 | A kind of the indicating fault design method and instruction device of being easy to server heat to safeguard |
CN108108291A (en) * | 2017-12-25 | 2018-06-01 | 曙光信息产业(北京)有限公司 | A kind of trouble-shooter of server |
CN108874598A (en) * | 2018-05-24 | 2018-11-23 | 郑州云海信息技术有限公司 | A kind of memory failure information diagnosis system |
CN110994618A (en) * | 2020-01-03 | 2020-04-10 | 清华大学 | Module power supply method of multi-port electric energy router based on high-frequency collection bus |
CN111625389A (en) * | 2020-05-28 | 2020-09-04 | 山东海量信息技术研究院 | VR fault data acquisition method and device and related components |
CN112463547A (en) * | 2020-11-06 | 2021-03-09 | 苏州浪潮智能科技有限公司 | High-density server system state indicating device and indicating method |
CN112596742A (en) * | 2020-11-30 | 2021-04-02 | 新华三云计算技术有限公司 | BMC software upgrading method, device, equipment and machine readable storage medium |
CN117792863A (en) * | 2024-02-27 | 2024-03-29 | 深圳供电局有限公司 | Industrial switch field visual fault detection method, system and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080244296A1 (en) * | 2007-03-26 | 2008-10-02 | International Business Machines Corporation | Computer system fault detection |
CN201515381U (en) * | 2009-10-28 | 2010-06-23 | 浪潮电子信息产业股份有限公司 | Novel server management monitoring system |
CN103077103A (en) * | 2013-01-18 | 2013-05-01 | 浪潮电子信息产业股份有限公司 | Off-line diagnosing method for server faults |
-
2013
- 2013-11-19 CN CN201310576783.3A patent/CN103593276A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080244296A1 (en) * | 2007-03-26 | 2008-10-02 | International Business Machines Corporation | Computer system fault detection |
CN201515381U (en) * | 2009-10-28 | 2010-06-23 | 浪潮电子信息产业股份有限公司 | Novel server management monitoring system |
CN103077103A (en) * | 2013-01-18 | 2013-05-01 | 浪潮电子信息产业股份有限公司 | Off-line diagnosing method for server faults |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104199757A (en) * | 2014-09-05 | 2014-12-10 | 浪潮电子信息产业股份有限公司 | Off-line alarming method for fault messages of server system |
CN105095058A (en) * | 2015-08-27 | 2015-11-25 | 浪潮电子信息产业股份有限公司 | Design method applied to server offline diagnosis |
CN106407090A (en) * | 2016-09-23 | 2017-02-15 | 郑州云海信息技术有限公司 | An optical path diagnosis server state display panel |
CN107193701A (en) * | 2017-06-06 | 2017-09-22 | 郑州云海信息技术有限公司 | Server master board and method for diagnosing faults with fault diagnosis functions |
CN107688524A (en) * | 2017-09-05 | 2018-02-13 | 郑州云海信息技术有限公司 | A kind of the indicating fault design method and instruction device of being easy to server heat to safeguard |
CN108108291A (en) * | 2017-12-25 | 2018-06-01 | 曙光信息产业(北京)有限公司 | A kind of trouble-shooter of server |
CN108874598A (en) * | 2018-05-24 | 2018-11-23 | 郑州云海信息技术有限公司 | A kind of memory failure information diagnosis system |
CN110994618A (en) * | 2020-01-03 | 2020-04-10 | 清华大学 | Module power supply method of multi-port electric energy router based on high-frequency collection bus |
CN110994618B (en) * | 2020-01-03 | 2021-12-07 | 清华大学 | Module power supply method of multi-port electric energy router based on high-frequency collection bus |
CN111625389A (en) * | 2020-05-28 | 2020-09-04 | 山东海量信息技术研究院 | VR fault data acquisition method and device and related components |
CN111625389B (en) * | 2020-05-28 | 2024-01-19 | 山东海量信息技术研究院 | VR fault data acquisition method and device and related components |
CN112463547A (en) * | 2020-11-06 | 2021-03-09 | 苏州浪潮智能科技有限公司 | High-density server system state indicating device and indicating method |
CN112596742A (en) * | 2020-11-30 | 2021-04-02 | 新华三云计算技术有限公司 | BMC software upgrading method, device, equipment and machine readable storage medium |
CN117792863A (en) * | 2024-02-27 | 2024-03-29 | 深圳供电局有限公司 | Industrial switch field visual fault detection method, system and storage medium |
CN117792863B (en) * | 2024-02-27 | 2024-06-18 | 深圳供电局有限公司 | Industrial switch field visual fault detection method, system and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103593276A (en) | Fault diagnosis method for server in power-down state | |
CN204330370U (en) | The trouble-shooter of air conditioner | |
CN103077103A (en) | Off-line diagnosing method for server faults | |
WO2017084426A1 (en) | Multiple unit (mpu) offline variable monitoring system and method | |
CN106326061A (en) | High-speed cache data processing method and equipment | |
CN109086192B (en) | IPMI-based onboard SATA hard disk lighting system and method | |
CN102013273A (en) | Off-line flash burning device and burning method thereof | |
EP2464041B1 (en) | Detection device and method thereof | |
CN104260677A (en) | Vehicle power supply control circuit and car | |
CN103530265A (en) | Device and method for realizing safe hot plugging of CF card of electronic equipment | |
CN103309791A (en) | Display device with fault diagnosis function | |
CN203561985U (en) | FPGA (field programmable gate array) chip and BMC (baseboard management controller) chip coordinated power management system for ATCA (advanced telecom computing architecture) blade | |
CN104598283A (en) | Method for realizing single-architecture multi-structure BMC firmware program | |
CN111726563A (en) | Video storage device for train video monitoring system | |
CN201345558Y (en) | Off-line UPS | |
CN105511980A (en) | Power failure recording method of high-end fault-tolerant server | |
US8566623B2 (en) | Start-up control apparatus and method | |
CN103995758A (en) | Method for displaying main board fault information in delayed mode | |
CN104699588A (en) | Hard disk state display device | |
CN203786229U (en) | Capacitor-storage battery mixed automobile starting system assembly comprehensive property detection apparatus | |
CN105095058A (en) | Design method applied to server offline diagnosis | |
CN102591441B (en) | Power system | |
CN204629708U (en) | A kind of fuel gas heating apparatus with sound prompt function | |
CN105277749B (en) | A kind of rack assets U positions and fault detection system | |
CN219846618U (en) | Ultrasonic equipment host |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140219 |