CN103593276A - Fault diagnosis method for server in power-down state - Google Patents

Fault diagnosis method for server in power-down state Download PDF

Info

Publication number
CN103593276A
CN103593276A CN201310576783.3A CN201310576783A CN103593276A CN 103593276 A CN103593276 A CN 103593276A CN 201310576783 A CN201310576783 A CN 201310576783A CN 103593276 A CN103593276 A CN 103593276A
Authority
CN
China
Prior art keywords
server
bmc
power
led light
fault diagnosis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310576783.3A
Other languages
Chinese (zh)
Inventor
刘宝阳
平原
张锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN201310576783.3A priority Critical patent/CN103593276A/en
Publication of CN103593276A publication Critical patent/CN103593276A/en
Pending legal-status Critical Current

Links

Landscapes

  • Test And Diagnosis Of Digital Computers (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a fault diagnosis method for a server in the power-down state. A server management controller BMC is responsible for monitoring the operation state of the server, and uses LED indicating lamps corresponding to components for carrying out fault diagnosis; when a fault occurs in the components of the server, the BMC is responsible for lightening the LED indicating lamps corresponding to the components, wherein a USB interface is designed to be arranged on a front panel of the server, the USB interface of a BMC monitoring chip is guided to the front panel of a case and is used for enabling an external USB mobile power source to provide power for the BMC and provide the fault diagnosis capability when the BMC is in the power-down state. When the fault occurs in the server, the BMC records the corresponding fault state into EEPROM and stores the fault state permanently until the fault state changes, and the BMC restores the fault state; the USB mobile power source is used for providing power for the BMC monitoring chip when the server is in the power-down state, the BMC detects that power is supplied by the USB mobile power source, lightens the LED indicating lamps of the components according to the server fault state stored in the EEPROM correspondingly, so that the fault diagnosis function is continued to be finished on the condition that the server is in the power-down state.

Description

A kind of method of power-down state server failure diagnosis
Technical field
The present invention relates to server failure diagnostic field, be specifically related to a kind of method of power-down state server failure diagnosis.
Technical background
Development along with high-performance calculation machine technology, the parts of server are on the increase, also more and more urgent to the failure monitoring of server component, diagnosis, the monitoring management unit B MC of server (Baseboard Management Controller) is responsible for each parts of server to carry out condition monitoring.But along with increasing of server component, LED light for unit failure diagnosis reaches 30-40, for high-performance server, LED light for fault diagnosis reaches more than 60, the LED light of front panel cannot meet the requirement of server failure diagnosis, so the LED light of fault diagnosis can only be placed on mainboard.There is so again another one problem, when fault diagnosis is carried out to server in fault diagnosis personnel scene, server must be taken out from rack, open the chassis lid of server, these operations must be pulled out the power supply of server, and after traditional server power down, malfunction will be cleared, cannot playback server malfunction during operation, also just cannot reach the object of fault diagnosis.
Along with being on the increase of server component, complexity is more and more higher, and the time that after breaking down, tracing trouble spends is more and more longer, and traditional server cannot meet the requirement of unit failure diagnosis, and just to server failure, diagnosis has proposed new requirement for this.Therefore, the method for a kind of power-down state server failure diagnosis proposed just necessary.
Summary of the invention
The technical problem to be solved in the present invention is: a kind of method that power-down state server failure diagnosis is provided.
The technical solution adopted in the present invention is: the fault diagnosis that the present invention mentions is to take the fault diagnosis that server B MC (Baseboard Management Controller) is core.
A kind of method of power-down state server failure diagnosis, the LED light that each parts of server by utilizing are corresponding is carried out fault diagnosis, server management controller BMC is responsible for monitoring server running status, when server component breaks down, BMC is responsible for lighting the LED light that parts are corresponding, wherein, on server front panel, design a USB interface, the USB interface of BMC monitoring chip is guided on the front panel of cabinet, for external USB portable power source, to BMC, power, the ability of fault diagnosis under BMC power-down state is provided; In server normal course of operation, BMC is responsible for the running status of monitoring server, and when server breaks down, BMC is recorded to corresponding malfunction in EEPROM, and persistence, until malfunction changes, is preserved malfunction again; Under server power-down state, use USB portable power source to power to BMC monitoring chip, BMC detects the power supply of USB portable power source, can be according to the server failure state being kept in EEPROM, the LED light of corresponding point highlights part, reach the object of power-down state server failure diagnosis, made up the defect of traditional server trouble diagnosibility deficiency.
Described method flow is as follows:
First, in server normal course of operation, the running status of each parts of BMC monitoring server, when server component breaks down, the LED light of BMC corresponding point highlights part;
Secondly, during fault diagnosis, server power supply is broken, the LED light of parts is all extinguished, and then, takes out the battery interface position that USB portable power source is inserted in server front panel, portable power source is only given the peripheral circuit power supply of BMC chip and BMC chip, in BMC start-up course, powered battery detected, the malfunction while moving powering on shows fault diagnosis personnel by LED light again, is labeled as diagnostic process simultaneously;
Finally, after having diagnosed, insert power supply, it is Power supply that BMC detects, and no longer according to the malfunction of preserving, lights corresponding LED light, avoids that LED mistake is bright brings puzzlement to client, simultaneously, BMC detects the mark of diagnostic process, and the malfunction being kept in EEPROM is emptied, and removes the mark of diagnostic process simultaneously;
After server failure diagnosis, normally operation.
Beneficial effect of the present invention is:
Application based on high-performance server, parts are used more and more, complexity is more and more higher, after breaking down the time of tracing trouble cost more and more longer, so essential to the fault automatic monitoring of parts, diagnostic function.The invention provides and a kind ofly can under server power-down conditions, continue the fault diagnosis functions of server, made up the deficiency of traditional server method for diagnosing faults, make it be more suitable for high-performance computer application, thereby there is development space very widely.
Embodiment
In conjunction with the embodiments to the detailed description of the invention.
Embodiment 1:
A kind of method of power-down state server failure diagnosis, server management controller BMC is responsible for monitoring server running status, utilize the LED light that each parts are corresponding to carry out fault diagnosis, when server component breaks down, BMC is responsible for lighting the LED light that parts are corresponding, wherein, on server front panel, design a USB interface, the USB interface of BMC monitoring chip is guided on the front panel of cabinet, for external USB portable power source, to BMC, power, the ability of fault diagnosis under BMC power-down state is provided; In server normal course of operation, BMC is responsible for the running status of monitoring server, and when server breaks down, BMC is recorded to corresponding malfunction in EEPROM, and persistence, until malfunction changes, is preserved malfunction again; Under server power-down state, use USB portable power source to the power supply of BMC monitoring chip, BMC detects the power supply of USB portable power source, can be according to the server failure state being kept in EEPROM, the LED light of corresponding point highlights part.
Embodiment 2:
On the basis of embodiment 1, method flow is as follows described in the present embodiment:
First, in server normal course of operation, the running status of each parts of BMC monitoring server, when server component breaks down, the LED light of BMC corresponding point highlights part;
Secondly, during fault diagnosis, server power supply is broken, the LED light of parts is all extinguished, and then, takes out the battery interface position that USB portable power source is inserted in server front panel, portable power source is only given the peripheral circuit power supply of BMC chip and BMC chip, in BMC start-up course, powered battery detected, the malfunction while moving powering on shows fault diagnosis personnel by LED light again, is labeled as diagnostic process simultaneously;
Finally, after having diagnosed, insert power supply, it is Power supply that BMC detects, and no longer according to the malfunction of preserving, lights corresponding LED light, avoids that LED mistake is bright brings puzzlement to client, simultaneously, BMC detects the mark of diagnostic process, and the malfunction being kept in EEPROM is emptied, and removes the mark of diagnostic process simultaneously;
After server failure diagnosis, normally operation.

Claims (2)

1. the method for power-down state server failure diagnosis, wherein, server management controller BMC is responsible for monitoring server running status, utilize the LED light that each parts are corresponding to carry out fault diagnosis, when server component breaks down, BMC is responsible for lighting the LED light that parts are corresponding, it is characterized in that: on server front panel, design a USB interface, the USB interface of BMC monitoring chip is guided on the front panel of cabinet, for external USB portable power source, to BMC, power, the ability of fault diagnosis under BMC power-down state is provided; In server normal course of operation, BMC is responsible for the running status of monitoring server, and when server breaks down, BMC is recorded to corresponding malfunction in EEPROM, and persistence, until malfunction changes, is preserved malfunction again; Under server power-down state, use USB portable power source to the power supply of BMC monitoring chip, BMC detects the power supply of USB portable power source, can be according to the server failure state being kept in EEPROM, the LED light of corresponding point highlights part.
2. the method for a kind of power-down state server failure diagnosis according to claim 1, is characterized in that, described method flow is as follows:
First, in server normal course of operation, the running status of each parts of BMC monitoring server, when server component breaks down, the LED light of BMC corresponding point highlights part;
Secondly, during fault diagnosis, server power supply is broken, the LED light of parts is all extinguished, and then, takes out the battery interface position that USB portable power source is inserted in server front panel, portable power source is only given the peripheral circuit power supply of BMC chip and BMC chip, in BMC start-up course, powered battery detected, the malfunction while moving powering on shows fault diagnosis personnel by LED light again, is labeled as diagnostic process simultaneously;
Finally, after having diagnosed, insert power supply, it is Power supply that BMC detects, and no longer according to the malfunction of preserving, lights corresponding LED light, avoids that LED mistake is bright brings puzzlement to client, simultaneously, BMC detects the mark of diagnostic process, and the malfunction being kept in EEPROM is emptied, and removes the mark of diagnostic process simultaneously;
After server failure diagnosis, normally operation.
CN201310576783.3A 2013-11-19 2013-11-19 Fault diagnosis method for server in power-down state Pending CN103593276A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310576783.3A CN103593276A (en) 2013-11-19 2013-11-19 Fault diagnosis method for server in power-down state

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310576783.3A CN103593276A (en) 2013-11-19 2013-11-19 Fault diagnosis method for server in power-down state

Publications (1)

Publication Number Publication Date
CN103593276A true CN103593276A (en) 2014-02-19

Family

ID=50083428

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310576783.3A Pending CN103593276A (en) 2013-11-19 2013-11-19 Fault diagnosis method for server in power-down state

Country Status (1)

Country Link
CN (1) CN103593276A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199757A (en) * 2014-09-05 2014-12-10 浪潮电子信息产业股份有限公司 Off-line alarming method for fault messages of server system
CN105095058A (en) * 2015-08-27 2015-11-25 浪潮电子信息产业股份有限公司 Design method applied to server offline diagnosis
CN106407090A (en) * 2016-09-23 2017-02-15 郑州云海信息技术有限公司 An optical path diagnosis server state display panel
CN107193701A (en) * 2017-06-06 2017-09-22 郑州云海信息技术有限公司 Server master board and method for diagnosing faults with fault diagnosis functions
CN107688524A (en) * 2017-09-05 2018-02-13 郑州云海信息技术有限公司 A kind of the indicating fault design method and instruction device of being easy to server heat to safeguard
CN108108291A (en) * 2017-12-25 2018-06-01 曙光信息产业(北京)有限公司 A kind of trouble-shooter of server
CN108874598A (en) * 2018-05-24 2018-11-23 郑州云海信息技术有限公司 A kind of memory failure information diagnosis system
CN110994618A (en) * 2020-01-03 2020-04-10 清华大学 Module power supply method of multi-port electric energy router based on high-frequency collection bus
CN111625389A (en) * 2020-05-28 2020-09-04 山东海量信息技术研究院 VR fault data acquisition method and device and related components
CN112463547A (en) * 2020-11-06 2021-03-09 苏州浪潮智能科技有限公司 High-density server system state indicating device and indicating method
CN112596742A (en) * 2020-11-30 2021-04-02 新华三云计算技术有限公司 BMC software upgrading method, device, equipment and machine readable storage medium
CN117792863A (en) * 2024-02-27 2024-03-29 深圳供电局有限公司 Industrial switch field visual fault detection method, system and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080244296A1 (en) * 2007-03-26 2008-10-02 International Business Machines Corporation Computer system fault detection
CN201515381U (en) * 2009-10-28 2010-06-23 浪潮电子信息产业股份有限公司 Novel server management monitoring system
CN103077103A (en) * 2013-01-18 2013-05-01 浪潮电子信息产业股份有限公司 Off-line diagnosing method for server faults

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080244296A1 (en) * 2007-03-26 2008-10-02 International Business Machines Corporation Computer system fault detection
CN201515381U (en) * 2009-10-28 2010-06-23 浪潮电子信息产业股份有限公司 Novel server management monitoring system
CN103077103A (en) * 2013-01-18 2013-05-01 浪潮电子信息产业股份有限公司 Off-line diagnosing method for server faults

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199757A (en) * 2014-09-05 2014-12-10 浪潮电子信息产业股份有限公司 Off-line alarming method for fault messages of server system
CN105095058A (en) * 2015-08-27 2015-11-25 浪潮电子信息产业股份有限公司 Design method applied to server offline diagnosis
CN106407090A (en) * 2016-09-23 2017-02-15 郑州云海信息技术有限公司 An optical path diagnosis server state display panel
CN107193701A (en) * 2017-06-06 2017-09-22 郑州云海信息技术有限公司 Server master board and method for diagnosing faults with fault diagnosis functions
CN107688524A (en) * 2017-09-05 2018-02-13 郑州云海信息技术有限公司 A kind of the indicating fault design method and instruction device of being easy to server heat to safeguard
CN108108291A (en) * 2017-12-25 2018-06-01 曙光信息产业(北京)有限公司 A kind of trouble-shooter of server
CN108874598A (en) * 2018-05-24 2018-11-23 郑州云海信息技术有限公司 A kind of memory failure information diagnosis system
CN110994618A (en) * 2020-01-03 2020-04-10 清华大学 Module power supply method of multi-port electric energy router based on high-frequency collection bus
CN110994618B (en) * 2020-01-03 2021-12-07 清华大学 Module power supply method of multi-port electric energy router based on high-frequency collection bus
CN111625389A (en) * 2020-05-28 2020-09-04 山东海量信息技术研究院 VR fault data acquisition method and device and related components
CN111625389B (en) * 2020-05-28 2024-01-19 山东海量信息技术研究院 VR fault data acquisition method and device and related components
CN112463547A (en) * 2020-11-06 2021-03-09 苏州浪潮智能科技有限公司 High-density server system state indicating device and indicating method
CN112596742A (en) * 2020-11-30 2021-04-02 新华三云计算技术有限公司 BMC software upgrading method, device, equipment and machine readable storage medium
CN117792863A (en) * 2024-02-27 2024-03-29 深圳供电局有限公司 Industrial switch field visual fault detection method, system and storage medium
CN117792863B (en) * 2024-02-27 2024-06-18 深圳供电局有限公司 Industrial switch field visual fault detection method, system and storage medium

Similar Documents

Publication Publication Date Title
CN103593276A (en) Fault diagnosis method for server in power-down state
CN204330370U (en) The trouble-shooter of air conditioner
CN103077103A (en) Off-line diagnosing method for server faults
WO2017084426A1 (en) Multiple unit (mpu) offline variable monitoring system and method
CN106326061A (en) High-speed cache data processing method and equipment
CN109086192B (en) IPMI-based onboard SATA hard disk lighting system and method
CN102013273A (en) Off-line flash burning device and burning method thereof
EP2464041B1 (en) Detection device and method thereof
CN104260677A (en) Vehicle power supply control circuit and car
CN103530265A (en) Device and method for realizing safe hot plugging of CF card of electronic equipment
CN103309791A (en) Display device with fault diagnosis function
CN203561985U (en) FPGA (field programmable gate array) chip and BMC (baseboard management controller) chip coordinated power management system for ATCA (advanced telecom computing architecture) blade
CN104598283A (en) Method for realizing single-architecture multi-structure BMC firmware program
CN111726563A (en) Video storage device for train video monitoring system
CN201345558Y (en) Off-line UPS
CN105511980A (en) Power failure recording method of high-end fault-tolerant server
US8566623B2 (en) Start-up control apparatus and method
CN103995758A (en) Method for displaying main board fault information in delayed mode
CN104699588A (en) Hard disk state display device
CN203786229U (en) Capacitor-storage battery mixed automobile starting system assembly comprehensive property detection apparatus
CN105095058A (en) Design method applied to server offline diagnosis
CN102591441B (en) Power system
CN204629708U (en) A kind of fuel gas heating apparatus with sound prompt function
CN105277749B (en) A kind of rack assets U positions and fault detection system
CN219846618U (en) Ultrasonic equipment host

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140219