CN106557392A - Server failure detection means and method - Google Patents

Server failure detection means and method Download PDF

Info

Publication number
CN106557392A
CN106557392A CN201510634554.1A CN201510634554A CN106557392A CN 106557392 A CN106557392 A CN 106557392A CN 201510634554 A CN201510634554 A CN 201510634554A CN 106557392 A CN106557392 A CN 106557392A
Authority
CN
China
Prior art keywords
server
management controller
baseboard management
basic input
output system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510634554.1A
Other languages
Chinese (zh)
Inventor
黄育成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hongfujin Precision Electronics Tianjin Co Ltd
Original Assignee
Hongfujin Precision Industry Shenzhen Co Ltd
Hon Hai Precision Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hongfujin Precision Industry Shenzhen Co Ltd, Hon Hai Precision Industry Co Ltd filed Critical Hongfujin Precision Industry Shenzhen Co Ltd
Priority to CN201510634554.1A priority Critical patent/CN106557392A/en
Publication of CN106557392A publication Critical patent/CN106557392A/en
Pending legal-status Critical Current

Links

Abstract

A kind of server failure detection means, including a baseboard management controller.The baseboard management controller is used for the execution state of the basic input output system of the reception server, the baseboard management controller is also preset with some presupposed solutions of some fault types of server basic input output system and correspondence some fault types, when the server basic input output system exports Fisrt fault signal, the baseboard management controller identification Fisrt fault signal determines failure for Fisrt fault type and performs corresponding first presupposed solution.The server failure detection means can find fault type and attempt repairing automatically failure.The present invention also provides a kind of server failure detection method.

Description

Server failure detection means and method
Technical field
The present invention relates to a kind of server failure detection means, further relates to a kind of server failure detection method.
Background technology
During startup of server, the central processing unit of server from Serial Peripheral Interface (SPI) chip can be downloaded BIOS and its solution is depressed into Installed System Memory, execution system initialization and self-inspection again afterwards, screen can just be lighted after VBIOS initialization display chips are read during self-inspection, if certain flow process breaks down in this process, such as user cpu frequency setting mistake or VGA display chips setting mistake, as screen not yet power system will be unable to show failure code that user needs the long period solve failure after determining failure cause.
The content of the invention
In consideration of it, being necessary to provide a kind of server failure detection means and method that can feed back failure cause self-healing failure.
A kind of server failure detection means, including:
One baseboard management controller, the baseboard management controller is used for the execution state of the basic input output system of the reception server, the baseboard management controller is also preset with some presupposed solutions of some fault types of server basic input output system and correspondence some fault types, when the server basic input output system exports Fisrt fault signal, the baseboard management controller identification Fisrt fault signal determines failure for Fisrt fault type and performs corresponding first presupposed solution.
A kind of server failure detection method, comprises the following steps:
Server is started shooting;
The signal of the execution state of the basic input output system of server basic input output system output representative server is to baseboard management controller;
The baseboard management controller judges whether the basic input output system of the server performs state abnormal;
If the basic input output system of the server performs abnormal state, the baseboard management controller is called and performs solution corresponding with abnormality.
The server failure detection means can meet with assisting user when failure cannot light screen in BIOS and understand failure cause and automatically attempt to repair failure.
Description of the drawings
Fig. 1 is the block diagram of the better embodiment of server failure detection means of the present invention.
Fig. 2 is the flow chart of the better embodiment of server failure detection method of the present invention.
Main element symbol description
Server 100
Server failure detection means 10
Baseboard management controller 11
Basic input output system 101
Server failure detection method 200
Following specific embodiment will further illustrate the present invention with reference to above-mentioned accompanying drawing.
Specific embodiment
Fig. 1 is refer to, server failure detection means of the present invention 10 is applied in a server 100.The server failure detection means 10 includes baseboard management controller 11.The baseboard management controller 11 is connected with the basic input output system 101 of the server 100.The baseboard management controller 11 is used for the execution state of the basic input output system 101 of the reception server 100.The baseboard management controller 11 is also preset with some presupposed solutions of some fault types of the basic input output system 101 of server 100 and correspondence some fault types.When the basic input output system 101 exports Fisrt fault signal, the identification Fisrt fault signal of the baseboard management controller 11 determines failure for Fisrt fault type and performs corresponding first presupposed solution.
In present embodiment, the baseboard management controller 11 is preset with the fault type of the several basic input output system 101.The fault type includes but is not limited to cabinet and invades failure, CPU initialization failures, cpu frequency setting failure, cpu cache initialization failure, VBIOS initialization failures, internal memory initialization failure, memory size failure, hard disk initialization failure, PCI external equipment failures, USB external equipment failures, VBIOS collapse failures, platform controller initialization failure, node administration controller failure etc..
In present embodiment, when the identification failure of the baseboard management controller 11 is that cabinet invades failure, the baseboard management controller 11 calls the first presupposed solution, the baseboard management controller 11 confirms whether the cabinet is installed correctly, if cabinet installs correct, the baseboard management controller 11 is removed the cabinet and invades failure logging to continue starting procedure.
When the identification failure of the baseboard management controller 11 is CPU initialization failures, depositor related setting value is inserted depositor according to 100 depositor setting table of server by the baseboard management controller 11, and is outputed signal to CPU and restarted pin control CPU and restart.
When the identification failure of the baseboard management controller 11 is that cpu frequency sets failure, the baseboard management controller 11 inquires about now cpu frequency setting, if setting is abnormal reads cpu frequency from the mainboard CMOS ROM of server 100.
When the identification failure of the baseboard management controller 11 is cpu cache initialization failure, the baseboard management controller 11 inquires about the cpu cache setting of the server 100, if setting is abnormal resets caching.
When the identification failure of the baseboard management controller 11 is VBIOS initialization failures, the baseboard management controller 11 determines whether display card, if there is no display card, then video frequency output set of options is by CPU output images by the baseboard management controller 11, if there is display card, display card is restarted.
When the identification failure of the baseboard management controller 11 is internal memory initialization failure, the ROM read on internal memory obtains memory standards information, and the specification information is compared with 100 setting value of server, the setting value of server 100 is revised as if having differences the memory standards information for obtaining.
When the identification failure of the baseboard management controller 11 is memory size failure, the ROM that the baseboard management controller 11 is read on internal memory obtains memory standards information, and the specification information is compared with 100 setting value of server, the setting value of server 100 is revised as if having differences the memory standards information for obtaining.
When the identification failure of the baseboard management controller 11 is hard disk initialization failure, the baseboard management controller 11 checks the controller setting of hard disk.
When the identification failure of the baseboard management controller 11 is PCI external equipment failures, device PCI is scanned and by abnormal device PCI information output to the baseboard management controller 11.
When the identification failure of the baseboard management controller 11 is USB external equipment failures, scans USB device and abnormal USB device information is exported to the baseboard management controller 11, this USB device is disabled and reinitialized.
When the identification failure of the baseboard management controller 11 is that VBIOS collapses failure, the basic input output system 101 reads VBIOS from standby ROM.
When the identification failure of the baseboard management controller 11 is platform controller failure, basic input output system 101 detects whether the platform controller has feedback to judge whether hardware is normal.
When the identification failure of the baseboard management controller 11 is Node Controller failure, the baseboard management controller 11 is tested to Node Controller using IPMB interfaces, and if abnormal, 11 pairs of Node Controllers of the baseboard management controller carry out software upgrading.
Fig. 2 is refer to, the better embodiment of server failure detection method of the present invention 200 includes step S10-S16:
S10:Server is started shooting;
S12:The signal of the execution state of the basic input output system of server basic input output system output representative server is to baseboard management controller;
S14:The baseboard management controller judges whether the basic input output system of the server performs state abnormal;
S16:If the basic input output system of the server performs abnormal state, the baseboard management controller is called and performs solution corresponding with abnormality.
Finally it should be noted that, above example is only to illustrate technical scheme and unrestricted, although being described in detail to the present invention with reference to preferred embodiment, it will be understood by those within the art that, technical scheme can be modified or equivalent, without deviating from the spirit and scope of technical solution of the present invention.

Claims (5)

1. a kind of server failure detection means, including:
One baseboard management controller, the baseboard management controller is connected with server basic input output system, the baseboard management controller is used for the execution state of the basic input output system of the reception server, the baseboard management controller is also preset with some presupposed solutions of some fault types of server basic input output system and correspondence some fault types, when the server basic input output system exports Fisrt fault signal, the baseboard management controller identification Fisrt fault signal determines failure for Fisrt fault type and performs corresponding first presupposed solution.
2. server failure detection means as claimed in claim 1, it is characterised in that:The Fisrt fault type is that cabinet invades failure.
3. server failure detection means as claimed in claim 1, it is characterised in that:When the server basic input output system exports the second fault-signal, the baseboard management controller recognizes that the second fault-signal determines failure for the second fault type and performs corresponding second presupposed solution.
4. server failure detection means as claimed in claim 3, it is characterised in that:Second fault type is CPU initialization failures.
5. a kind of server failure detection method, comprises the following steps:
Server is started shooting;
The signal of the execution state of the basic input output system of server basic input output system output representative server is to baseboard management controller;
The baseboard management controller judges whether the basic input output system of the server performs state abnormal;
If the basic input output system of the server performs abnormal state, the baseboard management controller is called and performs solution corresponding with abnormality.
CN201510634554.1A 2015-09-29 2015-09-29 Server failure detection means and method Pending CN106557392A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510634554.1A CN106557392A (en) 2015-09-29 2015-09-29 Server failure detection means and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510634554.1A CN106557392A (en) 2015-09-29 2015-09-29 Server failure detection means and method

Publications (1)

Publication Number Publication Date
CN106557392A true CN106557392A (en) 2017-04-05

Family

ID=58416060

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510634554.1A Pending CN106557392A (en) 2015-09-29 2015-09-29 Server failure detection means and method

Country Status (1)

Country Link
CN (1) CN106557392A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107885637A (en) * 2017-11-09 2018-04-06 紫光华山信息技术有限公司 A kind of server exception detection method and device
CN108427044A (en) * 2018-01-19 2018-08-21 广州视源电子科技股份有限公司 A kind of test method of failure protection function, device, equipment and storage medium
CN109032880A (en) * 2018-07-26 2018-12-18 郑州云海信息技术有限公司 A kind of hardware adjusting, measuring method and device cooperateing with BIOS self-test
CN110704219A (en) * 2019-09-02 2020-01-17 上海商米科技集团股份有限公司 Hardware fault reporting method and device and computer storage medium
CN117389781A (en) * 2023-10-18 2024-01-12 上海合芯数字科技有限公司 Abnormality detection and recovery method and system for server equipment, server and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102082781A (en) * 2009-11-27 2011-06-01 宏正自动科技股份有限公司 Server management system and method
US20110145634A1 (en) * 2009-12-16 2011-06-16 Nec Corporation Apparatus, a recovery method and a program thereof
CN103473167A (en) * 2013-09-09 2013-12-25 华为技术有限公司 Fault display method and device of server
CN104021054A (en) * 2014-06-11 2014-09-03 浪潮(北京)电子信息产业有限公司 Server fault visual detecting and processing method and system and programmable chip
CN104318879A (en) * 2014-10-20 2015-01-28 京东方科技集团股份有限公司 Display device and display device failure analysis system and method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102082781A (en) * 2009-11-27 2011-06-01 宏正自动科技股份有限公司 Server management system and method
US20110145634A1 (en) * 2009-12-16 2011-06-16 Nec Corporation Apparatus, a recovery method and a program thereof
CN103473167A (en) * 2013-09-09 2013-12-25 华为技术有限公司 Fault display method and device of server
CN104021054A (en) * 2014-06-11 2014-09-03 浪潮(北京)电子信息产业有限公司 Server fault visual detecting and processing method and system and programmable chip
CN104318879A (en) * 2014-10-20 2015-01-28 京东方科技集团股份有限公司 Display device and display device failure analysis system and method

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107885637A (en) * 2017-11-09 2018-04-06 紫光华山信息技术有限公司 A kind of server exception detection method and device
CN107885637B (en) * 2017-11-09 2019-11-08 新华三信息技术有限公司 A kind of server exception detection method and device
CN108427044A (en) * 2018-01-19 2018-08-21 广州视源电子科技股份有限公司 A kind of test method of failure protection function, device, equipment and storage medium
CN109032880A (en) * 2018-07-26 2018-12-18 郑州云海信息技术有限公司 A kind of hardware adjusting, measuring method and device cooperateing with BIOS self-test
CN110704219A (en) * 2019-09-02 2020-01-17 上海商米科技集团股份有限公司 Hardware fault reporting method and device and computer storage medium
CN110704219B (en) * 2019-09-02 2023-08-22 上海商米科技集团股份有限公司 Hardware fault reporting method and device and computer storage medium
CN117389781A (en) * 2023-10-18 2024-01-12 上海合芯数字科技有限公司 Abnormality detection and recovery method and system for server equipment, server and medium

Similar Documents

Publication Publication Date Title
CN106557392A (en) Server failure detection means and method
US9710255B1 (en) Updating system of firmware of complex programmable logic device and updating method thereof
CN108038019B (en) Automatic fault recovery method and system for substrate management controller
CN107171833B (en) Method for realizing batch upgrading of BMC and BIOS of server through BMC
TW201712543A (en) Method for detecting fault of server and device using the same
WO2019129022A1 (en) Error processing method, apparatus and system for device
US20120303940A1 (en) System, method and program product to manage firmware on a system board
US10783253B2 (en) Hardware structure of a trusted computer and trusted booting method for a computer
CN101853171A (en) On-line upgrade method and device of complicated programmable logical device
FI127566B (en) Rack having multiple rack management modules and firmware updating method for the same
US20170115996A1 (en) Reboot system and method for baseboard management controller
US20180210783A1 (en) Information processing apparatus, control method of the same, and storage medium
CN109408121B (en) EDID reading and configuring method, system and medium
US8826078B2 (en) Computer system and diagnostic method thereof
CN106055440A (en) Testing method and system for realizing abnormal power failure of server through BMC
US10762029B2 (en) Electronic apparatus and detection method using the same
CN107894935B (en) OPS computer module detection processing method and device and electronic equipment
CN111722965A (en) Computer system and debugging method thereof
TWI534609B (en) Automatic scanning and repair method for electronic devices
CN113867812B (en) Method, system, equipment and medium for BMC to acquire link information
CN107423168B (en) Test method and test device
CN115952122A (en) I2C device hot plug method, system, device, medium and product
CN107179911B (en) Method and equipment for restarting management engine
US20200159646A1 (en) Information processing apparatus
CN110908725B (en) Application program starting method and device, electronic equipment and readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20180205

Address after: The 300457 Tianjin economic and Technological Development Zone Haiyun Street No. 80

Applicant after: Hongfujin Precision Electronics (Tianjin) Co., Ltd.

Address before: 518109 Guangdong city of Shenzhen province Baoan District Longhua Town Industrial Zone tabulaeformis tenth East Ring Road No. 2 two

Applicant before: Hongfujin Precise Industry (Shenzhen) Co., Ltd.

Applicant before: Hon Hai Precision Industry Co., Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170405