CN109032867A - A kind of method for diagnosing faults, device and equipment - Google Patents

A kind of method for diagnosing faults, device and equipment Download PDF

Info

Publication number
CN109032867A
CN109032867A CN201810826756.XA CN201810826756A CN109032867A CN 109032867 A CN109032867 A CN 109032867A CN 201810826756 A CN201810826756 A CN 201810826756A CN 109032867 A CN109032867 A CN 109032867A
Authority
CN
China
Prior art keywords
fault
mainboard device
log
basic input
output system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810826756.XA
Other languages
Chinese (zh)
Inventor
王龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201810826756.XA priority Critical patent/CN109032867A/en
Publication of CN109032867A publication Critical patent/CN109032867A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2205Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested

Abstract

This application discloses a kind of method for diagnosing faults, applied to baseboard management controller, it include: the IPMI order for receiving the fault message for transmitting mainboard device that basic input output system controller is sent, IPMI order is generated after detecting mainboard device fault by basic input output system controller;Fault message is parsed to generate fault log, so that user calls fault log to carry out fault diagnosis.The application can enable baseboard management controller generate the fault log that the baseboard management controllers such as CPU do not have the mainboard device of direct monitoring permission, to carry out fault diagnosis.The application effectively extends the range of fault diagnosis object, and then effectively increases trouble diagnosibility.Disclosed herein as well is a kind of trouble-shooter, equipment and computer readable storage mediums, equally have above-mentioned beneficial effect.

Description

A kind of method for diagnosing faults, device and equipment
Technical field
This application involves field of computer technology, in particular to a kind of method for diagnosing faults, device, equipment and computer can Read storage medium.
Background technique
Fault log is by baseboard management controller (the Baseboard Management on server master board Controller, BMC) the significant data file generated after monitoring that device breaks down, for those skilled in the art Later period is called and is analyzed to carry out detailed fault diagnosis to system.
Since baseboard management controller can only monitor whether certain devices break down on mainboard, and for CPU, memory Etc. important mainboard device, baseboard management controller is that therefore supervision authority can not temporarily not generate this kind of in the prior art The fault log for the mainboard device not monitored by baseboard management controller.
As it can be seen which kind of method for diagnosing faults effectively to expand the range of fault diagnosis object, and then to improve failure using Diagnosis capability is those skilled in the art's technical problem urgently to be resolved.
Summary of the invention
The application's is designed to provide a kind of method for diagnosing faults, device, equipment and computer readable storage medium, with Just effectively expand the range of fault diagnosis object, and then effectively improve trouble diagnosibility.
In order to solve the above technical problems, the application provides a kind of method for diagnosing faults, it is applied to baseboard management controller, packet It includes:
The IPMI order for the fault message for transmitting mainboard device that basic input output system controller is sent is received, The IPMI order is generated after detecting the mainboard device fault by the basic input output system controller;
The fault message is parsed to generate fault log, so that user calls the fault log to carry out failure Diagnosis.
Optionally, the failure letter for transmitting mainboard device sent in the reception basic input output system controller After the IPMI order of breath, further includes:
The status information of the mainboard device is obtained, to solve jointly to the fault message and the status information Analysis is to generate the fault log.
Optionally, the status information includes any of the following or any combination:
Temperature information, type information, occupancy rate information.
Optionally, the mainboard device includes following any one or any combination:
CPU, memory, PCI-e equipment.
Optionally, the content of the fault log includes following any one or any combination:
Error code, failure rank, fault time, event of failure type, failure-description.
Present invention also provides a kind of trouble-shooters, are applied to baseboard management controller, comprising:
Receiving module: for receiving the failure letter for transmitting mainboard device of basic input output system controller transmission The IPMI order of breath, the IPMI order are detecting the mainboard device fault by the basic input output system controller After generate;
Parsing module: for being parsed the fault message to generate fault log, so that user calls the event Hinder log and carries out fault diagnosis.
Optionally, further includes:
Obtain module: for obtaining the status information of the mainboard device;
The parsing module is specifically used for:
The fault message and the status information are parsed jointly to generate the fault log, so as to user's tune Fault diagnosis is carried out with the fault log.
Optionally, the mainboard device includes following any one or any combination:
CPU, memory, PCI-e equipment.
Present invention also provides a kind of failure diagnosis apparatus, comprising:
Memory: for storing computer program;
Processor: the step of any method for diagnosing faults as described above is realized for executing the computer program Suddenly.
Present invention also provides a kind of computer readable storage medium, meter is stored in the computer readable storage medium Calculation machine program, to realize the step of any method for diagnosing faults as described above when the computer program is executed by processor Suddenly.
Method for diagnosing faults provided herein is applied to baseboard management controller, comprising: receives basic input and output The IPMI order for the fault message for transmitting mainboard device that system controller is sent, the IPMI order is by described substantially defeated Enter output system controller to generate after detecting the mainboard device fault;The fault message is parsed to generate event Hinder log, so that user calls the fault log to carry out fault diagnosis.
As it can be seen that compared with the prior art, in method for diagnosing faults provided herein, basic input and output system is utilized Supervision authority of the system controller to the mainboards device such as CPU, the fault message that will be monitored by basic input output system controller It is sent to baseboard management controller, is not had directly to enable baseboard management controller that the baseboard management controllers such as CPU can be generated The fault log of the mainboard device of supervision authority is connect, to carry out fault diagnosis.The application effectively extends fault diagnosis pair The range of elephant, and then effectively increase trouble diagnosibility.Trouble-shooter, equipment and computer provided herein can Reading storage medium may be implemented above-mentioned method for diagnosing faults, equally have above-mentioned beneficial effect.
Detailed description of the invention
In order to illustrate more clearly of the technical solution in the prior art and the embodiment of the present application, below will to the prior art and Attached drawing to be used is needed to make brief introduction in the embodiment of the present application description.Certainly, in relation to the attached drawing of the embodiment of the present application below A part of the embodiment in only the application of description is not paying creativeness to those skilled in the art Under the premise of labour, other attached drawings can also be obtained according to the attached drawing of offer, other accompanying drawings obtained also belong to the application Protection scope.
Fig. 1 is a kind of flow chart of method for diagnosing faults provided herein;
Fig. 2 is a kind of structural block diagram of trouble-shooter provided herein.
Specific embodiment
The core of the application is to provide a kind of method for diagnosing faults, device, equipment and computer readable storage medium, with Just effectively expand the range of fault diagnosis object, and then effectively improve trouble diagnosibility.
In order to which technical solutions in the embodiments of the present application is more clearly and completely described, below in conjunction with this Shen Please attached drawing in embodiment, technical solutions in the embodiments of the present application is introduced.Obviously, described embodiment is only Some embodiments of the present application, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art Every other embodiment obtained without making creative work, shall fall in the protection scope of this application.
Referring to FIG. 1, Fig. 1 is a kind of flow chart of method for diagnosing faults provided herein, it is applied to substrate management Controller (Baseboard Management Controller, BMC), mainly comprises the steps that
Step 1: receiving basic input output system (Basic Input Output System, BIOS) controller and send For transmit mainboard device fault message IPMI order.
Wherein, IPMI order is generated after detecting mainboard device fault by basic input output system controller.
Step 2: fault message being parsed to generate fault log, is examined so that user calls fault log to carry out failure It is disconnected.
Specifically, baseboard management controller is the important control device on server master board in IPMI specification hardware component, It can carry out machine to include that system mode such as monitors, restarts, powers, powers off again at the bottoms in the state that machine is not keyed up Plate control;It can also realize the updating operation of some hardware firmwares.For some commonplace components on mainboard, such as power supply, wind Fan etc., baseboard management controller have direct monitoring permission, can be directly obtained the fault message etc. of such device, still, For devices such as such as CPU, memory, PCI-e equipment, baseboard management controller does not simultaneously have supervision authority, can not learn such device Whether part breaks down, and then just can not generate fault log in such device fault.
Come for this purpose, basic input output system controller is utilized in method for diagnosing faults provided herein to substrate pipe Manage the fault message of the devices such as controller transmitting CPU.Specifically, basic input output system, that is, BIOS is one group and is cured to service Program in device on mainboard self-check program and is after the program of its in store most important basic input and output of computer, booting System self-triggered program etc., major function is that the bottom, most direct hardware setting and control are provided for computer.For base Board management controller, which does not have mainboards device, the BIOS controllers such as the CPU of supervision authority, has supervision authority, can pass through Relevant bios program gets the fault message of this kind of device, learns whether these devices break down.
Therefore, baseboard management controller is sent to by the fault message for enabling BIOS controller will test, it can be by base Board management controller learns whether the mainboards such as CPU device breaks down, and generates corresponding failure day when confirming that failure occurs Will.
It is easily understood that as it is known by the man skilled in the art that baseboard management controller is led to using IPMI standard Letter, therefore, the communication process between BIOS controller and baseboard management controller is again based on IPMI order completion.This Field technical staff needs in advance to carry out BIOS controller and baseboard management controller relevant IPMI command development and set It sets, the two is enabled to complete communication according to the communication protocol of setting, certainly, this is also generally directed to substrate management control in the process The communication data format of device and BIOS controller processed is converted, and specific content voluntarily can be selected and be set by those skilled in the art It sets, the application is not limited thereto.
As it can be seen that basic input output system controller is utilized to CPU etc. in method for diagnosing faults provided herein The fault message monitored is sent to substrate management control by basic input output system controller by the supervision authority of mainboard device Device processed, to enable baseboard management controller that the mainboard that the baseboard management controllers such as CPU do not have direct monitoring permission can be generated The fault log of device, to carry out fault diagnosis.The application effectively extends the range of fault diagnosis object, and then effectively Improve trouble diagnosibility.
Method for diagnosing faults provided herein, on the basis of the above embodiments:
As a kind of preferred embodiment, it is used to transmit mainboard device what reception basic input output system controller was sent Fault message IPMI order after, further includes:
The status information of mainboard device is obtained, to be parsed fault message and status information to generate failure jointly Log.
Specifically, the fault message that basic input output system controller is got can reflect corresponding mainboard device It is no to break down;But other than fault message, the information of the operating status of mainboard device is also generally usually used in judging failure, in turn Generate fault log.And it is directed to the information of these reflection operating statuses, baseboard management controller usually all has direct Permission is obtained, therefore, in method for diagnosing faults provided herein, baseboard management controller is in addition to obtaining mainboard device Other than fault message, the status information of mainboard device can also be obtained, so that combination failure information is parsed with status information, Generate fault log.
As a kind of preferred embodiment, status information includes any of the following or any combination:
Temperature information, type information, occupancy rate information.
Certainly, those skilled in the art also can choose and be arranged other kinds of status information, the application to this not It is defined.
As a kind of preferred embodiment, the content of fault log includes following any one or any combination:
Error code, failure rank, fault time, event of failure type, failure-description.
Specifically, baseboard management controller, can be by the detailed fault condition of mainboard device point when generating fault log Class arrangement is clear, and generally, described detailed fault condition can be specifically from error code, failure rank, fault time, failure thing Part type, failure-description etc. record.Certainly, this neighborhood technique personnel voluntarily can select and be arranged, simultaneously to this application Without limiting.
Trouble-shooter provided herein is introduced below.
Referring to Fig. 2, Fig. 2 is a kind of structural block diagram of trouble-shooter provided herein;Applied to substrate pipe Manage controller, including receiving module 1 and parsing module 2;
Receiving module 1 is used to receive the failure letter for transmitting mainboard device of basic input output system controller transmission The IPMI order of breath, IPMI order are generated after detecting mainboard device fault by basic input output system controller;
Parsing module 2 is used to parse to generate fault log fault message, so as to user call fault log into Row fault diagnosis.
As it can be seen that trouble-shooter provided herein, it is main to CPU etc. to be utilized basic input output system controller The fault message monitored is sent to substrate management control by basic input output system controller by the supervision authority of plate device Device, to enable baseboard management controller that the mainboard device that the baseboard management controllers such as CPU do not have direct monitoring permission can be generated The fault log of part, to carry out fault diagnosis.The application effectively extends the range of fault diagnosis object, and then effectively mentions High trouble diagnosibility.
Trouble-shooter provided herein, on the basis of the above embodiments:
As a kind of preferred embodiment, further includes:
Obtain module: for obtaining the status information of mainboard device;
Parsing module 2 is specifically used for:
Fault message and status information are parsed jointly to generate fault log, so as to user call fault log into Row fault diagnosis.
As a kind of preferred embodiment, status information includes any of the following or any combination:
Temperature information, type information, occupancy rate information.
As a kind of preferred embodiment, mainboard device includes following any one or any combination:
CPU, memory, PCI-e equipment.
As a kind of preferred embodiment, the content of fault log includes following any one or any combination:
Error code, failure rank, fault time, event of failure type, failure-description.
Present invention also provides a kind of failure diagnosis apparatus, comprising:
Memory: for storing computer program;
Processor: the step of any method for diagnosing faults as described above is realized for executing the computer program Suddenly.
Present invention also provides a kind of computer readable storage medium, computer is stored in computer readable storage medium Program, the step of when computer program is executed by processor to realize any method for diagnosing faults as described above.
The specific embodiment of trouble-shooter provided herein, equipment and computer readable storage medium with it is upper Method for diagnosing faults described in text can correspond to each other reference, just repeat no more here.
Each embodiment is described in a progressive manner in the application, the highlights of each of the examples are with other realities The difference of example is applied, the same or similar parts in each embodiment may refer to each other.For device disclosed in embodiment Speech, since it is corresponded to the methods disclosed in the examples, so being described relatively simple, related place is referring to method part illustration ?.
It should be noted that in present specification, the relational terms of such as " first " and " second " etc are used merely to One entity or operation and another entity or operate is distinguished, without necessarily requiring or implying these entities or There are any actual relationship or orders between person's operation.In addition, the terms "include", "comprise" or its any other Variant is intended to non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only It including those elements, but also including other elements that are not explicitly listed, or further include for this process, method, object Product or the intrinsic element of equipment.In the absence of more restrictions, the element limited by sentence "including a ...", It is not precluded in the process, method, article or apparatus that includes the element that there is also other identical elements.
Technical solution provided herein is described in detail above.Specific case used herein is to this Shen Principle and embodiment please is expounded, the present processes that the above embodiments are only used to help understand and its Core concept.It should be pointed out that for those skilled in the art, in the premise for not departing from the application principle Under, can also to the application, some improvement and modification can also be carried out, these improvement and modification also fall into the protection of the claim of this application In range.

Claims (10)

1. a kind of method for diagnosing faults, which is characterized in that be applied to baseboard management controller, comprising:
The IPMI order for the fault message for transmitting mainboard device that basic input output system controller is sent is received, it is described IPMI order is generated after detecting the mainboard device fault by the basic input output system controller;
The fault message is parsed to generate fault log, is examined so that user calls the fault log to carry out failure It is disconnected.
2. method for diagnosing faults according to claim 1, which is characterized in that in the reception basic input output system control After the IPMI order for the fault message for transmitting mainboard device that device processed is sent, further includes:
Obtain the status information of the mainboard device, so as to the fault message and the status information parsed jointly with Generate the fault log.
3. method for diagnosing faults according to claim 2, which is characterized in that the status information includes any of the following Or any combination:
Temperature information, type information, occupancy rate information.
4. method for diagnosing faults according to any one of claims 1 to 3, which is characterized in that the mainboard device include with Lower any one or any combination:
CPU, memory, PCI-e equipment.
5. method for diagnosing faults according to claim 4, which is characterized in that the content of the fault log includes following Meaning one or any combination:
Error code, failure rank, fault time, event of failure type, failure-description.
6. a kind of trouble-shooter, which is characterized in that be applied to baseboard management controller, comprising:
Receiving module: for receive the transmission of basic input output system controller for transmitting the fault message of mainboard device IPMI order, the IPMI order are raw after detecting the mainboard device fault by the basic input output system controller At;
Parsing module: for being parsed the fault message to generate fault log, so that user calls the failure day Will carries out fault diagnosis.
7. trouble-shooter according to claim 6, which is characterized in that further include:
Obtain module: for obtaining the status information of the mainboard device;
The parsing module is specifically used for:
The fault message and the status information are parsed jointly to generate the fault log, so that user calls institute It states fault log and carries out fault diagnosis.
8. trouble-shooter according to claim 7, which is characterized in that the mainboard device includes following any one Or any combination:
CPU, memory, PCI-e equipment.
9. a kind of failure diagnosis apparatus characterized by comprising
Memory: for storing computer program;
Processor: for executing the computer program to realize such as method for diagnosing faults described in any one of claim 1 to 5 The step of.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium Program, to realize such as fault diagnosis side described in any one of claim 1 to 5 when the computer program is executed by processor The step of method.
CN201810826756.XA 2018-07-25 2018-07-25 A kind of method for diagnosing faults, device and equipment Pending CN109032867A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810826756.XA CN109032867A (en) 2018-07-25 2018-07-25 A kind of method for diagnosing faults, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810826756.XA CN109032867A (en) 2018-07-25 2018-07-25 A kind of method for diagnosing faults, device and equipment

Publications (1)

Publication Number Publication Date
CN109032867A true CN109032867A (en) 2018-12-18

Family

ID=64645198

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810826756.XA Pending CN109032867A (en) 2018-07-25 2018-07-25 A kind of method for diagnosing faults, device and equipment

Country Status (1)

Country Link
CN (1) CN109032867A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110413435A (en) * 2019-07-12 2019-11-05 苏州浪潮智能科技有限公司 A kind of communication failure restoration methods, system and associated component
CN112213980A (en) * 2020-10-21 2021-01-12 苏州浪潮智能科技有限公司 Singlechip fault diagnosis board card and method
CN114690747A (en) * 2022-05-31 2022-07-01 深圳市星卡软件技术开发有限公司 Method, device, equipment and storage medium for troubleshooting and diagnosing equipment problems

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102799506A (en) * 2012-06-29 2012-11-28 浪潮电子信息产业股份有限公司 Method for positioning fault memory
CN105183600A (en) * 2015-09-09 2015-12-23 浪潮电子信息产业股份有限公司 Device and method for remotely positioning hard disk faults
CN106407030A (en) * 2016-09-13 2017-02-15 郑州云海信息技术有限公司 Failure processing method and system for storage cluster system
CN106909382A (en) * 2017-02-24 2017-06-30 郑州云海信息技术有限公司 Output different type system starts the method and device of information
CN107665260A (en) * 2017-10-24 2018-02-06 郑州云海信息技术有限公司 A kind of log collection instrument based on Linux system
CN107832194A (en) * 2017-11-16 2018-03-23 郑州云海信息技术有限公司 A kind of server failure detecting system and method based on onboard BMC
CN108170476A (en) * 2018-01-26 2018-06-15 郑州云海信息技术有限公司 A kind of method and system for recording server B ios release information

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102799506A (en) * 2012-06-29 2012-11-28 浪潮电子信息产业股份有限公司 Method for positioning fault memory
CN105183600A (en) * 2015-09-09 2015-12-23 浪潮电子信息产业股份有限公司 Device and method for remotely positioning hard disk faults
CN106407030A (en) * 2016-09-13 2017-02-15 郑州云海信息技术有限公司 Failure processing method and system for storage cluster system
CN106909382A (en) * 2017-02-24 2017-06-30 郑州云海信息技术有限公司 Output different type system starts the method and device of information
CN107665260A (en) * 2017-10-24 2018-02-06 郑州云海信息技术有限公司 A kind of log collection instrument based on Linux system
CN107832194A (en) * 2017-11-16 2018-03-23 郑州云海信息技术有限公司 A kind of server failure detecting system and method based on onboard BMC
CN108170476A (en) * 2018-01-26 2018-06-15 郑州云海信息技术有限公司 A kind of method and system for recording server B ios release information

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110413435A (en) * 2019-07-12 2019-11-05 苏州浪潮智能科技有限公司 A kind of communication failure restoration methods, system and associated component
CN112213980A (en) * 2020-10-21 2021-01-12 苏州浪潮智能科技有限公司 Singlechip fault diagnosis board card and method
CN114690747A (en) * 2022-05-31 2022-07-01 深圳市星卡软件技术开发有限公司 Method, device, equipment and storage medium for troubleshooting and diagnosing equipment problems

Similar Documents

Publication Publication Date Title
CN106603265B (en) Management method, network device, and non-transitory computer-readable medium
US7844866B2 (en) Mechanism to report operating system events on an intelligent platform management interface compliant server
CN104639380B (en) server monitoring method
EP0474058A2 (en) Problem analysis of a node computer with assistance from a central site
CN109039733A (en) A kind of alarm method, system and electronic equipment and storage medium
CN105205003A (en) Automated testing method and device based on clustering system
CN105159964A (en) Log monitoring method and system
US11231944B2 (en) Alerting, diagnosing, and transmitting computer issues to a technical resource in response to a dedicated physical button or trigger
WO2012046293A1 (en) Fault monitoring device, fault monitoring method and program
CN109032867A (en) A kind of method for diagnosing faults, device and equipment
JP2011210064A (en) Log information collection system, device, method and program
CN106646186B (en) Batch test method and system for chips
WO2016197737A1 (en) Self-check processing method, apparatus and system
CN105183575A (en) Processor fault diagnosis method, device and system
JP2009294837A (en) Failure monitoring system and device, monitoring apparatus, and failure monitoring method
CN102253873B (en) Alarm system and method for BIOS (Basic Input Output System)
CN112817883A (en) Method, device and system for adapting interface platform and computer readable storage medium
CN109728957B (en) Interactive operation and maintenance method and device
CN104571098B (en) Long-range self-diagnosing method based on Atom platforms
CN111124828A (en) Data processing method, device, equipment and storage medium
CN113110970B (en) Method, device, equipment and medium for monitoring all parts in server working mode
CN110708286A (en) Supervision system for testing internet
CN104951389A (en) Server display management implementing system and method
JP2004220221A (en) Information processor, monitoring control method for information processor, and information processing system
CN111949477B (en) Management scheme of large equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20181218

RJ01 Rejection of invention patent application after publication