CN108038019A - A kind of automatically restoring fault method and system of baseboard management controller - Google Patents

A kind of automatically restoring fault method and system of baseboard management controller Download PDF

Info

Publication number
CN108038019A
CN108038019A CN201711424949.4A CN201711424949A CN108038019A CN 108038019 A CN108038019 A CN 108038019A CN 201711424949 A CN201711424949 A CN 201711424949A CN 108038019 A CN108038019 A CN 108038019A
Authority
CN
China
Prior art keywords
logic device
programmable logic
management controller
signal
baseboard management
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711424949.4A
Other languages
Chinese (zh)
Other versions
CN108038019B (en
Inventor
胡远明
赵熠琳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dawning Information Industry Beijing Co Ltd
Dawning Information Industry Co Ltd
Original Assignee
Dawning Information Industry Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dawning Information Industry Beijing Co Ltd filed Critical Dawning Information Industry Beijing Co Ltd
Priority to CN201711424949.4A priority Critical patent/CN108038019B/en
Publication of CN108038019A publication Critical patent/CN108038019A/en
Application granted granted Critical
Publication of CN108038019B publication Critical patent/CN108038019B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/24Resetting means

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The present invention provides a kind of automatically restoring fault method and system of baseboard management controller, the described method includes:Initialization process is performed by the baseboard management controller;Enabling signal is received by the complicated programmable logic device;The heartbeat signal of baseboard management controller is sent to complicated programmable logic device, and the monitoring signals using the heartbeat signal as the complicated programmable logic device;Detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;When the heartbeat signal exports predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;When the heartbeat signal is not output predeterminated frequency, then Restart Signal is sent from the complicated programmable logic device to the baseboard management controller, so that the baseboard management controller restarts and completes fault recovery.The present invention can lift the maintenance efficiency of server and the stability of the Management Controller management.

Description

A kind of automatically restoring fault method and system of baseboard management controller
Technical field
The present invention relates to a kind of automatically restoring fault method of field of computer technology, more particularly to baseboard management controller And system.
Background technology
With the rise of the technologies such as internet, cloud computing and big data, server has become strategic infrastructure. Under the overall situation of server demands amount rapid growth, server manageability, maintainability, stability etc. are all more and more important. Wherein, server disposition and management use baseboard management controller (BMC:Baseboard Management Controller) Scheme as outband management system master account for absolute majority, this also proposes wanting for higher to BMC with external system stability Ask.BMC outband managements system also occurs low probability when feelings such as machines as a set of independent system as server system Condition, if there is the management and the fortune that will just influence whole server without a kind of automatically restoring fault method after situations such as machine Dimension, influences stablizing and causing customer care inconvenient for server system.
Current server system, can be soft by being designed in server product server B MC on BMC fault recovery methods Part watchdog pattern recovery BMC failures, restart BMC by software watchdog in the case of BMC function module exceptions, reach To the purpose of fault recovery.But above-mentioned software fault pattern needs to rely on BMC internal clockings, if BMC clocks go wrong, Software watchdog will be unable to come into force;Alternatively, designing BMC reboot buttons in the server, service and break down in BMC, can be with By restarting BMC by reboot button.But since server is different from desktop computer or notebook, server is all placed on computer room In, BMC is restarted using button just to be needed to be operated into computer room, and for O&M, the fault recovery scheme is very low Effect;Again or part whole machine cabinet server uses rack management control (RMC:Rack Management Control) module pair BMC carries out fault recovery, and still, since RMC modules are also a set of BMC Managed Solutions in fact, its core component is also BMC cores Piece, difference is simply that BMC only manages this calculating node (server), and RMC modules and the BMC of all nodes are led to Letter, manages all nodes (multiple servers) in whole rack, since all there are failure risk, same RMC can similarly deposit by RMC In failure risk, if RMC and BMC breaks down at the same time, then the problem of BMC fault recoveries will can not achieve.
The content of the invention
The automatically restoring fault method and system of baseboard management controller provided by the invention, can lift the dimension of server Protect efficiency and the stability of the Management Controller management.
In a first aspect, the present invention provides a kind of automatically restoring fault method of baseboard management controller, including:
Initialization process is performed by the baseboard management controller;
Enabling signal is received by the complicated programmable logic device;
The heartbeat signal of baseboard management controller is sent to complicated programmable logic device, and using the heartbeat signal as The monitoring signals of the complicated programmable logic device;
Detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;
When the complicated programmable logic device detects the heartbeat signal output predeterminated frequency, then continue by the complexity Programmable logic device detects whether the heartbeat signal exports predeterminated frequency;
When it is not output predeterminated frequency that the complicated programmable logic device, which detects the heartbeat signal, then by the complexity Programmable logic device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted and completed Fault recovery.
Alternatively, it is described to be included by complicated programmable logic device reception enabling signal:
From platform control unit enabling signal is sent to the complicated programmable logic device through universal input/output interface;
The event of baseboard management controller is turned on and off according to the enabling signal control complicated programmable logic device Hinder auto restore facility.
Alternatively, after the execution initialization process by the baseboard management controller, the method further includes:
Judge whether the initialization process runs succeeded, if the initialization process runs succeeded, to described multiple Miscellaneous programmable logic device sends and is initialized to function signal, and performs next step;If the initialization process is not carried out success, Initialization failure signal is sent to the complicated programmable logic device, and substrate management is closed by the complicated programmable logic device The automatically restoring fault function of controller.
Alternatively, it is described to the complicated programmable logic device send be initialized to function signal after, the method is also Including:
Whether there is output predeterminated frequency by complicated programmable logic device detection heartbeat signal in setting time;
If the heartbeat signal has output predeterminated frequency in setting time, continue by the complex programmable logic Device detects whether the heartbeat signal exports predeterminated frequency;
If the heartbeat signal continues not to be output predeterminated frequency in setting time, patrolled by the complex programmable Volume device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted and to complete failure extensive It is multiple.
Second aspect, the present invention provide a kind of fault automatic recovery system of baseboard management controller, including:
Baseboard management controller, for performing initialization process and sending heartbeat signal to complex programmable logic Device, and the monitoring signals using the heartbeat signal as the complicated programmable logic device;
Complicated programmable logic device, for receiving whether enabling signal and the detection heartbeat signal export default frequency Rate;When the heartbeat signal exports predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency;When described When heartbeat signal is not output predeterminated frequency, then Restart Signal is sent to the baseboard management controller, so that the substrate pipe Reason controller restarts and completes fault recovery.
Alternatively, the system also includes:
Platform control unit, letter is opened for being sent through universal input/output interface to the complicated programmable logic device Number, and it is automatically extensive according to the failure that the signal controls the complicated programmable logic device to be turned on and off baseboard management controller Multiple function.
Alternatively, the complicated programmable logic device includes:
Signal receiving module, for receiving the heartbeat signal transmitted by the baseboard management controller;
Signal detection module, for detecting whether the heartbeat signal exports predeterminated frequency;
Signal transmitting module, for sending Restart Signal to the baseboard management controller.
The automatically restoring fault method and system of baseboard management controller provided in an embodiment of the present invention, can be compiled using complexity Journey logic device (CPLD:Complex Programmable Logic Device) the control baseboard management controller progress event Barrier is automatic to be recovered, wherein, mainly patrolled by regarding the heartbeat signal of the baseboard management controller as the complex programmable The monitoring signals of device are collected, for example, the monitoring signals are the watchdog signals of the complicated programmable logic device;And by described Complicated programmable logic device detects the heartbeat signal in real time, and controls the substrate management according to the heartbeat signal The automatically restoring fault function of controller.
Wherein, the method is mainly that the output frequency of the heartbeat signal is detected by the complicated programmable logic device Rate, and the output frequency of the heartbeat signal and output predeterminated frequency are contrasted, controlled whether according to comparing result by institute State complicated programmable logic device and send Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted And complete fault recovery.
Meanwhile method described in the present embodiment can also also detect itself each module shape by the baseboard management controller Whether state is abnormal, for example detects network and be constantly in dynamic host configuration protocol (DHCP:Dynamic Host Configuration Protocol) state, and IP address can not be obtained, it is possible to control heartbeat signal no longer to export default frequency Rate, so that the complicated programmable logic device is completed to restart baseboard management controller and peripheral modules in a short time Complete the automatic recovery of failure.
Therefore, the present embodiment the method realizes baseboard management controller by using the complicated programmable logic device Automatically restoring fault function, the method not only improve the maintenance efficiency of server;Meanwhile also improve and described take substrate pipe Manage controller management stability.
Brief description of the drawings
Fig. 1 is the flow chart of the automatically restoring fault method of one embodiment of the invention baseboard management controller;
Fig. 2 is the flow chart of the automatically restoring fault method of another embodiment of the present invention baseboard management controller;
Fig. 3 is the structure diagram of the fault automatic recovery system of one embodiment of the invention baseboard management controller;
Fig. 4 is the structure diagram of the fault automatic recovery system of another embodiment of the present invention baseboard management controller.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, the technical solution in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only Only it is part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, ordinary skill Personnel's all other embodiments obtained without making creative work, belong to the scope of protection of the invention.
The embodiment of the present invention provides a kind of automatically restoring fault method of baseboard management controller, as shown in Figure 1, the side Method includes:
S01, by the baseboard management controller perform initialization process;
S10, by the complicated programmable logic device receive enabling signal;
S11, send the heartbeat signal of baseboard management controller to complicated programmable logic device, and by the heartbeat signal Monitoring signals as the complicated programmable logic device;
S12, by the complicated programmable logic device detect whether the heartbeat signal exports predeterminated frequency;
S13, when the complicated programmable logic device detects heartbeat signal output predeterminated frequency, then continue by described Complicated programmable logic device detects whether the heartbeat signal exports predeterminated frequency;
S14, when it is not output predeterminated frequency that the complicated programmable logic device, which detects the heartbeat signal, then by described Complicated programmable logic device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted simultaneously Complete fault recovery.
The automatically restoring fault method of baseboard management controller provided in an embodiment of the present invention utilizes complex programmable logic Device controls the baseboard management controller to carry out automatically restoring fault, wherein, mainly by by the baseboard management controller Monitoring signals of the heartbeat signal as the complicated programmable logic device, for example, the monitoring signals can be compiled for the complexity The watchdog signals of journey logic device;And the heartbeat signal is detected in real time by the complicated programmable logic device, and The automatically restoring fault function of the baseboard management controller is controlled according to the heartbeat signal.
Wherein, the method is mainly that the output frequency of the heartbeat signal is detected by the complicated programmable logic device Rate, and the output frequency of the heartbeat signal and output predeterminated frequency are contrasted, controlled whether according to comparing result by institute State complicated programmable logic device and send Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted And complete fault recovery.
Meanwhile method described in the present embodiment can also also detect itself each module shape by the baseboard management controller Whether state is abnormal, for example detects network and be constantly in dynamic host configuration protocol state, and can not obtain IP address, it is possible to Control heartbeat signal no longer exports predeterminated frequency, so that the complicated programmable logic device is completed to substrate in a short time Management Controller and restarting for peripheral modules complete recovering automatically for failure.
Therefore, the present embodiment the method realizes baseboard management controller by using the complicated programmable logic device Automatically restoring fault function, the method not only improve the maintenance efficiency of server;Meanwhile also improve and described take substrate pipe Manage controller management stability.
Alternatively, as shown in Fig. 2, described included by complicated programmable logic device reception enabling signal:
S101, from platform control unit through universal input/output interface to the complicated programmable logic device send start Signal;
S102, according to the enabling signal control the complicated programmable logic device to be turned on and off baseboard management controller Automatically restoring fault function.
Specifically, platform control unit inputs to the complex programmable through universal input/output interface in the present embodiment One enabling signal of logic device, is confirmed whether to open automatic recovery (restarting) described baseboard management controller function;By institute Baseboard management controller is stated in substrate management controller firmware described in self-renewing, the heartbeat signal and it is described input/it is defeated Outgoing interface is all in nondeterministic statement, it is necessary to which the complicated programmable logic device closing baseboard management controller failure is automatic Recover function;Therefore, the present embodiment the method is sent by the platform control unit to the complicated programmable logic device Enabling signal is confirmed, prevents the complicated programmable logic device from receiving error signal false triggering and restarting the substrate management control Device processed, and then cause the substrate management controller firmware upgrading failure, the baseboard management controller will be unable to again normal work Make.
Wherein, universal input/output interface of platform control unit described in the present embodiment the method need to only pass through height The baseboard management controller automatically restoring fault is opened or closed to the i.e. controllable complicated programmable logic device of low level Function;For example, high level (assuming that 3.3 volts) is selected as opening auto restore facility, then (0 volt) of low level is to close certainly It is dynamic to recover function.
Alternatively, after the execution initialization process by the baseboard management controller, the method further includes:
S02, judge whether the initialization process runs succeeded, if the initialization process runs succeeded, to institute State complicated programmable logic device transmission and be initialized to function signal, and perform next step;If the initialization process is not carried out into Work(, then send initialization failure signal to the complicated programmable logic device, and closes base by the complicated programmable logic device The automatically restoring fault function of board management controller.
Specifically, the present embodiment the method is meeting that the initialization process runs succeeded and the enabling signal controls The complicated programmable logic device opens the automatically restoring fault function of baseboard management controller and then performs step S12.
Alternatively, it is described to the complicated programmable logic device send be initialized to function signal after, the method is also Including:
Whether there is output predeterminated frequency by complicated programmable logic device detection heartbeat signal in setting time;
If the heartbeat signal has output predeterminated frequency in setting time, continue by the complex programmable logic Device detects whether the heartbeat signal exports predeterminated frequency;
If the heartbeat signal continues not to be output predeterminated frequency in setting time, patrolled by the complex programmable Volume device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted and to complete failure extensive It is multiple.
Specifically, method described in the present embodiment can also be initialized in described sent to the complicated programmable logic device After pass signal, whether there is output to preset by complicated programmable logic device detection heartbeat signal in setting time The automatically restoring fault function of baseboard management controller described in FREQUENCY CONTROL.
For example, the method, after the baseboard management controller has performed initialization process, the substrate management controls Device software takes over the heartbeat signal, and exports the square wave of default fixed frequency, such as 1HZ, while the substrate management controls Universal input/output interface of device sends to the complicated programmable logic device and is initialized to function signal, such as 3.3V high level (it is then 0V low levels that initialization, which does not complete).After the completion of baseboard management controller initialization process performs, if described It is not 1Hz square waves that heartbeat signal, which continues for some time, for example, persistently being detected in setting time 20S, then can be compiled by the complexity Journey logic device, which is signaled, restarts the baseboard management controller and the baseboard management controller related peripheral chip, described in completion Baseboard management controller automatically restoring fault.The method can also be by the universal input of the baseboard management controller/defeated Function signal is initialized to transmitted by outgoing interface, is further ensured that the baseboard management controller software initialized completion, Avoid needing certain time since the baseboard management controller powers on or restart initialization, if without this signal conduct Judge benchmark, cause the complicated programmable logic device false triggering to restart baseboard management controller, and then form endless loop to cause The baseboard management controller can not work.
The embodiment of the present invention also provides a kind of fault automatic recovery system of baseboard management controller, as shown in figure 3, described System includes:
Baseboard management controller 11, for performing initialization process and sending heartbeat signal to complex programmable logic Device, and the monitoring signals using the heartbeat signal as the complicated programmable logic device;
Complicated programmable logic device 12, for receiving whether enabling signal and the detection heartbeat signal export default frequency Rate;When the heartbeat signal exports predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency;When described When heartbeat signal is not output predeterminated frequency, then Restart Signal is sent to the baseboard management controller, so that the substrate pipe Reason controller restarts and completes fault recovery.
The fault automatic recovery system of baseboard management controller provided in an embodiment of the present invention utilizes complex programmable logic Device controls the baseboard management controller to carry out automatically restoring fault, wherein, mainly by by the baseboard management controller Monitoring signals of the heartbeat signal as the complicated programmable logic device, for example, the monitoring signals can be compiled for the complexity The watchdog signals of journey logic device;And the heartbeat signal is detected in real time by the complicated programmable logic device, and The automatically restoring fault function of the baseboard management controller is controlled according to the heartbeat signal.
Wherein, the system is mainly that the output frequency of the heartbeat signal is detected by the complicated programmable logic device Rate, and the output frequency of the heartbeat signal and output predeterminated frequency are contrasted, controlled whether according to comparing result by institute State complicated programmable logic device and send Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted And complete fault recovery.
Meanwhile system described in the present embodiment can also also detect itself each module shape by the baseboard management controller Whether state is abnormal, for example detects network and be constantly in dynamic host configuration protocol state, and can not obtain IP address, it is possible to Control heartbeat signal no longer exports predeterminated frequency, so that the complicated programmable logic device is completed to substrate in a short time Management Controller and restarting for peripheral modules complete recovering automatically for failure.
Therefore, system described in the present embodiment realizes baseboard management controller by using the complicated programmable logic device Automatically restoring fault function, the method not only improve the maintenance efficiency of server;Meanwhile also improve and described take substrate pipe Manage controller management stability.
Alternatively, as shown in figure 4, the system also includes:
Platform control unit 13, is opened for being sent through universal input/output interface to the complicated programmable logic device Signal, and it is automatic according to the failure that the signal controls the complicated programmable logic device to be turned on and off baseboard management controller Recover function.
Alternatively, the complicated programmable logic device includes:
Signal receiving module 121, for receiving the heartbeat signal transmitted by the baseboard management controller;
Signal detection module 122, for detecting whether the heartbeat signal exports predeterminated frequency;
Signal transmitting module 123, for sending Restart Signal to the baseboard management controller.
The system of the present embodiment, can be used for the technical solution for performing above method embodiment, its realization principle and technology Effect is similar, and details are not described herein again.
One of ordinary skill in the art will appreciate that realize all or part of flow in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the program can be stored in a computer read/write memory medium In, the program is upon execution, it may include such as the flow of the embodiment of above-mentioned each method.Wherein, the storage medium can be magnetic Dish, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..
The above description is merely a specific embodiment, but protection scope of the present invention is not limited thereto, any Those familiar with the art the invention discloses technical scope in, the change or replacement that can readily occur in, all should It is included within the scope of the present invention.Therefore, protection scope of the present invention should be subject to scope of the claims.

Claims (7)

1. a kind of automatically restoring fault method of baseboard management controller, it is characterised in that including:
Initialization process is performed by the baseboard management controller;
Enabling signal is received by the complicated programmable logic device;
The heartbeat signal of baseboard management controller is sent to complicated programmable logic device, and using the heartbeat signal as described in The monitoring signals of complicated programmable logic device;
Detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;
When the complicated programmable logic device detects the heartbeat signal output predeterminated frequency, then continue to be compiled by the complexity Journey logic device detects whether the heartbeat signal exports predeterminated frequency;
When it is not output predeterminated frequency that the complicated programmable logic device, which detects the heartbeat signal, then can be compiled by the complexity Journey logic device sends Restart Signal to the baseboard management controller, so that the baseboard management controller restarts and completes failure Recover.
2. according to the method described in claim 1, it is characterized in that, described received by the complicated programmable logic device starts letter Number include:
From platform control unit enabling signal is sent to the complicated programmable logic device through universal input/output interface;
The complicated programmable logic device is controlled to be turned on and off the failure of baseboard management controller certainly according to the enabling signal It is dynamic to recover function.
3. method according to claim 1 or 2, it is characterised in that performed just by the baseboard management controller described After beginning process, the described method includes:
Judge whether the initialization process runs succeeded, can to the complexity if the initialization process runs succeeded Programmed logic device sends and is initialized to function signal, and performs next step;If the initialization process is not carried out success, to institute State complicated programmable logic device and send initialization failure signal, and substrate management control is closed by the complicated programmable logic device The automatically restoring fault function of device.
4. according to the method described in claim 3, it is characterized in that, sent initially to the complicated programmable logic device described It is melted into after function signal, the method further includes:
Whether there is output predeterminated frequency by complicated programmable logic device detection heartbeat signal in setting time;
If the heartbeat signal has output predeterminated frequency in setting time, continuation is examined by the complicated programmable logic device Survey whether the heartbeat signal exports predeterminated frequency;
If the heartbeat signal continues not to be output predeterminated frequency in setting time, by the complicated programmable logic device Restart Signal is sent to the baseboard management controller, so that the baseboard management controller restarts and completes fault recovery.
A kind of 5. fault automatic recovery system of baseboard management controller, it is characterised in that including:
Baseboard management controller, for performing initialization process and sending heartbeat signal to complicated programmable logic device, and Monitoring signals using the heartbeat signal as the complicated programmable logic device;
Complicated programmable logic device, for receiving whether enabling signal and the detection heartbeat signal export predeterminated frequency;When During the heartbeat signal output predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency;When the heartbeat When signal is not output predeterminated frequency, then Restart Signal is sent to the baseboard management controller, so that the substrate management control System, which is thought highly of, opens and completes fault recovery.
6. system according to claim 5, it is characterised in that the system also includes:
Platform control unit, for sending open signal to the complicated programmable logic device through universal input/output interface, and The automatically restoring fault work(of baseboard management controller is turned on and off according to the signal control complicated programmable logic device Energy.
7. the system according to claim 5 or 6, it is characterised in that the complicated programmable logic device includes:
Signal receiving module, for receiving the heartbeat signal transmitted by the baseboard management controller;
Signal detection module, for detecting whether the heartbeat signal exports predeterminated frequency;
Signal transmitting module, for sending Restart Signal to the baseboard management controller.
CN201711424949.4A 2017-12-25 2017-12-25 Automatic fault recovery method and system for substrate management controller Active CN108038019B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711424949.4A CN108038019B (en) 2017-12-25 2017-12-25 Automatic fault recovery method and system for substrate management controller

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711424949.4A CN108038019B (en) 2017-12-25 2017-12-25 Automatic fault recovery method and system for substrate management controller

Publications (2)

Publication Number Publication Date
CN108038019A true CN108038019A (en) 2018-05-15
CN108038019B CN108038019B (en) 2021-06-11

Family

ID=62101154

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711424949.4A Active CN108038019B (en) 2017-12-25 2017-12-25 Automatic fault recovery method and system for substrate management controller

Country Status (1)

Country Link
CN (1) CN108038019B (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109032362A (en) * 2018-08-31 2018-12-18 苏州竹原信息科技有限公司 A kind of tutoring system and its control method based on VR
CN109032639A (en) * 2018-07-19 2018-12-18 郑州云海信息技术有限公司 A kind of complete machine flogic system upgrade method, system and independent logical device
CN109240851A (en) * 2018-08-24 2019-01-18 郑州云海信息技术有限公司 A kind of autonomous type realization self-healing method and system of batch BMC
CN109254894A (en) * 2018-08-20 2019-01-22 曙光信息产业(北京)有限公司 The heartbeat inspecting device and method of chip
CN109656739A (en) * 2018-12-10 2019-04-19 英业达科技有限公司 Method of servicing, system, mainboard and computer readable storage medium
CN109669711A (en) * 2018-12-14 2019-04-23 郑州云海信息技术有限公司 A kind of server independently refreshes the method and BMC of CPLD
CN110213136A (en) * 2019-06-24 2019-09-06 山信软件股份有限公司 A kind of communicating control method and system
CN111124849A (en) * 2019-11-08 2020-05-08 苏州浪潮智能科技有限公司 Method, device and medium for server fault warning
TWI697768B (en) * 2019-03-07 2020-07-01 神雲科技股份有限公司 Reset bmc control method
CN111367700A (en) * 2020-02-28 2020-07-03 苏州浪潮智能科技有限公司 Forced recovery method, system and related components after BMC downtime
CN111813600A (en) * 2020-06-29 2020-10-23 中国长城科技集团股份有限公司 Controller recovery method, device, terminal and medium
CN111913551A (en) * 2019-05-08 2020-11-10 佛山市顺德区顺达电脑厂有限公司 Control method for resetting baseboard management controller
CN111966559A (en) * 2020-07-14 2020-11-20 中国长城科技集团股份有限公司 Fault recovery method and device, electronic equipment and storage medium
CN111984464A (en) * 2020-07-25 2020-11-24 苏州浪潮智能科技有限公司 Programmable logic device monitoring and restarting method, device and system
CN112000995A (en) * 2020-08-06 2020-11-27 苏州浪潮智能科技有限公司 Novel case intrusion warning system and method
CN113359967A (en) * 2021-04-15 2021-09-07 山东英信计算机技术有限公司 Equipment starting method and device
CN113918383A (en) * 2021-10-12 2022-01-11 北京百度网讯科技有限公司 Core board resetting method, device, equipment, storage medium and program product
CN114691408A (en) * 2022-04-18 2022-07-01 苏州浪潮智能科技有限公司 Fault detection device for substrate management controller
CN115237644A (en) * 2022-06-16 2022-10-25 广州汽车集团股份有限公司 System failure processing method, central processing unit and vehicle
CN116820827A (en) * 2023-08-28 2023-09-29 苏州浪潮智能科技有限公司 Control method and system of substrate management controller of node server
TWI827031B (en) * 2022-04-24 2023-12-21 新加坡商鴻運科股份有限公司 Detection system and method of substrate management controller

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100017629A1 (en) * 2008-07-17 2010-01-21 Hitachi, Ltd. File sharing apparatus and file sharing system
TW201227272A (en) * 2010-12-22 2012-07-01 Inventec Corp A detect device of the peripheral component
CN103835972A (en) * 2012-11-20 2014-06-04 英业达科技有限公司 Fan rotating speed control system and method for control rotating speed of fan
CN103885860A (en) * 2014-03-21 2014-06-25 浪潮集团有限公司 Method for achieving BMC double-management hot redundancy by applying IPMI command
CN105959151A (en) * 2016-06-22 2016-09-21 中国工商银行股份有限公司 High availability stream processing system and method
CN107145428A (en) * 2017-05-26 2017-09-08 郑州云海信息技术有限公司 A kind of server and server monitoring method
CN206647293U (en) * 2017-03-03 2017-11-17 郑州云海信息技术有限公司 A kind of server fan rotating speed control system based on CPLD

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100017629A1 (en) * 2008-07-17 2010-01-21 Hitachi, Ltd. File sharing apparatus and file sharing system
TW201227272A (en) * 2010-12-22 2012-07-01 Inventec Corp A detect device of the peripheral component
CN103835972A (en) * 2012-11-20 2014-06-04 英业达科技有限公司 Fan rotating speed control system and method for control rotating speed of fan
CN103885860A (en) * 2014-03-21 2014-06-25 浪潮集团有限公司 Method for achieving BMC double-management hot redundancy by applying IPMI command
CN105959151A (en) * 2016-06-22 2016-09-21 中国工商银行股份有限公司 High availability stream processing system and method
CN206647293U (en) * 2017-03-03 2017-11-17 郑州云海信息技术有限公司 A kind of server fan rotating speed control system based on CPLD
CN107145428A (en) * 2017-05-26 2017-09-08 郑州云海信息技术有限公司 A kind of server and server monitoring method

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109032639A (en) * 2018-07-19 2018-12-18 郑州云海信息技术有限公司 A kind of complete machine flogic system upgrade method, system and independent logical device
CN109254894A (en) * 2018-08-20 2019-01-22 曙光信息产业(北京)有限公司 The heartbeat inspecting device and method of chip
CN109254894B (en) * 2018-08-20 2022-03-11 中科曙光信息产业成都有限公司 Device and method for monitoring heartbeat of chip
CN109240851A (en) * 2018-08-24 2019-01-18 郑州云海信息技术有限公司 A kind of autonomous type realization self-healing method and system of batch BMC
CN109032362A (en) * 2018-08-31 2018-12-18 苏州竹原信息科技有限公司 A kind of tutoring system and its control method based on VR
CN109656739A (en) * 2018-12-10 2019-04-19 英业达科技有限公司 Method of servicing, system, mainboard and computer readable storage medium
CN109669711A (en) * 2018-12-14 2019-04-23 郑州云海信息技术有限公司 A kind of server independently refreshes the method and BMC of CPLD
CN109669711B (en) * 2018-12-14 2021-10-29 郑州云海信息技术有限公司 Method for server to automatically refresh CPLD and BMC
TWI697768B (en) * 2019-03-07 2020-07-01 神雲科技股份有限公司 Reset bmc control method
CN111913551A (en) * 2019-05-08 2020-11-10 佛山市顺德区顺达电脑厂有限公司 Control method for resetting baseboard management controller
CN111913551B (en) * 2019-05-08 2024-04-19 佛山市顺德区顺达电脑厂有限公司 Control method for resetting baseboard management controller
CN110213136A (en) * 2019-06-24 2019-09-06 山信软件股份有限公司 A kind of communicating control method and system
CN111124849A (en) * 2019-11-08 2020-05-08 苏州浪潮智能科技有限公司 Method, device and medium for server fault warning
CN111367700A (en) * 2020-02-28 2020-07-03 苏州浪潮智能科技有限公司 Forced recovery method, system and related components after BMC downtime
CN111813600A (en) * 2020-06-29 2020-10-23 中国长城科技集团股份有限公司 Controller recovery method, device, terminal and medium
CN111966559A (en) * 2020-07-14 2020-11-20 中国长城科技集团股份有限公司 Fault recovery method and device, electronic equipment and storage medium
CN111966559B (en) * 2020-07-14 2023-12-15 中国长城科技集团股份有限公司 Fault recovery method and device, electronic equipment and storage medium
CN111984464A (en) * 2020-07-25 2020-11-24 苏州浪潮智能科技有限公司 Programmable logic device monitoring and restarting method, device and system
CN111984464B (en) * 2020-07-25 2023-01-10 苏州浪潮智能科技有限公司 Programmable logic device monitoring and restarting method, device and system
CN112000995A (en) * 2020-08-06 2020-11-27 苏州浪潮智能科技有限公司 Novel case intrusion warning system and method
CN112000995B (en) * 2020-08-06 2022-12-09 苏州浪潮智能科技有限公司 Novel case intrusion warning system and method
CN113359967B (en) * 2021-04-15 2022-04-22 山东英信计算机技术有限公司 Equipment starting method and device
CN113359967A (en) * 2021-04-15 2021-09-07 山东英信计算机技术有限公司 Equipment starting method and device
CN113918383A (en) * 2021-10-12 2022-01-11 北京百度网讯科技有限公司 Core board resetting method, device, equipment, storage medium and program product
CN114691408A (en) * 2022-04-18 2022-07-01 苏州浪潮智能科技有限公司 Fault detection device for substrate management controller
TWI827031B (en) * 2022-04-24 2023-12-21 新加坡商鴻運科股份有限公司 Detection system and method of substrate management controller
CN115237644A (en) * 2022-06-16 2022-10-25 广州汽车集团股份有限公司 System failure processing method, central processing unit and vehicle
CN115237644B (en) * 2022-06-16 2024-04-23 广州汽车集团股份有限公司 System fault processing method, central operation unit and vehicle
CN116820827A (en) * 2023-08-28 2023-09-29 苏州浪潮智能科技有限公司 Control method and system of substrate management controller of node server
CN116820827B (en) * 2023-08-28 2024-01-23 苏州浪潮智能科技有限公司 Control method and system of substrate management controller of node server

Also Published As

Publication number Publication date
CN108038019B (en) 2021-06-11

Similar Documents

Publication Publication Date Title
CN108038019A (en) A kind of automatically restoring fault method and system of baseboard management controller
CN107179957B (en) Physical machine fault classification processing method and device and virtual machine recovery method and system
CN102132523B (en) Device power management using network connections
CN104268061B (en) A kind of storage state monitoring method suitable for virtual machine
CN110807064B (en) Data recovery device in RAC distributed database cluster system
CN105204955B (en) A kind of virtual-machine fail restorative procedure and device
CN108427616A (en) background program monitoring method and monitoring device
CN110109782B (en) Method, device and system for replacing fault PCIe (peripheral component interconnect express) equipment
CN106032619B (en) Machine communicating with washing method
CN108228374A (en) A kind of fault handling method of equipment, apparatus and system
CN107819808A (en) Communicate to connect method for building up and device
CN110532096B (en) System and method for multi-node grouping parallel deployment
CN105260274A (en) Method for detecting random hot plug stability of hard disk based on linux
CN107070747B (en) Device, system and method for automatically testing network card network connection stability in network card binding mode
CN117251333A (en) Method, device, equipment and storage medium for acquiring hard disk information
CN1322422C (en) Automatic startup of cluster system after occurrence of recoverable error
CN110032465A (en) A kind of BMC restarts log recording method and device
US8421614B2 (en) Reliable redundant data communication through alternating current power distribution system
CN109783390A (en) PSU firmware promotion and demotion stability test method, apparatus, terminal and storage medium
CN108897646A (en) A kind of switching method and baseboard management controller of BIOS chip
US6973412B2 (en) Method and apparatus involving a hierarchy of field replaceable units containing stored data
CN109254894B (en) Device and method for monitoring heartbeat of chip
CN110413435A (en) A kind of communication failure restoration methods, system and associated component
CN105912414A (en) Method and system for server management
CN112148527A (en) Server power-off method, device and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220810

Address after: 100089 building 36, courtyard 8, Dongbeiwang West Road, Haidian District, Beijing

Patentee after: Dawning Information Industry (Beijing) Co.,Ltd.

Patentee after: DAWNING INFORMATION INDUSTRY Co.,Ltd.

Address before: 100193 No. 36 Building, No. 8 Hospital, Wangxi Road, Haidian District, Beijing

Patentee before: Dawning Information Industry (Beijing) Co.,Ltd.