CN108038019A - A kind of automatically restoring fault method and system of baseboard management controller - Google Patents
A kind of automatically restoring fault method and system of baseboard management controller Download PDFInfo
- Publication number
- CN108038019A CN108038019A CN201711424949.4A CN201711424949A CN108038019A CN 108038019 A CN108038019 A CN 108038019A CN 201711424949 A CN201711424949 A CN 201711424949A CN 108038019 A CN108038019 A CN 108038019A
- Authority
- CN
- China
- Prior art keywords
- logic device
- programmable logic
- management controller
- signal
- baseboard management
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0706—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/24—Resetting means
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Debugging And Monitoring (AREA)
Abstract
The present invention provides a kind of automatically restoring fault method and system of baseboard management controller, the described method includes:Initialization process is performed by the baseboard management controller;Enabling signal is received by the complicated programmable logic device;The heartbeat signal of baseboard management controller is sent to complicated programmable logic device, and the monitoring signals using the heartbeat signal as the complicated programmable logic device;Detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;When the heartbeat signal exports predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;When the heartbeat signal is not output predeterminated frequency, then Restart Signal is sent from the complicated programmable logic device to the baseboard management controller, so that the baseboard management controller restarts and completes fault recovery.The present invention can lift the maintenance efficiency of server and the stability of the Management Controller management.
Description
Technical field
The present invention relates to a kind of automatically restoring fault method of field of computer technology, more particularly to baseboard management controller
And system.
Background technology
With the rise of the technologies such as internet, cloud computing and big data, server has become strategic infrastructure.
Under the overall situation of server demands amount rapid growth, server manageability, maintainability, stability etc. are all more and more important.
Wherein, server disposition and management use baseboard management controller (BMC:Baseboard Management Controller)
Scheme as outband management system master account for absolute majority, this also proposes wanting for higher to BMC with external system stability
Ask.BMC outband managements system also occurs low probability when feelings such as machines as a set of independent system as server system
Condition, if there is the management and the fortune that will just influence whole server without a kind of automatically restoring fault method after situations such as machine
Dimension, influences stablizing and causing customer care inconvenient for server system.
Current server system, can be soft by being designed in server product server B MC on BMC fault recovery methods
Part watchdog pattern recovery BMC failures, restart BMC by software watchdog in the case of BMC function module exceptions, reach
To the purpose of fault recovery.But above-mentioned software fault pattern needs to rely on BMC internal clockings, if BMC clocks go wrong,
Software watchdog will be unable to come into force;Alternatively, designing BMC reboot buttons in the server, service and break down in BMC, can be with
By restarting BMC by reboot button.But since server is different from desktop computer or notebook, server is all placed on computer room
In, BMC is restarted using button just to be needed to be operated into computer room, and for O&M, the fault recovery scheme is very low
Effect;Again or part whole machine cabinet server uses rack management control (RMC:Rack Management Control) module pair
BMC carries out fault recovery, and still, since RMC modules are also a set of BMC Managed Solutions in fact, its core component is also BMC cores
Piece, difference is simply that BMC only manages this calculating node (server), and RMC modules and the BMC of all nodes are led to
Letter, manages all nodes (multiple servers) in whole rack, since all there are failure risk, same RMC can similarly deposit by RMC
In failure risk, if RMC and BMC breaks down at the same time, then the problem of BMC fault recoveries will can not achieve.
The content of the invention
The automatically restoring fault method and system of baseboard management controller provided by the invention, can lift the dimension of server
Protect efficiency and the stability of the Management Controller management.
In a first aspect, the present invention provides a kind of automatically restoring fault method of baseboard management controller, including:
Initialization process is performed by the baseboard management controller;
Enabling signal is received by the complicated programmable logic device;
The heartbeat signal of baseboard management controller is sent to complicated programmable logic device, and using the heartbeat signal as
The monitoring signals of the complicated programmable logic device;
Detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;
When the complicated programmable logic device detects the heartbeat signal output predeterminated frequency, then continue by the complexity
Programmable logic device detects whether the heartbeat signal exports predeterminated frequency;
When it is not output predeterminated frequency that the complicated programmable logic device, which detects the heartbeat signal, then by the complexity
Programmable logic device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted and completed
Fault recovery.
Alternatively, it is described to be included by complicated programmable logic device reception enabling signal:
From platform control unit enabling signal is sent to the complicated programmable logic device through universal input/output interface;
The event of baseboard management controller is turned on and off according to the enabling signal control complicated programmable logic device
Hinder auto restore facility.
Alternatively, after the execution initialization process by the baseboard management controller, the method further includes:
Judge whether the initialization process runs succeeded, if the initialization process runs succeeded, to described multiple
Miscellaneous programmable logic device sends and is initialized to function signal, and performs next step;If the initialization process is not carried out success,
Initialization failure signal is sent to the complicated programmable logic device, and substrate management is closed by the complicated programmable logic device
The automatically restoring fault function of controller.
Alternatively, it is described to the complicated programmable logic device send be initialized to function signal after, the method is also
Including:
Whether there is output predeterminated frequency by complicated programmable logic device detection heartbeat signal in setting time;
If the heartbeat signal has output predeterminated frequency in setting time, continue by the complex programmable logic
Device detects whether the heartbeat signal exports predeterminated frequency;
If the heartbeat signal continues not to be output predeterminated frequency in setting time, patrolled by the complex programmable
Volume device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted and to complete failure extensive
It is multiple.
Second aspect, the present invention provide a kind of fault automatic recovery system of baseboard management controller, including:
Baseboard management controller, for performing initialization process and sending heartbeat signal to complex programmable logic
Device, and the monitoring signals using the heartbeat signal as the complicated programmable logic device;
Complicated programmable logic device, for receiving whether enabling signal and the detection heartbeat signal export default frequency
Rate;When the heartbeat signal exports predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency;When described
When heartbeat signal is not output predeterminated frequency, then Restart Signal is sent to the baseboard management controller, so that the substrate pipe
Reason controller restarts and completes fault recovery.
Alternatively, the system also includes:
Platform control unit, letter is opened for being sent through universal input/output interface to the complicated programmable logic device
Number, and it is automatically extensive according to the failure that the signal controls the complicated programmable logic device to be turned on and off baseboard management controller
Multiple function.
Alternatively, the complicated programmable logic device includes:
Signal receiving module, for receiving the heartbeat signal transmitted by the baseboard management controller;
Signal detection module, for detecting whether the heartbeat signal exports predeterminated frequency;
Signal transmitting module, for sending Restart Signal to the baseboard management controller.
The automatically restoring fault method and system of baseboard management controller provided in an embodiment of the present invention, can be compiled using complexity
Journey logic device (CPLD:Complex Programmable Logic Device) the control baseboard management controller progress event
Barrier is automatic to be recovered, wherein, mainly patrolled by regarding the heartbeat signal of the baseboard management controller as the complex programmable
The monitoring signals of device are collected, for example, the monitoring signals are the watchdog signals of the complicated programmable logic device;And by described
Complicated programmable logic device detects the heartbeat signal in real time, and controls the substrate management according to the heartbeat signal
The automatically restoring fault function of controller.
Wherein, the method is mainly that the output frequency of the heartbeat signal is detected by the complicated programmable logic device
Rate, and the output frequency of the heartbeat signal and output predeterminated frequency are contrasted, controlled whether according to comparing result by institute
State complicated programmable logic device and send Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted
And complete fault recovery.
Meanwhile method described in the present embodiment can also also detect itself each module shape by the baseboard management controller
Whether state is abnormal, for example detects network and be constantly in dynamic host configuration protocol (DHCP:Dynamic Host
Configuration Protocol) state, and IP address can not be obtained, it is possible to control heartbeat signal no longer to export default frequency
Rate, so that the complicated programmable logic device is completed to restart baseboard management controller and peripheral modules in a short time
Complete the automatic recovery of failure.
Therefore, the present embodiment the method realizes baseboard management controller by using the complicated programmable logic device
Automatically restoring fault function, the method not only improve the maintenance efficiency of server;Meanwhile also improve and described take substrate pipe
Manage controller management stability.
Brief description of the drawings
Fig. 1 is the flow chart of the automatically restoring fault method of one embodiment of the invention baseboard management controller;
Fig. 2 is the flow chart of the automatically restoring fault method of another embodiment of the present invention baseboard management controller;
Fig. 3 is the structure diagram of the fault automatic recovery system of one embodiment of the invention baseboard management controller;
Fig. 4 is the structure diagram of the fault automatic recovery system of another embodiment of the present invention baseboard management controller.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention
In attached drawing, the technical solution in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only
Only it is part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, ordinary skill
Personnel's all other embodiments obtained without making creative work, belong to the scope of protection of the invention.
The embodiment of the present invention provides a kind of automatically restoring fault method of baseboard management controller, as shown in Figure 1, the side
Method includes:
S01, by the baseboard management controller perform initialization process;
S10, by the complicated programmable logic device receive enabling signal;
S11, send the heartbeat signal of baseboard management controller to complicated programmable logic device, and by the heartbeat signal
Monitoring signals as the complicated programmable logic device;
S12, by the complicated programmable logic device detect whether the heartbeat signal exports predeterminated frequency;
S13, when the complicated programmable logic device detects heartbeat signal output predeterminated frequency, then continue by described
Complicated programmable logic device detects whether the heartbeat signal exports predeterminated frequency;
S14, when it is not output predeterminated frequency that the complicated programmable logic device, which detects the heartbeat signal, then by described
Complicated programmable logic device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted simultaneously
Complete fault recovery.
The automatically restoring fault method of baseboard management controller provided in an embodiment of the present invention utilizes complex programmable logic
Device controls the baseboard management controller to carry out automatically restoring fault, wherein, mainly by by the baseboard management controller
Monitoring signals of the heartbeat signal as the complicated programmable logic device, for example, the monitoring signals can be compiled for the complexity
The watchdog signals of journey logic device;And the heartbeat signal is detected in real time by the complicated programmable logic device, and
The automatically restoring fault function of the baseboard management controller is controlled according to the heartbeat signal.
Wherein, the method is mainly that the output frequency of the heartbeat signal is detected by the complicated programmable logic device
Rate, and the output frequency of the heartbeat signal and output predeterminated frequency are contrasted, controlled whether according to comparing result by institute
State complicated programmable logic device and send Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted
And complete fault recovery.
Meanwhile method described in the present embodiment can also also detect itself each module shape by the baseboard management controller
Whether state is abnormal, for example detects network and be constantly in dynamic host configuration protocol state, and can not obtain IP address, it is possible to
Control heartbeat signal no longer exports predeterminated frequency, so that the complicated programmable logic device is completed to substrate in a short time
Management Controller and restarting for peripheral modules complete recovering automatically for failure.
Therefore, the present embodiment the method realizes baseboard management controller by using the complicated programmable logic device
Automatically restoring fault function, the method not only improve the maintenance efficiency of server;Meanwhile also improve and described take substrate pipe
Manage controller management stability.
Alternatively, as shown in Fig. 2, described included by complicated programmable logic device reception enabling signal:
S101, from platform control unit through universal input/output interface to the complicated programmable logic device send start
Signal;
S102, according to the enabling signal control the complicated programmable logic device to be turned on and off baseboard management controller
Automatically restoring fault function.
Specifically, platform control unit inputs to the complex programmable through universal input/output interface in the present embodiment
One enabling signal of logic device, is confirmed whether to open automatic recovery (restarting) described baseboard management controller function;By institute
Baseboard management controller is stated in substrate management controller firmware described in self-renewing, the heartbeat signal and it is described input/it is defeated
Outgoing interface is all in nondeterministic statement, it is necessary to which the complicated programmable logic device closing baseboard management controller failure is automatic
Recover function;Therefore, the present embodiment the method is sent by the platform control unit to the complicated programmable logic device
Enabling signal is confirmed, prevents the complicated programmable logic device from receiving error signal false triggering and restarting the substrate management control
Device processed, and then cause the substrate management controller firmware upgrading failure, the baseboard management controller will be unable to again normal work
Make.
Wherein, universal input/output interface of platform control unit described in the present embodiment the method need to only pass through height
The baseboard management controller automatically restoring fault is opened or closed to the i.e. controllable complicated programmable logic device of low level
Function;For example, high level (assuming that 3.3 volts) is selected as opening auto restore facility, then (0 volt) of low level is to close certainly
It is dynamic to recover function.
Alternatively, after the execution initialization process by the baseboard management controller, the method further includes:
S02, judge whether the initialization process runs succeeded, if the initialization process runs succeeded, to institute
State complicated programmable logic device transmission and be initialized to function signal, and perform next step;If the initialization process is not carried out into
Work(, then send initialization failure signal to the complicated programmable logic device, and closes base by the complicated programmable logic device
The automatically restoring fault function of board management controller.
Specifically, the present embodiment the method is meeting that the initialization process runs succeeded and the enabling signal controls
The complicated programmable logic device opens the automatically restoring fault function of baseboard management controller and then performs step S12.
Alternatively, it is described to the complicated programmable logic device send be initialized to function signal after, the method is also
Including:
Whether there is output predeterminated frequency by complicated programmable logic device detection heartbeat signal in setting time;
If the heartbeat signal has output predeterminated frequency in setting time, continue by the complex programmable logic
Device detects whether the heartbeat signal exports predeterminated frequency;
If the heartbeat signal continues not to be output predeterminated frequency in setting time, patrolled by the complex programmable
Volume device sends Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted and to complete failure extensive
It is multiple.
Specifically, method described in the present embodiment can also be initialized in described sent to the complicated programmable logic device
After pass signal, whether there is output to preset by complicated programmable logic device detection heartbeat signal in setting time
The automatically restoring fault function of baseboard management controller described in FREQUENCY CONTROL.
For example, the method, after the baseboard management controller has performed initialization process, the substrate management controls
Device software takes over the heartbeat signal, and exports the square wave of default fixed frequency, such as 1HZ, while the substrate management controls
Universal input/output interface of device sends to the complicated programmable logic device and is initialized to function signal, such as 3.3V high level
(it is then 0V low levels that initialization, which does not complete).After the completion of baseboard management controller initialization process performs, if described
It is not 1Hz square waves that heartbeat signal, which continues for some time, for example, persistently being detected in setting time 20S, then can be compiled by the complexity
Journey logic device, which is signaled, restarts the baseboard management controller and the baseboard management controller related peripheral chip, described in completion
Baseboard management controller automatically restoring fault.The method can also be by the universal input of the baseboard management controller/defeated
Function signal is initialized to transmitted by outgoing interface, is further ensured that the baseboard management controller software initialized completion,
Avoid needing certain time since the baseboard management controller powers on or restart initialization, if without this signal conduct
Judge benchmark, cause the complicated programmable logic device false triggering to restart baseboard management controller, and then form endless loop to cause
The baseboard management controller can not work.
The embodiment of the present invention also provides a kind of fault automatic recovery system of baseboard management controller, as shown in figure 3, described
System includes:
Baseboard management controller 11, for performing initialization process and sending heartbeat signal to complex programmable logic
Device, and the monitoring signals using the heartbeat signal as the complicated programmable logic device;
Complicated programmable logic device 12, for receiving whether enabling signal and the detection heartbeat signal export default frequency
Rate;When the heartbeat signal exports predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency;When described
When heartbeat signal is not output predeterminated frequency, then Restart Signal is sent to the baseboard management controller, so that the substrate pipe
Reason controller restarts and completes fault recovery.
The fault automatic recovery system of baseboard management controller provided in an embodiment of the present invention utilizes complex programmable logic
Device controls the baseboard management controller to carry out automatically restoring fault, wherein, mainly by by the baseboard management controller
Monitoring signals of the heartbeat signal as the complicated programmable logic device, for example, the monitoring signals can be compiled for the complexity
The watchdog signals of journey logic device;And the heartbeat signal is detected in real time by the complicated programmable logic device, and
The automatically restoring fault function of the baseboard management controller is controlled according to the heartbeat signal.
Wherein, the system is mainly that the output frequency of the heartbeat signal is detected by the complicated programmable logic device
Rate, and the output frequency of the heartbeat signal and output predeterminated frequency are contrasted, controlled whether according to comparing result by institute
State complicated programmable logic device and send Restart Signal to the baseboard management controller, so that the baseboard management controller is restarted
And complete fault recovery.
Meanwhile system described in the present embodiment can also also detect itself each module shape by the baseboard management controller
Whether state is abnormal, for example detects network and be constantly in dynamic host configuration protocol state, and can not obtain IP address, it is possible to
Control heartbeat signal no longer exports predeterminated frequency, so that the complicated programmable logic device is completed to substrate in a short time
Management Controller and restarting for peripheral modules complete recovering automatically for failure.
Therefore, system described in the present embodiment realizes baseboard management controller by using the complicated programmable logic device
Automatically restoring fault function, the method not only improve the maintenance efficiency of server;Meanwhile also improve and described take substrate pipe
Manage controller management stability.
Alternatively, as shown in figure 4, the system also includes:
Platform control unit 13, is opened for being sent through universal input/output interface to the complicated programmable logic device
Signal, and it is automatic according to the failure that the signal controls the complicated programmable logic device to be turned on and off baseboard management controller
Recover function.
Alternatively, the complicated programmable logic device includes:
Signal receiving module 121, for receiving the heartbeat signal transmitted by the baseboard management controller;
Signal detection module 122, for detecting whether the heartbeat signal exports predeterminated frequency;
Signal transmitting module 123, for sending Restart Signal to the baseboard management controller.
The system of the present embodiment, can be used for the technical solution for performing above method embodiment, its realization principle and technology
Effect is similar, and details are not described herein again.
One of ordinary skill in the art will appreciate that realize all or part of flow in above-described embodiment method, being can be with
Relevant hardware is instructed to complete by computer program, the program can be stored in a computer read/write memory medium
In, the program is upon execution, it may include such as the flow of the embodiment of above-mentioned each method.Wherein, the storage medium can be magnetic
Dish, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access
Memory, RAM) etc..
The above description is merely a specific embodiment, but protection scope of the present invention is not limited thereto, any
Those familiar with the art the invention discloses technical scope in, the change or replacement that can readily occur in, all should
It is included within the scope of the present invention.Therefore, protection scope of the present invention should be subject to scope of the claims.
Claims (7)
1. a kind of automatically restoring fault method of baseboard management controller, it is characterised in that including:
Initialization process is performed by the baseboard management controller;
Enabling signal is received by the complicated programmable logic device;
The heartbeat signal of baseboard management controller is sent to complicated programmable logic device, and using the heartbeat signal as described in
The monitoring signals of complicated programmable logic device;
Detect whether the heartbeat signal exports predeterminated frequency by the complicated programmable logic device;
When the complicated programmable logic device detects the heartbeat signal output predeterminated frequency, then continue to be compiled by the complexity
Journey logic device detects whether the heartbeat signal exports predeterminated frequency;
When it is not output predeterminated frequency that the complicated programmable logic device, which detects the heartbeat signal, then can be compiled by the complexity
Journey logic device sends Restart Signal to the baseboard management controller, so that the baseboard management controller restarts and completes failure
Recover.
2. according to the method described in claim 1, it is characterized in that, described received by the complicated programmable logic device starts letter
Number include:
From platform control unit enabling signal is sent to the complicated programmable logic device through universal input/output interface;
The complicated programmable logic device is controlled to be turned on and off the failure of baseboard management controller certainly according to the enabling signal
It is dynamic to recover function.
3. method according to claim 1 or 2, it is characterised in that performed just by the baseboard management controller described
After beginning process, the described method includes:
Judge whether the initialization process runs succeeded, can to the complexity if the initialization process runs succeeded
Programmed logic device sends and is initialized to function signal, and performs next step;If the initialization process is not carried out success, to institute
State complicated programmable logic device and send initialization failure signal, and substrate management control is closed by the complicated programmable logic device
The automatically restoring fault function of device.
4. according to the method described in claim 3, it is characterized in that, sent initially to the complicated programmable logic device described
It is melted into after function signal, the method further includes:
Whether there is output predeterminated frequency by complicated programmable logic device detection heartbeat signal in setting time;
If the heartbeat signal has output predeterminated frequency in setting time, continuation is examined by the complicated programmable logic device
Survey whether the heartbeat signal exports predeterminated frequency;
If the heartbeat signal continues not to be output predeterminated frequency in setting time, by the complicated programmable logic device
Restart Signal is sent to the baseboard management controller, so that the baseboard management controller restarts and completes fault recovery.
A kind of 5. fault automatic recovery system of baseboard management controller, it is characterised in that including:
Baseboard management controller, for performing initialization process and sending heartbeat signal to complicated programmable logic device, and
Monitoring signals using the heartbeat signal as the complicated programmable logic device;
Complicated programmable logic device, for receiving whether enabling signal and the detection heartbeat signal export predeterminated frequency;When
During the heartbeat signal output predeterminated frequency, then continue to detect whether the heartbeat signal exports predeterminated frequency;When the heartbeat
When signal is not output predeterminated frequency, then Restart Signal is sent to the baseboard management controller, so that the substrate management control
System, which is thought highly of, opens and completes fault recovery.
6. system according to claim 5, it is characterised in that the system also includes:
Platform control unit, for sending open signal to the complicated programmable logic device through universal input/output interface, and
The automatically restoring fault work(of baseboard management controller is turned on and off according to the signal control complicated programmable logic device
Energy.
7. the system according to claim 5 or 6, it is characterised in that the complicated programmable logic device includes:
Signal receiving module, for receiving the heartbeat signal transmitted by the baseboard management controller;
Signal detection module, for detecting whether the heartbeat signal exports predeterminated frequency;
Signal transmitting module, for sending Restart Signal to the baseboard management controller.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711424949.4A CN108038019B (en) | 2017-12-25 | 2017-12-25 | Automatic fault recovery method and system for substrate management controller |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711424949.4A CN108038019B (en) | 2017-12-25 | 2017-12-25 | Automatic fault recovery method and system for substrate management controller |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108038019A true CN108038019A (en) | 2018-05-15 |
CN108038019B CN108038019B (en) | 2021-06-11 |
Family
ID=62101154
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711424949.4A Active CN108038019B (en) | 2017-12-25 | 2017-12-25 | Automatic fault recovery method and system for substrate management controller |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108038019B (en) |
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109032362A (en) * | 2018-08-31 | 2018-12-18 | 苏州竹原信息科技有限公司 | A kind of tutoring system and its control method based on VR |
CN109032639A (en) * | 2018-07-19 | 2018-12-18 | 郑州云海信息技术有限公司 | A kind of complete machine flogic system upgrade method, system and independent logical device |
CN109240851A (en) * | 2018-08-24 | 2019-01-18 | 郑州云海信息技术有限公司 | A kind of autonomous type realization self-healing method and system of batch BMC |
CN109254894A (en) * | 2018-08-20 | 2019-01-22 | 曙光信息产业(北京)有限公司 | The heartbeat inspecting device and method of chip |
CN109656739A (en) * | 2018-12-10 | 2019-04-19 | 英业达科技有限公司 | Method of servicing, system, mainboard and computer readable storage medium |
CN109669711A (en) * | 2018-12-14 | 2019-04-23 | 郑州云海信息技术有限公司 | A kind of server independently refreshes the method and BMC of CPLD |
CN110213136A (en) * | 2019-06-24 | 2019-09-06 | 山信软件股份有限公司 | A kind of communicating control method and system |
CN111124849A (en) * | 2019-11-08 | 2020-05-08 | 苏州浪潮智能科技有限公司 | Method, device and medium for server fault warning |
TWI697768B (en) * | 2019-03-07 | 2020-07-01 | 神雲科技股份有限公司 | Reset bmc control method |
CN111367700A (en) * | 2020-02-28 | 2020-07-03 | 苏州浪潮智能科技有限公司 | Forced recovery method, system and related components after BMC downtime |
CN111813600A (en) * | 2020-06-29 | 2020-10-23 | 中国长城科技集团股份有限公司 | Controller recovery method, device, terminal and medium |
CN111913551A (en) * | 2019-05-08 | 2020-11-10 | 佛山市顺德区顺达电脑厂有限公司 | Control method for resetting baseboard management controller |
CN111966559A (en) * | 2020-07-14 | 2020-11-20 | 中国长城科技集团股份有限公司 | Fault recovery method and device, electronic equipment and storage medium |
CN111984464A (en) * | 2020-07-25 | 2020-11-24 | 苏州浪潮智能科技有限公司 | Programmable logic device monitoring and restarting method, device and system |
CN112000995A (en) * | 2020-08-06 | 2020-11-27 | 苏州浪潮智能科技有限公司 | Novel case intrusion warning system and method |
CN113359967A (en) * | 2021-04-15 | 2021-09-07 | 山东英信计算机技术有限公司 | Equipment starting method and device |
CN113918383A (en) * | 2021-10-12 | 2022-01-11 | 北京百度网讯科技有限公司 | Core board resetting method, device, equipment, storage medium and program product |
CN114691408A (en) * | 2022-04-18 | 2022-07-01 | 苏州浪潮智能科技有限公司 | Fault detection device for substrate management controller |
CN115237644A (en) * | 2022-06-16 | 2022-10-25 | 广州汽车集团股份有限公司 | System failure processing method, central processing unit and vehicle |
CN116820827A (en) * | 2023-08-28 | 2023-09-29 | 苏州浪潮智能科技有限公司 | Control method and system of substrate management controller of node server |
TWI827031B (en) * | 2022-04-24 | 2023-12-21 | 新加坡商鴻運科股份有限公司 | Detection system and method of substrate management controller |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100017629A1 (en) * | 2008-07-17 | 2010-01-21 | Hitachi, Ltd. | File sharing apparatus and file sharing system |
TW201227272A (en) * | 2010-12-22 | 2012-07-01 | Inventec Corp | A detect device of the peripheral component |
CN103835972A (en) * | 2012-11-20 | 2014-06-04 | 英业达科技有限公司 | Fan rotating speed control system and method for control rotating speed of fan |
CN103885860A (en) * | 2014-03-21 | 2014-06-25 | 浪潮集团有限公司 | Method for achieving BMC double-management hot redundancy by applying IPMI command |
CN105959151A (en) * | 2016-06-22 | 2016-09-21 | 中国工商银行股份有限公司 | High availability stream processing system and method |
CN107145428A (en) * | 2017-05-26 | 2017-09-08 | 郑州云海信息技术有限公司 | A kind of server and server monitoring method |
CN206647293U (en) * | 2017-03-03 | 2017-11-17 | 郑州云海信息技术有限公司 | A kind of server fan rotating speed control system based on CPLD |
-
2017
- 2017-12-25 CN CN201711424949.4A patent/CN108038019B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100017629A1 (en) * | 2008-07-17 | 2010-01-21 | Hitachi, Ltd. | File sharing apparatus and file sharing system |
TW201227272A (en) * | 2010-12-22 | 2012-07-01 | Inventec Corp | A detect device of the peripheral component |
CN103835972A (en) * | 2012-11-20 | 2014-06-04 | 英业达科技有限公司 | Fan rotating speed control system and method for control rotating speed of fan |
CN103885860A (en) * | 2014-03-21 | 2014-06-25 | 浪潮集团有限公司 | Method for achieving BMC double-management hot redundancy by applying IPMI command |
CN105959151A (en) * | 2016-06-22 | 2016-09-21 | 中国工商银行股份有限公司 | High availability stream processing system and method |
CN206647293U (en) * | 2017-03-03 | 2017-11-17 | 郑州云海信息技术有限公司 | A kind of server fan rotating speed control system based on CPLD |
CN107145428A (en) * | 2017-05-26 | 2017-09-08 | 郑州云海信息技术有限公司 | A kind of server and server monitoring method |
Cited By (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109032639A (en) * | 2018-07-19 | 2018-12-18 | 郑州云海信息技术有限公司 | A kind of complete machine flogic system upgrade method, system and independent logical device |
CN109254894A (en) * | 2018-08-20 | 2019-01-22 | 曙光信息产业(北京)有限公司 | The heartbeat inspecting device and method of chip |
CN109254894B (en) * | 2018-08-20 | 2022-03-11 | 中科曙光信息产业成都有限公司 | Device and method for monitoring heartbeat of chip |
CN109240851A (en) * | 2018-08-24 | 2019-01-18 | 郑州云海信息技术有限公司 | A kind of autonomous type realization self-healing method and system of batch BMC |
CN109032362A (en) * | 2018-08-31 | 2018-12-18 | 苏州竹原信息科技有限公司 | A kind of tutoring system and its control method based on VR |
CN109656739A (en) * | 2018-12-10 | 2019-04-19 | 英业达科技有限公司 | Method of servicing, system, mainboard and computer readable storage medium |
CN109669711A (en) * | 2018-12-14 | 2019-04-23 | 郑州云海信息技术有限公司 | A kind of server independently refreshes the method and BMC of CPLD |
CN109669711B (en) * | 2018-12-14 | 2021-10-29 | 郑州云海信息技术有限公司 | Method for server to automatically refresh CPLD and BMC |
TWI697768B (en) * | 2019-03-07 | 2020-07-01 | 神雲科技股份有限公司 | Reset bmc control method |
CN111913551A (en) * | 2019-05-08 | 2020-11-10 | 佛山市顺德区顺达电脑厂有限公司 | Control method for resetting baseboard management controller |
CN111913551B (en) * | 2019-05-08 | 2024-04-19 | 佛山市顺德区顺达电脑厂有限公司 | Control method for resetting baseboard management controller |
CN110213136A (en) * | 2019-06-24 | 2019-09-06 | 山信软件股份有限公司 | A kind of communicating control method and system |
CN111124849A (en) * | 2019-11-08 | 2020-05-08 | 苏州浪潮智能科技有限公司 | Method, device and medium for server fault warning |
CN111367700A (en) * | 2020-02-28 | 2020-07-03 | 苏州浪潮智能科技有限公司 | Forced recovery method, system and related components after BMC downtime |
CN111813600A (en) * | 2020-06-29 | 2020-10-23 | 中国长城科技集团股份有限公司 | Controller recovery method, device, terminal and medium |
CN111966559A (en) * | 2020-07-14 | 2020-11-20 | 中国长城科技集团股份有限公司 | Fault recovery method and device, electronic equipment and storage medium |
CN111966559B (en) * | 2020-07-14 | 2023-12-15 | 中国长城科技集团股份有限公司 | Fault recovery method and device, electronic equipment and storage medium |
CN111984464A (en) * | 2020-07-25 | 2020-11-24 | 苏州浪潮智能科技有限公司 | Programmable logic device monitoring and restarting method, device and system |
CN111984464B (en) * | 2020-07-25 | 2023-01-10 | 苏州浪潮智能科技有限公司 | Programmable logic device monitoring and restarting method, device and system |
CN112000995A (en) * | 2020-08-06 | 2020-11-27 | 苏州浪潮智能科技有限公司 | Novel case intrusion warning system and method |
CN112000995B (en) * | 2020-08-06 | 2022-12-09 | 苏州浪潮智能科技有限公司 | Novel case intrusion warning system and method |
CN113359967B (en) * | 2021-04-15 | 2022-04-22 | 山东英信计算机技术有限公司 | Equipment starting method and device |
CN113359967A (en) * | 2021-04-15 | 2021-09-07 | 山东英信计算机技术有限公司 | Equipment starting method and device |
CN113918383A (en) * | 2021-10-12 | 2022-01-11 | 北京百度网讯科技有限公司 | Core board resetting method, device, equipment, storage medium and program product |
CN114691408A (en) * | 2022-04-18 | 2022-07-01 | 苏州浪潮智能科技有限公司 | Fault detection device for substrate management controller |
TWI827031B (en) * | 2022-04-24 | 2023-12-21 | 新加坡商鴻運科股份有限公司 | Detection system and method of substrate management controller |
CN115237644A (en) * | 2022-06-16 | 2022-10-25 | 广州汽车集团股份有限公司 | System failure processing method, central processing unit and vehicle |
CN115237644B (en) * | 2022-06-16 | 2024-04-23 | 广州汽车集团股份有限公司 | System fault processing method, central operation unit and vehicle |
CN116820827A (en) * | 2023-08-28 | 2023-09-29 | 苏州浪潮智能科技有限公司 | Control method and system of substrate management controller of node server |
CN116820827B (en) * | 2023-08-28 | 2024-01-23 | 苏州浪潮智能科技有限公司 | Control method and system of substrate management controller of node server |
Also Published As
Publication number | Publication date |
---|---|
CN108038019B (en) | 2021-06-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108038019A (en) | A kind of automatically restoring fault method and system of baseboard management controller | |
CN107179957B (en) | Physical machine fault classification processing method and device and virtual machine recovery method and system | |
CN102132523B (en) | Device power management using network connections | |
CN104268061B (en) | A kind of storage state monitoring method suitable for virtual machine | |
CN110807064B (en) | Data recovery device in RAC distributed database cluster system | |
CN105204955B (en) | A kind of virtual-machine fail restorative procedure and device | |
CN108427616A (en) | background program monitoring method and monitoring device | |
CN110109782B (en) | Method, device and system for replacing fault PCIe (peripheral component interconnect express) equipment | |
CN106032619B (en) | Machine communicating with washing method | |
CN108228374A (en) | A kind of fault handling method of equipment, apparatus and system | |
CN107819808A (en) | Communicate to connect method for building up and device | |
CN110532096B (en) | System and method for multi-node grouping parallel deployment | |
CN105260274A (en) | Method for detecting random hot plug stability of hard disk based on linux | |
CN107070747B (en) | Device, system and method for automatically testing network card network connection stability in network card binding mode | |
CN117251333A (en) | Method, device, equipment and storage medium for acquiring hard disk information | |
CN1322422C (en) | Automatic startup of cluster system after occurrence of recoverable error | |
CN110032465A (en) | A kind of BMC restarts log recording method and device | |
US8421614B2 (en) | Reliable redundant data communication through alternating current power distribution system | |
CN109783390A (en) | PSU firmware promotion and demotion stability test method, apparatus, terminal and storage medium | |
CN108897646A (en) | A kind of switching method and baseboard management controller of BIOS chip | |
US6973412B2 (en) | Method and apparatus involving a hierarchy of field replaceable units containing stored data | |
CN109254894B (en) | Device and method for monitoring heartbeat of chip | |
CN110413435A (en) | A kind of communication failure restoration methods, system and associated component | |
CN105912414A (en) | Method and system for server management | |
CN112148527A (en) | Server power-off method, device and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220810 Address after: 100089 building 36, courtyard 8, Dongbeiwang West Road, Haidian District, Beijing Patentee after: Dawning Information Industry (Beijing) Co.,Ltd. Patentee after: DAWNING INFORMATION INDUSTRY Co.,Ltd. Address before: 100193 No. 36 Building, No. 8 Hospital, Wangxi Road, Haidian District, Beijing Patentee before: Dawning Information Industry (Beijing) Co.,Ltd. |