CN110187994A - A kind of failure separation method, equipment and fault isolation system - Google Patents
A kind of failure separation method, equipment and fault isolation system Download PDFInfo
- Publication number
- CN110187994A CN110187994A CN201910452699.8A CN201910452699A CN110187994A CN 110187994 A CN110187994 A CN 110187994A CN 201910452699 A CN201910452699 A CN 201910452699A CN 110187994 A CN110187994 A CN 110187994A
- Authority
- CN
- China
- Prior art keywords
- equipment
- launched
- hardware module
- microcontroller
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0793—Remedial or corrective actions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1402—Saving, restoring, recovering or retrying
- G06F11/1415—Saving, restoring, recovering or retrying at system level
- G06F11/1441—Resetting or repowering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/4401—Bootstrapping
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
- H04L41/0659—Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities
Abstract
The application provides a kind of failure separation method, equipment and fault isolation system, for improving the robustness of system.Include: in microcontroller equipment to be launched n-th of hardware module whether break down, n-th of hardware module is any one in N number of identical hardware module that the equipment to be launched includes, and N is integer more than or equal to 2;If so, the microcontroller controls electricity under the power supplys of other modules in the equipment to be launched in addition to the storage medium ceased for memory plane information carrying;Wherein, the onboard information is the hardware information of n-th of hardware module;The microcontroller modifies the onboard information, so that the equipment to be launched skips n-th of hardware module in the initialization procedure of restarting.
Description
Technical field
This application involves field of computer technology, in particular to a kind of failure separation method, equipment and fault isolation system.
Background technique
Currently, for the system including more hardware modules, the frame type exchanger for example including multiple business boards, including it is more
The separate unit interchanger of a exchange media access control (Media Access Control, MAC) chip, when starting the system,
The hardware module (business board, exchange MAC chip) for including to system is needed to initialize.
In initialization procedure, if any one hardware module breaks down in the hardware module that system includes, it will lead to
Initialization failure, and then lead to system starting failure, it is lower so as to cause the robustness (Robust) of system.
Summary of the invention
The embodiment of the present application provides a kind of failure separation method, equipment and fault isolation system, for improving the Shandong of system
Stick.
In a first aspect, this application provides a kind of failure separation methods, comprising:
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, and n-th of hardware module is
Any one in N number of identical hardware module that the equipment to be launched includes, N are the integer more than or equal to 2;
If so, the microcontroller controls in the equipment to be launched in addition to the storage medium ceased for memory plane information carrying
It is electric under the power supply of other modules;Wherein, the onboard information is the hardware information of n-th of hardware module;
The microcontroller modifies the onboard information, so that initialization procedure of the equipment to be launched in restarting
In skip n-th of hardware module.
In the embodiment of the present application, determine that n-th of hardware module there are when failure, then controls equipment to be launched in microcontroller
On other modules in addition to storage medium where the onboard information of n-th hardware module power supply under electricity, and modify n-th of hardware
The purpose of n-th of hardware module is isolated to realize for the onboard information of module, to guarantee when n-th of hardware module breaks down
Also it can complete to initialize, and then can be improved the robustness of system.
On the other hand, the application be controlled by microcontroller the power supply that equipment to be launched includes each module power-on and power-off and
The isolation that faulty hardware module is realized to the modification of the onboard information of the hardware device of failure, it is complicated to avoid passing through design
Software realization faulty hardware equipment is evaded, to simplify the process of Fault Isolation.
In a possible embodiment, microcontroller judges whether n-th of hardware module event occurs in equipment to be launched
Barrier, comprising:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is used for
N-th of hardware module is characterized to break down;
The microcontroller judges whether n-th of hardware module breaks down according to the first instruction information;Or
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, comprising:
The first finger that the microcontroller is sent according to the equipment to be launched whether is received in predetermined time period
Show information to judge whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate described the
N hardware module initializes successfully.
In the embodiment of the present application, microcontroller is used to indicate n-th of hardware mould receive that equipment to be launched sends
When the instruction information that block breaks down, then it can determine that n-th of hardware module breaks down;Or in predetermined time period not
Receive that equipment to be launched sends be used to indicate n-th of hardware module initialization successfully instruction information when, then can determine
N-th of hardware module breaks down, and the mode that microcontroller determines whether n-th of hardware module breaks down is more flexible.
In a possible embodiment, microcontroller receives the first instruction information that the equipment to be launched is sent, institute
State the first instruction information for characterize the failure of n-th hardware module, comprising:
The microcontroller receives the first instruction information that equipment to be launched is sent, and the first instruction information is used to indicate
M-th of memory modules RANK of n-th of dual inline memory module DIMM of the equipment to be launched breaks down, and described the
M RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The microcontroller receives the first instruction information that equipment to be launched is sent, and the first instruction information is for characterizing
N-th of business board of the equipment to be launched breaks down;Or
The microcontroller receives the first instruction information that equipment to be launched is sent, and the first instruction information is for characterizing
N-th of exchange chip of the equipment to be launched breaks down.
In the embodiment of the present application, when equipment to be launched is different types of equipment, the first instruction information includes interior
Hold slightly different, therefore those of ordinary skill in the art can be adaptively adjusted in the first instruction information and wrap according to actual needs
The content included, so that failure separation method provided by the present application has wider array of application range.
In a possible embodiment, the microcontroller modifies the onboard information, so that the equipment to be launched
N-th of hardware module is skipped in the initialization procedure of restarting, comprising:
The microcontroller modifies the serial sensing SPD information on n-th of DIMM, so that the equipment to be launched exists
M-th of RANK of n-th of DIMM is skipped in the initialization procedure of restarting;Or
The hardware state that the microcontroller modifies n-th of business board is mistake, so that the equipment to be launched
N-th of business board is skipped in the initialization procedure of restarting;Or
The hardware state that the microcontroller modifies n-th of exchange chip is mistake, so that the equipment to be launched
N-th of exchange chip is skipped in the initialization procedure of restarting.
In a possible embodiment, the method also includes:
The microcontroller receives the second indication information that the equipment to be launched is sent, and the second indication information is used for
It characterizes n-th of hardware module and restores normal;
The microcontroller modifies the onboard information according to the second indication information, so that the equipment weight to be launched
N number of identical hardware module can be successively initialized when new starting.
In the embodiment of the present application, it when microcontroller receives the second indication information that equipment to be launched is sent, then repairs
Change the onboard information of n-th of hardware module so that equipment to be launched restarting when can successively initialize it is N number of identical
Hardware module, to avoid when n-th of hardware module restores normal, the still not initialization to n-th of hardware module can be reduced
The utilization rate of hardware module.
Second aspect, the application provide a kind of failure separation method, comprising:
Equipment to be launched initializes N number of identical hardware module that the equipment to be launched includes, N be greater than or
Integer equal to 2;
When n-th of hardware module in N number of identical hardware module initializes failure, the equipment to be launched to
Microcontroller sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with
Indicate that the microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is opening again
N-th of hardware module is skipped in dynamic initialization procedure;
Wherein, the onboard information be n-th of hardware module hardware information, n-th of hardware module be to
Any one in N number of identical hardware module that starting device includes.
In a possible embodiment, the equipment to be launched sends the first instruction information to microcontroller, and described the
One instruction information breaks down for characterizing n-th of hardware module, comprising:
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described for characterizing
M-th of memory modules RANK of n-th of dual inline memory module DIMM of equipment to be launched breaks down, and described m-th
RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described for characterizing
N-th of business board of equipment to be launched breaks down;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described for characterizing
N-th of exchange chip of equipment to be launched breaks down.
In a possible embodiment, the method also includes:
In the equipment to be launched restarting, whether the equipment to be launched detects n-th of hardware module extensive
It is multiple normal;
When n-th of hardware module restores normal, the equipment to be launched sends second to the microcontroller and refers to
Show information, the second indication information is used to indicate the microcontroller and modifies the onboard information, to be launched sets so that described
N number of identical hardware module can be successively initialized when standby restarting.
The third aspect, the application provide a kind of microcontroller, comprising:
Judging unit, for judging whether n-th of hardware module breaks down in equipment to be launched, n-th of the hardware
Module is any one in N number of identical hardware module that equipment to be launched includes, and N is the integer more than or equal to 2;
Control unit is used for for controlling to remove in the equipment to be launched when n-th of hardware module breaks down
Electricity under the power supply of other modules outside the storage medium of memory plane information carrying breath;Wherein, the onboard information is n-th of the hardware
The hardware information of module;
Unit is modified, for modifying the onboard information, so that initialization of the equipment to be launched in restarting
N-th of hardware module is skipped in journey.
In a possible embodiment, whether n-th of hardware module is judged in equipment to be launched in the judging unit
When failure, it is specifically used for:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is for characterizing described n-th
A hardware module breaks down;
According to the first instruction information, judge whether n-th of hardware module breaks down;Or
When the judging unit judges whether n-th of hardware module breaks down in equipment to be launched, it is specifically used for:
Judge according to the first instruction information that the equipment to be launched is sent whether is received in predetermined time period
Whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate n-th of hardware module
It initializes successfully.
In a possible embodiment, it is used to receive the first finger that the equipment to be launched is sent in the judging unit
Show that information is specifically used for when the first instruction information breaks down for characterizing n-th of hardware module:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is used to indicate described wait open
M-th of memory modules RANK of n-th of dual inline memory module DIMM of dynamic equipment breaks down, and m is whole greater than zero
Number, m-th of RANK are any one in the M RANK that n-th of DIMM includes, and M is whole more than or equal to 2
Number;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing
N-th of business board of dynamic equipment breaks down;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing
N-th of exchange chip of dynamic equipment breaks down.
In a possible embodiment, the onboard information is modified in the modification unit, to be launched is set so that described
For when skipping n-th of hardware module in the initialization procedure of restarting, it is specifically used for:
The serial sensing SPD information on n-th of DIMM is modified, so that the equipment to be launched is in restarting
M-th of RANK of n-th of DIMM is skipped in initialization procedure;Or
The hardware state for modifying n-th of business board is mistake, so that the equipment to be launched is in restarting
N-th of business board is skipped in initialization procedure;Or
The hardware state for modifying n-th of exchange chip is mistake, so that the equipment to be launched is in restarting
N-th of exchange chip is skipped in initialization procedure.
In a possible embodiment, the microcontroller further include:
Receiving unit, the second indication information sent for receiving the equipment to be launched, the second indication information are used
Restore normal in characterizing n-th of hardware module;
Correspondingly, the modification unit is also used to:
The onboard information is modified according to the second indication information, when the equipment to be launched is restarted
Successively initialize N number of identical hardware module.
Fourth aspect, the application provide a kind of equipment to be launched, comprising:
Initialization unit, for initializing to N number of identical hardware module that the equipment to be launched includes, N is
Integer more than or equal to 2;
Transmission unit, when initializing failure for n-th of hardware module in N number of identical hardware module, to
Microcontroller sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with
Indicate that the microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is opening again
N-th of hardware module is skipped in dynamic initialization procedure;
Wherein, the onboard information be n-th of hardware module hardware information, n-th of hardware module be to
Any one in N number of identical hardware module that starting device includes.
In a possible embodiment, the first instruction information is sent to microcontroller in the transmission unit, described the
When one instruction information breaks down for characterizing n-th of hardware module, it is specifically used for:
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched
M-th of memory modules RANK of n dual inline memory module DIMM breaks down, and m-th of RANK is described n-th
Any one in the M RANK that DIMM includes, M are the integer more than or equal to 2;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched
N business board breaks down;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched
N exchange chip breaks down.
In a possible embodiment, the equipment to be launched further include:
Detection unit, for detecting whether n-th of hardware module is restored in the equipment to be launched restarting
Normally;
Correspondingly, the transmission unit is also used to:
When n-th of hardware module restores normal, Xiang Suoshu microcontroller transmission second indication information, described second
Instruction information is used to indicate the microcontroller and modifies the onboard information, when the equipment to be launched is restarted
Successively initialize N number of identical hardware module.
5th aspect, the application provides a kind of fault isolation system, including first aspect, any possibility of first aspect
Design and second aspect, the microcontroller in any possible design of second aspect and equipment to be launched.
6th aspect, the application provides a kind of computer storage medium, for being stored as described in the above-mentioned third aspect
Computer software instructions used in equipment to be launched described in microcontroller or fourth aspect, and include for executing above-mentioned
It on the one hand, is the program and second aspect, second party of microcontroller design in the possible embodiment of any one of first aspect
It is program designed by equipment to be launched in any one embodiment in face.
In the embodiment of the present application, determine that n-th of hardware module there are when failure, then controls equipment to be launched in microcontroller
On other modules in addition to storage medium where the onboard information of n-th hardware module power supply under electricity, and modify n-th of hardware
The purpose of n-th of hardware module is isolated to realize for the onboard information of module, to guarantee when n-th of hardware module breaks down
Also it can complete to initialize, and then can be improved the robustness of system.
On the other hand, the application be controlled by microcontroller the power supply that equipment to be launched includes each module power-on and power-off and
The isolation that faulty hardware module is realized to the modification of the onboard information of the hardware device of failure, it is complicated to avoid passing through design
Software realization faulty hardware equipment is evaded, to simplify the process of Fault Isolation.
Detailed description of the invention
Fig. 1 is a kind of schematic diagram for application scenarios that one embodiment of the application provides;
Fig. 2 be Fig. 1 provided by the present application in equipment to be launched be server schematic diagram;
Fig. 3 be Fig. 1 provided by the present application in equipment to be launched be rack-mount server schematic diagram;
Fig. 4 be Fig. 1 provided by the present application in equipment to be launched be single server schematic diagram;
Fig. 5 is a kind of flow diagram of failure separation method provided by the embodiments of the present application;
Fig. 6 is that process when equipment to be launched is server in a kind of failure separation method provided by the embodiments of the present application is shown
It is intended to;
Fig. 7 is the structural schematic diagram of microcontroller provided by the embodiments of the present application;
Fig. 8 is the structural schematic diagram of equipment to be launched provided by the present application.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application is described.
Hereinafter, the part term in the embodiment of the present application is explained, in order to those skilled in the art understand that.
(1), equipment to be launched can be the equipment including multiple same hardwares.Such as server, it may include more
A memory;Router or frame type exchanger, may include management board and multiple business boards, and management board is mainly responsible for whole
The management work of machine carries out initial configuration to business board card, and the working condition of monitoring business board is with the presence or absence of abnormal, industry
Business board mainly provides various interfaces, provides Ethernet (Gigabit Ethernet, GE) interface, Sonet/SDH as example
On data packet (Packet OverSonet/SDH, POS) interface, channelizing POS (Channelized POS, CPOS) interface,
Asynchronous transfer mode (Asynchronous Transfer Mode, ATM) interface etc., to carry various specific business;Separate unit
Interchanger may include multiple exchange chips, wherein exchange chip can be MAC chip.Here it will be understood that equipment to be launched
Be also possible to include multiple same hardwares modularized equipment.
(2) serial sensing (Serial Presence Detect, SPD) is one group of configuration information about memory modules,
Be generally stored inside one on memory bar using small outline integrated circuit (Small Outline Integrated Circuit,
SOIC) Erarable Programmable Read only Memory (the Erasable Programmable Read-Only of 8 needles encapsulated
Memory, EEPROM) in chip, capacity is 256 bytes.Many important informations of memory, such as memory are had recorded in SPD chip
Chip and the parameters such as Mo Zu manufacturer, working frequency, operating voltage, speed, capacity, voltage and row, column address bandwidth.
(3) Channel, the Memory control that can be central processing unit (Central Processing Unit, CPU) are logical
Road, a CPU can have multiple Channel, a Channel that can correspond to multiple dual inline memory module (Dual
Inline Memory Modules, DIMM), wherein there can be multiple memory modules RANK on a DIMM.
RANK, Memory Controller Hub control CPU every time can only with memory carry out one group of 64 bit (bit) data exchange, one
The bandwidth of a 64bit is exactly a RANK, i.e. 1RANK=64bit.
Each RANK, which includes memory grain 1 to memory grain M, M, can be 8,16 or for other numerical value.
(4) the terms "and/or", only a kind of incidence relation for describing affiliated partner, indicates may exist three
Kind relationship, for example, A and/or B, can indicate: individualism A exists simultaneously A and B, these three situations of individualism B.In addition,
Character "/" herein typicallys represent the relationship that forward-backward correlation object is a kind of "or" unless otherwise specified.And in the application reality
It applies in the description of example, the vocabulary such as " first ", " second ", is only used for distinguishing the purpose of description, be not understood to indicate or imply
Relative importance can not be interpreted as indication or suggestion sequence.
It referring to Figure 1, is a kind of application scenarios of the embodiment of the present application.It include: microcontroller 100, equipment to be launched
101.Microcontroller 100 is by spare (standby) power supply power supply, and corresponding microcontroller 100 is to each mould in equipment 101 to be launched
The power supply power supply of block, after microcontroller 100 is the power supply electrifying for each module that equipment 101 to be launched includes, equipment to be launched
In control module (such as CPU) treat the identical hardware module that starting device includes and initialized, it is to be launched to start
Equipment.
Microcontroller 100 can be STM32 chip, a similar single-chip microcontroller.
Equipment 101 to be launched can be server, specifically refer to Fig. 2, and server includes control module, Memory Controller Hub
And memory.Wherein, control module can be by CPU and basic input output system (Basic Input Output
System, BIOS) or Bootloader composition, CPU may include P Memory control channel (Channel);Memory is by multiple
Memory bar composition, by taking Channel1 as an example, Channel1 can correspond to DIMM 1 to DIMM N.One DIMM may include M
RANK.Wherein, under normal circumstances, the number of Channel of numerical value and CPU of N is identical, and M is 2 or 4.The EEPROM of each DIMM
In be stored with the SPD information of the DIMM.Further include chip selection signal (Chip Select, CS) in Fig. 2, is wrapped on DIMM for selecting
Which of RANK included and address signal and data-signal.
Fig. 3 is referred to, equipment 101 to be launched can be frame type exchanger, configured with management board and multiple business boards
Card.Management board is connect with multiple business boards, includes the storage medium for memory plane information carrying breath on each business board.Its
In, the onboard information of business board includes but is not limited to: the performance number of board, board type, board hardware state.
Under the scene, microcontroller is also possible to be integrated in management board, that is to say, that microcontroller and management
Board is integrated in one.
Fig. 4 is referred to, equipment 101 to be launched can be separate unit interchanger, including multiple exchange chips, in addition, separate unit is handed over
It further include the storage medium for memory plane information carrying breath on changing planes.Wherein, the onboard information of exchange chip includes but is not limited to: handing over
Change the power of chip, the power of interchanger, exchange chip quantity, the hardware signal of separate unit interchanger, separate unit exchange hardware shape
State etc..
Technical solution provided by the embodiments of the present application is introduced with reference to the accompanying drawing, during following introduction, incite somebody to action this
For the technical solution that application provides is applied in application scenarios shown in Fig. 1.
Fig. 5 is referred to, one embodiment of the application provides a kind of failure separation method, and the process of this method is described as follows:
S501: equipment to be launched initializes N number of identical hardware module that the equipment to be launched includes, and N is
Integer more than or equal to 2;
In the embodiment of the present application, as shown in figure 2 above-Fig. 4, equipment to be launched can be server, frame type exchanger
Or separate unit interchanger.When equipment to be launched is different type equipment, N number of identical hardware module that equipment to be launched includes
It is not identical.Such as when equipment to be launched is server, N number of identical hardware module can be memory, such as shown in Fig. 2
N number of DIMM;When equipment to be launched is frame type exchanger, N number of identical hardware module can be business board;To be launched
When equipment is separate unit interchanger, N number of identical hardware module can be exchange chip.
When equipment to be launched is different type equipment, initialization procedure when starting is slightly different, then distinguishes below
It describes in detail.
It in the embodiment of the present application, is first device power to be launched by microcontroller before equipment to be launched starting,
Microcontroller is the power supply electrifying for the storage medium of memory plane information carrying breath first, then to deposit in equipment to be launched except above-mentioned
The power supply electrifying of other modules outside storage media.
If microcontroller first controls the power supply electrifying of SPD, then controls the power supply of CPU so that equipment to be launched is server as an example
With the power supply electrifying of memory, wherein the electric sequence of the power supply of power supply and memory to CPU is with no restrictions.Correspondingly, servicing
After device starting, CPU, which reads the SPD on each DIMM by I2C bus using BIOS or startup program (Boot loader), to be believed
Breath, automatically configures corresponding memory working sequence according to SPD information by the Memory control channel of CPU by CPU and posts with control
Storage, and self-test is carried out, so as to give full play to the performance of memory bar.
If microcontroller first controls onboard for storing on business board so that equipment to be launched is frame type exchanger as an example
The power supply electrifying of the storage medium of information, then control on the power supply of the management power supply of board of frame type exchanger, business board
Electricity, wherein with no restrictions to the electric sequence of the power supply of the management power supply of board, business board.Correspondingly, being exchanged in rack
When device starts, the onboard information stored in the storage medium that management board card passes through reading business board carries out just business board card
Beginningization configuration, and establish corresponding complete machine control management strategy.
If microcontroller elder generation control panel is used for the storage of memory plane information carrying breath so that equipment to be launched is separate unit interchanger as an example
The power supply electrifying of medium, then control the power supply electrifying of the power supply of the CPU of single server, exchange chip, wherein to the electricity of CPU
Source, exchange chip power supply electric sequence with no restrictions.In the starting of separate unit interchanger, it is integrated in separate unit interchanger
CPU reads the onboard information stored in storage medium by I2C bus, initializes to exchange chip.
S502: microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, n-th of the hardware
Module is any one in N number of identical hardware module that the equipment to be launched includes.
In the embodiment of the present application, microcontroller judges the realization process whether n-th of hardware module breaks down, packet
It includes:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is used for
N-th of hardware module is characterized to break down;
The microcontroller judges whether n-th of hardware module breaks down according to the first instruction information;Or
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, comprising:
The first finger that the microcontroller is sent according to the equipment to be launched whether is received in predetermined time period
Show information to judge whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate described the
N hardware module initializes successfully.
In the embodiment of the present application, initialization procedure successively is carried out to each hardware module after equipment to be launched starting
In, after one hardware module of successful initialization, just then initialize next hardware module.It is being initialised to n-th of hardware
When module, if n-th of hardware module initialization failure, equipment to be launched send the first instruction information to microcontroller, first refers to
Show and information is used to indicate n-th of hardware module there are failures.In this way, then can when microcontroller receives the first instruction information
Enough judge that n-th of hardware module breaks down.
During specific implementation, when equipment to be launched is different type equipment, the content of the first instruction information is also slightly
It is different.Specifically, n-th of biserial that the first instruction information is used to characterize server is straight when equipment to be launched is server
M-th of RANK for inserting formula memory module DIMM breaks down, and m-th of RANK is the M RANK that n-th of DIMM includes
In any one, M is integer more than or equal to 2;When equipment to be launched is frame type exchanger, the first instruction information is used
It breaks down in n-th of business board of characterization frame type exchanger;When equipment to be launched is separate unit interchanger, the first instruction
N-th of exchange chip that information is used to characterize the separate unit interchanger breaks down.
In the embodiment of the present application, initialization procedure is successively carried out to each hardware module after equipment to be launched starting
In, after one hardware module of successful initialization, equipment to be launched can to microcontroller send first instruction information, first
Instruction information is used to indicate the hardware module and initializes successfully, just then initializes next hardware module, is being initialised to n-th
When a hardware module, if n-th of hardware module initialization failure, equipment to be launched send the first instruction without normal direction microcontroller
Information.In this way, microcontroller is not when predetermined time period receives the first instruction information of the transmission of equipment to be launched, then table
Bright n-th of hardware module breaks down.Wherein, predetermined time period can be 5 received after one first instruction information
Millisecond (ms), 10ms or be other time length, those of ordinary skill in the art can be configured according to actual needs.
During specific implementation, when equipment to be launched is different type equipment, the content of the first instruction information is also slightly
It is different.Specifically, the first instruction information is used to indicate m-th of RANK of n-th of DIMM when equipment to be launched is server
It initializes successfully;When equipment to be launched is frame type exchanger, it is initial that the first instruction information is used to indicate n-th of business board
It is melted into function;When equipment to be launched is separate unit interchanger, the first instruction information is used to indicate n-th of exchange chip and is initialized to
Function.
S503: if so, the microcontroller controls in the equipment to be launched except the storage for memory plane information carrying breath is situated between
It is electric under the power supply of other modules outside matter;Wherein, the onboard information is the hardware information of n-th of hardware module.
To allow equipment to be launched during initialization to faulty hardware module unaware, then needing N number of hardware module
Electricity under the power supply of other modules in starting device in addition to the storage medium ceased for memory plane information carrying is treated, to modify storage
Onboard information on medium.During specific implementation, when equipment to be launched is server, microcontroller is then by the electricity of CPU
Source, memory power supply under electricity, retain the power supply of SPD;When equipment to be launched is frame type exchanger, microcontroller then will management
It is electric under the power supply of board, the power supply of business board, it is preserved for the power supply of the storage medium of memory plane information carrying breath;It is set to be launched
It is standby when being separate unit interchanger, the power supply for the CPU that microcontroller then integrates separate unit interchanger, electricity under the power supply of exchange chip, and
It is preserved for the power supply of the storage medium of memory plane information carrying breath.It should be understood that onboard information herein refers to n-th of hardware module
Onboard information.
S504: the microcontroller modifies the onboard information, so that the equipment to be launched is in the initial of restarting
N-th of hardware module is skipped during changing.
During specific implementation, when equipment to be launched is different types of equipment, microcontroller modifies onboard information
Content it is also slightly different, will be made below being discussed in detail.
When equipment to be launched is server, there is one piece of storage region in the EEPROM of each DIMM, it is every for storing
The number for the RANK for including on a DIMM, by taking n-th of DIMM as an example, if n-th of DIMM includes two RANK, the storage region
It is middle to record 11, if DIMM includes four RANK, 1111 are recorded in the storage region.It include below two with n-th of DIMM
For RANK, when first RANK on n-th of DIMM breaks down, microcontroller passes through I2C bus for the storage region
The SPD information of storage is revised as 01, only includes a RANK to characterize n-th of DIMM, second RANK on n-th of DIMM
When failure, then the SPD information that can be stored the storage region is revised as 10, only includes one to characterize n-th of DIMM
RANK.In this way, the initialization to n-th of DIMM, m-th of RANK can be skipped in server restarting, so that clothes
Business device can still be able to initialize successfully, to improve server when the memory grain of some memory breaks down
Robustness.
6 complete descriptions are when equipment to be launched is server with reference to the accompanying drawing, failure separation method provided by the present application.
As shown in fig. 6, microcontroller first controls the power supply electrifying of SPD, then controls the power supply and memory of CPU on a microcontroller after electricity
Power supply electrifying.After the power supply electrifying of each module in the server, CPU passes through I2C bus using BIOS or Boot loader
The SPD information on each DIMM is read, then each DIMM is initialized.Specifically, from first of first DIMM
RANK starts, if initializing successfully, successively carries out, normally to start server.During initialization, if n-th of DIMM
M-th of RANK initialize failure, it is abnormal that the CPU of server then indicates that m-th of RANK of n-th of DIMM of microcontroller occurs,
Microcontroller then controls electricity under the power supply of the power supply of CPU, memory, and retains the power supply of SPD, to modify the SPD letter of n-th of DIMM
Breath.After the completion of modification, then control electric under the power supply of SPD.Again to the power supply electrifying of the power supply of the power supply of SPD, CPU, memory,
Then DIMM is initialized according to modified SPD information after the completion of powering on.
When equipment to be launched is frame type exchanger, microcontroller is when determining that n-th of business board breaks down, then
The hardware state of n-th of business board of modification is wrong (FALL).Again to the failure when to avoid restarting
Business board card carries out power distribution and initialization action, causes board system to block or manages plan using the control of mistake
Slightly, such as failure board is distributed power to, causes the waste of resource.
When equipment to be launched is separate unit interchanger, microcontroller is then repaired when determining that n-th of exchange chip breaks down
The hardware state for changing n-th of exchange chip is mistake, to avoid next time when restarting again to the exchange chip of failure
Initialization so that separate unit interchanger part exchange chip work in the case where can also start.
Herein, it should be noted that, can be first to onboard for storing after microcontroller modifies to onboard information
It is electric under the power supply of the storage medium of information, in the power supply electrifying to other modules in addition to storage medium, so as to be launched
Equipment initializes N number of identical hardware module according to modified onboard information.It is of course also possible to directly to except storage
The power supply electrifying of other modules outside medium, so that equipment to be launched is according to modified onboard information to N number of identical
Hardware module is initialized.
In the embodiment of the present application, the method also includes:
The microcontroller receives the second indication information that the equipment to be launched is sent, and the second indication information is used for
It characterizes n-th of hardware module and restores normal;
The microcontroller modifies the board information according to the second indication information, so that the equipment weight to be launched
N number of identical hardware module can be successively initialized when new starting.
It is still not initial to avoid when n-th of hardware module of failure restores normal during specific implementation
Change, reduce the service efficiency of equipment to be launched, then in each restarting, n-th of hardware module of detection is equipment to be launched
It is no to restore normal.When n-th of hardware module restores normal, then second indication information, the second instruction letter are sent to microcontroller
Breath restores normal for characterizing n-th of hardware module.
When microcontroller receives the second indication information that equipment to be launched is sent, then onboard information is modified, so that
When equipment to be launched restarts, N number of identical hardware module can be successively initialized.
Based on the same inventive concept, Fig. 7 is referred to, the application also provides a kind of microcontroller 700, comprising:
Judging unit 701, for judging whether n-th of hardware module breaks down in equipment to be launched, described n-th hard
Part module is any one in N number of identical hardware module that the equipment to be launched includes, and N is whole more than or equal to 2
Number;
Control unit 702, for controlling and being removed in the equipment to be launched when n-th of hardware module breaks down
For electricity under the power supply of other modules outside the storage medium of memory plane information carrying breath;Wherein, the onboard information is described n-th
The hardware information of hardware module;
Unit 703 is modified, for modifying the onboard information, so that initialization of the equipment to be launched in restarting
N-th of hardware module is skipped in the process.
In a possible embodiment, judge that n-th of hardware module is in equipment to be launched in the judging unit 701
When no failure, it is specifically used for:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is for characterizing described n-th
A hardware module breaks down;
According to the first instruction information, judge whether n-th of hardware module breaks down;Or
It is specific to use when the judging unit 701 judges whether n-th of hardware module breaks down in equipment to be launched
In:
Judge according to the first instruction information that the equipment to be launched is sent whether is received in predetermined time period
Whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate n-th of hardware module
It initializes successfully.
In a possible embodiment, it is used to receive that the equipment to be launched sends the in the judging unit 701
One instruction information is specifically used for when the first instruction information breaks down for characterizing n-th of hardware module:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is used to indicate described wait open
M-th of memory modules RANK of n-th of dual inline memory module DIMM of dynamic equipment breaks down, m-th of RANK
Any one in M RANK for including for n-th of DIMM, M are the integer more than or equal to 2;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing
N-th of business board of dynamic equipment breaks down;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing
N-th of exchange chip of dynamic equipment breaks down.
In a possible embodiment, the onboard information is modified in the modification unit 703, so that described to be launched
When equipment skips n-th of hardware module in the initialization procedure of restarting, it is specifically used for:
The serial sensing SPD information on n-th of DIMM is modified, so that the equipment to be launched is in restarting
M-th of RANK of n-th of DIMM is skipped in initialization procedure;Or
The hardware state for modifying n-th of business board is mistake, so that the equipment to be launched is in restarting
N-th of business board is skipped in initialization procedure;Or
The hardware state for modifying n-th of exchange chip is mistake, so that the equipment to be launched is in restarting
N-th of exchange chip is skipped in initialization procedure.
In a possible embodiment, the microcontroller further include:
Receiving unit 704, the second indication information sent for receiving the equipment to be launched, the second indication information
Restore normal for characterizing n-th of hardware module;
Correspondingly, the modification unit 703 is also used to:
The onboard information is modified according to the second indication information, when the equipment to be launched is restarted
Successively initialize N number of identical hardware module.
Based on the same inventive concept, Fig. 8 is referred to, the application provides a kind of equipment to be launched 800, comprising:
Initialization unit 801, for being initialized to N number of identical hardware module that the equipment to be launched includes, N
For the integer more than or equal to 2;
Transmission unit 802, when initializing failure for n-th of hardware module in N number of identical hardware module,
The first instruction information is sent to microcontroller, the first instruction information breaks down for characterizing n-th of hardware module,
To indicate that the microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is again
N-th of hardware module is skipped in the initialization procedure of starting;
Wherein, the onboard information be n-th of hardware module hardware information, n-th of hardware module be to
Any one in N number of identical hardware module that starting device includes.
In a possible embodiment, the first instruction information is sent to microcontroller in the transmission unit 802, it is described
When first instruction information breaks down for characterizing n-th of hardware module, it is specifically used for:
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched
M-th of memory modules RANK of n dual inline memory module DIMM breaks down, and m-th of RANK is described n-th
Any one in the M RANK that DIMM includes, M are the integer more than or equal to 2;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched
N business board breaks down;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched
N exchange chip breaks down.
In a possible embodiment, the equipment to be launched further include:
Detection unit 803, for whether detecting n-th of hardware module in the equipment to be launched restarting
Restore normal;
Correspondingly, the transmission unit 802 is also used to:
When n-th of hardware module restores normal, Xiang Suoshu microcontroller transmission second indication information, described second
Instruction information is used to indicate the microcontroller and modifies the onboard information, when the equipment to be launched is restarted
Successively initialize N number of identical hardware module.
The application also provides a kind of fault isolation system, including the microcontroller 100 as shown in Fig. 7 embodiment and such as Fig. 8
Equipment 101 to be launched shown in embodiment.
The application also provides a kind of computer storage medium, which may include memory, the storage
Device can be stored with program, which includes the microcontroller, to be launched as recorded in preceding embodiment of the method shown in fig. 5 when executing
Overall Steps performed by equipment.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program
Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application
Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more,
The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces
The form of product.
The application be referring to according to the present processes, equipment (system) and computer program product flow chart and/or
Block diagram describes.It should be understood that each process that can be realized by computer program instructions in flowchart and/or the block diagram and/or
The combination of process and/or box in box and flowchart and/or the block diagram.It can provide these computer program instructions to arrive
General purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices processor to generate one
Machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for realizing flowing
The device for the function of being specified in journey figure one process or multiple processes and/or block diagrams one box or multiple boxes.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates,
Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or
The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or
The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one
The step of function of being specified in a box or multiple boxes.
Obviously, those skilled in the art can carry out various modification and variations without departing from the essence of the application to the application
Mind and range.In this way, if these modifications and variations of the application belong to the range of the claim of this application and its equivalent technologies
Within, then the application is also intended to include these modifications and variations.
Claims (18)
1. a kind of failure separation method characterized by comprising
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, and n-th of hardware module is described
Any one in N number of identical hardware module that equipment to be launched includes, N are the integer more than or equal to 2;
If so, the microcontroller control it is other in addition to the storage medium ceased for memory plane information carrying in the equipment to be launched
It is electric under the power supply of module;Wherein, the onboard information is the hardware information of n-th of hardware module;
The microcontroller modifies the onboard information, so that the equipment to be launched is jumped in the initialization procedure of restarting
Cross n-th of hardware module.
2. the method according to claim 1, wherein microcontroller judges n-th of hardware mould in equipment to be launched
Whether block breaks down, comprising:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is for characterizing
N-th of hardware module breaks down;
The microcontroller judges whether n-th of hardware module breaks down according to the first instruction information;Or
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, comprising:
The microcontroller is believed according to the first instruction that the equipment to be launched is sent whether is received in predetermined time period
Breath is to judge whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate described n-th
Hardware module initializes successfully.
3. according to the method described in claim 2, it is characterized in that, the microcontroller receives what the equipment to be launched was sent
First instruction information, the first instruction information break down for characterizing n-th of hardware module, comprising:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is used to indicate
M-th of memory modules RANK of n-th of dual inline memory module DIMM of the equipment to be launched breaks down, and described the
M RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is for characterizing
N-th of business board of the equipment to be launched breaks down;Or
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is for characterizing
N-th of exchange chip of the equipment to be launched breaks down.
4. according to the method described in claim 3, it is characterized in that, the microcontroller modifies the onboard information, so that institute
It states equipment to be launched and skips n-th of hardware module in the initialization procedure of restarting, comprising:
The microcontroller modifies the serial sensing SPD information on n-th of DIMM, so that the equipment to be launched is again
M-th of RANK of n-th of DIMM is skipped in the initialization procedure of starting;Or
The hardware state that the microcontroller modifies n-th of business board is mistake, so that the equipment to be launched is in weight
N-th of business board is skipped in the initialization procedure newly started;Or
The hardware state that the microcontroller modifies n-th of exchange chip is mistake, so that the equipment to be launched is in weight
N-th of exchange chip is skipped in the initialization procedure newly started.
5. method according to claim 1-4, which is characterized in that the method also includes:
The microcontroller receives the second indication information that the equipment to be launched is sent, and the second indication information is for characterizing
N-th of hardware module restores normal;
The microcontroller modifies the onboard information according to the second indication information, so that the equipment to be launched opens again
N number of identical hardware module can be successively initialized when dynamic.
6. a kind of failure separation method characterized by comprising
Equipment to be launched initializes N number of identical hardware module that the equipment to be launched includes, N be more than or equal to
2 integer;
When n-th of hardware module in N number of identical hardware module initializes failure, the equipment to be launched is to micro-control
Device processed sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with instruction
The microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is in restarting
N-th of hardware module is skipped in initialization procedure;
Wherein, the onboard information is the hardware information of n-th of hardware module, and n-th of hardware module is to be launched
Any one in N number of identical hardware module that equipment includes.
7. according to the method described in claim 6, it is characterized in that, the equipment to be launched sends the first instruction to microcontroller
Information, the first instruction information break down for characterizing n-th of hardware module, comprising:
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described wait open for characterizing
M-th of memory modules RANK of n-th of dual inline memory module DIMM of dynamic equipment breaks down, m-th of RANK
Any one in M RANK for including for n-th of DIMM, M are the integer more than or equal to 2;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described wait open for characterizing
N-th of business board of dynamic equipment breaks down;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described wait open for characterizing
N-th of exchange chip of dynamic equipment breaks down.
8. method according to claim 6 or 7, which is characterized in that the method also includes:
In the equipment to be launched restarting, the equipment to be launched detects whether n-th of hardware module is restored just
Often;
When n-th of hardware module restores normal, the equipment to be launched sends the second instruction letter to the microcontroller
Breath, the second indication information is used to indicate the microcontroller and modifies the onboard information, so that the equipment weight to be launched
N number of identical hardware module can be successively initialized when new starting.
9. a kind of microcontroller characterized by comprising
Judging unit, for judging whether n-th of hardware module breaks down in equipment to be launched, n-th of hardware module
Any one in the N number of identical hardware module for including for the equipment to be launched, N are the integer more than or equal to 2;
Control unit, for controlling in the equipment to be launched except for storing when n-th of hardware module breaks down
It is electric under the power supply of other modules outside the storage medium of onboard information;Wherein, the onboard information is n-th of hardware module
Hardware information;
Unit is modified, for modifying the onboard information, so that the equipment to be launched is in the initialization procedure of restarting
Skip n-th of hardware module.
10. microcontroller according to claim 9, which is characterized in that judge in equipment to be launched in the judging unit
When whether n-th of hardware module breaks down, it is specifically used for:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is hard for characterizing described n-th
Part module breaks down;
According to the first instruction information, judge whether n-th of hardware module breaks down;Or
When the judging unit judges whether n-th of hardware module breaks down in equipment to be launched, it is specifically used for:
Whether according to receiving in predetermined time period, the first instruction information that the equipment to be launched is sent is described to judge
Whether n-th of hardware module breaks down;Wherein, it is initial to be used to indicate n-th of hardware module for the first instruction information
It is melted into function.
11. microcontroller according to claim 10, which is characterized in that described wait open for receiving in the judging unit
The first instruction information that dynamic equipment is sent, when the first instruction information breaks down for characterizing n-th of hardware module,
It is specifically used for:
Receive the first instruction information that the equipment to be launched is sent, the first instruction information is used to indicate described to be launched set
M-th of memory modules RANK of n-th standby of dual inline memory module DIMM breaks down, and m is the integer greater than zero, institute
Any one in the M RANK that m-th of RANK includes for n-th of DIMM is stated, M is the integer more than or equal to 2;Or
Receive the first instruction information that the equipment to be launched is sent, the first instruction information is for characterizing described to be launched set
N-th standby of business board breaks down;Or
Receive the first instruction information that the equipment to be launched is sent, the first instruction information is for characterizing described to be launched set
N-th standby of exchange chip breaks down.
12. microcontroller according to claim 11, which is characterized in that modify the onboard letter in the modification unit
Breath, it is specific to use when so that the equipment to be launched skipping n-th of hardware module in the initialization procedure of restarting
In:
The serial sensing SPD information on n-th of DIMM is modified, so that the equipment to be launched is in the initial of restarting
M-th of RANK of n-th of DIMM is skipped during changing;Or
The hardware state for modifying n-th of business board is mistake, so that the equipment to be launched is in the initial of restarting
N-th of business board is skipped during changing;Or
The hardware state for modifying n-th of exchange chip is mistake, so that the equipment to be launched is in the initial of restarting
N-th of exchange chip is skipped during changing.
13. according to the described in any item microcontrollers of claim 9-12, which is characterized in that the microcontroller further include:
Receiving unit, the second indication information sent for receiving the equipment to be launched, the second indication information are used for table
It levies n-th of hardware module and restores normal;
Correspondingly, the modification unit is also used to:
The onboard information is modified according to the second indication information, when the equipment to be launched restarted successively
Initialize N number of identical hardware module.
14. a kind of equipment to be launched characterized by comprising
Initialization unit, for being initialized to N number of identical hardware module that the equipment to be launched includes, N be greater than
Or the integer equal to 2;
Transmission unit, when initializing failure for n-th of hardware module in N number of identical hardware module, to micro-control
Device processed sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with instruction
The microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is in restarting
N-th of hardware module is skipped in initialization procedure;
Wherein, the onboard information is the hardware information of n-th of hardware module, and n-th of hardware module is to be launched
Any one in N number of identical hardware module that equipment includes.
15. equipment according to claim 14, which is characterized in that send first to microcontroller in the transmission unit and refer to
Show that information is specifically used for when the first instruction information breaks down for characterizing n-th of hardware module:
The first instruction information is sent to microcontroller, the first instruction information is used to characterize n-th of the equipment to be launched
M-th of memory modules RANK of dual inline memory module DIMM breaks down, and m is the integer greater than zero, and described m-th
RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize n-th of the equipment to be launched
Business board breaks down;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize n-th of the equipment to be launched
Exchange chip breaks down.
16. equipment according to claim 14 or 15, which is characterized in that the equipment to be launched further include:
Detection unit, for detecting whether n-th of hardware module is restored just in the equipment to be launched restarting
Often;
Correspondingly, the transmission unit is also used to:
When n-th of hardware module restores normal, Xiang Suoshu microcontroller sends second indication information, second instruction
Information is used to indicate the microcontroller and modifies the onboard information, when the equipment to be launched restarted successively
Initialize N number of identical hardware module.
17. a kind of computer storage medium, which is characterized in that the computer storage medium is stored with instruction, works as described instruction
When running on computers, so that the computer executes such as the described in any item methods of claim 1-5 or 6-8.
18. a kind of fault isolation system, which is characterized in that including such as described in any item microcontrollers of claim 9-13 and such as
The described in any item equipment to be launched of claim 14-16.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910452699.8A CN110187994A (en) | 2019-05-28 | 2019-05-28 | A kind of failure separation method, equipment and fault isolation system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910452699.8A CN110187994A (en) | 2019-05-28 | 2019-05-28 | A kind of failure separation method, equipment and fault isolation system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110187994A true CN110187994A (en) | 2019-08-30 |
Family
ID=67718274
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910452699.8A Pending CN110187994A (en) | 2019-05-28 | 2019-05-28 | A kind of failure separation method, equipment and fault isolation system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110187994A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111782434A (en) * | 2020-06-30 | 2020-10-16 | 联想(北京)有限公司 | Abnormity detection method and device and electronic equipment |
CN111813731A (en) * | 2020-06-11 | 2020-10-23 | 中国长城科技集团股份有限公司 | Method, device, server and medium for reading memory information |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6463550B1 (en) * | 1998-06-04 | 2002-10-08 | Compaq Information Technologies Group, L.P. | Computer system implementing fault detection and isolation using unique identification codes stored in non-volatile memory |
JP2012190128A (en) * | 2011-03-09 | 2012-10-04 | Nec Corp | Fast startup method, fast startup device and fast startup program against degeneration of hardware configuration |
WO2016106965A1 (en) * | 2014-12-31 | 2016-07-07 | 中兴通讯股份有限公司 | Server self-healing method and device |
WO2017063505A1 (en) * | 2015-10-16 | 2017-04-20 | 中兴通讯股份有限公司 | Method for detecting hardware fault of server, apparatus thereof, and server |
CN109086151A (en) * | 2017-06-13 | 2018-12-25 | 中兴通讯股份有限公司 | The method and device of memory failure is isolated on a kind of server |
CN109151144A (en) * | 2018-08-14 | 2019-01-04 | 深圳市广和通无线通信软件有限公司 | Method of hardware management, device, system, computer equipment and storage medium |
-
2019
- 2019-05-28 CN CN201910452699.8A patent/CN110187994A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6463550B1 (en) * | 1998-06-04 | 2002-10-08 | Compaq Information Technologies Group, L.P. | Computer system implementing fault detection and isolation using unique identification codes stored in non-volatile memory |
JP2012190128A (en) * | 2011-03-09 | 2012-10-04 | Nec Corp | Fast startup method, fast startup device and fast startup program against degeneration of hardware configuration |
WO2016106965A1 (en) * | 2014-12-31 | 2016-07-07 | 中兴通讯股份有限公司 | Server self-healing method and device |
WO2017063505A1 (en) * | 2015-10-16 | 2017-04-20 | 中兴通讯股份有限公司 | Method for detecting hardware fault of server, apparatus thereof, and server |
CN109086151A (en) * | 2017-06-13 | 2018-12-25 | 中兴通讯股份有限公司 | The method and device of memory failure is isolated on a kind of server |
CN109151144A (en) * | 2018-08-14 | 2019-01-04 | 深圳市广和通无线通信软件有限公司 | Method of hardware management, device, system, computer equipment and storage medium |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111813731A (en) * | 2020-06-11 | 2020-10-23 | 中国长城科技集团股份有限公司 | Method, device, server and medium for reading memory information |
CN111813731B (en) * | 2020-06-11 | 2022-10-25 | 中国长城科技集团股份有限公司 | Method, device, server and medium for reading memory information |
CN111782434A (en) * | 2020-06-30 | 2020-10-16 | 联想(北京)有限公司 | Abnormity detection method and device and electronic equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102141940B (en) | Device, method and system for recovering firmware update | |
CN104750510A (en) | Chip start method and multi-core processor chip | |
US20060010282A1 (en) | Method and apparatus to boot a system by monitoring an operating status of a NAND flash memory | |
US20130246855A1 (en) | Error Location Specification Method, Error Location Specification Apparatus and Computer-Readable Recording Medium in Which Error Location Specification Program is Recorded | |
CN110187994A (en) | A kind of failure separation method, equipment and fault isolation system | |
CN103164316B (en) | Hardware monitor | |
CN108334372A (en) | firmware upgrade processing method, device and system | |
CN113190405B (en) | Node health detection method and device, electronic equipment and storage medium | |
CN111338698A (en) | Method and system for accurately booting server by BIOS (basic input output System) | |
CN106681877B (en) | Chip debugging system and method and system chip | |
CN100445953C (en) | Method and device for distinguish cool start and hot start | |
CN112199240B (en) | Method for switching nodes during node failure and related equipment | |
US20060173655A1 (en) | Information processing apparatus, failure monitoring method and failure monitoring computer program | |
CN110286852A (en) | Dual control framework distributed memory system, method for reading data, device and storage medium | |
JPH1091289A (en) | Memory initialization device and method | |
CN108897646A (en) | A kind of switching method and baseboard management controller of BIOS chip | |
US20200409813A1 (en) | System and Method to Derive Health Information for a General Purpose Processing Unit Through Aggregation of Board Parameters | |
CN109634796A (en) | A kind of method for diagnosing faults of computer, apparatus and system | |
CN109189638A (en) | A kind of GPU driving detection method, device, terminal and storage medium | |
US8069364B2 (en) | Method to recover from logical path failures | |
TW201328248A (en) | Method for updating system firmware and server system using the same | |
CN101158920A (en) | Method and apparatus for detecting fault of operating system | |
CN101739261B (en) | Switching system for basic input and output system and switching method thereof | |
CN113742198B (en) | Test method, test device, electronic equipment and storage medium | |
CN101311909A (en) | System peculiarity diagnose method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |