CN110187994A - A kind of failure separation method, equipment and fault isolation system - Google Patents

A kind of failure separation method, equipment and fault isolation system Download PDF

Info

Publication number
CN110187994A
CN110187994A CN201910452699.8A CN201910452699A CN110187994A CN 110187994 A CN110187994 A CN 110187994A CN 201910452699 A CN201910452699 A CN 201910452699A CN 110187994 A CN110187994 A CN 110187994A
Authority
CN
China
Prior art keywords
equipment
launched
hardware module
microcontroller
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910452699.8A
Other languages
Chinese (zh)
Inventor
李海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Star Net Ruijie Networks Co Ltd
Original Assignee
Beijing Star Net Ruijie Networks Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Star Net Ruijie Networks Co Ltd filed Critical Beijing Star Net Ruijie Networks Co Ltd
Priority to CN201910452699.8A priority Critical patent/CN110187994A/en
Publication of CN110187994A publication Critical patent/CN110187994A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/1441Resetting or repowering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/4401Bootstrapping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0659Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities

Abstract

The application provides a kind of failure separation method, equipment and fault isolation system, for improving the robustness of system.Include: in microcontroller equipment to be launched n-th of hardware module whether break down, n-th of hardware module is any one in N number of identical hardware module that the equipment to be launched includes, and N is integer more than or equal to 2;If so, the microcontroller controls electricity under the power supplys of other modules in the equipment to be launched in addition to the storage medium ceased for memory plane information carrying;Wherein, the onboard information is the hardware information of n-th of hardware module;The microcontroller modifies the onboard information, so that the equipment to be launched skips n-th of hardware module in the initialization procedure of restarting.

Description

A kind of failure separation method, equipment and fault isolation system
Technical field
This application involves field of computer technology, in particular to a kind of failure separation method, equipment and fault isolation system.
Background technique
Currently, for the system including more hardware modules, the frame type exchanger for example including multiple business boards, including it is more The separate unit interchanger of a exchange media access control (Media Access Control, MAC) chip, when starting the system, The hardware module (business board, exchange MAC chip) for including to system is needed to initialize.
In initialization procedure, if any one hardware module breaks down in the hardware module that system includes, it will lead to Initialization failure, and then lead to system starting failure, it is lower so as to cause the robustness (Robust) of system.
Summary of the invention
The embodiment of the present application provides a kind of failure separation method, equipment and fault isolation system, for improving the Shandong of system Stick.
In a first aspect, this application provides a kind of failure separation methods, comprising:
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, and n-th of hardware module is Any one in N number of identical hardware module that the equipment to be launched includes, N are the integer more than or equal to 2;
If so, the microcontroller controls in the equipment to be launched in addition to the storage medium ceased for memory plane information carrying It is electric under the power supply of other modules;Wherein, the onboard information is the hardware information of n-th of hardware module;
The microcontroller modifies the onboard information, so that initialization procedure of the equipment to be launched in restarting In skip n-th of hardware module.
In the embodiment of the present application, determine that n-th of hardware module there are when failure, then controls equipment to be launched in microcontroller On other modules in addition to storage medium where the onboard information of n-th hardware module power supply under electricity, and modify n-th of hardware The purpose of n-th of hardware module is isolated to realize for the onboard information of module, to guarantee when n-th of hardware module breaks down Also it can complete to initialize, and then can be improved the robustness of system.
On the other hand, the application be controlled by microcontroller the power supply that equipment to be launched includes each module power-on and power-off and The isolation that faulty hardware module is realized to the modification of the onboard information of the hardware device of failure, it is complicated to avoid passing through design Software realization faulty hardware equipment is evaded, to simplify the process of Fault Isolation.
In a possible embodiment, microcontroller judges whether n-th of hardware module event occurs in equipment to be launched Barrier, comprising:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is used for N-th of hardware module is characterized to break down;
The microcontroller judges whether n-th of hardware module breaks down according to the first instruction information;Or
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, comprising:
The first finger that the microcontroller is sent according to the equipment to be launched whether is received in predetermined time period Show information to judge whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate described the N hardware module initializes successfully.
In the embodiment of the present application, microcontroller is used to indicate n-th of hardware mould receive that equipment to be launched sends When the instruction information that block breaks down, then it can determine that n-th of hardware module breaks down;Or in predetermined time period not Receive that equipment to be launched sends be used to indicate n-th of hardware module initialization successfully instruction information when, then can determine N-th of hardware module breaks down, and the mode that microcontroller determines whether n-th of hardware module breaks down is more flexible.
In a possible embodiment, microcontroller receives the first instruction information that the equipment to be launched is sent, institute State the first instruction information for characterize the failure of n-th hardware module, comprising:
The microcontroller receives the first instruction information that equipment to be launched is sent, and the first instruction information is used to indicate M-th of memory modules RANK of n-th of dual inline memory module DIMM of the equipment to be launched breaks down, and described the M RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The microcontroller receives the first instruction information that equipment to be launched is sent, and the first instruction information is for characterizing N-th of business board of the equipment to be launched breaks down;Or
The microcontroller receives the first instruction information that equipment to be launched is sent, and the first instruction information is for characterizing N-th of exchange chip of the equipment to be launched breaks down.
In the embodiment of the present application, when equipment to be launched is different types of equipment, the first instruction information includes interior Hold slightly different, therefore those of ordinary skill in the art can be adaptively adjusted in the first instruction information and wrap according to actual needs The content included, so that failure separation method provided by the present application has wider array of application range.
In a possible embodiment, the microcontroller modifies the onboard information, so that the equipment to be launched N-th of hardware module is skipped in the initialization procedure of restarting, comprising:
The microcontroller modifies the serial sensing SPD information on n-th of DIMM, so that the equipment to be launched exists M-th of RANK of n-th of DIMM is skipped in the initialization procedure of restarting;Or
The hardware state that the microcontroller modifies n-th of business board is mistake, so that the equipment to be launched N-th of business board is skipped in the initialization procedure of restarting;Or
The hardware state that the microcontroller modifies n-th of exchange chip is mistake, so that the equipment to be launched N-th of exchange chip is skipped in the initialization procedure of restarting.
In a possible embodiment, the method also includes:
The microcontroller receives the second indication information that the equipment to be launched is sent, and the second indication information is used for It characterizes n-th of hardware module and restores normal;
The microcontroller modifies the onboard information according to the second indication information, so that the equipment weight to be launched N number of identical hardware module can be successively initialized when new starting.
In the embodiment of the present application, it when microcontroller receives the second indication information that equipment to be launched is sent, then repairs Change the onboard information of n-th of hardware module so that equipment to be launched restarting when can successively initialize it is N number of identical Hardware module, to avoid when n-th of hardware module restores normal, the still not initialization to n-th of hardware module can be reduced The utilization rate of hardware module.
Second aspect, the application provide a kind of failure separation method, comprising:
Equipment to be launched initializes N number of identical hardware module that the equipment to be launched includes, N be greater than or Integer equal to 2;
When n-th of hardware module in N number of identical hardware module initializes failure, the equipment to be launched to Microcontroller sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with Indicate that the microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is opening again N-th of hardware module is skipped in dynamic initialization procedure;
Wherein, the onboard information be n-th of hardware module hardware information, n-th of hardware module be to Any one in N number of identical hardware module that starting device includes.
In a possible embodiment, the equipment to be launched sends the first instruction information to microcontroller, and described the One instruction information breaks down for characterizing n-th of hardware module, comprising:
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described for characterizing M-th of memory modules RANK of n-th of dual inline memory module DIMM of equipment to be launched breaks down, and described m-th RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described for characterizing N-th of business board of equipment to be launched breaks down;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described for characterizing N-th of exchange chip of equipment to be launched breaks down.
In a possible embodiment, the method also includes:
In the equipment to be launched restarting, whether the equipment to be launched detects n-th of hardware module extensive It is multiple normal;
When n-th of hardware module restores normal, the equipment to be launched sends second to the microcontroller and refers to Show information, the second indication information is used to indicate the microcontroller and modifies the onboard information, to be launched sets so that described N number of identical hardware module can be successively initialized when standby restarting.
The third aspect, the application provide a kind of microcontroller, comprising:
Judging unit, for judging whether n-th of hardware module breaks down in equipment to be launched, n-th of the hardware Module is any one in N number of identical hardware module that equipment to be launched includes, and N is the integer more than or equal to 2;
Control unit is used for for controlling to remove in the equipment to be launched when n-th of hardware module breaks down Electricity under the power supply of other modules outside the storage medium of memory plane information carrying breath;Wherein, the onboard information is n-th of the hardware The hardware information of module;
Unit is modified, for modifying the onboard information, so that initialization of the equipment to be launched in restarting N-th of hardware module is skipped in journey.
In a possible embodiment, whether n-th of hardware module is judged in equipment to be launched in the judging unit When failure, it is specifically used for:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is for characterizing described n-th A hardware module breaks down;
According to the first instruction information, judge whether n-th of hardware module breaks down;Or
When the judging unit judges whether n-th of hardware module breaks down in equipment to be launched, it is specifically used for:
Judge according to the first instruction information that the equipment to be launched is sent whether is received in predetermined time period Whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate n-th of hardware module It initializes successfully.
In a possible embodiment, it is used to receive the first finger that the equipment to be launched is sent in the judging unit Show that information is specifically used for when the first instruction information breaks down for characterizing n-th of hardware module:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is used to indicate described wait open M-th of memory modules RANK of n-th of dual inline memory module DIMM of dynamic equipment breaks down, and m is whole greater than zero Number, m-th of RANK are any one in the M RANK that n-th of DIMM includes, and M is whole more than or equal to 2 Number;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing N-th of business board of dynamic equipment breaks down;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing N-th of exchange chip of dynamic equipment breaks down.
In a possible embodiment, the onboard information is modified in the modification unit, to be launched is set so that described For when skipping n-th of hardware module in the initialization procedure of restarting, it is specifically used for:
The serial sensing SPD information on n-th of DIMM is modified, so that the equipment to be launched is in restarting M-th of RANK of n-th of DIMM is skipped in initialization procedure;Or
The hardware state for modifying n-th of business board is mistake, so that the equipment to be launched is in restarting N-th of business board is skipped in initialization procedure;Or
The hardware state for modifying n-th of exchange chip is mistake, so that the equipment to be launched is in restarting N-th of exchange chip is skipped in initialization procedure.
In a possible embodiment, the microcontroller further include:
Receiving unit, the second indication information sent for receiving the equipment to be launched, the second indication information are used Restore normal in characterizing n-th of hardware module;
Correspondingly, the modification unit is also used to:
The onboard information is modified according to the second indication information, when the equipment to be launched is restarted Successively initialize N number of identical hardware module.
Fourth aspect, the application provide a kind of equipment to be launched, comprising:
Initialization unit, for initializing to N number of identical hardware module that the equipment to be launched includes, N is Integer more than or equal to 2;
Transmission unit, when initializing failure for n-th of hardware module in N number of identical hardware module, to Microcontroller sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with Indicate that the microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is opening again N-th of hardware module is skipped in dynamic initialization procedure;
Wherein, the onboard information be n-th of hardware module hardware information, n-th of hardware module be to Any one in N number of identical hardware module that starting device includes.
In a possible embodiment, the first instruction information is sent to microcontroller in the transmission unit, described the When one instruction information breaks down for characterizing n-th of hardware module, it is specifically used for:
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched M-th of memory modules RANK of n dual inline memory module DIMM breaks down, and m-th of RANK is described n-th Any one in the M RANK that DIMM includes, M are the integer more than or equal to 2;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched N business board breaks down;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched N exchange chip breaks down.
In a possible embodiment, the equipment to be launched further include:
Detection unit, for detecting whether n-th of hardware module is restored in the equipment to be launched restarting Normally;
Correspondingly, the transmission unit is also used to:
When n-th of hardware module restores normal, Xiang Suoshu microcontroller transmission second indication information, described second Instruction information is used to indicate the microcontroller and modifies the onboard information, when the equipment to be launched is restarted Successively initialize N number of identical hardware module.
5th aspect, the application provides a kind of fault isolation system, including first aspect, any possibility of first aspect Design and second aspect, the microcontroller in any possible design of second aspect and equipment to be launched.
6th aspect, the application provides a kind of computer storage medium, for being stored as described in the above-mentioned third aspect Computer software instructions used in equipment to be launched described in microcontroller or fourth aspect, and include for executing above-mentioned It on the one hand, is the program and second aspect, second party of microcontroller design in the possible embodiment of any one of first aspect It is program designed by equipment to be launched in any one embodiment in face.
In the embodiment of the present application, determine that n-th of hardware module there are when failure, then controls equipment to be launched in microcontroller On other modules in addition to storage medium where the onboard information of n-th hardware module power supply under electricity, and modify n-th of hardware The purpose of n-th of hardware module is isolated to realize for the onboard information of module, to guarantee when n-th of hardware module breaks down Also it can complete to initialize, and then can be improved the robustness of system.
On the other hand, the application be controlled by microcontroller the power supply that equipment to be launched includes each module power-on and power-off and The isolation that faulty hardware module is realized to the modification of the onboard information of the hardware device of failure, it is complicated to avoid passing through design Software realization faulty hardware equipment is evaded, to simplify the process of Fault Isolation.
Detailed description of the invention
Fig. 1 is a kind of schematic diagram for application scenarios that one embodiment of the application provides;
Fig. 2 be Fig. 1 provided by the present application in equipment to be launched be server schematic diagram;
Fig. 3 be Fig. 1 provided by the present application in equipment to be launched be rack-mount server schematic diagram;
Fig. 4 be Fig. 1 provided by the present application in equipment to be launched be single server schematic diagram;
Fig. 5 is a kind of flow diagram of failure separation method provided by the embodiments of the present application;
Fig. 6 is that process when equipment to be launched is server in a kind of failure separation method provided by the embodiments of the present application is shown It is intended to;
Fig. 7 is the structural schematic diagram of microcontroller provided by the embodiments of the present application;
Fig. 8 is the structural schematic diagram of equipment to be launched provided by the present application.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application is described.
Hereinafter, the part term in the embodiment of the present application is explained, in order to those skilled in the art understand that.
(1), equipment to be launched can be the equipment including multiple same hardwares.Such as server, it may include more A memory;Router or frame type exchanger, may include management board and multiple business boards, and management board is mainly responsible for whole The management work of machine carries out initial configuration to business board card, and the working condition of monitoring business board is with the presence or absence of abnormal, industry Business board mainly provides various interfaces, provides Ethernet (Gigabit Ethernet, GE) interface, Sonet/SDH as example On data packet (Packet OverSonet/SDH, POS) interface, channelizing POS (Channelized POS, CPOS) interface, Asynchronous transfer mode (Asynchronous Transfer Mode, ATM) interface etc., to carry various specific business;Separate unit Interchanger may include multiple exchange chips, wherein exchange chip can be MAC chip.Here it will be understood that equipment to be launched Be also possible to include multiple same hardwares modularized equipment.
(2) serial sensing (Serial Presence Detect, SPD) is one group of configuration information about memory modules, Be generally stored inside one on memory bar using small outline integrated circuit (Small Outline Integrated Circuit, SOIC) Erarable Programmable Read only Memory (the Erasable Programmable Read-Only of 8 needles encapsulated Memory, EEPROM) in chip, capacity is 256 bytes.Many important informations of memory, such as memory are had recorded in SPD chip Chip and the parameters such as Mo Zu manufacturer, working frequency, operating voltage, speed, capacity, voltage and row, column address bandwidth.
(3) Channel, the Memory control that can be central processing unit (Central Processing Unit, CPU) are logical Road, a CPU can have multiple Channel, a Channel that can correspond to multiple dual inline memory module (Dual Inline Memory Modules, DIMM), wherein there can be multiple memory modules RANK on a DIMM.
RANK, Memory Controller Hub control CPU every time can only with memory carry out one group of 64 bit (bit) data exchange, one The bandwidth of a 64bit is exactly a RANK, i.e. 1RANK=64bit.
Each RANK, which includes memory grain 1 to memory grain M, M, can be 8,16 or for other numerical value.
(4) the terms "and/or", only a kind of incidence relation for describing affiliated partner, indicates may exist three Kind relationship, for example, A and/or B, can indicate: individualism A exists simultaneously A and B, these three situations of individualism B.In addition, Character "/" herein typicallys represent the relationship that forward-backward correlation object is a kind of "or" unless otherwise specified.And in the application reality It applies in the description of example, the vocabulary such as " first ", " second ", is only used for distinguishing the purpose of description, be not understood to indicate or imply Relative importance can not be interpreted as indication or suggestion sequence.
It referring to Figure 1, is a kind of application scenarios of the embodiment of the present application.It include: microcontroller 100, equipment to be launched 101.Microcontroller 100 is by spare (standby) power supply power supply, and corresponding microcontroller 100 is to each mould in equipment 101 to be launched The power supply power supply of block, after microcontroller 100 is the power supply electrifying for each module that equipment 101 to be launched includes, equipment to be launched In control module (such as CPU) treat the identical hardware module that starting device includes and initialized, it is to be launched to start Equipment.
Microcontroller 100 can be STM32 chip, a similar single-chip microcontroller.
Equipment 101 to be launched can be server, specifically refer to Fig. 2, and server includes control module, Memory Controller Hub And memory.Wherein, control module can be by CPU and basic input output system (Basic Input Output System, BIOS) or Bootloader composition, CPU may include P Memory control channel (Channel);Memory is by multiple Memory bar composition, by taking Channel1 as an example, Channel1 can correspond to DIMM 1 to DIMM N.One DIMM may include M RANK.Wherein, under normal circumstances, the number of Channel of numerical value and CPU of N is identical, and M is 2 or 4.The EEPROM of each DIMM In be stored with the SPD information of the DIMM.Further include chip selection signal (Chip Select, CS) in Fig. 2, is wrapped on DIMM for selecting Which of RANK included and address signal and data-signal.
Fig. 3 is referred to, equipment 101 to be launched can be frame type exchanger, configured with management board and multiple business boards Card.Management board is connect with multiple business boards, includes the storage medium for memory plane information carrying breath on each business board.Its In, the onboard information of business board includes but is not limited to: the performance number of board, board type, board hardware state.
Under the scene, microcontroller is also possible to be integrated in management board, that is to say, that microcontroller and management Board is integrated in one.
Fig. 4 is referred to, equipment 101 to be launched can be separate unit interchanger, including multiple exchange chips, in addition, separate unit is handed over It further include the storage medium for memory plane information carrying breath on changing planes.Wherein, the onboard information of exchange chip includes but is not limited to: handing over Change the power of chip, the power of interchanger, exchange chip quantity, the hardware signal of separate unit interchanger, separate unit exchange hardware shape State etc..
Technical solution provided by the embodiments of the present application is introduced with reference to the accompanying drawing, during following introduction, incite somebody to action this For the technical solution that application provides is applied in application scenarios shown in Fig. 1.
Fig. 5 is referred to, one embodiment of the application provides a kind of failure separation method, and the process of this method is described as follows:
S501: equipment to be launched initializes N number of identical hardware module that the equipment to be launched includes, and N is Integer more than or equal to 2;
In the embodiment of the present application, as shown in figure 2 above-Fig. 4, equipment to be launched can be server, frame type exchanger Or separate unit interchanger.When equipment to be launched is different type equipment, N number of identical hardware module that equipment to be launched includes It is not identical.Such as when equipment to be launched is server, N number of identical hardware module can be memory, such as shown in Fig. 2 N number of DIMM;When equipment to be launched is frame type exchanger, N number of identical hardware module can be business board;To be launched When equipment is separate unit interchanger, N number of identical hardware module can be exchange chip.
When equipment to be launched is different type equipment, initialization procedure when starting is slightly different, then distinguishes below It describes in detail.
It in the embodiment of the present application, is first device power to be launched by microcontroller before equipment to be launched starting, Microcontroller is the power supply electrifying for the storage medium of memory plane information carrying breath first, then to deposit in equipment to be launched except above-mentioned The power supply electrifying of other modules outside storage media.
If microcontroller first controls the power supply electrifying of SPD, then controls the power supply of CPU so that equipment to be launched is server as an example With the power supply electrifying of memory, wherein the electric sequence of the power supply of power supply and memory to CPU is with no restrictions.Correspondingly, servicing After device starting, CPU, which reads the SPD on each DIMM by I2C bus using BIOS or startup program (Boot loader), to be believed Breath, automatically configures corresponding memory working sequence according to SPD information by the Memory control channel of CPU by CPU and posts with control Storage, and self-test is carried out, so as to give full play to the performance of memory bar.
If microcontroller first controls onboard for storing on business board so that equipment to be launched is frame type exchanger as an example The power supply electrifying of the storage medium of information, then control on the power supply of the management power supply of board of frame type exchanger, business board Electricity, wherein with no restrictions to the electric sequence of the power supply of the management power supply of board, business board.Correspondingly, being exchanged in rack When device starts, the onboard information stored in the storage medium that management board card passes through reading business board carries out just business board card Beginningization configuration, and establish corresponding complete machine control management strategy.
If microcontroller elder generation control panel is used for the storage of memory plane information carrying breath so that equipment to be launched is separate unit interchanger as an example The power supply electrifying of medium, then control the power supply electrifying of the power supply of the CPU of single server, exchange chip, wherein to the electricity of CPU Source, exchange chip power supply electric sequence with no restrictions.In the starting of separate unit interchanger, it is integrated in separate unit interchanger CPU reads the onboard information stored in storage medium by I2C bus, initializes to exchange chip.
S502: microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, n-th of the hardware Module is any one in N number of identical hardware module that the equipment to be launched includes.
In the embodiment of the present application, microcontroller judges the realization process whether n-th of hardware module breaks down, packet It includes:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is used for N-th of hardware module is characterized to break down;
The microcontroller judges whether n-th of hardware module breaks down according to the first instruction information;Or
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, comprising:
The first finger that the microcontroller is sent according to the equipment to be launched whether is received in predetermined time period Show information to judge whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate described the N hardware module initializes successfully.
In the embodiment of the present application, initialization procedure successively is carried out to each hardware module after equipment to be launched starting In, after one hardware module of successful initialization, just then initialize next hardware module.It is being initialised to n-th of hardware When module, if n-th of hardware module initialization failure, equipment to be launched send the first instruction information to microcontroller, first refers to Show and information is used to indicate n-th of hardware module there are failures.In this way, then can when microcontroller receives the first instruction information Enough judge that n-th of hardware module breaks down.
During specific implementation, when equipment to be launched is different type equipment, the content of the first instruction information is also slightly It is different.Specifically, n-th of biserial that the first instruction information is used to characterize server is straight when equipment to be launched is server M-th of RANK for inserting formula memory module DIMM breaks down, and m-th of RANK is the M RANK that n-th of DIMM includes In any one, M is integer more than or equal to 2;When equipment to be launched is frame type exchanger, the first instruction information is used It breaks down in n-th of business board of characterization frame type exchanger;When equipment to be launched is separate unit interchanger, the first instruction N-th of exchange chip that information is used to characterize the separate unit interchanger breaks down.
In the embodiment of the present application, initialization procedure is successively carried out to each hardware module after equipment to be launched starting In, after one hardware module of successful initialization, equipment to be launched can to microcontroller send first instruction information, first Instruction information is used to indicate the hardware module and initializes successfully, just then initializes next hardware module, is being initialised to n-th When a hardware module, if n-th of hardware module initialization failure, equipment to be launched send the first instruction without normal direction microcontroller Information.In this way, microcontroller is not when predetermined time period receives the first instruction information of the transmission of equipment to be launched, then table Bright n-th of hardware module breaks down.Wherein, predetermined time period can be 5 received after one first instruction information Millisecond (ms), 10ms or be other time length, those of ordinary skill in the art can be configured according to actual needs.
During specific implementation, when equipment to be launched is different type equipment, the content of the first instruction information is also slightly It is different.Specifically, the first instruction information is used to indicate m-th of RANK of n-th of DIMM when equipment to be launched is server It initializes successfully;When equipment to be launched is frame type exchanger, it is initial that the first instruction information is used to indicate n-th of business board It is melted into function;When equipment to be launched is separate unit interchanger, the first instruction information is used to indicate n-th of exchange chip and is initialized to Function.
S503: if so, the microcontroller controls in the equipment to be launched except the storage for memory plane information carrying breath is situated between It is electric under the power supply of other modules outside matter;Wherein, the onboard information is the hardware information of n-th of hardware module.
To allow equipment to be launched during initialization to faulty hardware module unaware, then needing N number of hardware module Electricity under the power supply of other modules in starting device in addition to the storage medium ceased for memory plane information carrying is treated, to modify storage Onboard information on medium.During specific implementation, when equipment to be launched is server, microcontroller is then by the electricity of CPU Source, memory power supply under electricity, retain the power supply of SPD;When equipment to be launched is frame type exchanger, microcontroller then will management It is electric under the power supply of board, the power supply of business board, it is preserved for the power supply of the storage medium of memory plane information carrying breath;It is set to be launched It is standby when being separate unit interchanger, the power supply for the CPU that microcontroller then integrates separate unit interchanger, electricity under the power supply of exchange chip, and It is preserved for the power supply of the storage medium of memory plane information carrying breath.It should be understood that onboard information herein refers to n-th of hardware module Onboard information.
S504: the microcontroller modifies the onboard information, so that the equipment to be launched is in the initial of restarting N-th of hardware module is skipped during changing.
During specific implementation, when equipment to be launched is different types of equipment, microcontroller modifies onboard information Content it is also slightly different, will be made below being discussed in detail.
When equipment to be launched is server, there is one piece of storage region in the EEPROM of each DIMM, it is every for storing The number for the RANK for including on a DIMM, by taking n-th of DIMM as an example, if n-th of DIMM includes two RANK, the storage region It is middle to record 11, if DIMM includes four RANK, 1111 are recorded in the storage region.It include below two with n-th of DIMM For RANK, when first RANK on n-th of DIMM breaks down, microcontroller passes through I2C bus for the storage region The SPD information of storage is revised as 01, only includes a RANK to characterize n-th of DIMM, second RANK on n-th of DIMM When failure, then the SPD information that can be stored the storage region is revised as 10, only includes one to characterize n-th of DIMM RANK.In this way, the initialization to n-th of DIMM, m-th of RANK can be skipped in server restarting, so that clothes Business device can still be able to initialize successfully, to improve server when the memory grain of some memory breaks down Robustness.
6 complete descriptions are when equipment to be launched is server with reference to the accompanying drawing, failure separation method provided by the present application. As shown in fig. 6, microcontroller first controls the power supply electrifying of SPD, then controls the power supply and memory of CPU on a microcontroller after electricity Power supply electrifying.After the power supply electrifying of each module in the server, CPU passes through I2C bus using BIOS or Boot loader The SPD information on each DIMM is read, then each DIMM is initialized.Specifically, from first of first DIMM RANK starts, if initializing successfully, successively carries out, normally to start server.During initialization, if n-th of DIMM M-th of RANK initialize failure, it is abnormal that the CPU of server then indicates that m-th of RANK of n-th of DIMM of microcontroller occurs, Microcontroller then controls electricity under the power supply of the power supply of CPU, memory, and retains the power supply of SPD, to modify the SPD letter of n-th of DIMM Breath.After the completion of modification, then control electric under the power supply of SPD.Again to the power supply electrifying of the power supply of the power supply of SPD, CPU, memory, Then DIMM is initialized according to modified SPD information after the completion of powering on.
When equipment to be launched is frame type exchanger, microcontroller is when determining that n-th of business board breaks down, then The hardware state of n-th of business board of modification is wrong (FALL).Again to the failure when to avoid restarting Business board card carries out power distribution and initialization action, causes board system to block or manages plan using the control of mistake Slightly, such as failure board is distributed power to, causes the waste of resource.
When equipment to be launched is separate unit interchanger, microcontroller is then repaired when determining that n-th of exchange chip breaks down The hardware state for changing n-th of exchange chip is mistake, to avoid next time when restarting again to the exchange chip of failure Initialization so that separate unit interchanger part exchange chip work in the case where can also start.
Herein, it should be noted that, can be first to onboard for storing after microcontroller modifies to onboard information It is electric under the power supply of the storage medium of information, in the power supply electrifying to other modules in addition to storage medium, so as to be launched Equipment initializes N number of identical hardware module according to modified onboard information.It is of course also possible to directly to except storage The power supply electrifying of other modules outside medium, so that equipment to be launched is according to modified onboard information to N number of identical Hardware module is initialized.
In the embodiment of the present application, the method also includes:
The microcontroller receives the second indication information that the equipment to be launched is sent, and the second indication information is used for It characterizes n-th of hardware module and restores normal;
The microcontroller modifies the board information according to the second indication information, so that the equipment weight to be launched N number of identical hardware module can be successively initialized when new starting.
It is still not initial to avoid when n-th of hardware module of failure restores normal during specific implementation Change, reduce the service efficiency of equipment to be launched, then in each restarting, n-th of hardware module of detection is equipment to be launched It is no to restore normal.When n-th of hardware module restores normal, then second indication information, the second instruction letter are sent to microcontroller Breath restores normal for characterizing n-th of hardware module.
When microcontroller receives the second indication information that equipment to be launched is sent, then onboard information is modified, so that When equipment to be launched restarts, N number of identical hardware module can be successively initialized.
Based on the same inventive concept, Fig. 7 is referred to, the application also provides a kind of microcontroller 700, comprising:
Judging unit 701, for judging whether n-th of hardware module breaks down in equipment to be launched, described n-th hard Part module is any one in N number of identical hardware module that the equipment to be launched includes, and N is whole more than or equal to 2 Number;
Control unit 702, for controlling and being removed in the equipment to be launched when n-th of hardware module breaks down For electricity under the power supply of other modules outside the storage medium of memory plane information carrying breath;Wherein, the onboard information is described n-th The hardware information of hardware module;
Unit 703 is modified, for modifying the onboard information, so that initialization of the equipment to be launched in restarting N-th of hardware module is skipped in the process.
In a possible embodiment, judge that n-th of hardware module is in equipment to be launched in the judging unit 701 When no failure, it is specifically used for:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is for characterizing described n-th A hardware module breaks down;
According to the first instruction information, judge whether n-th of hardware module breaks down;Or
It is specific to use when the judging unit 701 judges whether n-th of hardware module breaks down in equipment to be launched In:
Judge according to the first instruction information that the equipment to be launched is sent whether is received in predetermined time period Whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate n-th of hardware module It initializes successfully.
In a possible embodiment, it is used to receive that the equipment to be launched sends the in the judging unit 701 One instruction information is specifically used for when the first instruction information breaks down for characterizing n-th of hardware module:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is used to indicate described wait open M-th of memory modules RANK of n-th of dual inline memory module DIMM of dynamic equipment breaks down, m-th of RANK Any one in M RANK for including for n-th of DIMM, M are the integer more than or equal to 2;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing N-th of business board of dynamic equipment breaks down;Or
The first instruction information that the equipment to be launched is sent is received, the first instruction information is described wait open for characterizing N-th of exchange chip of dynamic equipment breaks down.
In a possible embodiment, the onboard information is modified in the modification unit 703, so that described to be launched When equipment skips n-th of hardware module in the initialization procedure of restarting, it is specifically used for:
The serial sensing SPD information on n-th of DIMM is modified, so that the equipment to be launched is in restarting M-th of RANK of n-th of DIMM is skipped in initialization procedure;Or
The hardware state for modifying n-th of business board is mistake, so that the equipment to be launched is in restarting N-th of business board is skipped in initialization procedure;Or
The hardware state for modifying n-th of exchange chip is mistake, so that the equipment to be launched is in restarting N-th of exchange chip is skipped in initialization procedure.
In a possible embodiment, the microcontroller further include:
Receiving unit 704, the second indication information sent for receiving the equipment to be launched, the second indication information Restore normal for characterizing n-th of hardware module;
Correspondingly, the modification unit 703 is also used to:
The onboard information is modified according to the second indication information, when the equipment to be launched is restarted Successively initialize N number of identical hardware module.
Based on the same inventive concept, Fig. 8 is referred to, the application provides a kind of equipment to be launched 800, comprising:
Initialization unit 801, for being initialized to N number of identical hardware module that the equipment to be launched includes, N For the integer more than or equal to 2;
Transmission unit 802, when initializing failure for n-th of hardware module in N number of identical hardware module, The first instruction information is sent to microcontroller, the first instruction information breaks down for characterizing n-th of hardware module, To indicate that the microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is again N-th of hardware module is skipped in the initialization procedure of starting;
Wherein, the onboard information be n-th of hardware module hardware information, n-th of hardware module be to Any one in N number of identical hardware module that starting device includes.
In a possible embodiment, the first instruction information is sent to microcontroller in the transmission unit 802, it is described When first instruction information breaks down for characterizing n-th of hardware module, it is specifically used for:
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched M-th of memory modules RANK of n dual inline memory module DIMM breaks down, and m-th of RANK is described n-th Any one in the M RANK that DIMM includes, M are the integer more than or equal to 2;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched N business board breaks down;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize the of the equipment to be launched N exchange chip breaks down.
In a possible embodiment, the equipment to be launched further include:
Detection unit 803, for whether detecting n-th of hardware module in the equipment to be launched restarting Restore normal;
Correspondingly, the transmission unit 802 is also used to:
When n-th of hardware module restores normal, Xiang Suoshu microcontroller transmission second indication information, described second Instruction information is used to indicate the microcontroller and modifies the onboard information, when the equipment to be launched is restarted Successively initialize N number of identical hardware module.
The application also provides a kind of fault isolation system, including the microcontroller 100 as shown in Fig. 7 embodiment and such as Fig. 8 Equipment 101 to be launched shown in embodiment.
The application also provides a kind of computer storage medium, which may include memory, the storage Device can be stored with program, which includes the microcontroller, to be launched as recorded in preceding embodiment of the method shown in fig. 5 when executing Overall Steps performed by equipment.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The application be referring to according to the present processes, equipment (system) and computer program product flow chart and/or Block diagram describes.It should be understood that each process that can be realized by computer program instructions in flowchart and/or the block diagram and/or The combination of process and/or box in box and flowchart and/or the block diagram.It can provide these computer program instructions to arrive General purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices processor to generate one Machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for realizing flowing The device for the function of being specified in journey figure one process or multiple processes and/or block diagrams one box or multiple boxes.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Obviously, those skilled in the art can carry out various modification and variations without departing from the essence of the application to the application Mind and range.In this way, if these modifications and variations of the application belong to the range of the claim of this application and its equivalent technologies Within, then the application is also intended to include these modifications and variations.

Claims (18)

1. a kind of failure separation method characterized by comprising
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, and n-th of hardware module is described Any one in N number of identical hardware module that equipment to be launched includes, N are the integer more than or equal to 2;
If so, the microcontroller control it is other in addition to the storage medium ceased for memory plane information carrying in the equipment to be launched It is electric under the power supply of module;Wherein, the onboard information is the hardware information of n-th of hardware module;
The microcontroller modifies the onboard information, so that the equipment to be launched is jumped in the initialization procedure of restarting Cross n-th of hardware module.
2. the method according to claim 1, wherein microcontroller judges n-th of hardware mould in equipment to be launched Whether block breaks down, comprising:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is for characterizing N-th of hardware module breaks down;
The microcontroller judges whether n-th of hardware module breaks down according to the first instruction information;Or
Microcontroller judges whether n-th of hardware module breaks down in equipment to be launched, comprising:
The microcontroller is believed according to the first instruction that the equipment to be launched is sent whether is received in predetermined time period Breath is to judge whether n-th of hardware module breaks down;Wherein, the first instruction information is used to indicate described n-th Hardware module initializes successfully.
3. according to the method described in claim 2, it is characterized in that, the microcontroller receives what the equipment to be launched was sent First instruction information, the first instruction information break down for characterizing n-th of hardware module, comprising:
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is used to indicate M-th of memory modules RANK of n-th of dual inline memory module DIMM of the equipment to be launched breaks down, and described the M RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is for characterizing N-th of business board of the equipment to be launched breaks down;Or
The microcontroller receives the first instruction information that the equipment to be launched is sent, and the first instruction information is for characterizing N-th of exchange chip of the equipment to be launched breaks down.
4. according to the method described in claim 3, it is characterized in that, the microcontroller modifies the onboard information, so that institute It states equipment to be launched and skips n-th of hardware module in the initialization procedure of restarting, comprising:
The microcontroller modifies the serial sensing SPD information on n-th of DIMM, so that the equipment to be launched is again M-th of RANK of n-th of DIMM is skipped in the initialization procedure of starting;Or
The hardware state that the microcontroller modifies n-th of business board is mistake, so that the equipment to be launched is in weight N-th of business board is skipped in the initialization procedure newly started;Or
The hardware state that the microcontroller modifies n-th of exchange chip is mistake, so that the equipment to be launched is in weight N-th of exchange chip is skipped in the initialization procedure newly started.
5. method according to claim 1-4, which is characterized in that the method also includes:
The microcontroller receives the second indication information that the equipment to be launched is sent, and the second indication information is for characterizing N-th of hardware module restores normal;
The microcontroller modifies the onboard information according to the second indication information, so that the equipment to be launched opens again N number of identical hardware module can be successively initialized when dynamic.
6. a kind of failure separation method characterized by comprising
Equipment to be launched initializes N number of identical hardware module that the equipment to be launched includes, N be more than or equal to 2 integer;
When n-th of hardware module in N number of identical hardware module initializes failure, the equipment to be launched is to micro-control Device processed sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with instruction The microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is in restarting N-th of hardware module is skipped in initialization procedure;
Wherein, the onboard information is the hardware information of n-th of hardware module, and n-th of hardware module is to be launched Any one in N number of identical hardware module that equipment includes.
7. according to the method described in claim 6, it is characterized in that, the equipment to be launched sends the first instruction to microcontroller Information, the first instruction information break down for characterizing n-th of hardware module, comprising:
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described wait open for characterizing M-th of memory modules RANK of n-th of dual inline memory module DIMM of dynamic equipment breaks down, m-th of RANK Any one in M RANK for including for n-th of DIMM, M are the integer more than or equal to 2;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described wait open for characterizing N-th of business board of dynamic equipment breaks down;Or
The equipment to be launched sends the first instruction information to microcontroller, and the first instruction information is described wait open for characterizing N-th of exchange chip of dynamic equipment breaks down.
8. method according to claim 6 or 7, which is characterized in that the method also includes:
In the equipment to be launched restarting, the equipment to be launched detects whether n-th of hardware module is restored just Often;
When n-th of hardware module restores normal, the equipment to be launched sends the second instruction letter to the microcontroller Breath, the second indication information is used to indicate the microcontroller and modifies the onboard information, so that the equipment weight to be launched N number of identical hardware module can be successively initialized when new starting.
9. a kind of microcontroller characterized by comprising
Judging unit, for judging whether n-th of hardware module breaks down in equipment to be launched, n-th of hardware module Any one in the N number of identical hardware module for including for the equipment to be launched, N are the integer more than or equal to 2;
Control unit, for controlling in the equipment to be launched except for storing when n-th of hardware module breaks down It is electric under the power supply of other modules outside the storage medium of onboard information;Wherein, the onboard information is n-th of hardware module Hardware information;
Unit is modified, for modifying the onboard information, so that the equipment to be launched is in the initialization procedure of restarting Skip n-th of hardware module.
10. microcontroller according to claim 9, which is characterized in that judge in equipment to be launched in the judging unit When whether n-th of hardware module breaks down, it is specifically used for:
The first instruction information that the equipment to be launched is sent is received, the first instruction information is hard for characterizing described n-th Part module breaks down;
According to the first instruction information, judge whether n-th of hardware module breaks down;Or
When the judging unit judges whether n-th of hardware module breaks down in equipment to be launched, it is specifically used for:
Whether according to receiving in predetermined time period, the first instruction information that the equipment to be launched is sent is described to judge Whether n-th of hardware module breaks down;Wherein, it is initial to be used to indicate n-th of hardware module for the first instruction information It is melted into function.
11. microcontroller according to claim 10, which is characterized in that described wait open for receiving in the judging unit The first instruction information that dynamic equipment is sent, when the first instruction information breaks down for characterizing n-th of hardware module, It is specifically used for:
Receive the first instruction information that the equipment to be launched is sent, the first instruction information is used to indicate described to be launched set M-th of memory modules RANK of n-th standby of dual inline memory module DIMM breaks down, and m is the integer greater than zero, institute Any one in the M RANK that m-th of RANK includes for n-th of DIMM is stated, M is the integer more than or equal to 2;Or
Receive the first instruction information that the equipment to be launched is sent, the first instruction information is for characterizing described to be launched set N-th standby of business board breaks down;Or
Receive the first instruction information that the equipment to be launched is sent, the first instruction information is for characterizing described to be launched set N-th standby of exchange chip breaks down.
12. microcontroller according to claim 11, which is characterized in that modify the onboard letter in the modification unit Breath, it is specific to use when so that the equipment to be launched skipping n-th of hardware module in the initialization procedure of restarting In:
The serial sensing SPD information on n-th of DIMM is modified, so that the equipment to be launched is in the initial of restarting M-th of RANK of n-th of DIMM is skipped during changing;Or
The hardware state for modifying n-th of business board is mistake, so that the equipment to be launched is in the initial of restarting N-th of business board is skipped during changing;Or
The hardware state for modifying n-th of exchange chip is mistake, so that the equipment to be launched is in the initial of restarting N-th of exchange chip is skipped during changing.
13. according to the described in any item microcontrollers of claim 9-12, which is characterized in that the microcontroller further include:
Receiving unit, the second indication information sent for receiving the equipment to be launched, the second indication information are used for table It levies n-th of hardware module and restores normal;
Correspondingly, the modification unit is also used to:
The onboard information is modified according to the second indication information, when the equipment to be launched restarted successively Initialize N number of identical hardware module.
14. a kind of equipment to be launched characterized by comprising
Initialization unit, for being initialized to N number of identical hardware module that the equipment to be launched includes, N be greater than Or the integer equal to 2;
Transmission unit, when initializing failure for n-th of hardware module in N number of identical hardware module, to micro-control Device processed sends the first instruction information, and the first instruction information breaks down for characterizing n-th of hardware module, with instruction The microcontroller modifies the onboard information of n-th of hardware module, so that the equipment to be launched is in restarting N-th of hardware module is skipped in initialization procedure;
Wherein, the onboard information is the hardware information of n-th of hardware module, and n-th of hardware module is to be launched Any one in N number of identical hardware module that equipment includes.
15. equipment according to claim 14, which is characterized in that send first to microcontroller in the transmission unit and refer to Show that information is specifically used for when the first instruction information breaks down for characterizing n-th of hardware module:
The first instruction information is sent to microcontroller, the first instruction information is used to characterize n-th of the equipment to be launched M-th of memory modules RANK of dual inline memory module DIMM breaks down, and m is the integer greater than zero, and described m-th RANK is any one in the M RANK that n-th of DIMM includes, and M is the integer more than or equal to 2;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize n-th of the equipment to be launched Business board breaks down;Or
The first instruction information is sent to microcontroller, the first instruction information is used to characterize n-th of the equipment to be launched Exchange chip breaks down.
16. equipment according to claim 14 or 15, which is characterized in that the equipment to be launched further include:
Detection unit, for detecting whether n-th of hardware module is restored just in the equipment to be launched restarting Often;
Correspondingly, the transmission unit is also used to:
When n-th of hardware module restores normal, Xiang Suoshu microcontroller sends second indication information, second instruction Information is used to indicate the microcontroller and modifies the onboard information, when the equipment to be launched restarted successively Initialize N number of identical hardware module.
17. a kind of computer storage medium, which is characterized in that the computer storage medium is stored with instruction, works as described instruction When running on computers, so that the computer executes such as the described in any item methods of claim 1-5 or 6-8.
18. a kind of fault isolation system, which is characterized in that including such as described in any item microcontrollers of claim 9-13 and such as The described in any item equipment to be launched of claim 14-16.
CN201910452699.8A 2019-05-28 2019-05-28 A kind of failure separation method, equipment and fault isolation system Pending CN110187994A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910452699.8A CN110187994A (en) 2019-05-28 2019-05-28 A kind of failure separation method, equipment and fault isolation system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910452699.8A CN110187994A (en) 2019-05-28 2019-05-28 A kind of failure separation method, equipment and fault isolation system

Publications (1)

Publication Number Publication Date
CN110187994A true CN110187994A (en) 2019-08-30

Family

ID=67718274

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910452699.8A Pending CN110187994A (en) 2019-05-28 2019-05-28 A kind of failure separation method, equipment and fault isolation system

Country Status (1)

Country Link
CN (1) CN110187994A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111782434A (en) * 2020-06-30 2020-10-16 联想(北京)有限公司 Abnormity detection method and device and electronic equipment
CN111813731A (en) * 2020-06-11 2020-10-23 中国长城科技集团股份有限公司 Method, device, server and medium for reading memory information

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6463550B1 (en) * 1998-06-04 2002-10-08 Compaq Information Technologies Group, L.P. Computer system implementing fault detection and isolation using unique identification codes stored in non-volatile memory
JP2012190128A (en) * 2011-03-09 2012-10-04 Nec Corp Fast startup method, fast startup device and fast startup program against degeneration of hardware configuration
WO2016106965A1 (en) * 2014-12-31 2016-07-07 中兴通讯股份有限公司 Server self-healing method and device
WO2017063505A1 (en) * 2015-10-16 2017-04-20 中兴通讯股份有限公司 Method for detecting hardware fault of server, apparatus thereof, and server
CN109086151A (en) * 2017-06-13 2018-12-25 中兴通讯股份有限公司 The method and device of memory failure is isolated on a kind of server
CN109151144A (en) * 2018-08-14 2019-01-04 深圳市广和通无线通信软件有限公司 Method of hardware management, device, system, computer equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6463550B1 (en) * 1998-06-04 2002-10-08 Compaq Information Technologies Group, L.P. Computer system implementing fault detection and isolation using unique identification codes stored in non-volatile memory
JP2012190128A (en) * 2011-03-09 2012-10-04 Nec Corp Fast startup method, fast startup device and fast startup program against degeneration of hardware configuration
WO2016106965A1 (en) * 2014-12-31 2016-07-07 中兴通讯股份有限公司 Server self-healing method and device
WO2017063505A1 (en) * 2015-10-16 2017-04-20 中兴通讯股份有限公司 Method for detecting hardware fault of server, apparatus thereof, and server
CN109086151A (en) * 2017-06-13 2018-12-25 中兴通讯股份有限公司 The method and device of memory failure is isolated on a kind of server
CN109151144A (en) * 2018-08-14 2019-01-04 深圳市广和通无线通信软件有限公司 Method of hardware management, device, system, computer equipment and storage medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111813731A (en) * 2020-06-11 2020-10-23 中国长城科技集团股份有限公司 Method, device, server and medium for reading memory information
CN111813731B (en) * 2020-06-11 2022-10-25 中国长城科技集团股份有限公司 Method, device, server and medium for reading memory information
CN111782434A (en) * 2020-06-30 2020-10-16 联想(北京)有限公司 Abnormity detection method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN102141940B (en) Device, method and system for recovering firmware update
CN104750510A (en) Chip start method and multi-core processor chip
US20060010282A1 (en) Method and apparatus to boot a system by monitoring an operating status of a NAND flash memory
US20130246855A1 (en) Error Location Specification Method, Error Location Specification Apparatus and Computer-Readable Recording Medium in Which Error Location Specification Program is Recorded
CN110187994A (en) A kind of failure separation method, equipment and fault isolation system
CN103164316B (en) Hardware monitor
CN108334372A (en) firmware upgrade processing method, device and system
CN113190405B (en) Node health detection method and device, electronic equipment and storage medium
CN111338698A (en) Method and system for accurately booting server by BIOS (basic input output System)
CN106681877B (en) Chip debugging system and method and system chip
CN100445953C (en) Method and device for distinguish cool start and hot start
CN112199240B (en) Method for switching nodes during node failure and related equipment
US20060173655A1 (en) Information processing apparatus, failure monitoring method and failure monitoring computer program
CN110286852A (en) Dual control framework distributed memory system, method for reading data, device and storage medium
JPH1091289A (en) Memory initialization device and method
CN108897646A (en) A kind of switching method and baseboard management controller of BIOS chip
US20200409813A1 (en) System and Method to Derive Health Information for a General Purpose Processing Unit Through Aggregation of Board Parameters
CN109634796A (en) A kind of method for diagnosing faults of computer, apparatus and system
CN109189638A (en) A kind of GPU driving detection method, device, terminal and storage medium
US8069364B2 (en) Method to recover from logical path failures
TW201328248A (en) Method for updating system firmware and server system using the same
CN101158920A (en) Method and apparatus for detecting fault of operating system
CN101739261B (en) Switching system for basic input and output system and switching method thereof
CN113742198B (en) Test method, test device, electronic equipment and storage medium
CN101311909A (en) System peculiarity diagnose method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination