CN104657166B - server system and node replacement method - Google Patents
server system and node replacement method Download PDFInfo
- Publication number
- CN104657166B CN104657166B CN201310597425.0A CN201310597425A CN104657166B CN 104657166 B CN104657166 B CN 104657166B CN 201310597425 A CN201310597425 A CN 201310597425A CN 104657166 B CN104657166 B CN 104657166B
- Authority
- CN
- China
- Prior art keywords
- node
- server system
- hardware
- preset time
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Abstract
A kind of node replacement method, suitable for server system.The step flow of the node replacement method is as follows:Whether detection node inserts server system, and cue is produced when detecting that node inserts server system, and cue is indicating that node can not pull away server system.First identification code of detection node and the first hardware configuration information of the hardware in node.Whether it has been replaced according to the first identification code and the second identification code before hardware configuration information and node insertion server system and hardware configuration information, decision node or hardware.If being judged as NO, the power supply of closed node, program is replaced to perform node.If being judged as YES, to node installation operating system, package data and firmware bag data at least one.
Description
Technical field
The present invention relates to a kind of server system (such as data center of cabinet-type) and node replacement method, particularly one
Kind can quickly carry out the server system and node replacement method that node replaces program.
Background technology
With the development of science and technology, computer all over the world is enabled to enter joining line by internet.One computer passes through
Network connectivity just can carry out exchanging, access etc. and acting for data with another computer.In client and server system architecture
On, client with server is linked up by network.
In general, server system may be configured with multiple nodes, and each node runs multiple virtual machines simultaneously
(virtual machine, VM), uses the environment of operation for being supplied to each user's independence.Also, each node can be considered each
From independent computer, that is, each node has memory, storage area, operational capability and network linking function.Therefore, each section
Point can run operating system alone, and can also be linked up between each node by the network equipment and data transfer.
After server system architecture, it is necessary to deployment is completed to the node in it, that is, is installed needed for each node
Operating system, package data and firmware bag data, so that server system operates and provides service to user's use.So
And when the hardware of any one group node in server system produces damage, it will cause node corresponding to this hardware without
Method normal operation, now testing staff is that specific hardware in which node is damaged due to can not accurately learn, and is only capable of
Sequentially node is pulled away with insertion server system to be detected, cause the waste in detection time.
The content of the invention
The technical problems to be solved by the invention are to provide a kind of server system and node replacement method, and it can be automatically
Judge according to the hardware configuration information of the hardware in the identification code and node of node specific hard in specific node or this node
Whether part, which needs, is replaced, so that quickly and easily node can be replaced by testing staff.
To achieve these goals, the invention provides a kind of node replacement method, this node replacement method to be applied to clothes
Business device system.The step flow of this node replacement method is as described below.Whether detection node inserts server system, and in detection
The first cue is produced when inserting server system to node, wherein this first cue is indicating that node can not pull away
Server system.First identification code of detection node and the first hardware configuration information of the hardware in node.Identified according to first
Code and the second identification code and the second hardware configuration information before the first hardware configuration information and node insertion server system,
Whether the hardware in decision node or node has been replaced.If judging, egress is not all replaced with the hardware in node, is closed
The power supply of close node, program is replaced to perform node.If judging, the hardware in egress or node is replaced, to node installation
Operating system, package data and firmware bag data at least one.
In one embodiment, in node installation operating system, package data and firmware bag data at least one
The step of after, in addition to steps described below flow.Continue the situation of the hardware in detection node, whether to judge hardware
Make a mistake.If judging, hardware produces the mistake of unrepairable, closes the power supply of this node, and journey is replaced to perform node
Sequence.If judging, hardware produces recoverable errors number and reaches default threshold value, and normal shutdown journey is carried out to this node
Sequence, and perform node according to this and replace program.
In one embodiment, node, which replaces program, includes steps described below flow.It is initial pattern by node sets.
The second cue is produced, wherein this second cue is indicating that node can pull away server system.Whether detection node
Pull away server system.Whether detection node or another node insert server system.If detecting, node or another node are inserted
Enter server system, then produce the first cue, and the first identification code of the detection node that continues and the hardware in node
The step of after first hardware configuration information.
The above embodiments are accepted, in the step of whether detection node pulls away server system, in addition to it is as described below
Step flow.Set the first preset time and start timing.Whether decision node has pulled away server system.If judge to save
Point not yet pulls away server system, then resets the first preset time and reclocking.If judging, egress pulls away server system
And have been subjected to the first preset time, then the step of whether perform detection node or another node are inserted after server system.
The above embodiments are accepted, in the step of whether detection node or another node insert server system, are also wrapped
Include steps described below flow.Set the first preset time and the second preset time and start timing, wherein second it is default when
Between be connected in the first preset time after.Whether decision node inserts server system.If judging, egress not yet inserts server
System, then reset the first preset time and reclocking.If judge egress insertion server system and have been subjected to first to preset
Time, then then whether decision node still persistently inserts server system in the second preset time.If judge egress in
Server system is pulled away in two preset times, then the step of continuing node sets as after initial pattern.If judge egress
In not pulling away server system yet after the second preset time, then the first cue is produced, and the detection node that continues
The step of first identification code is with after the first hardware configuration information of the hardware in node.
In order to which above-mentioned purpose is better achieved, present invention also offers a kind of server system, this server system includes
Node, detection module, reminding module and processing module.Node has hardware.Detection module communicates connecting node, this detection mould
Whether block inserts or pulls away server system, and the first identification code and the of hardware to detection node to detection node
One hardware configuration information.Reminding module communicates connection detection module, and this reminding module in detection module detecting that node is inserted
The first cue is produced when entering server system.Wherein, this first cue is indicating that node can not pull away server
System.Processing module communication is connected between detection module and node, and this processing module is to according to the first identification code and first
The second identification code before hardware configuration information and node insertion server system judges to save with the second hardware configuration information
Whether point or hardware have been replaced.Wherein, if processing module judges that egress is not all replaced with hardware, the electricity of closed node
Source, with perform node replace program, if processing module judges that egress or hardware are replaced, to node installation operating system,
Package data and firmware bag data at least one.
In one embodiment, in processing module to node installation operating system, package data and firmware bag data at least
After one of them, processing module also persistently judges whether hardware makes a mistake.If processing module judges hardware, generation can not
The power supply of the mistake of reparation, then closed node, program is replaced to perform node.If processing module, which judges that hardware produces, to repair
Errors number reach default threshold value, then to node carry out normal shutdown program, and according to this perform node replace program.
In one embodiment, when server system performs node and replaces program, node sets are initial by processing module
Pattern.Then, reminding module produces the second cue, and this second cue is indicating that node can pull away server system
System.In addition, whether detection module detection node pulls away server system, and continued after detecting that node pulls away server system
Whether detection node or another node insert server system.If detection module detects node or another node insertion server
System, then reminding module produces the first cue, and detection module continues first identification code and hardware of detection node
Processing routine after first hardware configuration information.
The above embodiments are accepted, server system also includes timing module, this timing module communication connection detection module.
When whether detection module detection node pulls away server system, timing module can set the first preset time and start timing.
If detection module detects egress in not pulling away server system in the first preset time yet, it is pre- that timing module will reset first
If time and reclocking.If detection module detection egress pulls away server system and has been subjected to the first preset time, connect
The processing routine whether continuous detection node or another node are inserted after server system.
The above embodiments are accepted, server system also includes timing module, this timing module communication connection detection module.
When whether detection module detection node or another node insert server system, timing module can set the first preset time with
Second preset time simultaneously starts timing, wherein after the second preset time is connected in the first preset time.If detection module detects
Node is in not yet inserting server system in the first preset time, then timing module resets the first preset time and reclocking.
If detection module detection egress inserts server system and has been subjected to the first preset time, the detection node that continues is pre- in second
If whether still persistently insert server system in the time.If detection module detection egress pulls away service in the second preset time
Device system, then processing routine of the processing module that continues by node sets for initial pattern.If detection module detects egress in warp
Server system is not pulled away yet after crossing the second preset time, then reminding module produces the first cue, and detection module connects
Processing routine after first identification code of continuous detection node and the first hardware configuration information of hardware.
The technical effects of the invention are that:
Server system and the node replacement method of the present invention, the hardware in its identification code and node by detection node
Hardware configuration information, whether the hardware of the hardware come in decision node or node has been replaced, and then optionally performs section
Point replaces program or to this node installation operating system, package data or firmware bag data.In addition, the server system of the present invention
System can also be after node installation operating system, package data or firmware bag data with node replacement method, and constantly detection saves
Condition of hardware in point, and can be carried out by cue to allow testing staff to learn when hardware in node produces wrong
Node replaces program.
Below in conjunction with the drawings and specific embodiments, the present invention will be described in detail, but not as a limitation of the invention.
Brief description of the drawings
Fig. 1 is the functional block diagram according to the server system of one embodiment of the invention;
Fig. 2A is the step flow chart according to the node replacement method of the server system of one embodiment of the invention;
Fig. 2 B are the step flow chart according to the node replacement method of the server system of another embodiment of the present invention;
Fig. 3 is the step flow chart that program is replaced according to the node of one embodiment of the invention;
Fig. 4 is the detailed step flow chart of the step S304 in Fig. 3;
Fig. 5 is the detailed step flow chart of the step S306 in Fig. 3.
Wherein, reference
1 server system
10 nodes
12 detection modules
14 reminding modules
16 processing modules
18 timing modules
S200~S214, S300~S308, S400~S404, S500~S506 steps
Embodiment
The structural principle and operation principle of the present invention are described in detail below in conjunction with the accompanying drawings:
Fig. 1 is refer to, Fig. 1 is the functional block diagram according to the server system of one embodiment of the invention.As shown in figure 1, clothes
Device system 1 of being engaged in includes node 10, detection module 12, reminding module 14, processing module 16 and timing module 18, wherein detection module
12 communications are connected between the reminding module 14 of node 10, processing module 16 and timing module 18, and node 10 and and processing module
16 communication connections.Communication connection of the present invention can be realized with entity connection, or be connected with wireless telecommunications
And realize, the present invention is not any limitation as herein.In in practice, server system 1 can be a kind of data center of cabinet-type
(container data center), but be not limited.Below by respectively with regard to each portion's functional module in server system 1
It is described in detail.
Node 10 has an at least hardware, and described hardware can include baseboard management controller (baseboard
Management controller, BMC), network interface controller (network interface controller, NIC, also
Claim network card), hard disk (hard disk drive, HDD), DIMM (Dual In-line Memory Module) and centre
Device (CPU) etc. is managed, but is not limited.In addition, although Fig. 1 has only illustrated a group node, but the server system of the present invention
It is not any limitation as the number of node herein.
Whether detection module 12 has insertion to detection node 10 or pulls away server system 1, and to detection node
10 the first identification code and the first hardware configuration information (hardware configuration of hardware in node 10
information).In in practice, the identification code of node 10 can be a kind of general unique identifier (universally
Unique identifier, UUID), but be not limited.In general, such a general unique identifier be by a string 16
16 carry digits of tuple (also known as 128 bits) are formed, to allow each node 10 to have unique identification information, then
Person, this UUID can be obtained by the UUID fields of SMBIOS (System Management BIOS) Type1 data structures;Node 10
Hardware configuration information can by calculating and its unique 4 bit group hardware signature (Hardware Signature), BIOS
(Basic Input Output System) will obtain hardware configuration information simultaneously when its POST (Power On Self Test)
Enter hardware must be signed and be stored in ACPI (Advanced Configuration Power Management Interface) FACS
The Hardware Signature fields of (Firmware ACPI Control Structure) table, this Hardware
Signature fields can be used to quick decision, and whether hardware configuration information is different;Furthermore BMC on ping nodes 10 can be passed through
NIC comes whether detection node 10 is inserted or pulled out.
Reminding module 14 carries to produce one group first when detection module 12 detects that node 10 inserts server system 1
Show signal, this first cue is indicating that node 10 can not pull away server system 1.In addition, in some cases, prompt
Module 14 produces one group of second cue, and this second cue is indicating that node 10 can pull away server system 1.Yu Shi
In business, reminding module 14 can be a kind of display module (such as electronic display such as light-emittingdiode, display panel, seven-segment display
Show element) or sounding module (such as the electronic sound such as loudspeaker, buzzer element), it is of the invention not to be any limitation as herein.If carry
If showing that module 14 is display module, then cue is presented to user in the form of image or light;If reminding module
If 14 are sounding module, then cue is presented to user with the pattern of sound.
Processing module 16 is to the identification code according to node 10 and the hardware configuration information and node of hardware in node 10
The second identification code before 10 insertion server systems 1 is come hard in decision node 10 or node 10 with the second hardware configuration information
Whether part has been replaced;It need to know, " the second identification code and the second hardware configuration information before the insertion server system 1 of node 10 "
It is " its identification code and hardware configuration information during the previous insertion server system 1 of node 10 ", furthermore, if new node 10
Server system is inserted, then its second identification code and the second hardware configuration information are all empty.Timing module 18 to set to
Few one group of preset time, and start timing.In some situations, timing module 18 can return timing during timing
Zero, to restart timing.
For the actual operation mode of the server system 1 and node replacement method of the more clear explanation present invention, one is asked
And reference picture 1 and Fig. 2A, Fig. 2A are the step flow according to the node replacement method of the server system of one embodiment of the invention
Figure.As shown in Figure 2 A, in step s 200, whether the meeting of detection module 12 detection node 10 inserts server system 1, and in detection
When inserting server system to node 10, reminding module 14 can produce one group of first cue, and enter step S202.If inspection
Survey module 12 and be not detected by the insertion server system 1 of node 10, then continue to repeat step S200, until detection module 12 is examined
Untill measuring the insertion server system 1 of node 10.
In step S202, detection module 12 is understood in then the first identification code of detection node 10 and this node 10 wherein
First hardware configuration information of one hardware.In step S204, processing module 16 can match somebody with somebody according to the first identification code with the first hardware
The second identification code and the second hardware configuration information before confidence breath and the insertion server system 1 of node 10, decision node 10
Or whether the hardware in node 10 has been replaced.If processing module 16 judge hardware in egress 10 or node 10 by for
Change, then perform step S206;If processing module 16 judges that egress 10 is not all replaced with the hardware in node 10, step is performed
Rapid S208.It need to know, in the case where node 10 or its hardware are not replaced, can also force again to this installation operation system of node 10, soft
Part bag data or firmware bag data (not shown), it can be applied to actual situation and for example Xias ﹕ when node 10 is hard because thereon
Part produces hardware error because of loose contact, can now pull out node 10, then makes the contact of its hardware good, then again will section
Point 10 turns back to server system 1 again.
In step S206, processing module 16 can be to the installation operation system of node 10 (operating system, OS), soft
Part bag data (software package data) and firmware bag data (firmware package data) at least within it
One.In step S208, processing module 16 can close the power supply of (power off) node 10, and program is replaced to perform node.
Fig. 2 B are refer to, Fig. 2 B is according to the step of the node replacement methods of the server system of another embodiment of the present invention
Flow chart.As shown in Figure 2 B, to node installation operating system, package data and at least one of step of firmware bag data
Suddenly after (i.e. step S206), detection module 12 or another group of monitoring module (not shown) can be constantly in detection nodes
The situation of hardware, so that processing module 16 judges whether hardware makes a mistake (i.e. step S210).If detection module 12 or another
Hardware among one group of monitoring module detection egress 10 produces the mistake (un-correct error) of unrepairable, then performs
Step S212;If the hardware among detection module 12 or another group of monitoring module detection egress 10 produces recoverable mistake
(correct error) number reaches default threshold value (default threshold value), then performs step S214.
In step S212, because the hardware among node 10 produces the mistake of unrepairable, that is, now node 10 has been
Through damage can not normal operation, then processing module 16 can closed node 10 power supply, with perform node replace program.In step
In S214, reach default threshold value (for example, in one hour because the hardware among node 10 produces recoverable errors number
Produce the recoverable errors number of more than 10 times), that is, now node 10 soon damages and will be unable to normal operation, then locates
Normal shutdown (shutdown) program can be carried out to node 10 by managing module 16, and is performed node according to this and replaced program.
Fig. 3 is refer to, Fig. 3 is the step flow chart that program is replaced according to the node of one embodiment of the invention.Such as Fig. 3 institutes
Show, in step S300, node 10 can be set as initial pattern by processing module 16.In the present embodiment, initial pattern is
State host configuration (dynamic host configuration protocol, DHCP) pattern.In the operation of reality,
, can be automatically by the baseboard management controller of node 10 when processing module 16 judges that egress 10 can pull away server system 1
DHCP patterns are set back, to obtain Internet protocol address (the internet protocol of a new group substrate Management Controller
Address, IP address).
In step s 302, reminding module 14 can produce one group of second cue, and this second cue is indicating
Node 10 can pull away server system 1.In step s 304, whether the meeting of detection module 12 detection node 10 pulls away server system
1.If detection module 12 detects egress 10 and do not pull away server system 1 yet, step S304 is continued executing with;If detection module 12
Detection egress 10 pulls away server system 1, then performs step S306.In step S306, detection module 12 can continue detection
Whether node 10 or another group node insert server system 1.If detection module 12 is tested with a group node, (node 10 is another
One group node) insertion server system 1, then perform step S308;If detection module 12 detects there is not node insertion service yet
Device system 1, then continue executing with step S306.In step S308, reminding module 14 can produce the first cue, and continue and hold
Row step S202.
Fig. 4 is refer to, Fig. 4 is the detailed step flow chart of the step S304 in Fig. 3.As shown in figure 4, in prompting mould
After block 14 produces the second cue (i.e. step S302), timing module 18 can set one group of first preset time (such as one
Minute) and start timing.In step S402, whether meeting decision node 10 has pulled away server system 1.If judge egress
10 in not pulling away server system 1 yet in the first preset time, then perform step S404;If judging, egress 10 pulls away server
System 1 and the first preset time is had been subjected to, then perform step S306.In step s 404, it is pre- to reset first for timing module 18
If time and reclocking, and subsequent steps S402 determining program.
In addition, decision node 10 performed in step S402 can pass through the step of whether having pulled away server system 1
The network interface controller of detection module 12, processing module 16 or node 10 is reached, and the present invention is not herein any limitation as, such as
Whether can be pulled away come detection node 10 by the NIC of BMC on ping nodes 10.Thereby, can be avoided by Fig. 4 judgment mechanism
Erroneous judgement node 10 caused by network shakiness or loose contact has pulled away the situation of server system 1, in other words, Fig. 4's
Judgment mechanism is a kind of de-bounce mechanism.
Fig. 5 is refer to, Fig. 5 is the detailed step flow chart of the step S306 in Fig. 3.As shown in figure 5, in detection mould
After the step of whether detection node 10 of block 12 pulls away server system 1 (i.e. step S304), timing module 18 can set one group
First preset time and one group of second preset time simultaneously start timing, wherein the second preset time be connected in the first preset time it
Afterwards.For example, the first preset time is first minute (i.e. the 0th~60 second) that timing module 18 starts timing, and second is pre-
If the time then starts second minute (i.e. the 61st~120 second) of timing for timing module 18, need to know, the first preset time and second
Preset time can be different.
In step S502, whether meeting decision node 10 inserts server system 1.If judging, egress 10 is still not inserted into clothes
Business device system 1, then perform step S504;If judging, egress 10 inserts server system 1, performs step S506.In step
In S504, timing module 18 can reset the first preset time and reclocking, and subsequent steps S502 determining program, so step
Rapid S502 and step S504 judgment mechanism is a kind of de-bounce mechanism.In addition, the determining program performed by step S502 can
To be to be reached by the network interface controller of detection module 12, processing module 16 or node 10, the present invention is not subject to herein
Limitation, such as whether can be inserted come detection node 10 by the NIC of BMC on ping nodes 10.
In step S506, if judging, egress 10 inserts server system 1 and has been subjected to the first preset time, then
Whether decision node 10 still persistently inserts server system 1 in the second preset time.If decision node 10 is when second is default
Between in still persistently insert server system 1, represent node 10 and the position in inserted server system 1 it is all correct, then
The step of after execution step S308;If decision node 10 pulls away server system 1 in the second preset time, node 10 is represented
With the position in inserted server system 1 may wrong or wrong plug node 10 and pulled away, then after performing step S300
The step of, the correct position that correct node 10 is inserted into server system 1, so step S506 judgment mechanism is
A kind of artificial fool proof (fool-proofing) mechanism.
In addition, determining program performed in step S506 can pass through detection module 12, processing module 16 or node 10
Network interface controller reach, the present invention is not any limitation as herein, for example, can by the NIC of BMC on ping nodes 10 come
Whether detection node 10 is persistently inserted.Thereby, by Fig. 5 judgment mechanism except that can avoid because network is unstable or loose contact
Caused erroneous judgement node 10 has been inserted outside the situation of server system 1, moreover it is possible to and allow user to have an opportunity when error node,
Have an opportunity that this node is pulled out to and inserted correct node, in other words, Fig. 5 judgment mechanism is a kind of de-bounce mechanism
With the combination of fool proof (fool-proofing) mechanism.
In summary described, server system provided in an embodiment of the present invention and node replacement method, it is saved by detecting
Point identification code and node in hardware hardware configuration information, the hardware come in decision node or node whether be replaced or
Whether new node is added into, and then optionally performs node and replace program or to this node installation operating system, software kit
Data or firmware bag data, or even in the case where node or hardware are not replaced, can also force again to this node installation operating system,
Package data or firmware bag data.In addition, the server system of the present invention can also be grasped with node replacement method in node installation
After making system, package data or firmware bag data, the constantly condition of hardware in detection node, and the hardware production in node
Node replacement program can be carried out to allow user to learn by cue when raw wrong.Thereby, server system of the invention
System can automatically carry out whether node needs hyperphoric processing routine with node replacement method, and user is only needed according to prompting letter
Number node is inserted or pulls away server system, without doing other detection programs, very with practicality.
Certainly, the present invention can also have other various embodiments, ripe in the case of without departing substantially from spirit of the invention and its essence
Know those skilled in the art when can be made according to the present invention it is various it is corresponding change and deformation, but these corresponding change and become
Shape should all belong to the protection domain of appended claims of the invention.
Claims (10)
1. a kind of node replacement method, suitable for a server system, it is characterised in that the node replacement method includes:
Detect a node and whether insert the server system, and produce when detecting that the node inserts the server system one the
One cue, first cue is indicating that the node can not pull away the server system;
Detect one first identification code and one first hardware configuration information of the hardware in the node of the node;
Inserted according to first identification code and first hardware configuration information and the node before the server system the
Two identification codes and one second hardware configuration information, judge whether the hardware in the node or the node has been replaced;
If judging, the node is not all replaced with the hardware in the node, closes the power supply of the node, to perform a section
Point replaces program;And
If judging, the hardware in the node or the node is replaced, to the operating system of node installation one, a software kit
Data and a firmware bag data at least one.
2. node replacement method as claimed in claim 1, it is characterised in that in the node installation operating system, this is soft
After part bag data and at least one of step of firmware bag data, in addition to:
The situation of the hardware in the node is persistently detected, to judge whether the hardware makes a mistake;
If judging, the hardware produces the mistake of unrepairable, closes the power supply of the node, and program is replaced to perform the node;
And
If judging, the hardware produces recoverable errors number and reaches default threshold value, and normal shutdown is carried out to the node
Program, and perform the node according to this and replace program.
3. node replacement method as claimed in claim 1, it is characterised in that the node, which replaces program, to be included:
It is an initial pattern by the node sets;
One second cue is produced, second cue is indicating that the node can pull away the server system;
Detect whether the node pulls away the server system;
If detecting, the node pulls away the server system, detects the node or whether another node inserts the server system
System;And
If detecting, the node or another node insert the server system, produce first cue, and the inspection that continues
Survey the step of first identification code of the node is with after first hardware configuration information of the hardware in the node.
4. node replacement method as claimed in claim 3, it is characterised in that whether the node pulls away the server system in detection
In the step of system, in addition to:
Set one first preset time and start timing;
Judge whether the node has pulled away the server system;
If judging, the node not yet pulls away the server system, resets first preset time and reclocking;And
If judging, the node pulls away the server system and has been subjected to first preset time, perform detection node or another
The step of whether one node is inserted after the server system.
5. node replacement method as claimed in claim 3, it is characterised in that whether inserted in the detection node or another node
In the step of server system, in addition to:
Set one first preset time and one second preset time and start timing, wherein second preset time be connected in this
After one preset time;
Judge whether the node inserts the server system;
If judging, the node not yet inserts the server system, resets first preset time and reclocking;
If judging, the node inserts the server system and has been subjected to first preset time, then judges the node in this
Whether the server system is still persistently inserted in second preset time;
If judging, the node pulls away the server system in second preset time, continues the node sets are first for this
The step of after beginning pattern;And
If judging the node in not pulling away the server system yet after second preset time, first prompting is produced
Signal, and first hardware configuration information of the hardware in continue first identification code and the node for detecting the node it
Step afterwards.
A kind of 6. server system, it is characterised in that including:
One node, there is a hardware;
One detection module, communication connects the node, to detect whether the node inserts or pull away the server system, Yi Jiyong
To detect one first hardware configuration information of one first identification code of the node and the hardware;
One reminding module, communication connects the detection module, to detect that the node inserts the server system in the detection module
One first cue is produced during system, first cue is indicating that the node can not pull away the server system;And
One processing module, communication be connected between the detection module and the node, to according to first identification code and this first
One second identification code that hardware configuration information and the node are inserted before the server system and one second hardware configuration information
To judge whether the node or the hardware have been replaced;
Wherein, if the processing module judges that the node is not all replaced with the hardware, the power supply of the node is closed, to perform
One node replaces program, if the processing module judges that the node or the hardware are replaced, system is operated to the node installation one
System, a package data and a firmware bag data at least one.
7. server system as claimed in claim 6, it is characterised in that in the processing module to the node installation operation system
After at least one, whether the processing module also persistently judges the hardware for system, the package data and the firmware bag data
Make a mistake, if judging, the hardware produces the mistake of unrepairable, closes the power supply of the node, to perform node replacement
Program, if judging, the hardware produces recoverable errors number and reaches default threshold value, and the node is normally closed
Machine program, and perform the node according to this and replace program.
8. server system as claimed in claim 6, it is characterised in that perform the node in the server system and replace program
When, the node sets are an initial pattern by the processing module, and then the reminding module produces one second cue, and this second
Cue is indicating that the node can pull away the server system, then carrys out the detection module and detect whether the node pulls away the clothes
It is engaged in device system, and continues after detecting that the node pulls away the server system and to detect the node or whether another node inserts this
Server system, if the detection module detects that the node or another node insert the server system, reminding module production
Raw first cue, and the detection module continue detect first identification code of the node and the hardware this is first hard
Processing routine after part configuration information.
9. server system as claimed in claim 8, it is characterised in that the server system also includes a timing module, should
Timing module communication connects the detection module, when the detection module detects the node and whether pulls away the server system, the meter
When module set and one first preset time and start timing, if the detection module detects the node in first preset time
The server system is not pulled away yet, then the timing module resets first preset time and reclocking, if the detection module is examined
Measure the node to pull away the server system and have been subjected to first preset time, then continue and detect the node or another node and be
The no processing routine inserted after the server system.
10. server system as claimed in claim 8, it is characterised in that the server system also includes:
One timing module, timing module communication connect the detection module, the node or another node are detected in the detection module
When whether inserting the server system, the timing module sets one first preset time and one second preset time and starts to count
When, after wherein second preset time is connected in first preset time, if the detection module detect the node in this first
The server system is not yet inserted in preset time, then the timing module resets first preset time and reclocking, if should
Detection module detects that the node inserts the server system and has been subjected to first preset time, then continue detect the node in
Whether the server system is still persistently inserted in second preset time, if the detection module detects that the node is second pre- in this
If pulling away the server system in the time, then processing routine of the processing module that continues by the node sets for the initial pattern,
If the detection module detects the node in not pulling away the server system yet after second preset time, the prompting mould
Block produces first cue, and the detection module continue detect first identification code of the node and the hardware this
Processing routine after one hardware configuration information.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310597425.0A CN104657166B (en) | 2013-11-22 | 2013-11-22 | server system and node replacement method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310597425.0A CN104657166B (en) | 2013-11-22 | 2013-11-22 | server system and node replacement method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104657166A CN104657166A (en) | 2015-05-27 |
CN104657166B true CN104657166B (en) | 2018-03-20 |
Family
ID=53248348
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310597425.0A Active CN104657166B (en) | 2013-11-22 | 2013-11-22 | server system and node replacement method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104657166B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110457074A (en) * | 2019-07-26 | 2019-11-15 | 新华三技术有限公司成都分公司 | Configuration method, device, electronic equipment and the storage medium of calculate node |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001082678A2 (en) * | 2000-05-02 | 2001-11-08 | Sun Microsystems, Inc. | Cluster membership monitor |
CN102135932A (en) * | 2011-03-08 | 2011-07-27 | 浪潮(北京)电子信息产业有限公司 | Monitoring system and monitoring method thereof |
CN102769673A (en) * | 2012-07-25 | 2012-11-07 | 楚云汉智武汉网络存储系统有限公司 | Failure detection method suitable to large-scale storage cluster |
CN103186403A (en) * | 2011-12-28 | 2013-07-03 | 英业达股份有限公司 | Node replacement processing method and server system using same |
-
2013
- 2013-11-22 CN CN201310597425.0A patent/CN104657166B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001082678A2 (en) * | 2000-05-02 | 2001-11-08 | Sun Microsystems, Inc. | Cluster membership monitor |
CN102135932A (en) * | 2011-03-08 | 2011-07-27 | 浪潮(北京)电子信息产业有限公司 | Monitoring system and monitoring method thereof |
CN103186403A (en) * | 2011-12-28 | 2013-07-03 | 英业达股份有限公司 | Node replacement processing method and server system using same |
CN102769673A (en) * | 2012-07-25 | 2012-11-07 | 楚云汉智武汉网络存储系统有限公司 | Failure detection method suitable to large-scale storage cluster |
Also Published As
Publication number | Publication date |
---|---|
CN104657166A (en) | 2015-05-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6496790B1 (en) | Management of sensors in computer systems | |
US6976197B2 (en) | Apparatus and method for error logging on a memory module | |
US11748218B2 (en) | Methods, electronic devices, storage systems, and computer program products for error detection | |
CN106561018A (en) | Server monitoring method, monitoring device and monitoring system | |
CN106059783A (en) | Cabling connection method and cabling connection system | |
CA2503757A1 (en) | Method and apparatus for validation and error resolution of configuration data in a private branch exchange switch | |
US20080270827A1 (en) | Recovering diagnostic data after out-of-band data capture failure | |
CN106407059A (en) | Server node testing system and method | |
CN103164316B (en) | Hardware monitor | |
CN102710740B (en) | A kind of device identifier determines method | |
CN101989220A (en) | Pressure testing method | |
CN110674034A (en) | Health examination method and device, electronic equipment and storage medium | |
CN109918242A (en) | A kind of method and system of automatic detection server product configuration information | |
TW201305813A (en) | Computer system and diagnostic method thereof | |
CN109088744A (en) | Powerline network abnormal intrusion detection method, device, equipment and storage medium | |
CN116662091A (en) | Method, device, equipment and storage medium for detecting high-speed cable of server | |
CN104657166B (en) | server system and node replacement method | |
CN106775847A (en) | A kind of board software version updating method and device | |
CN103957130B (en) | Fault detect and restoration methods and system | |
US20120054391A1 (en) | Apparatus and method for testing smnp cards | |
CN103018617B (en) | Circuit board detection method and detection system thereof | |
CN107431459A (en) | Photovoltaic string combiner with modular platform framework | |
JP6217086B2 (en) | Information processing apparatus, error detection function diagnosis method, and computer program | |
JP5683354B2 (en) | Monitoring device and monitoring method | |
TWI518519B (en) | Server system and node replacement method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |