CN104657166B - server system and node replacement method - Google Patents

server system and node replacement method Download PDF

Info

Publication number
CN104657166B
CN104657166B CN201310597425.0A CN201310597425A CN104657166B CN 104657166 B CN104657166 B CN 104657166B CN 201310597425 A CN201310597425 A CN 201310597425A CN 104657166 B CN104657166 B CN 104657166B
Authority
CN
China
Prior art keywords
node
server system
hardware
preset time
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310597425.0A
Other languages
Chinese (zh)
Other versions
CN104657166A (en
Inventor
卢盈志
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Pudong Technology Corp
Inventec Corp
Original Assignee
Inventec Pudong Technology Corp
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Pudong Technology Corp, Inventec Corp filed Critical Inventec Pudong Technology Corp
Priority to CN201310597425.0A priority Critical patent/CN104657166B/en
Publication of CN104657166A publication Critical patent/CN104657166A/en
Application granted granted Critical
Publication of CN104657166B publication Critical patent/CN104657166B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

A kind of node replacement method, suitable for server system.The step flow of the node replacement method is as follows:Whether detection node inserts server system, and cue is produced when detecting that node inserts server system, and cue is indicating that node can not pull away server system.First identification code of detection node and the first hardware configuration information of the hardware in node.Whether it has been replaced according to the first identification code and the second identification code before hardware configuration information and node insertion server system and hardware configuration information, decision node or hardware.If being judged as NO, the power supply of closed node, program is replaced to perform node.If being judged as YES, to node installation operating system, package data and firmware bag data at least one.

Description

Server system and node replacement method
Technical field
The present invention relates to a kind of server system (such as data center of cabinet-type) and node replacement method, particularly one Kind can quickly carry out the server system and node replacement method that node replaces program.
Background technology
With the development of science and technology, computer all over the world is enabled to enter joining line by internet.One computer passes through Network connectivity just can carry out exchanging, access etc. and acting for data with another computer.In client and server system architecture On, client with server is linked up by network.
In general, server system may be configured with multiple nodes, and each node runs multiple virtual machines simultaneously (virtual machine, VM), uses the environment of operation for being supplied to each user's independence.Also, each node can be considered each From independent computer, that is, each node has memory, storage area, operational capability and network linking function.Therefore, each section Point can run operating system alone, and can also be linked up between each node by the network equipment and data transfer.
After server system architecture, it is necessary to deployment is completed to the node in it, that is, is installed needed for each node Operating system, package data and firmware bag data, so that server system operates and provides service to user's use.So And when the hardware of any one group node in server system produces damage, it will cause node corresponding to this hardware without Method normal operation, now testing staff is that specific hardware in which node is damaged due to can not accurately learn, and is only capable of Sequentially node is pulled away with insertion server system to be detected, cause the waste in detection time.
The content of the invention
The technical problems to be solved by the invention are to provide a kind of server system and node replacement method, and it can be automatically Judge according to the hardware configuration information of the hardware in the identification code and node of node specific hard in specific node or this node Whether part, which needs, is replaced, so that quickly and easily node can be replaced by testing staff.
To achieve these goals, the invention provides a kind of node replacement method, this node replacement method to be applied to clothes Business device system.The step flow of this node replacement method is as described below.Whether detection node inserts server system, and in detection The first cue is produced when inserting server system to node, wherein this first cue is indicating that node can not pull away Server system.First identification code of detection node and the first hardware configuration information of the hardware in node.Identified according to first Code and the second identification code and the second hardware configuration information before the first hardware configuration information and node insertion server system, Whether the hardware in decision node or node has been replaced.If judging, egress is not all replaced with the hardware in node, is closed The power supply of close node, program is replaced to perform node.If judging, the hardware in egress or node is replaced, to node installation Operating system, package data and firmware bag data at least one.
In one embodiment, in node installation operating system, package data and firmware bag data at least one The step of after, in addition to steps described below flow.Continue the situation of the hardware in detection node, whether to judge hardware Make a mistake.If judging, hardware produces the mistake of unrepairable, closes the power supply of this node, and journey is replaced to perform node Sequence.If judging, hardware produces recoverable errors number and reaches default threshold value, and normal shutdown journey is carried out to this node Sequence, and perform node according to this and replace program.
In one embodiment, node, which replaces program, includes steps described below flow.It is initial pattern by node sets. The second cue is produced, wherein this second cue is indicating that node can pull away server system.Whether detection node Pull away server system.Whether detection node or another node insert server system.If detecting, node or another node are inserted Enter server system, then produce the first cue, and the first identification code of the detection node that continues and the hardware in node The step of after first hardware configuration information.
The above embodiments are accepted, in the step of whether detection node pulls away server system, in addition to it is as described below Step flow.Set the first preset time and start timing.Whether decision node has pulled away server system.If judge to save Point not yet pulls away server system, then resets the first preset time and reclocking.If judging, egress pulls away server system And have been subjected to the first preset time, then the step of whether perform detection node or another node are inserted after server system.
The above embodiments are accepted, in the step of whether detection node or another node insert server system, are also wrapped Include steps described below flow.Set the first preset time and the second preset time and start timing, wherein second it is default when Between be connected in the first preset time after.Whether decision node inserts server system.If judging, egress not yet inserts server System, then reset the first preset time and reclocking.If judge egress insertion server system and have been subjected to first to preset Time, then then whether decision node still persistently inserts server system in the second preset time.If judge egress in Server system is pulled away in two preset times, then the step of continuing node sets as after initial pattern.If judge egress In not pulling away server system yet after the second preset time, then the first cue is produced, and the detection node that continues The step of first identification code is with after the first hardware configuration information of the hardware in node.
In order to which above-mentioned purpose is better achieved, present invention also offers a kind of server system, this server system includes Node, detection module, reminding module and processing module.Node has hardware.Detection module communicates connecting node, this detection mould Whether block inserts or pulls away server system, and the first identification code and the of hardware to detection node to detection node One hardware configuration information.Reminding module communicates connection detection module, and this reminding module in detection module detecting that node is inserted The first cue is produced when entering server system.Wherein, this first cue is indicating that node can not pull away server System.Processing module communication is connected between detection module and node, and this processing module is to according to the first identification code and first The second identification code before hardware configuration information and node insertion server system judges to save with the second hardware configuration information Whether point or hardware have been replaced.Wherein, if processing module judges that egress is not all replaced with hardware, the electricity of closed node Source, with perform node replace program, if processing module judges that egress or hardware are replaced, to node installation operating system, Package data and firmware bag data at least one.
In one embodiment, in processing module to node installation operating system, package data and firmware bag data at least After one of them, processing module also persistently judges whether hardware makes a mistake.If processing module judges hardware, generation can not The power supply of the mistake of reparation, then closed node, program is replaced to perform node.If processing module, which judges that hardware produces, to repair Errors number reach default threshold value, then to node carry out normal shutdown program, and according to this perform node replace program.
In one embodiment, when server system performs node and replaces program, node sets are initial by processing module Pattern.Then, reminding module produces the second cue, and this second cue is indicating that node can pull away server system System.In addition, whether detection module detection node pulls away server system, and continued after detecting that node pulls away server system Whether detection node or another node insert server system.If detection module detects node or another node insertion server System, then reminding module produces the first cue, and detection module continues first identification code and hardware of detection node Processing routine after first hardware configuration information.
The above embodiments are accepted, server system also includes timing module, this timing module communication connection detection module. When whether detection module detection node pulls away server system, timing module can set the first preset time and start timing. If detection module detects egress in not pulling away server system in the first preset time yet, it is pre- that timing module will reset first If time and reclocking.If detection module detection egress pulls away server system and has been subjected to the first preset time, connect The processing routine whether continuous detection node or another node are inserted after server system.
The above embodiments are accepted, server system also includes timing module, this timing module communication connection detection module. When whether detection module detection node or another node insert server system, timing module can set the first preset time with Second preset time simultaneously starts timing, wherein after the second preset time is connected in the first preset time.If detection module detects Node is in not yet inserting server system in the first preset time, then timing module resets the first preset time and reclocking. If detection module detection egress inserts server system and has been subjected to the first preset time, the detection node that continues is pre- in second If whether still persistently insert server system in the time.If detection module detection egress pulls away service in the second preset time Device system, then processing routine of the processing module that continues by node sets for initial pattern.If detection module detects egress in warp Server system is not pulled away yet after crossing the second preset time, then reminding module produces the first cue, and detection module connects Processing routine after first identification code of continuous detection node and the first hardware configuration information of hardware.
The technical effects of the invention are that:
Server system and the node replacement method of the present invention, the hardware in its identification code and node by detection node Hardware configuration information, whether the hardware of the hardware come in decision node or node has been replaced, and then optionally performs section Point replaces program or to this node installation operating system, package data or firmware bag data.In addition, the server system of the present invention System can also be after node installation operating system, package data or firmware bag data with node replacement method, and constantly detection saves Condition of hardware in point, and can be carried out by cue to allow testing staff to learn when hardware in node produces wrong Node replaces program.
Below in conjunction with the drawings and specific embodiments, the present invention will be described in detail, but not as a limitation of the invention.
Brief description of the drawings
Fig. 1 is the functional block diagram according to the server system of one embodiment of the invention;
Fig. 2A is the step flow chart according to the node replacement method of the server system of one embodiment of the invention;
Fig. 2 B are the step flow chart according to the node replacement method of the server system of another embodiment of the present invention;
Fig. 3 is the step flow chart that program is replaced according to the node of one embodiment of the invention;
Fig. 4 is the detailed step flow chart of the step S304 in Fig. 3;
Fig. 5 is the detailed step flow chart of the step S306 in Fig. 3.
Wherein, reference
1 server system
10 nodes
12 detection modules
14 reminding modules
16 processing modules
18 timing modules
S200~S214, S300~S308, S400~S404, S500~S506 steps
Embodiment
The structural principle and operation principle of the present invention are described in detail below in conjunction with the accompanying drawings:
Fig. 1 is refer to, Fig. 1 is the functional block diagram according to the server system of one embodiment of the invention.As shown in figure 1, clothes Device system 1 of being engaged in includes node 10, detection module 12, reminding module 14, processing module 16 and timing module 18, wherein detection module 12 communications are connected between the reminding module 14 of node 10, processing module 16 and timing module 18, and node 10 and and processing module 16 communication connections.Communication connection of the present invention can be realized with entity connection, or be connected with wireless telecommunications And realize, the present invention is not any limitation as herein.In in practice, server system 1 can be a kind of data center of cabinet-type (container data center), but be not limited.Below by respectively with regard to each portion's functional module in server system 1 It is described in detail.
Node 10 has an at least hardware, and described hardware can include baseboard management controller (baseboard Management controller, BMC), network interface controller (network interface controller, NIC, also Claim network card), hard disk (hard disk drive, HDD), DIMM (Dual In-line Memory Module) and centre Device (CPU) etc. is managed, but is not limited.In addition, although Fig. 1 has only illustrated a group node, but the server system of the present invention It is not any limitation as the number of node herein.
Whether detection module 12 has insertion to detection node 10 or pulls away server system 1, and to detection node 10 the first identification code and the first hardware configuration information (hardware configuration of hardware in node 10 information).In in practice, the identification code of node 10 can be a kind of general unique identifier (universally Unique identifier, UUID), but be not limited.In general, such a general unique identifier be by a string 16 16 carry digits of tuple (also known as 128 bits) are formed, to allow each node 10 to have unique identification information, then Person, this UUID can be obtained by the UUID fields of SMBIOS (System Management BIOS) Type1 data structures;Node 10 Hardware configuration information can by calculating and its unique 4 bit group hardware signature (Hardware Signature), BIOS (Basic Input Output System) will obtain hardware configuration information simultaneously when its POST (Power On Self Test) Enter hardware must be signed and be stored in ACPI (Advanced Configuration Power Management Interface) FACS The Hardware Signature fields of (Firmware ACPI Control Structure) table, this Hardware Signature fields can be used to quick decision, and whether hardware configuration information is different;Furthermore BMC on ping nodes 10 can be passed through NIC comes whether detection node 10 is inserted or pulled out.
Reminding module 14 carries to produce one group first when detection module 12 detects that node 10 inserts server system 1 Show signal, this first cue is indicating that node 10 can not pull away server system 1.In addition, in some cases, prompt Module 14 produces one group of second cue, and this second cue is indicating that node 10 can pull away server system 1.Yu Shi In business, reminding module 14 can be a kind of display module (such as electronic display such as light-emittingdiode, display panel, seven-segment display Show element) or sounding module (such as the electronic sound such as loudspeaker, buzzer element), it is of the invention not to be any limitation as herein.If carry If showing that module 14 is display module, then cue is presented to user in the form of image or light;If reminding module If 14 are sounding module, then cue is presented to user with the pattern of sound.
Processing module 16 is to the identification code according to node 10 and the hardware configuration information and node of hardware in node 10 The second identification code before 10 insertion server systems 1 is come hard in decision node 10 or node 10 with the second hardware configuration information Whether part has been replaced;It need to know, " the second identification code and the second hardware configuration information before the insertion server system 1 of node 10 " It is " its identification code and hardware configuration information during the previous insertion server system 1 of node 10 ", furthermore, if new node 10 Server system is inserted, then its second identification code and the second hardware configuration information are all empty.Timing module 18 to set to Few one group of preset time, and start timing.In some situations, timing module 18 can return timing during timing Zero, to restart timing.
For the actual operation mode of the server system 1 and node replacement method of the more clear explanation present invention, one is asked And reference picture 1 and Fig. 2A, Fig. 2A are the step flow according to the node replacement method of the server system of one embodiment of the invention Figure.As shown in Figure 2 A, in step s 200, whether the meeting of detection module 12 detection node 10 inserts server system 1, and in detection When inserting server system to node 10, reminding module 14 can produce one group of first cue, and enter step S202.If inspection Survey module 12 and be not detected by the insertion server system 1 of node 10, then continue to repeat step S200, until detection module 12 is examined Untill measuring the insertion server system 1 of node 10.
In step S202, detection module 12 is understood in then the first identification code of detection node 10 and this node 10 wherein First hardware configuration information of one hardware.In step S204, processing module 16 can match somebody with somebody according to the first identification code with the first hardware The second identification code and the second hardware configuration information before confidence breath and the insertion server system 1 of node 10, decision node 10 Or whether the hardware in node 10 has been replaced.If processing module 16 judge hardware in egress 10 or node 10 by for Change, then perform step S206;If processing module 16 judges that egress 10 is not all replaced with the hardware in node 10, step is performed Rapid S208.It need to know, in the case where node 10 or its hardware are not replaced, can also force again to this installation operation system of node 10, soft Part bag data or firmware bag data (not shown), it can be applied to actual situation and for example Xias ﹕ when node 10 is hard because thereon Part produces hardware error because of loose contact, can now pull out node 10, then makes the contact of its hardware good, then again will section Point 10 turns back to server system 1 again.
In step S206, processing module 16 can be to the installation operation system of node 10 (operating system, OS), soft Part bag data (software package data) and firmware bag data (firmware package data) at least within it One.In step S208, processing module 16 can close the power supply of (power off) node 10, and program is replaced to perform node.
Fig. 2 B are refer to, Fig. 2 B is according to the step of the node replacement methods of the server system of another embodiment of the present invention Flow chart.As shown in Figure 2 B, to node installation operating system, package data and at least one of step of firmware bag data Suddenly after (i.e. step S206), detection module 12 or another group of monitoring module (not shown) can be constantly in detection nodes The situation of hardware, so that processing module 16 judges whether hardware makes a mistake (i.e. step S210).If detection module 12 or another Hardware among one group of monitoring module detection egress 10 produces the mistake (un-correct error) of unrepairable, then performs Step S212;If the hardware among detection module 12 or another group of monitoring module detection egress 10 produces recoverable mistake (correct error) number reaches default threshold value (default threshold value), then performs step S214.
In step S212, because the hardware among node 10 produces the mistake of unrepairable, that is, now node 10 has been Through damage can not normal operation, then processing module 16 can closed node 10 power supply, with perform node replace program.In step In S214, reach default threshold value (for example, in one hour because the hardware among node 10 produces recoverable errors number Produce the recoverable errors number of more than 10 times), that is, now node 10 soon damages and will be unable to normal operation, then locates Normal shutdown (shutdown) program can be carried out to node 10 by managing module 16, and is performed node according to this and replaced program.
Fig. 3 is refer to, Fig. 3 is the step flow chart that program is replaced according to the node of one embodiment of the invention.Such as Fig. 3 institutes Show, in step S300, node 10 can be set as initial pattern by processing module 16.In the present embodiment, initial pattern is State host configuration (dynamic host configuration protocol, DHCP) pattern.In the operation of reality, , can be automatically by the baseboard management controller of node 10 when processing module 16 judges that egress 10 can pull away server system 1 DHCP patterns are set back, to obtain Internet protocol address (the internet protocol of a new group substrate Management Controller Address, IP address).
In step s 302, reminding module 14 can produce one group of second cue, and this second cue is indicating Node 10 can pull away server system 1.In step s 304, whether the meeting of detection module 12 detection node 10 pulls away server system 1.If detection module 12 detects egress 10 and do not pull away server system 1 yet, step S304 is continued executing with;If detection module 12 Detection egress 10 pulls away server system 1, then performs step S306.In step S306, detection module 12 can continue detection Whether node 10 or another group node insert server system 1.If detection module 12 is tested with a group node, (node 10 is another One group node) insertion server system 1, then perform step S308;If detection module 12 detects there is not node insertion service yet Device system 1, then continue executing with step S306.In step S308, reminding module 14 can produce the first cue, and continue and hold Row step S202.
Fig. 4 is refer to, Fig. 4 is the detailed step flow chart of the step S304 in Fig. 3.As shown in figure 4, in prompting mould After block 14 produces the second cue (i.e. step S302), timing module 18 can set one group of first preset time (such as one Minute) and start timing.In step S402, whether meeting decision node 10 has pulled away server system 1.If judge egress 10 in not pulling away server system 1 yet in the first preset time, then perform step S404;If judging, egress 10 pulls away server System 1 and the first preset time is had been subjected to, then perform step S306.In step s 404, it is pre- to reset first for timing module 18 If time and reclocking, and subsequent steps S402 determining program.
In addition, decision node 10 performed in step S402 can pass through the step of whether having pulled away server system 1 The network interface controller of detection module 12, processing module 16 or node 10 is reached, and the present invention is not herein any limitation as, such as Whether can be pulled away come detection node 10 by the NIC of BMC on ping nodes 10.Thereby, can be avoided by Fig. 4 judgment mechanism Erroneous judgement node 10 caused by network shakiness or loose contact has pulled away the situation of server system 1, in other words, Fig. 4's Judgment mechanism is a kind of de-bounce mechanism.
Fig. 5 is refer to, Fig. 5 is the detailed step flow chart of the step S306 in Fig. 3.As shown in figure 5, in detection mould After the step of whether detection node 10 of block 12 pulls away server system 1 (i.e. step S304), timing module 18 can set one group First preset time and one group of second preset time simultaneously start timing, wherein the second preset time be connected in the first preset time it Afterwards.For example, the first preset time is first minute (i.e. the 0th~60 second) that timing module 18 starts timing, and second is pre- If the time then starts second minute (i.e. the 61st~120 second) of timing for timing module 18, need to know, the first preset time and second Preset time can be different.
In step S502, whether meeting decision node 10 inserts server system 1.If judging, egress 10 is still not inserted into clothes Business device system 1, then perform step S504;If judging, egress 10 inserts server system 1, performs step S506.In step In S504, timing module 18 can reset the first preset time and reclocking, and subsequent steps S502 determining program, so step Rapid S502 and step S504 judgment mechanism is a kind of de-bounce mechanism.In addition, the determining program performed by step S502 can To be to be reached by the network interface controller of detection module 12, processing module 16 or node 10, the present invention is not subject to herein Limitation, such as whether can be inserted come detection node 10 by the NIC of BMC on ping nodes 10.
In step S506, if judging, egress 10 inserts server system 1 and has been subjected to the first preset time, then Whether decision node 10 still persistently inserts server system 1 in the second preset time.If decision node 10 is when second is default Between in still persistently insert server system 1, represent node 10 and the position in inserted server system 1 it is all correct, then The step of after execution step S308;If decision node 10 pulls away server system 1 in the second preset time, node 10 is represented With the position in inserted server system 1 may wrong or wrong plug node 10 and pulled away, then after performing step S300 The step of, the correct position that correct node 10 is inserted into server system 1, so step S506 judgment mechanism is A kind of artificial fool proof (fool-proofing) mechanism.
In addition, determining program performed in step S506 can pass through detection module 12, processing module 16 or node 10 Network interface controller reach, the present invention is not any limitation as herein, for example, can by the NIC of BMC on ping nodes 10 come Whether detection node 10 is persistently inserted.Thereby, by Fig. 5 judgment mechanism except that can avoid because network is unstable or loose contact Caused erroneous judgement node 10 has been inserted outside the situation of server system 1, moreover it is possible to and allow user to have an opportunity when error node, Have an opportunity that this node is pulled out to and inserted correct node, in other words, Fig. 5 judgment mechanism is a kind of de-bounce mechanism With the combination of fool proof (fool-proofing) mechanism.
In summary described, server system provided in an embodiment of the present invention and node replacement method, it is saved by detecting Point identification code and node in hardware hardware configuration information, the hardware come in decision node or node whether be replaced or Whether new node is added into, and then optionally performs node and replace program or to this node installation operating system, software kit Data or firmware bag data, or even in the case where node or hardware are not replaced, can also force again to this node installation operating system, Package data or firmware bag data.In addition, the server system of the present invention can also be grasped with node replacement method in node installation After making system, package data or firmware bag data, the constantly condition of hardware in detection node, and the hardware production in node Node replacement program can be carried out to allow user to learn by cue when raw wrong.Thereby, server system of the invention System can automatically carry out whether node needs hyperphoric processing routine with node replacement method, and user is only needed according to prompting letter Number node is inserted or pulls away server system, without doing other detection programs, very with practicality.
Certainly, the present invention can also have other various embodiments, ripe in the case of without departing substantially from spirit of the invention and its essence Know those skilled in the art when can be made according to the present invention it is various it is corresponding change and deformation, but these corresponding change and become Shape should all belong to the protection domain of appended claims of the invention.

Claims (10)

1. a kind of node replacement method, suitable for a server system, it is characterised in that the node replacement method includes:
Detect a node and whether insert the server system, and produce when detecting that the node inserts the server system one the One cue, first cue is indicating that the node can not pull away the server system;
Detect one first identification code and one first hardware configuration information of the hardware in the node of the node;
Inserted according to first identification code and first hardware configuration information and the node before the server system the Two identification codes and one second hardware configuration information, judge whether the hardware in the node or the node has been replaced;
If judging, the node is not all replaced with the hardware in the node, closes the power supply of the node, to perform a section Point replaces program;And
If judging, the hardware in the node or the node is replaced, to the operating system of node installation one, a software kit Data and a firmware bag data at least one.
2. node replacement method as claimed in claim 1, it is characterised in that in the node installation operating system, this is soft After part bag data and at least one of step of firmware bag data, in addition to:
The situation of the hardware in the node is persistently detected, to judge whether the hardware makes a mistake;
If judging, the hardware produces the mistake of unrepairable, closes the power supply of the node, and program is replaced to perform the node; And
If judging, the hardware produces recoverable errors number and reaches default threshold value, and normal shutdown is carried out to the node Program, and perform the node according to this and replace program.
3. node replacement method as claimed in claim 1, it is characterised in that the node, which replaces program, to be included:
It is an initial pattern by the node sets;
One second cue is produced, second cue is indicating that the node can pull away the server system;
Detect whether the node pulls away the server system;
If detecting, the node pulls away the server system, detects the node or whether another node inserts the server system System;And
If detecting, the node or another node insert the server system, produce first cue, and the inspection that continues Survey the step of first identification code of the node is with after first hardware configuration information of the hardware in the node.
4. node replacement method as claimed in claim 3, it is characterised in that whether the node pulls away the server system in detection In the step of system, in addition to:
Set one first preset time and start timing;
Judge whether the node has pulled away the server system;
If judging, the node not yet pulls away the server system, resets first preset time and reclocking;And
If judging, the node pulls away the server system and has been subjected to first preset time, perform detection node or another The step of whether one node is inserted after the server system.
5. node replacement method as claimed in claim 3, it is characterised in that whether inserted in the detection node or another node In the step of server system, in addition to:
Set one first preset time and one second preset time and start timing, wherein second preset time be connected in this After one preset time;
Judge whether the node inserts the server system;
If judging, the node not yet inserts the server system, resets first preset time and reclocking;
If judging, the node inserts the server system and has been subjected to first preset time, then judges the node in this Whether the server system is still persistently inserted in second preset time;
If judging, the node pulls away the server system in second preset time, continues the node sets are first for this The step of after beginning pattern;And
If judging the node in not pulling away the server system yet after second preset time, first prompting is produced Signal, and first hardware configuration information of the hardware in continue first identification code and the node for detecting the node it Step afterwards.
A kind of 6. server system, it is characterised in that including:
One node, there is a hardware;
One detection module, communication connects the node, to detect whether the node inserts or pull away the server system, Yi Jiyong To detect one first hardware configuration information of one first identification code of the node and the hardware;
One reminding module, communication connects the detection module, to detect that the node inserts the server system in the detection module One first cue is produced during system, first cue is indicating that the node can not pull away the server system;And
One processing module, communication be connected between the detection module and the node, to according to first identification code and this first One second identification code that hardware configuration information and the node are inserted before the server system and one second hardware configuration information To judge whether the node or the hardware have been replaced;
Wherein, if the processing module judges that the node is not all replaced with the hardware, the power supply of the node is closed, to perform One node replaces program, if the processing module judges that the node or the hardware are replaced, system is operated to the node installation one System, a package data and a firmware bag data at least one.
7. server system as claimed in claim 6, it is characterised in that in the processing module to the node installation operation system After at least one, whether the processing module also persistently judges the hardware for system, the package data and the firmware bag data Make a mistake, if judging, the hardware produces the mistake of unrepairable, closes the power supply of the node, to perform node replacement Program, if judging, the hardware produces recoverable errors number and reaches default threshold value, and the node is normally closed Machine program, and perform the node according to this and replace program.
8. server system as claimed in claim 6, it is characterised in that perform the node in the server system and replace program When, the node sets are an initial pattern by the processing module, and then the reminding module produces one second cue, and this second Cue is indicating that the node can pull away the server system, then carrys out the detection module and detect whether the node pulls away the clothes It is engaged in device system, and continues after detecting that the node pulls away the server system and to detect the node or whether another node inserts this Server system, if the detection module detects that the node or another node insert the server system, reminding module production Raw first cue, and the detection module continue detect first identification code of the node and the hardware this is first hard Processing routine after part configuration information.
9. server system as claimed in claim 8, it is characterised in that the server system also includes a timing module, should Timing module communication connects the detection module, when the detection module detects the node and whether pulls away the server system, the meter When module set and one first preset time and start timing, if the detection module detects the node in first preset time The server system is not pulled away yet, then the timing module resets first preset time and reclocking, if the detection module is examined Measure the node to pull away the server system and have been subjected to first preset time, then continue and detect the node or another node and be The no processing routine inserted after the server system.
10. server system as claimed in claim 8, it is characterised in that the server system also includes:
One timing module, timing module communication connect the detection module, the node or another node are detected in the detection module When whether inserting the server system, the timing module sets one first preset time and one second preset time and starts to count When, after wherein second preset time is connected in first preset time, if the detection module detect the node in this first The server system is not yet inserted in preset time, then the timing module resets first preset time and reclocking, if should Detection module detects that the node inserts the server system and has been subjected to first preset time, then continue detect the node in Whether the server system is still persistently inserted in second preset time, if the detection module detects that the node is second pre- in this If pulling away the server system in the time, then processing routine of the processing module that continues by the node sets for the initial pattern, If the detection module detects the node in not pulling away the server system yet after second preset time, the prompting mould Block produces first cue, and the detection module continue detect first identification code of the node and the hardware this Processing routine after one hardware configuration information.
CN201310597425.0A 2013-11-22 2013-11-22 server system and node replacement method Active CN104657166B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310597425.0A CN104657166B (en) 2013-11-22 2013-11-22 server system and node replacement method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310597425.0A CN104657166B (en) 2013-11-22 2013-11-22 server system and node replacement method

Publications (2)

Publication Number Publication Date
CN104657166A CN104657166A (en) 2015-05-27
CN104657166B true CN104657166B (en) 2018-03-20

Family

ID=53248348

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310597425.0A Active CN104657166B (en) 2013-11-22 2013-11-22 server system and node replacement method

Country Status (1)

Country Link
CN (1) CN104657166B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110457074A (en) * 2019-07-26 2019-11-15 新华三技术有限公司成都分公司 Configuration method, device, electronic equipment and the storage medium of calculate node

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001082678A2 (en) * 2000-05-02 2001-11-08 Sun Microsystems, Inc. Cluster membership monitor
CN102135932A (en) * 2011-03-08 2011-07-27 浪潮(北京)电子信息产业有限公司 Monitoring system and monitoring method thereof
CN102769673A (en) * 2012-07-25 2012-11-07 楚云汉智武汉网络存储系统有限公司 Failure detection method suitable to large-scale storage cluster
CN103186403A (en) * 2011-12-28 2013-07-03 英业达股份有限公司 Node replacement processing method and server system using same

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001082678A2 (en) * 2000-05-02 2001-11-08 Sun Microsystems, Inc. Cluster membership monitor
CN102135932A (en) * 2011-03-08 2011-07-27 浪潮(北京)电子信息产业有限公司 Monitoring system and monitoring method thereof
CN103186403A (en) * 2011-12-28 2013-07-03 英业达股份有限公司 Node replacement processing method and server system using same
CN102769673A (en) * 2012-07-25 2012-11-07 楚云汉智武汉网络存储系统有限公司 Failure detection method suitable to large-scale storage cluster

Also Published As

Publication number Publication date
CN104657166A (en) 2015-05-27

Similar Documents

Publication Publication Date Title
US6496790B1 (en) Management of sensors in computer systems
US6976197B2 (en) Apparatus and method for error logging on a memory module
US11748218B2 (en) Methods, electronic devices, storage systems, and computer program products for error detection
CN106561018A (en) Server monitoring method, monitoring device and monitoring system
CN106059783A (en) Cabling connection method and cabling connection system
CA2503757A1 (en) Method and apparatus for validation and error resolution of configuration data in a private branch exchange switch
US20080270827A1 (en) Recovering diagnostic data after out-of-band data capture failure
CN106407059A (en) Server node testing system and method
CN103164316B (en) Hardware monitor
CN102710740B (en) A kind of device identifier determines method
CN101989220A (en) Pressure testing method
CN110674034A (en) Health examination method and device, electronic equipment and storage medium
CN109918242A (en) A kind of method and system of automatic detection server product configuration information
TW201305813A (en) Computer system and diagnostic method thereof
CN109088744A (en) Powerline network abnormal intrusion detection method, device, equipment and storage medium
CN116662091A (en) Method, device, equipment and storage medium for detecting high-speed cable of server
CN104657166B (en) server system and node replacement method
CN106775847A (en) A kind of board software version updating method and device
CN103957130B (en) Fault detect and restoration methods and system
US20120054391A1 (en) Apparatus and method for testing smnp cards
CN103018617B (en) Circuit board detection method and detection system thereof
CN107431459A (en) Photovoltaic string combiner with modular platform framework
JP6217086B2 (en) Information processing apparatus, error detection function diagnosis method, and computer program
JP5683354B2 (en) Monitoring device and monitoring method
TWI518519B (en) Server system and node replacement method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant