The method and apparatus that is used for the two-node cluster hot backup of blade server
Technical field
The present invention relates in general to network field, more specifically, relates to the method and apparatus for the two-node cluster hot backup of blade server.
Background technology
In current blade server application, the effect of administration module is mathematical.Yet due to the stability of system self and such as reasons such as software and hardwares, there is the possibility breaking down in administration module.Once fault occurs, and will cause blade server not run well.And repair system needs spended time, for some important ingresses for service or access point (enterprise, bank etc.), when administration module breaks down, current system exists very large risk.Therefore, on market, being badly in need of a kind of blade server with back mechanism avoids this situation to occur.
And for the back mechanism of other field, if directly apply to the present invention, can obtain having the two-shipper mechanism of main and standby relation, that is, and two-node cluster hot backup mechanism.Yet still there is very large defect in such scheme.For example: fault that cannot automatic monitoring operational management module; Cannot automatically start standby management module; Data resource that receives before cannot retaining when operational management module breaks down and opens standby management module, processes etc.And these not enough meetings cause great waste aspect time and data resource, the operation of blade server produces significant adverse effect.
Summary of the invention
Consider the problems referred to above and make the present invention.
The invention provides a kind of double hot standby method, comprise the following steps: by Ethernet interface, operational management module is synchronizeed with the synchrodata of standby management module; The fault of monitoring operational management module; When breaking down, standby management module substitutes the work of operational management module according to synchrodata, and wherein, monitoring comprises: step 1, checks whether the Ethernet interface between operational management module and standby management module has data transmission; Step 2, when Ethernet interface does not have data transmission within first schedule time, checks whether the serial ports between operational management module and standby management module has data transmission; Step 3, when serial ports does not have data transmission within second schedule time, determines that operational management module breaks down.
Wherein, step 2 further comprises: when Ethernet interface has data transmission within first schedule time, return to step 1.
Wherein, step 3 further comprises: when serial ports has data transmission within second schedule time, return to step 1.
Wherein, synchrodata comprises IP, time.
Wherein, monitoring further comprises: whether the processor that checks operational management module works within the 3rd schedule time, when processor is not worked within the 3rd schedule time, determines and breaks down.
In addition, the present invention also provides a kind of two-node cluster hot backup device, comprising: synchronization module, for operational management module being synchronizeed with the synchrodata of standby management module by Ethernet interface; Monitoring module, for monitoring the fault of operational management module; And alternative module, for making standby management module substitute the work of operational management module according to synchrodata when breaking down, wherein, monitoring module comprises: first checks submodule, for checking whether the Ethernet interface between operational management module and standby management module has data transmission within first schedule time; Second checks submodule, for checking when there is no data transmission whether the serial ports between operational management module and standby management module has data transmission within second schedule time; And fault determines submodule, for determining that when there is no data transmission operational management module breaks down.
Wherein, synchrodata comprises IP, time.
Wherein, monitoring module further comprises: the 3rd checks submodule, for checking whether the processor of operational management module works within the 3rd schedule time, when processor is not worked within the 3rd schedule time, determines and breaks down.
Other features and advantages of the present invention will be set forth in the following description, and, partly from instructions, become apparent, or understand by implementing the present invention.Object of the present invention and other advantages can be realized and be obtained by specifically noted structure in the instructions write, claims and accompanying drawing.
Accompanying drawing explanation
Accompanying drawing described herein is used to provide a further understanding of the present invention, forms the application's a part, and schematic description and description of the present invention is used for explaining the present invention, does not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 shows according to the process flow diagram of the double hot standby method of exemplary embodiment of the present invention;
Fig. 2 shows according to the process flow diagram of monitoring step in the double hot standby method of exemplary embodiment of the present invention;
Fig. 3 shows according to the block diagram of the two-node cluster hot backup device of exemplary embodiment of the present invention; And
Fig. 4 shows according to the process flow diagram of monitoring module in the two-node cluster hot backup device of exemplary embodiment of the present invention.
Embodiment
Below in conjunction with accompanying drawing, describe embodiments of the invention in detail.
Fig. 1 shows according to the process flow diagram of the double hot standby method of exemplary embodiment of the present invention.As shown in Figure 1, double hot standby method can comprise: S101, by Ethernet interface, operational management module is synchronizeed with the synchrodata of standby management module; S103, the fault of monitoring operational management module; S105, when breaking down, standby management module substitutes the work of operational management module according to synchrodata.Wherein, step S103 can have a plurality of sub-steps.
Fig. 2 shows according to the process flow diagram of monitoring step in the double hot standby method of exemplary embodiment of the present invention.As shown in Figure 2, monitoring step S103 comprises: S1031, checks whether the Ethernet interface between operational management module and standby management module has data transmission; S1033, when Ethernet interface does not have data transmission within first schedule time, checks whether the serial ports between operational management module and standby management module has data transmission; S1035, when serial ports does not have data transmission within second schedule time, determines that operational management module breaks down.
Wherein, step S1033 further comprises: when Ethernet interface has data transmission within first schedule time, return to step S1031.Step S1035 further comprises: when serial ports has data transmission within second schedule time, return to step S1031.
And monitoring step S103 further comprises: whether the processor that checks operational management module works within the 3rd schedule time, when processor is not worked within the 3rd schedule time, determine and break down.
Said method has following function:
1), by Ethernet interface, realize the synchronous of the data such as IP, time between two administration modules.
2) two administration modules are carried out to mutual double-pipe type module status through Ethernet and serial ports and monitor, while being out of order, correctly process.
3), when operational management module is out of order, standby management module can be promoted as operational management module smoothly according to the information of preserving, and guarantees system stable operation
The situation that hardware activates for two administration modules simultaneously also has mechanism and controls.That is,, when there is the situation that two administration modules all activate, can take at least one in following mechanism to control:
1) stop the work of all administration modules, and restart;
2) the active and standby priority that basis sets in advance keeps main management module wherein, and stops the work of another administration module;
3) performance when the first two administration module is marked, and be operational management module according to an administration module of predetermined policy selection, and the work that stops another administration module (for example, the performance of the first administration module is higher than the performance of the second administration module, and the first administration module is operational management module).
In addition, Fig. 3 shows according to the block diagram of the two-node cluster hot backup device of exemplary embodiment of the present invention.As shown in Figure 3, two-node cluster hot backup device can comprise: synchronization module 301, for operational management module being synchronizeed with the synchrodata of standby management module by Ethernet interface; Monitoring module 303, for monitoring the fault of operational management module; And alternative module 305, for making standby management module substitute the work of operational management module according to synchrodata when breaking down.
And Fig. 4 shows according to the process flow diagram of monitoring module in the two-node cluster hot backup device of exemplary embodiment of the present invention.As shown in Figure 4, monitoring module 303 comprises: first checks submodule 3031, for checking whether the Ethernet interface between operational management module and standby management module has data transmission within first schedule time; Second checks submodule 3033, for checking when there is no data transmission whether the serial ports between operational management module and standby management module has data transmission within second schedule time; And fault determines submodule 3035, for determining that when there is no data transmission operational management module breaks down.
In addition, monitoring module 303 can also comprise: the 3rd checks submodule (not shown), for checking whether the processor of operational management module works within the 3rd schedule time, when processor is not worked within the 3rd schedule time, determines and breaks down.
In the present invention, first schedule time can be 60 seconds, and second schedule time can be 5 seconds, and the 3rd schedule time can be to be less than 1 second.Certainly, according to design requirement, this three schedule times can be not equate or not etc.
The foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.