CN107395387A

CN107395387A - The methods, devices and systems of two-shipper business recovery

Info

Publication number: CN107395387A
Application number: CN201610332931.0A
Authority: CN
Inventors: 史骏
Original assignee: ZTE Corp
Current assignee: Shenzhen ZTE Technical Service Co.,Ltd.
Priority date: 2016-05-17
Filing date: 2016-05-17
Publication date: 2017-11-24

Abstract

The present invention relates to a kind of methods, devices and systems of two-shipper business recovery, including：The connection status of storage device is monitored in real time；If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the connection status of the storage device；If listening to the connection status for connection to recover, start cluster service corresponding to main frame；Monitor the resource starting state of main frame, if the resource starting state is normal, then start cluster service corresponding to standby host, when the connection status of storage device is abnormal, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to cluster service corresponding to snoop results automatic start standby machine, ensure that the continuity of business.Meanwhile the validity of resource is demonstrated by the monitoring of the resource starting state to main frame during startup business, it ensure that the correctness of initiation of services.

Description

The methods, devices and systems of two-shipper business recovery

Technical field

The present invention relates to communication technical field, more particularly to a kind of methods, devices and systems of two-shipper business recovery.

Background technology

In High Availabitity two-shipper field, the continuity of Service Source and the security of data are the functions of paying close attention to the most Point, and it is particularly important in the communications industry.

But High Availabitity two-shipper only exists simple monitoring and management at present, such as in the management of storage resource, simply pin The read-only of storage is judged, or overtime processing has only been done to the connection of storage, intellectuality, that is, do not work as storage After network disconnects, two-shipper management software typically can all think that storage there is a problem, and cause two-shipper business all to be paralysed, even if Network recovery, two-shipper management software also will not can only send alarm or throw exception wait artificially automatically by business recovery Operation is gone to solve, the continuous poor performance of business.

The content of the invention

Based on this, it is necessary to for above-mentioned technical problem, there is provided a kind of methods, devices and systems of two-shipper business recovery, After abnormal and recovery occurs for storage network, cluster service can be recovered automatically, ensure the continuity of business.

A kind of method of two-shipper business recovery, methods described include：

The connection status of storage device is monitored in real time；

If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the storage The connection status of equipment；

If listening to the connection status for connection to recover, start cluster service corresponding to main frame；

The resource starting state of main frame is monitored, if the resource starting state is normal, starts cluster corresponding to standby host Service.

A kind of device of two-shipper business recovery, described device include：

Module is monitored, for monitoring the connection status of storage device in real time；

Stopping modular, if abnormal for connection for listening to the connection status, stop standby machine business, and continue Monitor the connection status of the storage device；

Host start-up module, if recovered for listening to the connection status for connection, start and collect corresponding to main frame Group's service；

Standby host starting module, for monitoring the resource starting state of main frame, if the resource starting state is normal, open Cluster service corresponding to dynamic standby host.

The method and apparatus of above-mentioned two-shipper business recovery, by monitoring the connection status of storage device in real time, if monitored It is abnormal for connection to connection status, then stop standby machine business, and continue to monitor the connection status of storage device, if listened to The connection status is recovered for connection, then cluster service corresponding to startup main frame, the resource starting state of main frame is monitored, if money Source starting state is normal, then starts cluster service corresponding to standby host, when the connection status exception of storage device, in standby machine business After stopping, state can be recovered according to the connection of storage device, according to cluster service corresponding to snoop results automatic start standby machine, It ensure that the continuity of business.Meanwhile verified during startup business by the monitoring of the resource starting state to main frame The validity of resource, ensure that the correctness of initiation of services.

A kind of system of two-shipper business recovery, the system include monitoring device and storage device；

The monitoring device is used for the connection status for monitoring the storage device in real time, if listening to the connection status It is abnormal for connection, then stop standby machine business, and continue to monitor the connection status of the storage device；

If the monitoring device is additionally operable to listen to the connection status as connection recovery, starts and collect corresponding to main frame Group's service；

The monitoring device is additionally operable to monitor the resource starting state of main frame, if the resource starting state is normal, Start cluster service corresponding to standby host.

The system of above-mentioned two-shipper business recovery, by the cooperation of monitoring device and storage device, monitoring device is monitored in real time The connection status of storage device, if listening to connection status as connection exception, stop standby machine business, and continue monitoring and deposit The connection status of equipment is stored up, is recovered if listening to the connection status for connection, starts cluster service corresponding to main frame, prison The resource starting state of main frame is listened, if resource starting state is normal, cluster service corresponding to startup standby host, works as storage device Connection status it is abnormal, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to snoop results from Cluster service corresponding to dynamic startup standby machine, ensure that the continuity of business.Meanwhile by master during startup business The monitoring of the resource starting state of machine demonstrates the validity of resource, ensure that the correctness of initiation of services.

Brief description of the drawings

Fig. 1 is the applied environment figure of the method for two-shipper business recovery in one embodiment；

Fig. 2 is the flow chart of the method for two-shipper business recovery in one embodiment；

Fig. 3 is the flow chart of the cluster service according to corresponding to resource data check results start main frame in one embodiment；

Fig. 4 is general-purpose interface timing diagram between supervising device and third party device in one embodiment；

Fig. 5 is the timing diagram of the method for two-shipper business recovery in one embodiment；

Fig. 6 is the structured flowchart of the device of two-shipper business recovery in one embodiment；

Fig. 7 is the structured flowchart of the device of two-shipper business recovery in another embodiment；

Fig. 8 is the structured flowchart of the device of two-shipper business recovery in further embodiment；

Fig. 9 is the structured flowchart of the device of two-shipper business recovery in another embodiment；

Figure 10 is the structured flowchart of the system of two-shipper business recovery in one embodiment；

Figure 11 is the structured flowchart of the system of two-shipper business recovery in further embodiment；

Figure 12 is the structured flowchart of the system of two-shipper business recovery in another embodiment.

Embodiment

Fig. 1 is the applied environment figure of the method operation of two-shipper business recovery in one embodiment, as shown in figure 1, the application Environment includes monitoring device 110, storage device 120, cluster management server 130, cluster system 140, cluster standby host 150, prison Device for tone frequencies 110 can be arranged in any one server, and monitoring device 110 can directly carry out configuration startup, can also lead to Cross the monitoring parameter that general-purpose interface reception is transmitted and carry out configuration startup, realization that can be independent is monitored, and can also be supplied to the 3rd Side's control, which is realized, monitors, and monitors the connection status of storage device in real time and is sent out according to snoop results to cluster management server 130 Control instruction is sent, control instruction is used for the service state for controlling standby machine.Storage device 120 can possess connection store function Equipment, such as with the server of store function, magnetic battle array.Clustered software is run in cluster management server 130, for unified Cluster system and cluster standby host are managed, such as control the file system of standby machine, control standby machine to stop business or startup Business etc..Cluster system 140 and cluster standby host 150 are used to combine the two-shipper business for providing High Availabitity.

In one embodiment, as shown in the figure, there is provided a kind of method of two-shipper business recovery, ring is applied applied to above-mentioned Monitoring device in border, comprises the following steps：

Step S210, the connection status of storage device is monitored in real time.

Specifically, connection status refers to the UNICOM of network service, it is that connection is abnormal if broken string, is company if networking Connect normal.Storage device can be monitored according to some cycles, week can be monitored by the self-defined storage state of configuration parameter Phase, it is easy to easily be controlled monitoring.It can be tested by going the system file of storage device to be written and read verification or send The mode of packet is monitored the connection status of storage device.Also settable storage state maximum frequency of abnormity, if prison The abnormal connection number of result is listened then to be judged as that connection is abnormal more than storage state maximum frequency of abnormity, if being not above storing State maximum frequency of abnormity, can determine whether for connection status it is normal, by setting storage state maximum frequency of abnormity to avoid judging by accident It is disconnected, because when the connection of storage device occurs abnormal, it can attempt repeatedly to retry monitoring, the explanation connection hair if retrying successfully Raw flash, but business will not have an impact.

In one embodiment, step S210 includes：Verification is written and read to the file system of storage device, according to read-write The result of verification determines the connection status of storage device, or by sending network testing data bag to storage device, according to return Response data packet determine the connection status of storage device.

Specifically, verification or the wound that storage medium reads and writes speed are carried out to the file system of storage device by system command The mode for building temporary file judges the read-write state of storage medium, if storage device is magnetic battle array, then to magnetic present on server The volume of battle array distribution, the volume of magnetic battle array distribution change into the file system of operating system, the verification of progress disk read-write speed, can be with On the volume of magnetic battle array create temporary file mode judge disk read-write state, if disk speed can not obtain, disk can not Then check results are that connection status is abnormal to reading and writing of files, and otherwise check results are that connection status is normal.IP PING can also be passed through Network testing data bag is sent to storage device, if receiving the response data packet of return, IP PING lead to, storage device Connection status is normal.If there is storage state maximum frequency of abnormity, then can according to the continuous abnormal numbers of check results and The continuous response results abnormity number of PING orders judges connection status.

Step S220, if listening to connection status as connection exception, stop standby machine business, and continue to monitor storage The connection status of equipment.

If specifically, listening to, connection status is abnormal for connection, and notice cluster management server stops business, cluster The business of management server control standby machine stops, can report and alarm information alert storage device connection generation exception.Continue to supervise Listen the connection status of storage device, when standby machine business stops to the monitoring parameter of the connection status of storage device can with before Monitoring parameter it is different, such as there is different listening periods, storage abnormality scan period and storage state monitoring are such as set Cycle is different.

Step S230, recover if listening to connection status for connection, start cluster service corresponding to main frame.

Specifically, the settable minimum normal number of recovery state, it is minimum more than recovery state only to listen to connection status Normal number is just judged as that connection recovers, and can customize the minimum normal number of recovery state, is such as arranged to 2 times, is further ensured that Connect the reliability recovered.If connection recovers, notice cluster management server starts cluster service corresponding to main frame.

Step S240, the resource starting state of main frame is monitored, if resource starting state is normal, started corresponding to standby host Cluster service.

Specifically, application service resource listening period can be set, the resource starting state of main frame is monitored.Checked by process Order, checking process initiation state to monitor the resource starting state of main frame, when multiple processes be present, entering simply by the presence of one Journey starting state is abnormal, then resource starting state is abnormal.The starting state of process reflects mainframe cluster and taken for business The result of business is normal or abnormal, if the starting state of process is abnormal, illustrates that resource has problem, it is necessary to stop The service of main frame, if the starting state of process is normal, illustrate that resource completely can use, cluster corresponding to standby host can be started and taken Business.When judging that resource starting state is normal, application service resource can be set and monitor maximum errors number, the only resource of main frame Starting state continuous abnormal number exceedes the maximum errors number of application service resource monitoring and is just judged as resource starting state exception, Monitoring maximum errors number by application service resource reduces probability of miscarriage of justice.

In the present embodiment, by monitoring the connection status of storage device in real time, if it is different to connect to listen to connection status Often, then stop standby machine business, and continue to monitor the connection status of storage device, if listening to the connection status as connection Recover, then cluster service corresponding to startup main frame, monitor the resource starting state of main frame, if resource starting state is normal, Start cluster service corresponding to standby host,, can be according to storage after the stopping of standby machine business when the connection status exception of storage device The connection of equipment recovers state, according to cluster service corresponding to snoop results automatic start standby machine, ensure that the continuous of business Property.Meanwhile the validity of resource is demonstrated by the monitoring of the resource starting state to main frame during startup business, protect The correctness of initiation of services is demonstrate,proved.

In one embodiment, before step S210, in addition to：Monitoring parameter is configured, monitoring parameter includes：Storage state Listening period, storage state maximum frequency of abnormity, storage abnormality scan period, application service resource listening period, application Service Source monitors at least one of maximum errors number.

Specifically, monitoring parameter freely configures the customizable flexible management that snoop procedure can be achieved, storage state monitors week Phase is used to control the time cycle for monitoring storage device every time, and storage state maximum frequency of abnormity refers to listen to storage device When connection status is abnormal, largest tolerable frequency of abnormity, in the range of storage state maximum frequency of abnormity, it is believed that storage is set Standby connection status is normal.The storage abnormality scan period refers to when standby machine business stops to the connection status of storage device Listening period.Application service resource listening period refers to monitor week to the resource starting state of main frame after main frame recovery business Phase.When the maximum errors number of application service resource monitoring refers to the resource starting state exception for listening to main frame, largest tolerable Frequency of abnormity, in the range of resource starting state maximum frequency of abnormity, it is believed that the resource starting state of main frame is normal.Can be free The number of monitoring parameter is increased or decreased, adjusts monitoring parameter as needed.It may also include Disk State such as and scan time-out time For determine disk whether UNICOM order perform after time-out time normal range (NR), if time-out time is swept more than Disk State Retouch time-out time and then illustrate that connection is abnormal, what Disk State scanning expired times were used to limiting Disk State scanning expired times can Receive scope, if it exceeds Disk State scans expired times, then it is assumed that Disk State is abnormal.Monitored server name is used In when storage device deployment on the server when, it is determined that monitored server.Monitored process title, which is used for determination, to be needed to supervise The process of control.

In one embodiment, as shown in figure 3, before the step of starting cluster service corresponding to main frame in step S230, Also include：

Step S310, resource data corresponding to main frame is obtained, carrying out completeness check to resource data obtains the first verification As a result.

Specifically, resource data corresponding to main frame can be file stored on main frame etc., the algorithm of completeness check can root According to needing to select, such as md5 checking algorithms, crc CRCs algorithm, even-odd check method, pass through integrity check algorithm Corresponding computing is carried out to resource data and obtains the first check results, MD5 values such as are calculated to resource data, obtain the first verification knot Fruit.

Step S320, obtain last time standard integrality school corresponding to resource data in the normal connection status of storage device Result is tested, by the first check results and standard completeness check results contrast, if identical, enter and starts collection corresponding to main frame The step of group's service, otherwise, send data exception warning.

Specifically, when monitoring the connection status of storage device, often reach a listening period and just calculate first resource number According to corresponding completeness check result and a file is saved as, and is periodically covered.So as to by reading file acquisition Last time standard completeness check result corresponding to resource data in the normal connection status of storage device.If the first verification knot Fruit is identical with standard completeness check result, then it is intact to illustrate data, can carry out following step.If not phase Together, then illustrate that data are damaged, send data exception warning.

In the present embodiment, completeness check first is carried out to data before the step of starting cluster service corresponding to main frame, It can judge whether data are intact in advance, if damaged, data exception warning can be sent in time, improve response speed and accurate Degree.

In one embodiment, method also includes：By general-purpose interface receive the monitoring parameter that issues of third party device and Monitoring information is reported to third party device.

Specifically, generic configuration interface is provided, can be with comprehensive customization miscellaneous service monitoring scene, according to different prisons Control needs by third party device to issue corresponding monitoring parameter, while after obtaining monitoring information, such as the connection status of storage device Information, resource starting state information of main frame etc. are reported by general-purpose interface to third party device.It is illustrated in figure 4 supervising device The general-purpose interface timing diagram between third party device, comprises the following steps：

S1：Third party device is passed to monitoring parameter to monitoring device by general-purpose interface；

S2：Monitoring device renewal configuration parameter is the parameter of new incoming；

S3：Monitoring device notifies third party device configuration to complete by general-purpose interface；

S4：Third party device starts order to monitoring device by the way that general-purpose interface is incoming；

S5：The monitoring that monitoring device is ordered and started to group system according to starting；

S6：Group system feeds back associated monitoring information to monitoring device

S7：Monitoring device notifies third party device start completion by general-purpose interface, and produces link setup relation；

S8：Third party device sends heartbeat message；

S9：Third party device receives heartbeat message；

S10：Heartbeat processing is carried out between monitoring device and third party device；

S11：Monitoring device is received the abnormal notice of magnetic battle array connection and located accordingly by the monitoring to magnetic battle array Reason；

S12：Monitoring device notifies that the connection of third party device magnetic battle array is abnormal by general-purpose interface；

S13-14：Third party device receives abnormal notice by general-purpose interface, and continues to monitor；

S15：Monitoring device listens to the connection of magnetic battle array and recovered, and journey is treated with group system；

S16：Monitoring device notifies the result of magnetic battle array recovery by general-purpose interface to third party device.

The implementation to the method for two-shipper business recovery is described in further detail below in conjunction with the accompanying drawings, as shown in figure 5, bag Include following steps：

A：Monitoring device receives monitoring parameter by general-purpose interface, and storage state listening period 30 seconds, storage state are maximum Frequency of abnormity 4 times, 60 seconds abnormality scan periods of storage, application service resource listening period 40 seconds, application service resource prison Listen maximum errors number 5 times, start monitoring device after being provided with.

B：Monitoring device persistently monitors the connection status of storage device, checks once within every 30 seconds.

C：After continuous 4 connection status listened to are all exceptions, the clustered software in cluster management server is stopped Only, while cluster management server is notified to stop the service of standby machine.

D：Cluster management server control standby machine makes it stop service.

E：Monitoring device continues to monitor the connection status of storage device, and it is 60 seconds 1 time to monitor frequency, judges continuous 2 companies After connecing state recovery normally, completeness check is carried out to resource data corresponding to main frame, check results are obtained, if check results It is complete for data, then into F：Cluster management server is notified to start cluster service corresponding to main frame, otherwise into G：Send number According to abnormality warnings.

H：Monitoring device monitors the resource starting state of main frame, if resource starting state is normal, into I：Notice collection Group's management server starts cluster service corresponding to standby host, and cluster management server notice monitoring device cluster has recovered.If Resource starting state is abnormal, then into J：Send cluster and start failure notification, notice cluster management server stops whole cluster Service, send corrupted data warning.

In one embodiment, as shown in Figure 6, there is provided a kind of device of two-shipper business recovery, including：

Module 410 is monitored, for monitoring the connection status of storage device in real time.

Stopping modular 420, if abnormal for connection for listening to connection status, stop standby machine business, and continue Monitor the connection status of storage device.

Host start-up module 430, if recovered for listening to connection status for connection, start cluster corresponding to main frame Service.

Standby host starting module 440, for monitoring the resource starting state of main frame, if resource starting state is normal, open Cluster service corresponding to dynamic standby host.

In one embodiment, as shown in fig. 7, device also includes：

Configuration module 450, for configuring monitoring parameter, monitoring parameter includes：Storage state listening period, storage state are most Big frequency of abnormity, storage abnormality scan period, application service resource listening period, application service resource monitor maximum mistake At least one of number.

In one embodiment, as shown in figure 8, device also includes：

Correction verification module 460, for obtaining resource data corresponding to main frame, completeness check is carried out to resource data and obtains the One check results, obtain last time standard completeness check knot corresponding to resource data in the normal connection status of storage device Fruit, the first check results and standard completeness check results contrast if identical, start into host start-up module 430 Cluster service corresponding to main frame, otherwise, send data exception warning.

In one embodiment, as shown in figure 9, device also includes：

Interface module 470, set for the monitoring parameter issued by general-purpose interface reception third party device and to third party It is standby to report monitoring information.

In one embodiment, as shown in Figure 10, there is provided a kind of system of two-shipper business recovery, including monitoring device 510 and storage device 520, monitoring device 510 is used for the connection status for monitoring storage device 520 in real time, if listening to connection State is abnormal for connection, then stops standby machine business, and continues to monitor the connection status of storage device, and monitoring device 510 is also used If recovered in listening to the connection status for connection, start cluster service corresponding to main frame, monitoring device 510 is additionally operable to The resource starting state of main frame is monitored, if resource starting state is normal, starts cluster service corresponding to standby host.

In the present embodiment, by the cooperation of monitoring device and storage device, monitoring device monitors the company of storage device in real time State is connect, if listening to connection status as connection exception, stops standby machine business, and continue to monitor the connection of storage device State, recover if listening to the connection status for connection, cluster service corresponding to startup main frame, monitor the resource of main frame Starting state, if resource starting state is normal, start cluster service corresponding to standby host, when the connection status of storage device is different Often, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to snoop results automatic start standby machine Corresponding cluster service, it ensure that the continuity of business.Meanwhile by starting to the resource of main frame during startup business The monitoring of state demonstrates the validity of resource, ensure that the correctness of initiation of services.

In one embodiment, as shown in figure 11, system also includes：

Third party device 530, for issuing monitoring parameter to monitoring device by general-purpose interface, third party device is additionally operable to Receive the monitoring information that monitoring device reports.

Specifically, by generic configuration interface, needed to issue corresponding monitoring by third party device according to different monitoring Parameter, can be with comprehensive customization miscellaneous service monitoring scene.After monitoring device obtains monitoring information, such as the connection of storage device Status information, resource starting state information of main frame etc., the prison that third party device is reported by general-purpose interface reception monitoring device Control information.

In one embodiment, as shown in figure 12, system also includes：

Cluster management server 540, for receiving the control instruction of monitoring device transmission, according to control instruction to main frame and Standby host is controlled.

Specifically, unification is to main frame and standby host after the control instruction sent by cluster management server reception monitoring device Management is controlled, is more convenient orderly.

One of ordinary skill in the art will appreciate that realize all or part of flow in above-described embodiment method, being can be with The hardware of correlation is instructed to complete by computer program, described program can be stored in a computer read/write memory medium In, in the embodiment of the present invention, the program can be stored in the storage medium of computer system, and by the computer system At least one computing device, to realize the flow for including the embodiment such as above-mentioned each method.Wherein, the storage medium can be Magnetic disc, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..

Each technical characteristic of embodiment described above can be combined arbitrarily, to make description succinct, not to above-mentioned reality Apply all possible combination of each technical characteristic in example to be all described, as long as however, the combination of these technical characteristics is not deposited In contradiction, the scope that this specification is recorded all is considered to be.

Embodiment described above only expresses the several embodiments of the present invention, and its description is more specific and detailed, but simultaneously Can not therefore it be construed as limiting the scope of the patent.It should be pointed out that come for one of ordinary skill in the art Say, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the protection of the present invention Scope.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.

Claims

1. a kind of method of two-shipper business recovery, methods described include：

The connection status of storage device is monitored in real time；

If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the storage device Connection status；

The resource starting state of main frame is monitored, if the resource starting state is normal, starts cluster service corresponding to standby host.

2. according to the method for claim 1, it is characterised in that it is described in real time monitor storage device connection status the step of Including：

Verification is written and read to the file system of the storage device, the storage device is determined according to the result of read-write check Connection status；

Or by sending network testing data bag to the storage device, determine that the storage is set according to the response data packet of return Standby connection status.

3. according to the method for claim 1, it is characterised in that in the step of the connection status for monitoring storage device in real time Before rapid, in addition to：

Monitoring parameter is configured, the monitoring parameter includes：Storage state listening period, storage state maximum frequency of abnormity, storage At least one in abnormality scan period, application service resource listening period, the maximum errors number of application service resource monitoring Kind.

4. according to the method for claim 1, it is characterised in that it is described startup main frame corresponding to cluster service the step of it Before, in addition to：

Obtain resource data corresponding to the main frame；

Completeness check is carried out to the resource data and obtains the first check results；

Obtain last time standard completeness check result corresponding to the resource data in the normal connection status of storage device；

By first check results and standard completeness check results contrast, if identical, enter the startup main frame pair The step of cluster service answered, otherwise, send data exception warning.

5. according to the method for claim 3, it is characterised in that methods described also includes：

The monitoring parameter that issues of third party device is received by general-purpose interface and reports monitoring information to third party device.

6. a kind of device of two-shipper business recovery, it is characterised in that described device includes：

Stopping modular, if abnormal for connection for listening to the connection status, stop standby machine business, and continue to monitor The connection status of the storage device；

Host start-up module, if recovered for listening to the connection status for connection, start cluster clothes corresponding to main frame Business；

Standby host starting module, for monitoring the resource starting state of main frame, if the resource starting state is normal, start standby Cluster service corresponding to machine.

7. device according to claim 6, it is characterised in that described device also includes：

Configuration module, for configuring monitoring parameter, the monitoring parameter includes：Storage state listening period, storage state are maximum Frequency of abnormity, storage abnormality scan period, application service resource listening period, application service resource monitor maximum mistake time At least one of number.

8. device according to claim 6, it is characterised in that described device also includes：

Correction verification module, for obtaining resource data corresponding to the main frame, completeness check is carried out to the resource data and obtained First check results, obtain last time standard integrality school corresponding to the resource data in the normal connection status of storage device Result is tested, by first check results and standard completeness check results contrast, if identical, into host start-up module Start cluster service corresponding to main frame, otherwise, send data exception warning.

9. device according to claim 7, it is characterised in that described device also includes：

Interface module, for receiving the monitoring parameter that issues of third party device and to third party device by general-purpose interface Report monitoring information.

10. a kind of system of two-shipper business recovery, it is characterised in that the system includes monitoring device and storage device；

The monitoring device is used for the connection status for monitoring the storage device in real time, if listening to the connection status to connect Exception is connect, then stops standby machine business, and continue to monitor the connection status of the storage device；

If the monitoring device is additionally operable to listen to the connection status as connection recovery, start cluster clothes corresponding to main frame Business；

The monitoring device is additionally operable to monitor the resource starting state of main frame, if the resource starting state is normal, starts Cluster service corresponding to standby host.

11. system according to claim 10, it is characterised in that the system also includes：

Third party device, for issuing monitoring parameter to monitoring device by general-purpose interface；

The third party device is additionally operable to receive the monitoring information that monitoring device reports.

12. system according to claim 10, it is characterised in that the system also includes：

Cluster management server, the control instruction sent for receiving the monitoring device, according to the control instruction to main frame It is controlled with standby host.