CN107395387A - The methods, devices and systems of two-shipper business recovery - Google Patents

The methods, devices and systems of two-shipper business recovery Download PDF

Info

Publication number
CN107395387A
CN107395387A CN201610332931.0A CN201610332931A CN107395387A CN 107395387 A CN107395387 A CN 107395387A CN 201610332931 A CN201610332931 A CN 201610332931A CN 107395387 A CN107395387 A CN 107395387A
Authority
CN
China
Prior art keywords
connection status
monitoring
storage device
main frame
resource
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610332931.0A
Other languages
Chinese (zh)
Inventor
史骏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen ZTE Technical Service Co.,Ltd.
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201610332931.0A priority Critical patent/CN107395387A/en
Publication of CN107395387A publication Critical patent/CN107395387A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/22Arrangements for detecting or preventing errors in the information received using redundant apparatus to increase reliability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route

Abstract

The present invention relates to a kind of methods, devices and systems of two-shipper business recovery, including:The connection status of storage device is monitored in real time;If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the connection status of the storage device;If listening to the connection status for connection to recover, start cluster service corresponding to main frame;Monitor the resource starting state of main frame, if the resource starting state is normal, then start cluster service corresponding to standby host, when the connection status of storage device is abnormal, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to cluster service corresponding to snoop results automatic start standby machine, ensure that the continuity of business.Meanwhile the validity of resource is demonstrated by the monitoring of the resource starting state to main frame during startup business, it ensure that the correctness of initiation of services.

Description

The methods, devices and systems of two-shipper business recovery
Technical field
The present invention relates to communication technical field, more particularly to a kind of methods, devices and systems of two-shipper business recovery.
Background technology
In High Availabitity two-shipper field, the continuity of Service Source and the security of data are the functions of paying close attention to the most Point, and it is particularly important in the communications industry.
But High Availabitity two-shipper only exists simple monitoring and management at present, such as in the management of storage resource, simply pin The read-only of storage is judged, or overtime processing has only been done to the connection of storage, intellectuality, that is, do not work as storage After network disconnects, two-shipper management software typically can all think that storage there is a problem, and cause two-shipper business all to be paralysed, even if Network recovery, two-shipper management software also will not can only send alarm or throw exception wait artificially automatically by business recovery Operation is gone to solve, the continuous poor performance of business.
The content of the invention
Based on this, it is necessary to for above-mentioned technical problem, there is provided a kind of methods, devices and systems of two-shipper business recovery, After abnormal and recovery occurs for storage network, cluster service can be recovered automatically, ensure the continuity of business.
A kind of method of two-shipper business recovery, methods described include:
The connection status of storage device is monitored in real time;
If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the storage The connection status of equipment;
If listening to the connection status for connection to recover, start cluster service corresponding to main frame;
The resource starting state of main frame is monitored, if the resource starting state is normal, starts cluster corresponding to standby host Service.
A kind of device of two-shipper business recovery, described device include:
Module is monitored, for monitoring the connection status of storage device in real time;
Stopping modular, if abnormal for connection for listening to the connection status, stop standby machine business, and continue Monitor the connection status of the storage device;
Host start-up module, if recovered for listening to the connection status for connection, start and collect corresponding to main frame Group's service;
Standby host starting module, for monitoring the resource starting state of main frame, if the resource starting state is normal, open Cluster service corresponding to dynamic standby host.
The method and apparatus of above-mentioned two-shipper business recovery, by monitoring the connection status of storage device in real time, if monitored It is abnormal for connection to connection status, then stop standby machine business, and continue to monitor the connection status of storage device, if listened to The connection status is recovered for connection, then cluster service corresponding to startup main frame, the resource starting state of main frame is monitored, if money Source starting state is normal, then starts cluster service corresponding to standby host, when the connection status exception of storage device, in standby machine business After stopping, state can be recovered according to the connection of storage device, according to cluster service corresponding to snoop results automatic start standby machine, It ensure that the continuity of business.Meanwhile verified during startup business by the monitoring of the resource starting state to main frame The validity of resource, ensure that the correctness of initiation of services.
A kind of system of two-shipper business recovery, the system include monitoring device and storage device;
The monitoring device is used for the connection status for monitoring the storage device in real time, if listening to the connection status It is abnormal for connection, then stop standby machine business, and continue to monitor the connection status of the storage device;
If the monitoring device is additionally operable to listen to the connection status as connection recovery, starts and collect corresponding to main frame Group's service;
The monitoring device is additionally operable to monitor the resource starting state of main frame, if the resource starting state is normal, Start cluster service corresponding to standby host.
The system of above-mentioned two-shipper business recovery, by the cooperation of monitoring device and storage device, monitoring device is monitored in real time The connection status of storage device, if listening to connection status as connection exception, stop standby machine business, and continue monitoring and deposit The connection status of equipment is stored up, is recovered if listening to the connection status for connection, starts cluster service corresponding to main frame, prison The resource starting state of main frame is listened, if resource starting state is normal, cluster service corresponding to startup standby host, works as storage device Connection status it is abnormal, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to snoop results from Cluster service corresponding to dynamic startup standby machine, ensure that the continuity of business.Meanwhile by master during startup business The monitoring of the resource starting state of machine demonstrates the validity of resource, ensure that the correctness of initiation of services.
Brief description of the drawings
Fig. 1 is the applied environment figure of the method for two-shipper business recovery in one embodiment;
Fig. 2 is the flow chart of the method for two-shipper business recovery in one embodiment;
Fig. 3 is the flow chart of the cluster service according to corresponding to resource data check results start main frame in one embodiment;
Fig. 4 is general-purpose interface timing diagram between supervising device and third party device in one embodiment;
Fig. 5 is the timing diagram of the method for two-shipper business recovery in one embodiment;
Fig. 6 is the structured flowchart of the device of two-shipper business recovery in one embodiment;
Fig. 7 is the structured flowchart of the device of two-shipper business recovery in another embodiment;
Fig. 8 is the structured flowchart of the device of two-shipper business recovery in further embodiment;
Fig. 9 is the structured flowchart of the device of two-shipper business recovery in another embodiment;
Figure 10 is the structured flowchart of the system of two-shipper business recovery in one embodiment;
Figure 11 is the structured flowchart of the system of two-shipper business recovery in further embodiment;
Figure 12 is the structured flowchart of the system of two-shipper business recovery in another embodiment.
Embodiment
Fig. 1 is the applied environment figure of the method operation of two-shipper business recovery in one embodiment, as shown in figure 1, the application Environment includes monitoring device 110, storage device 120, cluster management server 130, cluster system 140, cluster standby host 150, prison Device for tone frequencies 110 can be arranged in any one server, and monitoring device 110 can directly carry out configuration startup, can also lead to Cross the monitoring parameter that general-purpose interface reception is transmitted and carry out configuration startup, realization that can be independent is monitored, and can also be supplied to the 3rd Side's control, which is realized, monitors, and monitors the connection status of storage device in real time and is sent out according to snoop results to cluster management server 130 Control instruction is sent, control instruction is used for the service state for controlling standby machine.Storage device 120 can possess connection store function Equipment, such as with the server of store function, magnetic battle array.Clustered software is run in cluster management server 130, for unified Cluster system and cluster standby host are managed, such as control the file system of standby machine, control standby machine to stop business or startup Business etc..Cluster system 140 and cluster standby host 150 are used to combine the two-shipper business for providing High Availabitity.
In one embodiment, as shown in the figure, there is provided a kind of method of two-shipper business recovery, ring is applied applied to above-mentioned Monitoring device in border, comprises the following steps:
Step S210, the connection status of storage device is monitored in real time.
Specifically, connection status refers to the UNICOM of network service, it is that connection is abnormal if broken string, is company if networking Connect normal.Storage device can be monitored according to some cycles, week can be monitored by the self-defined storage state of configuration parameter Phase, it is easy to easily be controlled monitoring.It can be tested by going the system file of storage device to be written and read verification or send The mode of packet is monitored the connection status of storage device.Also settable storage state maximum frequency of abnormity, if prison The abnormal connection number of result is listened then to be judged as that connection is abnormal more than storage state maximum frequency of abnormity, if being not above storing State maximum frequency of abnormity, can determine whether for connection status it is normal, by setting storage state maximum frequency of abnormity to avoid judging by accident It is disconnected, because when the connection of storage device occurs abnormal, it can attempt repeatedly to retry monitoring, the explanation connection hair if retrying successfully Raw flash, but business will not have an impact.
In one embodiment, step S210 includes:Verification is written and read to the file system of storage device, according to read-write The result of verification determines the connection status of storage device, or by sending network testing data bag to storage device, according to return Response data packet determine the connection status of storage device.
Specifically, verification or the wound that storage medium reads and writes speed are carried out to the file system of storage device by system command The mode for building temporary file judges the read-write state of storage medium, if storage device is magnetic battle array, then to magnetic present on server The volume of battle array distribution, the volume of magnetic battle array distribution change into the file system of operating system, the verification of progress disk read-write speed, can be with On the volume of magnetic battle array create temporary file mode judge disk read-write state, if disk speed can not obtain, disk can not Then check results are that connection status is abnormal to reading and writing of files, and otherwise check results are that connection status is normal.IP PING can also be passed through Network testing data bag is sent to storage device, if receiving the response data packet of return, IP PING lead to, storage device Connection status is normal.If there is storage state maximum frequency of abnormity, then can according to the continuous abnormal numbers of check results and The continuous response results abnormity number of PING orders judges connection status.
Step S220, if listening to connection status as connection exception, stop standby machine business, and continue to monitor storage The connection status of equipment.
If specifically, listening to, connection status is abnormal for connection, and notice cluster management server stops business, cluster The business of management server control standby machine stops, can report and alarm information alert storage device connection generation exception.Continue to supervise Listen the connection status of storage device, when standby machine business stops to the monitoring parameter of the connection status of storage device can with before Monitoring parameter it is different, such as there is different listening periods, storage abnormality scan period and storage state monitoring are such as set Cycle is different.
Step S230, recover if listening to connection status for connection, start cluster service corresponding to main frame.
Specifically, the settable minimum normal number of recovery state, it is minimum more than recovery state only to listen to connection status Normal number is just judged as that connection recovers, and can customize the minimum normal number of recovery state, is such as arranged to 2 times, is further ensured that Connect the reliability recovered.If connection recovers, notice cluster management server starts cluster service corresponding to main frame.
Step S240, the resource starting state of main frame is monitored, if resource starting state is normal, started corresponding to standby host Cluster service.
Specifically, application service resource listening period can be set, the resource starting state of main frame is monitored.Checked by process Order, checking process initiation state to monitor the resource starting state of main frame, when multiple processes be present, entering simply by the presence of one Journey starting state is abnormal, then resource starting state is abnormal.The starting state of process reflects mainframe cluster and taken for business The result of business is normal or abnormal, if the starting state of process is abnormal, illustrates that resource has problem, it is necessary to stop The service of main frame, if the starting state of process is normal, illustrate that resource completely can use, cluster corresponding to standby host can be started and taken Business.When judging that resource starting state is normal, application service resource can be set and monitor maximum errors number, the only resource of main frame Starting state continuous abnormal number exceedes the maximum errors number of application service resource monitoring and is just judged as resource starting state exception, Monitoring maximum errors number by application service resource reduces probability of miscarriage of justice.
In the present embodiment, by monitoring the connection status of storage device in real time, if it is different to connect to listen to connection status Often, then stop standby machine business, and continue to monitor the connection status of storage device, if listening to the connection status as connection Recover, then cluster service corresponding to startup main frame, monitor the resource starting state of main frame, if resource starting state is normal, Start cluster service corresponding to standby host,, can be according to storage after the stopping of standby machine business when the connection status exception of storage device The connection of equipment recovers state, according to cluster service corresponding to snoop results automatic start standby machine, ensure that the continuous of business Property.Meanwhile the validity of resource is demonstrated by the monitoring of the resource starting state to main frame during startup business, protect The correctness of initiation of services is demonstrate,proved.
In one embodiment, before step S210, in addition to:Monitoring parameter is configured, monitoring parameter includes:Storage state Listening period, storage state maximum frequency of abnormity, storage abnormality scan period, application service resource listening period, application Service Source monitors at least one of maximum errors number.
Specifically, monitoring parameter freely configures the customizable flexible management that snoop procedure can be achieved, storage state monitors week Phase is used to control the time cycle for monitoring storage device every time, and storage state maximum frequency of abnormity refers to listen to storage device When connection status is abnormal, largest tolerable frequency of abnormity, in the range of storage state maximum frequency of abnormity, it is believed that storage is set Standby connection status is normal.The storage abnormality scan period refers to when standby machine business stops to the connection status of storage device Listening period.Application service resource listening period refers to monitor week to the resource starting state of main frame after main frame recovery business Phase.When the maximum errors number of application service resource monitoring refers to the resource starting state exception for listening to main frame, largest tolerable Frequency of abnormity, in the range of resource starting state maximum frequency of abnormity, it is believed that the resource starting state of main frame is normal.Can be free The number of monitoring parameter is increased or decreased, adjusts monitoring parameter as needed.It may also include Disk State such as and scan time-out time For determine disk whether UNICOM order perform after time-out time normal range (NR), if time-out time is swept more than Disk State Retouch time-out time and then illustrate that connection is abnormal, what Disk State scanning expired times were used to limiting Disk State scanning expired times can Receive scope, if it exceeds Disk State scans expired times, then it is assumed that Disk State is abnormal.Monitored server name is used In when storage device deployment on the server when, it is determined that monitored server.Monitored process title, which is used for determination, to be needed to supervise The process of control.
In one embodiment, as shown in figure 3, before the step of starting cluster service corresponding to main frame in step S230, Also include:
Step S310, resource data corresponding to main frame is obtained, carrying out completeness check to resource data obtains the first verification As a result.
Specifically, resource data corresponding to main frame can be file stored on main frame etc., the algorithm of completeness check can root According to needing to select, such as md5 checking algorithms, crc CRCs algorithm, even-odd check method, pass through integrity check algorithm Corresponding computing is carried out to resource data and obtains the first check results, MD5 values such as are calculated to resource data, obtain the first verification knot Fruit.
Step S320, obtain last time standard integrality school corresponding to resource data in the normal connection status of storage device Result is tested, by the first check results and standard completeness check results contrast, if identical, enter and starts collection corresponding to main frame The step of group's service, otherwise, send data exception warning.
Specifically, when monitoring the connection status of storage device, often reach a listening period and just calculate first resource number According to corresponding completeness check result and a file is saved as, and is periodically covered.So as to by reading file acquisition Last time standard completeness check result corresponding to resource data in the normal connection status of storage device.If the first verification knot Fruit is identical with standard completeness check result, then it is intact to illustrate data, can carry out following step.If not phase Together, then illustrate that data are damaged, send data exception warning.
In the present embodiment, completeness check first is carried out to data before the step of starting cluster service corresponding to main frame, It can judge whether data are intact in advance, if damaged, data exception warning can be sent in time, improve response speed and accurate Degree.
In one embodiment, method also includes:By general-purpose interface receive the monitoring parameter that issues of third party device and Monitoring information is reported to third party device.
Specifically, generic configuration interface is provided, can be with comprehensive customization miscellaneous service monitoring scene, according to different prisons Control needs by third party device to issue corresponding monitoring parameter, while after obtaining monitoring information, such as the connection status of storage device Information, resource starting state information of main frame etc. are reported by general-purpose interface to third party device.It is illustrated in figure 4 supervising device The general-purpose interface timing diagram between third party device, comprises the following steps:
S1:Third party device is passed to monitoring parameter to monitoring device by general-purpose interface;
S2:Monitoring device renewal configuration parameter is the parameter of new incoming;
S3:Monitoring device notifies third party device configuration to complete by general-purpose interface;
S4:Third party device starts order to monitoring device by the way that general-purpose interface is incoming;
S5:The monitoring that monitoring device is ordered and started to group system according to starting;
S6:Group system feeds back associated monitoring information to monitoring device
S7:Monitoring device notifies third party device start completion by general-purpose interface, and produces link setup relation;
S8:Third party device sends heartbeat message;
S9:Third party device receives heartbeat message;
S10:Heartbeat processing is carried out between monitoring device and third party device;
S11:Monitoring device is received the abnormal notice of magnetic battle array connection and located accordingly by the monitoring to magnetic battle array Reason;
S12:Monitoring device notifies that the connection of third party device magnetic battle array is abnormal by general-purpose interface;
S13-14:Third party device receives abnormal notice by general-purpose interface, and continues to monitor;
S15:Monitoring device listens to the connection of magnetic battle array and recovered, and journey is treated with group system;
S16:Monitoring device notifies the result of magnetic battle array recovery by general-purpose interface to third party device.
The implementation to the method for two-shipper business recovery is described in further detail below in conjunction with the accompanying drawings, as shown in figure 5, bag Include following steps:
A:Monitoring device receives monitoring parameter by general-purpose interface, and storage state listening period 30 seconds, storage state are maximum Frequency of abnormity 4 times, 60 seconds abnormality scan periods of storage, application service resource listening period 40 seconds, application service resource prison Listen maximum errors number 5 times, start monitoring device after being provided with.
B:Monitoring device persistently monitors the connection status of storage device, checks once within every 30 seconds.
C:After continuous 4 connection status listened to are all exceptions, the clustered software in cluster management server is stopped Only, while cluster management server is notified to stop the service of standby machine.
D:Cluster management server control standby machine makes it stop service.
E:Monitoring device continues to monitor the connection status of storage device, and it is 60 seconds 1 time to monitor frequency, judges continuous 2 companies After connecing state recovery normally, completeness check is carried out to resource data corresponding to main frame, check results are obtained, if check results It is complete for data, then into F:Cluster management server is notified to start cluster service corresponding to main frame, otherwise into G:Send number According to abnormality warnings.
H:Monitoring device monitors the resource starting state of main frame, if resource starting state is normal, into I:Notice collection Group's management server starts cluster service corresponding to standby host, and cluster management server notice monitoring device cluster has recovered.If Resource starting state is abnormal, then into J:Send cluster and start failure notification, notice cluster management server stops whole cluster Service, send corrupted data warning.
In one embodiment, as shown in Figure 6, there is provided a kind of device of two-shipper business recovery, including:
Module 410 is monitored, for monitoring the connection status of storage device in real time.
Stopping modular 420, if abnormal for connection for listening to connection status, stop standby machine business, and continue Monitor the connection status of storage device.
Host start-up module 430, if recovered for listening to connection status for connection, start cluster corresponding to main frame Service.
Standby host starting module 440, for monitoring the resource starting state of main frame, if resource starting state is normal, open Cluster service corresponding to dynamic standby host.
In one embodiment, as shown in fig. 7, device also includes:
Configuration module 450, for configuring monitoring parameter, monitoring parameter includes:Storage state listening period, storage state are most Big frequency of abnormity, storage abnormality scan period, application service resource listening period, application service resource monitor maximum mistake At least one of number.
In one embodiment, as shown in figure 8, device also includes:
Correction verification module 460, for obtaining resource data corresponding to main frame, completeness check is carried out to resource data and obtains the One check results, obtain last time standard completeness check knot corresponding to resource data in the normal connection status of storage device Fruit, the first check results and standard completeness check results contrast if identical, start into host start-up module 430 Cluster service corresponding to main frame, otherwise, send data exception warning.
In one embodiment, as shown in figure 9, device also includes:
Interface module 470, set for the monitoring parameter issued by general-purpose interface reception third party device and to third party It is standby to report monitoring information.
In one embodiment, as shown in Figure 10, there is provided a kind of system of two-shipper business recovery, including monitoring device 510 and storage device 520, monitoring device 510 is used for the connection status for monitoring storage device 520 in real time, if listening to connection State is abnormal for connection, then stops standby machine business, and continues to monitor the connection status of storage device, and monitoring device 510 is also used If recovered in listening to the connection status for connection, start cluster service corresponding to main frame, monitoring device 510 is additionally operable to The resource starting state of main frame is monitored, if resource starting state is normal, starts cluster service corresponding to standby host.
In the present embodiment, by the cooperation of monitoring device and storage device, monitoring device monitors the company of storage device in real time State is connect, if listening to connection status as connection exception, stops standby machine business, and continue to monitor the connection of storage device State, recover if listening to the connection status for connection, cluster service corresponding to startup main frame, monitor the resource of main frame Starting state, if resource starting state is normal, start cluster service corresponding to standby host, when the connection status of storage device is different Often, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to snoop results automatic start standby machine Corresponding cluster service, it ensure that the continuity of business.Meanwhile by starting to the resource of main frame during startup business The monitoring of state demonstrates the validity of resource, ensure that the correctness of initiation of services.
In one embodiment, as shown in figure 11, system also includes:
Third party device 530, for issuing monitoring parameter to monitoring device by general-purpose interface, third party device is additionally operable to Receive the monitoring information that monitoring device reports.
Specifically, by generic configuration interface, needed to issue corresponding monitoring by third party device according to different monitoring Parameter, can be with comprehensive customization miscellaneous service monitoring scene.After monitoring device obtains monitoring information, such as the connection of storage device Status information, resource starting state information of main frame etc., the prison that third party device is reported by general-purpose interface reception monitoring device Control information.
In one embodiment, as shown in figure 12, system also includes:
Cluster management server 540, for receiving the control instruction of monitoring device transmission, according to control instruction to main frame and Standby host is controlled.
Specifically, unification is to main frame and standby host after the control instruction sent by cluster management server reception monitoring device Management is controlled, is more convenient orderly.
One of ordinary skill in the art will appreciate that realize all or part of flow in above-described embodiment method, being can be with The hardware of correlation is instructed to complete by computer program, described program can be stored in a computer read/write memory medium In, in the embodiment of the present invention, the program can be stored in the storage medium of computer system, and by the computer system At least one computing device, to realize the flow for including the embodiment such as above-mentioned each method.Wherein, the storage medium can be Magnetic disc, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..
Each technical characteristic of embodiment described above can be combined arbitrarily, to make description succinct, not to above-mentioned reality Apply all possible combination of each technical characteristic in example to be all described, as long as however, the combination of these technical characteristics is not deposited In contradiction, the scope that this specification is recorded all is considered to be.
Embodiment described above only expresses the several embodiments of the present invention, and its description is more specific and detailed, but simultaneously Can not therefore it be construed as limiting the scope of the patent.It should be pointed out that come for one of ordinary skill in the art Say, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the protection of the present invention Scope.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.

Claims (12)

1. a kind of method of two-shipper business recovery, methods described include:
The connection status of storage device is monitored in real time;
If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the storage device Connection status;
If listening to the connection status for connection to recover, start cluster service corresponding to main frame;
The resource starting state of main frame is monitored, if the resource starting state is normal, starts cluster service corresponding to standby host.
2. according to the method for claim 1, it is characterised in that it is described in real time monitor storage device connection status the step of Including:
Verification is written and read to the file system of the storage device, the storage device is determined according to the result of read-write check Connection status;
Or by sending network testing data bag to the storage device, determine that the storage is set according to the response data packet of return Standby connection status.
3. according to the method for claim 1, it is characterised in that in the step of the connection status for monitoring storage device in real time Before rapid, in addition to:
Monitoring parameter is configured, the monitoring parameter includes:Storage state listening period, storage state maximum frequency of abnormity, storage At least one in abnormality scan period, application service resource listening period, the maximum errors number of application service resource monitoring Kind.
4. according to the method for claim 1, it is characterised in that it is described startup main frame corresponding to cluster service the step of it Before, in addition to:
Obtain resource data corresponding to the main frame;
Completeness check is carried out to the resource data and obtains the first check results;
Obtain last time standard completeness check result corresponding to the resource data in the normal connection status of storage device;
By first check results and standard completeness check results contrast, if identical, enter the startup main frame pair The step of cluster service answered, otherwise, send data exception warning.
5. according to the method for claim 3, it is characterised in that methods described also includes:
The monitoring parameter that issues of third party device is received by general-purpose interface and reports monitoring information to third party device.
6. a kind of device of two-shipper business recovery, it is characterised in that described device includes:
Module is monitored, for monitoring the connection status of storage device in real time;
Stopping modular, if abnormal for connection for listening to the connection status, stop standby machine business, and continue to monitor The connection status of the storage device;
Host start-up module, if recovered for listening to the connection status for connection, start cluster clothes corresponding to main frame Business;
Standby host starting module, for monitoring the resource starting state of main frame, if the resource starting state is normal, start standby Cluster service corresponding to machine.
7. device according to claim 6, it is characterised in that described device also includes:
Configuration module, for configuring monitoring parameter, the monitoring parameter includes:Storage state listening period, storage state are maximum Frequency of abnormity, storage abnormality scan period, application service resource listening period, application service resource monitor maximum mistake time At least one of number.
8. device according to claim 6, it is characterised in that described device also includes:
Correction verification module, for obtaining resource data corresponding to the main frame, completeness check is carried out to the resource data and obtained First check results, obtain last time standard integrality school corresponding to the resource data in the normal connection status of storage device Result is tested, by first check results and standard completeness check results contrast, if identical, into host start-up module Start cluster service corresponding to main frame, otherwise, send data exception warning.
9. device according to claim 7, it is characterised in that described device also includes:
Interface module, for receiving the monitoring parameter that issues of third party device and to third party device by general-purpose interface Report monitoring information.
10. a kind of system of two-shipper business recovery, it is characterised in that the system includes monitoring device and storage device;
The monitoring device is used for the connection status for monitoring the storage device in real time, if listening to the connection status to connect Exception is connect, then stops standby machine business, and continue to monitor the connection status of the storage device;
If the monitoring device is additionally operable to listen to the connection status as connection recovery, start cluster clothes corresponding to main frame Business;
The monitoring device is additionally operable to monitor the resource starting state of main frame, if the resource starting state is normal, starts Cluster service corresponding to standby host.
11. system according to claim 10, it is characterised in that the system also includes:
Third party device, for issuing monitoring parameter to monitoring device by general-purpose interface;
The third party device is additionally operable to receive the monitoring information that monitoring device reports.
12. system according to claim 10, it is characterised in that the system also includes:
Cluster management server, the control instruction sent for receiving the monitoring device, according to the control instruction to main frame It is controlled with standby host.
CN201610332931.0A 2016-05-17 2016-05-17 The methods, devices and systems of two-shipper business recovery Pending CN107395387A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610332931.0A CN107395387A (en) 2016-05-17 2016-05-17 The methods, devices and systems of two-shipper business recovery

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610332931.0A CN107395387A (en) 2016-05-17 2016-05-17 The methods, devices and systems of two-shipper business recovery

Publications (1)

Publication Number Publication Date
CN107395387A true CN107395387A (en) 2017-11-24

Family

ID=60338820

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610332931.0A Pending CN107395387A (en) 2016-05-17 2016-05-17 The methods, devices and systems of two-shipper business recovery

Country Status (1)

Country Link
CN (1) CN107395387A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109981459A (en) * 2019-02-28 2019-07-05 联想(北京)有限公司 A kind of method for sending information, client and computer readable storage medium
CN110221949A (en) * 2019-06-17 2019-09-10 深圳前海微众银行股份有限公司 Automate operation management method, apparatus, equipment and readable storage medium storing program for executing

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101160794A (en) * 2005-10-26 2008-04-09 华为技术有限公司 Disaster recovery system and method of service controlling device in intelligent network
JP2008287632A (en) * 2007-05-21 2008-11-27 Panasonic Corp Control device recovery system
CN101854253A (en) * 2010-05-07 2010-10-06 无锡中星微电子有限公司 Method for automatically recovering monitoring and storing and monitoring system thereof
CN103167517A (en) * 2011-12-14 2013-06-19 中国电信股份有限公司 Method and system for monitoring data recovery in internet of things
CN103500130A (en) * 2013-09-11 2014-01-08 上海爱数软件有限公司 Method for backing up dual-computer hot standby data in real time
CN105024879A (en) * 2015-07-15 2015-11-04 中国船舶重工集团公司第七0九研究所 Virtual machine fault detection and recovery system and virtual machine detection, recovery and starting method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101160794A (en) * 2005-10-26 2008-04-09 华为技术有限公司 Disaster recovery system and method of service controlling device in intelligent network
JP2008287632A (en) * 2007-05-21 2008-11-27 Panasonic Corp Control device recovery system
CN101854253A (en) * 2010-05-07 2010-10-06 无锡中星微电子有限公司 Method for automatically recovering monitoring and storing and monitoring system thereof
CN103167517A (en) * 2011-12-14 2013-06-19 中国电信股份有限公司 Method and system for monitoring data recovery in internet of things
CN103500130A (en) * 2013-09-11 2014-01-08 上海爱数软件有限公司 Method for backing up dual-computer hot standby data in real time
CN105024879A (en) * 2015-07-15 2015-11-04 中国船舶重工集团公司第七0九研究所 Virtual machine fault detection and recovery system and virtual machine detection, recovery and starting method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109981459A (en) * 2019-02-28 2019-07-05 联想(北京)有限公司 A kind of method for sending information, client and computer readable storage medium
CN110221949A (en) * 2019-06-17 2019-09-10 深圳前海微众银行股份有限公司 Automate operation management method, apparatus, equipment and readable storage medium storing program for executing

Similar Documents

Publication Publication Date Title
CN110224858B (en) Log-based alarm method and related device
CN104065526B (en) A kind of method and apparatus of server failure alarm
CN110554930B (en) Data storage method and related equipment
CN103138988B (en) Positioning treatment method and positioning treatment device of network faults
CN107800783B (en) Method and device for remotely monitoring server
CN109508295B (en) Block chain consensus algorithm testing method and device, calculating device and storage medium
CN112395156A (en) Fault warning method and device, storage medium and electronic equipment
CN110209529A (en) The guard method of radio frequency parameter and electronic equipment
CN111930703A (en) Automatic log file capturing method and device and computer equipment
CN107395387A (en) The methods, devices and systems of two-shipper business recovery
CN112713996B (en) Block chain-based fault verification method, server and terminal
CN109460311A (en) The management method and device of firmware abnormality
CN115102862B (en) Automatic synchronization method and device for SDN equipment
JP2010147804A (en) Transmitting apparatus, and unit mounted on the same
CN106406963A (en) Initialization method and device for Linux system
CN113392079B (en) Distributed storage cluster log storage optimization method, system and terminal
CN106559249A (en) Check the method and device of security baseline
CN109445993A (en) A kind of detection method and relevant apparatus of file system health status
CN110968456A (en) Method and device for processing fault disk in distributed storage system
WO2021057855A1 (en) Program process monitoring method and apparatus, computer device, and readable storage medium
CN112069027A (en) Interface data processing method and device, electronic equipment and storage medium
CN114826884B (en) Method, device, equipment and readable medium for positioning communication faults of cross-equipment protocol
CN112530139B (en) Monitoring system, method, device, collector and storage medium
CN110674016A (en) Method for processing log and positioning error information in mobile terminal, mobile terminal and monitoring device thereof and storage medium
CN111130926B (en) State monitoring method, system and device suitable for encryption machine and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200624

Address after: 518057 Zhongxing building, A3-01, A3-02, Nanshan District hi tech Industrial Park, Shenzhen, Guangdong

Applicant after: Shenzhen ZTE Technical Service Co.,Ltd.

Address before: 518000 Zhongxing building, science and technology south road, Nanshan District hi tech Industrial Park, Guangdong, Shenzhen

Applicant before: ZTE Corp.

TA01 Transfer of patent application right
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20171124

WD01 Invention patent application deemed withdrawn after publication