CN107395387A - The methods, devices and systems of two-shipper business recovery - Google Patents
The methods, devices and systems of two-shipper business recovery Download PDFInfo
- Publication number
- CN107395387A CN107395387A CN201610332931.0A CN201610332931A CN107395387A CN 107395387 A CN107395387 A CN 107395387A CN 201610332931 A CN201610332931 A CN 201610332931A CN 107395387 A CN107395387 A CN 107395387A
- Authority
- CN
- China
- Prior art keywords
- connection status
- monitoring
- storage device
- main frame
- resource
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
- H04L41/0663—Performing the actions predefined by failover planning, e.g. switching to standby network elements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L1/00—Arrangements for detecting or preventing errors in the information received
- H04L1/22—Arrangements for detecting or preventing errors in the information received using redundant apparatus to increase reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/10—Active monitoring, e.g. heartbeat, ping or trace-route
Abstract
The present invention relates to a kind of methods, devices and systems of two-shipper business recovery, including:The connection status of storage device is monitored in real time;If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the connection status of the storage device;If listening to the connection status for connection to recover, start cluster service corresponding to main frame;Monitor the resource starting state of main frame, if the resource starting state is normal, then start cluster service corresponding to standby host, when the connection status of storage device is abnormal, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to cluster service corresponding to snoop results automatic start standby machine, ensure that the continuity of business.Meanwhile the validity of resource is demonstrated by the monitoring of the resource starting state to main frame during startup business, it ensure that the correctness of initiation of services.
Description
Technical field
The present invention relates to communication technical field, more particularly to a kind of methods, devices and systems of two-shipper business recovery.
Background technology
In High Availabitity two-shipper field, the continuity of Service Source and the security of data are the functions of paying close attention to the most
Point, and it is particularly important in the communications industry.
But High Availabitity two-shipper only exists simple monitoring and management at present, such as in the management of storage resource, simply pin
The read-only of storage is judged, or overtime processing has only been done to the connection of storage, intellectuality, that is, do not work as storage
After network disconnects, two-shipper management software typically can all think that storage there is a problem, and cause two-shipper business all to be paralysed, even if
Network recovery, two-shipper management software also will not can only send alarm or throw exception wait artificially automatically by business recovery
Operation is gone to solve, the continuous poor performance of business.
The content of the invention
Based on this, it is necessary to for above-mentioned technical problem, there is provided a kind of methods, devices and systems of two-shipper business recovery,
After abnormal and recovery occurs for storage network, cluster service can be recovered automatically, ensure the continuity of business.
A kind of method of two-shipper business recovery, methods described include:
The connection status of storage device is monitored in real time;
If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the storage
The connection status of equipment;
If listening to the connection status for connection to recover, start cluster service corresponding to main frame;
The resource starting state of main frame is monitored, if the resource starting state is normal, starts cluster corresponding to standby host
Service.
A kind of device of two-shipper business recovery, described device include:
Module is monitored, for monitoring the connection status of storage device in real time;
Stopping modular, if abnormal for connection for listening to the connection status, stop standby machine business, and continue
Monitor the connection status of the storage device;
Host start-up module, if recovered for listening to the connection status for connection, start and collect corresponding to main frame
Group's service;
Standby host starting module, for monitoring the resource starting state of main frame, if the resource starting state is normal, open
Cluster service corresponding to dynamic standby host.
The method and apparatus of above-mentioned two-shipper business recovery, by monitoring the connection status of storage device in real time, if monitored
It is abnormal for connection to connection status, then stop standby machine business, and continue to monitor the connection status of storage device, if listened to
The connection status is recovered for connection, then cluster service corresponding to startup main frame, the resource starting state of main frame is monitored, if money
Source starting state is normal, then starts cluster service corresponding to standby host, when the connection status exception of storage device, in standby machine business
After stopping, state can be recovered according to the connection of storage device, according to cluster service corresponding to snoop results automatic start standby machine,
It ensure that the continuity of business.Meanwhile verified during startup business by the monitoring of the resource starting state to main frame
The validity of resource, ensure that the correctness of initiation of services.
A kind of system of two-shipper business recovery, the system include monitoring device and storage device;
The monitoring device is used for the connection status for monitoring the storage device in real time, if listening to the connection status
It is abnormal for connection, then stop standby machine business, and continue to monitor the connection status of the storage device;
If the monitoring device is additionally operable to listen to the connection status as connection recovery, starts and collect corresponding to main frame
Group's service;
The monitoring device is additionally operable to monitor the resource starting state of main frame, if the resource starting state is normal,
Start cluster service corresponding to standby host.
The system of above-mentioned two-shipper business recovery, by the cooperation of monitoring device and storage device, monitoring device is monitored in real time
The connection status of storage device, if listening to connection status as connection exception, stop standby machine business, and continue monitoring and deposit
The connection status of equipment is stored up, is recovered if listening to the connection status for connection, starts cluster service corresponding to main frame, prison
The resource starting state of main frame is listened, if resource starting state is normal, cluster service corresponding to startup standby host, works as storage device
Connection status it is abnormal, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to snoop results from
Cluster service corresponding to dynamic startup standby machine, ensure that the continuity of business.Meanwhile by master during startup business
The monitoring of the resource starting state of machine demonstrates the validity of resource, ensure that the correctness of initiation of services.
Brief description of the drawings
Fig. 1 is the applied environment figure of the method for two-shipper business recovery in one embodiment;
Fig. 2 is the flow chart of the method for two-shipper business recovery in one embodiment;
Fig. 3 is the flow chart of the cluster service according to corresponding to resource data check results start main frame in one embodiment;
Fig. 4 is general-purpose interface timing diagram between supervising device and third party device in one embodiment;
Fig. 5 is the timing diagram of the method for two-shipper business recovery in one embodiment;
Fig. 6 is the structured flowchart of the device of two-shipper business recovery in one embodiment;
Fig. 7 is the structured flowchart of the device of two-shipper business recovery in another embodiment;
Fig. 8 is the structured flowchart of the device of two-shipper business recovery in further embodiment;
Fig. 9 is the structured flowchart of the device of two-shipper business recovery in another embodiment;
Figure 10 is the structured flowchart of the system of two-shipper business recovery in one embodiment;
Figure 11 is the structured flowchart of the system of two-shipper business recovery in further embodiment;
Figure 12 is the structured flowchart of the system of two-shipper business recovery in another embodiment.
Embodiment
Fig. 1 is the applied environment figure of the method operation of two-shipper business recovery in one embodiment, as shown in figure 1, the application
Environment includes monitoring device 110, storage device 120, cluster management server 130, cluster system 140, cluster standby host 150, prison
Device for tone frequencies 110 can be arranged in any one server, and monitoring device 110 can directly carry out configuration startup, can also lead to
Cross the monitoring parameter that general-purpose interface reception is transmitted and carry out configuration startup, realization that can be independent is monitored, and can also be supplied to the 3rd
Side's control, which is realized, monitors, and monitors the connection status of storage device in real time and is sent out according to snoop results to cluster management server 130
Control instruction is sent, control instruction is used for the service state for controlling standby machine.Storage device 120 can possess connection store function
Equipment, such as with the server of store function, magnetic battle array.Clustered software is run in cluster management server 130, for unified
Cluster system and cluster standby host are managed, such as control the file system of standby machine, control standby machine to stop business or startup
Business etc..Cluster system 140 and cluster standby host 150 are used to combine the two-shipper business for providing High Availabitity.
In one embodiment, as shown in the figure, there is provided a kind of method of two-shipper business recovery, ring is applied applied to above-mentioned
Monitoring device in border, comprises the following steps:
Step S210, the connection status of storage device is monitored in real time.
Specifically, connection status refers to the UNICOM of network service, it is that connection is abnormal if broken string, is company if networking
Connect normal.Storage device can be monitored according to some cycles, week can be monitored by the self-defined storage state of configuration parameter
Phase, it is easy to easily be controlled monitoring.It can be tested by going the system file of storage device to be written and read verification or send
The mode of packet is monitored the connection status of storage device.Also settable storage state maximum frequency of abnormity, if prison
The abnormal connection number of result is listened then to be judged as that connection is abnormal more than storage state maximum frequency of abnormity, if being not above storing
State maximum frequency of abnormity, can determine whether for connection status it is normal, by setting storage state maximum frequency of abnormity to avoid judging by accident
It is disconnected, because when the connection of storage device occurs abnormal, it can attempt repeatedly to retry monitoring, the explanation connection hair if retrying successfully
Raw flash, but business will not have an impact.
In one embodiment, step S210 includes:Verification is written and read to the file system of storage device, according to read-write
The result of verification determines the connection status of storage device, or by sending network testing data bag to storage device, according to return
Response data packet determine the connection status of storage device.
Specifically, verification or the wound that storage medium reads and writes speed are carried out to the file system of storage device by system command
The mode for building temporary file judges the read-write state of storage medium, if storage device is magnetic battle array, then to magnetic present on server
The volume of battle array distribution, the volume of magnetic battle array distribution change into the file system of operating system, the verification of progress disk read-write speed, can be with
On the volume of magnetic battle array create temporary file mode judge disk read-write state, if disk speed can not obtain, disk can not
Then check results are that connection status is abnormal to reading and writing of files, and otherwise check results are that connection status is normal.IP PING can also be passed through
Network testing data bag is sent to storage device, if receiving the response data packet of return, IP PING lead to, storage device
Connection status is normal.If there is storage state maximum frequency of abnormity, then can according to the continuous abnormal numbers of check results and
The continuous response results abnormity number of PING orders judges connection status.
Step S220, if listening to connection status as connection exception, stop standby machine business, and continue to monitor storage
The connection status of equipment.
If specifically, listening to, connection status is abnormal for connection, and notice cluster management server stops business, cluster
The business of management server control standby machine stops, can report and alarm information alert storage device connection generation exception.Continue to supervise
Listen the connection status of storage device, when standby machine business stops to the monitoring parameter of the connection status of storage device can with before
Monitoring parameter it is different, such as there is different listening periods, storage abnormality scan period and storage state monitoring are such as set
Cycle is different.
Step S230, recover if listening to connection status for connection, start cluster service corresponding to main frame.
Specifically, the settable minimum normal number of recovery state, it is minimum more than recovery state only to listen to connection status
Normal number is just judged as that connection recovers, and can customize the minimum normal number of recovery state, is such as arranged to 2 times, is further ensured that
Connect the reliability recovered.If connection recovers, notice cluster management server starts cluster service corresponding to main frame.
Step S240, the resource starting state of main frame is monitored, if resource starting state is normal, started corresponding to standby host
Cluster service.
Specifically, application service resource listening period can be set, the resource starting state of main frame is monitored.Checked by process
Order, checking process initiation state to monitor the resource starting state of main frame, when multiple processes be present, entering simply by the presence of one
Journey starting state is abnormal, then resource starting state is abnormal.The starting state of process reflects mainframe cluster and taken for business
The result of business is normal or abnormal, if the starting state of process is abnormal, illustrates that resource has problem, it is necessary to stop
The service of main frame, if the starting state of process is normal, illustrate that resource completely can use, cluster corresponding to standby host can be started and taken
Business.When judging that resource starting state is normal, application service resource can be set and monitor maximum errors number, the only resource of main frame
Starting state continuous abnormal number exceedes the maximum errors number of application service resource monitoring and is just judged as resource starting state exception,
Monitoring maximum errors number by application service resource reduces probability of miscarriage of justice.
In the present embodiment, by monitoring the connection status of storage device in real time, if it is different to connect to listen to connection status
Often, then stop standby machine business, and continue to monitor the connection status of storage device, if listening to the connection status as connection
Recover, then cluster service corresponding to startup main frame, monitor the resource starting state of main frame, if resource starting state is normal,
Start cluster service corresponding to standby host,, can be according to storage after the stopping of standby machine business when the connection status exception of storage device
The connection of equipment recovers state, according to cluster service corresponding to snoop results automatic start standby machine, ensure that the continuous of business
Property.Meanwhile the validity of resource is demonstrated by the monitoring of the resource starting state to main frame during startup business, protect
The correctness of initiation of services is demonstrate,proved.
In one embodiment, before step S210, in addition to:Monitoring parameter is configured, monitoring parameter includes:Storage state
Listening period, storage state maximum frequency of abnormity, storage abnormality scan period, application service resource listening period, application
Service Source monitors at least one of maximum errors number.
Specifically, monitoring parameter freely configures the customizable flexible management that snoop procedure can be achieved, storage state monitors week
Phase is used to control the time cycle for monitoring storage device every time, and storage state maximum frequency of abnormity refers to listen to storage device
When connection status is abnormal, largest tolerable frequency of abnormity, in the range of storage state maximum frequency of abnormity, it is believed that storage is set
Standby connection status is normal.The storage abnormality scan period refers to when standby machine business stops to the connection status of storage device
Listening period.Application service resource listening period refers to monitor week to the resource starting state of main frame after main frame recovery business
Phase.When the maximum errors number of application service resource monitoring refers to the resource starting state exception for listening to main frame, largest tolerable
Frequency of abnormity, in the range of resource starting state maximum frequency of abnormity, it is believed that the resource starting state of main frame is normal.Can be free
The number of monitoring parameter is increased or decreased, adjusts monitoring parameter as needed.It may also include Disk State such as and scan time-out time
For determine disk whether UNICOM order perform after time-out time normal range (NR), if time-out time is swept more than Disk State
Retouch time-out time and then illustrate that connection is abnormal, what Disk State scanning expired times were used to limiting Disk State scanning expired times can
Receive scope, if it exceeds Disk State scans expired times, then it is assumed that Disk State is abnormal.Monitored server name is used
In when storage device deployment on the server when, it is determined that monitored server.Monitored process title, which is used for determination, to be needed to supervise
The process of control.
In one embodiment, as shown in figure 3, before the step of starting cluster service corresponding to main frame in step S230,
Also include:
Step S310, resource data corresponding to main frame is obtained, carrying out completeness check to resource data obtains the first verification
As a result.
Specifically, resource data corresponding to main frame can be file stored on main frame etc., the algorithm of completeness check can root
According to needing to select, such as md5 checking algorithms, crc CRCs algorithm, even-odd check method, pass through integrity check algorithm
Corresponding computing is carried out to resource data and obtains the first check results, MD5 values such as are calculated to resource data, obtain the first verification knot
Fruit.
Step S320, obtain last time standard integrality school corresponding to resource data in the normal connection status of storage device
Result is tested, by the first check results and standard completeness check results contrast, if identical, enter and starts collection corresponding to main frame
The step of group's service, otherwise, send data exception warning.
Specifically, when monitoring the connection status of storage device, often reach a listening period and just calculate first resource number
According to corresponding completeness check result and a file is saved as, and is periodically covered.So as to by reading file acquisition
Last time standard completeness check result corresponding to resource data in the normal connection status of storage device.If the first verification knot
Fruit is identical with standard completeness check result, then it is intact to illustrate data, can carry out following step.If not phase
Together, then illustrate that data are damaged, send data exception warning.
In the present embodiment, completeness check first is carried out to data before the step of starting cluster service corresponding to main frame,
It can judge whether data are intact in advance, if damaged, data exception warning can be sent in time, improve response speed and accurate
Degree.
In one embodiment, method also includes:By general-purpose interface receive the monitoring parameter that issues of third party device and
Monitoring information is reported to third party device.
Specifically, generic configuration interface is provided, can be with comprehensive customization miscellaneous service monitoring scene, according to different prisons
Control needs by third party device to issue corresponding monitoring parameter, while after obtaining monitoring information, such as the connection status of storage device
Information, resource starting state information of main frame etc. are reported by general-purpose interface to third party device.It is illustrated in figure 4 supervising device
The general-purpose interface timing diagram between third party device, comprises the following steps:
S1:Third party device is passed to monitoring parameter to monitoring device by general-purpose interface;
S2:Monitoring device renewal configuration parameter is the parameter of new incoming;
S3:Monitoring device notifies third party device configuration to complete by general-purpose interface;
S4:Third party device starts order to monitoring device by the way that general-purpose interface is incoming;
S5:The monitoring that monitoring device is ordered and started to group system according to starting;
S6:Group system feeds back associated monitoring information to monitoring device
S7:Monitoring device notifies third party device start completion by general-purpose interface, and produces link setup relation;
S8:Third party device sends heartbeat message;
S9:Third party device receives heartbeat message;
S10:Heartbeat processing is carried out between monitoring device and third party device;
S11:Monitoring device is received the abnormal notice of magnetic battle array connection and located accordingly by the monitoring to magnetic battle array
Reason;
S12:Monitoring device notifies that the connection of third party device magnetic battle array is abnormal by general-purpose interface;
S13-14:Third party device receives abnormal notice by general-purpose interface, and continues to monitor;
S15:Monitoring device listens to the connection of magnetic battle array and recovered, and journey is treated with group system;
S16:Monitoring device notifies the result of magnetic battle array recovery by general-purpose interface to third party device.
The implementation to the method for two-shipper business recovery is described in further detail below in conjunction with the accompanying drawings, as shown in figure 5, bag
Include following steps:
A:Monitoring device receives monitoring parameter by general-purpose interface, and storage state listening period 30 seconds, storage state are maximum
Frequency of abnormity 4 times, 60 seconds abnormality scan periods of storage, application service resource listening period 40 seconds, application service resource prison
Listen maximum errors number 5 times, start monitoring device after being provided with.
B:Monitoring device persistently monitors the connection status of storage device, checks once within every 30 seconds.
C:After continuous 4 connection status listened to are all exceptions, the clustered software in cluster management server is stopped
Only, while cluster management server is notified to stop the service of standby machine.
D:Cluster management server control standby machine makes it stop service.
E:Monitoring device continues to monitor the connection status of storage device, and it is 60 seconds 1 time to monitor frequency, judges continuous 2 companies
After connecing state recovery normally, completeness check is carried out to resource data corresponding to main frame, check results are obtained, if check results
It is complete for data, then into F:Cluster management server is notified to start cluster service corresponding to main frame, otherwise into G:Send number
According to abnormality warnings.
H:Monitoring device monitors the resource starting state of main frame, if resource starting state is normal, into I:Notice collection
Group's management server starts cluster service corresponding to standby host, and cluster management server notice monitoring device cluster has recovered.If
Resource starting state is abnormal, then into J:Send cluster and start failure notification, notice cluster management server stops whole cluster
Service, send corrupted data warning.
In one embodiment, as shown in Figure 6, there is provided a kind of device of two-shipper business recovery, including:
Module 410 is monitored, for monitoring the connection status of storage device in real time.
Stopping modular 420, if abnormal for connection for listening to connection status, stop standby machine business, and continue
Monitor the connection status of storage device.
Host start-up module 430, if recovered for listening to connection status for connection, start cluster corresponding to main frame
Service.
Standby host starting module 440, for monitoring the resource starting state of main frame, if resource starting state is normal, open
Cluster service corresponding to dynamic standby host.
In one embodiment, as shown in fig. 7, device also includes:
Configuration module 450, for configuring monitoring parameter, monitoring parameter includes:Storage state listening period, storage state are most
Big frequency of abnormity, storage abnormality scan period, application service resource listening period, application service resource monitor maximum mistake
At least one of number.
In one embodiment, as shown in figure 8, device also includes:
Correction verification module 460, for obtaining resource data corresponding to main frame, completeness check is carried out to resource data and obtains the
One check results, obtain last time standard completeness check knot corresponding to resource data in the normal connection status of storage device
Fruit, the first check results and standard completeness check results contrast if identical, start into host start-up module 430
Cluster service corresponding to main frame, otherwise, send data exception warning.
In one embodiment, as shown in figure 9, device also includes:
Interface module 470, set for the monitoring parameter issued by general-purpose interface reception third party device and to third party
It is standby to report monitoring information.
In one embodiment, as shown in Figure 10, there is provided a kind of system of two-shipper business recovery, including monitoring device
510 and storage device 520, monitoring device 510 is used for the connection status for monitoring storage device 520 in real time, if listening to connection
State is abnormal for connection, then stops standby machine business, and continues to monitor the connection status of storage device, and monitoring device 510 is also used
If recovered in listening to the connection status for connection, start cluster service corresponding to main frame, monitoring device 510 is additionally operable to
The resource starting state of main frame is monitored, if resource starting state is normal, starts cluster service corresponding to standby host.
In the present embodiment, by the cooperation of monitoring device and storage device, monitoring device monitors the company of storage device in real time
State is connect, if listening to connection status as connection exception, stops standby machine business, and continue to monitor the connection of storage device
State, recover if listening to the connection status for connection, cluster service corresponding to startup main frame, monitor the resource of main frame
Starting state, if resource starting state is normal, start cluster service corresponding to standby host, when the connection status of storage device is different
Often, after the stopping of standby machine business, state can be recovered according to the connection of storage device, according to snoop results automatic start standby machine
Corresponding cluster service, it ensure that the continuity of business.Meanwhile by starting to the resource of main frame during startup business
The monitoring of state demonstrates the validity of resource, ensure that the correctness of initiation of services.
In one embodiment, as shown in figure 11, system also includes:
Third party device 530, for issuing monitoring parameter to monitoring device by general-purpose interface, third party device is additionally operable to
Receive the monitoring information that monitoring device reports.
Specifically, by generic configuration interface, needed to issue corresponding monitoring by third party device according to different monitoring
Parameter, can be with comprehensive customization miscellaneous service monitoring scene.After monitoring device obtains monitoring information, such as the connection of storage device
Status information, resource starting state information of main frame etc., the prison that third party device is reported by general-purpose interface reception monitoring device
Control information.
In one embodiment, as shown in figure 12, system also includes:
Cluster management server 540, for receiving the control instruction of monitoring device transmission, according to control instruction to main frame and
Standby host is controlled.
Specifically, unification is to main frame and standby host after the control instruction sent by cluster management server reception monitoring device
Management is controlled, is more convenient orderly.
One of ordinary skill in the art will appreciate that realize all or part of flow in above-described embodiment method, being can be with
The hardware of correlation is instructed to complete by computer program, described program can be stored in a computer read/write memory medium
In, in the embodiment of the present invention, the program can be stored in the storage medium of computer system, and by the computer system
At least one computing device, to realize the flow for including the embodiment such as above-mentioned each method.Wherein, the storage medium can be
Magnetic disc, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access
Memory, RAM) etc..
Each technical characteristic of embodiment described above can be combined arbitrarily, to make description succinct, not to above-mentioned reality
Apply all possible combination of each technical characteristic in example to be all described, as long as however, the combination of these technical characteristics is not deposited
In contradiction, the scope that this specification is recorded all is considered to be.
Embodiment described above only expresses the several embodiments of the present invention, and its description is more specific and detailed, but simultaneously
Can not therefore it be construed as limiting the scope of the patent.It should be pointed out that come for one of ordinary skill in the art
Say, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the protection of the present invention
Scope.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.
Claims (12)
1. a kind of method of two-shipper business recovery, methods described include:
The connection status of storage device is monitored in real time;
If it is abnormal for connection to listen to the connection status, stop standby machine business, and continue to monitor the storage device
Connection status;
If listening to the connection status for connection to recover, start cluster service corresponding to main frame;
The resource starting state of main frame is monitored, if the resource starting state is normal, starts cluster service corresponding to standby host.
2. according to the method for claim 1, it is characterised in that it is described in real time monitor storage device connection status the step of
Including:
Verification is written and read to the file system of the storage device, the storage device is determined according to the result of read-write check
Connection status;
Or by sending network testing data bag to the storage device, determine that the storage is set according to the response data packet of return
Standby connection status.
3. according to the method for claim 1, it is characterised in that in the step of the connection status for monitoring storage device in real time
Before rapid, in addition to:
Monitoring parameter is configured, the monitoring parameter includes:Storage state listening period, storage state maximum frequency of abnormity, storage
At least one in abnormality scan period, application service resource listening period, the maximum errors number of application service resource monitoring
Kind.
4. according to the method for claim 1, it is characterised in that it is described startup main frame corresponding to cluster service the step of it
Before, in addition to:
Obtain resource data corresponding to the main frame;
Completeness check is carried out to the resource data and obtains the first check results;
Obtain last time standard completeness check result corresponding to the resource data in the normal connection status of storage device;
By first check results and standard completeness check results contrast, if identical, enter the startup main frame pair
The step of cluster service answered, otherwise, send data exception warning.
5. according to the method for claim 3, it is characterised in that methods described also includes:
The monitoring parameter that issues of third party device is received by general-purpose interface and reports monitoring information to third party device.
6. a kind of device of two-shipper business recovery, it is characterised in that described device includes:
Module is monitored, for monitoring the connection status of storage device in real time;
Stopping modular, if abnormal for connection for listening to the connection status, stop standby machine business, and continue to monitor
The connection status of the storage device;
Host start-up module, if recovered for listening to the connection status for connection, start cluster clothes corresponding to main frame
Business;
Standby host starting module, for monitoring the resource starting state of main frame, if the resource starting state is normal, start standby
Cluster service corresponding to machine.
7. device according to claim 6, it is characterised in that described device also includes:
Configuration module, for configuring monitoring parameter, the monitoring parameter includes:Storage state listening period, storage state are maximum
Frequency of abnormity, storage abnormality scan period, application service resource listening period, application service resource monitor maximum mistake time
At least one of number.
8. device according to claim 6, it is characterised in that described device also includes:
Correction verification module, for obtaining resource data corresponding to the main frame, completeness check is carried out to the resource data and obtained
First check results, obtain last time standard integrality school corresponding to the resource data in the normal connection status of storage device
Result is tested, by first check results and standard completeness check results contrast, if identical, into host start-up module
Start cluster service corresponding to main frame, otherwise, send data exception warning.
9. device according to claim 7, it is characterised in that described device also includes:
Interface module, for receiving the monitoring parameter that issues of third party device and to third party device by general-purpose interface
Report monitoring information.
10. a kind of system of two-shipper business recovery, it is characterised in that the system includes monitoring device and storage device;
The monitoring device is used for the connection status for monitoring the storage device in real time, if listening to the connection status to connect
Exception is connect, then stops standby machine business, and continue to monitor the connection status of the storage device;
If the monitoring device is additionally operable to listen to the connection status as connection recovery, start cluster clothes corresponding to main frame
Business;
The monitoring device is additionally operable to monitor the resource starting state of main frame, if the resource starting state is normal, starts
Cluster service corresponding to standby host.
11. system according to claim 10, it is characterised in that the system also includes:
Third party device, for issuing monitoring parameter to monitoring device by general-purpose interface;
The third party device is additionally operable to receive the monitoring information that monitoring device reports.
12. system according to claim 10, it is characterised in that the system also includes:
Cluster management server, the control instruction sent for receiving the monitoring device, according to the control instruction to main frame
It is controlled with standby host.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610332931.0A CN107395387A (en) | 2016-05-17 | 2016-05-17 | The methods, devices and systems of two-shipper business recovery |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610332931.0A CN107395387A (en) | 2016-05-17 | 2016-05-17 | The methods, devices and systems of two-shipper business recovery |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107395387A true CN107395387A (en) | 2017-11-24 |
Family
ID=60338820
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610332931.0A Pending CN107395387A (en) | 2016-05-17 | 2016-05-17 | The methods, devices and systems of two-shipper business recovery |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107395387A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109981459A (en) * | 2019-02-28 | 2019-07-05 | 联想(北京)有限公司 | A kind of method for sending information, client and computer readable storage medium |
CN110221949A (en) * | 2019-06-17 | 2019-09-10 | 深圳前海微众银行股份有限公司 | Automate operation management method, apparatus, equipment and readable storage medium storing program for executing |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101160794A (en) * | 2005-10-26 | 2008-04-09 | 华为技术有限公司 | Disaster recovery system and method of service controlling device in intelligent network |
JP2008287632A (en) * | 2007-05-21 | 2008-11-27 | Panasonic Corp | Control device recovery system |
CN101854253A (en) * | 2010-05-07 | 2010-10-06 | 无锡中星微电子有限公司 | Method for automatically recovering monitoring and storing and monitoring system thereof |
CN103167517A (en) * | 2011-12-14 | 2013-06-19 | 中国电信股份有限公司 | Method and system for monitoring data recovery in internet of things |
CN103500130A (en) * | 2013-09-11 | 2014-01-08 | 上海爱数软件有限公司 | Method for backing up dual-computer hot standby data in real time |
CN105024879A (en) * | 2015-07-15 | 2015-11-04 | 中国船舶重工集团公司第七0九研究所 | Virtual machine fault detection and recovery system and virtual machine detection, recovery and starting method |
-
2016
- 2016-05-17 CN CN201610332931.0A patent/CN107395387A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101160794A (en) * | 2005-10-26 | 2008-04-09 | 华为技术有限公司 | Disaster recovery system and method of service controlling device in intelligent network |
JP2008287632A (en) * | 2007-05-21 | 2008-11-27 | Panasonic Corp | Control device recovery system |
CN101854253A (en) * | 2010-05-07 | 2010-10-06 | 无锡中星微电子有限公司 | Method for automatically recovering monitoring and storing and monitoring system thereof |
CN103167517A (en) * | 2011-12-14 | 2013-06-19 | 中国电信股份有限公司 | Method and system for monitoring data recovery in internet of things |
CN103500130A (en) * | 2013-09-11 | 2014-01-08 | 上海爱数软件有限公司 | Method for backing up dual-computer hot standby data in real time |
CN105024879A (en) * | 2015-07-15 | 2015-11-04 | 中国船舶重工集团公司第七0九研究所 | Virtual machine fault detection and recovery system and virtual machine detection, recovery and starting method |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109981459A (en) * | 2019-02-28 | 2019-07-05 | 联想(北京)有限公司 | A kind of method for sending information, client and computer readable storage medium |
CN110221949A (en) * | 2019-06-17 | 2019-09-10 | 深圳前海微众银行股份有限公司 | Automate operation management method, apparatus, equipment and readable storage medium storing program for executing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110224858B (en) | Log-based alarm method and related device | |
CN104065526B (en) | A kind of method and apparatus of server failure alarm | |
CN110554930B (en) | Data storage method and related equipment | |
CN103138988B (en) | Positioning treatment method and positioning treatment device of network faults | |
CN107800783B (en) | Method and device for remotely monitoring server | |
CN109508295B (en) | Block chain consensus algorithm testing method and device, calculating device and storage medium | |
CN112395156A (en) | Fault warning method and device, storage medium and electronic equipment | |
CN110209529A (en) | The guard method of radio frequency parameter and electronic equipment | |
CN111930703A (en) | Automatic log file capturing method and device and computer equipment | |
CN107395387A (en) | The methods, devices and systems of two-shipper business recovery | |
CN112713996B (en) | Block chain-based fault verification method, server and terminal | |
CN109460311A (en) | The management method and device of firmware abnormality | |
CN115102862B (en) | Automatic synchronization method and device for SDN equipment | |
JP2010147804A (en) | Transmitting apparatus, and unit mounted on the same | |
CN106406963A (en) | Initialization method and device for Linux system | |
CN113392079B (en) | Distributed storage cluster log storage optimization method, system and terminal | |
CN106559249A (en) | Check the method and device of security baseline | |
CN109445993A (en) | A kind of detection method and relevant apparatus of file system health status | |
CN110968456A (en) | Method and device for processing fault disk in distributed storage system | |
WO2021057855A1 (en) | Program process monitoring method and apparatus, computer device, and readable storage medium | |
CN112069027A (en) | Interface data processing method and device, electronic equipment and storage medium | |
CN114826884B (en) | Method, device, equipment and readable medium for positioning communication faults of cross-equipment protocol | |
CN112530139B (en) | Monitoring system, method, device, collector and storage medium | |
CN110674016A (en) | Method for processing log and positioning error information in mobile terminal, mobile terminal and monitoring device thereof and storage medium | |
CN111130926B (en) | State monitoring method, system and device suitable for encryption machine and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200624 Address after: 518057 Zhongxing building, A3-01, A3-02, Nanshan District hi tech Industrial Park, Shenzhen, Guangdong Applicant after: Shenzhen ZTE Technical Service Co.,Ltd. Address before: 518000 Zhongxing building, science and technology south road, Nanshan District hi tech Industrial Park, Guangdong, Shenzhen Applicant before: ZTE Corp. |
|
TA01 | Transfer of patent application right | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20171124 |
|
WD01 | Invention patent application deemed withdrawn after publication |