A kind of application redundancy robotization switches control design case method
Technical field
The present invention relates to disaster tolerance control and management technical field, particularly a kind of application redundancy robotization switches control design case method.
Background technology
In the information age, computer information system is more and more important to human lives, and important infosystem is all concentrated and is deployed in data center.Infosystem, through continuous service for many years, have accumulated a large amount of valuable data.The disaster that disaster and human error cause, all may cause infosystem to be paralysed, and produces massive losses.Since system disaster cannot be avoided completely, positive carries out disaster tolerance system construction, just becomes the inevitable choice of important information system.
When production system disaster occurs, important is exactly that disaster tolerance system accurately and fast completes switching, substitutes original production system, continues externally to provide service, the impact that minimizing disaster is brought and loss.In order to tackle the destruction of disaster generation to infosystem, people have done disaster tolerance construction to some key service systems.When disaster occurs, when production system can not use, disaster tolerance system just replaces production system externally to provide information service.
Be switched to disaster tolerance system from production system, relate to many-sided technical matterss such as network address switching, data consistency.Operation steps is more, and Rule of judgment is complicated, and professional operation order is more, and slip-stick artist inputs switching command one by one and easily produces mistake, and spended time is more, extends disaster tolerance system enabling time.
For the deficiency of existing disaster tolerance system, the present invention devises a kind of application redundancy robotization and switches control design case method.When system disaster occurs, replace artificial manual input by one-touch automatic switching program, allow disaster tolerance system switch automatical and efficient completing.
Summary of the invention
The present invention, in order to make up the defect of prior art, provides a kind of application redundancy robotization based on expandable container and switches control design case method.
The present invention is achieved through the following technical solutions:
A kind of application redundancy robotization switches control design case method, it is characterized in that: Disaster Recover Manager Server comprises the WEB management service software as foreground and the BM management server two parts as backstage; Wherein WEB management service software possesses displaying interface and operating function, BM management server containing backstage primary control program disaster tolerance management Server and on each controlled main frame Agent Agent, for the communication service that realizes between disaster tolerance management host and controlled main frame with transmit switching command; Manage Server at each server node deploy Agent Agent with disaster tolerance to communicate, and receive the instruction from disaster tolerance management Server.
This application redundancy robotization switches control design case method, comprises the following steps:
(1) when switch start time, open WEB management service software, and from WEB management service software page invocation disaster tolerance management Server process, when switch stop time or the time of completing, can from WEB management service software page termination disaster tolerance management Server process;
When starting to switch, WEB management service software page is sent on the server of corresponding Agent Agent towards BM management server and starts instruction prepared in advance, BM management server starts AgentJob, until switched according to instruction on corresponding Agent Agent;
(2) initialize routine checks data mode in disaster tolerance management database, and reads in initialization data, implements the data mode in more new database, keep front page layout and background data base consistance with the change of switch step;
(3) the WEB management service software page is according to the data in disaster tolerance management database, represents switching state in real time, goes wrong in switching, during state display mistake, and status data in manual modification database;
(4) in handoff procedure, Server is as the bridge between disaster tolerance management database and Agent Agent client computer in disaster tolerance management, the instruction of next step operation of Agent Agent is obtained in disaster tolerance management database, and send to Agent Agent, then obtain execution result and the state value of instruction, be updated in the tables of data of disaster tolerance management database;
(5) manage Server with disaster tolerance after AgentJob in Agent Agent client computer starts carries out alternately, manages Server transmission current state or previous action result, and obtain next step operational order to disaster tolerance; After being finished, when disaster tolerance management Server process stops, Job also can stop.
The WEB management service software page that described step (1) uses JAVA to write, by arranging startup/mute key, control the disaster tolerance management Server process defined, the WEB management service software page initiate instruction by BM management server unified distribution task, then be delivered to Agent Agent end perform switch script accordingly; Described step (2) preserves switching flow state by disaster tolerance management database, and the progress status in real-time update switching flow, keep the consistance of foreground and background data base.
In described step (3), robotization switches each ingredient in control flow, comprises production system database, production system middleware, production system WEB, disaster tolerance system database, disaster tolerance system middleware and disaster tolerance system WEB represent its state in a database.
In described step (4), create process disaster tolerance management Server and transmit bridge as the data between disaster tolerance management database and Agent Agent client proxy, in time Agent Agent state transfer in disaster tolerance management database; In described step (5), the AgentJob process in Agent Agent client computer and disaster tolerance manage Server process and all can start with the startup of a task, stop with the end of task, and releasing resource.
The invention has the beneficial effects as follows: this application redundancy robotization switches control design case method, the each node state of analytic system can be shifted to an earlier date, when a disaster occurs, by disaster tolerance switching control program analysis judgment system state, select required switching command, complete disaster tolerance in the short period of time to switch, shorten business recovery, reduce the economic loss because long-time business disaster brings, because whole handoff procedure transfers to computing machine automatically to complete, accurately, efficiently, reduce and the professional technique of maintainer is required and maintenance cost, improve work efficiency.
Accompanying drawing explanation
Accompanying drawing 1 is disaster tolerance system robotization switching flow schematic diagram of the present invention.
Accompanying drawing 2 is Disaster Recover Manager Server logical organization schematic diagram of the present invention.
Accompanying drawing 3 is disaster tolerance robotization switching command conveying flow schematic diagram of the present invention.
Accompanying drawing 4 deletes disaster tolerance system schematic flow sheet for increasing in Disaster Recover Manager Server of the present invention.
Embodiment
In order to make technical matters to be solved by this invention, technical scheme and beneficial effect clearly understand, below in conjunction with drawings and Examples, the present invention will be described in detail.It should be noted that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
This application redundancy robotization switches control design case method, and Disaster Recover Manager Server comprises the WEB management service software as foreground and the BM management server two parts as backstage; Wherein WEB management service software possesses displaying interface and operating function, BM management server containing backstage primary control program disaster tolerance management Server and on each controlled main frame Agent Agent, for the communication service that realizes between disaster tolerance management host and controlled main frame with transmit switching command; Manage Server at each server node deploy Agent Agent with disaster tolerance to communicate, and receive the instruction from disaster tolerance management Server.
This application redundancy robotization switches control design case method, comprises the following steps:
(1) when switch start time, open WEB management service software, and from WEB management service software page invocation disaster tolerance management Server process, when switch stop time or the time of completing, can from WEB management service software page termination disaster tolerance management Server process;
When starting to switch, WEB management service software page is sent on the server of corresponding Agent Agent towards BM management server and starts instruction prepared in advance, BM management server starts AgentJob, until switched according to instruction on corresponding Agent Agent;
(2) initialize routine checks data mode in disaster tolerance management database, and reads in initialization data, implements the data mode in more new database, keep front page layout and background data base consistance with the change of switch step;
(3) the WEB management service software page is according to the data in disaster tolerance management database, represents switching state in real time, goes wrong in switching, during state display mistake, and status data in manual modification database;
(4) in handoff procedure, Server is as the bridge between disaster tolerance management database and Agent Agent client computer in disaster tolerance management, the instruction of next step operation of Agent Agent is obtained in disaster tolerance management database, and send to Agent Agent, then obtain execution result and the state value of instruction, be updated in the tables of data of disaster tolerance management database;
(5) manage Server with disaster tolerance after AgentJob in Agent Agent client computer starts carries out alternately, manages Server transmission current state or previous action result, and obtain next step operational order to disaster tolerance; After being finished, when disaster tolerance management Server process stops, Job also can stop.
The WEB management service software page that described step (1) uses JAVA to write, by arranging startup/mute key, control the disaster tolerance management Server process defined, the WEB management service software page initiate instruction by BM management server unified distribution task, then be delivered to Agent Agent end perform switch script accordingly; Described step (2) preserves switching flow state by disaster tolerance management database, and the progress status in real-time update switching flow, keep the consistance of foreground and background data base.
In described step (3), robotization switches each ingredient in control flow, comprises production system database, production system middleware, production system WEB, disaster tolerance system database, disaster tolerance system middleware and disaster tolerance system WEB represent its state in a database.
In described step (4), create process disaster tolerance management Server and transmit bridge as the data between disaster tolerance management database and Agent Agent client proxy, in time Agent Agent state transfer in disaster tolerance management database; In described step (5), the AgentJob process in Agent Agent client computer and disaster tolerance manage Server process and all can start with the startup of a task, stop with the end of task, and releasing resource.
WEB management service software (foreground) possesses displaying interface and operating function, when production system system generation disaster, when needing to carry out disaster tolerance switching, operation maintenance personnel logs in WEB management service software interface, switch from WEB management service software page millet cake hits, start switching flow.After switching flow is opened, the WEB management service software page can show switching progress, and switching the display of Host Status, disaster tolerance system database update state and the WEB management service software page can automatic synchronization.Disaster recovery and backup systems keeper can check switching progress, and whether whole flow process has switched.By monitoring handoff procedure in real time.When switching is broken down, need manual operation intervention, the interactive entrance that the WEB management service software page provides manual operation to control and reference instruction.When switching completes or need to stop in advance, blocked operation flow process can be stopped.
BM management server (backstage) manages Server and Agent Agent on each controlled main frame containing backstage primary control program disaster tolerance, can realize the communication service between disaster tolerance management host and controlled main frame and transmit switching command.
Combing service logic and write application server, disaster tolerance system database server, interface server institute bearer service startup and close script.
Manage Server at each server node deploy Agent Agent with disaster tolerance to communicate, and can receive the instruction to disaster tolerance management Server.
Switching command conveying flow mentality of designing, fill order is initiated from WEB management service software interface, pass to BM management server background service process, connected by the Agent Agent service of background service process and host node, and send instructions and pass to the Agent Agent service of host node, initiate a Job task by Agent Agent, call and can perform script, executing state can be returned after script is complete to BM management server.
This Disaster Recover Manager Server is designed to support to manage many cover disaster tolerance systems simultaneously.Increase in Disaster Recover Manager Server and delete operation system, adopt Excel as visual authoring tool, by perl script, the content of regularly writing inside Exel is imported in database table, change the data of backstage disaster tolerance system database.Can perform by create.vbs the data that script reads background data base, generate create.js script file.Create.js file can be called when the display of WEB management service software interface, in the webpage representation excel file on foreground, write the business tine of change.
Disaster tolerance system database can adopt all kinds of total relation type database, by define system Basic Information Table, and Host Status table, main frame Basic Information Table, state updating table, operation log recording table.Associated by major key between each table.