CN105574590A - Adaptive general control disaster recovery switching device and system, and signal generation method - Google Patents
Adaptive general control disaster recovery switching device and system, and signal generation method Download PDFInfo
- Publication number
- CN105574590A CN105574590A CN201510999459.1A CN201510999459A CN105574590A CN 105574590 A CN105574590 A CN 105574590A CN 201510999459 A CN201510999459 A CN 201510999459A CN 105574590 A CN105574590 A CN 105574590A
- Authority
- CN
- China
- Prior art keywords
- fault
- data
- analysis
- production center
- fault data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2035—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
- G06N5/027—Frames
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Databases & Information Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Hardware Redundancy (AREA)
Abstract
The invention discloses an adaptive general control disaster recovery switching device. The device comprises a fault data processing unit, a fault inference and judgment unit, and a switching signal emission unit; wherein the fault data processing unit is used for acquiring operation state data of each production center, extracting fault data and performing classified storage of the data, and obtaining fault characteristic data via the analysis of the fault data; the fault inference and judgment unit is used for inferring the fault characteristic data via a knowledge base and obtaining a fault switching judgment; and the switching signal emission unit is used for emitting a switching control command to each production center according to the fault switching judgment and a manual command. The invention also discloses an adaptive general control disaster recovery switching system and an adaptive general control disaster recovery switching signal generation method, switching signals are rapidly and accurately generated by employing the intelligent technology, and adaptive switching of the system is realized.
Description
Technical field
The present invention relates to data center's disaster recovery and field of intelligent control, particularly a kind of self-adaptation master control calamity is for switching device shifter, system and signal generating method.
Background technology
Under the trend of data centralization, a lot of enterprise institution has built oneself data center.After data centralization, while bringing big advantages, also bring concentration of risk, therefore the safe reliability of data center also just seems particularly important.For ensureing reliability, the redundancy scheme of data center is the important channel addressed this problem.But redundancy scheme result also in the complicacy of Constructing data center, detection data center fault promptly and accurately, and provide rational expert advice, seamlessly switching to center for subsequent use is the key that data center provides business continuity to run.
Traditional disaster recovery solution has " standby with city calamity ", " strange land calamity is standby " and " same to city-strange land calamity is standby " Three models, wherein, mainly refer to that Disaster Preparation Center and the production center are in same city with city calamity for pattern, backed up in synchronization or async backup can be adopted simultaneously, it has minimum cost of investment, the fastest disaster recovery speed, Data safeguard highly, but regional data disaster cannot be tackled; For pattern, strange land calamity mainly refers to that Disaster Preparation Center and the production center are in different cities, generally can only realize async backup, cost of investment is higher, disaster recovery speed and Data safeguard ability lower slightly, advantage to tackle regional disaster risk; Same city-strange land calamity is the combination of two kinds of patterns above for pattern, cost of investment is the highest, but there is the above two advantage, this kind of pattern is divided into again two kinds of implementations, one first sets up same city Disaster Preparation Center, set up that strange land calamity is standby to be backed up same city calamity is standby again, one is that center, same city and center, strange land are independently for the production center is backed up.But above Three models or do not have consideration to run into regional irresistible factor (fire, power-off, earthquake), or adopt single simple switching mode, be only switched to center for subsequent use from the production center, ignore the security at center for subsequent use.Like this once disaster occurs, highly reliable, the disaster-tolerant backup of High Availabitity data center and the requirement of switching will be difficult to meet.
In a kind of disaster recovery solution of many production centers newly, mutually can back up in the heart in each, and independently bear business, greatly improve calamity for grade.In the research that many production centers switched in the past, researcher is more after paying close attention to many production centers generation disaster, the enforcement of switch step between the production center, and seldom consider that application intellectual technology accurately and fast produces switching signal, the self-adaptation realizing system switches.
Supervisory system is widely used in the security of system, system maintenance of Civil aviation information system.Supervisory system application warning information in contain information useful in a large number, but these information only have through in-depth analysis after could be excavated out.Current most of monitor supervision platforms, just lay particular emphasis on the unified collection of warning information, store, the Treatment Analysis ability of warning information is more weak, and gather while multiple application system alert data can not be supported, in the face of huge alert data, cannot accurate localizing faults source, the information causing maintainer to pay close attention to often is submerged in information common in a large number, cannot play system maintenance and judge in advance to process even in time, this makes maintenance work heavy and arduous.
Summary of the invention
For solving the technical matters of existing existence, the embodiment of the present invention provides a kind of self-adaptation master control calamity for switching device shifter, system and signal generating method.
For achieving the above object, the technical scheme of the embodiment of the present invention is achieved in that
A kind of self-adaptation master control calamity is for switching device shifter, and described device comprises: fault data processing unit, fault reasoning judging unit and switching signal issue unit, wherein,
Fault data processing unit, for gathering the fault data of each production center, carrying out classification to described fault data and storing, analyze, obtaining fault signature data;
Fault reasoning judging unit, for obtaining failover suggestion by described fault signature data by Analysis of Knowledge Bases Reasoning;
Switching signal issue unit, for according to described failover suggestion and manual command, sends and switches steering order to each production center.
Wherein, described fault data processing unit comprises: fault data collection module, for collect each production center fault data and carry out fault data classification store.
Wherein, described fault data collection module, obtain the fault data of the production center specifically for the Agent program of acting on behalf of by being arranged on each production center, and monitor other production center running status by heartbeat detection apparatus, and collect the fault data of other production center.
Wherein, described fault data collection module, stores specifically for the classification carrying out fault data by affiliated different application subsystem.
Wherein, described fault data processing unit also comprises: failure analysis module, for carrying out fault analysis respectively according to the fault data being stored in different application subsystem, and carrying out the association analysis of fault to each application subsystem, obtaining fault signature data.
Wherein, described failure analysis module comprises single system fault analysis submodule; The fault data that single system fault analysis submodule is used for according to being stored in different application subsystem carries out fault analysis respectively, obtains fault signature data.
Wherein, described failure analysis module also comprises interconnected system fault analysis submodule; Described interconnected system fault analysis submodule, for carrying out the association analysis of fault to each application subsystem, obtains fault signature data.
Wherein, described fault data processing unit also comprises fault signature database, for preserving described fault signature data.
Wherein, described fault reasoning judging unit comprises knowledge base, Analysis of Knowledge Bases Reasoning module; Described knowledge base describes knowledge processing solution logic; Described Analysis of Knowledge Bases Reasoning module is used for, with described knowledge base for back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data, obtaining failover suggestion, and send to described switching signal issue unit in conjunction with the switchover policy preset.
Wherein, described switching signal issue unit, comprising: transfer switches control module, for after described failover suggestion is by manual intervention and confirmation, sends and switches steering order to each production center.
A kind of self-adaptation master control calamity is for switched system, described system comprises at least two production centers, heartbeat detection apparatus and the self-adaptation master control calamity as described in any one of claim 1 to 11 for switching device shifter, each described production center is connected for switching device shifter with described self-adaptation master control calamity respectively, is connected with described heartbeat detection apparatus between each described production center.
Wherein, described generating center comprises: status monitor service device and access server;
Described status monitor service device, for monitoring the running status of the production center in real time by acting on behalf of Agent program, and sends to described self-adaptation master control calamity for switching device shifter by the fault data of generating center;
Described access server, for waiting for the switching steering order that described self-adaptation master control calamity sends for switching device shifter and carrying out corresponding Failure Transfer operation.
Wherein, described generating center also comprises: WEB cluster, database D B cluster and Centroid.
Wherein, described heartbeat detection apparatus, monitors for the real-time running status to the production center, and is sent to by the fault data of generating center described self-adaptation master control calamity for switching device shifter.
A kind of self-adaptation master control calamity is for switching signal production method, and described method comprises:
Fault data processing unit gathers the fault data of each production center, carries out classification and stores, analyzes, obtain fault signature data to described fault data;
Described fault signature data are obtained failover suggestion by Analysis of Knowledge Bases Reasoning by fault reasoning judging unit;
Switching signal issue unit, according to described failover suggestion and manual command, sends and switches steering order to each production center.
Wherein, each production center of fault data collection module collection of described fault data processing unit fault data and carry out fault data classification store.
Wherein, described fault data collection module obtains the running state data of the production center by the status monitor service device being arranged on each production center, and is obtained the running state data of other production center by heartbeat detection apparatus.
Wherein, the classification that described fault data collection module carries out fault data by affiliated different application subsystem stores.
Wherein, the failure analysis module of described fault data processing unit carries out fault analysis respectively according to the fault data of different application subsystem, and carries out the association analysis of fault to each application subsystem, obtains fault signature data.
Wherein, described failure analysis module carries out fault analysis respectively according to the fault data of different application subsystem, obtain fault signature data, comprise: the single system fault analysis submodule of described failure analysis module carries out fault analysis respectively according to the fault data being stored in different application subsystem, obtains fault signature data.
Wherein, described failure analysis module carries out the association analysis of fault to each application subsystem, obtain fault signature data, for the interconnected system fault analysis submodule of: described failure analysis module to carry out the association analysis of fault to each application subsystem, obtain fault signature data.
Wherein, described method also comprises: the fault signature database described fault signature data being saved in described fault data processing unit.
Wherein, be that back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data with knowledge base, obtain failover suggestion in conjunction with the switchover policy preset, and send to described switching signal issue unit; Described knowledge base describes knowledge processing solution logic.
Wherein, in described failover suggestion by manual intervention with after confirming, send and switch steering order to each production center.
The self-adaptation master control calamity of the embodiment of the present invention is for switching device shifter, system and signal generating method, when there being the production center to occur abnormal, self-adaptation master control calamity will start automatically for switched system, produce and switch steering order production control center execution Failure Transfer, to make the normal production center can the user of the abnormal production center of adaptive adapter, thus application intellectual technology accurately and fast produces switching signal, the self-adaptation realizing system switches, alleviate the artificial degree participated in, by machine intelligence, realize the Intelligent treatment of human expert, provide expert in time and switch suggestion.Further, gather while multiple application system alert data can be supported, in the face of huge alert data, can accurate localizing faults source, system maintenance is served to the effect judging in advance, process in time.
Accompanying drawing explanation
In accompanying drawing (it is not necessarily drawn in proportion), similar Reference numeral can describe similar parts in different views.The similar reference numerals with different letter suffix can represent the different examples of similar parts.Accompanying drawing generally shows each embodiment discussed herein by way of example and not limitation.
Fig. 1 is the composition structural representation of embodiment of the present invention self-adaptation master control calamity for switching device shifter;
Fig. 2 is the composition structural representation of embodiment of the present invention self-adaptation master control calamity for switched system;
Fig. 3 is the composition structural representation of embodiment of the present invention self-adaptation master control calamity for switching device shifter;
Fig. 4 is the schematic flow sheet of embodiment of the present invention self-adaptation master control calamity for switching signal production method.
Embodiment
The self-adapting intelligent master control calamity that the technical problem to be solved in the present invention is a kind of many production centers of proposition, heartbeat triggers is for switching signal mechanism, utilize machine learning and expert system, realize the generation of more intelligent switching signal faster with more accurate expert opinion, meet the ability improving data center and provide business continuity to run more.The self-adaptation master control calamity of the embodiment of the present invention directly acts on each production center for switching device shifter, obtains the running state data of each production center and sends and switch steering order and switch accordingly to the access server of each production center.
As Fig. 1 shows, the self-adaptation master control calamity of the embodiment of the present invention comprises fault data processing unit 11, fault reasoning judging unit 12, switching signal issue unit 13 for switching device shifter.
Fault data processing unit 11 adopts Agent technology to gather the running state data of each production center, and therefrom extract fault data and storage of classifying, application and trouble data processing algorithm is analyzed fault data further, obtains fault signature data;
Fault reasoning judging unit 12 is that data source carries out Analysis of Knowledge Bases Reasoning to fault signature data with knowledge base, in conjunction with set switchover policy, obtains final failover suggestion, also can be described as to switch to control expert advice; Adopt the switching of history to control experience for study collection, knowledge base described in the correction of applied for machines learning art by knowledge revision simultaneously;
Switching signal issue unit 13 comprises two parts: a part directly accepts outside manual command and sends to each generating center, Failure Transfer is carried out to make the production center, another part is responsible in described failover suggestion by manual intervention with after confirming, sending and switch steering order to each production center, making the production center carry out Failure Transfer when breaking down.
Be described in detail for switching signal mechanism below in conjunction with the self-adaptation master control calamity of drawings and Examples to the embodiment of the present invention, should be appreciated that following illustrated preferred exemplary is only for instruction and explanation of the present invention, is not intended to limit the present invention.And when not conflicting, the embodiment in the present invention and the feature in embodiment can be combined with each other.
The self-adaptation master control calamity that the embodiment of the present invention is a kind of many production centers, heartbeat triggers is for switched system, this system is based on one many production centers pattern, the self-adaptation master control calamity being the deployment of many production centers as shown in Figure 2, for switched system structural drawing, is made up of two large classes, four parts.Wherein three parts are the production center that planning and configuration is identical: first production center, second production center and the 3rd production center, each production center comprises WEB cluster, database (DB, Database) cluster, access server, status monitor service device and Centroid etc., the heartbeat detection apparatus of redundancy is disposed between each production center, heartbeat detection apparatus and status monitor service device are monitored the running status of the production center in real time, and sent to by fault data self-adaptation master control calamity for switching device shifter, access server is responsible for waiting for that switching signal that self-adaptation master control calamity produces for switched system (namely switching steering order) hereinafter carries out corresponding Failure Transfer operation.Self-adaptation master control calamity is responsible for generating switching signal according to the fault data obtained for switched system, and the generation method that switching signal is concrete will hereafter be described in detail.
Here, web cluster is made up of multiple server running same web application simultaneously.Database (DB, Database) cluster utilizes at least two or multiple stage database server exactly, forms a virtual centralized database logical image, as single database system, provides transparent data, services to client.Centroid can be equivalent to the computing machine installing each application.
What Fig. 3 described is that self-adaptation master control calamity is for the concrete composition structure of switching device shifter and the schematic diagram generating switching signal process according to fault data.Self-adaptation master control calamity is divided into three parts for switching device shifter entirety: fault data processing unit 11, fault reasoning judging unit 12, switching signal issue unit 13.
Fault data processing unit 11 comprises fault collection module and failure analysis module.Fault collection module for collect each production center fault data and carry out fault data classification store.Particularly, fault collection module collects two class fault datas, one is the fault data of self production center obtained by status monitor service device as shown in Figure 1, another is the fault data of other production center obtained by heartbeat detection apparatus, and stores by the classification that affiliated different application subsystem carries out fault data.
Failure analysis module carries out fault analysis respectively according to the fault data being stored in different application subsystem, and carries out the association analysis of fault to each application subsystem, obtains fault signature data with this; Particularly, distributed fault analyzing subsystem is built in failure analysis module, this distributed fault analyzing subsystem comprises single system fault analysis submodule and interconnected system fault analysis submodule, single system fault analysis submodule carries out fault analysis respectively according to the fault data being stored in different application subsystem and obtains fault signature data, interconnected system fault analysis submodule can carry out the association analysis of fault to each application subsystem, obtain fault signature data with this.The fault signature data obtained are saved in fault signature database by fault data processing unit.
Fault reasoning judging unit is mainly according to the knowledge of expert system, be that back-end data carries out Analysis of Knowledge Bases Reasoning to the fault signature data that fault data processing unit obtains with knowledge base, knowledge base describes knowledge processing solution logic, comprises the different fixed value adjusting logic of plurality of devices, localization of fault logic, the decision condition logic of relations.In conjunction with set switchover policy, obtain final failover suggestion, and send to the transfer of switching signal issue unit to switch control module.
Switching signal issue unit comprises transfer and switches control module, and the failover suggestion being responsible for fault reasoning judging unit to obtain, by manual intervention with after confirming, sends and switches steering order to the access server of each production center; And, receive manually input manual command and using this manual command as the access server switching steering order and send to each production center.Here switching steering order comprises two parts: the instruction that the manual command manually inputted from keeper and the failover suggestion obtained by fault reasoning judging unit are converted to.
As shown in Figure 4, the self-adapting intelligent master control calamity of the embodiment of the present invention generates the flow process of switching signal for switching signal production method and self-adaptation master control calamity for switched system, can comprise the steps:
Step 401, fault data processing unit gathers the fault data of each production center by Agent mechanism and heartbeat detection mechanism, stores and failure data analyzing, obtain fault signature data through fault data classification;
Step 402, the fault signature data that fault reasoning judging unit obtains with step 401 are input, by knowledge base, obtain failover suggestion by Analysis of Knowledge Bases Reasoning;
Step 403, switching signal issue unit, based on failover suggestion and manual command, switches control module through transfer and sends switching steering order to each production center.
Wherein, each production center of fault data collection module collection of described fault data processing unit fault data and carry out fault data classification store.Described fault data collection module obtains the fault data of the production center by the agent equipment being arranged on each production center, and is obtained the fault data of other production center by heartbeat detection apparatus.The classification that described fault data collection module carries out fault data by affiliated different application subsystem stores.
Wherein, the failure analysis module of described fault data processing unit in carrying out fault analysis respectively according to the fault data being stored in different application subsystem, and carries out the association analysis of fault to each application subsystem, obtains fault signature data.Particularly, the single system fault analysis submodule of described failure analysis module obtains fault signature data for carrying out fault analysis respectively according to the fault data being stored in different application subsystem; The interconnected system fault analysis submodule of described failure analysis module carries out the association analysis of fault to each application subsystem, obtain fault signature data.
Wherein, described method also comprises: the fault signature database described fault signature data being saved in described fault data processing unit.
Wherein, be that back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data with knowledge base, obtain failover suggestion in conjunction with the switchover policy preset, and send to described switching signal issue unit; Described knowledge base describes knowledge processing solution logic.
Wherein, the transfer of described switching signal issue unit switches control module reception manual command and described manual command is sent to each production center as switching steering order; Described failover suggestion is sent after manual intervention and confirmation and switches steering order to each production center.
Compared with prior art, beneficial effect of the present invention is:
In the embodiment of the present invention, there is multiple production center, under normal circumstances, all for user provides service, when occurring abnormal, self-adaptation master control calamity will start automatically for switched system, and the normal production center is by the user of the production center abnormal for adapter;
In the embodiment of the present invention, the running state data of self is not only passed to self-adaptation master control calamity for switching device shifter by each production center, and by heartbeat detection technology, the running state data of other production centers is also passed to self-adaptation master control calamity for switching device shifter;
In the embodiment of the present invention, self-adaptation master control calamity introduces expert system reasoning technology for switching device shifter, and by failure data analysis reasoning, the operational process in conjunction with center switches;
In the embodiment of the present invention, self-adaptation master control calamity introduces the technology of machine learning for switching device shifter, by the unceasing study to history inferred results data, and adaptation knowledge base;
In the embodiment of the present invention, self-adaptation master control calamity alleviates the artificial degree participated in for switching device shifter, by machine intelligence, realizes the Intelligent treatment of human expert, provides expert in time and switch suggestion.
Those skilled in the art should understand, embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt the form of hardware embodiment, software implementation or the embodiment in conjunction with software and hardware aspect.And the present invention can adopt in one or more form wherein including the upper computer program implemented of computer-usable storage medium (including but not limited to magnetic disk memory and optical memory etc.) of computer usable program code.
The present invention describes with reference to according to the process flow diagram of the method for the embodiment of the present invention, equipment (system) and computer program and/or block scheme.Should understand can by the combination of the flow process in each flow process in computer program instructions realization flow figure and/or block scheme and/or square frame and process flow diagram and/or block scheme and/or square frame.These computer program instructions can being provided to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, making the instruction performed by the processor of computing machine or other programmable data processing device produce device for realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be stored in can in the computer-readable memory that works in a specific way of vectoring computer or other programmable data processing device, the instruction making to be stored in this computer-readable memory produces the manufacture comprising command device, and this command device realizes the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be loaded in computing machine or other programmable data processing device, make on computing machine or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computing machine or other programmable devices is provided for the step realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
The above, be only preferred embodiment of the present invention, be not intended to limit protection scope of the present invention.
Claims (24)
1. self-adaptation master control calamity is for a switching device shifter, it is characterized in that, described device comprises: fault data processing unit, fault reasoning judging unit and switching signal issue unit, wherein,
Fault data processing unit, for gathering the fault data of each production center, carrying out classification to described fault data and storing, analyze, obtaining fault signature data;
Fault reasoning judging unit, for obtaining failover suggestion by described fault signature data by Analysis of Knowledge Bases Reasoning;
Switching signal issue unit, for according to described failover suggestion and manual command, sends and switches steering order to each production center.
2. device according to claim 1, is characterized in that, described fault data processing unit comprises: fault data collection module, for collect each production center fault data and carry out fault data classification store.
3. device according to claim 2, it is characterized in that, described fault data collection module, the fault data of the production center is obtained specifically for the Agent program of acting on behalf of by being arranged on each production center, and monitor other production center running status by heartbeat detection apparatus, and collect the fault data of other production center.
4. device according to claim 3, is characterized in that, described fault data collection module, stores specifically for the classification carrying out fault data by affiliated different application subsystem.
5. device according to claim 2, it is characterized in that, described fault data processing unit also comprises: failure analysis module, for carrying out fault analysis respectively according to the fault data being stored in different application subsystem, and each application subsystem is carried out to the association analysis of fault, obtain fault signature data.
6. device according to claim 5, is characterized in that, described failure analysis module comprises single system fault analysis submodule; The fault data that single system fault analysis submodule is used for according to being stored in different application subsystem carries out fault analysis respectively, obtains fault signature data.
7. device according to claim 6, is characterized in that, described failure analysis module also comprises interconnected system fault analysis submodule; Described interconnected system fault analysis submodule, for carrying out the association analysis of fault to each application subsystem, obtains fault signature data.
8. device according to claim 1, is characterized in that, described fault data processing unit also comprises fault signature database, for preserving described fault signature data.
9. device according to claim 1, is characterized in that, described fault reasoning judging unit comprises knowledge base, Analysis of Knowledge Bases Reasoning module; Described knowledge base describes knowledge processing solution logic; Described Analysis of Knowledge Bases Reasoning module is used for, with described knowledge base for back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data, obtaining failover suggestion, and send to described switching signal issue unit in conjunction with the switchover policy preset.
10. device according to claim 1, is characterized in that, described switching signal issue unit, comprising: transfer switches control module, for after described failover suggestion is by manual intervention and confirmation, sends and switches steering order to each production center.
11. 1 kinds of self-adaptation master control calamities are for switched system, it is characterized in that, described system comprises at least two production centers, heartbeat detection apparatus and the self-adaptation master control calamity as described in any one of claim 1 to 11 for switching device shifter, each described production center is connected for switching device shifter with described self-adaptation master control calamity respectively, is connected with described heartbeat detection apparatus between each described production center.
12. systems according to claim 11, is characterized in that, described generating center comprises: status monitor service device and access server;
Described status monitor service device, for monitoring the running status of the production center in real time by acting on behalf of Agent program, and sends to described self-adaptation master control calamity for switching device shifter by the fault data of generating center;
Described access server, for waiting for the switching steering order that described self-adaptation master control calamity sends for switching device shifter and carrying out corresponding Failure Transfer operation.
13. systems according to claim 12, is characterized in that, described generating center also comprises: WEB cluster, database D B cluster and Centroid.
14. systems according to claim 11 or 12, is characterized in that, described heartbeat detection apparatus, monitor for the real-time running status to the production center, and sent to by the fault data of generating center described self-adaptation master control calamity for switching device shifter.
15. 1 kinds of self-adaptation master control calamities are for switching signal production method, and it is characterized in that, described method comprises:
Fault data processing unit gathers the fault data of each production center, carries out classification and stores, analyzes, obtain fault signature data to described fault data;
Described fault signature data are obtained failover suggestion by Analysis of Knowledge Bases Reasoning by fault reasoning judging unit;
Switching signal issue unit, according to described failover suggestion and manual command, sends and switches steering order to each production center.
16. methods according to claim 15, is characterized in that: the fault data of each production center of fault data collection module collection of described fault data processing unit the classification carrying out fault data store.
17. methods according to claim 16, it is characterized in that: described fault data collection module obtains the running state data of the production center by the status monitor service device being arranged on each production center, and is obtained the running state data of other production center by heartbeat detection apparatus.
18. methods according to claim 17, is characterized in that: the classification that described fault data collection module carries out fault data by affiliated different application subsystem stores.
19. methods according to claim 16, it is characterized in that: the failure analysis module of described fault data processing unit carries out fault analysis respectively according to the fault data of different application subsystem, and each application subsystem is carried out to the association analysis of fault, obtain fault signature data.
20. methods according to claim 19, is characterized in that, described failure analysis module carries out fault analysis respectively according to the fault data of different application subsystem, obtain fault signature data, comprising:
The single system fault analysis submodule of described failure analysis module carries out fault analysis respectively according to the fault data being stored in different application subsystem, obtains fault signature data.
21. methods according to claim 19, it is characterized in that, described failure analysis module carries out the association analysis of fault to each application subsystem, obtain fault signature data, for the interconnected system fault analysis submodule of: described failure analysis module to carry out the association analysis of fault to each application subsystem, obtain fault signature data.
22. methods according to claim 15, is characterized in that, described method also comprises: the fault signature database described fault signature data being saved in described fault data processing unit.
23. methods according to claim 15, is characterized in that:
Be that back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data with knowledge base, obtain failover suggestion in conjunction with the switchover policy preset, and send to described switching signal issue unit; Described knowledge base describes knowledge processing solution logic.
24. methods according to claim 15, is characterized in that: in described failover suggestion by manual intervention with after confirming, send and switch steering order to each production center.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510999459.1A CN105574590A (en) | 2015-12-28 | 2015-12-28 | Adaptive general control disaster recovery switching device and system, and signal generation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510999459.1A CN105574590A (en) | 2015-12-28 | 2015-12-28 | Adaptive general control disaster recovery switching device and system, and signal generation method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105574590A true CN105574590A (en) | 2016-05-11 |
Family
ID=55884696
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510999459.1A Pending CN105574590A (en) | 2015-12-28 | 2015-12-28 | Adaptive general control disaster recovery switching device and system, and signal generation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105574590A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107634863A (en) * | 2017-10-25 | 2018-01-26 | 北京百悟科技有限公司 | Distributed monitoring device and method for domain name mapping disaster tolerance service |
CN107733684A (en) * | 2017-08-31 | 2018-02-23 | 北京宇航系统工程研究所 | A kind of multi-controller computing redundancy cluster based on Loongson processor |
CN109617716A (en) * | 2018-11-30 | 2019-04-12 | 新华三技术有限公司合肥分公司 | Data center's abnormality eliminating method and device |
CN110569149A (en) * | 2019-09-16 | 2019-12-13 | 上海新炬网络技术有限公司 | method for triggering automatic emergency switching of Oracle disaster tolerance based on fault detection |
CN110635950A (en) * | 2019-09-30 | 2019-12-31 | 深圳供电局有限公司 | Double-data-center disaster recovery system |
CN112540876A (en) * | 2020-12-21 | 2021-03-23 | 中国人民解放军61623部队 | Remote disaster recovery backup method and system for artificial telephone exchange |
CN114338359A (en) * | 2021-12-29 | 2022-04-12 | 中国邮政储蓄银行股份有限公司 | Method and device for processing data center abnormity |
CN112540876B (en) * | 2020-12-21 | 2024-05-31 | 中国人民解放军61623部队 | Remote disaster recovery backup method and system for manual telephone exchange |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1474542A (en) * | 2002-08-06 | 2004-02-11 | 华为技术有限公司 | Telecommunication equipment fault information managing method |
CN101266279A (en) * | 2008-05-09 | 2008-09-17 | 东北大学 | Electric network failure diagnosis device and method |
CN101628628A (en) * | 2009-08-03 | 2010-01-20 | 北京航空航天大学 | Self-correcting redundancy switching mechanism for spacecraft system and verification method thereof |
CN105183937A (en) * | 2015-07-17 | 2015-12-23 | 中国运载火箭技术研究院 | Fault diagnosis method suitable for electrical system of unmanned aerial vehicle |
-
2015
- 2015-12-28 CN CN201510999459.1A patent/CN105574590A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1474542A (en) * | 2002-08-06 | 2004-02-11 | 华为技术有限公司 | Telecommunication equipment fault information managing method |
CN101266279A (en) * | 2008-05-09 | 2008-09-17 | 东北大学 | Electric network failure diagnosis device and method |
CN101628628A (en) * | 2009-08-03 | 2010-01-20 | 北京航空航天大学 | Self-correcting redundancy switching mechanism for spacecraft system and verification method thereof |
CN105183937A (en) * | 2015-07-17 | 2015-12-23 | 中国运载火箭技术研究院 | Fault diagnosis method suitable for electrical system of unmanned aerial vehicle |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107733684A (en) * | 2017-08-31 | 2018-02-23 | 北京宇航系统工程研究所 | A kind of multi-controller computing redundancy cluster based on Loongson processor |
CN107733684B (en) * | 2017-08-31 | 2021-02-09 | 北京宇航系统工程研究所 | Multi-controller computing redundancy cluster based on Loongson processor |
CN107634863A (en) * | 2017-10-25 | 2018-01-26 | 北京百悟科技有限公司 | Distributed monitoring device and method for domain name mapping disaster tolerance service |
CN109617716A (en) * | 2018-11-30 | 2019-04-12 | 新华三技术有限公司合肥分公司 | Data center's abnormality eliminating method and device |
CN109617716B (en) * | 2018-11-30 | 2022-02-25 | 新华三技术有限公司合肥分公司 | Data center exception handling method and device |
CN110569149A (en) * | 2019-09-16 | 2019-12-13 | 上海新炬网络技术有限公司 | method for triggering automatic emergency switching of Oracle disaster tolerance based on fault detection |
CN110569149B (en) * | 2019-09-16 | 2023-07-25 | 上海新炬网络技术有限公司 | Method for triggering Oracle disaster recovery automatic emergency switching based on fault detection |
CN110635950A (en) * | 2019-09-30 | 2019-12-31 | 深圳供电局有限公司 | Double-data-center disaster recovery system |
CN112540876A (en) * | 2020-12-21 | 2021-03-23 | 中国人民解放军61623部队 | Remote disaster recovery backup method and system for artificial telephone exchange |
CN112540876B (en) * | 2020-12-21 | 2024-05-31 | 中国人民解放军61623部队 | Remote disaster recovery backup method and system for manual telephone exchange |
CN114338359A (en) * | 2021-12-29 | 2022-04-12 | 中国邮政储蓄银行股份有限公司 | Method and device for processing data center abnormity |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105574590A (en) | Adaptive general control disaster recovery switching device and system, and signal generation method | |
CN102231681B (en) | High availability cluster computer system and fault treatment method thereof | |
CN111209131A (en) | Method and system for determining fault of heterogeneous system based on machine learning | |
EP3148116B1 (en) | Information system fault scenario information collection method and system | |
CN104268061B (en) | A kind of storage state monitoring method suitable for virtual machine | |
CN108199922B (en) | System and method for diagnosing and repairing network equipment and server faults | |
US10316623B2 (en) | Method and system for controlling well operations | |
CN107766502A (en) | A kind of Oracle RAC databases disaster tolerance switches drilling method | |
CN104360868A (en) | Multi-stage failure management method for use in large-sized plane comprehensive processing platform | |
CN113254279B (en) | Intelligent disaster recovery and backup management platform system | |
CN103595572B (en) | A kind of method of cloud computing cluster interior joint selfreparing | |
CN105162632A (en) | Automatic processing system for server cluster failures | |
CN104657150A (en) | Automatic operation and maintenance method under cluster environment | |
CN106789398A (en) | A kind of method of media big data hadoop cluster monitoring | |
CN108337108A (en) | A kind of cloud platform failure automation localization method based on association analysis | |
DE102017208293A1 (en) | Industrial facility management systems and methods therefor | |
CN109698766A (en) | The method and system of communication power supply accident analysis | |
CN106200615B (en) | A kind of intelligent track-traffic early warning implementation method based on incidence relation | |
CN112117756A (en) | Integrated operation and maintenance method and system for scheduling control system | |
CN109995554A (en) | The control method and cloud dispatch control device of multi-stage data center active-standby switch | |
CN115833927A (en) | Fiber core switching method and device, electronic equipment and storage medium | |
CN112711508A (en) | Intelligent operation and maintenance service system facing large-scale client system | |
CN109742852A (en) | A kind of controller switching equipment state-detection diagnostic system | |
CN113608750B (en) | Deployment method and device of monitoring component, computer equipment and storage medium | |
CN110519393B (en) | Self-service equipment supervision method, device, equipment, server and medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160511 |
|
RJ01 | Rejection of invention patent application after publication |