CN105574590A - Adaptive general control disaster recovery switching device and system, and signal generation method - Google Patents

Adaptive general control disaster recovery switching device and system, and signal generation method Download PDF

Info

Publication number
CN105574590A
CN105574590A CN201510999459.1A CN201510999459A CN105574590A CN 105574590 A CN105574590 A CN 105574590A CN 201510999459 A CN201510999459 A CN 201510999459A CN 105574590 A CN105574590 A CN 105574590A
Authority
CN
China
Prior art keywords
fault
data
analysis
production center
fault data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510999459.1A
Other languages
Chinese (zh)
Inventor
王尧
杨健
丁建立
王斌强
李永华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Travelsky Technology Co Ltd
China Travelsky Holding Co
Original Assignee
China Travelsky Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Travelsky Technology Co Ltd filed Critical China Travelsky Technology Co Ltd
Priority to CN201510999459.1A priority Critical patent/CN105574590A/en
Publication of CN105574590A publication Critical patent/CN105574590A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2035Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/027Frames

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Databases & Information Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention discloses an adaptive general control disaster recovery switching device. The device comprises a fault data processing unit, a fault inference and judgment unit, and a switching signal emission unit; wherein the fault data processing unit is used for acquiring operation state data of each production center, extracting fault data and performing classified storage of the data, and obtaining fault characteristic data via the analysis of the fault data; the fault inference and judgment unit is used for inferring the fault characteristic data via a knowledge base and obtaining a fault switching judgment; and the switching signal emission unit is used for emitting a switching control command to each production center according to the fault switching judgment and a manual command. The invention also discloses an adaptive general control disaster recovery switching system and an adaptive general control disaster recovery switching signal generation method, switching signals are rapidly and accurately generated by employing the intelligent technology, and adaptive switching of the system is realized.

Description

Self-adaptation master control calamity is for switching device shifter, system and signal generating method
Technical field
The present invention relates to data center's disaster recovery and field of intelligent control, particularly a kind of self-adaptation master control calamity is for switching device shifter, system and signal generating method.
Background technology
Under the trend of data centralization, a lot of enterprise institution has built oneself data center.After data centralization, while bringing big advantages, also bring concentration of risk, therefore the safe reliability of data center also just seems particularly important.For ensureing reliability, the redundancy scheme of data center is the important channel addressed this problem.But redundancy scheme result also in the complicacy of Constructing data center, detection data center fault promptly and accurately, and provide rational expert advice, seamlessly switching to center for subsequent use is the key that data center provides business continuity to run.
Traditional disaster recovery solution has " standby with city calamity ", " strange land calamity is standby " and " same to city-strange land calamity is standby " Three models, wherein, mainly refer to that Disaster Preparation Center and the production center are in same city with city calamity for pattern, backed up in synchronization or async backup can be adopted simultaneously, it has minimum cost of investment, the fastest disaster recovery speed, Data safeguard highly, but regional data disaster cannot be tackled; For pattern, strange land calamity mainly refers to that Disaster Preparation Center and the production center are in different cities, generally can only realize async backup, cost of investment is higher, disaster recovery speed and Data safeguard ability lower slightly, advantage to tackle regional disaster risk; Same city-strange land calamity is the combination of two kinds of patterns above for pattern, cost of investment is the highest, but there is the above two advantage, this kind of pattern is divided into again two kinds of implementations, one first sets up same city Disaster Preparation Center, set up that strange land calamity is standby to be backed up same city calamity is standby again, one is that center, same city and center, strange land are independently for the production center is backed up.But above Three models or do not have consideration to run into regional irresistible factor (fire, power-off, earthquake), or adopt single simple switching mode, be only switched to center for subsequent use from the production center, ignore the security at center for subsequent use.Like this once disaster occurs, highly reliable, the disaster-tolerant backup of High Availabitity data center and the requirement of switching will be difficult to meet.
In a kind of disaster recovery solution of many production centers newly, mutually can back up in the heart in each, and independently bear business, greatly improve calamity for grade.In the research that many production centers switched in the past, researcher is more after paying close attention to many production centers generation disaster, the enforcement of switch step between the production center, and seldom consider that application intellectual technology accurately and fast produces switching signal, the self-adaptation realizing system switches.
Supervisory system is widely used in the security of system, system maintenance of Civil aviation information system.Supervisory system application warning information in contain information useful in a large number, but these information only have through in-depth analysis after could be excavated out.Current most of monitor supervision platforms, just lay particular emphasis on the unified collection of warning information, store, the Treatment Analysis ability of warning information is more weak, and gather while multiple application system alert data can not be supported, in the face of huge alert data, cannot accurate localizing faults source, the information causing maintainer to pay close attention to often is submerged in information common in a large number, cannot play system maintenance and judge in advance to process even in time, this makes maintenance work heavy and arduous.
Summary of the invention
For solving the technical matters of existing existence, the embodiment of the present invention provides a kind of self-adaptation master control calamity for switching device shifter, system and signal generating method.
For achieving the above object, the technical scheme of the embodiment of the present invention is achieved in that
A kind of self-adaptation master control calamity is for switching device shifter, and described device comprises: fault data processing unit, fault reasoning judging unit and switching signal issue unit, wherein,
Fault data processing unit, for gathering the fault data of each production center, carrying out classification to described fault data and storing, analyze, obtaining fault signature data;
Fault reasoning judging unit, for obtaining failover suggestion by described fault signature data by Analysis of Knowledge Bases Reasoning;
Switching signal issue unit, for according to described failover suggestion and manual command, sends and switches steering order to each production center.
Wherein, described fault data processing unit comprises: fault data collection module, for collect each production center fault data and carry out fault data classification store.
Wherein, described fault data collection module, obtain the fault data of the production center specifically for the Agent program of acting on behalf of by being arranged on each production center, and monitor other production center running status by heartbeat detection apparatus, and collect the fault data of other production center.
Wherein, described fault data collection module, stores specifically for the classification carrying out fault data by affiliated different application subsystem.
Wherein, described fault data processing unit also comprises: failure analysis module, for carrying out fault analysis respectively according to the fault data being stored in different application subsystem, and carrying out the association analysis of fault to each application subsystem, obtaining fault signature data.
Wherein, described failure analysis module comprises single system fault analysis submodule; The fault data that single system fault analysis submodule is used for according to being stored in different application subsystem carries out fault analysis respectively, obtains fault signature data.
Wherein, described failure analysis module also comprises interconnected system fault analysis submodule; Described interconnected system fault analysis submodule, for carrying out the association analysis of fault to each application subsystem, obtains fault signature data.
Wherein, described fault data processing unit also comprises fault signature database, for preserving described fault signature data.
Wherein, described fault reasoning judging unit comprises knowledge base, Analysis of Knowledge Bases Reasoning module; Described knowledge base describes knowledge processing solution logic; Described Analysis of Knowledge Bases Reasoning module is used for, with described knowledge base for back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data, obtaining failover suggestion, and send to described switching signal issue unit in conjunction with the switchover policy preset.
Wherein, described switching signal issue unit, comprising: transfer switches control module, for after described failover suggestion is by manual intervention and confirmation, sends and switches steering order to each production center.
A kind of self-adaptation master control calamity is for switched system, described system comprises at least two production centers, heartbeat detection apparatus and the self-adaptation master control calamity as described in any one of claim 1 to 11 for switching device shifter, each described production center is connected for switching device shifter with described self-adaptation master control calamity respectively, is connected with described heartbeat detection apparatus between each described production center.
Wherein, described generating center comprises: status monitor service device and access server;
Described status monitor service device, for monitoring the running status of the production center in real time by acting on behalf of Agent program, and sends to described self-adaptation master control calamity for switching device shifter by the fault data of generating center;
Described access server, for waiting for the switching steering order that described self-adaptation master control calamity sends for switching device shifter and carrying out corresponding Failure Transfer operation.
Wherein, described generating center also comprises: WEB cluster, database D B cluster and Centroid.
Wherein, described heartbeat detection apparatus, monitors for the real-time running status to the production center, and is sent to by the fault data of generating center described self-adaptation master control calamity for switching device shifter.
A kind of self-adaptation master control calamity is for switching signal production method, and described method comprises:
Fault data processing unit gathers the fault data of each production center, carries out classification and stores, analyzes, obtain fault signature data to described fault data;
Described fault signature data are obtained failover suggestion by Analysis of Knowledge Bases Reasoning by fault reasoning judging unit;
Switching signal issue unit, according to described failover suggestion and manual command, sends and switches steering order to each production center.
Wherein, each production center of fault data collection module collection of described fault data processing unit fault data and carry out fault data classification store.
Wherein, described fault data collection module obtains the running state data of the production center by the status monitor service device being arranged on each production center, and is obtained the running state data of other production center by heartbeat detection apparatus.
Wherein, the classification that described fault data collection module carries out fault data by affiliated different application subsystem stores.
Wherein, the failure analysis module of described fault data processing unit carries out fault analysis respectively according to the fault data of different application subsystem, and carries out the association analysis of fault to each application subsystem, obtains fault signature data.
Wherein, described failure analysis module carries out fault analysis respectively according to the fault data of different application subsystem, obtain fault signature data, comprise: the single system fault analysis submodule of described failure analysis module carries out fault analysis respectively according to the fault data being stored in different application subsystem, obtains fault signature data.
Wherein, described failure analysis module carries out the association analysis of fault to each application subsystem, obtain fault signature data, for the interconnected system fault analysis submodule of: described failure analysis module to carry out the association analysis of fault to each application subsystem, obtain fault signature data.
Wherein, described method also comprises: the fault signature database described fault signature data being saved in described fault data processing unit.
Wherein, be that back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data with knowledge base, obtain failover suggestion in conjunction with the switchover policy preset, and send to described switching signal issue unit; Described knowledge base describes knowledge processing solution logic.
Wherein, in described failover suggestion by manual intervention with after confirming, send and switch steering order to each production center.
The self-adaptation master control calamity of the embodiment of the present invention is for switching device shifter, system and signal generating method, when there being the production center to occur abnormal, self-adaptation master control calamity will start automatically for switched system, produce and switch steering order production control center execution Failure Transfer, to make the normal production center can the user of the abnormal production center of adaptive adapter, thus application intellectual technology accurately and fast produces switching signal, the self-adaptation realizing system switches, alleviate the artificial degree participated in, by machine intelligence, realize the Intelligent treatment of human expert, provide expert in time and switch suggestion.Further, gather while multiple application system alert data can be supported, in the face of huge alert data, can accurate localizing faults source, system maintenance is served to the effect judging in advance, process in time.
Accompanying drawing explanation
In accompanying drawing (it is not necessarily drawn in proportion), similar Reference numeral can describe similar parts in different views.The similar reference numerals with different letter suffix can represent the different examples of similar parts.Accompanying drawing generally shows each embodiment discussed herein by way of example and not limitation.
Fig. 1 is the composition structural representation of embodiment of the present invention self-adaptation master control calamity for switching device shifter;
Fig. 2 is the composition structural representation of embodiment of the present invention self-adaptation master control calamity for switched system;
Fig. 3 is the composition structural representation of embodiment of the present invention self-adaptation master control calamity for switching device shifter;
Fig. 4 is the schematic flow sheet of embodiment of the present invention self-adaptation master control calamity for switching signal production method.
Embodiment
The self-adapting intelligent master control calamity that the technical problem to be solved in the present invention is a kind of many production centers of proposition, heartbeat triggers is for switching signal mechanism, utilize machine learning and expert system, realize the generation of more intelligent switching signal faster with more accurate expert opinion, meet the ability improving data center and provide business continuity to run more.The self-adaptation master control calamity of the embodiment of the present invention directly acts on each production center for switching device shifter, obtains the running state data of each production center and sends and switch steering order and switch accordingly to the access server of each production center.
As Fig. 1 shows, the self-adaptation master control calamity of the embodiment of the present invention comprises fault data processing unit 11, fault reasoning judging unit 12, switching signal issue unit 13 for switching device shifter.
Fault data processing unit 11 adopts Agent technology to gather the running state data of each production center, and therefrom extract fault data and storage of classifying, application and trouble data processing algorithm is analyzed fault data further, obtains fault signature data;
Fault reasoning judging unit 12 is that data source carries out Analysis of Knowledge Bases Reasoning to fault signature data with knowledge base, in conjunction with set switchover policy, obtains final failover suggestion, also can be described as to switch to control expert advice; Adopt the switching of history to control experience for study collection, knowledge base described in the correction of applied for machines learning art by knowledge revision simultaneously;
Switching signal issue unit 13 comprises two parts: a part directly accepts outside manual command and sends to each generating center, Failure Transfer is carried out to make the production center, another part is responsible in described failover suggestion by manual intervention with after confirming, sending and switch steering order to each production center, making the production center carry out Failure Transfer when breaking down.
Be described in detail for switching signal mechanism below in conjunction with the self-adaptation master control calamity of drawings and Examples to the embodiment of the present invention, should be appreciated that following illustrated preferred exemplary is only for instruction and explanation of the present invention, is not intended to limit the present invention.And when not conflicting, the embodiment in the present invention and the feature in embodiment can be combined with each other.
The self-adaptation master control calamity that the embodiment of the present invention is a kind of many production centers, heartbeat triggers is for switched system, this system is based on one many production centers pattern, the self-adaptation master control calamity being the deployment of many production centers as shown in Figure 2, for switched system structural drawing, is made up of two large classes, four parts.Wherein three parts are the production center that planning and configuration is identical: first production center, second production center and the 3rd production center, each production center comprises WEB cluster, database (DB, Database) cluster, access server, status monitor service device and Centroid etc., the heartbeat detection apparatus of redundancy is disposed between each production center, heartbeat detection apparatus and status monitor service device are monitored the running status of the production center in real time, and sent to by fault data self-adaptation master control calamity for switching device shifter, access server is responsible for waiting for that switching signal that self-adaptation master control calamity produces for switched system (namely switching steering order) hereinafter carries out corresponding Failure Transfer operation.Self-adaptation master control calamity is responsible for generating switching signal according to the fault data obtained for switched system, and the generation method that switching signal is concrete will hereafter be described in detail.
Here, web cluster is made up of multiple server running same web application simultaneously.Database (DB, Database) cluster utilizes at least two or multiple stage database server exactly, forms a virtual centralized database logical image, as single database system, provides transparent data, services to client.Centroid can be equivalent to the computing machine installing each application.
What Fig. 3 described is that self-adaptation master control calamity is for the concrete composition structure of switching device shifter and the schematic diagram generating switching signal process according to fault data.Self-adaptation master control calamity is divided into three parts for switching device shifter entirety: fault data processing unit 11, fault reasoning judging unit 12, switching signal issue unit 13.
Fault data processing unit 11 comprises fault collection module and failure analysis module.Fault collection module for collect each production center fault data and carry out fault data classification store.Particularly, fault collection module collects two class fault datas, one is the fault data of self production center obtained by status monitor service device as shown in Figure 1, another is the fault data of other production center obtained by heartbeat detection apparatus, and stores by the classification that affiliated different application subsystem carries out fault data.
Failure analysis module carries out fault analysis respectively according to the fault data being stored in different application subsystem, and carries out the association analysis of fault to each application subsystem, obtains fault signature data with this; Particularly, distributed fault analyzing subsystem is built in failure analysis module, this distributed fault analyzing subsystem comprises single system fault analysis submodule and interconnected system fault analysis submodule, single system fault analysis submodule carries out fault analysis respectively according to the fault data being stored in different application subsystem and obtains fault signature data, interconnected system fault analysis submodule can carry out the association analysis of fault to each application subsystem, obtain fault signature data with this.The fault signature data obtained are saved in fault signature database by fault data processing unit.
Fault reasoning judging unit is mainly according to the knowledge of expert system, be that back-end data carries out Analysis of Knowledge Bases Reasoning to the fault signature data that fault data processing unit obtains with knowledge base, knowledge base describes knowledge processing solution logic, comprises the different fixed value adjusting logic of plurality of devices, localization of fault logic, the decision condition logic of relations.In conjunction with set switchover policy, obtain final failover suggestion, and send to the transfer of switching signal issue unit to switch control module.
Switching signal issue unit comprises transfer and switches control module, and the failover suggestion being responsible for fault reasoning judging unit to obtain, by manual intervention with after confirming, sends and switches steering order to the access server of each production center; And, receive manually input manual command and using this manual command as the access server switching steering order and send to each production center.Here switching steering order comprises two parts: the instruction that the manual command manually inputted from keeper and the failover suggestion obtained by fault reasoning judging unit are converted to.
As shown in Figure 4, the self-adapting intelligent master control calamity of the embodiment of the present invention generates the flow process of switching signal for switching signal production method and self-adaptation master control calamity for switched system, can comprise the steps:
Step 401, fault data processing unit gathers the fault data of each production center by Agent mechanism and heartbeat detection mechanism, stores and failure data analyzing, obtain fault signature data through fault data classification;
Step 402, the fault signature data that fault reasoning judging unit obtains with step 401 are input, by knowledge base, obtain failover suggestion by Analysis of Knowledge Bases Reasoning;
Step 403, switching signal issue unit, based on failover suggestion and manual command, switches control module through transfer and sends switching steering order to each production center.
Wherein, each production center of fault data collection module collection of described fault data processing unit fault data and carry out fault data classification store.Described fault data collection module obtains the fault data of the production center by the agent equipment being arranged on each production center, and is obtained the fault data of other production center by heartbeat detection apparatus.The classification that described fault data collection module carries out fault data by affiliated different application subsystem stores.
Wherein, the failure analysis module of described fault data processing unit in carrying out fault analysis respectively according to the fault data being stored in different application subsystem, and carries out the association analysis of fault to each application subsystem, obtains fault signature data.Particularly, the single system fault analysis submodule of described failure analysis module obtains fault signature data for carrying out fault analysis respectively according to the fault data being stored in different application subsystem; The interconnected system fault analysis submodule of described failure analysis module carries out the association analysis of fault to each application subsystem, obtain fault signature data.
Wherein, described method also comprises: the fault signature database described fault signature data being saved in described fault data processing unit.
Wherein, be that back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data with knowledge base, obtain failover suggestion in conjunction with the switchover policy preset, and send to described switching signal issue unit; Described knowledge base describes knowledge processing solution logic.
Wherein, the transfer of described switching signal issue unit switches control module reception manual command and described manual command is sent to each production center as switching steering order; Described failover suggestion is sent after manual intervention and confirmation and switches steering order to each production center.
Compared with prior art, beneficial effect of the present invention is:
In the embodiment of the present invention, there is multiple production center, under normal circumstances, all for user provides service, when occurring abnormal, self-adaptation master control calamity will start automatically for switched system, and the normal production center is by the user of the production center abnormal for adapter;
In the embodiment of the present invention, the running state data of self is not only passed to self-adaptation master control calamity for switching device shifter by each production center, and by heartbeat detection technology, the running state data of other production centers is also passed to self-adaptation master control calamity for switching device shifter;
In the embodiment of the present invention, self-adaptation master control calamity introduces expert system reasoning technology for switching device shifter, and by failure data analysis reasoning, the operational process in conjunction with center switches;
In the embodiment of the present invention, self-adaptation master control calamity introduces the technology of machine learning for switching device shifter, by the unceasing study to history inferred results data, and adaptation knowledge base;
In the embodiment of the present invention, self-adaptation master control calamity alleviates the artificial degree participated in for switching device shifter, by machine intelligence, realizes the Intelligent treatment of human expert, provides expert in time and switch suggestion.
Those skilled in the art should understand, embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt the form of hardware embodiment, software implementation or the embodiment in conjunction with software and hardware aspect.And the present invention can adopt in one or more form wherein including the upper computer program implemented of computer-usable storage medium (including but not limited to magnetic disk memory and optical memory etc.) of computer usable program code.
The present invention describes with reference to according to the process flow diagram of the method for the embodiment of the present invention, equipment (system) and computer program and/or block scheme.Should understand can by the combination of the flow process in each flow process in computer program instructions realization flow figure and/or block scheme and/or square frame and process flow diagram and/or block scheme and/or square frame.These computer program instructions can being provided to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, making the instruction performed by the processor of computing machine or other programmable data processing device produce device for realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be stored in can in the computer-readable memory that works in a specific way of vectoring computer or other programmable data processing device, the instruction making to be stored in this computer-readable memory produces the manufacture comprising command device, and this command device realizes the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be loaded in computing machine or other programmable data processing device, make on computing machine or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computing machine or other programmable devices is provided for the step realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
The above, be only preferred embodiment of the present invention, be not intended to limit protection scope of the present invention.

Claims (24)

1. self-adaptation master control calamity is for a switching device shifter, it is characterized in that, described device comprises: fault data processing unit, fault reasoning judging unit and switching signal issue unit, wherein,
Fault data processing unit, for gathering the fault data of each production center, carrying out classification to described fault data and storing, analyze, obtaining fault signature data;
Fault reasoning judging unit, for obtaining failover suggestion by described fault signature data by Analysis of Knowledge Bases Reasoning;
Switching signal issue unit, for according to described failover suggestion and manual command, sends and switches steering order to each production center.
2. device according to claim 1, is characterized in that, described fault data processing unit comprises: fault data collection module, for collect each production center fault data and carry out fault data classification store.
3. device according to claim 2, it is characterized in that, described fault data collection module, the fault data of the production center is obtained specifically for the Agent program of acting on behalf of by being arranged on each production center, and monitor other production center running status by heartbeat detection apparatus, and collect the fault data of other production center.
4. device according to claim 3, is characterized in that, described fault data collection module, stores specifically for the classification carrying out fault data by affiliated different application subsystem.
5. device according to claim 2, it is characterized in that, described fault data processing unit also comprises: failure analysis module, for carrying out fault analysis respectively according to the fault data being stored in different application subsystem, and each application subsystem is carried out to the association analysis of fault, obtain fault signature data.
6. device according to claim 5, is characterized in that, described failure analysis module comprises single system fault analysis submodule; The fault data that single system fault analysis submodule is used for according to being stored in different application subsystem carries out fault analysis respectively, obtains fault signature data.
7. device according to claim 6, is characterized in that, described failure analysis module also comprises interconnected system fault analysis submodule; Described interconnected system fault analysis submodule, for carrying out the association analysis of fault to each application subsystem, obtains fault signature data.
8. device according to claim 1, is characterized in that, described fault data processing unit also comprises fault signature database, for preserving described fault signature data.
9. device according to claim 1, is characterized in that, described fault reasoning judging unit comprises knowledge base, Analysis of Knowledge Bases Reasoning module; Described knowledge base describes knowledge processing solution logic; Described Analysis of Knowledge Bases Reasoning module is used for, with described knowledge base for back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data, obtaining failover suggestion, and send to described switching signal issue unit in conjunction with the switchover policy preset.
10. device according to claim 1, is characterized in that, described switching signal issue unit, comprising: transfer switches control module, for after described failover suggestion is by manual intervention and confirmation, sends and switches steering order to each production center.
11. 1 kinds of self-adaptation master control calamities are for switched system, it is characterized in that, described system comprises at least two production centers, heartbeat detection apparatus and the self-adaptation master control calamity as described in any one of claim 1 to 11 for switching device shifter, each described production center is connected for switching device shifter with described self-adaptation master control calamity respectively, is connected with described heartbeat detection apparatus between each described production center.
12. systems according to claim 11, is characterized in that, described generating center comprises: status monitor service device and access server;
Described status monitor service device, for monitoring the running status of the production center in real time by acting on behalf of Agent program, and sends to described self-adaptation master control calamity for switching device shifter by the fault data of generating center;
Described access server, for waiting for the switching steering order that described self-adaptation master control calamity sends for switching device shifter and carrying out corresponding Failure Transfer operation.
13. systems according to claim 12, is characterized in that, described generating center also comprises: WEB cluster, database D B cluster and Centroid.
14. systems according to claim 11 or 12, is characterized in that, described heartbeat detection apparatus, monitor for the real-time running status to the production center, and sent to by the fault data of generating center described self-adaptation master control calamity for switching device shifter.
15. 1 kinds of self-adaptation master control calamities are for switching signal production method, and it is characterized in that, described method comprises:
Fault data processing unit gathers the fault data of each production center, carries out classification and stores, analyzes, obtain fault signature data to described fault data;
Described fault signature data are obtained failover suggestion by Analysis of Knowledge Bases Reasoning by fault reasoning judging unit;
Switching signal issue unit, according to described failover suggestion and manual command, sends and switches steering order to each production center.
16. methods according to claim 15, is characterized in that: the fault data of each production center of fault data collection module collection of described fault data processing unit the classification carrying out fault data store.
17. methods according to claim 16, it is characterized in that: described fault data collection module obtains the running state data of the production center by the status monitor service device being arranged on each production center, and is obtained the running state data of other production center by heartbeat detection apparatus.
18. methods according to claim 17, is characterized in that: the classification that described fault data collection module carries out fault data by affiliated different application subsystem stores.
19. methods according to claim 16, it is characterized in that: the failure analysis module of described fault data processing unit carries out fault analysis respectively according to the fault data of different application subsystem, and each application subsystem is carried out to the association analysis of fault, obtain fault signature data.
20. methods according to claim 19, is characterized in that, described failure analysis module carries out fault analysis respectively according to the fault data of different application subsystem, obtain fault signature data, comprising:
The single system fault analysis submodule of described failure analysis module carries out fault analysis respectively according to the fault data being stored in different application subsystem, obtains fault signature data.
21. methods according to claim 19, it is characterized in that, described failure analysis module carries out the association analysis of fault to each application subsystem, obtain fault signature data, for the interconnected system fault analysis submodule of: described failure analysis module to carry out the association analysis of fault to each application subsystem, obtain fault signature data.
22. methods according to claim 15, is characterized in that, described method also comprises: the fault signature database described fault signature data being saved in described fault data processing unit.
23. methods according to claim 15, is characterized in that:
Be that back-end data carries out Analysis of Knowledge Bases Reasoning to described fault signature data with knowledge base, obtain failover suggestion in conjunction with the switchover policy preset, and send to described switching signal issue unit; Described knowledge base describes knowledge processing solution logic.
24. methods according to claim 15, is characterized in that: in described failover suggestion by manual intervention with after confirming, send and switch steering order to each production center.
CN201510999459.1A 2015-12-28 2015-12-28 Adaptive general control disaster recovery switching device and system, and signal generation method Pending CN105574590A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510999459.1A CN105574590A (en) 2015-12-28 2015-12-28 Adaptive general control disaster recovery switching device and system, and signal generation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510999459.1A CN105574590A (en) 2015-12-28 2015-12-28 Adaptive general control disaster recovery switching device and system, and signal generation method

Publications (1)

Publication Number Publication Date
CN105574590A true CN105574590A (en) 2016-05-11

Family

ID=55884696

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510999459.1A Pending CN105574590A (en) 2015-12-28 2015-12-28 Adaptive general control disaster recovery switching device and system, and signal generation method

Country Status (1)

Country Link
CN (1) CN105574590A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107634863A (en) * 2017-10-25 2018-01-26 北京百悟科技有限公司 Distributed monitoring device and method for domain name mapping disaster tolerance service
CN107733684A (en) * 2017-08-31 2018-02-23 北京宇航系统工程研究所 A kind of multi-controller computing redundancy cluster based on Loongson processor
CN109617716A (en) * 2018-11-30 2019-04-12 新华三技术有限公司合肥分公司 Data center's abnormality eliminating method and device
CN110569149A (en) * 2019-09-16 2019-12-13 上海新炬网络技术有限公司 method for triggering automatic emergency switching of Oracle disaster tolerance based on fault detection
CN110635950A (en) * 2019-09-30 2019-12-31 深圳供电局有限公司 Double-data-center disaster recovery system
CN112540876A (en) * 2020-12-21 2021-03-23 中国人民解放军61623部队 Remote disaster recovery backup method and system for artificial telephone exchange
CN114338359A (en) * 2021-12-29 2022-04-12 中国邮政储蓄银行股份有限公司 Method and device for processing data center abnormity
CN112540876B (en) * 2020-12-21 2024-05-31 中国人民解放军61623部队 Remote disaster recovery backup method and system for manual telephone exchange

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1474542A (en) * 2002-08-06 2004-02-11 华为技术有限公司 Telecommunication equipment fault information managing method
CN101266279A (en) * 2008-05-09 2008-09-17 东北大学 Electric network failure diagnosis device and method
CN101628628A (en) * 2009-08-03 2010-01-20 北京航空航天大学 Self-correcting redundancy switching mechanism for spacecraft system and verification method thereof
CN105183937A (en) * 2015-07-17 2015-12-23 中国运载火箭技术研究院 Fault diagnosis method suitable for electrical system of unmanned aerial vehicle

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1474542A (en) * 2002-08-06 2004-02-11 华为技术有限公司 Telecommunication equipment fault information managing method
CN101266279A (en) * 2008-05-09 2008-09-17 东北大学 Electric network failure diagnosis device and method
CN101628628A (en) * 2009-08-03 2010-01-20 北京航空航天大学 Self-correcting redundancy switching mechanism for spacecraft system and verification method thereof
CN105183937A (en) * 2015-07-17 2015-12-23 中国运载火箭技术研究院 Fault diagnosis method suitable for electrical system of unmanned aerial vehicle

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107733684A (en) * 2017-08-31 2018-02-23 北京宇航系统工程研究所 A kind of multi-controller computing redundancy cluster based on Loongson processor
CN107733684B (en) * 2017-08-31 2021-02-09 北京宇航系统工程研究所 Multi-controller computing redundancy cluster based on Loongson processor
CN107634863A (en) * 2017-10-25 2018-01-26 北京百悟科技有限公司 Distributed monitoring device and method for domain name mapping disaster tolerance service
CN109617716A (en) * 2018-11-30 2019-04-12 新华三技术有限公司合肥分公司 Data center's abnormality eliminating method and device
CN109617716B (en) * 2018-11-30 2022-02-25 新华三技术有限公司合肥分公司 Data center exception handling method and device
CN110569149A (en) * 2019-09-16 2019-12-13 上海新炬网络技术有限公司 method for triggering automatic emergency switching of Oracle disaster tolerance based on fault detection
CN110569149B (en) * 2019-09-16 2023-07-25 上海新炬网络技术有限公司 Method for triggering Oracle disaster recovery automatic emergency switching based on fault detection
CN110635950A (en) * 2019-09-30 2019-12-31 深圳供电局有限公司 Double-data-center disaster recovery system
CN112540876A (en) * 2020-12-21 2021-03-23 中国人民解放军61623部队 Remote disaster recovery backup method and system for artificial telephone exchange
CN112540876B (en) * 2020-12-21 2024-05-31 中国人民解放军61623部队 Remote disaster recovery backup method and system for manual telephone exchange
CN114338359A (en) * 2021-12-29 2022-04-12 中国邮政储蓄银行股份有限公司 Method and device for processing data center abnormity

Similar Documents

Publication Publication Date Title
CN105574590A (en) Adaptive general control disaster recovery switching device and system, and signal generation method
CN102231681B (en) High availability cluster computer system and fault treatment method thereof
CN111209131A (en) Method and system for determining fault of heterogeneous system based on machine learning
EP3148116B1 (en) Information system fault scenario information collection method and system
CN104268061B (en) A kind of storage state monitoring method suitable for virtual machine
CN108199922B (en) System and method for diagnosing and repairing network equipment and server faults
US10316623B2 (en) Method and system for controlling well operations
CN107766502A (en) A kind of Oracle RAC databases disaster tolerance switches drilling method
CN104360868A (en) Multi-stage failure management method for use in large-sized plane comprehensive processing platform
CN113254279B (en) Intelligent disaster recovery and backup management platform system
CN103595572B (en) A kind of method of cloud computing cluster interior joint selfreparing
CN105162632A (en) Automatic processing system for server cluster failures
CN104657150A (en) Automatic operation and maintenance method under cluster environment
CN106789398A (en) A kind of method of media big data hadoop cluster monitoring
CN108337108A (en) A kind of cloud platform failure automation localization method based on association analysis
DE102017208293A1 (en) Industrial facility management systems and methods therefor
CN109698766A (en) The method and system of communication power supply accident analysis
CN106200615B (en) A kind of intelligent track-traffic early warning implementation method based on incidence relation
CN112117756A (en) Integrated operation and maintenance method and system for scheduling control system
CN109995554A (en) The control method and cloud dispatch control device of multi-stage data center active-standby switch
CN115833927A (en) Fiber core switching method and device, electronic equipment and storage medium
CN112711508A (en) Intelligent operation and maintenance service system facing large-scale client system
CN109742852A (en) A kind of controller switching equipment state-detection diagnostic system
CN113608750B (en) Deployment method and device of monitoring component, computer equipment and storage medium
CN110519393B (en) Self-service equipment supervision method, device, equipment, server and medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160511

RJ01 Rejection of invention patent application after publication