CN103701657A - Device and method for monitoring and processing dysfunction of continuously running data processing system - Google Patents
Device and method for monitoring and processing dysfunction of continuously running data processing system Download PDFInfo
- Publication number
- CN103701657A CN103701657A CN201210368459.8A CN201210368459A CN103701657A CN 103701657 A CN103701657 A CN 103701657A CN 201210368459 A CN201210368459 A CN 201210368459A CN 103701657 A CN103701657 A CN 103701657A
- Authority
- CN
- China
- Prior art keywords
- monitoring
- abnormal
- goal systems
- report information
- processing logic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Debugging And Monitoring (AREA)
Abstract
The invention provides a device and a method for monitoring and processing the dysfunction of a continuously running data processing system. The method comprises the following steps: monitoring the basic environment of a target system and generating basic environment report information; periodically sending messages for monitoring and testing application processing logic to the target system and generating application processing logic report information; based on monitoring rules, the basic environment report information and the application processing logic report information, judging whether the target system is dysfunctional or not and the property of the dysfunction, and automatically executing dysfunction processing operation which is related to the dysfunction based on a judgment result. Through the device and the method for monitoring and processing the dysfunction, disclosed by the invention, the dysfunction can be accurately monitored in a real-time manner, and a related emergent plan can be automatically implemented.
Description
Technical field
The present invention relates to abnormal monitoring and processing unit and method, more specifically, relate to abnormal monitoring and processing unit and the method for the data handling system for running without interruption.
Background technology
At present, along with becoming increasingly abundant of the class of business of the increasingly extensive and different field of cyber-net application, to the data handling system running without interruption (data handling system of moving continuously for 7 * 24 hours, for example transaction processing server in financial field) extremely monitor and processing becomes more and more important.
The existing abnormality monitoring system for the data handling system that runs without interruption and method are only for the monitoring state of goal systems, and the abnormal and alarm of finding for monitoring needs manpower intervention processing conventionally.
Therefore there are the following problems for the existing abnormality monitoring system for the data handling system that runs without interruption and method: (1) because needs manpower intervention is processed, thus human error can be caused, and ageing lower; (2) due to the conventional supervisory control system service logic of monitoring objective system not, relatively independent and there is versatility, therefore cannot set up specific monitoring rules with the service logic of monitoring objective system; (3) because shortage when carrying out abnormality processing comprehensively judges and need manpower intervention to process, therefore can not tackle fast extremely and accurately implement emergency preplan.
Therefore, there is following demand: provide abnormal monitoring and processing unit and the method for the data handling system that runs without interruption that can monitor real-time and accurately abnormal and emergency preplan that automatically implement to be associated.
Summary of the invention
In order to solve the existing problem of above-mentioned prior art scheme, the present invention proposes abnormal monitoring and processing unit and the method for the data handling system that runs without interruption that can monitor real-time and accurately abnormal and emergency preplan that automatically implement to be associated.
The object of the invention is to be achieved through the following technical solutions:
Abnormal monitoring and a processing unit, described abnormal monitoring and processing unit comprise:
The first monitoring unit, the basic environment of described the first monitoring unit monitoring objective system, and formation base environmental statement information, and described basic environment report information is sent to master controller;
The second monitoring unit, described the second monitoring unit periodically sends application processing logic control and measuring message to described goal systems, and generates application processing logic report information, and described application processing logic report information is sent to described master controller;
Master controller, described master controller judges based on monitoring rules and the described basic environment report information that receives and application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal;
Memory, monitoring rules described in described memory stores.
In the above in disclosed scheme, preferably, the basic environment that described the first monitoring unit is monitored described goal systems comprises at least one that carry out in following operation: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
In disclosed scheme, preferably, described the first monitoring unit is monitored the basic environment of described goal systems based at least one monitor control index in the above.
In disclosed scheme, preferably, described the second monitoring unit is monitored the application processing logic of described goal systems based at least one the service application monitor control index at least one applied business dimension in the above.
In the above in disclosed scheme, preferably, described the second monitoring unit is by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data of described goal systems, and set up baseline according to described goal systems historical behavior, thereby monitor the application processing logic of described goal systems.
In the above in disclosed scheme, preferably, described master controller is carried out filter operation based on filtering rule to the described basic environment report information receiving and application processing logic report information before carrying out decision operation based on described monitoring rules, to remove irrelevant information, wherein, filtering rule described in described memory stores.
In disclosed scheme, preferably, user is by the user interface of described abnormal monitoring and processing unit or by configuration file, the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation is set in the above.
In disclosed scheme, preferably, described memory is the incidence relation of storage extremely and between abnormality processing operation further in the above.
In disclosed scheme, preferably, described master controller is monitored the result of implementation of described abnormality processing operation after executing described abnormality processing operation in the above.
Object of the present invention also can be achieved through the following technical solutions:
Abnormal monitoring and a processing method, described abnormal monitoring and processing method comprise the following steps:
(A1) basic environment of monitoring objective system, and formation base environmental statement information;
(A2) periodically to described goal systems, send application processing logic control and measuring message, and generate application processing logic report information;
(A3) based on monitoring rules and described basic environment report information, judge with application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal.
Abnormal monitoring for the data handling system that runs without interruption disclosed in this invention and processing unit and method have the following advantages: (1) is because abnormality processing operation is automatically carried out and without manpower intervention, therefore can not introduce human error, and abnormality processing is ageing higher; (2) due to the application processing logic of monitoring objective system, therefore can by setting up specific monitoring rules, whether the application processing logic of monitoring objective system occurs extremely; (3) owing to comprehensively judging based on basic environment report information and application processing logic report information, therefore can tackle fast the abnormal emergency preplan of also implementing exactly.
Accompanying drawing explanation
By reference to the accompanying drawings, technical characterictic of the present invention and advantage will be understood better by those skilled in the art, wherein:
Fig. 1 is the schematic diagram of abnormal monitoring and processing unit according to an embodiment of the invention;
Fig. 2 is the flow chart of abnormal monitoring and processing method according to an embodiment of the invention.
Embodiment
Fig. 1 is the schematic diagram of abnormal monitoring and processing unit according to an embodiment of the invention.As shown in Figure 1, abnormal monitoring disclosed in this invention and processing unit comprise master controller 1, the first monitoring unit 2, the second monitoring unit 3 and memory 4.Wherein, the basic environment of described the first monitoring unit 2 monitoring objective systems (needing monitored data handling system), and formation base environmental statement information, and described basic environment report information is sent to master controller 1.Described the second monitoring unit 3 periodically (for example per minute) sends application processing logic control and measuring message (for example for detection of the business expression behaviour of transaction processing server whether conclude the business normally probe) to described goal systems, and generate application processing logic report information, and described application processing logic report information is sent to described master controller 1.Described master controller 1 judges that based on monitoring rules and the described basic environment report information that receives and application processing logic report information character that whether described goal systems is abnormal and abnormal (exemplarily, this decision operation completed in several seconds), and automatically carry out and the described abnormality processing operation (for example emergency preplan) being extremely associated based on judged result, so that described goal systems recovers normal.The described monitoring rules of described memory 4 storage.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, the basic environment of the described goal systems of described the first monitoring unit 2 monitoring comprises at least one that carry out in following: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described the first monitoring unit 2 is monitored the basic environment of described goal systems based at least one monitor control index.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described the second monitoring unit 3 at least one service application monitor control index based at least one applied business dimension (being applied business) is monitored the application processing logic of described goal systems.
Exemplarily, in abnormal monitoring disclosed in this invention and processing unit, described the second monitoring unit 3 for example, by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data (transaction data) of described goal systems, and set up the baseline basis of decision operation subsequently (for) according to described goal systems historical behavior, thereby monitor the application processing logic (for example trading processing logic) of described goal systems.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described master controller 1 is carried out filter operation based on filtering rule to the described basic environment report information receiving and application processing logic report information before carrying out decision operation based on described monitoring rules, to remove irrelevant information, wherein, the described filtering rule of described memory 4 storage.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, user is by the user interface (not shown) of described abnormal monitoring and processing unit or by configuration file, the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation is set.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described memory 4 is the incidence relation (i.e. one to one relation extremely and abnormality processing operation between) of storage extremely and between abnormality processing operation further.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, the result of implementation of described master controller 1 described abnormality processing operation of monitoring after executing described abnormality processing operation.
Exemplarily, in abnormal monitoring disclosed in this invention and processing unit, described master controller 1 is carried out described abnormality processing by telnet agreement or http protocol and is operated.
Therefore abnormal monitoring disclosed in this invention and processing unit tool have the following advantages: (1) because abnormality processing operation is automatically carried out and without manpower intervention, thus human error can not introduced, and abnormality processing is ageing higher; (2) due to the application processing logic of monitoring objective system, therefore can by setting up specific monitoring rules, whether the application processing logic of monitoring objective system occurs extremely; (3) owing to comprehensively judging based on basic environment report information and application processing logic report information, therefore can tackle fast the abnormal emergency preplan of also implementing exactly.
Fig. 2 is the flow chart of abnormal monitoring and processing method according to an embodiment of the invention.As shown in Figure 2, abnormal monitoring disclosed in this invention and processing method comprise the following steps: the basic environment of (A1) monitoring objective system (needing monitored data handling system), and formation base environmental statement information; (A2) periodically (for example per minute) sends application processing logic control and measuring message (for example for detection of the business expression behaviour of transaction processing server whether conclude the business normally probe) to described goal systems, and generates application processing logic report information; (A3) based on monitoring rules and described basic environment report information and application processing logic report information, judge that character that whether described goal systems is abnormal and abnormal (exemplarily, this decision operation completed in several seconds), and automatically carry out and the described abnormality processing operation (for example emergency preplan) being extremely associated based on judged result, so that described goal systems recovers normal.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A1) further comprises: carry out at least one in following operation: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A1) further comprises: the basic environment of monitoring described goal systems based at least one monitor control index.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A2) further comprises: at least one the service application monitor control index based at least one applied business dimension (being applied business) is monitored the application processing logic of described goal systems.
Exemplarily, in abnormal monitoring disclosed in this invention and processing method, described step (A2) further comprises: for example, by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data (transaction data) of described goal systems, and set up the baseline basis of decision operation subsequently (for) according to described goal systems historical behavior, thereby monitor the application processing logic (for example trading processing logic) of described goal systems.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A3) further comprises: before carrying out decision operation based on described monitoring rules, based on filtering rule, described basic environment report information and application processing logic report information are carried out to filter operation, to remove irrelevant information.
Preferably, in abnormal monitoring disclosed in this invention and processing method, user arranges the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation by user interface or configuration file.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A3) further comprises: the result of implementation of the described abnormality processing operation of monitoring after executing described abnormality processing operation.
Exemplarily, in abnormal monitoring disclosed in this invention and processing method, by telnet agreement or http protocol, carry out described abnormality processing and operate.
Therefore abnormal monitoring disclosed in this invention and processing method tool have the following advantages: (1) because abnormality processing operation is automatically carried out and without manpower intervention, thus human error can not introduced, and abnormality processing is ageing higher; (2) due to the application processing logic of monitoring objective system, therefore can by setting up specific monitoring rules, whether the application processing logic of monitoring objective system occurs extremely; (3) owing to comprehensively judging based on basic environment report information and application processing logic report information, therefore can tackle fast the abnormal emergency preplan of also implementing exactly.
Although the present invention is described by above-mentioned preferred implementation, its way of realization is not limited to above-mentioned execution mode.Should be realized that: in the situation that not departing from purport of the present invention and scope, those skilled in the art can make different variations and modification to the present invention.
Claims (10)
1. abnormal monitoring and a processing unit, described abnormal monitoring and processing unit comprise:
The first monitoring unit, the basic environment of described the first monitoring unit monitoring objective system, and formation base environmental statement information, and described basic environment report information is sent to master controller;
The second monitoring unit, described the second monitoring unit periodically sends application processing logic control and measuring message to described goal systems, and generates application processing logic report information, and described application processing logic report information is sent to described master controller;
Master controller, described master controller judges based on monitoring rules and the described basic environment report information that receives and application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal;
Memory, monitoring rules described in described memory stores.
2. abnormal monitoring according to claim 1 and processing unit, it is characterized in that, the basic environment that described the first monitoring unit is monitored described goal systems comprises at least one that carry out in following operation: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
3. abnormal monitoring according to claim 2 and processing unit, is characterized in that, described the first monitoring unit is monitored the basic environment of described goal systems based at least one monitor control index.
4. abnormal monitoring according to claim 3 and processing unit, is characterized in that, described the second monitoring unit is monitored the application processing logic of described goal systems based at least one the service application monitor control index at least one applied business dimension.
5. abnormal monitoring according to claim 4 and processing unit, it is characterized in that, described the second monitoring unit is by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data of described goal systems, and set up baseline according to described goal systems historical behavior, thereby monitor the application processing logic of described goal systems.
6. abnormal monitoring according to claim 5 and processing unit, it is characterized in that, described master controller is carried out filter operation based on filtering rule to the described basic environment report information receiving and application processing logic report information before carrying out decision operation based on described monitoring rules, to remove irrelevant information, wherein, filtering rule described in described memory stores.
7. abnormal monitoring according to claim 6 and processing unit, it is characterized in that, user is by the user interface of described abnormal monitoring and processing unit or by configuration file, the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation is set.
8. abnormal monitoring according to claim 7 and processing unit, is characterized in that, described memory is the incidence relation of storage extremely and between abnormality processing operation further.
9. abnormal monitoring according to claim 8 and processing unit, is characterized in that, the result of implementation of described master controller described abnormality processing operation of monitoring after executing described abnormality processing operation.
10. abnormal monitoring and a processing method, described abnormal monitoring and processing method comprise the following steps:
(A1) basic environment of monitoring objective system, and formation base environmental statement information;
(A2) periodically to described goal systems, send application processing logic control and measuring message, and generate application processing logic report information;
(A3) based on monitoring rules and described basic environment report information, judge with application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210368459.8A CN103701657A (en) | 2012-09-28 | 2012-09-28 | Device and method for monitoring and processing dysfunction of continuously running data processing system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210368459.8A CN103701657A (en) | 2012-09-28 | 2012-09-28 | Device and method for monitoring and processing dysfunction of continuously running data processing system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103701657A true CN103701657A (en) | 2014-04-02 |
Family
ID=50363060
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210368459.8A Pending CN103701657A (en) | 2012-09-28 | 2012-09-28 | Device and method for monitoring and processing dysfunction of continuously running data processing system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103701657A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104363113A (en) * | 2014-10-29 | 2015-02-18 | 中国建设银行股份有限公司 | Business continuity detection method |
CN104980962A (en) * | 2014-04-03 | 2015-10-14 | 中国移动通信集团设计院有限公司 | Line test period determining method and device |
CN106992900A (en) * | 2016-01-20 | 2017-07-28 | 北京国双科技有限公司 | The method and intelligent early-warning notification platform of monitoring and early warning |
CN108073499A (en) * | 2016-11-10 | 2018-05-25 | 腾讯科技(深圳)有限公司 | The test method and device of application program |
CN108509321A (en) * | 2017-02-24 | 2018-09-07 | 北京京东尚科信息技术有限公司 | Generate the monitoring method and system of data cube |
CN108683639A (en) * | 2018-04-23 | 2018-10-19 | 丙申南京网络技术有限公司 | A kind of computer network abnormality detection and automatic repair system, method and mobile terminal |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040153704A1 (en) * | 2001-02-09 | 2004-08-05 | Jurgen Bragulla | Automatic startup of a cluster system after occurrence of a recoverable error |
CN101482849A (en) * | 2009-02-24 | 2009-07-15 | 北京星网锐捷网络技术有限公司 | Test monitoring method and apparatus |
CN101556679A (en) * | 2009-05-21 | 2009-10-14 | 中国建设银行股份有限公司 | Method for processing failures in integrated front-end system and computer equipment |
CN102043682A (en) * | 2011-01-27 | 2011-05-04 | 中国农业银行股份有限公司 | Workflow exception handing method and system |
-
2012
- 2012-09-28 CN CN201210368459.8A patent/CN103701657A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040153704A1 (en) * | 2001-02-09 | 2004-08-05 | Jurgen Bragulla | Automatic startup of a cluster system after occurrence of a recoverable error |
CN101482849A (en) * | 2009-02-24 | 2009-07-15 | 北京星网锐捷网络技术有限公司 | Test monitoring method and apparatus |
CN101556679A (en) * | 2009-05-21 | 2009-10-14 | 中国建设银行股份有限公司 | Method for processing failures in integrated front-end system and computer equipment |
CN102043682A (en) * | 2011-01-27 | 2011-05-04 | 中国农业银行股份有限公司 | Workflow exception handing method and system |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104980962A (en) * | 2014-04-03 | 2015-10-14 | 中国移动通信集团设计院有限公司 | Line test period determining method and device |
CN104980962B (en) * | 2014-04-03 | 2019-04-30 | 中国移动通信集团设计院有限公司 | A kind of determination method and device in field testing period |
CN104363113A (en) * | 2014-10-29 | 2015-02-18 | 中国建设银行股份有限公司 | Business continuity detection method |
CN106992900A (en) * | 2016-01-20 | 2017-07-28 | 北京国双科技有限公司 | The method and intelligent early-warning notification platform of monitoring and early warning |
CN108073499A (en) * | 2016-11-10 | 2018-05-25 | 腾讯科技(深圳)有限公司 | The test method and device of application program |
CN108073499B (en) * | 2016-11-10 | 2020-09-29 | 腾讯科技(深圳)有限公司 | Application program testing method and device |
CN108509321A (en) * | 2017-02-24 | 2018-09-07 | 北京京东尚科信息技术有限公司 | Generate the monitoring method and system of data cube |
CN108683639A (en) * | 2018-04-23 | 2018-10-19 | 丙申南京网络技术有限公司 | A kind of computer network abnormality detection and automatic repair system, method and mobile terminal |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103701657A (en) | Device and method for monitoring and processing dysfunction of continuously running data processing system | |
CN104639380B (en) | server monitoring method | |
CN105659528B (en) | A kind of method and device for realizing fault location | |
CN103490917B (en) | The detection method of troubleshooting situation and device | |
CN108092836A (en) | The monitoring method and device of a kind of server | |
CN104022904A (en) | Unified management platform for IT devices in distributed computer rooms | |
WO2012157471A1 (en) | Fault sensing system for sensing fault in plurality of control systems | |
US10931533B2 (en) | System for network incident management | |
CN103713981A (en) | Database server performance detection and early warning method | |
CN103490919A (en) | Fault management system and fault management method | |
CN105471932A (en) | Front-end application monitoring method, front-end application and front-end application monitoring system | |
CN104461820A (en) | Equipment monitoring method and device | |
CN104065526A (en) | Server fault alarming method and device thereof | |
CN102404141A (en) | Method and device of alarm inhibition | |
CN102609350A (en) | Server memory failure alarm method | |
US20210287523A1 (en) | Method, apparatus, and system for managing alarms | |
CN111679950B (en) | Interface-level dynamic data sampling method and device | |
CN105025179A (en) | Method and system for monitoring service agents of call center | |
TWI591489B (en) | Intelligent monitoring and warning device and method for distributed software defined storage system | |
CN111124818B (en) | Monitoring method, device and equipment for Expander | |
CN105955864A (en) | Power supply fault processing method, power supply module, monitoring management module and server | |
CN104346233A (en) | Fault recovery method and device for computer system | |
CN103457755A (en) | IEC 61850 system communication fault detection method and system | |
CN116074180A (en) | Fault location method, fault repair method, device and storage medium | |
CN105550094B (en) | A kind of high-availability system state automatic monitoring method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140402 |
|
RJ01 | Rejection of invention patent application after publication |