CN103701657A - Device and method for monitoring and processing dysfunction of continuously running data processing system - Google Patents

Device and method for monitoring and processing dysfunction of continuously running data processing system Download PDF

Info

Publication number
CN103701657A
CN103701657A CN201210368459.8A CN201210368459A CN103701657A CN 103701657 A CN103701657 A CN 103701657A CN 201210368459 A CN201210368459 A CN 201210368459A CN 103701657 A CN103701657 A CN 103701657A
Authority
CN
China
Prior art keywords
monitoring
abnormal
goal systems
report information
processing logic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210368459.8A
Other languages
Chinese (zh)
Inventor
戚跃民
胡文斌
程军
陈根
吴正中
黄明雄
王昊
冀乃庚
杨燕明
蒋群华
张凉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Unionpay Co Ltd
Original Assignee
China Unionpay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Unionpay Co Ltd filed Critical China Unionpay Co Ltd
Priority to CN201210368459.8A priority Critical patent/CN103701657A/en
Publication of CN103701657A publication Critical patent/CN103701657A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention provides a device and a method for monitoring and processing the dysfunction of a continuously running data processing system. The method comprises the following steps: monitoring the basic environment of a target system and generating basic environment report information; periodically sending messages for monitoring and testing application processing logic to the target system and generating application processing logic report information; based on monitoring rules, the basic environment report information and the application processing logic report information, judging whether the target system is dysfunctional or not and the property of the dysfunction, and automatically executing dysfunction processing operation which is related to the dysfunction based on a judgment result. Through the device and the method for monitoring and processing the dysfunction, disclosed by the invention, the dysfunction can be accurately monitored in a real-time manner, and a related emergent plan can be automatically implemented.

Description

Abnormal monitoring and processing unit and method for the data handling system that runs without interruption
Technical field
The present invention relates to abnormal monitoring and processing unit and method, more specifically, relate to abnormal monitoring and processing unit and the method for the data handling system for running without interruption.
Background technology
At present, along with becoming increasingly abundant of the class of business of the increasingly extensive and different field of cyber-net application, to the data handling system running without interruption (data handling system of moving continuously for 7 * 24 hours, for example transaction processing server in financial field) extremely monitor and processing becomes more and more important.
The existing abnormality monitoring system for the data handling system that runs without interruption and method are only for the monitoring state of goal systems, and the abnormal and alarm of finding for monitoring needs manpower intervention processing conventionally.
Therefore there are the following problems for the existing abnormality monitoring system for the data handling system that runs without interruption and method: (1) because needs manpower intervention is processed, thus human error can be caused, and ageing lower; (2) due to the conventional supervisory control system service logic of monitoring objective system not, relatively independent and there is versatility, therefore cannot set up specific monitoring rules with the service logic of monitoring objective system; (3) because shortage when carrying out abnormality processing comprehensively judges and need manpower intervention to process, therefore can not tackle fast extremely and accurately implement emergency preplan.
Therefore, there is following demand: provide abnormal monitoring and processing unit and the method for the data handling system that runs without interruption that can monitor real-time and accurately abnormal and emergency preplan that automatically implement to be associated.
Summary of the invention
In order to solve the existing problem of above-mentioned prior art scheme, the present invention proposes abnormal monitoring and processing unit and the method for the data handling system that runs without interruption that can monitor real-time and accurately abnormal and emergency preplan that automatically implement to be associated.
The object of the invention is to be achieved through the following technical solutions:
Abnormal monitoring and a processing unit, described abnormal monitoring and processing unit comprise:
The first monitoring unit, the basic environment of described the first monitoring unit monitoring objective system, and formation base environmental statement information, and described basic environment report information is sent to master controller;
The second monitoring unit, described the second monitoring unit periodically sends application processing logic control and measuring message to described goal systems, and generates application processing logic report information, and described application processing logic report information is sent to described master controller;
Master controller, described master controller judges based on monitoring rules and the described basic environment report information that receives and application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal;
Memory, monitoring rules described in described memory stores.
In the above in disclosed scheme, preferably, the basic environment that described the first monitoring unit is monitored described goal systems comprises at least one that carry out in following operation: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
In disclosed scheme, preferably, described the first monitoring unit is monitored the basic environment of described goal systems based at least one monitor control index in the above.
In disclosed scheme, preferably, described the second monitoring unit is monitored the application processing logic of described goal systems based at least one the service application monitor control index at least one applied business dimension in the above.
In the above in disclosed scheme, preferably, described the second monitoring unit is by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data of described goal systems, and set up baseline according to described goal systems historical behavior, thereby monitor the application processing logic of described goal systems.
In the above in disclosed scheme, preferably, described master controller is carried out filter operation based on filtering rule to the described basic environment report information receiving and application processing logic report information before carrying out decision operation based on described monitoring rules, to remove irrelevant information, wherein, filtering rule described in described memory stores.
In disclosed scheme, preferably, user is by the user interface of described abnormal monitoring and processing unit or by configuration file, the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation is set in the above.
In disclosed scheme, preferably, described memory is the incidence relation of storage extremely and between abnormality processing operation further in the above.
In disclosed scheme, preferably, described master controller is monitored the result of implementation of described abnormality processing operation after executing described abnormality processing operation in the above.
Object of the present invention also can be achieved through the following technical solutions:
Abnormal monitoring and a processing method, described abnormal monitoring and processing method comprise the following steps:
(A1) basic environment of monitoring objective system, and formation base environmental statement information;
(A2) periodically to described goal systems, send application processing logic control and measuring message, and generate application processing logic report information;
(A3) based on monitoring rules and described basic environment report information, judge with application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal.
Abnormal monitoring for the data handling system that runs without interruption disclosed in this invention and processing unit and method have the following advantages: (1) is because abnormality processing operation is automatically carried out and without manpower intervention, therefore can not introduce human error, and abnormality processing is ageing higher; (2) due to the application processing logic of monitoring objective system, therefore can by setting up specific monitoring rules, whether the application processing logic of monitoring objective system occurs extremely; (3) owing to comprehensively judging based on basic environment report information and application processing logic report information, therefore can tackle fast the abnormal emergency preplan of also implementing exactly.
Accompanying drawing explanation
By reference to the accompanying drawings, technical characterictic of the present invention and advantage will be understood better by those skilled in the art, wherein:
Fig. 1 is the schematic diagram of abnormal monitoring and processing unit according to an embodiment of the invention;
Fig. 2 is the flow chart of abnormal monitoring and processing method according to an embodiment of the invention.
Embodiment
Fig. 1 is the schematic diagram of abnormal monitoring and processing unit according to an embodiment of the invention.As shown in Figure 1, abnormal monitoring disclosed in this invention and processing unit comprise master controller 1, the first monitoring unit 2, the second monitoring unit 3 and memory 4.Wherein, the basic environment of described the first monitoring unit 2 monitoring objective systems (needing monitored data handling system), and formation base environmental statement information, and described basic environment report information is sent to master controller 1.Described the second monitoring unit 3 periodically (for example per minute) sends application processing logic control and measuring message (for example for detection of the business expression behaviour of transaction processing server whether conclude the business normally probe) to described goal systems, and generate application processing logic report information, and described application processing logic report information is sent to described master controller 1.Described master controller 1 judges that based on monitoring rules and the described basic environment report information that receives and application processing logic report information character that whether described goal systems is abnormal and abnormal (exemplarily, this decision operation completed in several seconds), and automatically carry out and the described abnormality processing operation (for example emergency preplan) being extremely associated based on judged result, so that described goal systems recovers normal.The described monitoring rules of described memory 4 storage.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, the basic environment of the described goal systems of described the first monitoring unit 2 monitoring comprises at least one that carry out in following: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described the first monitoring unit 2 is monitored the basic environment of described goal systems based at least one monitor control index.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described the second monitoring unit 3 at least one service application monitor control index based at least one applied business dimension (being applied business) is monitored the application processing logic of described goal systems.
Exemplarily, in abnormal monitoring disclosed in this invention and processing unit, described the second monitoring unit 3 for example, by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data (transaction data) of described goal systems, and set up the baseline basis of decision operation subsequently (for) according to described goal systems historical behavior, thereby monitor the application processing logic (for example trading processing logic) of described goal systems.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described master controller 1 is carried out filter operation based on filtering rule to the described basic environment report information receiving and application processing logic report information before carrying out decision operation based on described monitoring rules, to remove irrelevant information, wherein, the described filtering rule of described memory 4 storage.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, user is by the user interface (not shown) of described abnormal monitoring and processing unit or by configuration file, the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation is set.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, described memory 4 is the incidence relation (i.e. one to one relation extremely and abnormality processing operation between) of storage extremely and between abnormality processing operation further.
Preferably, in abnormal monitoring disclosed in this invention and processing unit, the result of implementation of described master controller 1 described abnormality processing operation of monitoring after executing described abnormality processing operation.
Exemplarily, in abnormal monitoring disclosed in this invention and processing unit, described master controller 1 is carried out described abnormality processing by telnet agreement or http protocol and is operated.
Therefore abnormal monitoring disclosed in this invention and processing unit tool have the following advantages: (1) because abnormality processing operation is automatically carried out and without manpower intervention, thus human error can not introduced, and abnormality processing is ageing higher; (2) due to the application processing logic of monitoring objective system, therefore can by setting up specific monitoring rules, whether the application processing logic of monitoring objective system occurs extremely; (3) owing to comprehensively judging based on basic environment report information and application processing logic report information, therefore can tackle fast the abnormal emergency preplan of also implementing exactly.
Fig. 2 is the flow chart of abnormal monitoring and processing method according to an embodiment of the invention.As shown in Figure 2, abnormal monitoring disclosed in this invention and processing method comprise the following steps: the basic environment of (A1) monitoring objective system (needing monitored data handling system), and formation base environmental statement information; (A2) periodically (for example per minute) sends application processing logic control and measuring message (for example for detection of the business expression behaviour of transaction processing server whether conclude the business normally probe) to described goal systems, and generates application processing logic report information; (A3) based on monitoring rules and described basic environment report information and application processing logic report information, judge that character that whether described goal systems is abnormal and abnormal (exemplarily, this decision operation completed in several seconds), and automatically carry out and the described abnormality processing operation (for example emergency preplan) being extremely associated based on judged result, so that described goal systems recovers normal.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A1) further comprises: carry out at least one in following operation: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A1) further comprises: the basic environment of monitoring described goal systems based at least one monitor control index.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A2) further comprises: at least one the service application monitor control index based at least one applied business dimension (being applied business) is monitored the application processing logic of described goal systems.
Exemplarily, in abnormal monitoring disclosed in this invention and processing method, described step (A2) further comprises: for example, by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data (transaction data) of described goal systems, and set up the baseline basis of decision operation subsequently (for) according to described goal systems historical behavior, thereby monitor the application processing logic (for example trading processing logic) of described goal systems.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A3) further comprises: before carrying out decision operation based on described monitoring rules, based on filtering rule, described basic environment report information and application processing logic report information are carried out to filter operation, to remove irrelevant information.
Preferably, in abnormal monitoring disclosed in this invention and processing method, user arranges the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation by user interface or configuration file.
Preferably, in abnormal monitoring disclosed in this invention and processing method, described step (A3) further comprises: the result of implementation of the described abnormality processing operation of monitoring after executing described abnormality processing operation.
Exemplarily, in abnormal monitoring disclosed in this invention and processing method, by telnet agreement or http protocol, carry out described abnormality processing and operate.
Therefore abnormal monitoring disclosed in this invention and processing method tool have the following advantages: (1) because abnormality processing operation is automatically carried out and without manpower intervention, thus human error can not introduced, and abnormality processing is ageing higher; (2) due to the application processing logic of monitoring objective system, therefore can by setting up specific monitoring rules, whether the application processing logic of monitoring objective system occurs extremely; (3) owing to comprehensively judging based on basic environment report information and application processing logic report information, therefore can tackle fast the abnormal emergency preplan of also implementing exactly.
Although the present invention is described by above-mentioned preferred implementation, its way of realization is not limited to above-mentioned execution mode.Should be realized that: in the situation that not departing from purport of the present invention and scope, those skilled in the art can make different variations and modification to the present invention.

Claims (10)

1. abnormal monitoring and a processing unit, described abnormal monitoring and processing unit comprise:
The first monitoring unit, the basic environment of described the first monitoring unit monitoring objective system, and formation base environmental statement information, and described basic environment report information is sent to master controller;
The second monitoring unit, described the second monitoring unit periodically sends application processing logic control and measuring message to described goal systems, and generates application processing logic report information, and described application processing logic report information is sent to described master controller;
Master controller, described master controller judges based on monitoring rules and the described basic environment report information that receives and application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal;
Memory, monitoring rules described in described memory stores.
2. abnormal monitoring according to claim 1 and processing unit, it is characterized in that, the basic environment that described the first monitoring unit is monitored described goal systems comprises at least one that carry out in following operation: monitoring state of network, monitoring host computer running status, monitoring process state, monitoring application daily record.
3. abnormal monitoring according to claim 2 and processing unit, is characterized in that, described the first monitoring unit is monitored the basic environment of described goal systems based at least one monitor control index.
4. abnormal monitoring according to claim 3 and processing unit, is characterized in that, described the second monitoring unit is monitored the application processing logic of described goal systems based at least one the service application monitor control index at least one applied business dimension.
5. abnormal monitoring according to claim 4 and processing unit, it is characterized in that, described the second monitoring unit is by described application processing logic control and measuring message obtaining information pay close attention to the output of described goal systems from the application data of described goal systems, and set up baseline according to described goal systems historical behavior, thereby monitor the application processing logic of described goal systems.
6. abnormal monitoring according to claim 5 and processing unit, it is characterized in that, described master controller is carried out filter operation based on filtering rule to the described basic environment report information receiving and application processing logic report information before carrying out decision operation based on described monitoring rules, to remove irrelevant information, wherein, filtering rule described in described memory stores.
7. abnormal monitoring according to claim 6 and processing unit, it is characterized in that, user is by the user interface of described abnormal monitoring and processing unit or by configuration file, the incidence relation between described monitoring rules and/or filtering rule and/or abnormal and abnormality processing operation is set.
8. abnormal monitoring according to claim 7 and processing unit, is characterized in that, described memory is the incidence relation of storage extremely and between abnormality processing operation further.
9. abnormal monitoring according to claim 8 and processing unit, is characterized in that, the result of implementation of described master controller described abnormality processing operation of monitoring after executing described abnormality processing operation.
10. abnormal monitoring and a processing method, described abnormal monitoring and processing method comprise the following steps:
(A1) basic environment of monitoring objective system, and formation base environmental statement information;
(A2) periodically to described goal systems, send application processing logic control and measuring message, and generate application processing logic report information;
(A3) based on monitoring rules and described basic environment report information, judge with application processing logic report information the character whether described goal systems is abnormal and abnormal, and automatically carry out and the described abnormality processing operation being extremely associated based on judged result, so that described goal systems recovers normal.
CN201210368459.8A 2012-09-28 2012-09-28 Device and method for monitoring and processing dysfunction of continuously running data processing system Pending CN103701657A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210368459.8A CN103701657A (en) 2012-09-28 2012-09-28 Device and method for monitoring and processing dysfunction of continuously running data processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210368459.8A CN103701657A (en) 2012-09-28 2012-09-28 Device and method for monitoring and processing dysfunction of continuously running data processing system

Publications (1)

Publication Number Publication Date
CN103701657A true CN103701657A (en) 2014-04-02

Family

ID=50363060

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210368459.8A Pending CN103701657A (en) 2012-09-28 2012-09-28 Device and method for monitoring and processing dysfunction of continuously running data processing system

Country Status (1)

Country Link
CN (1) CN103701657A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104363113A (en) * 2014-10-29 2015-02-18 中国建设银行股份有限公司 Business continuity detection method
CN104980962A (en) * 2014-04-03 2015-10-14 中国移动通信集团设计院有限公司 Line test period determining method and device
CN106992900A (en) * 2016-01-20 2017-07-28 北京国双科技有限公司 The method and intelligent early-warning notification platform of monitoring and early warning
CN108073499A (en) * 2016-11-10 2018-05-25 腾讯科技(深圳)有限公司 The test method and device of application program
CN108509321A (en) * 2017-02-24 2018-09-07 北京京东尚科信息技术有限公司 Generate the monitoring method and system of data cube
CN108683639A (en) * 2018-04-23 2018-10-19 丙申南京网络技术有限公司 A kind of computer network abnormality detection and automatic repair system, method and mobile terminal

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040153704A1 (en) * 2001-02-09 2004-08-05 Jurgen Bragulla Automatic startup of a cluster system after occurrence of a recoverable error
CN101482849A (en) * 2009-02-24 2009-07-15 北京星网锐捷网络技术有限公司 Test monitoring method and apparatus
CN101556679A (en) * 2009-05-21 2009-10-14 中国建设银行股份有限公司 Method for processing failures in integrated front-end system and computer equipment
CN102043682A (en) * 2011-01-27 2011-05-04 中国农业银行股份有限公司 Workflow exception handing method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040153704A1 (en) * 2001-02-09 2004-08-05 Jurgen Bragulla Automatic startup of a cluster system after occurrence of a recoverable error
CN101482849A (en) * 2009-02-24 2009-07-15 北京星网锐捷网络技术有限公司 Test monitoring method and apparatus
CN101556679A (en) * 2009-05-21 2009-10-14 中国建设银行股份有限公司 Method for processing failures in integrated front-end system and computer equipment
CN102043682A (en) * 2011-01-27 2011-05-04 中国农业银行股份有限公司 Workflow exception handing method and system

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104980962A (en) * 2014-04-03 2015-10-14 中国移动通信集团设计院有限公司 Line test period determining method and device
CN104980962B (en) * 2014-04-03 2019-04-30 中国移动通信集团设计院有限公司 A kind of determination method and device in field testing period
CN104363113A (en) * 2014-10-29 2015-02-18 中国建设银行股份有限公司 Business continuity detection method
CN106992900A (en) * 2016-01-20 2017-07-28 北京国双科技有限公司 The method and intelligent early-warning notification platform of monitoring and early warning
CN108073499A (en) * 2016-11-10 2018-05-25 腾讯科技(深圳)有限公司 The test method and device of application program
CN108073499B (en) * 2016-11-10 2020-09-29 腾讯科技(深圳)有限公司 Application program testing method and device
CN108509321A (en) * 2017-02-24 2018-09-07 北京京东尚科信息技术有限公司 Generate the monitoring method and system of data cube
CN108683639A (en) * 2018-04-23 2018-10-19 丙申南京网络技术有限公司 A kind of computer network abnormality detection and automatic repair system, method and mobile terminal

Similar Documents

Publication Publication Date Title
CN103701657A (en) Device and method for monitoring and processing dysfunction of continuously running data processing system
CN104639380B (en) server monitoring method
CN105659528B (en) A kind of method and device for realizing fault location
CN103490917B (en) The detection method of troubleshooting situation and device
CN108092836A (en) The monitoring method and device of a kind of server
CN104022904A (en) Unified management platform for IT devices in distributed computer rooms
WO2012157471A1 (en) Fault sensing system for sensing fault in plurality of control systems
US10931533B2 (en) System for network incident management
CN103713981A (en) Database server performance detection and early warning method
CN103490919A (en) Fault management system and fault management method
CN105471932A (en) Front-end application monitoring method, front-end application and front-end application monitoring system
CN104461820A (en) Equipment monitoring method and device
CN104065526A (en) Server fault alarming method and device thereof
CN102404141A (en) Method and device of alarm inhibition
CN102609350A (en) Server memory failure alarm method
US20210287523A1 (en) Method, apparatus, and system for managing alarms
CN111679950B (en) Interface-level dynamic data sampling method and device
CN105025179A (en) Method and system for monitoring service agents of call center
TWI591489B (en) Intelligent monitoring and warning device and method for distributed software defined storage system
CN111124818B (en) Monitoring method, device and equipment for Expander
CN105955864A (en) Power supply fault processing method, power supply module, monitoring management module and server
CN104346233A (en) Fault recovery method and device for computer system
CN103457755A (en) IEC 61850 system communication fault detection method and system
CN116074180A (en) Fault location method, fault repair method, device and storage medium
CN105550094B (en) A kind of high-availability system state automatic monitoring method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140402

RJ01 Rejection of invention patent application after publication