CN103166778A - Method and device for automatically and intelligently processing malfunction - Google Patents

Method and device for automatically and intelligently processing malfunction Download PDF

Info

Publication number
CN103166778A
CN103166778A CN2011104136172A CN201110413617A CN103166778A CN 103166778 A CN103166778 A CN 103166778A CN 2011104136172 A CN2011104136172 A CN 2011104136172A CN 201110413617 A CN201110413617 A CN 201110413617A CN 103166778 A CN103166778 A CN 103166778A
Authority
CN
China
Prior art keywords
fault
processing
malfunction
troubleshooting
strategy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011104136172A
Other languages
Chinese (zh)
Inventor
舒刚
廖昕
陈松
杨涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Qinzhi Digital Technology Co Ltd
Original Assignee
Chengdu Qinzhi Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Qinzhi Digital Technology Co Ltd filed Critical Chengdu Qinzhi Digital Technology Co Ltd
Priority to CN2011104136172A priority Critical patent/CN103166778A/en
Publication of CN103166778A publication Critical patent/CN103166778A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a method and a device for automatically and intelligently processing a malfunction. The method mainly comprises the steps: (1) a malfunction monitor monitors a controlled terminal; (2) malfunction information is submitted to and filtered by a malfunction filter; (3) the malfunction information filtered through the malfunction filter is submitted to a malfunction processor, the malfunction processor receives the malfunction information and processes the malfunction automatically according to a processing regular of a malfunction processing strategy storage and returns a processing result back; (4) whether the malfunction is recovered or not is judged by the malfunction processor, if the malfunction is recovered, a malfunction processing recorder is noticed to record a processing condition; and if the malfunction is not recovered, an operation and maintenance worker is noticed by the malfunction processor; (5) the operation and maintenance worker obtains the malfunction information and the malfunction of the terminal is manually processed; (6) processing information is submitted to the malfunction processing recorder by the processing worker; and (7) and the malfunction processing recorder records the malfunction processing condition. According to the method and the device for automatically and intelligently processing the malfunction, the malfunction occurs in a network can be automatically and intelligently processed rapidly and efficiently.

Description

A kind of fault automatic intelligent processing method and device thereof
Technical field
The present invention relates to IT O﹠M field, relate in particular to a kind of fault automatic intelligent processing method.
Background technology
In IT O﹠M field, for guaranteeing the normal operation of whole O﹠M network, not only will in time grasp the ruuning situation of O﹠M network, but also the fault that needs in time network to be occurred is made processing fast and efficiently, reaching fast quick-recovery fault, thereby reduce the purpose of economic loss.The automation of fault and intelligent processing method are particularly important in the IT O﹠M.
Summary of the invention
The object of the present invention is to provide a kind of fault automatic intelligent processing method and device thereof, can make automation fast and efficiently and intelligent processing method to the fault that network occurs, reaching fast quick-recovery fault, thereby reduce the purpose of economic loss.
The invention provides a kind of fault automatic intelligent processing method, 1), fault monitor monitors controlled terminal the key step of the method is as follows:, in time obtain the operation conditions of controlled end, if find that certain station terminal breaks down, and generates corresponding fault message.2), this fault message is transferred to the fault filtering device, by fault filtering device filter faults information, needing to obtain fault to be processed.3), the fault message of fault filtering device after filtering submit to failure processor, after failure processor receives fault message, according to the processing rule in troubleshooting strategy warehouse, handling failure automatically, and return to result.4), failure processor resolve fault processing result information, whether failure judgement is recovered, if fault is restored, notice troubleshooting register records this time troubleshooting situation; If fault is not restored, failure processor notice O﹠M personnel do artificial processing.5), the O﹠M personnel obtain fault message, the artificial treatment terminal fault.6), treatment people fills in the processing method of this kind fault, an and newly-built processing rule together with the information of this troubleshooting situation, is submitted to the troubleshooting register.7), troubleshooting recorder trace troubleshooting situation, and do corresponding operation.
The invention provides a kind of fault automatic processing device, wherein mainly comprised fault monitor, fault filtering device, failure processor, troubleshooting strategy warehouse and troubleshooting register.
Each controlled terminal of fault monitor charge of overseeing, the ruuning situation of moment monitor terminal, and at the fault message of finding to produce when controlled terminal breaks down response, this information can be used for the position, fault category of positioning terminal equipment etc., makes failure processor can accurately locate the controlled terminal that the troubleshooting strategy need to affect.
After fault monitor discovery terminal broke down the generation fault message, at first fault monitor passed to the fault filtering device with this information, in the fault filtering device, is depositing the user according to the demand of oneself, a series of fault filterings rules that pre-define.The fault filtering device filters out according to these filtering rules the fault message that those meet the demands, and needing to obtain the fault message of recovery automatically.
The user is according to the own fail-over policy of understanding, these strategies are pre-defined in troubleshooting strategy warehouse, after the filtration of fault message through the fault filtering device, these information will pass to failure processor, after failure processor is received fault message, the failure processor treatment step is as described below: (A), resolve fault information, locate the information such as type, terminal location of this fault; (B), failure processor judges in the fail-over policy warehouse according to the type of fault whether strategy corresponding to this fault of processing is arranged; (C) if the processing policy of processing this fault is arranged, failure processor is from wherein taking out best processing policy; (D), failure processor is again according to the manner of execution of processing policy definition, makes corresponding processing action, affects the controlled terminal of locating in the A step; (E), after failure processor carries out processing policy, obtain the effect of this strategy execution, judge whether this fault is restored, if be not restored, get back to described step B, if fault is restored, notify this time of troubleshooting recorder trace recovery situation; (F) if, judge in step B when not recovered this fault tactful in troubleshooting strategy warehouse, failure processor is notified relevant fault recovery personnel, has the recovery personnel to do artificial recovery.
After the strategy of system by tactful warehouse successfully recovers fault, the fault recovery register can obtain the failure processor notice, register is according to this time of information recording/recovery situations such as the fault type of this time fault recovery, failure strategy signs, and promote the weight of the troubleshooting strategy of successfully processing this fault, select best processing policy for failure processor precondition is provided, thus the intelligent processing ability of continuous elevator system.
When the fault of transferring to O﹠M personnel manual reversion, the O﹠M personnel are after successfully recovering fault, the failure strategy typing interface that can provide by the fault recovery register, the strategy of processing this kind fault is submitted to register, register is recorded to the troubleshooting strategy of this new interpolation in tactful warehouse, thus the automatic processing capabilities of continuous elevator system.
Adopt this fault automatic intelligent processing method and device thereof, can make automation fast and efficiently and intelligent processing method to the fault that network occurs, fast quick-recovery fault reduces the economic loss that the information system fault is brought.
Description of drawings
Fig. 1: be fault automatic intelligent processing method schematic diagram;
Fig. 2: be fault automatic intelligent process flow figure;
Fig. 3: be fault automatic intelligent processing method sequential chart.
Embodiment
Disclosed all features in this specification, or the step in disclosed all methods or process except mutually exclusive feature and/or step, all can make up by any way.
Disclosed arbitrary feature in this specification (comprising any accessory claim, summary and accompanying drawing) is unless special narration all can be replaced by other equivalences or the alternative features with similar purpose.That is, unless special narration, each feature is an example in a series of equivalences or similar characteristics.
For fault filtering device definition filtering rule, be used for filtering the fault message that does not need system to recover.
According to the troubleshooting technology that reality is grasped, add predefined troubleshooting strategy to tactful warehouse.
Start the failure monitoring device, the ruuning situation of timing scan controlled terminal, failure monitoring device scanning controlled terminal, when finding that this terminal breaks down, fault monitor is classified this fault according to the fault that controlled terminal occurs, and resolve the IP address of this terminal, the information such as terminal type, system type, the failure monitoring device generates according to the fault message that collects the information format that failure processor can be identified, and this information is passed to the fault filtering device.
The fault filtering device is according to predefined a series of filtering rules, contrast one by one, judge each rule, get rid of those legal fault messages, needing to obtain at last the fault message of recovery, and need fault message to be processed submit to failure processor these.
after failure processor is obtained fault message, failure processor resolve fault information, the fault message of failure processor by parsing, the IP address of the equipment that breaks down in the location, terminal type, system type and fault category etc., failure processor is searched the processing policy of processing such fault by these information in troubleshooting strategy warehouse, at first judged whether predefined processing policy, the processing policy that there is no such fault, failure processor is just notified relevant O﹠M personnel, and do artificial processing, if there is the predefine strategy of processing such fault in the fault warehouse, failure processor is selected best processing policy according to the tactful weight in the warehouse so, after failure processor gets best processing policy, just carry out the predefined relevant treatment action of this strategy, affect the terminal of locating in fault message, after affecting terminal, failure processor obtains this strategy processing result information, and resolve, whether judgement has recovered by the processing of this troubleshooting strategy the terminal fault that fault message indicates, if do not recover, what failure processor repeated so obtains other best processing policies from troubleshooting strategy warehouse, and carry out, until it is complete to process the strategy execution of this fault in tactful warehouse, if fault is restored, notify this time fault recovery situation of fault recovery recorder trace, if failure processor does not all recover this fault after the strategy of processing such fault in tactful warehouse is all carried out, the relevant O﹠M personnel of failure processor notice do artificial processing.
after failure processor is complete to troubleshooting process, may have two kinds of situations, a kind of situation is that the fault processor is disposed automatically and has recovered fault, in such cases, failure processor is directly notified the fault recovery register, register can record the fault type of this time processing, and promotes the weight of the strategy that this time recovers this fault in tactful warehouse, but the second situation is the fault processor automatically to be disposed and not to recover fault, or because do not process the strategy of this fault in the fail-over policy warehouse, failure processor all can notify relevant O﹠M personnel to carry out artificial treatment, in such cases, after the O﹠M personnel have successfully recovered this fault, the Data Enter interface that should provide by the fault recovery register, the situation of this time of typing recovery, and according to the concrete steps of recovering, write predefined troubleshooting strategy and submit in the lump the fault recovery register, the fault recovery register is according to fault message, record this time fault recovery situation, and be added with the processing policy of such fault that the O﹠M personnel submit in troubleshooting strategy warehouse.

Claims (8)

1. fault automatic intelligent processing method is characterized in that the method comprises the following steps:
1), fault monitor monitors and in time to obtain the operation conditions of controlled end, if find that certain station terminal breaks down, and generates corresponding fault message by controlled terminal;
2), this fault message is transferred to the fault filtering device, by fault filtering device filter faults information, needing to obtain fault to be processed;
3), the fault message of fault filtering device after filtering submit to failure processor, after failure processor receives fault message, according to the processing rule in troubleshooting strategy warehouse, handling failure automatically, and return to result;
4), failure processor resolve fault processing result information, whether failure judgement is recovered, if fault is restored, notice troubleshooting register records this time troubleshooting situation; If fault is not restored, failure processor notice O﹠M personnel do artificial processing;
5), the O﹠M personnel obtain fault message, the artificial treatment terminal fault;
6), treatment people fills in the processing method of this kind fault, an and newly-built processing rule together with the information of this troubleshooting situation, is submitted to the troubleshooting register;
7), troubleshooting recorder trace troubleshooting situation, and do corresponding operation.
2. any fault automatic intelligent processing method according to claim 1 is characterized in that: described failure processor by for troubleshooting strategy division weight, is the foundation that the fault processor is chosen the optimization process strategy; To the fault that some system can't recover, process register and provide interface for the corresponding processing policy of fault recovery personnel typing.
3. fault automatic intelligent processing unit is characterized in that: the method has comprised fault monitor, fault filtering device, troubleshooting strategy warehouse, failure processor, troubleshooting register.
4. a kind of fault automatic intelligent processing unit according to claim 2, it is characterized in that: described fault monitor is monitored the ruuning situation of controlled terminal constantly, find the fault of operation terminal, and can produce corresponding fault message according to terminal equipment failure.
5. a kind of fault automatic intelligent processing unit according to claim 2 is characterized in that: described fault monitor, and the user can carry out the self-defined of filtering rule to it according to the actual requirements; And filter can filtering rule predefined according to these, filter faults information.
6. a kind of fault automatic intelligent processing method according to claim 2, it is characterized in that: described failure processor is selected best processing policy in troubleshooting strategy warehouse according to fault message, and make corresponding processing action according to this strategy, to affect controlled terminal; Can obtain the situation as a result that strategy is processed after affecting controlled terminal, and make corresponding subsequent action according to result: if not success of Recovery processing, whether also have other strategies can process this fault in the determination strategy warehouse, if have, therefrom take out best processing policy, and the action of handling it, if there is no other processing policies, notify artificial treatment; If fault recorder is notified in the Recovery processing success, record this time disposition.
7. a kind of fault automatic intelligent processing method according to claim 2, it is characterized in that: described failure processor is made corresponding record according to fault recovery mode difference, if being system, fault recovers, record this time fault recovery situation, and promote the tactful weight that is used for this time troubleshooting; If in tactful warehouse, all strategies of processing this kind fault all do not make this fault recovery, or do not process the strategy of this fault, and transfer the situation of manual reversion to, the troubleshooting register will be according to the disposition for the treatment of people, record this time processing, and the strategy of this kind of processing fault that treatment people is submitted to joins in troubleshooting strategy warehouse.
8. according to claim 2 or 6 described a kind of fault automatic intelligent processing unit is characterized in that: described failure processor by for troubleshooting strategy division weight, is the foundation that the fault processor is chosen the optimization process strategy; To the fault that some system can't recover, process register and provide interface for the corresponding processing policy of fault recovery personnel typing.
CN2011104136172A 2011-12-13 2011-12-13 Method and device for automatically and intelligently processing malfunction Pending CN103166778A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011104136172A CN103166778A (en) 2011-12-13 2011-12-13 Method and device for automatically and intelligently processing malfunction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011104136172A CN103166778A (en) 2011-12-13 2011-12-13 Method and device for automatically and intelligently processing malfunction

Publications (1)

Publication Number Publication Date
CN103166778A true CN103166778A (en) 2013-06-19

Family

ID=48589531

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011104136172A Pending CN103166778A (en) 2011-12-13 2011-12-13 Method and device for automatically and intelligently processing malfunction

Country Status (1)

Country Link
CN (1) CN103166778A (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104538336A (en) * 2015-01-07 2015-04-22 海太半导体(无锡)有限公司 Alarm recognizing and processing system and method for semiconductor encapsulation equipment
CN105262616A (en) * 2015-09-21 2016-01-20 浪潮集团有限公司 Failure repository-based automated failure processing system and method
CN106161079A (en) * 2015-04-28 2016-11-23 小米科技有限责任公司 Fault feedback method and device
CN106383760A (en) * 2016-09-19 2017-02-08 郑州云海信息技术有限公司 Computer fault management method and apparatus
CN106998256A (en) * 2016-01-22 2017-08-01 腾讯科技(深圳)有限公司 A kind of communication failure localization method and server
CN107122254A (en) * 2016-02-25 2017-09-01 阿里巴巴集团控股有限公司 A kind of computer repairs control method and system, restorative procedure and system
CN108121642A (en) * 2017-12-20 2018-06-05 维沃移动通信有限公司 A kind of failure solves method, server and mobile terminal
CN108429629A (en) * 2017-02-14 2018-08-21 腾讯科技(深圳)有限公司 Equipment fault restoration methods and device
CN108540308A (en) * 2018-03-02 2018-09-14 中国银行股份有限公司 A kind of windows application platform fault self-recovery system and methods based on SCOM
CN108833187A (en) * 2018-06-29 2018-11-16 上海瀚之友信息技术服务有限公司 A kind of document self-cure monitoring system and method
CN109034415A (en) * 2018-07-20 2018-12-18 郑州云海信息技术有限公司 A kind of fault handling method of self study, apparatus and system
CN109597397A (en) * 2018-11-28 2019-04-09 北京星航机电装备有限公司 A kind of fault diagnosis system and method based on ForceControl configuration software
CN109669402A (en) * 2018-09-25 2019-04-23 平安普惠企业管理有限公司 Abnormality monitoring method, unit and computer readable storage medium
CN110095144A (en) * 2018-01-30 2019-08-06 中电长城(长沙)信息技术有限公司 A kind of terminal device local fault recognition method and system
CN111297243A (en) * 2019-05-09 2020-06-19 湖北润格卫浴股份有限公司 Intelligent closestool cover operation master control fault processing system based on Internet of things
CN111355610A (en) * 2020-02-25 2020-06-30 网宿科技股份有限公司 Exception handling method and device based on edge network
CN113179180A (en) * 2021-04-23 2021-07-27 杭州安恒信息技术股份有限公司 Basalt client disaster fault repairing method, basalt client disaster fault repairing device and basalt client disaster storage medium
CN113572637A (en) * 2021-07-16 2021-10-29 中盈优创资讯科技有限公司 Network fault automatic preprocessing method and device
CN114002543A (en) * 2021-09-27 2022-02-01 中盈优创资讯科技有限公司 STN relay quality difference circuit autonomous method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101605346A (en) * 2008-06-10 2009-12-16 中兴通讯股份有限公司 The fault restoration method and apparatus
CN101958536A (en) * 2010-09-20 2011-01-26 中国电力科学研究院 Distribution network failure isolation and quick power service restoration decision support system
US20110138224A1 (en) * 2009-12-09 2011-06-09 Electronics And Telecommunications Research Institute Method and system for tracepoint-based fault diagnosis and recovery
WO2011135968A1 (en) * 2010-04-28 2011-11-03 新日鉄ソリューションズ株式会社 Information processing system, information processing method, and program

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101605346A (en) * 2008-06-10 2009-12-16 中兴通讯股份有限公司 The fault restoration method and apparatus
US20110138224A1 (en) * 2009-12-09 2011-06-09 Electronics And Telecommunications Research Institute Method and system for tracepoint-based fault diagnosis and recovery
WO2011135968A1 (en) * 2010-04-28 2011-11-03 新日鉄ソリューションズ株式会社 Information processing system, information processing method, and program
CN101958536A (en) * 2010-09-20 2011-01-26 中国电力科学研究院 Distribution network failure isolation and quick power service restoration decision support system

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104538336A (en) * 2015-01-07 2015-04-22 海太半导体(无锡)有限公司 Alarm recognizing and processing system and method for semiconductor encapsulation equipment
CN106161079A (en) * 2015-04-28 2016-11-23 小米科技有限责任公司 Fault feedback method and device
CN105262616A (en) * 2015-09-21 2016-01-20 浪潮集团有限公司 Failure repository-based automated failure processing system and method
CN106998256A (en) * 2016-01-22 2017-08-01 腾讯科技(深圳)有限公司 A kind of communication failure localization method and server
CN107122254A (en) * 2016-02-25 2017-09-01 阿里巴巴集团控股有限公司 A kind of computer repairs control method and system, restorative procedure and system
CN107122254B (en) * 2016-02-25 2020-08-21 阿里巴巴集团控股有限公司 Computer repair control method and system and repair method and system
CN106383760A (en) * 2016-09-19 2017-02-08 郑州云海信息技术有限公司 Computer fault management method and apparatus
CN108429629A (en) * 2017-02-14 2018-08-21 腾讯科技(深圳)有限公司 Equipment fault restoration methods and device
CN108121642A (en) * 2017-12-20 2018-06-05 维沃移动通信有限公司 A kind of failure solves method, server and mobile terminal
CN110095144B (en) * 2018-01-30 2021-07-09 中电长城(长沙)信息技术有限公司 Method and system for identifying local fault of terminal equipment
CN110095144A (en) * 2018-01-30 2019-08-06 中电长城(长沙)信息技术有限公司 A kind of terminal device local fault recognition method and system
CN108540308A (en) * 2018-03-02 2018-09-14 中国银行股份有限公司 A kind of windows application platform fault self-recovery system and methods based on SCOM
CN108833187A (en) * 2018-06-29 2018-11-16 上海瀚之友信息技术服务有限公司 A kind of document self-cure monitoring system and method
CN109034415A (en) * 2018-07-20 2018-12-18 郑州云海信息技术有限公司 A kind of fault handling method of self study, apparatus and system
CN109669402A (en) * 2018-09-25 2019-04-23 平安普惠企业管理有限公司 Abnormality monitoring method, unit and computer readable storage medium
CN109597397A (en) * 2018-11-28 2019-04-09 北京星航机电装备有限公司 A kind of fault diagnosis system and method based on ForceControl configuration software
CN111297243A (en) * 2019-05-09 2020-06-19 湖北润格卫浴股份有限公司 Intelligent closestool cover operation master control fault processing system based on Internet of things
CN111355610A (en) * 2020-02-25 2020-06-30 网宿科技股份有限公司 Exception handling method and device based on edge network
CN113179180A (en) * 2021-04-23 2021-07-27 杭州安恒信息技术股份有限公司 Basalt client disaster fault repairing method, basalt client disaster fault repairing device and basalt client disaster storage medium
CN113572637A (en) * 2021-07-16 2021-10-29 中盈优创资讯科技有限公司 Network fault automatic preprocessing method and device
CN114002543A (en) * 2021-09-27 2022-02-01 中盈优创资讯科技有限公司 STN relay quality difference circuit autonomous method and device
CN114002543B (en) * 2021-09-27 2024-01-05 中盈优创资讯科技有限公司 Autonomous method and device for STN relay quality difference circuit

Similar Documents

Publication Publication Date Title
CN103166778A (en) Method and device for automatically and intelligently processing malfunction
CN108833188B (en) Alarm information management method, device, equipment and storage medium
CN105159964B (en) A kind of log monitoring method and system
CN202282837U (en) Video quality diagnosis system
CN109308252B (en) Fault positioning processing method and device
CN102750350B (en) Monitoring system and method
CN103490917B (en) The detection method of troubleshooting situation and device
CN107332722A (en) The method for removing and system of a kind of fault message
CN102638378B (en) Mass storage system monitoring method integrating heterogeneous storage devices
CN103116531A (en) Storage system failure predicting method and storage system failure predicting device
CN104935456B (en) The alarm information transmission of communication network warning system and processing method
CN101197621A (en) Method and system for remote diagnosing and locating failure of network management system
CN103475696A (en) System and method for monitoring state of cloud computing cluster server
CN111143167B (en) Alarm merging method, device, equipment and storage medium for multiple platforms
JP3085844B2 (en) Fault indication method in centralized monitoring system
CN111327685A (en) Data processing method, device and equipment of distributed storage system and storage medium
CN104065526A (en) Server fault alarming method and device thereof
CN104753712A (en) Alarming report method, alarming report node and alarming report system
CN110659147B (en) Self-repairing method and system based on module self-checking behavior
CN102457400B (en) Method for preventing split brain phenomenon from occurring on distributed replicated block device (DRBD) resource
CN103714060A (en) Interrupt-period historical data processing method and front-end collecting sub system equipment
CN101247265A (en) Alarm processing method, device and system
CN102387210B (en) Distribution type file system monitoring method based on rapid synchronization network
CN113055203B (en) Method and device for recovering exception of SDN control plane
CN102377619A (en) Automatic detecting and processing method for communication abnormality of simple network management protocol (SNMP) agent

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130619