CN104932978A - System running fault self-detection and self-recovery method and system - Google Patents

System running fault self-detection and self-recovery method and system Download PDF

Info

Publication number
CN104932978A
CN104932978A CN201510365115.5A CN201510365115A CN104932978A CN 104932978 A CN104932978 A CN 104932978A CN 201510365115 A CN201510365115 A CN 201510365115A CN 104932978 A CN104932978 A CN 104932978A
Authority
CN
China
Prior art keywords
data information
software
monitored
unit
police
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510365115.5A
Other languages
Chinese (zh)
Other versions
CN104932978B (en
Inventor
李文强
张永彬
查一昆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Astronavigation Age Development In Science And Technology Co Ltd
Original Assignee
Beijing Astronavigation Age Development In Science And Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Astronavigation Age Development In Science And Technology Co Ltd filed Critical Beijing Astronavigation Age Development In Science And Technology Co Ltd
Priority to CN201510365115.5A priority Critical patent/CN104932978B/en
Publication of CN104932978A publication Critical patent/CN104932978A/en
Application granted granted Critical
Publication of CN104932978B publication Critical patent/CN104932978B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a system running fault self-detection and self-recovery method and system. The method comprises the following steps: acquiring data information of monitored equipment; filtering valid data information and storing the filtered valid data information; filtering valid data information needing alarming according to a preset alarming strategy, and making alarming records; and realizing abnormity recovery according to the valid data information needing alarming. The acquisition step comprises a sub-step of acquiring the running data information of monitored equipment software and a sub-step of acquiring the abnormity data information of the monitored equipment software. After the adoption of the technical scheme, the method and system have the beneficial effects that the state parameters of a plurality of pieces of software can be processed intensively and managed uniformly, so that the operation amount is effectively reduced; each data storage unit can effectively support the acquisition of the plurality of pieces of software, so that the compatibility of the system is effectively expanded; and running control can be performed on monitored software according to the abnormity data information needing alarming and software running information, so that the problem of abnormities is solved, and functional recovery of the monitored software is realized.

Description

The method and system of a kind of system cloud gray model automatic fault selftesting and selfreparing
Technical field
The present invention relates to Communication Monitoring field, the method and system of particularly a kind of system cloud gray model automatic fault selftesting and selfreparing.
Background technology
Under the current information age, the process of information data relies on the software systems of computing machine to carry out, the server apparatus of current most system is placed in server room or external field environment, needs maintainer regularly to go the target place situation of operating software in checkout facility successively.But, the not only at substantial manpower of the method for this personal monitoring, and can not the abnormal information of operating software in acquisition equipment in time, easily cause larger loss.
Based on above-mentioned present situation, slip-stick artist have developed monitoring system, can replace the situation of operating software in personal monitoring's equipment.Existing monitoring system still has deficiency:
The method of the software thread that existing monitoring system adopts monitoring monitored, effectively can monitor the essential information of the operation of software.But, when monitored running software, occasional produces the normal but actual software of thread can not " seemingly-dead " state of practical function, existing monitoring system can not realize identifying this software " seemingly-dead " state, and when more can not solve software " seemingly-dead ", monitored software loses the technical matters of function.
In view of this, special proposition the present invention.
Summary of the invention
The technical problem to be solved in the present invention is to overcome the deficiencies in the prior art, provides a kind of method and system of system O&M function, effectively identifies torpor, reach better monitoring effect.
For solving the problems of the technologies described above, the present invention adopts the basic conception of technical scheme to be:
A method for system cloud gray model automatic fault selftesting and selfreparing, is characterized in that, comprising:
The data message of S1, collection monitored equipment;
The filtering policy that S2, basis pre-set, filters out valid data information and stores from obtained data message; And
S3, basis are preset the warning policy filtering arranged and are gone out the valid data information that need report to the police, and report to the police and record;
S4, according to the valid data information that need report to the police, control monitored software restarting or startup;
Described step S1 comprises the sub-step S13 of the sub-step S12 of the service data information gathering monitored equipment software and the abnormal data information of collection monitored equipment software;
Step S12 comprises:
S121, call the task manager of monitored equipment;
S122, according to the monitored software startup time point of the process record in this task manager, the shut-in time point and working time; Generating run data message;
Step S13 comprises:
S131, call the task manager of monitored equipment;
S132, according to configuration software title and address, in task manager, determine the process of monitored software;
S133, monitor corresponding ini file according to the title of monitored software and address; Generate abnormal data information.
Above-mentioned system cloud gray model automatic fault selftesting and the method for selfreparing, the filter method filtering effective service data information that needs are reported to the police in described step S3 is:
Transfer effective service data information, this effective service data information and preset value are contrasted, when error exceedes threshold value, then this effective service data information is the effective service data information needing to report to the police;
In step S4, need effective service data information of reporting to the police to occur, then the order of calling the task manager of monitored equipment makes monitored software startup.
Above-mentioned system cloud gray model automatic fault selftesting and the method for selfreparing, the filter method filtering the effective anomaly data message that needs are reported to the police in described step S3 is:
Transfer effective anomaly data message,
Determine the time point of the last ini file content change with this abnormal data information acquisition time point,
When this time point and the time interval of this acquisition time exceed threshold value,
Transfer the running status that corresponding effective service data information determines acquisition time monitored software, if monitored software is in opening, then this effective anomaly data message is the effective anomaly information needing to report to the police;
In step S4, occur the effective anomaly data message needing to report to the police, then the order of calling the task manager of monitored equipment makes monitored software restarting.
Above-mentioned system cloud gray model automatic fault selftesting and the method for selfreparing, described step S1 also comprises the sub-step S11 gathering monitored equipment and its external equipment connection data information;
S11 comprises:
S111, the communication protocol Ping order of calling monitored equipment send an ICMP to the external equipment be connected with this monitored equipment;
The ICMP echo content of S112, acquisition; Generate connection data information.
A system for operation troubles Autonomous test and selfreparing, comprises running status acquiring unit, abnormal data monitoring means, data storage cell and operating software and restarts unit;
Described running status acquiring unit, for gathering the service data information of software in monitored equipment;
Described abnormal data monitoring means, the designated software for gathering monitored equipment be in operation occur abnormal data information;
Described data storage cell, to go forward side by side row relax, parsing and storage for obtaining service data information and abnormal data information;
Described operating software restarts unit, for restarting monitored software according to service data information and abnormal data information and realizing controlling monitored software simulating running status according to Preset Time.
Above-mentioned operation troubles Autonomous test and the system of selfreparing, also comprise connection status acquiring unit;
Described connection status acquiring unit, for obtaining the connection data information between peripheral hardware that monitored equipment is connected with it; This connection data information is also obtained by described data storage cell.
Above-mentioned operation troubles Autonomous test and the system of selfreparing, also comprise data query lead-out unit;
Described data query lead-out unit, for gathering information data, supports that temporally scope carries out statistical conversion.
Above-mentioned operation troubles Autonomous test and the system of selfreparing, also comprise first network communication unit and second network communication unit;
Described first network communication unit, for obtaining service data information and abnormal data information, and the second network communication unit passed to;
Described second network communication unit, for receiving service data information and abnormal data information and passing to described data storage cell.
Above-mentioned operation troubles Autonomous test and the system of selfreparing, also comprise communication check unit;
Communication check unit, for detecting the connection state information of first network communication unit and second network communication unit.
Above-mentioned operation troubles Autonomous test and the system of selfreparing, described running status acquiring unit, described abnormal data monitoring means, first network communication unit and described operating software are restarted unit and are arranged at client, described data storage cell and second network communication unit are arranged at service end, and described service end is by the IP address of client and the port identification data message from different clients.
After adopting technique scheme, the present invention compared with prior art has following beneficial effect:
1, be combined with configuration file monitoring mode by thread monitoring mode, efficiently solve the phenomenon that software " seemingly-dead " can not be monitored;
2, operation can be carried out according to the abnormal data information that need report to the police and running software information to monitored software to control, solve abnormal problem, realize monitored software function reparation;
3, the state parameter of multiple software can carry out focusing on and unified management, the effectively amount of simplifying the operation;
4, there is connection status monitoring function, can ensure that monitoring function whole process realizes;
5, each data storage cell effectively can support the collection of multiple software book data, effectively expands the compatibility of present system.
Accompanying drawing explanation
Fig. 1 is the structured flowchart of the system of operation troubles Autonomous test of the present invention and selfreparing.
Fig. 2 is the block diagram of present system operation troubles Autonomous test and self-repair method.
In above-mentioned accompanying drawing, 1, client; 2, service end; 3, configuration module.
Embodiment
Below in conjunction with the drawings and specific embodiments, the invention will be further described, to help understanding content of the present invention.
As shown in Figure 1, the invention provides the system of a kind of operation troubles Autonomous test and selfreparing, comprise running status acquiring unit, abnormal data monitoring means, data storage cell, described operating software restart unit, connection status acquiring unit, data query lead-out unit, first network communication unit, second network communication unit and communication check unit;
Described running status acquiring unit, for gathering the service data information of software in monitored equipment; This running status acquiring unit transfers the task manager of monitored equipment, obtains the service data information of this monitored software according to the process of software monitored in task manager.This basic running state information comprises software opening time, shut-in time and working time.
Described abnormal data monitoring means, the designated software for gathering monitored equipment be in operation occur abnormal data information; Because monitored software cycle writes content in fixing ini file, then abnormal data monitoring means monitors corresponding ini file according to the address of monitored software and title, and according to the content change of ini, abnormal data monitoring means obtains abnormal data information.In conjunction with the corresponding service data information that above-mentioned running status acquiring unit obtains, can judge whether monitored software is in " seemingly-dead " state.
Described data storage cell, to go forward side by side row relax, parsing and storage for obtaining service data information and abnormal data information.After receiving data, according to the standard preset, data storage cell judges whether data are valid data, according to result of determination, invalid data is dropped, valid data are resolved packet content then, obtain running software data message and abnormal data information, store respectively according to different data contents.Here, the criterion of valid data is configured setting as required, such as, the verification msg without practical significance can be regarded as invalid data.
Described connection status acquiring unit, for obtaining the connection data information between peripheral hardware (external equipment) that monitored equipment is connected with it; This connection data information is also obtained by described data storage cell.Described connection status acquiring unit periodically calls communication protocol Ping order (the Packet Internet Groper of monitored equipment system, the Internet packets survey meter) send ICMP (Internet Control Messages Protocol to its external equipment, i.e. the Internet letter report control protocol), whether normal according to the connection that ICMP echo (ICMP Echo Reply) content is come between judgment device and the peripheral hardware be connected, if normal, Echo Reply is normal network delay time; Occur extremely if connected, the mistake that returns of Ping order is Request Timed Out or Destination Host Unreachable.Connection status acquiring unit, according to the return message of Ping order, judges the connection status of monitored equipment and its peripheral hardware, generates connection data information.
Described data query lead-out unit, for gathering information data, supports that temporally scope carries out statistical conversion.According to user's request, data query lead-out unit reads the valid data in described data storage cell, per year, monthly or by the time range of specifying by statistical conversion in excel form.
Described first network communication unit, for obtaining service data information and abnormal data information, and the second network communication unit passed to; Described second network communication unit, for receiving service data information and abnormal data information and passing to described data storage cell.First network communication unit by obtain data message by data packet transmission to second network communication unit.Above-mentioned two network communication units by TCP/IP(Transmission Control Protocol/Internet Protocol, transmission control protocol/Internet Protocol) mode connect.The form of packet is:
For ensureing that between first network communication unit and second network communication unit, network connects, the communication protocol Ping order that communication check unit periodically calls second network communication unit sends ICMP to first network communication unit, judge that whether the connection status of first network communication unit and second network communication unit is normal according to ICMP echo (ICMP Echo Reply) content, if normal, Echo Reply is normal network delay time; Occur extremely if connected, the mistake that returns of Ping order is Request Timed Out or Destination Host Unreachable.Connect and occur that then first network communication unit and second network communication unit can start disconnection reconnecting mechanism extremely, re-start network and connect.The data message of grid cell generating network exception also stores, for inquiry.
In addition, based on the realization of warning function and configuration feature, the present embodiment also comprises alarm unit and basic configuration module;
Alarm unit, for obtaining in data storage cell the valid data information needing to report to the police; Whether dissimilar valid data information is that the valid data information correspondence needing to report to the police independently presets filtering policy.Such as, whether service data information is need the filtering policy of effective service data information of reporting to the police: this effective service data information and preset value are contrasted, when error exceedes threshold value, then this effective service data information is the effective service data information needing to report to the police.Again such as, whether abnormal data information is need the filtering policy of the effective anomaly data message of reporting to the police: transfer effective anomaly data message, determine the time point of the last ini file content change with this abnormal data information acquisition time point, when this time point and the time interval of this acquisition time exceed threshold value, transfer the running status that corresponding effective service data information determines acquisition time monitored software, if monitored software is in opening, then this effective anomaly data message is the effective anomaly information needing to report to the police.
Described operating software restarts unit, for restarting monitored software according to service data information and abnormal data information and realizing controlling monitored software simulating running status according to Preset Time.Such as, alarm unit gets the effective service data information (certain monitored software that such as should be in running status according to preset value is in closed condition) needing to report to the police according to foregoing description, then operating software is restarted unit and is obtained corresponding control signal, call the task manager order of monitored equipment, reopen corresponding monitored software, realize the reparation of operation troubles.Again such as, alarm unit gets the effective anomaly data message (such as certain monitored software is in " seemingly-dead " state) needing to report to the police according to foregoing description, then operating software restarts the task manager order that cell call is detected equipment, realize restarting of corresponding monitored software, complete the reparation for monitored software " seemingly-dead " state.In addition, described operating software restarts the local zone time that unit also can obtain monitored equipment, when this local zone time and the difference of the start-up time of monitored software the last time (this information is obtained by running status acquiring unit) are greater than default interval time (as required by basic configuration block configuration), operating software is restarted unit and is obtained control signal, call the task manager order of monitored equipment, close monitored software, realize the function that timing is closed.
Basic configuration module, comprises operating user interface unit and device management unit, and for being configured the state parameter of whole self-checking system, this state parameter comprises the performance period of each functional unit, the filtering policy etc. of each data message.
Each functional unit above-mentioned, running status acquiring unit, abnormal data monitoring means, operating software are restarted unit, first network communication unit and connection status acquiring unit and are positioned at client 1, and data storage cell, data query lead-out unit, second network communication unit and communication check unit are positioned at service end 2.Second network communication unit in service end 2 passes through the first network communication unit in the client 1 of IP address and port numbers identification, and so, when multiple client 1 is connected to same service end 2 respectively, the data that each client 1 transmits can not be obscured.
Fig. 2 shows the method flow of present system operation troubles Autonomous test and selfreparing.In conjunction with the operation troubles Autonomous test of above-mentioned application the method and the system of selfreparing, concrete steps of the present invention are:
The data message of S1, collection monitored equipment;
The filtering policy that S2, basis pre-set, filters out valid data information and stores from obtained data message; And
S3, basis are preset the warning policy filtering arranged and are gone out the valid data information that need report to the police, and report to the police and record;
S4, according to the valid data information that need report to the police, control monitored software restarting or startup;
Step S1 comprises the sub-step S11 gathering monitored equipment and its external equipment connection data information, the sub-step S12 gathering the service data information of monitored equipment software and gathers the sub-step S13 of abnormal data information of monitored equipment software;
Step S11 comprises:
The communication protocol Ping order that S111, connection status acquiring unit call monitored equipment sends an ICMP to the external equipment be connected with this monitored equipment;
The ICMP echo content of S112, acquisition; Generate connection data information.
Step S11 performs according to the loop cycle preset, once occur that the connection status of monitored equipment and its external equipment and preset state are not inconsistent, then the O&M function self-checking system of the present embodiment stops automatically, and gives the alarm.
Step S12 comprises:
S121, running status acquiring unit call the task manager of monitored equipment;
S122, according to the monitored software startup time point of the process record in this task manager, the shut-in time point and working time; Generating run data message;
Step S13 comprises:
S131, abnormal data acquiring unit call the task manager of monitored equipment;
S132, according to configuration software title and address, in task manager, determine the process of monitored software;
S133, monitor corresponding ini file according to the title of monitored software and address; Generate abnormal data information.
In step S2, data storage cell filters out effective service data information, effective anomaly data message and effective connection data information according to preset strategy from the data message that step S11, S12 and S13 obtain, and stores respectively.
The filter method filtering effective service data information that needs are reported to the police in step s3 is:
Transfer effective service data information, this effective service data information and preset value are contrasted, when error exceedes threshold value, then this effective service data information is the effective service data information needing to report to the police.
In step S4, occur the effective service data information needing to report to the police, operating software is restarted unit and is obtained control signal, and the order of calling the task manager of monitored equipment makes monitored software restarting.
The filter method filtering the effective anomaly data message that needs are reported to the police in described step S3 is:
Transfer effective anomaly data message, determine the time point of the last ini file content change with this abnormal data information acquisition time point, when this time point and the time interval of this acquisition time exceed threshold value, transfer the running status that corresponding effective service data information determines acquisition time monitored software, if monitored software is in opening, then this effective anomaly data message is the effective anomaly information needing to report to the police.
In step S4, occur the effective anomaly data message needing to report to the police, operating software is restarted unit and is obtained control signal, and the order of calling the task manager of monitored equipment makes monitored software restarting.
The present invention is combined with configuration file monitoring mode by thread monitoring mode, efficiently solves the problem of software " seemingly-dead " phenomenon of monitoring; Operation can be carried out according to the abnormal data information that need report to the police and running software information to monitored software to control, solve abnormal problem, realize monitored software function reparation; Meanwhile, the state parameter of multiple software can carry out focusing on and unified management in the system of the present invention, the effectively amount of simplifying the operation; There is connection status monitoring function, the unusual condition of self-checking system itself can be monitored in time, the more complete reliability that ensure that monitoring.
The above is only the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (10)

1. a method for system cloud gray model automatic fault selftesting and selfreparing, is characterized in that, comprising:
The data message of S1, collection monitored equipment;
The filtering policy that S2, basis pre-set, filters out valid data information and stores from obtained data message; And
S3, basis are preset the warning policy filtering arranged and are gone out the valid data information that need report to the police, and report to the police and record;
S4, according to the valid data information that need report to the police, control monitored software restarting or startup;
Described step S1 comprises the sub-step S13 of the sub-step S12 of the service data information gathering monitored equipment software and the abnormal data information of collection monitored equipment software;
Step S12 comprises:
S121, call the task manager of monitored equipment;
S122, according to the monitored software startup time point of the process record in this task manager, the shut-in time point and working time; Generating run data message;
Step S13 comprises:
S131, call the task manager of monitored equipment;
S132, according to configuration software title and address, in task manager, determine the process of monitored software;
S133, monitor corresponding ini file according to the title of monitored software and address; Generate abnormal data information.
2. the method for system cloud gray model automatic fault selftesting according to claim 1 and selfreparing, is characterized in that, the filter method filtering effective service data information that needs are reported to the police in described step S3 is:
Transfer effective service data information, this effective service data information and preset value are contrasted, when error exceedes threshold value, then this effective service data information is the effective service data information needing to report to the police;
In step S4, need effective service data information of reporting to the police to occur, then the order of calling the task manager of monitored equipment makes monitored software startup.
3. the method for system cloud gray model automatic fault selftesting according to claim 1 and selfreparing, is characterized in that, the filter method filtering the effective anomaly data message that needs are reported to the police in described step S3 is:
Transfer effective anomaly data message,
Determine the time point of the last ini file content change with this abnormal data information acquisition time point,
When this time point and the time interval of this acquisition time exceed threshold value,
Transfer the running status that corresponding effective service data information determines acquisition time monitored software, if monitored software is in opening, then this effective anomaly data message is the effective anomaly information needing to report to the police;
In step S4, occur the effective anomaly data message needing to report to the police, then the order of calling the task manager of monitored equipment makes monitored software restarting.
4., according to the method for the arbitrary described system cloud gray model automatic fault selftesting of claim 1-3 and selfreparing, it is characterized in that, described step S1 also comprises the sub-step S11 gathering monitored equipment and its external equipment connection data information;
S11 comprises:
S111, the communication protocol Ping order of calling monitored equipment send an ICMP to the external equipment be connected with this monitored equipment;
The ICMP echo content of S112, acquisition; Generate connection data information.
5. a system for operation troubles Autonomous test and selfreparing, is characterized in that, comprises running status acquiring unit, abnormal data monitoring means, data storage cell and operating software and restarts unit;
Described running status acquiring unit, for gathering the service data information of software in monitored equipment;
Described abnormal data monitoring means, the designated software for gathering monitored equipment be in operation occur abnormal data information;
Described data storage cell, to go forward side by side row relax, parsing and storage for obtaining service data information and abnormal data information;
Described operating software restarts unit, for restarting monitored software according to service data information and abnormal data information and realizing controlling monitored software simulating running status according to Preset Time.
6. the system of operation troubles Autonomous test according to claim 5 and selfreparing, is characterized in that, also comprises connection status acquiring unit;
Described connection status acquiring unit, for obtaining the connection data information between peripheral hardware that monitored equipment is connected with it; This connection data information is also obtained by described data storage cell.
7. the system of operation troubles Autonomous test according to claim 5 and selfreparing, is characterized in that, also comprises data query lead-out unit;
Described data query lead-out unit, for gathering information data, supports that temporally scope carries out statistical conversion.
8., according to the system of the arbitrary described operation troubles Autonomous test of claim 5-7 and selfreparing, it is characterized in that, also comprise first network communication unit and second network communication unit;
Described first network communication unit, for obtaining service data information and abnormal data information, and the second network communication unit passed to;
Described second network communication unit, for receiving service data information and abnormal data information and passing to described data storage cell.
9. the system of operation troubles Autonomous test according to claim 8 and selfreparing, is characterized in that, also comprises communication check unit;
Communication check unit, for detecting the connection state information of first network communication unit and second network communication unit.
10. the system of operation troubles Autonomous test according to claim 8 and selfreparing, it is characterized in that, described running status acquiring unit, described abnormal data monitoring means, first network communication unit and described operating software are restarted unit and are arranged at client, described data storage cell and second network communication unit are arranged at service end, and described service end is by the IP address of client and the port identification data message from different clients.
CN201510365115.5A 2015-06-29 2015-06-29 A kind of system operation automatic fault selftesting and the method and system of selfreparing Active CN104932978B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510365115.5A CN104932978B (en) 2015-06-29 2015-06-29 A kind of system operation automatic fault selftesting and the method and system of selfreparing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510365115.5A CN104932978B (en) 2015-06-29 2015-06-29 A kind of system operation automatic fault selftesting and the method and system of selfreparing

Publications (2)

Publication Number Publication Date
CN104932978A true CN104932978A (en) 2015-09-23
CN104932978B CN104932978B (en) 2018-04-13

Family

ID=54120150

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510365115.5A Active CN104932978B (en) 2015-06-29 2015-06-29 A kind of system operation automatic fault selftesting and the method and system of selfreparing

Country Status (1)

Country Link
CN (1) CN104932978B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106384297A (en) * 2016-09-09 2017-02-08 深圳市汇拓新邦科技有限公司 Method and system for operating and maintaining photovoltaic power generation
CN107122685A (en) * 2017-04-27 2017-09-01 国信优易数据有限公司 A kind of big data method for secure storing and equipment
CN111147818A (en) * 2019-12-29 2020-05-12 航天信息股份有限公司 Grain depot video monitoring method and system
CN111240949A (en) * 2020-01-13 2020-06-05 奇安信科技集团股份有限公司 Method and device for determining software use frequency in domestic operating system
CN111474942A (en) * 2020-05-09 2020-07-31 烟台市地摩动力科技有限公司 Fault self-checking method and system of intelligent transportation device
CN111694687A (en) * 2020-06-05 2020-09-22 中国第一汽车股份有限公司 Vehicle software fault detection method, device, equipment and storage medium
CN111708613A (en) * 2020-08-18 2020-09-25 广东睿江云计算股份有限公司 Method and system for repairing boot failure card task of VM virtual machine
CN112104508A (en) * 2020-09-23 2020-12-18 沈阳奥普泰光通信有限公司 Intelligent fault monitoring and self-repairing method for network data acquisition equipment, storage medium and computer equipment
CN112149823A (en) * 2020-08-20 2020-12-29 汉威科技集团股份有限公司 Combined implementation method for filtering alarm information
CN113886213A (en) * 2020-06-29 2022-01-04 腾讯科技(深圳)有限公司 Program data processing method, device, computer readable storage medium and equipment

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6675315B1 (en) * 2000-05-05 2004-01-06 Oracle International Corp. Diagnosing crashes in distributed computing systems
CN101201786A (en) * 2006-12-13 2008-06-18 中兴通讯股份有限公司 Method and device for monitoring fault log
CN101873616A (en) * 2010-06-21 2010-10-27 宇龙计算机通信科技(深圳)有限公司 Mobile terminal self-check method and system and mobile terminal
CN101883026A (en) * 2010-07-05 2010-11-10 优视科技有限公司 Method for maintaining data acquisition system
CN102387040A (en) * 2011-11-01 2012-03-21 深圳市航天泰瑞捷电子有限公司 Method and system for keeping high-speed stable running of front-end processor
CN102638378A (en) * 2012-02-22 2012-08-15 中国人民解放军国防科学技术大学 Mass storage system monitoring method integrating heterogeneous storage devices
CN102739452A (en) * 2012-06-28 2012-10-17 浪潮(北京)电子信息产业有限公司 Method and system for monitoring resources
CN103166773A (en) * 2011-12-09 2013-06-19 国家电网公司 Method and system for monitoring operation state of server
CN103544091A (en) * 2013-10-31 2014-01-29 北京国双科技有限公司 Method and device for monitoring Windows process
CN104461830A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Method and device for monitored progress

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6675315B1 (en) * 2000-05-05 2004-01-06 Oracle International Corp. Diagnosing crashes in distributed computing systems
CN101201786A (en) * 2006-12-13 2008-06-18 中兴通讯股份有限公司 Method and device for monitoring fault log
CN101873616A (en) * 2010-06-21 2010-10-27 宇龙计算机通信科技(深圳)有限公司 Mobile terminal self-check method and system and mobile terminal
CN101883026A (en) * 2010-07-05 2010-11-10 优视科技有限公司 Method for maintaining data acquisition system
CN102387040A (en) * 2011-11-01 2012-03-21 深圳市航天泰瑞捷电子有限公司 Method and system for keeping high-speed stable running of front-end processor
CN103166773A (en) * 2011-12-09 2013-06-19 国家电网公司 Method and system for monitoring operation state of server
CN102638378A (en) * 2012-02-22 2012-08-15 中国人民解放军国防科学技术大学 Mass storage system monitoring method integrating heterogeneous storage devices
CN102739452A (en) * 2012-06-28 2012-10-17 浪潮(北京)电子信息产业有限公司 Method and system for monitoring resources
CN103544091A (en) * 2013-10-31 2014-01-29 北京国双科技有限公司 Method and device for monitoring Windows process
CN104461830A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Method and device for monitored progress

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106384297A (en) * 2016-09-09 2017-02-08 深圳市汇拓新邦科技有限公司 Method and system for operating and maintaining photovoltaic power generation
CN107122685A (en) * 2017-04-27 2017-09-01 国信优易数据有限公司 A kind of big data method for secure storing and equipment
CN111147818A (en) * 2019-12-29 2020-05-12 航天信息股份有限公司 Grain depot video monitoring method and system
CN111240949A (en) * 2020-01-13 2020-06-05 奇安信科技集团股份有限公司 Method and device for determining software use frequency in domestic operating system
CN111240949B (en) * 2020-01-13 2024-04-26 奇安信科技集团股份有限公司 Method and device for determining software use frequency in domestic operating system
CN111474942A (en) * 2020-05-09 2020-07-31 烟台市地摩动力科技有限公司 Fault self-checking method and system of intelligent transportation device
CN111694687A (en) * 2020-06-05 2020-09-22 中国第一汽车股份有限公司 Vehicle software fault detection method, device, equipment and storage medium
CN113886213A (en) * 2020-06-29 2022-01-04 腾讯科技(深圳)有限公司 Program data processing method, device, computer readable storage medium and equipment
CN111708613A (en) * 2020-08-18 2020-09-25 广东睿江云计算股份有限公司 Method and system for repairing boot failure card task of VM virtual machine
CN111708613B (en) * 2020-08-18 2020-12-11 广东睿江云计算股份有限公司 Method and system for repairing boot failure card task of VM virtual machine
CN112149823A (en) * 2020-08-20 2020-12-29 汉威科技集团股份有限公司 Combined implementation method for filtering alarm information
CN112104508A (en) * 2020-09-23 2020-12-18 沈阳奥普泰光通信有限公司 Intelligent fault monitoring and self-repairing method for network data acquisition equipment, storage medium and computer equipment
CN112104508B (en) * 2020-09-23 2023-04-18 辽宁奥普泰通信股份有限公司 Intelligent fault monitoring and self-repairing method for network data acquisition equipment, storage medium and computer equipment

Also Published As

Publication number Publication date
CN104932978B (en) 2018-04-13

Similar Documents

Publication Publication Date Title
CN104932978A (en) System running fault self-detection and self-recovery method and system
US7213179B2 (en) Automated and embedded software reliability measurement and classification in network elements
CN105119767A (en) Data self-check and self-cleaning software operation state monitoring method and system
CN107995049B (en) Cross-region synchronous fault monitoring method, device and system for power safety region
CN103200050B (en) The hardware state monitoring method and system of server
CN101345663B (en) Heartbeat detection method and heartbeat detection apparatus
CA2493525C (en) Method and apparatus for outage measurement
CN105099762B (en) A kind of self checking method and self-checking system of system O&M function
CN107632918B (en) Monitoring system and method for computing storage equipment
CN106789306B (en) Method and system for detecting, collecting and recovering software fault of communication equipment
CN105610648B (en) A kind of acquisition method and server of O&M monitoring data
CN101197621A (en) Method and system for remote diagnosing and locating failure of network management system
US20060085680A1 (en) Network monitoring method and apparatus
CN111698127A (en) System, method and device for monitoring state of equipment in network
CN111953542B (en) System for guaranteeing stable operation of gateway
CN114039900A (en) Efficient network data packet protocol analysis method and system
CN108174400B (en) Data processing method, system and equipment of terminal equipment
CN103634166B (en) Equipment survival detection method and equipment survival detection device
CN113676723B (en) Non-homologous network video monitoring fault positioning method and device based on Internet of things
CN109547271B (en) Network state real-time monitoring alarm system based on big data
CN105490847A (en) Real-time detecting and processing method of node failure in private cloud storage system
CN106897189A (en) A kind of daily record monitoring system based on data real time propelling movement
CN110138628B (en) Real-time diagnosis and recovery method and device for network fault of camera and camera
CN108174398B (en) Data processing method, system and equipment of terminal equipment
CN110858813A (en) Network camera safety detection method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant