CN104270268B - A kind of distributed system network performance evaluation and method for diagnosing faults - Google Patents

A kind of distributed system network performance evaluation and method for diagnosing faults Download PDF

Info

Publication number
CN104270268B
CN104270268B CN201410508685.0A CN201410508685A CN104270268B CN 104270268 B CN104270268 B CN 104270268B CN 201410508685 A CN201410508685 A CN 201410508685A CN 104270268 B CN104270268 B CN 104270268B
Authority
CN
China
Prior art keywords
network
node
management service
distributed system
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410508685.0A
Other languages
Chinese (zh)
Other versions
CN104270268A (en
Inventor
张攀勇
彭成
季旻
苗艳超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CHINESE CORPORATION DAWNING INFORMATION INDUSTRY CHENGDU CO., LTD.
Dawning Information Industry Co Ltd
Original Assignee
Dawning Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dawning Information Industry Co Ltd filed Critical Dawning Information Industry Co Ltd
Priority to CN201410508685.0A priority Critical patent/CN104270268B/en
Publication of CN104270268A publication Critical patent/CN104270268A/en
Application granted granted Critical
Publication of CN104270268B publication Critical patent/CN104270268B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention provides a kind of distributed system network performance evaluation and method for diagnosing faults, comprises the following steps:Monitoring service is disposed in monitored distributed system;According to the feature of distributed system, operational management service;Carry out discovering network topology;Determine the monitor node set of monitored node;Management service collector node status information is simultaneously analyzed;Network performance detects;Network state is analyzed, and determines failure that may be present.The present invention take into account the state of the all-network equipment and link that participate on communication path, communication performance between node, according to network topological information, it can analyze and the particular location of the localization of faults, the precision of fault detect is improved, reduces the expense of fault detect.Simultaneously for the performance evaluation of distributed system, this method can provide the actual performance between distributed system node, rather than the theoretical performance of network system, it is possible to increase the precision of Performance Prediction.

Description

A kind of distributed system network performance evaluation and method for diagnosing faults
Technical field
The present invention relates to a kind of diagnostic method, and in particular to a kind of distributed system network performance evaluation and fault diagnosis side Method.
Background technology
Distributed system is referred to establishing on network system, and each different node is passed through into the message between node One or more services are completed in communication, cooperation.Because service is distributed to different nodes, therefore distributed system by distributed system System is with good expansibility, Fault Isolation, and application transparency.Widely should it be obtained in the IT system of reality With typical service is distributed formula file system, distributed data base, website service etc..
Each service node is interconnected together because distributed system relies on the network equipment, the performance of the network equipment and steady Qualitative performance and stability to distributed system serves conclusive effect.With the expansion of distributed system scale, make The scale of network, device type are obtained, the connected mode of equipment becomes extremely complex, can be direct if some equipment break down Have influence on the quality of top service.How efficient fault diagnosis and performance evaluation carried out to network system by instrument, had Very important meaning.
For current fault diagnosis mechanism, it is divided into hardware fault diagnosis mechanism and Software Testing Tool.
Hardware fault diagnosis mechanism includes the performance counter provided on the network equipment, there is provided various performances and failure count Device, including messaging, abandon message, hardware error message etc. and count, be able to detect that hardware device is by these countings It is no exception to be present.
Software Testing Tool calculates the network delay and band of point-to-point by carrying out the information receiving and transmitting of point-to-point on one's own initiative Width, and then judge that network whether there is failure.Typical testing tool has Iperf, netperf etc..
The problem of existing distributed system network performance evaluation and fault diagnosis are present following aspects:
● breakdown judge source is simple:Hardware counter can only detect the source of trouble of hardware in itself, can not be for network link The failure such as state, software protocol layer mistake judged;The net that software point-to-point testing tool can only be tested between two points Network performance, it can not quickly judge network failure by data.
● keeper participates in by hand:The various possible situations of keeper's manual test are needed, and may be deposited according to interpretation of result Which kind of handled in failure.As network size caused by distributed system popularization is huge, it is necessary to failure diagnosis tool Simplify and the possible breakdown point of overall network is quickly provided, be easy to keeper to carry out the judgement and exclusion of failure.
The content of the invention
In order to overcome the above-mentioned deficiencies of the prior art, the present invention provides a kind of distributed system network performance evaluation and failure Diagnostic method, it is contemplated that participate in the state of the all-network equipment and link on communication path, the communication performance between node, According to network topological information, the particular location of the simultaneously localization of faults can be analyzed, the precision of fault detect is improved, reduces event Hinder the expense of detection.
Simultaneously for the performance evaluation of distributed system, this method can provide the actual property between distributed system node Can, rather than the theoretical performance of network system, it is possible to increase the precision of Performance Prediction.
In order to realize foregoing invention purpose, the present invention adopts the following technical scheme that:
The present invention, which provides a kind of distributed system network performance evaluation and method for diagnosing faults, methods described, includes following step Suddenly:
Step 1:Monitoring service is disposed in monitored distributed system;
Step 2:According to the feature of distributed system, operational management service;
Step 3:Carry out discovering network topology;
Step 4:Determine the monitor node set of monitored node;
Step 5:Management service collector node status information is simultaneously analyzed;
Step 6:Network performance detects;
Step 7:Network state is analyzed, and determines failure that may be present.
In the step 1, according to monitored distributed system scale, monitored node is determined, and in monitored node Upper deployment monitoring service;The monitored node is defined as node where monitored service is needed in distributed system, including Server and the network equipment etc..
The network state of node where monitoring service is responsible for monitoring, including the hardware state of network interface card and operating system provide Performance count information etc.;
Monitoring service receives the order of management service and execution, and order includes network detection order and applied in network performance test life Order;
The network detection order that monitoring service is sent according to management service, carry out network detection;And sent out according to management service The applied in network performance test order gone out, carry out the applied in network performance test between node.
In the step 2, the operational management service in management node, management service is according to distributed system feature, selection Monitored node, start monitoring service, and be connected with the monitoring service on monitored node.
The connected mode of management service and monitoring service is depending on the scale of distributed system:
For small-scale distributed system, management service is directly connected with all monitoring services;
For large scale distributed system, management service is connected using tree hierarchy mode, i.e. tension management service management The management service of different subregions, the node and network of a single partition management service management configuration quantity.
In the step 3, management service initiates discovering network topology to the all-network equipment of distributed system, to determine Network topological information, and by network topology information storage into management service;If the network equipment residing for distributed system Topology Discovery is not supported, then the topological arrangement provided according to keeper builds network topological information.
In the step 4, monitored node supports following three kinds of monitor modes:
(1) total system scan mode:All nodes and the network equipment of distributed system are scanned, then monitor node Collection is combined into all nodes of internal system and the network equipment;
(2) keeper's specific mode:Keeper is by configuring specified monitor node set;
(3) application program is specified, monitoring set scan mode during failure:Application program specifies monitor node collection by API Close, system scans after suspected fault is found for specific node;The detailed process of the monitor mode is as follows:
3-1):Application program specifies the node for needing to monitor;
3-2):The state of monitoring service regular monitoring node, if it find that network state is abnormal, then by the exception of this node Communications status proactive notification is to management service;
3-3):Management service calculates communication lines after node abnormal communication states notice is received according to network topology Footpath, by the all-network equipment and node on communication path, add monitor node list.
The step 5 comprises the following steps:
Step 5-1:Monitoring service of the management service into monitor node set initiates node status information and collects order;
Step 5-2:After monitoring service receives node status information collection order, the shape of this meshed network equipment is collected State, and return result to management service;
Step 5-3:The status information that management service is collected into all nodes is analyzed, and the network for confirming to have failure is set It is standby, and marked there will be the network equipment of failure in the network topological information of management service;
Step 5-4:There will be the list of the network equipment of failure to report keeper for management service, notifies keeper to carry out Safeguard.
The step 6 comprises the following steps:
Step 6-1:Monitor node of the management service into monitor node set initiates the detection of Active Networks performance, property in pairs Energy index includes bilateral network delay, network bandwidth and network performance stability, and the all-network on collector node path is set Standby counter;
Step 6-2:Monitoring service on node actively is initiated to visit after network performance probe requests thereby is received to corresponding node Message Opcode is surveyed, and returns result to management service;
Step 6-3:Management service is chosen to the algorithm to monitor node, including permutation and combination algorithm and greedy algorithm etc..
In the step 7, management service is after the result of step 5 and step 6 is received, according to the net of step 3 acquisition Network topology information carries out network state analysis, the communication test between the counter and node of comprehensive all-network equipment Can, it is determined that the network equipment or link of failure be present, it is understood that there may be failure include network card equipment hardware fault, network interface card work Pattern-Fault, network card interface and node interface mismatch, connection cables disconnect, connection cables are unstable and exchange fault.
Compared with prior art, the beneficial effects of the present invention are:
Distributed system network performance evaluation provided by the invention and method for diagnosing faults, due to consideration that distributed system The state of the all-network equipment of system, the path detection between node is actively carried out, and performance is gone out according to detection Analysis of conclusion and asked Topic or trouble point, specific to some network equipment, link, or node rank, greatly reduce grid performance point Analysis and the expense of fault diagnosis, alleviate the manual intervention of keeper;Support the fault-finding of total system and application specified path.
Brief description of the drawings
Fig. 1 is the connection diagram of management service and monitoring service in the embodiment of the present invention;
Fig. 2 is the schematic diagram that management service judges network equipment failure according to result in the embodiment of the present invention.
Embodiment
The present invention is described in further detail below in conjunction with the accompanying drawings.
The present invention, which provides a kind of distributed system network performance evaluation and method for diagnosing faults, methods described, includes following step Suddenly:
Step 1:Monitoring service is disposed in monitored distributed system;
Step 2:According to the feature of distributed system, operational management service;
Step 3:Carry out discovering network topology;
Step 4:Determine the monitor node set of monitored node;
Step 5:Management service collector node status information is simultaneously analyzed;
Step 6:Network performance detects;
Step 7:Network state is analyzed, and determines failure that may be present.
In the step 1, according to monitored distributed system scale, monitored node is determined, and in monitored node Upper deployment monitoring service;The monitored node is defined as node where monitored service is needed in distributed system, including Server and the network equipment etc..
The network state of node where monitoring service is responsible for monitoring, including the hardware state of network interface card and operating system provide Performance count information etc.;
Monitoring service receives the order of management service and execution, and order includes network detection order and applied in network performance test life Order;
The network detection order that monitoring service is sent according to management service, carry out network detection;And sent out according to management service The applied in network performance test order gone out, carry out the applied in network performance test between node.
In the step 2, the operational management service in management node, management service is according to distributed system feature, selection Monitored node, start monitoring service, and be connected (such as Fig. 1) with the monitoring service on monitored node.
The connected mode of management service and monitoring service is depending on the scale of distributed system:
For small-scale distributed system, management service is directly connected with all monitoring services;
For large scale distributed system, management service is connected using tree hierarchy mode, i.e. tension management service management The management service of different subregions, the node and network of a single partition management service management configuration quantity.
In the step 3, management service initiates discovering network topology to the all-network equipment of distributed system, to determine Network topological information, and by network topology information storage into management service;If the network equipment residing for distributed system Topology Discovery is not supported, then the topological arrangement provided according to keeper builds network topological information.
In the step 4, monitored node supports following three kinds of monitor modes:
(1) total system scan mode:All nodes and the network equipment of distributed system are scanned, then monitor node Collection is combined into all nodes of internal system and the network equipment;
(2) keeper's specific mode:Keeper is by configuring specified monitor node set;
(3) application program is specified, monitoring set scan mode during failure:Application program specifies monitor node collection by API Close, system scans after suspected fault is found for specific node;The detailed process of the monitor mode is as follows:
3-1):Application program specifies the node for needing to monitor;
3-2):The state of monitoring service regular monitoring node, if it find that network state is abnormal, then by the exception of this node Communications status proactive notification is to management service;
3-3):Management service calculates communication lines after node abnormal communication states notice is received according to network topology Footpath, by the all-network equipment and node on communication path, add monitor node list.
The step 5 comprises the following steps:
Step 5-1:Monitoring service of the management service into monitor node set initiates node status information and collects order;
Step 5-2:After monitoring service receives node status information collection order, the shape of this meshed network equipment is collected State, and return result to management service;
Step 5-3:The status information that management service is collected into all nodes is analyzed, and the network for confirming to have failure is set It is standby, and marked there will be the network equipment of failure in the network topological information of management service;
Step 5-4:There will be the list of the network equipment of failure to report keeper for management service, notifies keeper to carry out Safeguard.
The step 6 comprises the following steps:
Step 6-1:Monitor node of the management service into monitor node set initiates the detection of Active Networks performance, property in pairs Energy index includes bilateral network delay, network bandwidth and network performance stability, and the all-network on collector node path is set Standby counter;
Step 6-2:Monitoring service on node actively is initiated to visit after network performance probe requests thereby is received to corresponding node Message Opcode is surveyed, and returns result to management service;
Step 6-3:Management service is chosen to the algorithm to monitor node, including permutation and combination algorithm and greedy algorithm etc..
In the step 7, management service is after the result of step 5 and step 6 is received, according to the net of step 3 acquisition Network topology information carries out network state analysis, the communication test between the counter and node of comprehensive all-network equipment Can, it is determined that the network equipment or link of failure be present, it is understood that there may be failure include network card equipment hardware fault, network interface card work Pattern-Fault, network card interface and node interface mismatch, connection cables disconnect, connection cables are unstable and exchange fault.
Basis for estimation may have following several but be not limited to following method:
● the outside all link performances of some node are abnormal, judge that the network card equipment of the node or node are outside Connection cables failure;
● it is abnormal by communication performance on the link of some switching equipment, judge the switching equipment operation irregularity;
● the communication performance using the node-to-node of some link is abnormal, judges link exception.
Fig. 2 is the example that step 7 management service judges equipment fault according to result:
■ judges equal normal work for node 1, node 2, node 3, interchanger 1, interchanger 2 according to unit count device;
The ■ network performances that node 1 is arrived between node 2 simultaneously are normal, but node 1 arrives node 3, node 2 to node 3 State is abnormal;
■ analyzes according to the network topological information of management service, due to the public network of node 1- nodes 3 and node 2- nodes 3 Network path is interchanger 1- interchangers 2, while normal according to the unit count device of interchanger, and failure judgement may be interchanger 1- Link failure between interchanger 2, notify trouble point corresponding to keeper.
If necessary to obtain influence of the distributed system network performance to application performance, according to the application pattern of offer, Calculate expected performance number.Analysis result is reported keeper by management service, is judged by keeper, and failure is entered The corresponding processing of row.
Finally it should be noted that:The above embodiments are merely illustrative of the technical scheme of the present invention and are not intended to be limiting thereof, institute The those of ordinary skill in category field with reference to above-described embodiment still can to the present invention embodiment modify or Equivalent substitution, these are applying for this pending hair without departing from any modification of spirit and scope of the invention or equivalent substitution Within bright claims.

Claims (1)

1. a kind of distributed system network performance evaluation and method for diagnosing faults, it is characterised in that:Methods described includes following step Suddenly:
Step 1:Monitoring service is disposed in monitored distributed system;
Step 2:According to the feature of distributed system, operational management service;
Step 3:Carry out discovering network topology;
Step 4:Determine the monitor node set of monitored node;
Step 5:Management service collector node status information is simultaneously analyzed;
Step 6:Network performance detects;
Step 7:Network state is analyzed, and determines failure that may be present;
In the step 1, according to monitored distributed system scale, monitored node is determined, and on monitored node top Affix one's name to monitoring service;The monitored node is defined as node where monitored service is needed in distributed system, including service Device and the network equipment;
The network state of node where monitoring service is responsible for monitoring, including the performance that the hardware state of network interface card and operating system provide Count information;
Monitoring service receives the order of management service and execution, and order includes network detection order and applied in network performance test order;
The network detection order that monitoring service is sent according to management service, carry out network detection;And sent according to management service Applied in network performance test order, carry out the applied in network performance test between node;
In the step 2, the operational management service in management node, management service selects to be supervised according to distributed system feature Node is controlled, starts monitoring service, and be connected with the monitoring service on monitored node;
The connected mode of management service and monitoring service is depending on the scale of distributed system:
For small-scale distributed system, management service is directly connected with all monitoring services;
For large scale distributed system, management service is connected using tree hierarchy mode, i.e. tension management service management is different The management service of subregion, the node and network of a single partition management service management configuration quantity;
In the step 3, management service initiates discovering network topology to the all-network equipment of distributed system, to determine network Topology information, and by network topology information storage into management service;If the network equipment residing for distributed system does not prop up Topology Discovery is held, then the topological arrangement provided according to keeper builds network topological information;
In the step 4, monitored node supports following three kinds of monitor modes:
(1) total system scan mode:All nodes and the network equipment of distributed system are scanned, then monitor node set For all nodes of internal system and the network equipment;
(2) keeper's specific mode:Keeper is by configuring specified monitor node set;
(3) application program is specified, monitoring set scan mode during failure:Application program specifies monitor node set by API, is System scans after suspected fault is found for specific node;The detailed process of the monitor mode is as follows:
3-1):Application program specifies the node for needing to monitor;
3-2):The state of monitoring service regular monitoring node, if it find that network state is abnormal, then by the exceptional communication of this node State proactive notification is to management service;
3-3):Management service calculates communication path after node abnormal communication states notice is received according to network topology, will All-network equipment and node on communication path, add monitor node list;
The step 5 comprises the following steps:
Step 5-1:Monitoring service of the management service into monitor node set initiates node status information and collects order;
Step 5-2:After monitoring service receives node status information collection order, the state of this meshed network equipment is collected, and Return result to management service;
Step 5-3:The status information that management service is collected into all nodes is analyzed, and confirms the network equipment of failure be present, And marked there will be the network equipment of failure in the network topological information of management service;
Step 5-4:There will be the list of the network equipment of failure to report keeper for management service, notifies keeper to be tieed up Shield;
The step 6 comprises the following steps:
Step 6-1:Monitor node of the management service into monitor node set initiates the detection of Active Networks performance in pairs, and performance refers to Marking includes bilateral network delay, network bandwidth and network performance stability, and the all-network equipment on collector node path Counter;
Step 6-2:Monitoring service on node is actively initiated detection to corresponding node and disappeared after network performance probe requests thereby is received Breath operation, and return result to management service;
Step 6-3:Management service is chosen to the algorithm to monitor node, including permutation and combination algorithm and greedy algorithm;
In the step 7, management service is opened up after the result of step 5 and step 6 is received according to the network that step 3 obtains Flutter information and carry out network state analysis, integrate the communication test performance between the counter and node of all-network equipment, really Surely the network equipment or link of failure be present, it is understood that there may be failure include network card equipment hardware fault, network interface card mode of operation Mistake, network card interface and node interface mismatch, connection cables disconnect, connection cables are unstable and exchange fault.
CN201410508685.0A 2014-09-28 2014-09-28 A kind of distributed system network performance evaluation and method for diagnosing faults Active CN104270268B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410508685.0A CN104270268B (en) 2014-09-28 2014-09-28 A kind of distributed system network performance evaluation and method for diagnosing faults

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410508685.0A CN104270268B (en) 2014-09-28 2014-09-28 A kind of distributed system network performance evaluation and method for diagnosing faults

Publications (2)

Publication Number Publication Date
CN104270268A CN104270268A (en) 2015-01-07
CN104270268B true CN104270268B (en) 2017-12-05

Family

ID=52161762

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410508685.0A Active CN104270268B (en) 2014-09-28 2014-09-28 A kind of distributed system network performance evaluation and method for diagnosing faults

Country Status (1)

Country Link
CN (1) CN104270268B (en)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104935458B (en) * 2015-04-29 2018-05-29 中国人民解放军国防科学技术大学 A kind of performance bottleneck analysis method and device based on distributed automatization measurement
CN105227395B (en) * 2015-08-28 2018-09-28 北京奇艺世纪科技有限公司 A kind of method, apparatus and system of distribution JVM performance evaluations
CN105227354A (en) * 2015-09-07 2016-01-06 浪潮软件集团有限公司 Log-based method for monitoring and managing distributed system
CN106598800A (en) * 2015-10-14 2017-04-26 中兴通讯股份有限公司 Hardware fault analysis system and method
CN105812210A (en) * 2016-05-25 2016-07-27 赵鹏 Distributed network performance measuring system
CN106130761B (en) * 2016-06-22 2019-06-18 北京百度网讯科技有限公司 The recognition methods of the failed network device of data center and device
CN107545129B (en) * 2016-06-27 2021-06-22 西门子(深圳)磁共振有限公司 Fault checking method and device for medical equipment
CN106506196A (en) * 2016-10-19 2017-03-15 上海携程商务有限公司 Enterprise-level online troubleshooting method and system
CN108664346A (en) * 2017-03-27 2018-10-16 中国移动通信集团福建有限公司 The localization method of the node exception of distributed memory system, device and system
CN108933708B (en) * 2017-05-27 2021-03-09 中国互联网络信息中心 Multi-dimensional checking method and system for distributed DNS service
CN109559583B (en) * 2017-09-27 2022-04-05 华为技术有限公司 Fault simulation method and device
CN107634863A (en) * 2017-10-25 2018-01-26 北京百悟科技有限公司 Distributed monitoring device and method for domain name mapping disaster tolerance service
CN108337114A (en) * 2018-01-16 2018-07-27 中车青岛四方机车车辆股份有限公司 Network state processing equipment, method and train
US10795756B2 (en) * 2018-04-24 2020-10-06 EMC IP Holding Company LLC System and method to predictively service and support the solution
CN109088766B (en) * 2018-08-15 2021-10-29 无锡江南计算技术研究所 Interconnection network fault detection and positioning method based on pairing test
CN109450729A (en) * 2018-11-05 2019-03-08 郑州云海信息技术有限公司 A kind of method and system of automatic test whole machine cabinet server network stability
CN109802855B (en) * 2018-12-28 2020-08-07 华为技术有限公司 Fault positioning method and device
CN111092747A (en) * 2019-10-25 2020-05-01 苏州浪潮智能科技有限公司 Method, device and medium for network performance diagnosis
CN112751689B (en) * 2019-10-30 2023-12-05 北京京东振世信息技术有限公司 Network connectivity detection method, monitoring server and monitoring proxy device
CN110837453B (en) * 2019-11-01 2023-09-01 山东中创软件商用中间件股份有限公司 Method and related device for monitoring document exchange platform
CN111044936A (en) * 2019-11-28 2020-04-21 中国航空工业集团公司西安航空计算技术研究所 Airborne GJB289A bus cable fault rapid positioning method
CN111682976B (en) * 2020-04-26 2022-03-01 合肥中科类脑智能技术有限公司 Method for ensuring distributed multi-machine communication monitoring
CN113839827B (en) * 2020-06-24 2023-09-12 维谛技术有限公司 Data monitoring system, equipment and method
CN111817913B (en) * 2020-06-30 2022-05-17 北京红山信息科技研究院有限公司 Distributed network performance test method, system, server and storage medium
CN114500244A (en) * 2020-11-13 2022-05-13 中兴通讯股份有限公司 Network fault diagnosis method and device, computer equipment and readable medium
CN112491464B (en) * 2020-12-01 2022-08-09 凯睿星通信息科技(南京)股份有限公司 Distributed fault real-time monitoring and standby equipment switching method for satellite communication
CN113179182B (en) * 2021-04-27 2022-11-22 中国联合网络通信集团有限公司 Network supervision method, device, equipment and storage medium
CN113708995B (en) * 2021-08-20 2023-04-07 深圳市风云实业有限公司 Network fault diagnosis method, system, electronic equipment and storage medium
WO2023225886A1 (en) * 2022-05-25 2023-11-30 Intel Corporation Low latency and deterministic node failure detection
CN115378830B (en) * 2022-08-19 2024-03-26 百倍云(浙江)物联科技有限公司 Ecological environment monitoring system stability monitoring method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103607297A (en) * 2013-11-07 2014-02-26 上海爱数软件有限公司 Fault processing method of computer cluster system
CN103699111A (en) * 2013-09-26 2014-04-02 青岛海信网络科技股份有限公司 Failure detection method and device for distributed monitoring system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8295178B2 (en) * 2008-05-19 2012-10-23 Solarwinds Worldwide Llc Manual configuration for sites that cannot give read/write credentials to a voice over internet protocol (VOIP) monitor

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103699111A (en) * 2013-09-26 2014-04-02 青岛海信网络科技股份有限公司 Failure detection method and device for distributed monitoring system
CN103607297A (en) * 2013-11-07 2014-02-26 上海爱数软件有限公司 Fault processing method of computer cluster system

Also Published As

Publication number Publication date
CN104270268A (en) 2015-01-07

Similar Documents

Publication Publication Date Title
CN104270268B (en) A kind of distributed system network performance evaluation and method for diagnosing faults
CN106789177B (en) A kind of system of dealing with network breakdown
US8443074B2 (en) Constructing an inference graph for a network
CN102158360B (en) Network fault self-diagnosis method based on causal relationship positioning of time factors
JP4421645B2 (en) Communication apparatus and network information collection program
CN104796298B (en) A kind of method and device of SDN network accident analysis
Ramanathan et al. Towards a debugging system for sensor networks
CN111030873A (en) Fault diagnosis method and device
CN102195857A (en) Network topology structure and node information gathering method
KR20160147957A (en) Verification in self-organizing networks
CN101667941A (en) Method for detecting link performance and device therefor
CN110224883A (en) A kind of Grey Fault Diagnosis method applied to telecommunications bearer network
CN108933694A (en) Data center network Fault Node Diagnosis method and system based on testing data
CN105812210A (en) Distributed network performance measuring system
CN112333020A (en) Network security monitoring and data message analyzing system based on quintuple
US11012290B2 (en) Systems and methods for node outage determination and reporting
Nie et al. Passive diagnosis for WSNs using data traces
JP2005237018A (en) Data transmission to network management system
CN108123752B (en) EPON precise loop detection method based on geographic information positioning
KR100500836B1 (en) Fault management system of metro ethernet network and method thereof
CN111654413B (en) Method, equipment and storage medium for selecting effective measurement points of network flow
CN113300914A (en) Network quality monitoring method, device, system, electronic equipment and storage medium
Han et al. Research of network monitoring based on SNMP
CN114338103B (en) Abnormal flow position method and system based on TR069 protocol combined log analysis
Ye et al. Providing diagnostic network feedback to end users on smartphones

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20190909

Address after: 610000 Sichuan city of Chengdu province Tianfu Tianfu Avenue South Huayang Street No. 846

Co-patentee after: Sugon Information Industry Co., Ltd.

Patentee after: CHINESE CORPORATION DAWNING INFORMATION INDUSTRY CHENGDU CO., LTD.

Address before: 300384 Tianjin city Xiqing District Huayuan Industrial Zone (outer ring) Haitai Huake Street No. 15 1-3

Patentee before: Sugon Information Industry Co., Ltd.