CN107835098A - A kind of network fault detecting method and system - Google Patents

A kind of network fault detecting method and system Download PDF

Info

Publication number
CN107835098A
CN107835098A CN201711216127.7A CN201711216127A CN107835098A CN 107835098 A CN107835098 A CN 107835098A CN 201711216127 A CN201711216127 A CN 201711216127A CN 107835098 A CN107835098 A CN 107835098A
Authority
CN
China
Prior art keywords
network
data
interchanger
webserver
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711216127.7A
Other languages
Chinese (zh)
Other versions
CN107835098B (en
Inventor
杨龙
王金龙
邓谦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Che Zhi Interconnect (beijing) Technology Co Ltd
Original Assignee
Che Zhi Interconnect (beijing) Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Che Zhi Interconnect (beijing) Technology Co Ltd filed Critical Che Zhi Interconnect (beijing) Technology Co Ltd
Priority to CN201711216127.7A priority Critical patent/CN107835098B/en
Publication of CN107835098A publication Critical patent/CN107835098A/en
Application granted granted Critical
Publication of CN107835098B publication Critical patent/CN107835098B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/04Processing captured monitoring data, e.g. for logfile generation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0811Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity

Abstract

The invention discloses a kind of network fault detecting method and system, this method is suitable to perform in aggregate server, aggregate server communicates to connect with one or more webservers and database server, the webserver is accessed in multiple Internet data centers, each Internet data center is deployed with more interchangers, every interchanger access has multiple host, and corresponding data probe is previously provided with every main frame and is communicated to connect with the webserver, this method includes:One or more achievement datas that the webserver reports are received, the result of detection that achievement data is reported by the webserver according to each data probe is formed;Network connection is detected according to the achievement data of each abnormal state;If testing result indicates existing network connectivity fai_lure, alarm is sent;Polymerization is carried out to the normal achievement data of each state to calculate to obtain corresponding network quality data;Network quality data is sent to database server to store.

Description

A kind of network fault detecting method and system
Technical field
The present invention relates to computer network field, more particularly to a kind of network fault detecting method and system.
Background technology
Internet firm often possesses thousands of servers, these server distributions in the multiple computer rooms in the whole nation, and It is connected with each other incorporate's Intranet.Between the computer room and computer room of Intranet, between server and server, there is extensive network Communication, then the importance of communication quality is self-evident.
Many reasons cause across Network Communication Quality problem in computer room, such as network interface card, netting twine, server individual equipment failure, This kind of failure individual probability very little, but just can not be ignored after being multiplied by huge radix, and interchanger Single Point of Faliure, network are gathered around Plug, special line failure etc. also occur often, therefore failure can not almost avoid, so as to need to monitor across computer room net accurately and in time Network quality and positioning failure, this is most important for guarantee company's business normal operation.
Existing Network Fault Detection scheme is divided into two classes more, and one kind is that failure is positioned manually, after problem has occurred and that By virtue of experience progressively reduced the scope by O&M engineer, investigate suspicious circuit, interchanger or server, waste time and energy and lack Data supporting, the degree of accuracy is relatively low, and another kind of is that active probe is network-like by way of disposing probe in all aol servers Condition, but the problem of network topology covering is not complete, detection data is difficult to comprehensive utilization be present.Therefore, it is necessary to a kind of new network event Hinder detection scheme to improve above-mentioned processing procedure.
The content of the invention
Therefore, the present invention provides a kind of technical scheme of Network Fault Detection, solve or at least alleviate above to try hard to The problem of existing.
According to an aspect of the present invention, there is provided a kind of network fault detecting method, suitable for being performed in aggregate server, Aggregate server is communicated to connect with one or more webservers and database server, and the webserver is accessed in multiple Internet data center, each Internet data center are deployed with more interchangers, and every interchanger access has multiple host, often Corresponding data probe is previously provided with platform main frame and is communicated to connect with the webserver, this method comprises the following steps:It is first First, one or more achievement datas that the webserver reports are received, achievement data is by the webserver according to each data probe The result of detection reported is formed;Network connection is detected according to the achievement data of each abnormal state;If testing result indicates There is network connectivity fai_lure, then send alarm;Polymerization is carried out to the normal achievement data of each state to calculate to obtain corresponding net Network qualitative data;Network quality data is sent to database server to store.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net The step of network connection is detected includes:According to the achievement data of each abnormal state, detect every two under each Internet data center There is the host number accounting for the network failure for being connected to another interchanger in main frame between individual interchanger, under an interchanger; Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network connection event among the switches Barrier.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should Under the affiliated Internet data center of interchanger, interchanger there is the network for being connected to each interchanger under other Internet data centers First interchanger quantity accounting of failure;Exceed each internet data of default second ratio to the first interchanger quantity accounting Center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should There is the second exchange for being connected to the network failure of interchanger under other internet centres in the affiliated Internet data center of interchanger Machine quantity accounting;Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, judge that it is deposited Network connectivity fai_lure between Internet data center.
Alternatively, in the network according to the invention fault detection method, the normal achievement data of each state is gathered Total the step of calculating to obtain corresponding network quality data, includes:To the normal achievement data of each state, according to default Four time intervals generate tantile data;The normal achievement data of each state is polymerize by its corresponding interchanger, counted The network quality index of each interchanger;The normal achievement data of each state is gathered by its corresponding Internet data center Close, count the network quality index of each Internet data center;With reference to the tantile data of generation, each interchanger and each internet The network quality index of data center, to form corresponding network quality data.
Alternatively, in the network according to the invention fault detection method, the first ratio, the second ratio and the 3rd ratio are equal It is preset as 0.5.
According to a further aspect of the invention, there is provided a kind of computing device, including one or more processors, memory with And one or more programs, wherein one or more program storages in memory and are configured as by one or more processors Perform, one or more programs include being used for the instruction for performing the network according to the invention fault detection method.
According to a further aspect of the invention, there is provided a kind of computer-readable storage medium for storing one or more programs Matter, one or more programs include instruction, and instruction is when executed by a computing apparatus so that computing device is according to the present invention's Network fault detecting method.
According to a further aspect of the invention, there is provided a kind of network fault detecting method, suitable in Network Fault Detection system Performed in system, the system includes one or more webservers, aggregate server, database server and multiple data and visited Pin, for the system access in multiple Internet data centers, each Internet data center is deployed with more interchangers, every exchange Machine access has multiple host, and corresponding data probe is previously provided with every main frame, is stored with the webserver and each number According to network detection list corresponding to main frame associated by probe, this method comprises the following steps:First, data probe is from network service Network detection list corresponding to main frame associated by device acquisition, according to the network detection list got, with the default very first time Interval carries out network state detection, and result of detection is sent to the webserver;The webserver is received on each data probe The result of detection of report, sent after each result of detection is formed into corresponding achievement data to corresponding aggregate server;Aggregated service Device receives one or more achievement datas that the webserver reports, and network connection is entered according to the achievement data of each abnormal state Row detection, if testing result indicates existing network connectivity fai_lure, sends alarm, the normal achievement data of each state is gathered It is total to calculate to obtain corresponding network quality data, and network quality data is sent to database server;Database service Device receives and stores the network quality data of aggregate server transmission.
Alternatively, in the network according to the invention fault detection method, data probe is obtained from the webserver and closed Include corresponding to connection main frame the step of network detection list:According to default second time interval, send and arrange to the webserver Table renewal is asked to obtain the version of network detection list corresponding to associated main frame;If the version is newly current in the data probe Network detection list version, then from the webserver obtain associated by network detection list corresponding to main frame, with replace should The current network detection list of data probe.
Alternatively, in the network according to the invention fault detection method, in addition to:Data probe is according to default resource Take rule come judge its consumption system resource whether excess load;If excess load, stopping detect the system resource of consumption; If the non-excess load of system resource of consumption, continues to detect.
Alternatively, configuration management number is provided with the network according to the invention fault detection method, in the webserver According to storehouse and cache database, configuration management database purchase has newest network topological information, and network topological information includes each master The numbering and annexation of machine, each interchanger and each Internet data center, this method also include the pre- Mr. of the webserver Into with each data probe associated by the corresponding network detection list of main frame, previously generate with each data probe associated by main frame it is corresponding Network detection list the step of include:According to network topological information, to each main frame, by under the affiliated interchanger of the main frame its His main frame is disposed as needing the destination host detected;Other interchangers that the affiliated Internet data center of the main frame is included Under, be arranged to its numbering identical main frame to need the destination host that detects;Other internet centres that the main frame is not belonging to Including each interchanger under, be arranged to its numbering identical main frame to need the destination host that detects;According to the main frame and each mesh Network detection list corresponding to the IP address generation of main frame is marked, by network detection list storage and cache database.
Alternatively, in the network according to the invention fault detection method, in addition to:The webserver is according to default Three time intervals obtain the network topological information stored in configuration management database;If the network topological information got than Preceding network topological information changes, then new network detection list is generated according to the network topological information got, and deposit It is stored in cache database to update.
Alternatively, in the network according to the invention fault detection method, each result of detection is formed into corresponding index number Include according to rear send to the step of corresponding aggregate server:Checking treatment is carried out to each result of detection;Qualified spy will be verified Survey result and be converted into corresponding achievement data, and send the achievement data to corresponding poly- according to default index transmission rule Hop server.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net The step of network connection is detected includes:According to the achievement data of each abnormal state, detect every two under each Internet data center There is the host number accounting for the network failure for being connected to another interchanger in main frame between individual interchanger, under an interchanger; Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network connection event among the switches Barrier.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should Under the affiliated Internet data center of interchanger, interchanger there is the network for being connected to each interchanger under other Internet data centers First interchanger quantity accounting of failure;Exceed each internet data of default second ratio to the first interchanger quantity accounting Center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should There is the second exchange for being connected to the network failure of interchanger under other internet centres in the affiliated Internet data center of interchanger Machine quantity accounting;Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, judge that it is deposited Network connectivity fai_lure between Internet data center.
Alternatively, in the network according to the invention fault detection method, the normal achievement data of each state is gathered Total the step of calculating to obtain corresponding network quality data, includes:To the normal achievement data of each state, according to default Four time intervals generate tantile data;The normal achievement data of each state is polymerize by its corresponding interchanger, counted The network quality index of each interchanger;The normal achievement data of each state is gathered by its corresponding Internet data center Close, count the network quality index of each Internet data center;With reference to the tantile data of generation, each interchanger and each internet The network quality index of data center, to form corresponding network quality data.
According to a further aspect of the invention, a kind of network fault detection system is also provided, the system includes one or more The individual webserver, aggregate server, database server and multiple data probes, the system access is in multiple interconnection netting indexs According to center, each Internet data center is deployed with more interchangers, and every interchanger, which accesses, multiple host, in every main frame Be previously provided with corresponding data probe, be stored with the webserver with each data probe associated by the corresponding network of main frame visit List is surveyed, within the system:Data probe is suitable to from network detection list, root corresponding to main frame associated by webserver acquisition According to the network detection list got, network state detection is carried out with default very first time interval, and result of detection is sent To the webserver;The webserver is suitable to receive the result of detection that each data probe reports, and each result of detection is formed accordingly Achievement data after send to corresponding aggregate server;Aggregate server is suitable to receive one or more that the webserver reports Individual achievement data, network connection is detected according to the achievement data of each abnormal state, if testing result indicates existing network network Connecting fault, then alarm is sent, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding network quality number According to, and network quality data is sent to database server;Database server is suitable to receive and store aggregate server hair The network quality data sent.
The technical scheme of the network according to the invention fault detect, first receive the one or more that the webserver reports and refer to Data are marked, network connection are detected according to the achievement data of each abnormal state, if testing result indicates existing network connection Failure, then alarm is sent, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding network quality data. In above-mentioned technical proposal, active alarm is carried out to there is abnormal network connection state, greatly speeds up failure response time, and lead to Cross and the distribution of achievement data is polymerize, can obtain in real time between any two interchanger and any two Internet data center Network quality data.
Further, the result of detection that achievement data is reported by the webserver according to each data probe is formed, and data The network detection list that probe issues according to the webserver is detected, and network detection list is based on net by the webserver Network topology information generates, and network topological information includes each main frame, the numbering of each interchanger and each Internet data center and company Relation is connect, All hosts can be covered according to the network detection list that this network topological information generates, it is complete to realize network Covering, reduces amount of calculation and system resources consumption, and then can real-time detect the company of the network between any two main frames Situation is connect, and adaptive network topology variation is realized by issuing network detection list.
Brief description of the drawings
In order to realize above-mentioned and related purpose, some illustrative sides are described herein in conjunction with following description and accompanying drawing Face, these aspects indicate the various modes that can put into practice principles disclosed herein, and all aspects and its equivalent aspect It is intended to fall under in the range of theme claimed.Read following detailed description in conjunction with the accompanying drawings, the disclosure it is above-mentioned And other purposes, feature and advantage will be apparent.Throughout the disclosure, identical reference generally refers to identical Part or element.
Fig. 1 shows the schematic diagram of network fault detection system 100 according to an embodiment of the invention;
Fig. 2 shows the structured flowchart of computing device 200 according to an embodiment of the invention;
Fig. 3 shows the flow chart of network fault detecting method 300 according to an embodiment of the invention;And
Fig. 4 shows the flow chart of the network fault detecting method 400 according to another embodiment of the invention.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.
Fig. 1 shows the schematic diagram of network fault detection system 100 according to an embodiment of the invention.It should be pointed out that Network fault detection system 100 in Fig. 1 is only exemplary, in specific practice situation, network fault detection system 100 In can have the webserver, aggregate server, database server and the data probe of varying number, the present invention to network therefore The quantity of the webserver, aggregate server, database server and data probe included by barrier detecting system 100 is not done Limitation.As shown in figure 1, network fault detection system 100 includes the webserver 700, aggregate server 800, database service Device 900 and N number of data probe, N number of data probe be respectively data probe 1, data probe 2 ..., data probe N, wherein N is positive integer.
Network fault detection system 100 is accessed in multiple Internet data center (IDC:Internet Data Center), each Internet data center is deployed with more interchangers, and every interchanger access has multiple host, every main frame In be previously provided with corresponding data probe, it can thus be appreciated that the quantity of data probe is set according to the quantity of main frame, here The quantity of main frame is similarly N.Moreover, be stored with the webserver 700 with each data probe associated by the corresponding network of main frame Detect list.
Specifically, for 1~N of data probe, each data probe obtains associated main frame from the webserver 700 Corresponding network detection list, according to the network detection list got, network state is carried out with default very first time interval Detection, and result of detection is sent to the webserver 700.The webserver 700 receives the detection knot that each data probe reports Fruit, sent after each result of detection is formed into corresponding achievement data to corresponding aggregate server 800.Aggregate server 800 connects One or more achievement datas that the webserver 700 reports are received, network connection is entered according to the achievement data of each abnormal state Row detection, if testing result indicates existing network connectivity fai_lure, sends alarm, the normal achievement data of each state is gathered It is total to calculate to obtain corresponding network quality data, and network quality data is sent to database server 900.Database takes Business device 900 receives and stores the network quality data of the transmission of aggregate server 800.
Fig. 2 shows the structured flowchart of computing device 200 according to an embodiment of the invention.In basic configuration 202, Computing device 200 typically comprises system storage 206 and one or more processor 204.Memory bus 208 can be used In the communication between processor 204 and system storage 206.
Depending on desired configuration, processor 204 can be any kind of processing, include but is not limited to:Microprocessor (μ P), microcontroller (μ C), digital information processor (DSP) or any combination of them.Processor 204 can be included such as The cache of one or more rank of on-chip cache 210 and second level cache 212 etc, processor core 214 and register 216.The processor core 214 of example can include arithmetic and logical unit (ALU), floating-point unit (FPU), Digital signal processing core (DSP core) or any combination of them.The Memory Controller 218 of example can be with processor 204 are used together, or in some implementations, Memory Controller 218 can be an interior section of processor 204.
Depending on desired configuration, system storage 206 can be any type of memory, include but is not limited to:Easily The property lost memory (RAM), nonvolatile memory (ROM, flash memory etc.) or any combination of them.System stores Device 206 can include operating system 220, one or more apply 222 and routine data 226.In some embodiments, Program 222 may be arranged to utilize the execute instruction of routine data 224 by one or more processors 204 on an operating system.
Computing device 200 can also include contributing to from various interface equipments (for example, output equipment 242, Peripheral Interface 244 and communication equipment 246) to basic configuration 102 via the communication of bus/interface controller 230 interface bus 240.Example Output equipment 242 include graphics processing unit 248 and audio treatment unit 250.They can be configured as contributing to via One or more A/V port 252 is communicated with the various external equipments of such as display or loudspeaker etc.Outside example If interface 244 can include serial interface controller 254 and parallel interface controller 256, they can be configured as contributing to Via one or more I/O port 258 and such as input equipment (for example, keyboard, mouse, pen, voice-input device, touch Input equipment) or the external equipment of other peripheral hardwares (such as printer, scanner etc.) etc communicated.The communication of example is set Standby 246 can include network controller 260, and it can be arranged to be easy to via one or more COM1 264 and one The communication that other individual or multiple computing devices 262 pass through network communication link.
Network communication link can be an example of communication media.Communication media can be generally presented as in such as carrier wave Or computer-readable instruction in the modulated data signal of other transmission mechanisms etc, data structure, program module, and can With including any information delivery media." modulated data signal " can such signal, one in its data set or more It is individual or it change can the mode of coding information in the signal carry out.As nonrestrictive example, communication media can be with Include the wire medium of such as cable network or private line network etc, and it is such as sound, radio frequency (RF), microwave, infrared (IR) the various wireless mediums or including other wireless mediums.Term computer-readable medium used herein can include depositing Both storage media and communication media.
Computing device 200 can be implemented as server, such as file server, database server, application program service Device and WEB server etc., a part for portable (or mobile) electronic equipment of small size, these electronic equipments can also be embodied as Can be such as cell phone, personal digital assistant (PDA), personal media player device, wireless network browsing apparatus, individual Helmet, application specific equipment or the mixing apparatus that any of the above function can be included.Computing device 200 can also be real It is now to include desktop computer and the personal computer of notebook computer configuration.In certain embodiments, computing device 200 can It is embodied as the webserver, aggregate server and/or database server, and is configured as performing the network according to the invention event Hinder detection method.Wherein, one or more programs 222 of computing device 200 include being used to perform the network according to the invention event Hinder the instruction of detection method.
Fig. 3 shows the flow chart of network fault detecting method 300 according to an embodiment of the invention.Network service Device 700, aggregate server 800, database server 900 and 1~N of data probe are configured to perform the network according to the invention During fault detection method 300, communicated by mutual data to complete the processing of network fault detecting method 300 jointly, Now be embodied as respectively the webserver 700, aggregate server 800, one of computing device 200 of database server 900 Or multiple programs 222 include being used for the instruction for performing the network according to the invention fault detection method 300.
As shown in figure 3, method 300 starts from step S311.In step S311, data probe obtains from the webserver 700 Take network detection list corresponding to associated main frame.Wherein, network detection list is believed by the webserver 700 according to network topology Breath previously generates, for ease of subsequent descriptions, first the webserver 700 is previously generated herein with each data probe associated by main frame Corresponding network detection list illustrates.
Specifically, configuration management database and cache database are provided with the webserver 700, configuration management database Newest network topological information is stored with, network topological information includes each main frame, each interchanger and each Internet data center Numbering and annexation.According to one embodiment of present invention, when generating network detection list, the webserver 700 from Configuration management database obtains network topological information, according to network topological information, to each main frame, by exchange belonging to the main frame Other main frames are disposed as needing the destination host detected under machine, according to the generation pair of the IP address of the main frame and each destination host The network detection list answered, according to network detection list corresponding to the generation of the IP address of the main frame and each destination host.According to this Another embodiment of invention, when generating network detection list, the webserver 700 obtains network from configuration management database Topology information, according to network topological information, to each main frame, other friendships that the affiliated Internet data center of the main frame is included Change planes down, be arranged to need the destination host detected with its numbering identical main frame, according to the main frame and the IP of each destination host Network detection list corresponding to the generation of address.According to still another embodiment of the invention, when generating network detection list, network Server 700 obtains network topological information from configuration management database, according to network topological information, to each main frame, by this It is arranged to what needs detected under each interchanger that other internet centres that main frame is not belonging to include, with its numbering identical main frame Destination host, according to network detection list corresponding to the generation of the IP address of the main frame and each destination host.
The destination host quantity covered in view of network detection list is more, more can preferably detect network state, because This according to still another embodiment of the invention, can previously generate in the following way with each data probe associated by main frame it is corresponding Network detection list.First, network topological information is obtained from configuration management database, according to network topological information, to each Main frame, other main frames under the affiliated interchanger of the main frame are disposed as to need the destination host detected, according to the main frame and respectively Network detection list corresponding to the IP address generation of destination host.Then the affiliated Internet data center of the main frame is included its Be arranged to need the destination host that detects under his interchanger, with its numbering identical main frame, then the main frame is not belonging to other The destination host that needs detect, last root are arranged under each interchanger that internet centre includes, with its numbering identical main frame According to network detection list corresponding to the generation of the IP address of the main frame and each destination host, network detection list is stored with caching number According in storehouse.
In this embodiment, network topological information specifically includes:Network fault detection system 100 is accessed in 2 interconnections Network data center, is designated as IDC1 and IDC2 respectively, and IDC1 and IDC2 are deployed with 2 interchangers, the interchanger of IDC1 deployment respectively SW1 and SW2 are designated as respectively, and the interchanger of IDC2 deployment is designated as SW3 and SW4, lower point of interchanger SW1, SW2, SW3 and SW4 respectively Jie Ru there are not 4 main frames, the main frame of SW1 accesses is designated as H1, H2, H3 and H4 respectively, and numbering is followed successively by 1,2,3 and 4, SW2 access Main frame be designated as H5, H6, H7 and H8 respectively, numbering be followed successively by 1,2,3 and the main frame of 4, SW3 access be designated as H9, H10, H11 respectively And H12, numbering be followed successively by 1,2,3 and the main frame of 4, SW4 access be designated as H13, H14, H15 and H16 respectively, numbering is followed successively by 1,2, 3 and 4.It follows that main frame amounts to 16, then the quantity N of data probe is also 16, and main frame H1~H16 is disposed with data Probe 1~16.Table 1 shows the storage example of network topological information according to an embodiment of the invention, institute specific as follows Show:
Table 1
Below exemplified by generating main frame H1 associated by data probe 1, the mistake to previously generating corresponding network detection list Journey illustrates.For main frame H1, other main frame H2, H3 and H4 under the affiliated interchanger SW1 of the main frame are disposed as needing The destination host to be detected, it will be numbered under other interchangers SW2 that the affiliated Internet data center IDC1 of the main frame includes, with it Identical main frame H5 is arranged to need the destination host detected, and other internet centres IDC2 that the main frame is not belonging to is included Under interchanger SW3 and SW4, it is arranged to need the destination host detected with its numbering identical main frame H9 and H13, according to main frame H1 With the generation of destination host H5, H9 and H13 IP address corresponding to network detection list L1, and network detection list L1 is stored in In cache database.Simultaneously, it is contemplated that the feasibility and convenience of detection, main frame H1, mesh are also included in network detection list L1 Each interchanger and the IP address of Internet data center that mark main frame H5, H9 and H13 are accessed.In addition, if follow-up nothing refers in particular to Go out, will be using data probe 1 and its associated main frame H1 as example, to further illustrate technical scheme.
Because network detection list is changed with the change of network topological information, then according to the present invention another Embodiment, the webserver 700 are opened up according to default 3rd time interval to obtain the network stored in configuration management database Flutter information, if the network topological information got relatively before network topological information change, according to the network that gets Topology information generates new network detection list, and is stored in cache database to update.In this embodiment, configure In management database the network topological information that is stored can real-time update, the 3rd time interval is preset as 30 minutes.
On data probe in step S311 network detection list corresponding to associated main frame is obtained from the webserver 700 Process, according to one embodiment of present invention, can realize as follows.First, data probe is according to default Two time intervals, request is updated to the transmission list of the webserver 700 to obtain network detection list corresponding to associated main frame Version.Then, the webserver 700 responds the list update request that the data probe is sent, and it is sent to the data probe The version of network detection list corresponding to associated main frame, judge whether to determine renewal to indicate the data probe according to the version Network detection list, wherein list update request from the data probe according to default second time interval to the webserver 700 send.Now, data probe receives the version of the newest network detection list of the return of the webserver 700, by the version This is compared with the version of current network detection list, if the version is newly in the current network detection list of the data probe Version, then to the webserver 700 send determine fresh information.The webserver 700 receives the determination of data probe feedback Fresh information, network detection list corresponding to its associated main frame is sent to the data probe, and then data probe takes from network Business device 700 gets network detection list corresponding to associated main frame, to replace the current network detection list of the data probe.
In this embodiment, the second time interval is preset as 10 minutes, and data probe 1 is every 10 minutes to network service The transmission list of device 700 renewal request, with the version of network detection list L1 corresponding to main frame H1 associated by acquisition.The webserver The list update request that 700 response data probes 1 are sent, send network corresponding to its associated main frame H1 to data probe 1 and visit List L1 version is surveyed, the version is 3.1.5.Data probe 1 receives the newest network detection of the return of the webserver 700 List L1 version, by the version compared with the version of current network detection list, because current network detection arranges The version of table is 3.1.4, and version 3 .1.5 is newly in version 3 .1.4, it is determined that needs to update network detection list, data probe 1 Sent to the webserver 700 and determine fresh information.The webserver 700 receives the determination fresh information that data probe 1 feeds back, Network detection list L1 newest corresponding to its associated main frame H1 is sent to the data probe, final data probe 1 is with network Detection list L1 replaces its current network detection list.
Then, step S312 is performed, data probe is according to the network detection list got, between the default very first time Every progress network state detection.According to one embodiment of present invention, very first time interval is preset as 120 seconds, data probe 1 According to the network detection list L1 got, every 120 seconds to the network between network host H1 and destination host H5, H9 and H13 State is detected, and is ordered typically by PING (Packet Internet Groper, the Internet packets survey meter) to check Whether network is in connected state.In addition, it is not restricted on the agreement used in detection process, such as common HTTP, TCP, UDP and ICMP etc..Different detection agreements is used for different detection demands, if desired lays particular stress on the finger of quality of service Mark, then can use HTTP or TCP, if desired lay particular stress on the index of network in itself, then can use ICMP.Certainly, agreement is detected In any combination, such as consultation can be generally detected by the webserver 700 in net using HTTP and ICMP etc. simultaneously Specified in network detection list, so that data probe can therefrom obtain corresponding detection association after corresponding network detection list is obtained View.
Next, in step S313, the result of detection that data probe obtains after network state is detected is sent to network Server 700.According to one embodiment of present invention, data probe 1 has detected a destination host, i.e., can generate corresponding one Individual result of detection, result of detection generally comprise following field:Operating system, detection time started, detection end time, detection association The affiliated interchanger of view, destination host, destination host and/or the affiliated Internet data center of destination host.In view of individually transmission Single result of detection can bring unnecessary burden to system, therefore can use bulk transfer to the result of detection of a wheel, such as Every 128 result of detections are retransmited to the webserver 700 after being compressed packing.
Further, in order to control the occupancy situation of server resource, the normal execution of business on interfering line, each number are avoided Need to check continually on the system resource of oneself consumption according to probe and carry out subsequent treatment according to Expenditure Levels.According to the present invention's One embodiment, data probe according to default resource occupation rule come judge its consume system resource whether excess load, if The system resource of consumption excess load, then stop detection, if the non-excess load of system resource of consumption, continues to detect.In the reality Apply in mode, resource occupation rule it is predeterminable for CPU usage less than 99% and memory usage be less than 95%.To data probe For 1, its CPU to main frame H1 occupancy is 75%, memory usage 50%, it is known that the system that data probe 1 consumes The non-excess load of resource, it can continue to detect.
The webserver 700 performs step S321, by each detection after the result of detection that each data probe reports is received As a result corresponding achievement data is formed.According to one embodiment of present invention, when forming achievement data, first to each result of detection Checking treatment is carried out, then corresponding achievement data is converted into by qualified result of detection is verified.Wherein, achievement data includes current Main frame, destination host, Metric values, network interaction time and/or timestamp, current hosts are the main frame for performing probe command, Destination host is detected main frame, and Metric values can be the field such as leapfrog number isometry optimal path.Checking treatment is usual It is whether in the reasonable scope first to judge the value of field in result of detection, if the value of all fields is reasonable in result of that probe In the range of, it is determined that result of that probe verification is qualified.
The webserver 700, into step S322, will walk after achievement data is formed according to default index transmission rule The achievement data of rapid S321 generations is sent to corresponding aggregate server.Aggregate server mentioned above is one or more, Fig. 1 Merely illustrate aggregate server 800 in though, the quantity of actually aggregate server could be arranged to it is multiple, it is follow-up to accelerate The processing speed of diagnostic network failure.When the quantity of aggregate server only has 1, such as only aggregate server 800, then net Network server 700 directly sends achievement data to aggregate server 800.When the quantity of aggregate server has multiple, According to one embodiment of present invention, index transmission rule is preset as carrying out selective polymerization server reception index according to hash algorithm Data.The address of each aggregate server is ranked up with order corresponding to acquisition, in each achievement data first Metric values seek hash value, to obtaining corresponding index value after the hash value modulus tried to achieve, by the achievement data send to this The corresponding aggregate server of index value identical order.It is existing mature technology, herein on the particular content of hash algorithm Do not repeated.It should be noted that presetting for index transmission rule, phase can be taken according to the difference of actual conditions The algorithm answered realizes that these can be readily apparent that for the technical staff for understanding the present invention program, and also exist Within protection scope of the present invention, do not repeated herein.
Aggregate server 800 receives one or more achievement datas that the webserver 700 reports, and achievement data is by network The result of detection that server 700 reports according to each data probe is formed, and hereafter aggregate server 800 first can be entered to each achievement data Row filtration treatment, the achievement data that the network interaction time exceedes default threshold value or indicates network connection failure is filtered out, And the achievement data of abnormal state is designated as, and it is remaining, it is designated as the normal achievement data of state.According to one of present invention implementation Example, network interaction time exceed default threshold value and show that performing the PING values obtained after PING orders exceedes the threshold value, and network is handed over Mutual persond eixis network connection failure shows that PING is obstructed.
And then aggregate server 800 performs step S331, network connection is carried out according to the achievement data of each abnormal state Detection.According to one embodiment of present invention, it is each according to the achievement data of each abnormal state, detection when detecting network connection There is the network for being connected to another interchanger in main frame between each two interchanger, under an interchanger under Internet data center The host number accounting of failure, each interchanger of default first ratio is exceeded to host number accounting, judge that it is present and exchange Network connectivity fai_lure between machine.Wherein, the first ratio is preset as 0.5.In this embodiment, to Internet data center For IDC1, main frame H1, H2 and H3 under the achievement data instruction interchanger SW1 of abnormal state occur being connected to interchanger SW2 Network failure, 4 main frames are amounted under SW1, then the host number accounting for the network failure for being connected to SW2 occurs in main frame under SW1 For 3/4=0.75, more than the first ratio, judge that SW1 deposits network connectivity fai_lure among the switches.To Internet data center For IDC2, there is the network for being connected to interchanger SW4 in the main frame H10 under the achievement data instruction interchanger SW3 of abnormal state Failure, 4 main frames are amounted under SW3, then the host number accounting that the network failure for being connected to SW4 occurs in main frame under SW3 is 1/4= 0.25, more than the first ratio, judge the network connectivity fai_lure that SW3 is not present between interchanger.
To each interchanger of the host number accounting not less than default first ratio, according to one embodiment of present invention, Detect under the affiliated Internet data center of the interchanger, each interchanger under other Internet data centers occurs being connected in interchanger Network failure the first interchanger quantity accounting, to the first interchanger quantity accounting exceed default second ratio each interconnection Network data center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.Wherein, the second ratio is pre- It is set to 0.5.In this embodiment, interchanger SW3 host number accounting is not less than default first ratio, then to SW3 and Speech, the achievement data of abnormal state is indicated under its affiliated Internet data center IDC2, interchanger SW3 and SW4 are connected to Interchanger SW2 network failure under Internet data center IDC1,2 interchangers are amounted under IDC1, then interchanger goes out under IDC2 The the first interchanger quantity accounting for being now connected to interchanger SW2 network failure under IDC1 is 2/2=100%, more than the second ratio Example, judges that IDC2 has the network connectivity fai_lure between Internet data center and interchanger.
To each interchanger of the host number accounting not less than default first ratio, according to another implementation of the present invention Example, detects the affiliated Internet data center of the interchanger and occurs being connected to the network failure of interchanger under other internet centres Second switch quantity accounting, each Internet data center of default 3rd ratio is exceeded to second switch quantity accounting, Judge that it has the network connectivity fai_lure between Internet data center.Wherein, the 3rd ratio is preset as 0.5.In the embodiment party In formula, interchanger SW3 host number accounting is not less than default first ratio, then for SW3, the index number of abnormal state According to indicate its affiliated Internet data center IDC2 occur being connected under Internet data center IDC1, interchanger SW1 and SW2 Network failure, 2 interchangers are amounted under IDC1, then IDC2 occurs being connected to second of the network failure of interchanger under IDC1 and exchanged Machine quantity accounting is 2/2=100%, more than default 3rd ratio, judges that IDC2 has the net between Internet data center Network connecting fault.
After aggregate server 800 completes network connection detection, step S332 is performed, if testing result indicates that existing network network connects Failure is connect, then sends alarm.According to one embodiment of present invention, SW1 deposits network connectivity fai_lure among the switches, IDC2 The network connection event between network connectivity fai_lure and the Internet data center between Internet data center and interchanger be present Barrier, thereby determines that network failure, and alarm is sent to relevant staff and system.
In step S333, aggregate server 800 carries out polymerization to the normal achievement data of each state and calculated to obtain phase The network quality data answered.According to one embodiment of present invention, when generating network quality data, first, to each state just Normal achievement data, tantile data are generated according to default 4th time interval, the normal achievement data of each state is pressed it Corresponding interchanger is polymerize, and counts the network quality index of each interchanger, and the normal achievement data of each state is right by its The Internet data center answered is polymerize, and counts the network quality index of each Internet data center, in conjunction with point of generation The network quality index of place value data, each interchanger and each Internet data center, to form corresponding network quality data. In the embodiment, predeterminable the 4th time interval is 5 minutes and 10 minutes, then to the normal achievement data of each state, first may be used By the Metric values in hash algorithm and achievement data, achievement data is classified by the source of current hosts, then to every One current hosts as source, the 50%th achievement data is come as 50 tantiles every polymerization generation in 5 minutes, The achievement data for coming the 99%th is generated every polymerization in 10 minutes as 99 tantiles, 50 tantiles and 99 tantiles are made For tantile data.The normal achievement data of state corresponding to interchanger SW1, SW2, SW3 and SW4 is polymerize respectively, The network quality index of each interchanger is generated after corresponding statistics, to the state corresponding to Internet data center IDC1 and IDC2 just Normal achievement data is polymerize respectively, and the network quality index of each Internet data center is generated after corresponding statistics, is finally tied Close and state each tantile data, interchanger SW1, SW2, SW3, SW4, Internet data center IDC1 and IDC2 network quality and refer to Mark, forms corresponding network quality data.
After network quality data is obtained, aggregate server 800 performs step S334, and network quality data is sent to number According to storehouse server 900.According to one embodiment of present invention, aggregate server 800 sends network quality data to database Server 900 is to store.
After database server 900 receives the network quality data of the transmission of aggregate server 800, into step S341, Store the network quality data that aggregate server 800 is sent.According to one embodiment of present invention, in order to accelerate to inquire about, pass through The memory cache of the generated network quality data elder generation write into Databasce server 900 of polymerization, the caching preserve certain time, such as After one day, then corresponding network quality data switched into persistent storage, such as deposit OpenTSDB (Open Time Series Database, time series databases of increasing income).
Fig. 4 shows the flow chart of network fault detecting method 400 according to still another embodiment of the invention.Work as calculating When equipment 200 is embodied as aggregate server 800, one or more programs 222 of computing device 200 include being used to perform according to this The instruction of the network fault detecting method 400 of invention.According to one embodiment of present invention, aggregate server 800 and one or The multiple webservers and database server 900 are communicated to connect, and the webserver 700 is accessed in multiple internet datas The heart, each Internet data center are deployed with more interchangers, and every interchanger, which accesses, multiple host, in every main frame in advance It is provided with corresponding data probe and is communicated to connect with the webserver 700.
As shown in figure 4, method 400 starts from step S410.In step S410, the webserver 700 reports one is received Individual or multiple achievement datas, the result of detection that achievement data is reported by the webserver 700 according to each data probe are formed.According to One embodiment of the present of invention, achievement data include current hosts, destination host, Metric values, the network interaction time and/or when Between stab, current hosts is perform the main frame of probe command, and destination host is detected main frame, and Metric values can be such as leapfrog number The field of isometry optimal path.Aggregate server 800 can be filtered first after achievement data is received to each achievement data Processing, the achievement data that the network interaction time exceedes default threshold value or indicates network connection failure is filtered out, and be designated as The achievement data of abnormal state, it is remaining, it is designated as the normal achievement data of state.Step S410 and to achievement data carry out The concrete processing procedure of filtering, the related content that aggregate server 800 in method 300 received, filtered achievement data is can refer to, this Place is not repeated.
Then, step S420 is performed, network connection is detected according to the achievement data of each abnormal state.According to this hair Bright one embodiment, when detecting network connection, according to the achievement data of each abnormal state, detect each Internet data center There is the host number for the network failure for being connected to another interchanger in main frame between lower each two interchanger, under an interchanger Accounting, each interchanger of default first ratio is exceeded to host number accounting, judge that its network deposited among the switches connects Connect failure.To each interchanger of the host number accounting not less than default first ratio, according to one embodiment of present invention, inspection Survey under the affiliated Internet data center of the interchanger, each interchanger under other Internet data centers occurs being connected in interchanger First interchanger quantity accounting of network failure, each internet of default second ratio is exceeded to the first interchanger quantity accounting Data center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.To host number accounting not More than each interchanger of default first ratio, according to still another embodiment of the invention, the affiliated internet of the interchanger is detected There is the second switch quantity accounting for being connected to the network failure of interchanger under other internet centres in data center, to second Interchanger quantity accounting exceed default 3rd ratio each Internet data center, judge its exist Internet data center it Between network connectivity fai_lure.Wherein, the first ratio, the second ratio and the 3rd ratio are preset as 0.5.It is specific in step S420 Processing procedure, the related content of step S331 in method 300 is can refer to, is not repeated herein.
In step S430, if testing result indicates existing network connectivity fai_lure, alarm is sent.Tool in step S430 Body processing procedure, the related content of step S332 in method 300 is can refer to, is not repeated herein.
Next, in step S440, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding net Network qualitative data.According to one embodiment of present invention, when generating network quality data, first, each state is normally referred to Data are marked, tantile data are generated according to default 4th time interval, by the normal achievement data of each state as corresponding to it Interchanger is polymerize, and counts the network quality index of each interchanger, by the normal achievement data of each state by mutual corresponding to it Networking data center is polymerize, and counts the network quality index of each Internet data center, in conjunction with the tantile number of generation According to, each interchanger and the network quality index of each Internet data center, to form corresponding network quality data.Step S440 In concrete processing procedure, can refer to the related content of step S333 in method 300, do not repeated herein.
Finally, into step S450, network quality data is sent to database server 900 to store.Step Concrete processing procedure in S450, the related content of step S341 in method 300 is can refer to, is not repeated herein.
Existing Network Fault Detection scheme is divided into two classes more, and one kind is relied on after problem has occurred and that by O&M engineer Experience is that failure is positioned manually, and wastes time and energy and lacks data supporting, and the degree of accuracy is relatively low, and another kind of is by being taken on institute is wired It is engaged in the mode active probe network condition of device deployment probe, but has that network topology covering is complete, detection data is difficult to comprehensive profit The problem of using.The technical scheme of Network Fault Detection according to embodiments of the present invention, first receive report one of the webserver Or multiple achievement datas, network connection is detected according to the achievement data of each abnormal state, if testing result instruction occurs Network connectivity fai_lure, then alarm is sent, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding network matter Measure data.In the above-mentioned technical solutions, active alarm is carried out to there is abnormal network connection state, greatly speeds up failure response Time, and by polymerizeing to the distribution of achievement data, any two interchanger and any two internet can be obtained in real time Network quality data between data center.Further, the spy that achievement data is reported by the webserver according to each data probe Result is surveyed to be formed, and the network detection list that data probe issues according to the webserver is detected, network detection list Generated by the webserver based on network topological information, network topological information includes each main frame, each interchanger and each internet The numbering and annexation of data center, the network detection list generated according to this network topological information can be covered all Main frame, network all standing is realized, reduce amount of calculation and system resources consumption, and then can real-time detect any two Network connection situation between main frame, and realize adaptive network topology variation by issuing network detection list.
B10. the method as described in B9, net corresponding to main frame associated by the data probe from webserver acquisition The step of network detection list, includes:
According to default second time interval, request is updated to the webserver transmission list to obtain associated master The version of network detection list corresponding to machine;
If the version newly in the version of the current network detection list of the data probe, obtains from the webserver Network detection list corresponding to associated main frame, to replace the current network detection list of the data probe.
B11. the method as described in B9 or 10, in addition to:
The data probe according to default resource occupation rule come judge its consume system resource whether excess load;
If excess load, stopping detect the system resource of consumption;
If the non-excess load of system resource of consumption, continues to detect.
B12. the method as any one of B9-11, be provided with the webserver configuration management database and Cache database, the configuration management database purchase have newest network topological information, and the network topological information includes each The numbering and annexation of main frame, each interchanger and each Internet data center, it is advance that this method also includes the webserver Generation and the corresponding network detection list of main frame associated by each data probe, it is described previously generate with each data probe associated by lead Include corresponding to machine the step of network detection list:
According to the network topological information, to each main frame, other main frames under the affiliated interchanger of the main frame are all provided with It is set to the destination host for needing to detect;
It will be arranged under other interchangers that the affiliated Internet data center of the main frame includes, with its numbering identical main frame Need the destination host detected;
Set under each interchanger that other internet centres that the main frame is not belonging to include, with its numbering identical main frame To need the destination host detected;
According to network detection list corresponding to the generation of the IP address of the main frame and each destination host, the network detection is arranged In table storage and cache database.
B13. the method as described in B12, in addition to:
The webserver obtains what is stored in the configuration management database according to default 3rd time interval Network topological information;
If the network topological information got relatively before network topological information change, according to the network that gets Topology information generates new network detection list, and is stored in the cache database to update.
B14. the method as any one of B9-13, it is described each result of detection is formed into corresponding achievement data after send out Include corresponding to delivering to the step of aggregate server:
Checking treatment is carried out to each result of detection;
Qualified result of detection will be verified and be converted into corresponding achievement data, and should according to default index transmission rule Achievement data is sent to corresponding aggregate server.
B15. the method as any one of B9-14, the achievement data according to each abnormal state is to network connection The step of being detected includes:
According to the achievement data of each abnormal state, detect under each Internet data center between each two interchanger, one There is the host number accounting for the network failure for being connected to another interchanger in main frame under interchanger;
Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network among the switches Connecting fault.
B16. the method as described in B15, what the achievement data according to each abnormal state was detected to network connection Step includes:
To each interchanger of the host number accounting not less than default first ratio, interconnection netting index belonging to the interchanger is detected According under center, interchanger there is the first interchanger number for being connected to the network failure of each interchanger under other Internet data centers Measure accounting;
Exceed each Internet data center of default second ratio to the first interchanger quantity accounting, it is mutual to judge that it is present Network connectivity fai_lure between networking data center and interchanger.
B17. the method as described in B15 or 16, the achievement data according to each abnormal state are examined to network connection The step of survey, includes:
To each interchanger of the host number accounting not less than default first ratio, interconnection netting index belonging to the interchanger is detected The second switch quantity accounting for occurring being connected to the network failure of interchanger under other internet centres according to center;
Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, it is mutual to judge that it is present Network connectivity fai_lure between networking data center.
B18. the method as any one of B9-17, it is described that polymerization calculating is carried out to the normal achievement data of each state The step of to obtain corresponding network quality data, includes:
To the normal achievement data of each state, tantile data are generated according to default 4th time interval;
The normal achievement data of each state is polymerize by its corresponding interchanger, counts the network quality of each interchanger Index;
The normal achievement data of each state is polymerize by its corresponding Internet data center, counts each interconnection netting index According to the network quality index at center;
With reference to the tantile data of generation, each interchanger and the network quality index of each Internet data center, to be formed Corresponding network quality data.
In the specification that this place provides, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice in the case of these no details.In some instances, known method, knot is not been shown in detail Structure and technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description to the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor The application claims of shield are than the feature more features that is expressly recited in each claim.More precisely, as following As claims reflect, inventive aspect is all features less than single embodiment disclosed above.Therefore, abide by Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself Separate embodiments as the present invention.
Those skilled in the art should be understood the module or unit or group of the equipment in example disclosed herein Between can be arranged in equipment as depicted in this embodiment, or alternatively can be positioned at and the equipment in the example In different one or more equipment.Module in aforementioned exemplary can be combined as a module or be segmented into addition multiple Submodule.
Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Member or group between be combined into one between module or unit or group, and can be divided into addition multiple submodule or subelement or Between subgroup.In addition at least some in such feature and/or process or unit exclude each other, it can use any Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power Profit requires, summary and accompanying drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation Replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed One of meaning mode can use in any combination.
In addition, be described as herein can be by the processor of computer system or by performing for some in the embodiment The method or the combination of method element that other devices of the function are implemented.Therefore, have and be used to implement methods described or method The processor of the necessary instruction of element forms the device for implementing this method or method element.In addition, device embodiment Element described in this is the example of following device:The device is used to implement as in order to performed by implementing the element of the purpose of the invention Function.
Various technologies described herein can combine hardware or software, or combinations thereof is realized together.So as to the present invention Method and apparatus, or some aspects of the process and apparatus of the present invention or part can take embedded tangible media, such as soft The form of program code (instructing) in disk, CD-ROM, hard disk drive or other any machine readable storage mediums, Wherein when program is loaded into the machine of such as computer etc, and is performed by the machine, the machine becomes to put into practice this hair Bright equipment.
In the case where program code performs on programmable computers, computing device generally comprises processor, processor Readable storage medium (including volatibility and nonvolatile memory and/or memory element), at least one input unit, and extremely A few output device.Wherein, memory is arranged to store program codes;Processor is arranged to according to the memory Instruction in the described program code of middle storage, perform the network fault detecting method of the present invention.
By way of example and not limitation, computer-readable medium includes computer-readable storage medium and communication media.Calculate Machine computer-readable recording medium includes computer-readable storage medium and communication media.Computer-readable storage medium storage such as computer-readable instruction, The information such as data structure, program module or other data.Communication media is typically modulated with carrier wave or other transmission mechanisms etc. Data-signal processed passes to embody computer-readable instruction, data structure, program module or other data including any information Pass medium.Any combination above is also included within the scope of computer-readable medium.
As used in this, unless specifically stated so, come using ordinal number " first ", " second ", " the 3rd " etc. Description plain objects are merely representative of the different instances for being related to similar object, and are not intended to imply that the object being so described must Must have the time it is upper, spatially, in terms of sequence or given order in any other manner.
Although describing the present invention according to the embodiment of limited quantity, above description, the art are benefited from It is interior it is clear for the skilled person that in the scope of the present invention thus described, it can be envisaged that other embodiments.Additionally, it should be noted that The language that is used in this specification primarily to readable and teaching purpose and select, rather than in order to explain or limit Determine subject of the present invention and select.Therefore, in the case of without departing from the scope and spirit of the appended claims, for this Many modifications and changes will be apparent from for the those of ordinary skill of technical field.For the scope of the present invention, to this The done disclosure of invention is illustrative and not restrictive, and it is intended that the scope of the present invention be defined by the claims appended hereto.

Claims (10)

1. a kind of network fault detecting method, suitable for being performed in aggregate server, the aggregate server and one or more The webserver and database server communication connection, the webserver are accessed in multiple Internet data centers, often Individual Internet data center is deployed with more interchangers, and every interchanger access has multiple host, pre-set in every main frame There is corresponding data probe and communicated to connect with the webserver, methods described includes:
One or more achievement datas that the webserver reports are received, the achievement data is by the webserver root The result of detection reported according to each data probe is formed;
Network connection is detected according to the achievement data of each abnormal state;
If testing result indicates existing network connectivity fai_lure, alarm is sent;
Polymerization is carried out to the normal achievement data of each state to calculate to obtain corresponding network quality data;
The network quality data is sent to the database server to store.
2. the method as described in claim 1, what the achievement data according to each abnormal state was detected to network connection Step includes:
According to the achievement data of each abnormal state, detect under each Internet data center between each two interchanger, an exchange There is the host number accounting for the network failure for being connected to another interchanger in main frame under machine;
Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network connection among the switches Failure.
3. method as claimed in claim 1 or 2, the achievement data according to each abnormal state detect to network connection The step of include:
To each interchanger of the host number accounting not less than default first ratio, detect in the affiliated internet data of the interchanger Under the heart, there is being connected to the first interchanger quantity of the network failure of each interchanger under other Internet data centers and accounts in interchanger Than;
Exceed each Internet data center of default second ratio to the first interchanger quantity accounting, judge that it has internet Network connectivity fai_lure between data center and interchanger.
4. such as the method any one of claim 1-3, the achievement data according to each abnormal state is to network connection The step of being detected includes:
To each interchanger of the host number accounting not less than default first ratio, detect in the affiliated internet data of the interchanger There is the second switch quantity accounting for being connected to the network failure of interchanger under other internet centres in the heart;
Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, judge that it has internet Network connectivity fai_lure between data center.
It is 5. described that polymerization calculating is carried out to the normal achievement data of each state such as the method any one of claim 1-4 The step of to obtain corresponding network quality data, includes:
To the normal achievement data of each state, tantile data are generated according to default 4th time interval;
The normal achievement data of each state is polymerize by its corresponding interchanger, the network quality for counting each interchanger refers to Mark;
The normal achievement data of each state is polymerize by its corresponding Internet data center, counted in each internet data The network quality index of the heart;
It is corresponding to be formed with reference to the tantile data of generation, each interchanger and the network quality index of each Internet data center Network quality data.
6. such as the method any one of claim 1-5, first ratio, the second ratio and the 3rd ratio are preset as 0.5。
7. a kind of computing device, including:
One or more processors;
Memory;And
One or more programs, wherein one or more of program storages are in the memory and are configured as by described one Individual or multiple computing devices, one or more of programs include being used to perform in the method according to claim 1-6 Either method instruction.
8. a kind of computer-readable recording medium for storing one or more programs, one or more of programs include instruction, The instruction is when executed by a computing apparatus so that in method of the computing device according to claim 1-6 Either method.
9. a kind of network fault detecting method, suitable for being performed in network fault detection system, the system includes one or more The individual webserver, aggregate server, database server and multiple data probes, the system access is in multiple interconnection netting indexs According to center, each Internet data center is deployed with more interchangers, and every interchanger, which accesses, multiple host, in every main frame Be previously provided with corresponding data probe, be stored with the webserver with each data probe associated by the corresponding net of main frame Network detects list, and methods described includes:
Network detection list corresponding to main frame associated by data probe from webserver acquisition, according to the network got List is detected, network state detection is carried out with default very first time interval, and result of detection is sent to the network service Device;
The webserver receives the result of detection that each data probe reports, and is sent out after each result of detection is formed into corresponding achievement data Deliver to corresponding aggregate server;
Aggregate server receives one or more achievement datas that the webserver reports, according to the index of each abnormal state Data detect to network connection, if testing result indicates existing network connectivity fai_lure, send alarm, normal to each state Achievement data carry out polymerization and calculate to obtain corresponding network quality data, and the network quality data is sent to described Database server;
Database server receives and stores the network quality data of aggregate server transmission.
10. a kind of network fault detection system, the system includes one or more webservers, aggregate server, data Storehouse server and multiple data probes, the system access is in multiple Internet data centers, each portion of Internet data center There are more interchangers in administration, and every interchanger access has multiple host, corresponding data probe, institute are previously provided with every main frame State be stored with the webserver with each data probe associated by the corresponding network detection list of main frame, within the system:
Data probe, suitable for network detection list corresponding to main frame associated by being obtained from the webserver, according to getting Network detection list, network state detection is carried out with default very first time interval, and result of detection is sent to the net Network server;
The webserver, the result of detection reported suitable for receiving each data probe, corresponding index number is formed by each result of detection Sent according to rear to corresponding aggregate server;
Aggregate server, the one or more achievement datas reported suitable for receiving the webserver, according to each abnormal state Achievement data network connection is detected, if testing result indicates existing network connectivity fai_lure, alarm is sent, to each shape The normal achievement data of state carries out polymerization and calculated to obtain corresponding network quality data, and the network quality data is sent To the database server;
Database server, suitable for receiving and storing the network quality data of aggregate server transmission.
CN201711216127.7A 2017-11-28 2017-11-28 Network fault detection method and system Active CN107835098B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711216127.7A CN107835098B (en) 2017-11-28 2017-11-28 Network fault detection method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711216127.7A CN107835098B (en) 2017-11-28 2017-11-28 Network fault detection method and system

Publications (2)

Publication Number Publication Date
CN107835098A true CN107835098A (en) 2018-03-23
CN107835098B CN107835098B (en) 2021-01-29

Family

ID=61646152

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711216127.7A Active CN107835098B (en) 2017-11-28 2017-11-28 Network fault detection method and system

Country Status (1)

Country Link
CN (1) CN107835098B (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109155745A (en) * 2018-07-16 2019-01-04 威富通科技有限公司 Payment gateway is connected to the network detection method and terminal device
CN110830324A (en) * 2019-10-28 2020-02-21 烽火通信科技股份有限公司 Method and device for detecting network connectivity of data center and electronic equipment
CN110852387A (en) * 2019-11-13 2020-02-28 江苏能来能源互联网研究院有限公司 Energy internet super real-time state studying and judging algorithm
CN110932894A (en) * 2019-11-22 2020-03-27 北京金山云网络技术有限公司 Network fault positioning method and device of cloud storage system and electronic equipment
CN111049691A (en) * 2019-12-25 2020-04-21 中国联合网络通信集团有限公司 Network fault positioning method, server, acquisition probe and storage medium
CN111343049A (en) * 2020-03-02 2020-06-26 网联清算有限公司 Detection method, device, system and medium for transaction information data transmission special line
CN111817911A (en) * 2020-06-23 2020-10-23 腾讯科技(深圳)有限公司 Method and device for detecting network quality, computing equipment and storage medium
CN111989897A (en) * 2018-04-10 2020-11-24 奈特朗茨公司 Measurement indicators for computer networks
CN112165400A (en) * 2020-09-25 2021-01-01 天津大学 System for troubleshooting data network based on network delay
CN112636942A (en) * 2019-10-08 2021-04-09 中国移动通信集团浙江有限公司 Method and device for monitoring service host node
CN113676376A (en) * 2021-08-20 2021-11-19 北京交通大学 In-band network telemetering method based on clustering
CN113783752A (en) * 2021-08-26 2021-12-10 四川新网银行股份有限公司 Network quality monitoring method during mutual access of intranet cross-network inter-segment service systems
CN114095808A (en) * 2020-08-24 2022-02-25 华为技术有限公司 Network fault detection method, device, equipment and computer readable storage medium
CN114157554A (en) * 2021-12-21 2022-03-08 唯品会(广州)软件有限公司 Troubleshooting method and device, storage medium and computer equipment
CN114172796A (en) * 2021-12-24 2022-03-11 中国工商银行股份有限公司 Fault positioning method and related device for communication network
CN112994972B (en) * 2021-02-02 2022-05-20 成都卓源网络科技有限公司 Distributed probe monitoring platform
CN117880055A (en) * 2024-03-12 2024-04-12 灵长智能科技(杭州)有限公司 Network fault diagnosis method, device, equipment and medium based on transmission layer index

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101291232A (en) * 2008-06-03 2008-10-22 北京星网锐捷网络技术有限公司 Ethernet port, Ethernet switch and signal receiving and sending method of Ethernet equipment
CN102055626A (en) * 2010-12-31 2011-05-11 北京中创信测科技股份有限公司 Internet protocol (IP) network quality detecting method and system
CN102891779A (en) * 2012-09-27 2013-01-23 北京网瑞达科技有限公司 Large-scale network performance measuring system and method for IP network
WO2016015041A1 (en) * 2014-07-25 2016-01-28 Blockchain Technologies Corporation System and method for creating a multi-branched blockchain with configurable protocol rules
CN105871634A (en) * 2016-06-01 2016-08-17 北京蓝海讯通科技股份有限公司 Method and application for detecting cluster anomalies and cluster managing system
CN106991033A (en) * 2017-04-01 2017-07-28 北京蓝海讯通科技股份有限公司 Notify method, device, server and the readable storage medium storing program for executing of alarm information

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101291232A (en) * 2008-06-03 2008-10-22 北京星网锐捷网络技术有限公司 Ethernet port, Ethernet switch and signal receiving and sending method of Ethernet equipment
CN102055626A (en) * 2010-12-31 2011-05-11 北京中创信测科技股份有限公司 Internet protocol (IP) network quality detecting method and system
CN102891779A (en) * 2012-09-27 2013-01-23 北京网瑞达科技有限公司 Large-scale network performance measuring system and method for IP network
WO2016015041A1 (en) * 2014-07-25 2016-01-28 Blockchain Technologies Corporation System and method for creating a multi-branched blockchain with configurable protocol rules
CN105871634A (en) * 2016-06-01 2016-08-17 北京蓝海讯通科技股份有限公司 Method and application for detecting cluster anomalies and cluster managing system
CN106991033A (en) * 2017-04-01 2017-07-28 北京蓝海讯通科技股份有限公司 Notify method, device, server and the readable storage medium storing program for executing of alarm information

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111989897B (en) * 2018-04-10 2023-09-01 瞻博网络公司 Measuring index of computer network
US11595273B2 (en) 2018-04-10 2023-02-28 Juniper Networks, Inc. Measuring metrics of a computer network
CN111989897A (en) * 2018-04-10 2020-11-24 奈特朗茨公司 Measurement indicators for computer networks
CN109155745B (en) * 2018-07-16 2019-08-30 威富通科技有限公司 Payment gateway is connected to the network detection method and terminal device
CN109155745A (en) * 2018-07-16 2019-01-04 威富通科技有限公司 Payment gateway is connected to the network detection method and terminal device
CN112636942B (en) * 2019-10-08 2022-09-27 中国移动通信集团浙江有限公司 Method and device for monitoring service host node
CN112636942A (en) * 2019-10-08 2021-04-09 中国移动通信集团浙江有限公司 Method and device for monitoring service host node
CN110830324A (en) * 2019-10-28 2020-02-21 烽火通信科技股份有限公司 Method and device for detecting network connectivity of data center and electronic equipment
CN110830324B (en) * 2019-10-28 2021-09-03 烽火通信科技股份有限公司 Method and device for detecting network connectivity of data center and electronic equipment
CN110852387A (en) * 2019-11-13 2020-02-28 江苏能来能源互联网研究院有限公司 Energy internet super real-time state studying and judging algorithm
CN110852387B (en) * 2019-11-13 2022-04-22 江苏能来能源互联网研究院有限公司 Energy internet super real-time state studying and judging algorithm
CN110932894A (en) * 2019-11-22 2020-03-27 北京金山云网络技术有限公司 Network fault positioning method and device of cloud storage system and electronic equipment
CN111049691A (en) * 2019-12-25 2020-04-21 中国联合网络通信集团有限公司 Network fault positioning method, server, acquisition probe and storage medium
CN111049691B (en) * 2019-12-25 2022-06-10 中国联合网络通信集团有限公司 Network fault positioning method, server, acquisition probe and storage medium
CN111343049A (en) * 2020-03-02 2020-06-26 网联清算有限公司 Detection method, device, system and medium for transaction information data transmission special line
CN111817911A (en) * 2020-06-23 2020-10-23 腾讯科技(深圳)有限公司 Method and device for detecting network quality, computing equipment and storage medium
CN111817911B (en) * 2020-06-23 2023-08-08 腾讯科技(深圳)有限公司 Method, device, computing equipment and storage medium for detecting network quality
CN114095808A (en) * 2020-08-24 2022-02-25 华为技术有限公司 Network fault detection method, device, equipment and computer readable storage medium
CN114095808B (en) * 2020-08-24 2023-04-28 华为技术有限公司 Network fault detection method, device, equipment and computer readable storage medium
CN112165400A (en) * 2020-09-25 2021-01-01 天津大学 System for troubleshooting data network based on network delay
CN112994972B (en) * 2021-02-02 2022-05-20 成都卓源网络科技有限公司 Distributed probe monitoring platform
CN113676376A (en) * 2021-08-20 2021-11-19 北京交通大学 In-band network telemetering method based on clustering
CN113783752A (en) * 2021-08-26 2021-12-10 四川新网银行股份有限公司 Network quality monitoring method during mutual access of intranet cross-network inter-segment service systems
CN114157554A (en) * 2021-12-21 2022-03-08 唯品会(广州)软件有限公司 Troubleshooting method and device, storage medium and computer equipment
CN114157554B (en) * 2021-12-21 2024-02-23 唯品会(广州)软件有限公司 Fault checking method and device, storage medium and computer equipment
CN114172796A (en) * 2021-12-24 2022-03-11 中国工商银行股份有限公司 Fault positioning method and related device for communication network
CN114172796B (en) * 2021-12-24 2024-01-30 中国工商银行股份有限公司 Fault positioning method and related device for communication network
CN117880055A (en) * 2024-03-12 2024-04-12 灵长智能科技(杭州)有限公司 Network fault diagnosis method, device, equipment and medium based on transmission layer index

Also Published As

Publication number Publication date
CN107835098B (en) 2021-01-29

Similar Documents

Publication Publication Date Title
CN107835098A (en) A kind of network fault detecting method and system
CN107995030A (en) A kind of network detection method, network fault detecting method and system
CN107332902B (en) The user of online customer service system asks distribution method, device and computing device
CN104580349B (en) Secure cloud administration agent
CN102939594B (en) The method and apparatus that migration with the virtual resource in customer resources to data center environment is relevant
CN107925588A (en) Band outer platform is adjusted and configured
US8359328B2 (en) Party reputation aggregation system and method
CN105938443B (en) Method and system for executing diagnostic activities in a computing environment
CN107852368A (en) Highly usable service chaining for network service
CN110506259A (en) System and method for calculate node management agreement
US9043317B2 (en) System and method for event-driven prioritization
CN108712488A (en) A kind of data processing method based on block chain, device, block catenary system
CN104869155B (en) Data Audit method and device
CN107924360A (en) Diagnosis frame in computing system
CN108063699A (en) Network performance monitoring method, apparatus, electronic equipment, storage medium
JP2013530470A (en) Distributed randomization and supply management in clinical trials
CN107395414A (en) A kind of negative feedback control method and system based on output ruling
CN110225104A (en) Data capture method, device and terminal device
CN110362454A (en) A kind of alarm method, device and electronic equipment for supporting configurable decision engine
CN107643983A (en) A kind of test data processing method and system
CN109359019A (en) Application program capacity monitoring method, device, electronic equipment and storage medium
CN110351299A (en) A kind of network connection detection method and device
US20120084856A1 (en) Gathering, storing and using reputation information
CN106156361A (en) Law enforcement supervision method and device
CN107707516B (en) A kind of IP address analysis method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant