CN107835098A - A kind of network fault detecting method and system - Google Patents
A kind of network fault detecting method and system Download PDFInfo
- Publication number
- CN107835098A CN107835098A CN201711216127.7A CN201711216127A CN107835098A CN 107835098 A CN107835098 A CN 107835098A CN 201711216127 A CN201711216127 A CN 201711216127A CN 107835098 A CN107835098 A CN 107835098A
- Authority
- CN
- China
- Prior art keywords
- network
- data
- interchanger
- webserver
- detection
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/04—Processing captured monitoring data, e.g. for logfile generation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0811—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity
Abstract
The invention discloses a kind of network fault detecting method and system, this method is suitable to perform in aggregate server, aggregate server communicates to connect with one or more webservers and database server, the webserver is accessed in multiple Internet data centers, each Internet data center is deployed with more interchangers, every interchanger access has multiple host, and corresponding data probe is previously provided with every main frame and is communicated to connect with the webserver, this method includes:One or more achievement datas that the webserver reports are received, the result of detection that achievement data is reported by the webserver according to each data probe is formed;Network connection is detected according to the achievement data of each abnormal state;If testing result indicates existing network connectivity fai_lure, alarm is sent;Polymerization is carried out to the normal achievement data of each state to calculate to obtain corresponding network quality data;Network quality data is sent to database server to store.
Description
Technical field
The present invention relates to computer network field, more particularly to a kind of network fault detecting method and system.
Background technology
Internet firm often possesses thousands of servers, these server distributions in the multiple computer rooms in the whole nation, and
It is connected with each other incorporate's Intranet.Between the computer room and computer room of Intranet, between server and server, there is extensive network
Communication, then the importance of communication quality is self-evident.
Many reasons cause across Network Communication Quality problem in computer room, such as network interface card, netting twine, server individual equipment failure,
This kind of failure individual probability very little, but just can not be ignored after being multiplied by huge radix, and interchanger Single Point of Faliure, network are gathered around
Plug, special line failure etc. also occur often, therefore failure can not almost avoid, so as to need to monitor across computer room net accurately and in time
Network quality and positioning failure, this is most important for guarantee company's business normal operation.
Existing Network Fault Detection scheme is divided into two classes more, and one kind is that failure is positioned manually, after problem has occurred and that
By virtue of experience progressively reduced the scope by O&M engineer, investigate suspicious circuit, interchanger or server, waste time and energy and lack
Data supporting, the degree of accuracy is relatively low, and another kind of is that active probe is network-like by way of disposing probe in all aol servers
Condition, but the problem of network topology covering is not complete, detection data is difficult to comprehensive utilization be present.Therefore, it is necessary to a kind of new network event
Hinder detection scheme to improve above-mentioned processing procedure.
The content of the invention
Therefore, the present invention provides a kind of technical scheme of Network Fault Detection, solve or at least alleviate above to try hard to
The problem of existing.
According to an aspect of the present invention, there is provided a kind of network fault detecting method, suitable for being performed in aggregate server,
Aggregate server is communicated to connect with one or more webservers and database server, and the webserver is accessed in multiple
Internet data center, each Internet data center are deployed with more interchangers, and every interchanger access has multiple host, often
Corresponding data probe is previously provided with platform main frame and is communicated to connect with the webserver, this method comprises the following steps:It is first
First, one or more achievement datas that the webserver reports are received, achievement data is by the webserver according to each data probe
The result of detection reported is formed;Network connection is detected according to the achievement data of each abnormal state;If testing result indicates
There is network connectivity fai_lure, then send alarm;Polymerization is carried out to the normal achievement data of each state to calculate to obtain corresponding net
Network qualitative data;Network quality data is sent to database server to store.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net
The step of network connection is detected includes:According to the achievement data of each abnormal state, detect every two under each Internet data center
There is the host number accounting for the network failure for being connected to another interchanger in main frame between individual interchanger, under an interchanger;
Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network connection event among the switches
Barrier.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net
The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should
Under the affiliated Internet data center of interchanger, interchanger there is the network for being connected to each interchanger under other Internet data centers
First interchanger quantity accounting of failure;Exceed each internet data of default second ratio to the first interchanger quantity accounting
Center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net
The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should
There is the second exchange for being connected to the network failure of interchanger under other internet centres in the affiliated Internet data center of interchanger
Machine quantity accounting;Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, judge that it is deposited
Network connectivity fai_lure between Internet data center.
Alternatively, in the network according to the invention fault detection method, the normal achievement data of each state is gathered
Total the step of calculating to obtain corresponding network quality data, includes:To the normal achievement data of each state, according to default
Four time intervals generate tantile data;The normal achievement data of each state is polymerize by its corresponding interchanger, counted
The network quality index of each interchanger;The normal achievement data of each state is gathered by its corresponding Internet data center
Close, count the network quality index of each Internet data center;With reference to the tantile data of generation, each interchanger and each internet
The network quality index of data center, to form corresponding network quality data.
Alternatively, in the network according to the invention fault detection method, the first ratio, the second ratio and the 3rd ratio are equal
It is preset as 0.5.
According to a further aspect of the invention, there is provided a kind of computing device, including one or more processors, memory with
And one or more programs, wherein one or more program storages in memory and are configured as by one or more processors
Perform, one or more programs include being used for the instruction for performing the network according to the invention fault detection method.
According to a further aspect of the invention, there is provided a kind of computer-readable storage medium for storing one or more programs
Matter, one or more programs include instruction, and instruction is when executed by a computing apparatus so that computing device is according to the present invention's
Network fault detecting method.
According to a further aspect of the invention, there is provided a kind of network fault detecting method, suitable in Network Fault Detection system
Performed in system, the system includes one or more webservers, aggregate server, database server and multiple data and visited
Pin, for the system access in multiple Internet data centers, each Internet data center is deployed with more interchangers, every exchange
Machine access has multiple host, and corresponding data probe is previously provided with every main frame, is stored with the webserver and each number
According to network detection list corresponding to main frame associated by probe, this method comprises the following steps:First, data probe is from network service
Network detection list corresponding to main frame associated by device acquisition, according to the network detection list got, with the default very first time
Interval carries out network state detection, and result of detection is sent to the webserver;The webserver is received on each data probe
The result of detection of report, sent after each result of detection is formed into corresponding achievement data to corresponding aggregate server;Aggregated service
Device receives one or more achievement datas that the webserver reports, and network connection is entered according to the achievement data of each abnormal state
Row detection, if testing result indicates existing network connectivity fai_lure, sends alarm, the normal achievement data of each state is gathered
It is total to calculate to obtain corresponding network quality data, and network quality data is sent to database server;Database service
Device receives and stores the network quality data of aggregate server transmission.
Alternatively, in the network according to the invention fault detection method, data probe is obtained from the webserver and closed
Include corresponding to connection main frame the step of network detection list:According to default second time interval, send and arrange to the webserver
Table renewal is asked to obtain the version of network detection list corresponding to associated main frame;If the version is newly current in the data probe
Network detection list version, then from the webserver obtain associated by network detection list corresponding to main frame, with replace should
The current network detection list of data probe.
Alternatively, in the network according to the invention fault detection method, in addition to:Data probe is according to default resource
Take rule come judge its consumption system resource whether excess load;If excess load, stopping detect the system resource of consumption;
If the non-excess load of system resource of consumption, continues to detect.
Alternatively, configuration management number is provided with the network according to the invention fault detection method, in the webserver
According to storehouse and cache database, configuration management database purchase has newest network topological information, and network topological information includes each master
The numbering and annexation of machine, each interchanger and each Internet data center, this method also include the pre- Mr. of the webserver
Into with each data probe associated by the corresponding network detection list of main frame, previously generate with each data probe associated by main frame it is corresponding
Network detection list the step of include:According to network topological information, to each main frame, by under the affiliated interchanger of the main frame its
His main frame is disposed as needing the destination host detected;Other interchangers that the affiliated Internet data center of the main frame is included
Under, be arranged to its numbering identical main frame to need the destination host that detects;Other internet centres that the main frame is not belonging to
Including each interchanger under, be arranged to its numbering identical main frame to need the destination host that detects;According to the main frame and each mesh
Network detection list corresponding to the IP address generation of main frame is marked, by network detection list storage and cache database.
Alternatively, in the network according to the invention fault detection method, in addition to:The webserver is according to default
Three time intervals obtain the network topological information stored in configuration management database;If the network topological information got than
Preceding network topological information changes, then new network detection list is generated according to the network topological information got, and deposit
It is stored in cache database to update.
Alternatively, in the network according to the invention fault detection method, each result of detection is formed into corresponding index number
Include according to rear send to the step of corresponding aggregate server:Checking treatment is carried out to each result of detection;Qualified spy will be verified
Survey result and be converted into corresponding achievement data, and send the achievement data to corresponding poly- according to default index transmission rule
Hop server.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net
The step of network connection is detected includes:According to the achievement data of each abnormal state, detect every two under each Internet data center
There is the host number accounting for the network failure for being connected to another interchanger in main frame between individual interchanger, under an interchanger;
Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network connection event among the switches
Barrier.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net
The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should
Under the affiliated Internet data center of interchanger, interchanger there is the network for being connected to each interchanger under other Internet data centers
First interchanger quantity accounting of failure;Exceed each internet data of default second ratio to the first interchanger quantity accounting
Center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.
Alternatively, in the network according to the invention fault detection method, according to the achievement data of each abnormal state to net
The step of network connection is detected includes:To each interchanger of the host number accounting not less than default first ratio, detection should
There is the second exchange for being connected to the network failure of interchanger under other internet centres in the affiliated Internet data center of interchanger
Machine quantity accounting;Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, judge that it is deposited
Network connectivity fai_lure between Internet data center.
Alternatively, in the network according to the invention fault detection method, the normal achievement data of each state is gathered
Total the step of calculating to obtain corresponding network quality data, includes:To the normal achievement data of each state, according to default
Four time intervals generate tantile data;The normal achievement data of each state is polymerize by its corresponding interchanger, counted
The network quality index of each interchanger;The normal achievement data of each state is gathered by its corresponding Internet data center
Close, count the network quality index of each Internet data center;With reference to the tantile data of generation, each interchanger and each internet
The network quality index of data center, to form corresponding network quality data.
According to a further aspect of the invention, a kind of network fault detection system is also provided, the system includes one or more
The individual webserver, aggregate server, database server and multiple data probes, the system access is in multiple interconnection netting indexs
According to center, each Internet data center is deployed with more interchangers, and every interchanger, which accesses, multiple host, in every main frame
Be previously provided with corresponding data probe, be stored with the webserver with each data probe associated by the corresponding network of main frame visit
List is surveyed, within the system:Data probe is suitable to from network detection list, root corresponding to main frame associated by webserver acquisition
According to the network detection list got, network state detection is carried out with default very first time interval, and result of detection is sent
To the webserver;The webserver is suitable to receive the result of detection that each data probe reports, and each result of detection is formed accordingly
Achievement data after send to corresponding aggregate server;Aggregate server is suitable to receive one or more that the webserver reports
Individual achievement data, network connection is detected according to the achievement data of each abnormal state, if testing result indicates existing network network
Connecting fault, then alarm is sent, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding network quality number
According to, and network quality data is sent to database server;Database server is suitable to receive and store aggregate server hair
The network quality data sent.
The technical scheme of the network according to the invention fault detect, first receive the one or more that the webserver reports and refer to
Data are marked, network connection are detected according to the achievement data of each abnormal state, if testing result indicates existing network connection
Failure, then alarm is sent, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding network quality data.
In above-mentioned technical proposal, active alarm is carried out to there is abnormal network connection state, greatly speeds up failure response time, and lead to
Cross and the distribution of achievement data is polymerize, can obtain in real time between any two interchanger and any two Internet data center
Network quality data.
Further, the result of detection that achievement data is reported by the webserver according to each data probe is formed, and data
The network detection list that probe issues according to the webserver is detected, and network detection list is based on net by the webserver
Network topology information generates, and network topological information includes each main frame, the numbering of each interchanger and each Internet data center and company
Relation is connect, All hosts can be covered according to the network detection list that this network topological information generates, it is complete to realize network
Covering, reduces amount of calculation and system resources consumption, and then can real-time detect the company of the network between any two main frames
Situation is connect, and adaptive network topology variation is realized by issuing network detection list.
Brief description of the drawings
In order to realize above-mentioned and related purpose, some illustrative sides are described herein in conjunction with following description and accompanying drawing
Face, these aspects indicate the various modes that can put into practice principles disclosed herein, and all aspects and its equivalent aspect
It is intended to fall under in the range of theme claimed.Read following detailed description in conjunction with the accompanying drawings, the disclosure it is above-mentioned
And other purposes, feature and advantage will be apparent.Throughout the disclosure, identical reference generally refers to identical
Part or element.
Fig. 1 shows the schematic diagram of network fault detection system 100 according to an embodiment of the invention;
Fig. 2 shows the structured flowchart of computing device 200 according to an embodiment of the invention;
Fig. 3 shows the flow chart of network fault detecting method 300 according to an embodiment of the invention;And
Fig. 4 shows the flow chart of the network fault detecting method 400 according to another embodiment of the invention.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in accompanying drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here
Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure
Completely it is communicated to those skilled in the art.
Fig. 1 shows the schematic diagram of network fault detection system 100 according to an embodiment of the invention.It should be pointed out that
Network fault detection system 100 in Fig. 1 is only exemplary, in specific practice situation, network fault detection system 100
In can have the webserver, aggregate server, database server and the data probe of varying number, the present invention to network therefore
The quantity of the webserver, aggregate server, database server and data probe included by barrier detecting system 100 is not done
Limitation.As shown in figure 1, network fault detection system 100 includes the webserver 700, aggregate server 800, database service
Device 900 and N number of data probe, N number of data probe be respectively data probe 1, data probe 2 ..., data probe N, wherein
N is positive integer.
Network fault detection system 100 is accessed in multiple Internet data center (IDC:Internet Data
Center), each Internet data center is deployed with more interchangers, and every interchanger access has multiple host, every main frame
In be previously provided with corresponding data probe, it can thus be appreciated that the quantity of data probe is set according to the quantity of main frame, here
The quantity of main frame is similarly N.Moreover, be stored with the webserver 700 with each data probe associated by the corresponding network of main frame
Detect list.
Specifically, for 1~N of data probe, each data probe obtains associated main frame from the webserver 700
Corresponding network detection list, according to the network detection list got, network state is carried out with default very first time interval
Detection, and result of detection is sent to the webserver 700.The webserver 700 receives the detection knot that each data probe reports
Fruit, sent after each result of detection is formed into corresponding achievement data to corresponding aggregate server 800.Aggregate server 800 connects
One or more achievement datas that the webserver 700 reports are received, network connection is entered according to the achievement data of each abnormal state
Row detection, if testing result indicates existing network connectivity fai_lure, sends alarm, the normal achievement data of each state is gathered
It is total to calculate to obtain corresponding network quality data, and network quality data is sent to database server 900.Database takes
Business device 900 receives and stores the network quality data of the transmission of aggregate server 800.
Fig. 2 shows the structured flowchart of computing device 200 according to an embodiment of the invention.In basic configuration 202,
Computing device 200 typically comprises system storage 206 and one or more processor 204.Memory bus 208 can be used
In the communication between processor 204 and system storage 206.
Depending on desired configuration, processor 204 can be any kind of processing, include but is not limited to:Microprocessor
(μ P), microcontroller (μ C), digital information processor (DSP) or any combination of them.Processor 204 can be included such as
The cache of one or more rank of on-chip cache 210 and second level cache 212 etc, processor core
214 and register 216.The processor core 214 of example can include arithmetic and logical unit (ALU), floating-point unit (FPU),
Digital signal processing core (DSP core) or any combination of them.The Memory Controller 218 of example can be with processor
204 are used together, or in some implementations, Memory Controller 218 can be an interior section of processor 204.
Depending on desired configuration, system storage 206 can be any type of memory, include but is not limited to:Easily
The property lost memory (RAM), nonvolatile memory (ROM, flash memory etc.) or any combination of them.System stores
Device 206 can include operating system 220, one or more apply 222 and routine data 226.In some embodiments,
Program 222 may be arranged to utilize the execute instruction of routine data 224 by one or more processors 204 on an operating system.
Computing device 200 can also include contributing to from various interface equipments (for example, output equipment 242, Peripheral Interface
244 and communication equipment 246) to basic configuration 102 via the communication of bus/interface controller 230 interface bus 240.Example
Output equipment 242 include graphics processing unit 248 and audio treatment unit 250.They can be configured as contributing to via
One or more A/V port 252 is communicated with the various external equipments of such as display or loudspeaker etc.Outside example
If interface 244 can include serial interface controller 254 and parallel interface controller 256, they can be configured as contributing to
Via one or more I/O port 258 and such as input equipment (for example, keyboard, mouse, pen, voice-input device, touch
Input equipment) or the external equipment of other peripheral hardwares (such as printer, scanner etc.) etc communicated.The communication of example is set
Standby 246 can include network controller 260, and it can be arranged to be easy to via one or more COM1 264 and one
The communication that other individual or multiple computing devices 262 pass through network communication link.
Network communication link can be an example of communication media.Communication media can be generally presented as in such as carrier wave
Or computer-readable instruction in the modulated data signal of other transmission mechanisms etc, data structure, program module, and can
With including any information delivery media." modulated data signal " can such signal, one in its data set or more
It is individual or it change can the mode of coding information in the signal carry out.As nonrestrictive example, communication media can be with
Include the wire medium of such as cable network or private line network etc, and it is such as sound, radio frequency (RF), microwave, infrared
(IR) the various wireless mediums or including other wireless mediums.Term computer-readable medium used herein can include depositing
Both storage media and communication media.
Computing device 200 can be implemented as server, such as file server, database server, application program service
Device and WEB server etc., a part for portable (or mobile) electronic equipment of small size, these electronic equipments can also be embodied as
Can be such as cell phone, personal digital assistant (PDA), personal media player device, wireless network browsing apparatus, individual
Helmet, application specific equipment or the mixing apparatus that any of the above function can be included.Computing device 200 can also be real
It is now to include desktop computer and the personal computer of notebook computer configuration.In certain embodiments, computing device 200 can
It is embodied as the webserver, aggregate server and/or database server, and is configured as performing the network according to the invention event
Hinder detection method.Wherein, one or more programs 222 of computing device 200 include being used to perform the network according to the invention event
Hinder the instruction of detection method.
Fig. 3 shows the flow chart of network fault detecting method 300 according to an embodiment of the invention.Network service
Device 700, aggregate server 800, database server 900 and 1~N of data probe are configured to perform the network according to the invention
During fault detection method 300, communicated by mutual data to complete the processing of network fault detecting method 300 jointly,
Now be embodied as respectively the webserver 700, aggregate server 800, one of computing device 200 of database server 900
Or multiple programs 222 include being used for the instruction for performing the network according to the invention fault detection method 300.
As shown in figure 3, method 300 starts from step S311.In step S311, data probe obtains from the webserver 700
Take network detection list corresponding to associated main frame.Wherein, network detection list is believed by the webserver 700 according to network topology
Breath previously generates, for ease of subsequent descriptions, first the webserver 700 is previously generated herein with each data probe associated by main frame
Corresponding network detection list illustrates.
Specifically, configuration management database and cache database are provided with the webserver 700, configuration management database
Newest network topological information is stored with, network topological information includes each main frame, each interchanger and each Internet data center
Numbering and annexation.According to one embodiment of present invention, when generating network detection list, the webserver 700 from
Configuration management database obtains network topological information, according to network topological information, to each main frame, by exchange belonging to the main frame
Other main frames are disposed as needing the destination host detected under machine, according to the generation pair of the IP address of the main frame and each destination host
The network detection list answered, according to network detection list corresponding to the generation of the IP address of the main frame and each destination host.According to this
Another embodiment of invention, when generating network detection list, the webserver 700 obtains network from configuration management database
Topology information, according to network topological information, to each main frame, other friendships that the affiliated Internet data center of the main frame is included
Change planes down, be arranged to need the destination host detected with its numbering identical main frame, according to the main frame and the IP of each destination host
Network detection list corresponding to the generation of address.According to still another embodiment of the invention, when generating network detection list, network
Server 700 obtains network topological information from configuration management database, according to network topological information, to each main frame, by this
It is arranged to what needs detected under each interchanger that other internet centres that main frame is not belonging to include, with its numbering identical main frame
Destination host, according to network detection list corresponding to the generation of the IP address of the main frame and each destination host.
The destination host quantity covered in view of network detection list is more, more can preferably detect network state, because
This according to still another embodiment of the invention, can previously generate in the following way with each data probe associated by main frame it is corresponding
Network detection list.First, network topological information is obtained from configuration management database, according to network topological information, to each
Main frame, other main frames under the affiliated interchanger of the main frame are disposed as to need the destination host detected, according to the main frame and respectively
Network detection list corresponding to the IP address generation of destination host.Then the affiliated Internet data center of the main frame is included its
Be arranged to need the destination host that detects under his interchanger, with its numbering identical main frame, then the main frame is not belonging to other
The destination host that needs detect, last root are arranged under each interchanger that internet centre includes, with its numbering identical main frame
According to network detection list corresponding to the generation of the IP address of the main frame and each destination host, network detection list is stored with caching number
According in storehouse.
In this embodiment, network topological information specifically includes:Network fault detection system 100 is accessed in 2 interconnections
Network data center, is designated as IDC1 and IDC2 respectively, and IDC1 and IDC2 are deployed with 2 interchangers, the interchanger of IDC1 deployment respectively
SW1 and SW2 are designated as respectively, and the interchanger of IDC2 deployment is designated as SW3 and SW4, lower point of interchanger SW1, SW2, SW3 and SW4 respectively
Jie Ru there are not 4 main frames, the main frame of SW1 accesses is designated as H1, H2, H3 and H4 respectively, and numbering is followed successively by 1,2,3 and 4, SW2 access
Main frame be designated as H5, H6, H7 and H8 respectively, numbering be followed successively by 1,2,3 and the main frame of 4, SW3 access be designated as H9, H10, H11 respectively
And H12, numbering be followed successively by 1,2,3 and the main frame of 4, SW4 access be designated as H13, H14, H15 and H16 respectively, numbering is followed successively by 1,2,
3 and 4.It follows that main frame amounts to 16, then the quantity N of data probe is also 16, and main frame H1~H16 is disposed with data
Probe 1~16.Table 1 shows the storage example of network topological information according to an embodiment of the invention, institute specific as follows
Show:
Table 1
Below exemplified by generating main frame H1 associated by data probe 1, the mistake to previously generating corresponding network detection list
Journey illustrates.For main frame H1, other main frame H2, H3 and H4 under the affiliated interchanger SW1 of the main frame are disposed as needing
The destination host to be detected, it will be numbered under other interchangers SW2 that the affiliated Internet data center IDC1 of the main frame includes, with it
Identical main frame H5 is arranged to need the destination host detected, and other internet centres IDC2 that the main frame is not belonging to is included
Under interchanger SW3 and SW4, it is arranged to need the destination host detected with its numbering identical main frame H9 and H13, according to main frame H1
With the generation of destination host H5, H9 and H13 IP address corresponding to network detection list L1, and network detection list L1 is stored in
In cache database.Simultaneously, it is contemplated that the feasibility and convenience of detection, main frame H1, mesh are also included in network detection list L1
Each interchanger and the IP address of Internet data center that mark main frame H5, H9 and H13 are accessed.In addition, if follow-up nothing refers in particular to
Go out, will be using data probe 1 and its associated main frame H1 as example, to further illustrate technical scheme.
Because network detection list is changed with the change of network topological information, then according to the present invention another
Embodiment, the webserver 700 are opened up according to default 3rd time interval to obtain the network stored in configuration management database
Flutter information, if the network topological information got relatively before network topological information change, according to the network that gets
Topology information generates new network detection list, and is stored in cache database to update.In this embodiment, configure
In management database the network topological information that is stored can real-time update, the 3rd time interval is preset as 30 minutes.
On data probe in step S311 network detection list corresponding to associated main frame is obtained from the webserver 700
Process, according to one embodiment of present invention, can realize as follows.First, data probe is according to default
Two time intervals, request is updated to the transmission list of the webserver 700 to obtain network detection list corresponding to associated main frame
Version.Then, the webserver 700 responds the list update request that the data probe is sent, and it is sent to the data probe
The version of network detection list corresponding to associated main frame, judge whether to determine renewal to indicate the data probe according to the version
Network detection list, wherein list update request from the data probe according to default second time interval to the webserver
700 send.Now, data probe receives the version of the newest network detection list of the return of the webserver 700, by the version
This is compared with the version of current network detection list, if the version is newly in the current network detection list of the data probe
Version, then to the webserver 700 send determine fresh information.The webserver 700 receives the determination of data probe feedback
Fresh information, network detection list corresponding to its associated main frame is sent to the data probe, and then data probe takes from network
Business device 700 gets network detection list corresponding to associated main frame, to replace the current network detection list of the data probe.
In this embodiment, the second time interval is preset as 10 minutes, and data probe 1 is every 10 minutes to network service
The transmission list of device 700 renewal request, with the version of network detection list L1 corresponding to main frame H1 associated by acquisition.The webserver
The list update request that 700 response data probes 1 are sent, send network corresponding to its associated main frame H1 to data probe 1 and visit
List L1 version is surveyed, the version is 3.1.5.Data probe 1 receives the newest network detection of the return of the webserver 700
List L1 version, by the version compared with the version of current network detection list, because current network detection arranges
The version of table is 3.1.4, and version 3 .1.5 is newly in version 3 .1.4, it is determined that needs to update network detection list, data probe 1
Sent to the webserver 700 and determine fresh information.The webserver 700 receives the determination fresh information that data probe 1 feeds back,
Network detection list L1 newest corresponding to its associated main frame H1 is sent to the data probe, final data probe 1 is with network
Detection list L1 replaces its current network detection list.
Then, step S312 is performed, data probe is according to the network detection list got, between the default very first time
Every progress network state detection.According to one embodiment of present invention, very first time interval is preset as 120 seconds, data probe 1
According to the network detection list L1 got, every 120 seconds to the network between network host H1 and destination host H5, H9 and H13
State is detected, and is ordered typically by PING (Packet Internet Groper, the Internet packets survey meter) to check
Whether network is in connected state.In addition, it is not restricted on the agreement used in detection process, such as common HTTP,
TCP, UDP and ICMP etc..Different detection agreements is used for different detection demands, if desired lays particular stress on the finger of quality of service
Mark, then can use HTTP or TCP, if desired lay particular stress on the index of network in itself, then can use ICMP.Certainly, agreement is detected
In any combination, such as consultation can be generally detected by the webserver 700 in net using HTTP and ICMP etc. simultaneously
Specified in network detection list, so that data probe can therefrom obtain corresponding detection association after corresponding network detection list is obtained
View.
Next, in step S313, the result of detection that data probe obtains after network state is detected is sent to network
Server 700.According to one embodiment of present invention, data probe 1 has detected a destination host, i.e., can generate corresponding one
Individual result of detection, result of detection generally comprise following field:Operating system, detection time started, detection end time, detection association
The affiliated interchanger of view, destination host, destination host and/or the affiliated Internet data center of destination host.In view of individually transmission
Single result of detection can bring unnecessary burden to system, therefore can use bulk transfer to the result of detection of a wheel, such as
Every 128 result of detections are retransmited to the webserver 700 after being compressed packing.
Further, in order to control the occupancy situation of server resource, the normal execution of business on interfering line, each number are avoided
Need to check continually on the system resource of oneself consumption according to probe and carry out subsequent treatment according to Expenditure Levels.According to the present invention's
One embodiment, data probe according to default resource occupation rule come judge its consume system resource whether excess load, if
The system resource of consumption excess load, then stop detection, if the non-excess load of system resource of consumption, continues to detect.In the reality
Apply in mode, resource occupation rule it is predeterminable for CPU usage less than 99% and memory usage be less than 95%.To data probe
For 1, its CPU to main frame H1 occupancy is 75%, memory usage 50%, it is known that the system that data probe 1 consumes
The non-excess load of resource, it can continue to detect.
The webserver 700 performs step S321, by each detection after the result of detection that each data probe reports is received
As a result corresponding achievement data is formed.According to one embodiment of present invention, when forming achievement data, first to each result of detection
Checking treatment is carried out, then corresponding achievement data is converted into by qualified result of detection is verified.Wherein, achievement data includes current
Main frame, destination host, Metric values, network interaction time and/or timestamp, current hosts are the main frame for performing probe command,
Destination host is detected main frame, and Metric values can be the field such as leapfrog number isometry optimal path.Checking treatment is usual
It is whether in the reasonable scope first to judge the value of field in result of detection, if the value of all fields is reasonable in result of that probe
In the range of, it is determined that result of that probe verification is qualified.
The webserver 700, into step S322, will walk after achievement data is formed according to default index transmission rule
The achievement data of rapid S321 generations is sent to corresponding aggregate server.Aggregate server mentioned above is one or more, Fig. 1
Merely illustrate aggregate server 800 in though, the quantity of actually aggregate server could be arranged to it is multiple, it is follow-up to accelerate
The processing speed of diagnostic network failure.When the quantity of aggregate server only has 1, such as only aggregate server 800, then net
Network server 700 directly sends achievement data to aggregate server 800.When the quantity of aggregate server has multiple,
According to one embodiment of present invention, index transmission rule is preset as carrying out selective polymerization server reception index according to hash algorithm
Data.The address of each aggregate server is ranked up with order corresponding to acquisition, in each achievement data first
Metric values seek hash value, to obtaining corresponding index value after the hash value modulus tried to achieve, by the achievement data send to this
The corresponding aggregate server of index value identical order.It is existing mature technology, herein on the particular content of hash algorithm
Do not repeated.It should be noted that presetting for index transmission rule, phase can be taken according to the difference of actual conditions
The algorithm answered realizes that these can be readily apparent that for the technical staff for understanding the present invention program, and also exist
Within protection scope of the present invention, do not repeated herein.
Aggregate server 800 receives one or more achievement datas that the webserver 700 reports, and achievement data is by network
The result of detection that server 700 reports according to each data probe is formed, and hereafter aggregate server 800 first can be entered to each achievement data
Row filtration treatment, the achievement data that the network interaction time exceedes default threshold value or indicates network connection failure is filtered out,
And the achievement data of abnormal state is designated as, and it is remaining, it is designated as the normal achievement data of state.According to one of present invention implementation
Example, network interaction time exceed default threshold value and show that performing the PING values obtained after PING orders exceedes the threshold value, and network is handed over
Mutual persond eixis network connection failure shows that PING is obstructed.
And then aggregate server 800 performs step S331, network connection is carried out according to the achievement data of each abnormal state
Detection.According to one embodiment of present invention, it is each according to the achievement data of each abnormal state, detection when detecting network connection
There is the network for being connected to another interchanger in main frame between each two interchanger, under an interchanger under Internet data center
The host number accounting of failure, each interchanger of default first ratio is exceeded to host number accounting, judge that it is present and exchange
Network connectivity fai_lure between machine.Wherein, the first ratio is preset as 0.5.In this embodiment, to Internet data center
For IDC1, main frame H1, H2 and H3 under the achievement data instruction interchanger SW1 of abnormal state occur being connected to interchanger SW2
Network failure, 4 main frames are amounted under SW1, then the host number accounting for the network failure for being connected to SW2 occurs in main frame under SW1
For 3/4=0.75, more than the first ratio, judge that SW1 deposits network connectivity fai_lure among the switches.To Internet data center
For IDC2, there is the network for being connected to interchanger SW4 in the main frame H10 under the achievement data instruction interchanger SW3 of abnormal state
Failure, 4 main frames are amounted under SW3, then the host number accounting that the network failure for being connected to SW4 occurs in main frame under SW3 is 1/4=
0.25, more than the first ratio, judge the network connectivity fai_lure that SW3 is not present between interchanger.
To each interchanger of the host number accounting not less than default first ratio, according to one embodiment of present invention,
Detect under the affiliated Internet data center of the interchanger, each interchanger under other Internet data centers occurs being connected in interchanger
Network failure the first interchanger quantity accounting, to the first interchanger quantity accounting exceed default second ratio each interconnection
Network data center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.Wherein, the second ratio is pre-
It is set to 0.5.In this embodiment, interchanger SW3 host number accounting is not less than default first ratio, then to SW3 and
Speech, the achievement data of abnormal state is indicated under its affiliated Internet data center IDC2, interchanger SW3 and SW4 are connected to
Interchanger SW2 network failure under Internet data center IDC1,2 interchangers are amounted under IDC1, then interchanger goes out under IDC2
The the first interchanger quantity accounting for being now connected to interchanger SW2 network failure under IDC1 is 2/2=100%, more than the second ratio
Example, judges that IDC2 has the network connectivity fai_lure between Internet data center and interchanger.
To each interchanger of the host number accounting not less than default first ratio, according to another implementation of the present invention
Example, detects the affiliated Internet data center of the interchanger and occurs being connected to the network failure of interchanger under other internet centres
Second switch quantity accounting, each Internet data center of default 3rd ratio is exceeded to second switch quantity accounting,
Judge that it has the network connectivity fai_lure between Internet data center.Wherein, the 3rd ratio is preset as 0.5.In the embodiment party
In formula, interchanger SW3 host number accounting is not less than default first ratio, then for SW3, the index number of abnormal state
According to indicate its affiliated Internet data center IDC2 occur being connected under Internet data center IDC1, interchanger SW1 and SW2
Network failure, 2 interchangers are amounted under IDC1, then IDC2 occurs being connected to second of the network failure of interchanger under IDC1 and exchanged
Machine quantity accounting is 2/2=100%, more than default 3rd ratio, judges that IDC2 has the net between Internet data center
Network connecting fault.
After aggregate server 800 completes network connection detection, step S332 is performed, if testing result indicates that existing network network connects
Failure is connect, then sends alarm.According to one embodiment of present invention, SW1 deposits network connectivity fai_lure among the switches, IDC2
The network connection event between network connectivity fai_lure and the Internet data center between Internet data center and interchanger be present
Barrier, thereby determines that network failure, and alarm is sent to relevant staff and system.
In step S333, aggregate server 800 carries out polymerization to the normal achievement data of each state and calculated to obtain phase
The network quality data answered.According to one embodiment of present invention, when generating network quality data, first, to each state just
Normal achievement data, tantile data are generated according to default 4th time interval, the normal achievement data of each state is pressed it
Corresponding interchanger is polymerize, and counts the network quality index of each interchanger, and the normal achievement data of each state is right by its
The Internet data center answered is polymerize, and counts the network quality index of each Internet data center, in conjunction with point of generation
The network quality index of place value data, each interchanger and each Internet data center, to form corresponding network quality data.
In the embodiment, predeterminable the 4th time interval is 5 minutes and 10 minutes, then to the normal achievement data of each state, first may be used
By the Metric values in hash algorithm and achievement data, achievement data is classified by the source of current hosts, then to every
One current hosts as source, the 50%th achievement data is come as 50 tantiles every polymerization generation in 5 minutes,
The achievement data for coming the 99%th is generated every polymerization in 10 minutes as 99 tantiles, 50 tantiles and 99 tantiles are made
For tantile data.The normal achievement data of state corresponding to interchanger SW1, SW2, SW3 and SW4 is polymerize respectively,
The network quality index of each interchanger is generated after corresponding statistics, to the state corresponding to Internet data center IDC1 and IDC2 just
Normal achievement data is polymerize respectively, and the network quality index of each Internet data center is generated after corresponding statistics, is finally tied
Close and state each tantile data, interchanger SW1, SW2, SW3, SW4, Internet data center IDC1 and IDC2 network quality and refer to
Mark, forms corresponding network quality data.
After network quality data is obtained, aggregate server 800 performs step S334, and network quality data is sent to number
According to storehouse server 900.According to one embodiment of present invention, aggregate server 800 sends network quality data to database
Server 900 is to store.
After database server 900 receives the network quality data of the transmission of aggregate server 800, into step S341,
Store the network quality data that aggregate server 800 is sent.According to one embodiment of present invention, in order to accelerate to inquire about, pass through
The memory cache of the generated network quality data elder generation write into Databasce server 900 of polymerization, the caching preserve certain time, such as
After one day, then corresponding network quality data switched into persistent storage, such as deposit OpenTSDB (Open Time Series
Database, time series databases of increasing income).
Fig. 4 shows the flow chart of network fault detecting method 400 according to still another embodiment of the invention.Work as calculating
When equipment 200 is embodied as aggregate server 800, one or more programs 222 of computing device 200 include being used to perform according to this
The instruction of the network fault detecting method 400 of invention.According to one embodiment of present invention, aggregate server 800 and one or
The multiple webservers and database server 900 are communicated to connect, and the webserver 700 is accessed in multiple internet datas
The heart, each Internet data center are deployed with more interchangers, and every interchanger, which accesses, multiple host, in every main frame in advance
It is provided with corresponding data probe and is communicated to connect with the webserver 700.
As shown in figure 4, method 400 starts from step S410.In step S410, the webserver 700 reports one is received
Individual or multiple achievement datas, the result of detection that achievement data is reported by the webserver 700 according to each data probe are formed.According to
One embodiment of the present of invention, achievement data include current hosts, destination host, Metric values, the network interaction time and/or when
Between stab, current hosts is perform the main frame of probe command, and destination host is detected main frame, and Metric values can be such as leapfrog number
The field of isometry optimal path.Aggregate server 800 can be filtered first after achievement data is received to each achievement data
Processing, the achievement data that the network interaction time exceedes default threshold value or indicates network connection failure is filtered out, and be designated as
The achievement data of abnormal state, it is remaining, it is designated as the normal achievement data of state.Step S410 and to achievement data carry out
The concrete processing procedure of filtering, the related content that aggregate server 800 in method 300 received, filtered achievement data is can refer to, this
Place is not repeated.
Then, step S420 is performed, network connection is detected according to the achievement data of each abnormal state.According to this hair
Bright one embodiment, when detecting network connection, according to the achievement data of each abnormal state, detect each Internet data center
There is the host number for the network failure for being connected to another interchanger in main frame between lower each two interchanger, under an interchanger
Accounting, each interchanger of default first ratio is exceeded to host number accounting, judge that its network deposited among the switches connects
Connect failure.To each interchanger of the host number accounting not less than default first ratio, according to one embodiment of present invention, inspection
Survey under the affiliated Internet data center of the interchanger, each interchanger under other Internet data centers occurs being connected in interchanger
First interchanger quantity accounting of network failure, each internet of default second ratio is exceeded to the first interchanger quantity accounting
Data center, judge that it has the network connectivity fai_lure between Internet data center and interchanger.To host number accounting not
More than each interchanger of default first ratio, according to still another embodiment of the invention, the affiliated internet of the interchanger is detected
There is the second switch quantity accounting for being connected to the network failure of interchanger under other internet centres in data center, to second
Interchanger quantity accounting exceed default 3rd ratio each Internet data center, judge its exist Internet data center it
Between network connectivity fai_lure.Wherein, the first ratio, the second ratio and the 3rd ratio are preset as 0.5.It is specific in step S420
Processing procedure, the related content of step S331 in method 300 is can refer to, is not repeated herein.
In step S430, if testing result indicates existing network connectivity fai_lure, alarm is sent.Tool in step S430
Body processing procedure, the related content of step S332 in method 300 is can refer to, is not repeated herein.
Next, in step S440, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding net
Network qualitative data.According to one embodiment of present invention, when generating network quality data, first, each state is normally referred to
Data are marked, tantile data are generated according to default 4th time interval, by the normal achievement data of each state as corresponding to it
Interchanger is polymerize, and counts the network quality index of each interchanger, by the normal achievement data of each state by mutual corresponding to it
Networking data center is polymerize, and counts the network quality index of each Internet data center, in conjunction with the tantile number of generation
According to, each interchanger and the network quality index of each Internet data center, to form corresponding network quality data.Step S440
In concrete processing procedure, can refer to the related content of step S333 in method 300, do not repeated herein.
Finally, into step S450, network quality data is sent to database server 900 to store.Step
Concrete processing procedure in S450, the related content of step S341 in method 300 is can refer to, is not repeated herein.
Existing Network Fault Detection scheme is divided into two classes more, and one kind is relied on after problem has occurred and that by O&M engineer
Experience is that failure is positioned manually, and wastes time and energy and lacks data supporting, and the degree of accuracy is relatively low, and another kind of is by being taken on institute is wired
It is engaged in the mode active probe network condition of device deployment probe, but has that network topology covering is complete, detection data is difficult to comprehensive profit
The problem of using.The technical scheme of Network Fault Detection according to embodiments of the present invention, first receive report one of the webserver
Or multiple achievement datas, network connection is detected according to the achievement data of each abnormal state, if testing result instruction occurs
Network connectivity fai_lure, then alarm is sent, polymerization is carried out to the normal achievement data of each state and is calculated to obtain corresponding network matter
Measure data.In the above-mentioned technical solutions, active alarm is carried out to there is abnormal network connection state, greatly speeds up failure response
Time, and by polymerizeing to the distribution of achievement data, any two interchanger and any two internet can be obtained in real time
Network quality data between data center.Further, the spy that achievement data is reported by the webserver according to each data probe
Result is surveyed to be formed, and the network detection list that data probe issues according to the webserver is detected, network detection list
Generated by the webserver based on network topological information, network topological information includes each main frame, each interchanger and each internet
The numbering and annexation of data center, the network detection list generated according to this network topological information can be covered all
Main frame, network all standing is realized, reduce amount of calculation and system resources consumption, and then can real-time detect any two
Network connection situation between main frame, and realize adaptive network topology variation by issuing network detection list.
B10. the method as described in B9, net corresponding to main frame associated by the data probe from webserver acquisition
The step of network detection list, includes:
According to default second time interval, request is updated to the webserver transmission list to obtain associated master
The version of network detection list corresponding to machine;
If the version newly in the version of the current network detection list of the data probe, obtains from the webserver
Network detection list corresponding to associated main frame, to replace the current network detection list of the data probe.
B11. the method as described in B9 or 10, in addition to:
The data probe according to default resource occupation rule come judge its consume system resource whether excess load;
If excess load, stopping detect the system resource of consumption;
If the non-excess load of system resource of consumption, continues to detect.
B12. the method as any one of B9-11, be provided with the webserver configuration management database and
Cache database, the configuration management database purchase have newest network topological information, and the network topological information includes each
The numbering and annexation of main frame, each interchanger and each Internet data center, it is advance that this method also includes the webserver
Generation and the corresponding network detection list of main frame associated by each data probe, it is described previously generate with each data probe associated by lead
Include corresponding to machine the step of network detection list:
According to the network topological information, to each main frame, other main frames under the affiliated interchanger of the main frame are all provided with
It is set to the destination host for needing to detect;
It will be arranged under other interchangers that the affiliated Internet data center of the main frame includes, with its numbering identical main frame
Need the destination host detected;
Set under each interchanger that other internet centres that the main frame is not belonging to include, with its numbering identical main frame
To need the destination host detected;
According to network detection list corresponding to the generation of the IP address of the main frame and each destination host, the network detection is arranged
In table storage and cache database.
B13. the method as described in B12, in addition to:
The webserver obtains what is stored in the configuration management database according to default 3rd time interval
Network topological information;
If the network topological information got relatively before network topological information change, according to the network that gets
Topology information generates new network detection list, and is stored in the cache database to update.
B14. the method as any one of B9-13, it is described each result of detection is formed into corresponding achievement data after send out
Include corresponding to delivering to the step of aggregate server:
Checking treatment is carried out to each result of detection;
Qualified result of detection will be verified and be converted into corresponding achievement data, and should according to default index transmission rule
Achievement data is sent to corresponding aggregate server.
B15. the method as any one of B9-14, the achievement data according to each abnormal state is to network connection
The step of being detected includes:
According to the achievement data of each abnormal state, detect under each Internet data center between each two interchanger, one
There is the host number accounting for the network failure for being connected to another interchanger in main frame under interchanger;
Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network among the switches
Connecting fault.
B16. the method as described in B15, what the achievement data according to each abnormal state was detected to network connection
Step includes:
To each interchanger of the host number accounting not less than default first ratio, interconnection netting index belonging to the interchanger is detected
According under center, interchanger there is the first interchanger number for being connected to the network failure of each interchanger under other Internet data centers
Measure accounting;
Exceed each Internet data center of default second ratio to the first interchanger quantity accounting, it is mutual to judge that it is present
Network connectivity fai_lure between networking data center and interchanger.
B17. the method as described in B15 or 16, the achievement data according to each abnormal state are examined to network connection
The step of survey, includes:
To each interchanger of the host number accounting not less than default first ratio, interconnection netting index belonging to the interchanger is detected
The second switch quantity accounting for occurring being connected to the network failure of interchanger under other internet centres according to center;
Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, it is mutual to judge that it is present
Network connectivity fai_lure between networking data center.
B18. the method as any one of B9-17, it is described that polymerization calculating is carried out to the normal achievement data of each state
The step of to obtain corresponding network quality data, includes:
To the normal achievement data of each state, tantile data are generated according to default 4th time interval;
The normal achievement data of each state is polymerize by its corresponding interchanger, counts the network quality of each interchanger
Index;
The normal achievement data of each state is polymerize by its corresponding Internet data center, counts each interconnection netting index
According to the network quality index at center;
With reference to the tantile data of generation, each interchanger and the network quality index of each Internet data center, to be formed
Corresponding network quality data.
In the specification that this place provides, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention
Example can be put into practice in the case of these no details.In some instances, known method, knot is not been shown in detail
Structure and technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect,
Above in the description to the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes
In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor
The application claims of shield are than the feature more features that is expressly recited in each claim.More precisely, as following
As claims reflect, inventive aspect is all features less than single embodiment disclosed above.Therefore, abide by
Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself
Separate embodiments as the present invention.
Those skilled in the art should be understood the module or unit or group of the equipment in example disclosed herein
Between can be arranged in equipment as depicted in this embodiment, or alternatively can be positioned at and the equipment in the example
In different one or more equipment.Module in aforementioned exemplary can be combined as a module or be segmented into addition multiple
Submodule.
Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment
Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment
Member or group between be combined into one between module or unit or group, and can be divided into addition multiple submodule or subelement or
Between subgroup.In addition at least some in such feature and/or process or unit exclude each other, it can use any
Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint
Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power
Profit requires, summary and accompanying drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation
Replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments
In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention
Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed
One of meaning mode can use in any combination.
In addition, be described as herein can be by the processor of computer system or by performing for some in the embodiment
The method or the combination of method element that other devices of the function are implemented.Therefore, have and be used to implement methods described or method
The processor of the necessary instruction of element forms the device for implementing this method or method element.In addition, device embodiment
Element described in this is the example of following device:The device is used to implement as in order to performed by implementing the element of the purpose of the invention
Function.
Various technologies described herein can combine hardware or software, or combinations thereof is realized together.So as to the present invention
Method and apparatus, or some aspects of the process and apparatus of the present invention or part can take embedded tangible media, such as soft
The form of program code (instructing) in disk, CD-ROM, hard disk drive or other any machine readable storage mediums,
Wherein when program is loaded into the machine of such as computer etc, and is performed by the machine, the machine becomes to put into practice this hair
Bright equipment.
In the case where program code performs on programmable computers, computing device generally comprises processor, processor
Readable storage medium (including volatibility and nonvolatile memory and/or memory element), at least one input unit, and extremely
A few output device.Wherein, memory is arranged to store program codes;Processor is arranged to according to the memory
Instruction in the described program code of middle storage, perform the network fault detecting method of the present invention.
By way of example and not limitation, computer-readable medium includes computer-readable storage medium and communication media.Calculate
Machine computer-readable recording medium includes computer-readable storage medium and communication media.Computer-readable storage medium storage such as computer-readable instruction,
The information such as data structure, program module or other data.Communication media is typically modulated with carrier wave or other transmission mechanisms etc.
Data-signal processed passes to embody computer-readable instruction, data structure, program module or other data including any information
Pass medium.Any combination above is also included within the scope of computer-readable medium.
As used in this, unless specifically stated so, come using ordinal number " first ", " second ", " the 3rd " etc.
Description plain objects are merely representative of the different instances for being related to similar object, and are not intended to imply that the object being so described must
Must have the time it is upper, spatially, in terms of sequence or given order in any other manner.
Although describing the present invention according to the embodiment of limited quantity, above description, the art are benefited from
It is interior it is clear for the skilled person that in the scope of the present invention thus described, it can be envisaged that other embodiments.Additionally, it should be noted that
The language that is used in this specification primarily to readable and teaching purpose and select, rather than in order to explain or limit
Determine subject of the present invention and select.Therefore, in the case of without departing from the scope and spirit of the appended claims, for this
Many modifications and changes will be apparent from for the those of ordinary skill of technical field.For the scope of the present invention, to this
The done disclosure of invention is illustrative and not restrictive, and it is intended that the scope of the present invention be defined by the claims appended hereto.
Claims (10)
1. a kind of network fault detecting method, suitable for being performed in aggregate server, the aggregate server and one or more
The webserver and database server communication connection, the webserver are accessed in multiple Internet data centers, often
Individual Internet data center is deployed with more interchangers, and every interchanger access has multiple host, pre-set in every main frame
There is corresponding data probe and communicated to connect with the webserver, methods described includes:
One or more achievement datas that the webserver reports are received, the achievement data is by the webserver root
The result of detection reported according to each data probe is formed;
Network connection is detected according to the achievement data of each abnormal state;
If testing result indicates existing network connectivity fai_lure, alarm is sent;
Polymerization is carried out to the normal achievement data of each state to calculate to obtain corresponding network quality data;
The network quality data is sent to the database server to store.
2. the method as described in claim 1, what the achievement data according to each abnormal state was detected to network connection
Step includes:
According to the achievement data of each abnormal state, detect under each Internet data center between each two interchanger, an exchange
There is the host number accounting for the network failure for being connected to another interchanger in main frame under machine;
Exceed each interchanger of default first ratio to host number accounting, judge that it deposits network connection among the switches
Failure.
3. method as claimed in claim 1 or 2, the achievement data according to each abnormal state detect to network connection
The step of include:
To each interchanger of the host number accounting not less than default first ratio, detect in the affiliated internet data of the interchanger
Under the heart, there is being connected to the first interchanger quantity of the network failure of each interchanger under other Internet data centers and accounts in interchanger
Than;
Exceed each Internet data center of default second ratio to the first interchanger quantity accounting, judge that it has internet
Network connectivity fai_lure between data center and interchanger.
4. such as the method any one of claim 1-3, the achievement data according to each abnormal state is to network connection
The step of being detected includes:
To each interchanger of the host number accounting not less than default first ratio, detect in the affiliated internet data of the interchanger
There is the second switch quantity accounting for being connected to the network failure of interchanger under other internet centres in the heart;
Exceed each Internet data center of default 3rd ratio to second switch quantity accounting, judge that it has internet
Network connectivity fai_lure between data center.
It is 5. described that polymerization calculating is carried out to the normal achievement data of each state such as the method any one of claim 1-4
The step of to obtain corresponding network quality data, includes:
To the normal achievement data of each state, tantile data are generated according to default 4th time interval;
The normal achievement data of each state is polymerize by its corresponding interchanger, the network quality for counting each interchanger refers to
Mark;
The normal achievement data of each state is polymerize by its corresponding Internet data center, counted in each internet data
The network quality index of the heart;
It is corresponding to be formed with reference to the tantile data of generation, each interchanger and the network quality index of each Internet data center
Network quality data.
6. such as the method any one of claim 1-5, first ratio, the second ratio and the 3rd ratio are preset as
0.5。
7. a kind of computing device, including:
One or more processors;
Memory;And
One or more programs, wherein one or more of program storages are in the memory and are configured as by described one
Individual or multiple computing devices, one or more of programs include being used to perform in the method according to claim 1-6
Either method instruction.
8. a kind of computer-readable recording medium for storing one or more programs, one or more of programs include instruction,
The instruction is when executed by a computing apparatus so that in method of the computing device according to claim 1-6
Either method.
9. a kind of network fault detecting method, suitable for being performed in network fault detection system, the system includes one or more
The individual webserver, aggregate server, database server and multiple data probes, the system access is in multiple interconnection netting indexs
According to center, each Internet data center is deployed with more interchangers, and every interchanger, which accesses, multiple host, in every main frame
Be previously provided with corresponding data probe, be stored with the webserver with each data probe associated by the corresponding net of main frame
Network detects list, and methods described includes:
Network detection list corresponding to main frame associated by data probe from webserver acquisition, according to the network got
List is detected, network state detection is carried out with default very first time interval, and result of detection is sent to the network service
Device;
The webserver receives the result of detection that each data probe reports, and is sent out after each result of detection is formed into corresponding achievement data
Deliver to corresponding aggregate server;
Aggregate server receives one or more achievement datas that the webserver reports, according to the index of each abnormal state
Data detect to network connection, if testing result indicates existing network connectivity fai_lure, send alarm, normal to each state
Achievement data carry out polymerization and calculate to obtain corresponding network quality data, and the network quality data is sent to described
Database server;
Database server receives and stores the network quality data of aggregate server transmission.
10. a kind of network fault detection system, the system includes one or more webservers, aggregate server, data
Storehouse server and multiple data probes, the system access is in multiple Internet data centers, each portion of Internet data center
There are more interchangers in administration, and every interchanger access has multiple host, corresponding data probe, institute are previously provided with every main frame
State be stored with the webserver with each data probe associated by the corresponding network detection list of main frame, within the system:
Data probe, suitable for network detection list corresponding to main frame associated by being obtained from the webserver, according to getting
Network detection list, network state detection is carried out with default very first time interval, and result of detection is sent to the net
Network server;
The webserver, the result of detection reported suitable for receiving each data probe, corresponding index number is formed by each result of detection
Sent according to rear to corresponding aggregate server;
Aggregate server, the one or more achievement datas reported suitable for receiving the webserver, according to each abnormal state
Achievement data network connection is detected, if testing result indicates existing network connectivity fai_lure, alarm is sent, to each shape
The normal achievement data of state carries out polymerization and calculated to obtain corresponding network quality data, and the network quality data is sent
To the database server;
Database server, suitable for receiving and storing the network quality data of aggregate server transmission.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711216127.7A CN107835098B (en) | 2017-11-28 | 2017-11-28 | Network fault detection method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711216127.7A CN107835098B (en) | 2017-11-28 | 2017-11-28 | Network fault detection method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107835098A true CN107835098A (en) | 2018-03-23 |
CN107835098B CN107835098B (en) | 2021-01-29 |
Family
ID=61646152
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711216127.7A Active CN107835098B (en) | 2017-11-28 | 2017-11-28 | Network fault detection method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107835098B (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109155745A (en) * | 2018-07-16 | 2019-01-04 | 威富通科技有限公司 | Payment gateway is connected to the network detection method and terminal device |
CN110830324A (en) * | 2019-10-28 | 2020-02-21 | 烽火通信科技股份有限公司 | Method and device for detecting network connectivity of data center and electronic equipment |
CN110852387A (en) * | 2019-11-13 | 2020-02-28 | 江苏能来能源互联网研究院有限公司 | Energy internet super real-time state studying and judging algorithm |
CN110932894A (en) * | 2019-11-22 | 2020-03-27 | 北京金山云网络技术有限公司 | Network fault positioning method and device of cloud storage system and electronic equipment |
CN111049691A (en) * | 2019-12-25 | 2020-04-21 | 中国联合网络通信集团有限公司 | Network fault positioning method, server, acquisition probe and storage medium |
CN111343049A (en) * | 2020-03-02 | 2020-06-26 | 网联清算有限公司 | Detection method, device, system and medium for transaction information data transmission special line |
CN111817911A (en) * | 2020-06-23 | 2020-10-23 | 腾讯科技(深圳)有限公司 | Method and device for detecting network quality, computing equipment and storage medium |
CN111989897A (en) * | 2018-04-10 | 2020-11-24 | 奈特朗茨公司 | Measurement indicators for computer networks |
CN112165400A (en) * | 2020-09-25 | 2021-01-01 | 天津大学 | System for troubleshooting data network based on network delay |
CN112636942A (en) * | 2019-10-08 | 2021-04-09 | 中国移动通信集团浙江有限公司 | Method and device for monitoring service host node |
CN113676376A (en) * | 2021-08-20 | 2021-11-19 | 北京交通大学 | In-band network telemetering method based on clustering |
CN113783752A (en) * | 2021-08-26 | 2021-12-10 | 四川新网银行股份有限公司 | Network quality monitoring method during mutual access of intranet cross-network inter-segment service systems |
CN114095808A (en) * | 2020-08-24 | 2022-02-25 | 华为技术有限公司 | Network fault detection method, device, equipment and computer readable storage medium |
CN114157554A (en) * | 2021-12-21 | 2022-03-08 | 唯品会(广州)软件有限公司 | Troubleshooting method and device, storage medium and computer equipment |
CN114172796A (en) * | 2021-12-24 | 2022-03-11 | 中国工商银行股份有限公司 | Fault positioning method and related device for communication network |
CN112994972B (en) * | 2021-02-02 | 2022-05-20 | 成都卓源网络科技有限公司 | Distributed probe monitoring platform |
CN117880055A (en) * | 2024-03-12 | 2024-04-12 | 灵长智能科技(杭州)有限公司 | Network fault diagnosis method, device, equipment and medium based on transmission layer index |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101291232A (en) * | 2008-06-03 | 2008-10-22 | 北京星网锐捷网络技术有限公司 | Ethernet port, Ethernet switch and signal receiving and sending method of Ethernet equipment |
CN102055626A (en) * | 2010-12-31 | 2011-05-11 | 北京中创信测科技股份有限公司 | Internet protocol (IP) network quality detecting method and system |
CN102891779A (en) * | 2012-09-27 | 2013-01-23 | 北京网瑞达科技有限公司 | Large-scale network performance measuring system and method for IP network |
WO2016015041A1 (en) * | 2014-07-25 | 2016-01-28 | Blockchain Technologies Corporation | System and method for creating a multi-branched blockchain with configurable protocol rules |
CN105871634A (en) * | 2016-06-01 | 2016-08-17 | 北京蓝海讯通科技股份有限公司 | Method and application for detecting cluster anomalies and cluster managing system |
CN106991033A (en) * | 2017-04-01 | 2017-07-28 | 北京蓝海讯通科技股份有限公司 | Notify method, device, server and the readable storage medium storing program for executing of alarm information |
-
2017
- 2017-11-28 CN CN201711216127.7A patent/CN107835098B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101291232A (en) * | 2008-06-03 | 2008-10-22 | 北京星网锐捷网络技术有限公司 | Ethernet port, Ethernet switch and signal receiving and sending method of Ethernet equipment |
CN102055626A (en) * | 2010-12-31 | 2011-05-11 | 北京中创信测科技股份有限公司 | Internet protocol (IP) network quality detecting method and system |
CN102891779A (en) * | 2012-09-27 | 2013-01-23 | 北京网瑞达科技有限公司 | Large-scale network performance measuring system and method for IP network |
WO2016015041A1 (en) * | 2014-07-25 | 2016-01-28 | Blockchain Technologies Corporation | System and method for creating a multi-branched blockchain with configurable protocol rules |
CN105871634A (en) * | 2016-06-01 | 2016-08-17 | 北京蓝海讯通科技股份有限公司 | Method and application for detecting cluster anomalies and cluster managing system |
CN106991033A (en) * | 2017-04-01 | 2017-07-28 | 北京蓝海讯通科技股份有限公司 | Notify method, device, server and the readable storage medium storing program for executing of alarm information |
Cited By (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111989897B (en) * | 2018-04-10 | 2023-09-01 | 瞻博网络公司 | Measuring index of computer network |
US11595273B2 (en) | 2018-04-10 | 2023-02-28 | Juniper Networks, Inc. | Measuring metrics of a computer network |
CN111989897A (en) * | 2018-04-10 | 2020-11-24 | 奈特朗茨公司 | Measurement indicators for computer networks |
CN109155745B (en) * | 2018-07-16 | 2019-08-30 | 威富通科技有限公司 | Payment gateway is connected to the network detection method and terminal device |
CN109155745A (en) * | 2018-07-16 | 2019-01-04 | 威富通科技有限公司 | Payment gateway is connected to the network detection method and terminal device |
CN112636942B (en) * | 2019-10-08 | 2022-09-27 | 中国移动通信集团浙江有限公司 | Method and device for monitoring service host node |
CN112636942A (en) * | 2019-10-08 | 2021-04-09 | 中国移动通信集团浙江有限公司 | Method and device for monitoring service host node |
CN110830324A (en) * | 2019-10-28 | 2020-02-21 | 烽火通信科技股份有限公司 | Method and device for detecting network connectivity of data center and electronic equipment |
CN110830324B (en) * | 2019-10-28 | 2021-09-03 | 烽火通信科技股份有限公司 | Method and device for detecting network connectivity of data center and electronic equipment |
CN110852387A (en) * | 2019-11-13 | 2020-02-28 | 江苏能来能源互联网研究院有限公司 | Energy internet super real-time state studying and judging algorithm |
CN110852387B (en) * | 2019-11-13 | 2022-04-22 | 江苏能来能源互联网研究院有限公司 | Energy internet super real-time state studying and judging algorithm |
CN110932894A (en) * | 2019-11-22 | 2020-03-27 | 北京金山云网络技术有限公司 | Network fault positioning method and device of cloud storage system and electronic equipment |
CN111049691A (en) * | 2019-12-25 | 2020-04-21 | 中国联合网络通信集团有限公司 | Network fault positioning method, server, acquisition probe and storage medium |
CN111049691B (en) * | 2019-12-25 | 2022-06-10 | 中国联合网络通信集团有限公司 | Network fault positioning method, server, acquisition probe and storage medium |
CN111343049A (en) * | 2020-03-02 | 2020-06-26 | 网联清算有限公司 | Detection method, device, system and medium for transaction information data transmission special line |
CN111817911A (en) * | 2020-06-23 | 2020-10-23 | 腾讯科技(深圳)有限公司 | Method and device for detecting network quality, computing equipment and storage medium |
CN111817911B (en) * | 2020-06-23 | 2023-08-08 | 腾讯科技(深圳)有限公司 | Method, device, computing equipment and storage medium for detecting network quality |
CN114095808A (en) * | 2020-08-24 | 2022-02-25 | 华为技术有限公司 | Network fault detection method, device, equipment and computer readable storage medium |
CN114095808B (en) * | 2020-08-24 | 2023-04-28 | 华为技术有限公司 | Network fault detection method, device, equipment and computer readable storage medium |
CN112165400A (en) * | 2020-09-25 | 2021-01-01 | 天津大学 | System for troubleshooting data network based on network delay |
CN112994972B (en) * | 2021-02-02 | 2022-05-20 | 成都卓源网络科技有限公司 | Distributed probe monitoring platform |
CN113676376A (en) * | 2021-08-20 | 2021-11-19 | 北京交通大学 | In-band network telemetering method based on clustering |
CN113783752A (en) * | 2021-08-26 | 2021-12-10 | 四川新网银行股份有限公司 | Network quality monitoring method during mutual access of intranet cross-network inter-segment service systems |
CN114157554A (en) * | 2021-12-21 | 2022-03-08 | 唯品会(广州)软件有限公司 | Troubleshooting method and device, storage medium and computer equipment |
CN114157554B (en) * | 2021-12-21 | 2024-02-23 | 唯品会(广州)软件有限公司 | Fault checking method and device, storage medium and computer equipment |
CN114172796A (en) * | 2021-12-24 | 2022-03-11 | 中国工商银行股份有限公司 | Fault positioning method and related device for communication network |
CN114172796B (en) * | 2021-12-24 | 2024-01-30 | 中国工商银行股份有限公司 | Fault positioning method and related device for communication network |
CN117880055A (en) * | 2024-03-12 | 2024-04-12 | 灵长智能科技(杭州)有限公司 | Network fault diagnosis method, device, equipment and medium based on transmission layer index |
Also Published As
Publication number | Publication date |
---|---|
CN107835098B (en) | 2021-01-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107835098A (en) | A kind of network fault detecting method and system | |
CN107995030A (en) | A kind of network detection method, network fault detecting method and system | |
CN107332902B (en) | The user of online customer service system asks distribution method, device and computing device | |
CN104580349B (en) | Secure cloud administration agent | |
CN102939594B (en) | The method and apparatus that migration with the virtual resource in customer resources to data center environment is relevant | |
CN107925588A (en) | Band outer platform is adjusted and configured | |
US8359328B2 (en) | Party reputation aggregation system and method | |
CN105938443B (en) | Method and system for executing diagnostic activities in a computing environment | |
CN107852368A (en) | Highly usable service chaining for network service | |
CN110506259A (en) | System and method for calculate node management agreement | |
US9043317B2 (en) | System and method for event-driven prioritization | |
CN108712488A (en) | A kind of data processing method based on block chain, device, block catenary system | |
CN104869155B (en) | Data Audit method and device | |
CN107924360A (en) | Diagnosis frame in computing system | |
CN108063699A (en) | Network performance monitoring method, apparatus, electronic equipment, storage medium | |
JP2013530470A (en) | Distributed randomization and supply management in clinical trials | |
CN107395414A (en) | A kind of negative feedback control method and system based on output ruling | |
CN110225104A (en) | Data capture method, device and terminal device | |
CN110362454A (en) | A kind of alarm method, device and electronic equipment for supporting configurable decision engine | |
CN107643983A (en) | A kind of test data processing method and system | |
CN109359019A (en) | Application program capacity monitoring method, device, electronic equipment and storage medium | |
CN110351299A (en) | A kind of network connection detection method and device | |
US20120084856A1 (en) | Gathering, storing and using reputation information | |
CN106156361A (en) | Law enforcement supervision method and device | |
CN107707516B (en) | A kind of IP address analysis method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |