CN108494594A - A kind of analysis method and system of EIGRP route networks failure - Google Patents

A kind of analysis method and system of EIGRP route networks failure Download PDF

Info

Publication number
CN108494594A
CN108494594A CN201810233540.2A CN201810233540A CN108494594A CN 108494594 A CN108494594 A CN 108494594A CN 201810233540 A CN201810233540 A CN 201810233540A CN 108494594 A CN108494594 A CN 108494594A
Authority
CN
China
Prior art keywords
sample
training
eigrp
fdemc
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810233540.2A
Other languages
Chinese (zh)
Inventor
钱叶魁
叶立新
杜江
黄浩
杨瑞朋
雒朝峰
王丙坤
李宇翀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Campus Of Chinese People's Liberation Army Army Artillery Air Defense Academy
Original Assignee
Zhengzhou Campus Of Chinese People's Liberation Army Army Artillery Air Defense Academy
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Campus Of Chinese People's Liberation Army Army Artillery Air Defense Academy filed Critical Zhengzhou Campus Of Chinese People's Liberation Army Army Artillery Air Defense Academy
Priority to CN201810233540.2A priority Critical patent/CN108494594A/en
Publication of CN108494594A publication Critical patent/CN108494594A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06F18/2155Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Environmental & Geological Engineering (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a kind of FDEMC failure analysis methods of EIGRP route networks, the FDEMC failure analysis methods are the clustering methods based on EM algorithms and gauss hybrid models GMM, realize the unsupervised analysis that data are route to EIGRP, this approach includes the following steps:The routing message acquired in the EIGRP route networks carries out feature extraction and data prediction, forms training sample set;The gauss hybrid models GMM is trained using the training sample set to obtain the Clustering Model of training completion;The test sample for obtaining the EIGRP route networks inputs the test sample in the Clustering Model that the training is completed, to obtain the classification results of the test sample.The analytical judgment to EIGRP route network fault types can be achieved by the execution of this method, it is particularly possible to analyze and determine to the unknown failure in EIGRP route networks.In addition, the invention also discloses a kind of FDEMC trouble analysis systems of EIGRP route networks.

Description

A kind of analysis method and system of EIGRP route networks failure
Technical field
The present invention relates to field of communication technology more particularly to a kind of analysis methods and system of route network failure.
Background technology
With the fast development of Internet technology, networking products are more and more universal in production and living, the network scale of construction and multiple Miscellaneous degree constantly increases, and IP network is gradually to offer high-quality traffic direction development, quality and network product of the user to network service More stringent requirements are proposed for matter, and IP network monitors a kind of important means for having become network management, due to the service quality of routing Directly influence the running quality of whole network, thus the monitoring of routing failure is become one in network monitor it is important in Hold.
Routing failure refers to the case where Routing Protocol operation deviates normal condition.Route network fault diagnosis can be divided by Autonomous Domain For intra-area routes identification and inter-domain routing identification, enhanced interior gateway routing protoc EIGRP (Enhanced Interior Gateway Protocol, EIGRP) belong to intra-area routes agreement, have many advantages, such as that deployment is simple, highly reliable, robustness is good, It is commonly applied to high reliability, the network field of high security, such as banking system and military system.If eigrp routing protocol It is abnormal in the process of running or failure, it will cause very serious consequence, therefore effectively in identification EIGRP route networks The failure and exception of appearance, for safeguarding that the normal operation of network is of great significance.
ShaikhA et al. proposes two different ospfs (Open Shortest Path First, OSPF) Routing Protocol monitoring method, one is the SNMP methods based on management platform, another kind be based on LSAR with Control platform method has been applied to analyze OSPF in actual enterprise's ospf network with monitoring by the control platform method of LSAG Various actions performance in the actual environment;The specific framework of this monitoring method has also been set forth in detail in ShaikhA et al., and Realize this recognition methods.In recent years, machine learning is introduced in network fault diagnosis field, there is researcher will Neural network algorithm and support vector machines in machine learning are applied in Network Fault Detecting and obtain good experimental result, Demonstrate the feasibility that machine learning algorithm is applied in this field.However, the research in route network intelligent fault diagnosis field OSPF Routing Protocols are all mainly concentrated on, the intelligent fault diagnosis of eigrp routing protocol network is studied still in missing shape State;Further, the existing method based on neural network and support vector machines is on the sample basis including class label Fault type is analyzed, i.e., they are a kind of recognition methods having supervision, being limited in that for this method can only identify Some classifications (i.e. the class label), cannot identify new fault type in time, this can according to the differentiation of practical application It can lead to the inaccurate and unreasonable of classification.
Information, which is transmitted, in eigrp routing protocol mainly finds by reliable transport protocol (RTP) and neighbours and restores two Module is realized;The routing iinformation of router both is from direct-connected neighbor router in eigrp routing protocol network, each to route Only there is neighbor router port routing information in device, and neighbor router can be in the same way from the adjoining direct-connected with them Router obtains routing iinformation, and the operating mechanism of itself is different with traditional OSPF Routing Protocols, rip routing protocol, cannot The method for diagnosing faults of other Routing Protocols is simply indiscriminately imitated, therefore, there is an urgent need for one kind can either diagnosing EIGRP routings by network administrator Failure, and the intellectual analysis identification technology of unknown failure type can be identified in time.
Invention content
The present invention discloses a kind of FDEMC failures of the EIGRP route networks based on EM algorithms and gauss hybrid models GMM point Method and system is analysed, solving can not be analyzed in the prior art for EIGRP routing failures, and cannot be identified in time unknown The technical issues of routing failure.
In order to solve the above technical problems, one aspect of the present invention is to provide a kind of EIGRP route networks FDEMC failure analysis methods, the FDEMC failure analysis methods are the cluster sides based on EM algorithms and gauss hybrid models GMM Method is realized the unsupervised analysis to the EIGRP route networks known fault and unknown failure, is included the following steps:Step S1, The routing message acquired in the EIGRP route networks carries out feature extraction and data prediction, forms training sample set;Step S2 trains the gauss hybrid models GMM to obtain the Clustering Model of training completion using the training sample set;The training is complete At Clustering Model be the gauss hybrid models GMM for having optimized parameter, and the optimized parameter is calculated by using EM algorithms It arrives;Step S3 obtains the test sample of the EIGRP route networks, and the test sample is inputted the poly- of the training completion In class model, to obtain the classification results of the test sample and then identify the operation conditions of the EIGRP route networks;Into one It walks, the feature extracted in the step S1 includes at least operation code, label, confirms sequence number, self-aid system number, K value fields, AS Domain field, summarize automatically, IP address is discontinuous, source router, destination router, the feature for indicating not same area router setting.
In another embodiment of the FDEMC failure analysis methods of EIGRP route networks of the present invention, the step S2 includes Following steps:The training sample set is divided into m sample set by step S21, and each sample set indicates a classification, Three mean value of each classification, covariance and weight initial parameter values are calculated on the basis of the m sample set to be formed The initial parameter vector set of the m classificationIts InThe weight initial value of the m classification is indicated respectively,The m is indicated respectively The mean value initial value of a classification,The covariance initial value of the m classification is indicated respectively;Step S22, Following two steps operation is repeatedly carried out, until the log-likelihood function of the gauss hybrid models GMM is restrained:E steps:Described Training sample set and the estimation of parameter current vector setGiven situation Under, calculate the conditional expectation of the log-likelihood function of the training sample set:
The wijIndicate that i-th of sample belongs to the probability of jth class, x indicates that the sample of d dimensions, θ indicate Gaussian Mixture The parameter vector set of model, t=0,1 ..., J indicate that the current iteration number of EM algorithms, J indicate that the maximum in EM algorithms changes Generation number;Parameter value of the kth class under current t values iteration in expression parameter vector set;Indicate kth class in current t values Weight under iteration;Indicate sample average of the kth class under current t values iteration;Indicate kth class in current t values iteration Under covariance matrix;
M steps:Update the parameter of gauss hybrid modelsMake the conditional expectation of the log-likelihood function Value maximizes:
WhereinIt is the weight of jth class,It is the mean value of jth class, is d dimensional vectors,It is the covariance matrix of jth class, For d rank symmetric positive definite matrixs, d indicates that the dimension of training sample, N indicate the number of training sample in the training sample.
In another embodiment of the FDEMC failure analysis methods of EIGRP route networks of the present invention, the step S21 is adopted The model start-up parameter that the gauss hybrid models GMM is obtained with random initializtion method, specifically includes following steps:First, The center for being arbitrarily designated m sample as cluster is concentrated in training sample, m is classification sum herein, and m is positive integer;Second, point Not Ji Suan training sample concentrate at a distance from m sample of other samples and this, each sample is divided into m using apart from nearest principle In one type in class, and distribute identical class label;Third is calculated according to all kinds of all sample values per a kind of power Weight, mean value and covariance matrix.
In another embodiment of the FDEMC failure analysis methods of EIGRP route networks of the present invention, when progress Gaussian Mixture When model GM M training, the maximum iteration of EM algorithms is set as 50 times.
In another embodiment of the FDEMC failure analysis methods of EIGRP route networks of the present invention, according to training sample set Actual conditions, cluster cluster quantity be arranged ranging from 3 to 10.
Further, the present invention also proposes a kind of FDEMC trouble analysis systems of EIGRP route networks, the FDEMC failures Analysis system includes preprocessing module, model execution module, cluster result output module, to realize to the EIGRP routing networks The unsupervised analysis of network known fault and unknown failure, wherein:Preprocessing module, for acquiring in the EIGRP route networks Routing message carry out feature extraction and data prediction, form training sample set;Model execution module, for using the instruction Practice sample set and trains the gauss hybrid models GMM to obtain the Clustering Model of training completion;The Clustering Model that the training is completed It is there is the gauss hybrid models GMM of optimized parameter, and the optimized parameter is calculated by using EM algorithms;Cluster result is defeated Go out module, the test sample is inputted the poly- of the training completion by the test sample for obtaining the EIGRP route networks In class model, to obtain the classification results of the test sample and then identify the operation conditions of the EIGRP route networks;Into one Step, the feature of preprocessing module extraction include at least operation code, label, confirm sequence number, self-aid system number, K value fields, The domains AS field summarizes, the spy that IP address is discontinuous, router is arranged in source router, destination router and representative domain automatically Sign.
In another embodiment of the FDEMC trouble analysis systems of EIGRP route networks of the present invention, the model executes mould Block is specifically included with lower unit:The training sample set is divided into m sample set, each sample by initial computation unit One classification of set representations, at the beginning of the mean value, covariance and weight three of each classification are calculated on the basis of the m sample set Beginning parameter value is to form the initial parameter vector set of the m classificationWhereinThe weight initial value of the m classification is indicated respectively,The m is indicated respectively The mean value initial value of a classification,The covariance initial value of the m classification is indicated respectively;Convergence calculates Unit repeatedly carries out following two steps operation, until the log-likelihood function of the gauss hybrid models GMM is restrained:E steps: Estimate in the training sample set and parameter current vector setIt gives In the case of fixed, the conditional expectation of the log-likelihood function of the training sample set is calculated:
The wijIndicate that i-th of sample belongs to the probability of jth class, x indicates that the sample of d dimensions, θ indicate Gaussian Mixture The parameter vector set of model, t=0,1 ..., J indicate that the current iteration number of EM algorithms, J indicate that the maximum in EM algorithms changes Generation number;Parameter value of the kth class under current t values iteration in expression parameter vector set;Indicate kth class in current t values Weight under iteration;Indicate sample average of the kth class under current t values iteration;Indicate kth class in current t values iteration Under covariance matrix;
M steps:Update the parameter of gauss hybrid modelsMake the conditional expectation of the log-likelihood function Value maximizes:
WhereinIt is the weight of jth class,It is the mean value of jth class, is d dimensional vectors,It is the covariance matrix of jth class, For d rank symmetric positive definite matrixs, d indicates that the dimension of training sample, N indicate the number of training sample in the training sample.
In another embodiment of the FDEMC trouble analysis systems of EIGRP route networks of the present invention, the initial calculation list Member obtains the model start-up parameter of the gauss hybrid models GMM using random initializtion method, specifically includes following steps:The One, the center for being arbitrarily designated m sample as cluster is concentrated in training data, m is classification sum herein;Second, it calculates separately At a distance from m sample of other samples and this, each sample is divided into the one type in m classes using apart from nearest principle, And distribute identical class label;Third is calculated according to all kinds of all sample values per a kind of weight, mean value and covariance square Battle array.
In another embodiment of the FDEMC trouble analysis systems of EIGRP route networks of the present invention, when progress Gaussian Mixture When model GM M training, the maximum iteration of EM algorithms is set as 50 times.
In another embodiment of the FDEMC trouble analysis systems of EIGRP route networks of the present invention, according to training sample set Actual conditions, cluster cluster quantity be arranged ranging from 3 to 10.
The beneficial effects of the invention are as follows:The present invention provides a kind of FDEMC analysis methods of EIGRP route networks failure, The analysis method is a kind of unsupervised clustering based on EM algorithms and gauss hybrid models GMM, is not only realized to EIGRP The real-time analytical judgment of route network failure can also analyze the unknown failure in eigrp network.Further, this hair The bright feature being also creatively extracted for clustering improves reliability, the robustness of this method in practical applications And adaptability.In addition, the invention also provides a kind of FDEMC analysis systems of EIGRP route networks failure.
Description of the drawings
Fig. 1 is the route network monitoring mould for the FDEMC failure analysis methods of application EIGRP route networks of the present invention Type;
Fig. 2 is the flow chart of an embodiment of the FDEMC failure analysis methods of EIGRP route networks of the present invention;
Fig. 3 is the initial method flow chart according to the FDEMC failure analysis methods of EIGRP route networks of the present invention;
Fig. 4 is the classification under different iterations according to the FDEMC failure analysis methods of EIGRP route networks of the present invention Precision schematic diagram;
Fig. 5 is the classification essence under different number of clusters amounts according to the FDEMC failure analysis methods of EIGRP route networks of the present invention Spend schematic diagram;
Fig. 6 is the FDEMC trouble analysis system composition figures of EIGRP route networks of the present invention.
Specific implementation mode
To facilitate the understanding of the present invention, in the following with reference to the drawings and specific embodiments, the present invention will be described in more detail. The preferred embodiment of the present invention is given in attached drawing.But the present invention can realize in many different forms, and it is unlimited In this specification described embodiment.Make to the disclosure on the contrary, purpose of providing these embodiments is Understand more thorough and comprehensive.
It should be noted that unless otherwise defined, all technical and scientific terms used in this specification with belong to The normally understood meaning of those skilled in the art of the present invention is identical.Used term in the description of the invention It is to be not intended to the limitation present invention to describe the purpose of specific embodiment.Term "and/or" packet used in this specification Include any and all combinations of one or more relevant Listed Items.
Gauss hybrid models GMM (Gaussian Mixed Model, GMM) refers to the linear of multiple gauss of distribution function Combination, theoretically gauss hybrid models GMM can fit any type of distribution, commonly used in solving the number under identity set The case where according to comprising multiple and different distributions, wherein cluster is one of its common scene, clustering algorithm refers to not having a pile The data of label are divided into classes of method automatically.Gauss hybrid models GMM is mixed by the Gaussian Profile with different model parameters It is combined into, by calculating the weighted sum of several Gaussian probability-density functions, the distribution situation of input data set is described, It can be with the Density Distribution of smoothed approximation arbitrary shape.
When using gauss hybrid models GMM come when clustering, it is assumed that the data that we have had are by gauss hybrid models What GMM was generated, as long as releasing the probability distribution of gauss hybrid models GMM according to data come then high at this time The K Component of this mixed model GMM has actually just corresponded to K group or class (cluster).It is calculated according to data Probability density is commonly referred to as density estimation (density estimation), particularly, when we are at known (or hypothesis) The form of probability density function, and to estimate the process of parameter therein and be referred to as " parameter Estimation ".
It is now assumed that we have N number of data point, and assume that they obey some distribution (being denoted as p (x)), it is now in determining The value of some parameters in face, for example, in gauss hybrid models GMM, we are just it needs to be determined that πk、μkAnd ΣkThese parameters.It is main The idea wanted is to find such one group of parameter, and probability distribution determined by it generates the maximum probability of these given data points, And this probability is actually equal toThis product is referred to as likelihood function (likelihood by we function).Probability all very littles of usual a single point, the number of many very littles be multiplied get up easily caused in computer it is floating Points underflow, therefore we would generally take logarithm to it, and product is become to sum it upObtain log-likelihood function log‐likelihood function.Next we as long as by this function maximization, (common practice is that derivation and enabling is led Number is equal to zero, then solves equation), that is, such one group of parameter value is found, it allows likelihood function to obtain maximum value, we are considered as This is most suitable parameter, and this completes the processes of parameter Estimation.
According to the above, we can obtain the log-likelihood function log- of gauss hybrid models GMM Likelihood function are:
Due to having adduction inside log-likelihood function (1), the method that we cannot directly be solved equation with derivation is directly asked Obtain maximum value.Therefore, EM (Expectation Maximization Algorithm) algorithms are introduced to solve Gaussian Mixture The log-likelihood function of model GM M.
EM algorithms it is expected very big algorithm, are a kind of iterative algorithms.It is expected that very big algorithm is suitable for containing hidden variable probability Model carries out maximum a posteriori estimate to the parameter in model.The each iterative process of EM algorithms is divided into two steps, referred to as E Step and M steps, E steps it is expected that M steps are used for maximizing for asking, and E steps and M walk continuous alternating iteration to increase total log-likelihood Until convergence, while solving the problems, such as that maximum value is absorbed in local optimum by way of successive ignition.E is walked:For training sample Each sample concentrated calculates the probability that the sample belongs to each cluster, and is weighed result of calculation as the sample of the sample Weight:As soon as if the possibility that sample belongs to some cluster is very big, the corresponding probability value of the sample is disposed proximate to 1;When going out When a case where existing data sample belongs to two or more clusters occurs, it is necessary to establish a probability for different clusters point Cloth.M is walked:The sample weights that each sample is concentrated in conjunction with input sample estimate the relevant parameters such as variance, mean value in each cluster Meter;The probability that each sample is calculated using in back passes through calculating as weight, similar K-MEANS cluster algorithms The mean value and variance of each cluster are obtained, and further calculates the maximum likelihood of cluster, obtains final result.That is, ought be seemingly When right function is bounded function, it is expected that very big algorithm makes likelihood function value increase by iteration calculating, until likelihood function Value converges to a stability boundaris.Due to the extremely simple calculating process of EM algorithms and conveniently characteristic, in medical diagnosis and sample The various fields such as this cluster are used widely.
Specifically, it is assumed that mixed distribution P, P, which are distributed by k independent Gaussian and (are referred to as Component), to be formed, and is led to The mode for crossing mixed distribution creates the set of data samples D that sample is tieed up comprising N number of d, then by the weighted average of k Gaussian density function The distribution P of represented probability density function description is:
Wherein
Wherein, formula (2) is referred to as gauss hybrid models, may also be referred to as Gaussian Mixture Model Probability Density, i.e., by multiple gaussian densities The limited set of composition.In the gauss hybrid models, x indicates random sample, θ expression parameter vector set, πjIndicate Gauss The weight of basic mode type Gaussian density function, μ in mixed modeljRepresent the mean value of jth class sample;ΣjIndicate the association of jth class sample Variance matrix.
It is similar with clustering method, there is different initiation parameters, each different Gauss in Gaussian density function Model is considered as a cluster or classification.After data sample x inputs, by model formation (2) and (3) result of calculation, then It is compared with result of calculation using a preset threshold value, judges whether data sample x belongs to the Gauss model.
As shown in the above, the gauss hybrid models GMM based on EM algorithms can be used for Unsupervised clustering analysis.This A kind of FDEMC (the Fault Detection based on Expectation of EIGRP route networks of disclosure of the invention Maximization Clustering) failure analysis methods are exactly a kind of event based on EM algorithms and gauss hybrid models GMM Hinder analysis method, data can be route to the EIGRP of collection and carry out unsupervised analysis, it can also be to EIGRP route networks Unknown failure carries out diagnostic analysis, improves the reasonability and accuracy to EIGRP route network accident analyses.
Fig. 1 shows the route network monitoring of the EIGRP route network FDEMC failure analysis methods for the application present invention Model.
Route network monitoring model in the present invention by more routers, monitoring node PC, eigrp routing protocol and Other procotols such as OSPF, routing information protocol RIP (Routing Information Protocol) form.In the present invention Route network it is more complicated, more meet the situation in true environment.Route network is made of multiple domains AS, disposes OSPF, RIP With the Routing Protocols such as Border Gateway Protocol (BGP) (Border Gateway Protocol).With the increase of route network scale, According to the characteristic of eigrp routing protocol, monitoring node quantity is consequently increased.When disposing monitoring node, by itself and route network In core router be connected, so as to collect the routing message for flowing through this network to greatest extent.
Information, which is transmitted, in eigrp routing protocol mainly finds by reliable transport protocol (RTP) and neighbours and restores two Module is realized;Routing table in each router only has the Port IP of neighbor router, the router in EIGRP route systems Neighbor router is all relied on routing iinformation, and neighbor router obtains routing letter from their neighbor router there Breath.Therefore when disposing monitoring node, generally it is connected with the center router of route network, to realize capture quantities of messages It maximizes.In practical operation, monitoring node is incorporated to by monitored route system by hub, monitoring node is avoided to influence road By the normal operation of system.
In route network monitoring model shown in Fig. 1, EIGRP route monitorings network is by 20 Cisco's C3640 routers It is formed with 2 monitoring hosts, wherein 16 routers separately constitute the domains AS1 171, the domains AS2 172, the domains AS3 173 and the domains AS4 174, There are 4 routers 10 in each domains AS;Wherein eigrp routing protocol, 174 middle part of the domains AS4 are disposed in the domains AS1 171 and the domains AS3 173 OSPF Routing Protocols are affixed one's name to, RIP agreements are disposed in the domains AS2 172;In addition to intra-area routes, also router 11, router 12, road By inter domain routers such as device 13, routers 14, BGP inter-domain routing protocols are disposed in these inter domain routers;In whole network Two monitoring hosts of middle deployment, monitoring node 15 are connected with router 12, and monitoring node 16 is connected with router 11, collects respectively All routing messages transmitted by core router 12,11.
Further, it manually resets the relevant parameter in each router, reappears failure in EIRGP route systems and different Often, and while failure occurs, the routing data message being monitored in router is collected using sniffer, after carrying out Continuous clustering.
Fig. 2 is the flow chart of an embodiment of the FDEMC failure analysis methods of EIGRP route networks of the present invention.In Fig. 2 In, the FDEMC failure analysis methods of the EIGRP route networks include the following steps:
Step S1, the routing message acquired in the EIGRP route networks carry out feature extraction and data prediction, are formed Training sample set;
For machine learning algorithm, since sample data is unable to the sample data under limit all situations, Therefore feature extraction it is appropriate whether to a certain extent on the learning effect of machine learning influence it is very big, in the prior art There are many articles to show for same training sample, when the feature difference of extraction, the classifying quality brought also can be different, Therefore, correctly the feature of extraction EIGRP route networks is just particularly important.The feature extraction of EIGRP route networks and data Pretreatment refer to be by the EIGRP of capture routing Message processing can be with the choosing of the sample set of training machine learning model, including feature It takes and the arrangement of data format.Collected routing message amount is more in the present invention, and is also contaminated with OSPF routing reports The noise datas such as text, so the data progress data prediction to acquisition is most important, on the one hand pretreatment can improve data Quality, on the other hand data can be allowed to be better suited for analyzing.The routing message acquired in the present invention is from as shown in Figure 1 It is acquired in EIGRP route monitoring networks.
Preferably, EIGRP, which route the field in message included, version, operation code, verification and label, sequence number, confirmation The field of the various description routing iinformations such as sequence number, self-aid system number, failure route version in message, verification and sequence number etc. This kind of field is not different with normal routing message, so this kind of field will not be chosen as data sample feature.
Preferably, existing known fault type has six kinds, is that syntople establishes failure, routing publication failure, road respectively Failure is redistributed by installation failure, route flapping failure, Route summary failure, routing.The reason of syntople can not be established has K Value is different or not in the same domains AS, the K value fields routeing in message and the domains AS field will be selected as sample characteristics at this time, To identify adjacent failure.Remaining failure, which all refers to router, can not receive or issue rational routing entry, and reason has certainly It is dynamic summarize, IP address is discontinuous etc., so summarize automatically, whether IP address is continuously chosen as sample characteristics, to distinguish these Routing failure caused by reason.In addition, message length field contributes to fault distinguishing, therefore the field is also chosen as sample spy Sign.
Preferably, correlated characteristic is extracted in conjunction with primary routing message and EIGRP phenomena of the failure, in the routing message of acquisition, The field values such as sourceIP, destinationIP are invalid for actual analysis, and actual IP can constantly change, and can not pass through tool The IP address of body judges EIGRP routing failure reasons, so this kind of field is converted into useful feature information, such as will Two Continuous valued attributes of sourceIP and destinationIP be converted to sourceRoute and adjacencyRoute it is this kind of from Scattered classification value;In addition, in EIGRP route networks, the router setting in the different domains AS is different, these settings can't be It is embodied in routing message, but route network can be caused to be abnormal, this kind of setting needs are manually extracted as sample characteristics.Feature After the completion of extraction, initial data message is converted to the CSV formatted data files that can carry out machine learning analysis.Table 1 is shown All sample characteristics and its meaning selected in the present invention.
Table 1
By the analysis to routeing message in EIGRP route networks, the present invention proposes EIGRP route networks for identification The feature of failure includes at least operation code in the feature, label, confirms sequence number, self-aid system number, K value fields, the domains AS word Section, summarize automatically, IP address is discontinuous, source router, destination router, the setting of same area router.
In addition, when route network breaks down or is abnormal, by the data packet of capture it is found that there was only a small number of data packets In information can change, cause exceptional sample quantity be far less than normal sample quantity, make in sample set occur data not Equilibrium appearance has an impact accident analysis and model accuracy.There are many kinds of the methods of data balancing, can generally pass through receipts Collect more data and carry out balance sample collection, but eigrp routing protocol is in usual stable operation, even if breaking down or abnormal When, it is to be still HELLO data packets to be transmitted in route network at most, and fault data packet quantity does not have too big variation. So the method for taking oversampling in the present invention carries out data balancing, oversampling is by increasing minority class sample in sample set For quantity come to realize sample equilibrium, simplest method be exactly to replicate minority class sample to form a plurality of record, this method is easy, straight It connects, but the disadvantage is that if sample characteristics are less, is likely to result in over-fitting.Under normal circumstances, imbalanced training sets can also be led Cause feature distribution unbalanced, but group very this measurer has certain scale, then the distribution of its characteristic value is more uniform, passes through The cooperation of the feature with notable type is selected to participate in solving the problems, such as imbalanced training sets.In the data packet acquired in the present invention in addition to The data packet of EIGRP also wants the data packet of the Routing Protocols such as OSPF, RIP and BGP, these data packets are medium in training sample set It is same as noise data.Due to the presence of these noises, over-fitting can be prevented to a certain extent.
Step S2:The gauss hybrid models GMM is trained using the training sample set to obtain the cluster of training completion Model;The Clustering Model that the training is completed is the gauss hybrid models GMM for having optimized parameter, and the optimized parameter by using EM algorithms are calculated.
Preferably, the step S2 includes:
The training sample set is divided into m sample set by step S21, and each sample set indicates a classification, Three mean value of each classification, covariance and weight initial parameter values are calculated on the basis of the m sample set to be formed State the initial parameter vector set of m classificationIts InThe weight initial value of the m classification is indicated respectively,The m is indicated respectively The mean value initial value of a classification,The covariance initial value of the m classification is indicated respectively;
Step S22 repeatedly carries out following two steps operation, until the log-likelihood function of the gauss hybrid models GMM Convergence:
E steps:Estimate in the training sample set and parameter current vector set In the case of given, the conditional expectation of the log-likelihood function of the training sample set is calculated:
The wijIndicate that i-th of sample belongs to the probability of jth class;θ expression parameter vector set;X is indicated in sample set A sample;T=0,1 ..., J indicate that the current iteration number of EM algorithms, J indicate the maximum iteration in EM algorithms;Parameter value of the kth class under current t values iteration in expression parameter vector set;Indicate kth class under current t values iteration Weight;Indicate sample average of the kth class under current t values iteration;Indicate association of the kth class under current t values iteration Variance matrix;
M steps:Update the parameter of gauss hybrid modelsMake the conditional expectation of the log-likelihood function Value maximizes:
WhereinIt is the weight of jth class,It is the mean value of jth class, is d dimensional vectors,It is the covariance matrix of jth class, For d rank symmetric positive definite matrixs, d indicates that the dimension of training sample, N indicate the number of training sample in the training sample.
Step S3 obtains the test sample of the EIGRP route networks, and the test sample input training is completed Clustering Model in, to obtain the classification results of the test sample and then identify the operation conditions of the EIGRP route networks. The wherein test sample, can be identical with training sample, can also be different with training sample.It has been trained generally for determination The test effect of the Clustering Model of completion, can select the test sample different with training sample to be tested.
The FDEMC failure analysis methods of EIGRP routing networks proposed by the present invention are a kind of unsupervised clustering learning sides Method can analyze the fault type and reason of EIGRP route networks in real time, can also to unknown failure and reason into On the one hand row automatic identification can help the exception occurred in network related personnel's real-time oversight network and failure etc., another party Face can improve people to unknown abnormal understanding in network, management and maintenance of the optimum management personnel to network.Further, it ties Close the feature extracted of the present invention, can improve FDEMC failure analysis methods adaptability in practical applications, reliability and Robustness.
Preferably, when presorting to training sample set, the type number presorted can be pre-set.When setting It is 1 to set the type number presorted, at this time the pseudocode of the FDEMC failure analysis methods of the EIGRP route networks It can indicate as follows:
Fig. 3 shows the specific implementation of the initial method of the FDEMC failure analysis methods of EIGRP route networks of the present invention Mode.When the FDEMC failure analysis methods initialization operation, in order to obtain the initial parameter value of each sample class, i.e. mean value With covariance matrix etc. it is necessary to carrying out operation of presorting to sample set.The performance of FDEMC failure analysis methods is largely It can be influenced by initiation parameter, on the one hand appropriate initiation parameter can accelerate the establishment speed of model, on the other hand Model table can be made to reveal excellent clustering performance.
FDEMC failure analysis methods are when calculating the initiation parameter of gauss hybrid models GMM, using random initializtion side Method, this method, which has, calculates the characteristics such as simple, operation is convenient.FDEMC failure analysis methods specifically include following steps:Step S31, it is assumed that the classification sum that input sample is concentrated is m, and center of the m sample as m cluster is randomly selected from sample set;Step Rapid S32 calculates in sample set all remaining samples with m cluster center at a distance from, relatively more each residue sample and m cluster center The residue sample is divided with apart from nearest cluster center in same class, and marks identical category by distance;Step S33, according to Data sample included in each cluster, calculates the weight of each cluster, mean value and covariance matrix parameter.
In another embodiment of the invention, in order to evaluate the clustering performance of FDEMC failure analysis methods, based on identical Training sample set, iterations and test sample collection are carried out at the same time the experimental analysis of K-MEANS algorithms and DBSCAN algorithms;K‐ MEANS is most common cluster algorithm, when the sample concentrated to training sample carries out practical cluster, according to sample and sample Euclidean distance size between this is classified;DBSCAN belongs to a kind of density clustering algorithm, the data sample selected from some It sets out as cluster center, constantly calculates the density at cluster center and other samples, expand to the reachable region of density to cluster range , finally obtain a maximization region for including core sample point and boundary sample point, any two in the same area Density is connected (i.e. density is reachable) between data sample.Experimental result is as shown in table 2, from Table 2, it can be seen that K-MEANS and The classification error rate of DBSCAN is both greater than 50%, and Clustering Effect is undesirable;Compared to the common clustering algorithm model of first two, The classification performance of FDEMC cluster algorithms is excellent, and error rate is far below first two Clustering Model.It can also be seen that from table 2 The number of clusters that three kinds of models finally create is essentially identical.
2 model experiment results of table compare
Fig. 4 shows the FDEMC cluster analysis results under different iterations.Figure 4, it is seen that FDEMC failures The sample misclassification rate of the analysis not linear decrease with the increase of iterations, when iterations are set as 50 times and 500 times, Lower value is presented in sample misclassification rate;When iterations rise to 200 times to 450 times, sample misclassification rate highest, Clustering Model Classification performance show at this time it is worst;When iterations are set greater than 500 times, sample misclassification rate also rises therewith.Cause This, when carrying out gauss hybrid models GMM training, in order to obtain optimum cluster performance, operation iterations should be set as 50 times.
Fig. 5 shows influence result of the different number of clusters amounts to FDEMC failure analysis methods.In clustering, cluster (cluster) quantity can both be automatically generated by algorithm, can also be needed manually to be arranged according to experiment.It can from Fig. 5 To find out, when cluster quantity differences, model misclassification rate also will produce variation.When cluster number is 5, model Misclassification rate is minimum, and only 24% or so;According to the actual conditions of training sample set, the zone of reasonableness of cluster quantity arrives for 3 10, when cluster number is other values, misclassification rate is all higher than 30%, illustrates in current training set and test set, gathers When the number of clusters of class model is 5, Clustering Model best performance.
Fig. 6 shows an a kind of embodiment of the FDEMC trouble analysis systems of EIGRP route networks of the present invention.In Fig. 6 In, the FDEMC trouble analysis systems 61 of EIGRP route networks include preprocessing module 62, model execution module 63, cluster result Output module 64.The routing message that wherein preprocessing module 62 is used to acquire in the EIGRP route networks carries out feature extraction And data prediction, form training sample set;
For machine learning algorithm, since sample data is unable to the sample data under limit all situations, Therefore feature extraction it is appropriate whether to a certain extent on the learning effect of machine learning influence it is very big, in the prior art There are many articles to show for same training sample, when the feature difference of extraction, the classifying quality brought also can be different, Therefore, correctly the feature of extraction EIGRP route networks is just particularly important.The feature extraction of EIGRP route networks and data Pretreatment refer to be by the EIGRP of capture routing Message processing can be with the choosing of the sample set of training machine learning model, including feature It takes and the arrangement of data format.Collected routing message amount is more in the present invention, and is also contaminated with OSPF routing reports The noise datas such as text, so the data progress data prediction to acquisition is most important, on the one hand pretreatment can improve data Quality, on the other hand data can be allowed to be better suited for analyzing.The routing message acquired in the present invention is from as shown in Figure 1 It is acquired in EIGRP route monitoring networks.
Preferably, EIGRP, which route the field in message included, version, operation code, verification and label, sequence number, confirmation The field of the various description routing iinformations such as sequence number, self-aid system number, failure route version in message, verification and sequence number etc. This kind of field is not different with normal routing message, so this kind of field will not be chosen as data sample feature.
Preferably, existing known fault type has six kinds, is that syntople establishes failure, routing publication failure, road respectively Failure is redistributed by installation failure, route flapping failure, Route summary failure, routing.The reason of syntople can not be established has K Value is different or not in the same domains AS, the K value fields routeing in message and the domains AS field will be selected as sample characteristics at this time, To identify adjacent failure.Remaining failure, which all refers to router, can not receive or issue rational routing entry, and reason has certainly It is dynamic summarize, IP address is discontinuous etc., so summarize automatically, whether IP address is continuously chosen as sample characteristics, to distinguish these Routing failure caused by reason.In addition, message length field contributes to fault distinguishing, therefore the field is also chosen as sample spy Sign.
Preferably, correlated characteristic is extracted in conjunction with primary routing message and EIGRP phenomena of the failure, in the routing message of acquisition, The field values such as sourceIP, destinationIP are invalid for actual analysis, and actual IP can constantly change, and can not pass through tool The IP address of body judges EIGRP routing failure reasons, so this kind of field is converted into useful feature information, such as will Two Continuous valued attributes of sourceIP and destinationIP be converted to sourceRoute and adjacencyRoute it is this kind of from Scattered classification value;In addition, in EIGRP route networks, the router setting in the different domains AS is different, these settings can't be It is embodied in routing message, but route network can be caused to be abnormal, this kind of setting needs are manually extracted as sample characteristics.Feature After the completion of extraction, initial data message is converted to the CSV formatted data files that can carry out machine learning analysis.In the present invention All sample characteristics and its meaning of selection such as aforementioned table 1 are shown.
By the analysis to routeing message in EIGRP route networks, the present invention proposes EIGRP route networks for identification The feature of failure includes at least operation code in the feature, label, confirms sequence number, self-aid system number, K value fields, the domains AS word Section, summarize automatically, IP address is discontinuous, source router, destination router, the setting of same area router.
In addition, when route network breaks down or is abnormal, by the data packet of capture it is found that there was only a small number of data packets In information can change, cause exceptional sample quantity be far less than normal sample quantity, make in sample set occur data not Equilibrium appearance has an impact accident analysis and model accuracy.There are many kinds of the methods of data balancing, can generally pass through receipts Collect more data and carry out balance sample collection, but eigrp routing protocol is in usual stable operation, even if breaking down or abnormal When, it is to be still HELLO data packets to be transmitted in route network at most, and fault data packet quantity does not have too big variation. So the method for taking oversampling in the present invention carries out data balancing, oversampling is by increasing minority class sample in sample set For quantity come to realize sample equilibrium, simplest method be exactly to replicate minority class sample to form a plurality of record, this method is easy, straight It connects, but the disadvantage is that if sample characteristics are less, is likely to result in over-fitting.Under normal circumstances, imbalanced training sets can also be led Cause feature distribution unbalanced, but group very this measurer has certain scale, then the distribution of its characteristic value is more uniform, passes through The cooperation of the feature with notable type is selected to participate in solving the problems, such as imbalanced training sets.In the data packet acquired in the present invention in addition to The data packet of EIGRP also wants the data packet of the Routing Protocols such as OSPF, RIP and BGP, these data packets are medium in training sample set It is same as noise data.Due to the presence of these noises, over-fitting can be prevented to a certain extent.
Model execution module 63, for training the gauss hybrid models GMM using the training sample set to obtain instruction Practice the Clustering Model completed;The Clustering Model that the training is completed is the gauss hybrid models GMM for having optimized parameter, and this is optimal Parameter is calculated by using EM algorithms.
Preferably, the model execution module 63 is specifically included with lower unit:
The training sample set is divided into m sample set by initial computation unit 631, and each sample set indicates one A classification calculates three mean value, covariance and weight initial parameter values of each classification on the basis of the m sample set To form the initial parameter vector set of the m classificationIts InThe weight initial value of the m classification is indicated respectively,The m is indicated respectively The mean value initial value of a classification,The covariance initial value of the m classification is indicated respectively;
Computing unit 632 is restrained, following two steps operation is repeatedly carried out, until the logarithm of the gauss hybrid models GMM Likelihood function is restrained:
E steps:Estimate in the training sample set and parameter current vector setIn the case of given, the training is calculated The conditional expectation of the log-likelihood function of sample set:
The wijIndicate that i-th of sample belongs to the probability of jth class;θ expression parameter vector set;X is indicated in sample set A sample;T=0,1 ..., J indicate the current iteration number of EM algorithms;J indicates the maximum iteration in EM algorithms;Parameter value of the kth class under current t values iteration in expression parameter vector set;Indicate kth class under current t values iteration Weight;Indicate sample average of the kth class under current t values iteration;Indicate association of the kth class under current t values iteration Variance matrix;
M steps:Update the parameter of gauss hybrid modelsMake the conditional expectation of the log-likelihood function Value maximizes:
WhereinIt is the weight of jth class,It is the mean value of jth class, is d dimensional vectors,It is the covariance matrix of jth class, For d rank symmetric positive definite matrixs, d indicates that the dimension of training sample, N indicate the number of training sample in the training sample.
Cluster result output module 64, the test sample for obtaining the EIGRP route networks, by the test sample It inputs in the Clustering Model that the training is completed, to obtain the classification results of the test sample and then identify the roads EIGRP By the operation conditions of network.The wherein test sample, can be identical with training sample, can also be different with training sample. Generally for the test effect for having trained the Clustering Model completed is determined, the test sample different with training sample can be selected to carry out Test.
The FDEMC trouble analysis systems of EIGRP routing networks proposed by the present invention are a kind of based on Unsupervised clustering study The system that method is realized, can analyze the fault type and reason of EIGRP route networks in real time, can also be to unknown Failure and reason carry out automatic identification, on the one hand can help the exception occurred in network related personnel's real-time oversight network and event Barrier etc., on the other hand can improve people to unknown abnormal understanding in network, management and dimension of the optimum management personnel to network Shield.Further, in conjunction with the feature extracted of the present invention, the FDEMC failure analysis methods in practical applications suitable can be improved Ying Xing, reliability and robustness.
Preferably, the execution modules of FDEMC models described in Fig. 6 63 are in the initiation parameter for calculating gauss hybrid models GMM When, initial computation unit 631 uses random initializtion method, and this method, which has, calculates the characteristics such as simple, operation is convenient.Specific step It is rapid as follows:The first step, it is assumed that the classification sum that input sample is concentrated is m, and m sample is randomly selected from sample set as m The center of cluster;Second step calculates all remaining samples in sample set and compares each remaining sample and m at a distance from m cluster center The residue sample is divided with apart from nearest cluster center in same class, and marks identical category by the distance at a cluster center;The Three steps carry out the parameters such as the weight, mean value and covariance matrix of each cluster according to data sample included in each cluster It calculates.
Fig. 4 shows the FDEMC cluster analysis results under different iterations.Figure 4, it is seen that FDEMC failures The sample misclassification rate of the analysis not linear decrease with the increase of iterations, when iterations are set as 50 times and 500 times, Lower value is presented in sample misclassification rate;When iterations rise to 200 times to 450 times, sample misclassification rate highest, point of model Class performance shows worst at this time;When iterations are set greater than 500 times, sample misclassification rate also rises therewith.Therefore, when When carrying out gauss hybrid models GMM training, in order to obtain optimum cluster performance, operation iterations should be set as 50 times.
Fig. 5 shows influence result of the different number of clusters amounts to FDEMC accident analyses.In clustering, cluster (cluster) quantity can both be automatically generated by algorithm, can also be needed manually to be arranged according to experiment.It can from Fig. 5 To find out, when cluster quantity differences, Clustering Model misclassification rate also will produce variation.When cluster number is 5, gather The misclassification rate of class model is minimum, and only 24% or so;According to the actual conditions of training sample set, cluster quantity it is reasonable Ranging from 3 to 10, when cluster number is other values, misclassification rate is all higher than 30%, illustrates in current training set and survey Examination is concentrated, and when the number of clusters of Clustering Model is 5, model performance is optimal.
It can be seen that the present invention provides a kind of FDEMC routing failures based on EM algorithms and gauss hybrid models GMM point Analyse method and system.FDEMC methods belong to unsupervised learning, and the sample that training and test sample are concentrated is without category label;Due to The Clustering features that FDEMC has can not only diagnose known fault, while can also gather to unknown failure sample Class, it can be found that unknown route network failure.I.e. the present invention is realized by FDEMC methods to the real-time of routing network failure Analysis, meets the requirement in practical application.In addition, by the feature in reasonable drawing EIGRP route networks, effectively increase The recognition effect of FDEMC clusters, improves reliability, robustness and the adaptability of the FDEMC failure analysis methods and system.
Example the above is only the implementation of the present invention is not intended to limit the scope of the invention, every to utilize this hair Equivalent structure transformation made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant technical fields, Similarly it is included within the scope of the present invention.

Claims (10)

1. a kind of FDEMC failure analysis methods of EIGRP route networks, the FDEMC failure analysis methods are to be based on EM algorithms With the clustering method of gauss hybrid models GMM, realize to the unsupervised of the EIGRP route networks known fault and unknown failure Analysis, which is characterized in that include the following steps:
Step S1, the routing message acquired in the EIGRP route networks carry out feature extraction and data prediction, form training Sample set;
Step S2 trains the gauss hybrid models GMM to obtain the Clustering Model of training completion using the training sample set; The Clustering Model that the training is completed is the gauss hybrid models GMM for having optimized parameter, and the optimized parameter is calculated by using EM Method is calculated;
Step S3 obtains the test sample of the EIGRP route networks, and the test sample is inputted the poly- of the training completion In class model, to obtain the classification results of the test sample and then identify the operation conditions of the EIGRP route networks;
Further, the feature extracted in the step S1 includes at least operation code, label, confirms sequence number, self-aid system number, K Value field, the domains AS field, summarize automatically, IP address is discontinuous, source router, destination router, indicates that same area router is not set The feature set.
2. a kind of FDEMC failure analysis methods of EIGRP route networks according to claim 1, which is characterized in that described Step S2 includes the following steps:
The training sample set is divided into m sample set by step S21, and each sample set indicates a classification, described Three mean value, covariance and weight initial parameter values of each classification are calculated on the basis of m sample set to form the m The initial parameter vector set of classification WhereinThe weight initial value of the m classification is indicated respectively,Institute is indicated respectively The mean value initial value of m classification is stated,The covariance initial value of the m classification is indicated respectively;
Step S22 repeatedly carries out following two steps operation, until the log-likelihood function of the gauss hybrid models GMM is restrained:
E steps:Estimate in the training sample set and parameter current vector set
In the case of given, the trained sample is calculated The conditional expectation of the log-likelihood function of this collection:
The wijIndicate that i-th of sample belongs to the probability of jth class, x indicates that the sample of d dimensions, θ indicate gauss hybrid models Parameter vector set;T=0,1 ..., J indicate that the current iteration number of EM algorithms, J indicate the greatest iteration time in EM algorithms Number;Parameter value of the kth class under current t values iteration in expression parameter vector set;Indicate kth class in current t values iteration Under weight;Indicate sample average of the kth class under current t values iteration;Indicate kth class under current t values iteration Covariance matrix;
M steps:Update the parameter of gauss hybrid models GMMMake the conditional expectation of the log-likelihood function It maximizes:
WhereinIt is the weight of jth class,It is the mean value of jth class,It is the covariance matrix of jth class, N indicates the trained sample The number of training sample in this.
3. the FDEMC failure analysis methods of EIGRP route networks according to claim 2, which is characterized in that step S21 The model start-up parameter that the gauss hybrid models GMM is obtained using random initializtion method, specifically includes following steps:
First, it concentrates the center for being arbitrarily designated m sample as cluster, wherein m to indicate classification sum in training sample, is just whole Number;
Second, it calculates separately training sample and concentrates at a distance from m sample of other samples and this, using will be each apart from nearest principle Sample is divided into the one type in m classes, and distributes identical class label;
Third is calculated according to all kinds of all sample values per a kind of weight, mean value and covariance matrix.
4. the FDEMC failure analysis methods of EIGRP route networks according to claim 1, which is characterized in that high when carrying out When the GMM training of this mixed model, the maximum iteration of EM algorithms is set as 50 times.
5. the FDEMC failure analysis methods of EIGRP route networks according to claim 1, which is characterized in that according to training The actual conditions of sample set, the quantity for clustering cluster are arranged ranging from 3 to 10.
6. a kind of FDEMC trouble analysis systems of EIGRP route networks, which is characterized in that the FDEMC trouble analysis systems packet Preprocessing module, model execution module, cluster result output module are included, to realize to the EIGRP route networks known fault With the unsupervised analysis of unknown failure, wherein:
Preprocessing module carries out feature extraction and data prediction for acquiring the routing message in the EIGRP route networks, Form training sample set;
Model execution module, for training the gauss hybrid models GMM to obtain trained completion using the training sample set Clustering Model;The Clustering Model that the training is completed is the gauss hybrid models GMM for having optimized parameter, and the optimized parameter is logical It crosses and is calculated using EM algorithms;
The test sample is inputted institute by cluster result output module, the test sample for obtaining the EIGRP route networks In the Clustering Model for stating training completion, to obtain the classification results of the test sample and then identify the EIGRP route networks Operation conditions;
Further, the feature of preprocessing module extraction include at least operation code, label, confirm sequence number, self-aid system number, K value fields, the domains AS field, summarize automatically, IP address is discontinuous, router in source router, destination router and representative domain The feature of setting.
7. the FDEMC trouble analysis systems of EIGRP route networks according to claim 6, which is characterized in that the model Execution module is specifically included with lower unit:
The training sample set is divided into m sample set by initial computation unit, and each sample set indicates a classification, Three mean value of each classification, covariance and weight initial parameter values are calculated on the basis of the m sample set to be formed The initial parameter vector set of the m classificationIts InThe weight initial value of the m classification is indicated respectively,The m is indicated respectively The mean value initial value of a classification,The covariance initial value of the m classification is indicated respectively;
Computing unit is restrained, following two steps operation is repeatedly carried out, until the log-likelihood function of the gauss hybrid models GMM Convergence:
E steps:Estimate in the training sample set and parameter current vector setIt gives In the case of fixed, the conditional expectation of the log-likelihood function of the training sample set is calculated:
The wijIndicate that i-th of sample belongs to the probability of jth class, x indicates that the sample of d dimensions, θ indicate gauss hybrid models Parameter vector set, t=0,1 ..., J indicate that the current iteration number of EM algorithms, J indicate the greatest iteration time in EM algorithms Number;Parameter value of the kth class under current t values iteration in expression parameter vector set;Indicate kth class in current t values iteration Under weight;Indicate sample average of the kth class under current t values iteration;Indicate kth class under current t values iteration Covariance matrix;
M steps:Update the parameter of gauss hybrid modelsKeep the conditional expectation of the log-likelihood function maximum Change:
WhereinIt is the weight of jth class,It is the mean value of jth class,It is the covariance matrix of jth class, N indicates the trained sample The number of training sample in this.
8. the FDEMC trouble analysis systems of EIGRP route networks according to claim 7, which is characterized in that described initial Computing unit obtains the model start-up parameter of the gauss hybrid models GMM using random initializtion method, specifically includes following Step:
First, the center for being arbitrarily designated m sample as cluster is concentrated in training data, N is classification sum herein;
Second, it calculates separately at a distance from m sample of other samples and this, each sample is divided into m using apart from nearest principle In one type in class, and distribute identical class label;
Third is calculated according to all kinds of all sample values per a kind of weight, mean value and covariance matrix.
9. the FDEMC trouble analysis systems of EIGRP route networks according to claim 6, which is characterized in that high when carrying out When the GMM training of this mixed model, the maximum iteration of EM algorithms is set as 50 times.
10. the FDEMC trouble analysis systems of EIGRP route networks according to claim 6, which is characterized in that according to instruction The actual conditions for practicing sample set, the quantity for clustering cluster are arranged ranging from 3 to 10.
CN201810233540.2A 2018-03-21 2018-03-21 A kind of analysis method and system of EIGRP route networks failure Pending CN108494594A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810233540.2A CN108494594A (en) 2018-03-21 2018-03-21 A kind of analysis method and system of EIGRP route networks failure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810233540.2A CN108494594A (en) 2018-03-21 2018-03-21 A kind of analysis method and system of EIGRP route networks failure

Publications (1)

Publication Number Publication Date
CN108494594A true CN108494594A (en) 2018-09-04

Family

ID=63318832

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810233540.2A Pending CN108494594A (en) 2018-03-21 2018-03-21 A kind of analysis method and system of EIGRP route networks failure

Country Status (1)

Country Link
CN (1) CN108494594A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110309492A (en) * 2019-06-29 2019-10-08 河北工业大学 Wind power generating set health degree appraisal procedure based on scatter diagram Data Dimensionality Reduction technology
CN110879971A (en) * 2019-10-23 2020-03-13 上海宝信软件股份有限公司 Method and system for predicting abnormal operation condition of industrial production equipment
CN111077876A (en) * 2019-12-11 2020-04-28 湖南大唐先一科技有限公司 Power station equipment state intelligent evaluation and early warning method, device and system
CN111614518A (en) * 2020-05-20 2020-09-01 中国电子科技集团公司第五十四研究所 Automatic safety testing method based on OSPF protocol
CN112488371A (en) * 2020-11-20 2021-03-12 国网福建省电力有限公司厦门供电公司 Pipeline intelligent early warning method and system based on big data
CN114630352A (en) * 2020-12-11 2022-06-14 中国移动通信集团湖南有限公司 Fault monitoring method and device of access equipment
CN115430815A (en) * 2022-08-09 2022-12-06 衡阳镭目科技有限责任公司 Crystallizer liquid level control method and device, electronic equipment and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107483267A (en) * 2017-09-19 2017-12-15 中国人民解放军防空兵学院 A kind of EIGRP routing failures recognition methods

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107483267A (en) * 2017-09-19 2017-12-15 中国人民解放军防空兵学院 A kind of EIGRP routing failures recognition methods

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张德喜等: "一种适合于大数据集处理的混合EM算法", 《计算机应用》 *
陈英: "高斯混合模型聚类及其优化算法研究", 《中国优秀硕士毕业论文全文数据库信息科技辑》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110309492A (en) * 2019-06-29 2019-10-08 河北工业大学 Wind power generating set health degree appraisal procedure based on scatter diagram Data Dimensionality Reduction technology
CN110309492B (en) * 2019-06-29 2020-08-04 河北工业大学 Wind generating set health degree evaluation method based on diffusion map data dimension reduction technology
CN110879971A (en) * 2019-10-23 2020-03-13 上海宝信软件股份有限公司 Method and system for predicting abnormal operation condition of industrial production equipment
CN110879971B (en) * 2019-10-23 2023-06-13 上海宝信软件股份有限公司 Industrial production equipment operation abnormal condition prediction method and system
CN111077876A (en) * 2019-12-11 2020-04-28 湖南大唐先一科技有限公司 Power station equipment state intelligent evaluation and early warning method, device and system
CN111077876B (en) * 2019-12-11 2021-06-08 湖南大唐先一科技有限公司 Power station equipment state intelligent evaluation and early warning method, device and system
CN111614518A (en) * 2020-05-20 2020-09-01 中国电子科技集团公司第五十四研究所 Automatic safety testing method based on OSPF protocol
CN111614518B (en) * 2020-05-20 2021-08-31 中国电子科技集团公司第五十四研究所 Automatic safety testing method based on OSPF protocol
CN112488371A (en) * 2020-11-20 2021-03-12 国网福建省电力有限公司厦门供电公司 Pipeline intelligent early warning method and system based on big data
CN114630352A (en) * 2020-12-11 2022-06-14 中国移动通信集团湖南有限公司 Fault monitoring method and device of access equipment
CN114630352B (en) * 2020-12-11 2023-08-15 中国移动通信集团湖南有限公司 Fault monitoring method and device for access equipment
CN115430815A (en) * 2022-08-09 2022-12-06 衡阳镭目科技有限责任公司 Crystallizer liquid level control method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN108494594A (en) A kind of analysis method and system of EIGRP route networks failure
Verma et al. Evaluation of network intrusion detection systems for RPL based 6LoWPAN networks in IoT
Zhang et al. Social influence locality for modeling retweeting behaviors
Wang et al. On credibility estimation tradeoffs in assured social sensing
De Donato et al. Traffic identification engine: an open platform for traffic classification
CN110414555A (en) Detect the method and device of exceptional sample
US20160212163A1 (en) Method and Apparatus to Identify the Source of Information or Misinformation in Large-Scale Social Media Networks
Le et al. Data analytics on network traffic flows for botnet behaviour detection
Sun et al. An intrusion detection model for wireless sensor networks with an improved V-detector algorithm
CN107241358A (en) A kind of smart home intrusion detection method based on deep learning
Zhang et al. A data set accuracy weighted random forest algorithm for IoT fault detection based on edge computing and blockchain
CN108632269A (en) Detecting method of distributed denial of service attacking based on C4.5 decision Tree algorithms
CN108683564B (en) Network simulation system reliability evaluation method based on multidimensional decision attributes
CN108650649B (en) Abnormal data detection method suitable for wireless sensor network
Du et al. LTrust: An adaptive trust model based on LSTM for underwater acoustic sensor networks
CN106060039B (en) A kind of classification and Detection method of network-oriented abnormal data stream
Xiao et al. A traffic classification method with spectral clustering in SDN
Atli Anomaly-based intrusion detection by modeling probability distributions of flow characteristics
Eunice et al. Network anomaly detection technology based on deep learning
Guo et al. FullSight: A feasible intelligent and collaborative framework for service function chains failure detection
CN112291226B (en) Method and device for detecting abnormity of network flow
Tran Network anomaly detection
Choukri et al. A novel deep learning-based framework for blackhole attack detection in unsecured RPL networks
CN114866301B (en) Encryption traffic identification and classification method and system based on direct push graph
Rahman et al. ANFIS based cyber physical attack detection system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180904