CN118075090A

CN118075090A - Network fault prediction method based on machine learning

Info

Publication number: CN118075090A
Application number: CN202410148841.0A
Authority: CN
Inventors: 郭俊涛; 郑昌国
Original assignee: Inspur Communication Information System Co Ltd
Current assignee: Inspur Communication Information System Co Ltd
Priority date: 2024-02-02
Filing date: 2024-02-02
Publication date: 2024-05-24

Abstract

The invention provides a network fault prediction method based on machine learning, which belongs to the technical field of networks and collects data in the modes of SNMP protocol, netFlow protocol and the like; and preprocessing the collected data, including data cleaning, feature extraction and other operations. The data cleaning can remove interference factors such as abnormal values, missing values and the like, and the feature extraction can extract the features related to network faults from the data; model training is carried out on the preprocessed data by using a machine learning algorithm, and a network fault prediction model is obtained; the network is monitored and optimized according to the prediction result, and when a certain device is predicted to possibly fail, the device can be maintained or replaced in advance.

Description

Network fault prediction method based on machine learning

Technical Field

The invention relates to the technical field of networks, in particular to a network fault prediction method based on machine learning.

Background

With the rapid development of network technology, the scale and complexity of the network are continuously increased, and network faults are also increased. The network fault prediction is used as an effective technical means, so that potential faults can be found in advance, and the occurrence of the faults is avoided or the influence of the faults is reduced. However, the current network fault prediction method has some problems, such as low prediction accuracy, poor real-time performance, and the like, and cannot meet the requirements of practical application. Therefore, a method capable of accurately predicting network failure in real time is needed.

Disclosure of Invention

In order to solve the technical problems, the invention provides a network fault prediction method based on machine learning, which is used for solving the problems of low prediction accuracy and poor real-time performance existing in the existing network fault prediction method.

The technical scheme of the invention is as follows:

a machine learning based network failure prediction method, comprising:

Step 1) the acquisition and cleaning of data,

Step 2) training and verification of the model,

Step 3) network fault real-time prediction and monitoring

Further, the method specifically comprises the following steps:

collecting network operation data including alarm log data of network equipment and the like;

Preprocessing the collected data, including data cleaning, feature extraction and other operations;

model training is carried out on the preprocessed data by using a machine learning algorithm, and a network fault prediction model is obtained;

inputting network operation data acquired in real time into a trained network fault prediction model to obtain a prediction result;

And monitoring and optimizing the network according to the prediction result, finding potential faults in time and taking corresponding measures.

Still further, the processing includes:

(01) Collecting data by SNMP protocol and NetFlow protocol;

(02) The collected data comprises alarm log data;

(03) Extracting and reading each field in the collected original alarm log data;

(04) Preliminary pretreatment of original alarm log data:

(05) The pretreatment comprises the following steps:

(06) Denoising: removing data which cannot be interpreted in the system;

(07) Filling up missing values: and deducing whether the information has a missing condition according to the corresponding format of the alarm log data field. If the field is missing, filling is needed to be carried out manually, and a complete alarm log is formed;

(08) The original alarm log after preliminary pretreatment passes through a rule interpretation module;

(09) Inquiring alarm lists of all versions in an alarm log database;

(10) Resolving the corresponding field of the extracted original alarm log;

(11) The alarm log data with high readability is converted into alarm log data;

(12) Designing and building a network fault prediction model; the method comprises the following steps:

(13) Collecting an original alarm log;

(14) Cleaning dirty data after analyzing the log;

(15) Clustering the initial data set to obtain clustering features and extracting features according to the attribute of the alarm log data;

(16) Taking an alarm title, a manufacturer alarm level and alarm interpretation as judging factors;

(17) Defining the network fault risk degree;

(18) Feature conversion: converting the multi-vector into a new feature set;

(19) Dividing alarm log data into a training set, a verification set and a test set;

(20) Setting model parameters such as weight w and bias b;

(21) Training a training set by using different algorithm models;

(22) Taking the cross entropy loss function as a standard for evaluating the performance of the model;

(23) Adjusting model parameters;

(24) Until the loss function is minimum, finding out the most suitable w and b;

(25) Obtaining model parameters of each algorithm;

(26) Verifying the performances of the different models through a verification set;

(27) Evaluating the performance of each algorithm according to the indexes of the accuracy rate, the recall rate and the F value;

(28) And determining a network fault prediction algorithm model.

The invention has the beneficial effects that

The network operation data is modeled and predicted through a machine learning algorithm, so that potential network faults can be accurately found in real time. Meanwhile, the invention can also carry out customized model training according to actual requirements so as to meet the requirements of different scenes. In addition, the invention can also apply the prediction result to the monitoring and optimization of the network, and improve the reliability and performance of the network.

Drawings

FIG. 1 is a schematic workflow diagram of the present invention;

FIG. 2 is a training flow diagram of the predictive model of the present invention.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments, and all other embodiments obtained by those skilled in the art without making any inventive effort based on the embodiments of the present invention are within the scope of protection of the present invention.

The invention provides a network fault prediction method based on machine learning, which is realized by the following technical scheme:

(1) Collecting data through SNMP protocol;

(2) The collected data is specifically alarm log data;

(3) Extracting and reading each field in the collected original alarm log data;

(4) Preliminary pretreatment of original alarm log data:

(5) The pretreatment comprises the following steps:

(6) Denoising: removing data which cannot be interpreted in the system;

(7) Filling up missing values: and deducing whether the information has a missing condition according to the corresponding format of the alarm log data field. If the field is missing, filling is needed to be carried out manually, and a complete alarm log is formed;

(8) The original alarm log after preliminary pretreatment passes through a rule interpretation module;

(9) Inquiring alarm lists of all versions in an alarm log database;

(10) Resolving the corresponding field of the extracted original alarm log;

(11) The alarm log data with high readability is converted into alarm log data;

(12) Designing a network fault prediction model; the method comprises the following steps:

(13) Collecting an original alarm log;

(14) Cleaning dirty data after analyzing the log;

(15) Clustering the initial data set by using a K-means method to obtain clustering features and extracting features according to the attribute of the alarm log data;

(17) Defining the network fault risk degree (fault) as 0, 1 and 2;

(18) fault is 0, and the corresponding network system has no fault; fault is 1, 2 corresponding to the network system having failed;

(19) Feature conversion: converting the multi-vector into a new feature set through an RTE algorithm;

(20) Dividing alarm log data into a training set, a testing set and a verification set according to a ratio of 6:2:2;

(21) 60% of log samples are training sets, 20% of log samples are verification sets, and 20% of log samples are test sets;

(22) Setting model parameters such as weight w and bias b;

(23) Selecting XGBoost, random forests, bayes and convolutional neural network algorithm models, and utilizing training sets to respectively model parameters;

(24) Taking the cross entropy loss function as a standard for evaluating the performance of the model;

(25) Adjusting model parameters;

(26) Until the loss function is minimum, finding out the most suitable w and b of the model;

(27) Obtaining model parameters of each algorithm;

(28) Verifying the performances of the different models through a verification set;

(29) Evaluating the performance of each algorithm according to the indexes of the accuracy rate, the recall rate and the F value;

(30) Determining a network fault prediction algorithm model;

(31) Network operation data acquired in real time are input into a trained network fault prediction model;

(32) Obtaining a network fault prediction result;

(33) Monitoring and optimizing the network according to the prediction result;

(34) Finding a potential fault;

(35) And adopting corresponding measures to solve the faults.

The foregoing description is only illustrative of the preferred embodiments of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention are included in the protection scope of the present invention.

Claims

1. A network fault prediction method based on machine learning is characterized in that,

Comprising

Step 1) data acquisition and cleaning;

Step 2) training and verifying a model;

And 3) predicting and monitoring network faults in real time.

2. The method of claim 1, wherein the step of determining the position of the substrate comprises,

Collecting network operation data, including alarm log data of network equipment;

preprocessing the collected data, including data cleaning and feature extraction;

3. The method of claim 2, wherein the step of determining the position of the substrate comprises,

Data are collected through SNMP protocol and NetFlow protocol modes.

4. The method of claim 3, wherein the step of,

The data is cleaned to remove interference factors of abnormal values and missing value classes, and the characteristics are extracted to extract the characteristics related to network faults from the data.

5. A method according to claim 2 or 3, characterized in that,

Data acquisition and cleaning) specifically includes the following steps:

(1-1) extracting and interpreting each field in the collected original alarm log data;

(1-2) preliminary preprocessing of original alarm log data:

(1-3) the original alarm log after preliminary pretreatment passes through a rule interpretation module;

(1-4) querying an alarm list of each version in an alarm log database;

(1-5) resolving the corresponding field of the extracted original alarm log;

(1-6) converting the alarm log data into alarm log data with high readability.

6. The method of claim 5, wherein the step of determining the position of the probe is performed,

The preliminary pretreatment includes the following steps:

Denoising: removing data which cannot be interpreted in the system;

Filling up missing values: deducing whether the information has a missing condition according to the corresponding format of the alarm log data field; if the field is missing, the field needs to be filled manually to form a complete alarm log.

7. The method of claim 2, wherein the step of determining the position of the substrate comprises,

Step 2) training and verifying a model, which specifically comprises the following steps:

(2-1) collecting an original alarm log;

(2-2) cleaning dirty data after analyzing the log;

(2-3) clustering the initial data set to obtain clustering features and extracting features according to the attribute of the alarm log data;

(2-4) taking an alarm title, a manufacturer alarm level and an alarm interpretation as judging factors;

(2-5) defining a network failure risk level;

(2-6) feature transformation: converting the multi-vector into a new feature set;

(2-7) dividing the alarm log data into a training set, a verification set and a test set;

(2-8) setting model parameters such as weight w and bias b;

(2-9) training the training set using different algorithmic models;

(2-10) taking the cross entropy loss function as a standard of model performance evaluation;

(2-11) adjusting model parameters;

(2-12) until the loss function is minimum, finding out the most suitable w and b;

(2-13) obtaining model parameters of each algorithm;

(2-14) validating the performance of the different models by the validation set;

(2-15) evaluating the performance of each algorithm based on the precision, recall and F-value multiple indicators; (2-16) determining a network failure prediction algorithm model.

8. The method of claim 7, wherein the step of determining the position of the probe is performed,

The initial dataset was clustered using the K-means method.