CN112187820B

CN112187820B - Power distribution terminal DTU intrusion detection method and system based on machine learning

Info

Publication number: CN112187820B
Application number: CN202011073339.6A
Authority: CN
Inventors: 吕志宁; 邓巍; 宁柏锋; 刘威; 罗伟峰; 徐文渊; 冀晓宇; 蒋燕; 李鹏; 习伟
Original assignee: China South Power Grid International Co ltd; Shenzhen Power Supply Co ltd; Zhejiang University ZJU
Current assignee: China South Power Grid International Co ltd; Shenzhen Power Supply Co ltd; Zhejiang University ZJU
Priority date: 2020-10-09
Filing date: 2020-10-09
Publication date: 2022-10-21
Anticipated expiration: 2040-10-09
Also published as: CN112187820A

Abstract

The invention discloses a power distribution terminal DTU intrusion detection method and system based on machine learning, which belong to the field of intelligent power grid safety. The method adopts a principal component analysis method to reduce high-dimensional characteristic data, and then utilizes the characteristic after dimensional reduction to establish a model; secondly, double verification is carried out by adopting a least square support vector machine and a neural network algorithm so as to improve the detection accuracy and reduce the false alarm rate; finally, the intrusion detection system framework adopts a modular design, is suitable for intrusion detection in the field of smart power grids, and has good portability and universality.

Description

Power distribution terminal DTU intrusion detection method and system based on machine learning

Technical Field

The invention belongs to the field of intelligent power grid safety, and particularly relates to a power distribution terminal DTU intrusion detection method and system based on machine learning.

Background

The automation and intellectualization of the power distribution network can be used for optimizing the allocation of national energy resources, ensuring the safe and stable operation of an electric power system and promoting the development of the national strategic emerging industry. In recent years, as the combination of an electric power system and a communication network is more and more compact, the security threats from the internet are more complex and diversified, the information security problem of a power distribution network becomes more and more prominent, and especially, the microgrid controller device of a power distribution terminal is frequently attacked by the network, so that the normal production and operation of the electric power system are seriously hindered. An intelligent power Distribution Terminal DTU (Distribution Terminal Unit) is used as a core device in a power Distribution network and is used for monitoring the operation state of a transformer area in real time to ensure that a power Distribution system can operate safely and reliably. With the high-speed development of the intelligent power distribution network, the network environment and the network attack types are more and more complex and changeable, and the vulnerability of the security defense mechanism for the distribution transformer terminal is more and more prominent at present. The intelligent power distribution terminal in the power engineering control system is subjected to intrusion detection, so that network attacks can be timely discovered and processed, the current situation of passive defense of a power distribution network system is changed, and the power utilization safety risk and the economic loss are reduced.

At present, an intrusion detection system mainly detects hacker attacks and network viruses by analyzing network data packets in an industrial control system environment, and triggers an alarm system once an anomaly is detected, and generally consists of three modules, namely a data collection module, a transmission module and a processing module. However, in the field of smart power grids, with the increase of the number of power distribution terminals, more and more data are required to be processed by computers, the traditional intrusion detection system is difficult to meet the requirements, and it is necessary to ensure the safety of a power grid system and improve the response speed and accuracy of the intrusion detection system.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a DTU intrusion detection method and a DTU intrusion detection system based on machine learning, wherein the network flow and related network information of a power distribution terminal are utilized to carry out intrusion detection on the electric power industrial control attack, and an evolutionary algorithm combining a neural network algorithm and a least square support vector machine algorithm is used, so that the defect of local optimization of the traditional neural network algorithm is overcome, and meanwhile, the accuracy of DTU intrusion detection is greatly improved. The system consists of three subsystems of data collection, data transmission and data processing, each submodule in each subsystem has better independence, and the system has better universality and mobility in the field of electric power industrial control.

In order to achieve the purpose, the invention adopts the following technical scheme:

a power distribution terminal DTU intrusion detection method based on machine learning comprises the following steps:

step 1: establishing a C/S communication framework of a server-client, creating a socket object, and collecting DTU information data of a power distribution terminal;

step 2: preprocessing DTU information data to obtain an original feature set, screening out a preset number of features from the original feature set through a principal component analysis method to serve as final features, and obtaining a training sample set;

and step 3: building a neural network model composed of an input layer, a hidden layer and an output layer as a first classifier, wherein the input layer is responsible for receiving the screened final characteristics, the hidden layer is used for processing characteristic values, the hidden layer comprises initial weights, network objective functions and activation functions of the characteristic values, and the output layer is responsible for outputting neural network results;

and 4, step 4: building a least square support vector machine model as a second classifier, mapping the screened final features to a high-dimensional feature space through nonlinear mapping, then constructing an optimal decision function in the high-dimensional feature space based on a structural risk minimization principle, replacing dot product operation in the high-dimensional feature space with a kernel function of an original space, and outputting a result of the least square support vector machine;

and 5: performing ensemble learning on the first classifier and the second classifier to form a strong classifier evolution model; in ensemble learning, firstly, training and verifying a first classifier in an 8-fold cross verification mode by using a training sample set to obtain a classification error rate of the first classifier, and further calculating a weight coefficient of the first classifier in a strong classifier evolution model;

then updating the weight distribution of the training sample set to increase the weight of the sample with wrong prediction in the first classifier and decrease the weight with correct prediction, and then normalizing all weights; training and verifying the second classifier by using the training sample set with the updated weight distribution in an 8-fold cross verification mode to obtain a classification error rate of the second classifier, and further calculating a weight coefficient of the second classifier in a strong classifier evolution model;

finally, forming a trained strong classifier evolution model;

step 6: and (3) acquiring DTU information data of the power distribution terminal in real time through a C/S communication framework of a server-client, extracting features according to the screening result of the step (3), carrying out real-time intrusion detection on the DTU feature data by using a trained strong classifier evolution model, judging whether the DTU of the power distribution terminal is in a normal working state or in an abnormal working state suffering from attack, and giving an alarm if the DTU is in the abnormal working state.

Further, in step 1, if the acquired numerical characteristic variable has a default value, the characteristic is complemented by using a linear difference method, that is, the characteristic is complemented by using a linear difference method

In the formula y ₀ And x ₀ Respectively record the characteristic value of the previous strip of the dataAnd the number of rows of the corresponding feature, y ₁ And x ₁ The characteristic value and the line number of the corresponding characteristic are recorded for the next piece of the data respectively.

Further, obtaining original feature samples of DTU feature data, performing eigenvalue decomposition on covariance matrixes of the collected original feature samples of the DTU feature data through a principal component analysis method, solving eigenvectors, selecting the first q principal component features as final features according to the magnitude of the eigenvalue, and obtaining a training sample set.

Further, the step 5 specifically includes:

step 5.1: dividing the training sample set into 8 parts in equal proportion; using 7 of the samples for training, 1 sample for testing, the classification error rate e is obtained _i (x) (ii) a Go through a round of training to obtain 8 times of prediction output results in total, will

The classification error rate as the first classifier is denoted as e _NN ；

Step 5.2: calculating the weight coefficient alpha of the first classifier in the strong classifier evolution model _NN ，

Step 5.3: the weight distribution of the training sample set is updated,

D ₂ ＝(w _2,1 ,…,w _2,i ,…,w _2,N )

wherein N refers to the number of samples; d ₂ Representing the updated weight set; w is a _2,i Represents the weight of the updated ith sample data,w _1,i weight of ith sample data to initialize, w _1,i =1/N, i =1,2, …, N; z is a normalization factor for ensuring D ₂ The sum of the total weights is 1,y _i To true value, G ₁ (x _i ) Is the predicted value of the first classifier, when the prediction is correct, y _i G ₁ (x _i ) =1, when prediction error, y _i G ₁ (x _i ) = -1; alpha is a weight parameter, 0<α<1, the larger alpha, w _2,i The more obvious the updating effect is;

step 5.4: dividing the training sample set after updating weight distribution into 8 parts in equal proportion, using 7 parts of the training samples to train, using 1 part of the testing samples to train and verify the second classifier, traversing one round of training to obtain the classification error rate of the second classifier, and marking as e _LSSVM ；

Step 5.5: calculating the weight coefficient alpha of the second classifier in the strong classifier evolution model _LSSVM ，

Step 5.6: constructing a trained strong classifier evolution model, and expressing as follows:

G(x)＝sign(f(x))

f(x)＝α _NN G _NN (x)+α _LSSVM G _LSSVM (x)

wherein G (x) represents the evolution model of the strong classifier, f (x) represents the linear combination of the two classifiers, and alpha _NN And alpha _LSSVM A weight coefficient representing the degree of importance of the first classifier and the second classifier; sign (·) indicates that the system is judged to be normal as 1, and the system is judged to be abnormal as-1, so as to finally achieve the purpose of classification.

Another objective of the present invention is to provide a power distribution terminal DTU intrusion detection system based on the above method, including:

the data collection subsystem is used for collecting DTU information data of the power distribution terminal;

the data transmission subsystem is used for transmitting the data collected by the data collection subsystem to the data processing subsystem;

and the data processing subsystem is used for preprocessing the DTU information data, extracting the characteristic value, constructing and training a strong classifier evolution model, detecting the working state of the DTU of the power distribution terminal in real time by using the trained strong classifier evolution model, and sending an alarm if the state is abnormal.

Compared with the prior art, the invention has the beneficial effects that:

(1) The invention discloses a power distribution terminal intrusion detection process which comprises the following steps: the method comprises the steps of data collection, preprocessing, feature extraction, establishment of an evolution model combining a neural network algorithm and a least square support vector machine algorithm, a training model and intrusion detection of a power distribution terminal. The neural network is simple in structure and high in operation speed, and the problem that the neural network is easy to fall into a local minimum value exists when an optimized solution is solved at a high speed. Therefore, the quadratic programming problem in the support vector machine is changed into a solution equation set by further adopting a least square support vector machine, so that the great workload is simplified, the calculation speed is high under the condition of large-scale data, and the local optimization can be avoided.

(2) According to the invention, the network information of the power distribution network core device power distribution terminal is collected and the characteristics of the power distribution network core device power distribution terminal are extracted by the high-performance host computer through constructing the C/S communication architecture, and besides, the characteristic dimension is reduced in the characteristic selection by adopting the principal component analysis, so that the method is beneficial to extracting important information and discarding useless information.

(3) The intrusion detection system adopts a frame type design, each submodule has better independence, and the system has better universality and mobility in the field of electric power industrial control.

(4) The machine learning algorithm in the invention uses an evolutionary algorithm combining a neural network algorithm and a least square support vector machine algorithm, and introduces weight distribution in a training data set based on a training result of a first classifier, thereby realizing large weight for a basic classifier with small classification error rate and small weight for a basic classifier with large classification error rate, breaking through the defect of local optimum of the traditional neural network algorithm, and simultaneously greatly improving the accuracy of DTU intrusion detection.

Drawings

FIG. 1 is a block diagram of an intrusion detection system according to the present invention;

FIG. 2 is a flow chart of a method of the present invention;

FIG. 3 is a model cross-validation flow diagram;

fig. 4 is an overall operation block diagram of the intrusion detection system facing the power distribution terminal.

Detailed Description

The invention is further explained below with reference to the figures and examples.

The invention provides a DTU intrusion detection method and a DTU intrusion detection system based on machine learning, as shown in figure 2, the DTU intrusion detection system is composed of three subsystems of data collection, data transmission and data processing, intrusion detection is carried out on power industrial control attacks by utilizing network flow and related network information of a power distribution terminal, and a working flow chart of the intrusion detection system is shown in figure 2.

The specific working method of the system is as follows:

step 1: aiming at the requirements in the application of a power grid system, an intrusion detection system framework based on machine learning is constructed. The method comprises the following specific steps:

step 1.1: and establishing a data collection subsystem with the DTU as a client.

Step 1.2: a data transmission subsystem based on a socket interface technology is established.

Step 1.3: and establishing a data processing subsystem taking a high-performance PC as a server side.

And 2, step: and establishing a C/S communication framework of a server-client, creating a socket object, and collecting DTU information data of the power distribution terminal. The method comprises the following specific steps:

step 2.1: respectively creating socket objects of the DTU and the host;

step 2.2: binding a server address to realize communication between the power distribution terminal and the host;

step 2.3: the method comprises the steps that a host periodically collects DTU information data of a power distribution terminal;

step 2.4: recording the collected data as D; for the collected DTU information data of the power distribution terminal, the method comprises the following steps:

send _ byte: the number of bits of data transmitted from the power distribution terminal;

receive _ byte: the number of bits received by the power distribution terminal;

memory _ use: memory occupancy rate;

cpu _ use: the CPU utilization rate;

real _ time: a time stamp;

rcv _ des: a packet destination address;

src _ des: a packet source address;

length: a packet length;

pow _ csp: power consumption;

temp: (ii) temperature;

link _ flag: a connected normal or wrong state;

and (2) land: whether a connection is from/to the same host/port), if there is a default value for the numerical characteristic variable, the characteristic is complemented using a linear difference method, i.e. the connection is from/to the same host/port)

In the formula y ₀ And x ₀ Respectively record the feature value for the previous strip of the data and the number of rows, y, of the corresponding feature ₁ And x ₁ The feature value and the number of rows for the corresponding feature are recorded for the next piece of data, respectively.

And step 3: and constructing characteristics capable of representing attack characteristics according to the priori knowledge of the electric power industrial control message. The method comprises the following specific steps:

step 3.1: calculating the connection duration of the DTU and the host of the power distribution terminal, wherein t _link Indicating the duration of the connection, t, at which data was collected _cls Time stamp indicating disconnection, t _str A time stamp indicating when the connection is started;

t _link ＝t _cls -t _str

step 3.2: calculating the average received data byte number of the DTU of the power distribution terminal, wherein d _{receive_bit} Represents t _link Number of bits of received data in time, d _{receive_byte} To representThe average number of received bytes in the period of time;

step 3.3: calculating the average sending data byte number of the DTU of the power distribution terminal, wherein d _{send_bit} Represents t _link Number of bits of data transmitted in time, d _{send_byte} Indicating the average number of transmitted bytes in the period of time;

step 3.4: calculating an average network flow of the DTU of the power distribution terminal, wherein d _flow Is shown at t _link Average network flow of a power distribution terminal DTU within time;

d _flow ＝|d _{send_byte} -d _{save_byte} |

and 4, step 4: and reducing the characteristic dimension of the high latitude of the safety data in the intrusion detection system by using a principal component analysis method. Firstly, eigenvalue decomposition is carried out on a covariance matrix of an acquired DTU data sample, eigenvectors are solved, and the first 3 principal component characteristics are selected according to the magnitude of the eigenvalue value, so that the purpose of reducing data dimensionality is achieved. The principal component characteristics finally obtained are: memory occupancy rate memory _ usage; CPU utilization CPU _ usage, DTU average network traffic d _flow ；

And 5: and building a neural network model by using a library in Python. The neural network model consists of an input layer, a hidden layer and an output layer, wherein the input layer is responsible for receiving and inputting characteristic values of the power distribution terminal after dimensionality reduction: memory _ use, cpu _ use, d _flow The output layer is responsible for outputting the neural network result, namely the output of the terminal state tag state _ flag, and the hidden layer comprises the initial weight of each characteristic value, a network target function, an activation function and the like.

Step 5.1: and initializing parameters. Since the number of features in the neural network model is 3, the number of initialized weights is also 3, and random sampling is adoptedThe way of generating the initialization weight, the first time according to the weight of each neuron

And offset value b ⁰ Initialized to a random number close to zero and continuously updated during later training.

Step 5.2: and calculating a neural network activation value. The activation value of the neural network is the output of the first layer:

where n denotes the number of iterations and i (i =1,2,3) denotes the number of DTU network feature information, where X ₁ Representing the memory occupancy rate memory _ usage; x ₂ Denotes CPU _ usage, X, CPU usage ₃ Mean network traffic d representing DTU _flow 。

Representing the weight of the ith eigenvalue at the nth iteration, b ⁿ Representing a neural network bias value.

Step 5.3: an activation function is set. A Logistic function is taken as an activation function, also called a Sigmoid function, and is used for hidden layer neuron output, the value range of the Logistic function is (0,1), any real number can be mapped into a (0,1) interval, the Logistic function is usually used for binary classification, and the derivative function can be represented by the Logistic function. The expression of the Sigmoid function and its derivative function is as follows:

step 5.4: a loss function is defined. The loss function is used for measuring the deviation between the actual DTU state and the predicted DTU state, and generally, the larger the loss function value is, the larger the error of the neural network model is, and the worse the robustness is, so that the neural network takes the minimum loss function as the optimal target in the training process. In the present invention, the loss function is defined as:

step 5.5: and optimizing parameters by adopting a gradient descent method. The weights and bias values in the neural network model are solved, usually in an iterative fashion:

step 5.6: and judging the state of the power distribution terminal. The output value of the neural network model is a numerical value in the (0,1) interval, when the output value is higher than the threshold value, the state of the power distribution terminal is safe and does not suffer from malicious network attacks, otherwise, the system is abnormal.

Step 6: and building a least square support vector machine model by using Python. The LSSVM maps an input vector to a high-dimensional feature space by realizing selected nonlinear mapping, then constructs an optimal decision function in the feature space based on a structure risk minimization principle, and replaces dot product operation in the high-dimensional feature space with a kernel function of an original space. The method comprises the following specific steps:

step 6.1: and determining a classification surface and an optimal hyperplane equation of the DTU state. The classification surface and the hyperplane satisfy the following conditions:

H:w·x+b＝0

where i (i =1,2,3) denotes the serial number of DTU network feature information, where X ₁ Express memory occupancy memory_usage；X ₂ Denotes the CPU usage rate CPU _ usage, X ₃ Mean network traffic d representing DTU _flow 。w _i Represents the weight of the ith feature value, and b represents the offset value of the plane.

Step 6.2: the LSSVM model converts non-equality constraints in the SVM optimization problem into equality constraints, and meanwhile, error variables are introduced into each sample in order to solve the situation that partial special points exist. And if the regular term of the error variable is supposed in the function, the optimization problem of the LSSVM is converted into the following steps:

step 6.3: firstly, the LSSVM optimization problem is converted into a Lagrange function of the optimization problem. Wherein alpha is _i Represents a correspondence x _i Lagrange multiplier.

The Lagrange function is then derived for each variable and its derivative is zero:

finally, writing the equation set into a block matrix equation form, and solving Lagrange multiplier alpha = [ alpha ] by utilizing a kernel function ₁ ,α ₂ ,...,α _N ] ^T And b.

Step 6.4: and outputting the state of the power distribution terminal. The output result of the least square support vector machine is a numerical value in the (-1,1) interval, and when the output of the LSSVM model is less than 0, the system is abnormal, otherwise, the system is normal.

And 7: and performing integrated learning on the neural network model and the minimum quadratic support vector machine model by adopting an Adaboost algorithm, thereby forming a strong classifier for judging the state of the DTU of the power distribution terminal.

Step 7.1: as shown in fig. 3. And the model parameters are adjusted through the training results, so that the performance of the model is optimal in the classification of the industrial power control attack, and the intrusion detection of the DTU of the power distribution terminal is realized. The method comprises the following specific steps:

the sample data is divided into 8 parts in equal proportion and recorded as a sample S1, a sample S2, a sample S3, a sample S4, a sample S5, a sample S6, a sample S7 and a sample S8.

Training was performed using 7 samples, and 1 sample was tested. Specifically, firstly, samples S2, S3, S4, S5, S6, S7 and S8 are used for training a classifier model, a sample S1 is used for testing an evolution model, and an output model of the evolution model is marked as H1; training a classifier model by using samples S1, S3, S4, S5, S6, S7 and S8, testing the two models by using a sample S2, and marking an output model as H2; and in the same way, the rest samples (S3, S4, S5, S6, S7 and S8) are used as the test data set, and the rest samples are used as the training data set to obtain output models H3, H4, H5, H6, H7 and H8.

In conclusion, 8 times of prediction output results are obtained through one round of training, and the result is to be obtained

The classification error rate as the first classifier is denoted as e _NN ；

And 7.2: calculating the weight coefficient alpha of the first classifier in the strong classifier evolution model _NN ，

Step 7.3: the weight distribution of the training sample set is updated,

D ₂ ＝(w _2,1 ,…,w _2,i ,…,w _2,N )

wherein N refers to the number of samples; d ₂ Representing the updated weight set; w is a _2,i Weight, w, representing updated ith sample data _1,i Weight of ith sample data to initialize, w _1,i =1/N, i =1,2, …, N; z is a normalization factor for ensuring D ₂ The sum of the total weights is 1,y _i To true value, G ₁ (x _i ) Is the predicted value of the first classifier, when the prediction is correct, y _i G ₁ (x _i ) =1, when prediction error, y _i G ₁ (x _i ) = -1; alpha is weight parameter, 0 < alpha < 1, alpha is larger, w is _2,i The more obvious the updating effect is;

step 7.4: dividing the training sample set with updated weight distribution into 8 parts in equal proportion, training 7 parts of the training samples, testing 1 part of the samples, training and verifying the second classifier, traversing one round of training to obtain the classification error rate of the second classifier, and recording as e _LSSVM ；

Step 7.5: calculating the weight coefficient alpha of the second classifier in the strong classifier evolution model _LSSVM ，

Step 7.6: constructing a trained strong classifier evolution model, and expressing as follows:

G(x)＝sign(f(x))

f(x)＝α _NN G _NN (x)+α _LSSVM G _LSSVM (x)

wherein G (x) represents a strong classifier evolution model, f (x) represents a linear combination of two classifiers, and alpha _NN And alpha _LSSVM A weight coefficient representing the degree of importance of the first classifier and the second classifier; sign (·) indicates that the system is judged to be normal as 1 and the system is judged to be abnormal as-1, so as to achieve the purpose of classification finally.

The above classification error rate (weighted error function) is calculated by:

wherein N refers to the number of samples; g _NN (x _i ) And G _LSSVM (x _i ) Respectively representing NN and LSSVM models with respect to a sample x _i (x _i1 ,x _i2 ,x _i3 ) An output of (d); y is _i A label (normal is 1, abnormal is-1) indicating the actual state of the sample; p (G) _NN (x _i )≠y _i ) And P (G) _LSSVM (x _i )≠y _i ) Representing two models versus sample x _i (x _i1 ,x _i2 ,x _i3 ) The probability of a false positive; w is a _NNi And w _LSSVMi Representing the DTU sample x of the current round _i (x _i1 ,x _i2 ,x _i3 ) The weight distribution of the data set, rather than the parameters internal to the classifier.

And 8: a working block diagram of the intrusion detection system facing the DTU is shown in fig. 4, and the specific method is to perform intrusion detection on DT U data by using an evolution model, determine whether a power distribution terminal is in a normal working state or in an abnormal state subject to attack, and send an alarm if the state is abnormal, thereby implementing intrusion detection and active defense for the power distribution terminal.

In one embodiment of the present invention, a machine learning based DTU intrusion detection system for a power distribution terminal is further described. The method comprises the following steps:

Wherein, the data processing subsystem includes:

the data preprocessing module is used for preprocessing DTU information data to obtain an original feature set, screening out a preset number of features from the original feature set through a principal component analysis method to serve as final features, and obtaining a training sample set;

the first classifier module is configured with a neural network model consisting of an input layer, a hidden layer and an output layer;

a second classifier module configured with a least squares support vector machine model;

the classifier training model is used for respectively training the first classifier module and the second classifier module, and the training process is as follows:

in the training process of a first classifier module, an original training sample set is used as training data, training and verification are carried out on a first classifier in an 8-fold cross verification mode, a first classifier weight coefficient is obtained, and a trained first classifier model file is stored;

then, updating weight distribution of an original training sample set according to the training effect of the first classifier, training and verifying a second classifier by using the updated training sample set as training data in an 8-fold cross verification mode to obtain a weight coefficient of the second classifier, and storing a trained model file of the second classifier;

and the strong classifier evolution model building module is used for loading the trained first classifier model file and the trained second classifier model file and building a strong classifier evolution model according to the weight coefficients of the two classifiers so as to carry out real-time detection on the working state of the DTU of the power distribution terminal.

The DTU intrusion detection system for the power distribution terminal based on the machine learning specifically comprises port identification, data acquisition, transmission, data processing and dimension reduction of the power distribution terminal, construction of a classifier based on a neural network and a least square support vector machine, intrusion behavior detection experiments of the power distribution terminal, and timely alarming when abnormality occurs. The method adopts a principal component analysis method to reduce high-dimensional characteristic data, and then utilizes the characteristic after dimensional reduction to establish a model; secondly, a strong classifier is constructed by adopting a least square support vector machine and a neural network algorithm so as to improve the detection accuracy and reduce the false alarm rate; finally, the intrusion detection system framework adopts a modular design, is suitable for intrusion detection in the field of smart power grids, and has good portability and universality.

The foregoing lists merely illustrate specific embodiments of the invention. It is obvious that the invention is not limited to the above embodiments, but that many variations are possible. All modifications which can be derived or suggested by a person skilled in the art from the disclosure of the present invention are to be considered within the scope of the invention.

Claims

1. A DTU intrusion detection method of a power distribution terminal based on machine learning is characterized by comprising the following steps:

and step 3: building a neural network model composed of an input layer, a hidden layer and an output layer as a first classifier, wherein the input layer is responsible for receiving the screened final characteristics, the hidden layer is used for processing characteristic values and comprises initial weights, network objective functions and activation functions of the characteristic values, and the output layer is responsible for outputting neural network results;

and 5: performing ensemble learning on the first classifier and the second classifier to form a strong classifier evolution model; in ensemble learning, firstly, training and verifying a first classifier in an 8-fold cross verification mode by using a training sample set to obtain a classification error rate of the first classifier, and further calculating a weight coefficient of the first classifier in a strong classifier evolution model; then updating the weight distribution of the training sample set to increase the weight of the sample with wrong prediction in the first classifier and decrease the weight with correct prediction, and then normalizing all weights; training and verifying the second classifier by using the training sample set with the updated weight distribution in an 8-fold cross verification mode to obtain a classification error rate of the second classifier, and further calculating a weight coefficient of the second classifier in a strong classifier evolution model; finally, forming a trained strong classifier evolution model;

step 6: acquiring DTU information data of the power distribution terminal in real time through a C/S communication framework of a server-client, extracting features according to the screening result of the step 3, carrying out real-time intrusion detection on the DTU feature data by using a trained strong classifier evolution model, judging whether the DTU of the power distribution terminal is in a normal working state or in an abnormal working state suffering from attack, and giving an alarm if the DTU is in the abnormal working state.

2. The machine learning-based DTU intrusion detection method for the power distribution terminal according to claim 1, wherein the step 1 specifically comprises:

step 1.1: respectively creating socket objects of the DTU and the host;

step 1.2: binding a server address to realize communication between the power distribution terminal and the host;

step 1.3: the method comprises the steps that a host periodically collects DTU information data of a power distribution terminal;

if the acquired numerical characteristic variable has a default value, the characteristic is complemented by using a linear difference method, namely

In the formula y ₀ And x ₀ Respectively the previous note of DTU characteristic dataRecording the characteristic values and the number of lines, y, of the corresponding characteristic ₁ And x ₁ Respectively, the characteristic value of the next record of the DTU characteristic data and the line number of the corresponding characteristic.

3. The machine learning-based DTU intrusion detection method for the power distribution terminal according to claim 1, wherein the step 2 specifically comprises:

step 2.1: calculating the connection duration t of the DTU and the host of the power distribution terminal _link ，

t _link ＝t _cls -t _str

Wherein, t _cls Time stamp indicating disconnection, t _str A time stamp indicating when the connection is started;

step 2.2: calculating the average received data byte number d of the DTU of the power distribution terminal _{receive_byte} ，

Wherein d is _{receive_bit} Represents t _link The number of bits of the received data in time;

step 2.3: calculating the average sending data byte number d of the DTU of the power distribution terminal _{send_byte} ，

Wherein d is _{send_bit} Represents t _link The number of bits of the transmitted data in time;

step 2.4: calculating average network flow d of DTU of power distribution terminal _flow ，

d _flow ＝|d _{send_byte} -d _{receive_byte} |

Wherein d is _flow Is shown at t _link Average network flow of a power distribution terminal DTU within time;

step 2.5: taking the memory occupancy rate, the CPU utilization rate, the destination address of the data packet, the source address of the data packet, the length of the data packet, the power consumption, the temperature, the continuous duration, the number of bytes of average received data, the number of bytes of average sent data and the average network flow as original characteristics; and (3) performing eigenvalue decomposition on the covariance matrix of the acquired DTU characteristic data original characteristic sample by a principal component analysis method, solving an eigenvector, selecting the first q principal component characteristics as final characteristics according to the magnitude of the eigenvector value, and acquiring a training sample set.

4. The machine learning-based DTU intrusion detection method for the power distribution terminal according to claim 1, wherein the step 3 specifically comprises:

step 3.1: building a neural network model composed of an input layer, a hidden layer and an output layer as a first classifier;

step 3.2: initializing parameters of the neural network model, wherein the weight of each neuron is randomly generated to generate initialization weights

And an offset value b ⁰ Initializing the random number; setting an activation function and a loss function;

step 3.3: pre-training a neural network model using a first sample set, first computing a neural network activation value,

wherein n represents the number of iterations, X _i Representing the ith feature in the training sample set, q is the total number of features in the training sample set,

representing the weight of the ith eigenvalue at the nth iteration, b ⁿ Representing a neural network bias value; the range of the activation value is (1, -1), when the final output neural network result is higher than the threshold value,the power distribution terminal is in a safe state, otherwise, the power distribution terminal is abnormal;

step 3.4: performing iterative training on the neural network model according to the loss function value, optimizing parameters by adopting a gradient descent method,

wherein, w ⁿ⁺¹ Is the weight at the n +1 th iteration, w ⁿ Is the weight at the nth iteration, x represents the feature data vector of a sample, J ⁿ (w, b) represents a loss function, i.e. the square of the difference between the predicted value and the actual value,

an output value representing the neural network model,

representing a predicted value of the terminal state by the activation function; b ⁿ⁺¹ Is the neural network bias value at the n +1 th iteration, b ⁿ Is the neural network bias value at the nth iteration.

5. The machine learning-based DTU intrusion detection method for the power distribution terminal according to claim 1, wherein the step 4 specifically comprises:

step 4.1: building a least square support vector machine model as a second classifier;

step 4.2: determining a classification surface and an optimal hyperplane equation of the DTU state, wherein the classification surface and the optimal hyperplane satisfy the following conditions:

H:w·x+b＝0

wherein, X _i Representing the ith feature in the training sample set, q being the total number of features in the training sample set, w _i Representing the weight of the ith characteristic value, b representing the offset value of the plane, x representing the characteristic data vector of a sample, and w representing the hyperplane parameter;

step 4.3: the least square support vector machine model converts non-equality constraint in SVM optimization problem into equality constraint, introduces error variable aiming at each sample, adds regular item of the error variable in function, and converts the optimization problem into:

wherein | · | purple sweet ² Denotes the L2 norm, λ denotes the regularized norm, N denotes the number of samples, e _i An error variable representing the sample is determined,

representing the geometric spacing of the samples, y _i Representing the true value of the ith sample;

step 4.4: pre-training a least square support vector machine model by using a first sample set;

firstly, the optimization problem is firstly converted into Lagrange function, wherein alpha _i Represents a correspondence x _i The Lagrange multiplier of (a) is,

finally, the equation set is written into block momentsIn the form of an array equation, solving Lagrange multiplier alpha = [ alpha ] by using a kernel function ₁ ,α ₂ ,...,α _N ] ^T And b;

step 4.5: the output result of the least square support vector machine model is a numerical value in the (-1,1) interval, when the final output result of the least square support vector machine is higher than 0, the power distribution terminal is in a safe state, otherwise, the power distribution terminal is abnormal.

6. The machine learning-based DTU intrusion detection method for the power distribution terminal according to claim 1, wherein the step 5 specifically comprises:

The classification error rate as the first classifier is denoted as e _NN ；

Step 5.3: the weight distribution of the training sample set is updated,

D ₂ ＝(w _2,1 ,…,w _2,i ,…,w _2,N )

wherein, N refers to the number of samples;D ₂ representing the updated weight set; w is a _2,i Weight, w, representing updated ith sample data _1,i Weight of ith sample data to initialize, w _1,i =1/N, i =1,2, …, N; z is a normalization factor for ensuring D ₂ The sum of the total weights is 1,y _i To true value, G ₁ (x _i ) Is the predicted value of the first classifier, when the prediction is correct, y _i G ₁ (x _i ) =1, when prediction error, y _i G ₁ (x _i ) = -1; alpha is weight parameter, alpha is more than 0 and less than 1, alpha is larger, w is _2,i The more obvious the updating effect is;

step 5.4: dividing the training sample set with updated weight distribution into 8 parts in equal proportion, training 7 parts of the training samples, testing 1 part of the samples, training and verifying the second classifier, traversing one round of training to obtain the classification error rate of the second classifier, and recording as e _LSSVM ；

G(x)＝sign(f(x))

f(x)＝α _NN G _NN (x)+α _LSSVM G _LSSVM (x)

wherein G (x) represents the evolution model of the strong classifier, f (x) represents the linear combination of the two classifiers, G _NN (x) And G _LSSVM (x) Respectively representing the output of the NN model and the output of the LSSVM model; alpha (alpha) ("alpha") _NN And alpha _LSSVM A weight coefficient representing the degree of importance of the first classifier and the second classifier; sign (·) indicates that the system is judged to be normal as 1, and the system is judged to be abnormal as-1, so as to finally achieve the purpose of classification.

7. A DTU intrusion detection system for a power distribution terminal based on machine learning based on the method of claim 1, comprising:

8. The DTU intrusion detection system according to claim 7, wherein the data processing subsystem comprises:

in the training process of a first classifier module, an original training sample set is used as training data, a first classifier is trained and verified in an 8-fold cross validation mode, a first classifier weight coefficient is obtained, and a trained first classifier model file is stored;