CN116488325A

CN116488325A - Smart power grid anomaly detection and classification method, smart power grid anomaly detection and classification equipment and readable storage medium

Info

Publication number: CN116488325A
Application number: CN202310240818.XA
Authority: CN
Inventors: 卢丹; 张琳娟; 孙合法; 许长清; 李翼铭; 丁博; 王奕萱; 李亚男; 夏旻; 马冲; 郑征; 郭璞; 陈婧华; 韩军伟; 周志恒; 邱超
Original assignee: Nanjing University of Information Science and Technology; Economic and Technological Research Institute of State Grid Henan Electric Power Co Ltd
Current assignee: Nanjing University of Information Science and Technology; Economic and Technological Research Institute of State Grid Henan Electric Power Co Ltd
Priority date: 2023-03-14
Filing date: 2023-03-14
Publication date: 2023-07-25

Abstract

The invention belongs to the technical field of power grid anomaly detection, and particularly relates to intelligent power grid anomaly detection and classification method equipment and a readable storage medium; step 1, acquiring a proper training data set, and step 2, constructing an Anomaly Detection and Classification System (ADCS), wherein the anomaly detection and classification system comprises a self-coding and generation countermeasure network architecture, and is divided into an anomaly detection system and an anomaly classification system; step 3, training an anomaly detection and classification system network, preprocessing training data, and formatting the data by using a sliding window; step 4, inputting a test data set of the intelligent power grid, carrying out coding and decoding operations on the trained abnormality detection and classification system network on data consisting of normal and abnormal time series electric measurement, and outputting results of abnormality detection and abnormality classification; the method and the device solve the technical problems that the intrusion detection speed aiming at the intelligent power grid system is not fast enough and the precision is not high enough in the prior art.

Description

Smart power grid anomaly detection and classification method, smart power grid anomaly detection and classification equipment and readable storage medium

Technical Field

The invention belongs to the technical field of power grid anomaly detection, and particularly relates to intelligent power grid anomaly detection and classification method equipment and a readable storage medium.

Background

The rapid development of the industrial Internet of things brings a traditional power grid into a new digital paradigm called a smart power grid, and provides significant benefits of better utilization of existing resources, popularization control, self-repair and the like. According to the related research information, the intelligent power grid can be known to form the largest application of the Internet of things. However, the development of intelligent technology brings about serious network security problems, because: there must be unsafe legacy systems such as industrial control system monitoring and data acquisition, vulnerability of transmission control protocol/internet protocol, and new attack surface introduced by intelligent technology.

Denial of service, unauthorized access, and spurious data injection constitute the expected attack carrier for smart grids and have disastrous consequences. The first goal is the availability of the relevant system, while the other exploits the vulnerability of the industrial protocol to jeopardize the confidentiality, integrity and authenticity of the exchanged information.

At present, in the big data age, deep learning is an emerging technology, which can realize the characteristic of autonomous identification targets through training of a large amount of data and plays an important role in defending rapidly-developed network threats and timely detecting abnormal operation. Deep learning relies on a large amount of tagging data, however, most of the previous work has not been validated by the real smart grid environment and data.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a smart grid anomaly detection and classification method to solve the technical problems of insufficient intrusion detection speed and insufficient precision aiming at a smart grid system in the prior art.

The purpose of the invention is realized in the following way: a smart grid anomaly detection and classification method comprises the following steps:

step 1, acquiring a proper training data set, which comprises two cases:

in the first case, manually injecting an abnormal sample created by statistics into a database of the main terminal unit, and creating a data set consisting of normal and abnormal time-series electrical measurements for a plurality of smart grid environments;

in the second case, the intrusion detection data set is combined with the normal DNP3 network flow of the substation environment to generate a data set consisting of normal and malicious Modbus/TCP and DNP3 network flows;

step 2, constructing an Anomaly Detection and Classification System (ADCS), wherein the anomaly detection and classification system comprises a self-coding and generation countermeasure network architecture, and is divided into an architecture in the two cases of anomaly detection and anomaly classification;

step 3, training an anomaly detection and classification system network, preprocessing training data, formatting the data by using a sliding window, normalizing the data in the range of [0,1], and then inputting the data into the anomaly detection and classification system network for training;

and 4, inputting a test data set of the intelligent power grid, performing coding and decoding operations on the trained abnormal detection and classification system network of the data consisting of the normal and abnormal time series electric measurement, and outputting the result of abnormal detection and abnormal classification.

The generation of the countermeasure network in step 2 depends on two sub-neural networks, a generator G and a discriminator D, the generator G obtaining random noise data and generating data similar to real data, the discriminator D attempting to classify the input data sample as true or false, the generation of the two sub-networks in which the countermeasure network is intended to push and train in competition with each other, so that the generator G can generate data which the discriminator D cannot distinguish from the real data, and the relation equation between G and D is expressed as follows:

g accumulates noise Z from Z space, maps it to the space where D input x is located, P _data (x) And P _z (Z) represents probability distributions for spaces X and Z, respectively;

the self-encoding network structure is a deep learning network that learns analog input data by compressing and expanding the input data into a multi-layered channel; the automatic encoder consists of two sub-networks, namely an encoder and a decoder, the encoding sub-network compresses the input data of space X to manifold F, and conversely the decoder sub-network expands the data of manifold F into samples P, the goal of the automatic encoder architecture being to assist the network in generating samples similar to given actual data through the training process; after the training process, the network inputs new data similar to the training data, and the data pipeline formula for the auto-encoder architecture is shown below.

r,p:

r:X→F,p:F→P

The anomaly detection and classification system links self-encoding with generating a combination of the countermeasure network by encapsulating the automatic encoder architecture into a structure that generates the countermeasure network; the generator takes the form of a decoder and the discriminator takes the form of an encoder;

the generator-decoder accepts an input of noise samples N x M, where N is the number of noise points in the samples and M is the number of input samples; the generator-decoder expands the samples to produce samples that mimic the desired data; the discriminator-encoder compresses the output of the generator-decoder to a point, which is the validity label of the sample; this function is used to distinguish between true and false samples; after the training process, deriving an intermediate model from the discriminator-encoder sub-network; the model is part of a discriminator-encoder for the anomaly detection process; it includes an input layer, a hidden layer until network output;

the contrast loss is the difference between the generated sample and the real sample; the generator-decoder learns to generate normal samples, the greater the combat loss, the higher the probability of real sample anomalies, and the following equation describes the adversary's loss:

AdvL(d _r ,d _p )＝||d _r -d _p ||

wherein AdvL (x) is the generation of an antagonistic network loss score, d _r And d _p The actual sample and the predictions of the potential models in the generated sample, respectively.

The anomaly detection and classification system architecture is trained with only one set of normal samples and distinguishes between outliers in a dataset containing normal and abnormal samples; the structure of the whole network is divided into three parts: an input layer, a generator-decoder and a discriminator-encoder;

input layer of anomaly detection: the input layer represents the input of the proposed deep neural network; the noise vector with the size of N is adopted, and the noise vector is generated based on uniform distribution with the mean value of mu and the standard deviation of sigma;

generator-decoder for anomaly detection: the generator-decoder is responsible for expanding a random noise input vector of size z=10 to a size M, where M is the number of features, and the generated data mimics real data; it is trained to produce normal samples; the calculation process is as follows:

F ₁ ＝σ(Conv(x))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝tanh(Conv(x))

wherein σ represents a nonlinear activation function ReLU, tanh represents a nonlinear activation function Tanh, and Dr represents regularization;

discriminator-encoder for anomaly detection: the role of the discriminator-encoder is to distinguish between the true data samples and the generated data samples, i.e. the samples generated by the generator-decoder; it uses vectors representing M features of the data instance samples; it compresses the data through the multi-layer channel into a single point representing the significance layer, i.e., whether the binary classification of the sample is true or false; the discriminator-encoder trains with the generator-decoder, receives the true and generated samples, each sample having a true tag; the calculation process is as follows:

F ₁ ＝Dr(σ(Conv(x)))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝Sigmoid(Conv(x))

where σ represents the nonlinear activation function ReLU, tanh represents the nonlinear activation function Tanh, dr represents regularization, sigmoid represents the nonlinear activation function Sigmoid.

In the anomaly classification case, the ADCS architecture for anomaly classification is derived on the basis of the ADCS architecture for anomaly detection, in which case the anomaly detection and anomaly classification processes are combined into a single deep neural network, resulting in three fundamental true points, one for the validity of the sample, one for anomaly approximation, one describing the anomaly class of the sample; the architecture is divided into three parts: an input layer, a generator decoder and a discriminator-encoder; the main difference is that the network is designed to handle multiple classes of data with fewer features; in contrast, ADCS structures for anomaly detection are designed to handle a class of data and with a large number of features;

an anomaly classification input layer: the input layer receives the noise vector input with the size of N and a vector containing sampling classes; the elements of the random noise vector obey a normal distribution, where μ=0, σ=1; dimension of [1 XC ]]Is a zero vector with class position 1; c represents the number of classes present in a given dataset; class of sample is defined by c _p Representation, c _p Derived from the following formula;

c _p ＝argmax(V _label )

wherein V is _label Is a label vector;

anomaly classification generator-decoder: is a modified version of the generator-decoder for anomaly detection; in this case, the generator-decoder inputs the two vectors interpreted in the input layer and connects them in order to pass through the structure of the generator-decoder; the calculation process is as follows:

F ₀ ＝σ(Conv(x _t ,x _f ))

F _i ＝σ(Conv(x)),i＝1,2,3

wherein x is _t Is the noise vector, x _f Is a label vector;

discriminator-encoder for anomaly classification: the discriminator-encoder uses input vectors of M features representing data samples; the proposed structure not only produces a validity approximation, but also produces an anomaly classification of the incoming samples, the output of the discriminator-encoder comprising two parts; the first part is a validity label of a given sample and is used for distinguishing the authenticity of the sample; the second part is a label vector representing multi-class classification of the sample based on a given class in the dataset; the calculation process is as follows:

F _i ＝σ(Conv(x)),i＝0,1,2

F _out1 ＝Softmax(Conv(x))

F _out2 ＝Sigmoid(Conv(x))

wherein Softmax represents the Softmax function, F _out1 Is the classification result of the sample data, F _out2 Is the result of abnormality detection of the sample data.

The four real smart grid evaluation environments for evaluating and verifying the smart grid anomaly detection and classification method are smart grid laboratories, distribution substations, hydropower stations and power plants, respectively.

A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the smart grid anomaly detection and classification method when executing the computer program.

A computer-readable storage medium storing a computer program for executing the smart grid anomaly detection and classification method.

The invention has the beneficial effects that: the invention relates to a smart grid anomaly detection and classification method, which is characterized in that in the using process, a self-coding network is used for extracting characteristics of input power measurement data, the self-coding network and a structure for generating an countermeasure network are integrally adopted, an ADCS system structure for anomaly detection and a generator-decoder and a discriminator-encoder structure thereof are provided in a model, an ADCS system structure for anomaly classification and a generator-decoder and a discriminator-encoder structure thereof are provided, a deep learning model structure realizes anomaly detection and classification of a smart grid, solves the problem of anomaly detection, distinguishes five network attacks aiming at DNP3 and potential anomalies related to running data (namely instant sequence power measurement), solves the challenging multi-class classification problem consisting of 14 classes (13 MODBUS/TCP network attacks and normal examples), and has better recognition accuracy in various actual smart grid evaluation environments than the prior art.

Drawings

FIG. 1 is an architecture of the self-encoding and countermeasure network of the present invention for anomaly detection.

Fig. 2 is a generator-decoder architecture for anomaly detection of the present invention.

Fig. 3 is a discriminator-encoder configuration of the anomaly detection of the present invention.

FIG. 4 is an architecture of the self-encoding and countermeasure network of the present invention for anomaly classification.

Fig. 5 is a generator-decoder architecture of the anomaly classification of the present invention.

Fig. 6 is a discriminator-encoder configuration of the anomaly classification of the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

Example 1

The invention relates to a smart grid anomaly detection and classification method, as shown in fig. 1, which comprises the following steps:

step 1, acquiring a proper training data set, which comprises two cases:

r,p:

r:X→F,p:F→P

AdvL(d _r ,d _p )＝||d _r -d _p ||

F ₁ ＝σ(Conv(x))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝tanh(Conv(x))

F ₁ ＝Dr(σ(Conv(x)))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝Sigmoid(Conv(x))

c _p ＝argmax(V _label )

wherein V is _label Is a label vector;

F ₀ ＝σ(Conv(x _t ,x _f ))

F _i ＝σ(Conv(x)),i＝1,2,3

wherein x is _t Is the noise vector, x _f Is the direction of the labelAn amount of;

F _i ＝σ(Conv(x)),i＝0,1,2

F _out1 ＝Softmax(Conv(x))

F _out2 ＝Sigmoid(Conv(x))

In summary, in the smart grid anomaly detection and classification method of the present invention, in the use process, the self-coding network is used to perform feature extraction on the input power measurement data, the self-coding network and the structure of generating the countermeasure network are integrally adopted, the ADCS architecture for anomaly detection and the generator-decoder and discriminator-encoder architecture thereof are proposed in the model, the ADCS architecture for anomaly classification and the generator-decoder and discriminator-encoder architecture thereof are provided, the deep learning model architecture realizes anomaly detection and classification on the smart grid, solves the problem of anomaly detection, distinguishes five network attacks for DNP3 and potential anomalies related to the running data (i.e. real-time sequential power measurement), and solves the challenging multi-class classification problem consisting of 14 classes (13 MODBUS/TCP network attacks and normal examples), and the recognition accuracy in various practical smart grid evaluation environments is superior to the existing method.

Example 2

The invention discloses a smart grid anomaly detection and classification method, which comprises the following steps:

1. acquisition of training and data sets:

first, a suitable dataset is constructed. The power data of four real smart grid environments are used, namely a smart grid laboratory, a distribution substation, a hydropower station and a power plant. Each of the smart grid environments described above generates different operational data and is characterized by the establishment of an appropriate regional management unit that manages the operation of industrial components such as generators, turbines, and transformers. In the first case, statistically created anomaly samples are manually injected in the database of the master terminal unit, creating a dataset for the smart grid environment consisting of normal and anomaly time-series electrical measurements. This data is different for each smart grid environment. In the preprocessing step, the data is formatted using a sliding window of 30 instances and normalized in the range of [0,1 ]. In the second case, the intrusion detection data set is combined with the normal DNP3 network flows of the substation environment, resulting in a data set consisting of normal and malicious Modbus/TCP and DNP3 network flows. Both data sets are marked, in the first case, the exception instance is known, and in the second case, malicious IP is known.

2. Construct abnormal detection of smart power grids and classification system architecture

The architecture links self-encoding with generating a combination of countermeasure networks by encapsulating an automatic encoder architecture into the architecture of the generating countermeasure networks. The generator takes the form of a decoder and the discriminator takes the form of an encoder. The generator-decoder accepts an input of noise samples N x M, where N is the number of noise points in the samples and M is the number of input samples. The generator-decoder then expands the samples to produce samples that mimic the desired data. The discriminator-encoder compresses the output of the generator-decoder to a point, which is the validity label of the sample. This function is used to distinguish between true and false samples. After the training process, an intermediate model is derived from the discriminator-encoder sub-network. The model is part of a discriminator-encoder for the anomaly detection process. It includes an input layer up to the hidden layer before the network outputs. In particular, it is used to dimension down an input dimension to a specified potential space. Two samples were run through the intermediate model: actual data samples and generated samples. In this regard, the generator-decoder has learned to generate near-real data that mimics normal samples. To calculate the anomaly score for a real sample, an antagonistic loss function is used. The contrast loss is the difference between the generated sample and the real sample. Since the generator-decoder has learned to produce normal samples, the greater the combat loss, the higher the probability that the true samples will be abnormal. The following equation describes the adversary's loss:

AdvL(d _r ,d _p )＝||d _r -d _p ||

A method of anomaly detection and classification system architecture for anomaly detection is shown in FIG. 1. The ADCS architecture is trained with only one set of normal samples and can distinguish outliers in the dataset that contain normal and outlier samples. The structure of the entire network can be divided into three parts: input layer, generator-decoder and discriminator-encoder.

Input layer of anomaly detection: the input layer represents the input of the proposed DNN. It uses a noise vector of size N, which is generated based on a uniform distribution of mean μ and standard deviation σ.

Generator-decoder for anomaly detection: the generator-decoder is responsible for expanding a random noise input vector of size z=10 to a size M, where M is the number of features, and the generated data mimics real data. It is trained to produce normal samples. The calculation process is as follows:

F ₁ ＝σ(Conv(x))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝tanh(Conv(x))

where σ represents the nonlinear activation function ReLU, tanh represents the nonlinear activation function Tanh, and Dr represents regularization.

Discriminator-encoder for anomaly detection: the role of the discriminator-encoder is to distinguish between the true data samples and the generated data samples (i.e. the samples generated by the generator-decoder). It employs vectors representing M features of data instance samples. It compresses the data through the multi-layer channel into a single point representing the significance layer (i.e., whether the binary classification of the sample is true or false). The discriminator-encoder trains with the generator-decoder, receiving the true and generated samples, each with a true tag. The calculation process is as follows:

F ₁ ＝Dr(σ(Conv(x)))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝Sigmoid(Conv(x))

The method of the anomaly detection and classification system architecture for anomaly classification is shown in FIG. 4. In the case of anomaly classification, the ADCS architecture is further derived on the basis of the architecture used for anomaly detection. In which case the anomaly detection and anomaly classification processes are combined into a single deep neural network. In particular, it creates three basic realism points, one for the validity of the sample, one for the outlier approximation and one for the outlier class describing the sample. The architecture can also be divided into three parts: an input layer, a generator decoder and a discriminator-encoder. The main difference is that the network is designed to handle multiple classes of data with fewer features. In contrast, ADCS structures for anomaly detection are designed to handle a class of data and with a large number of features.

An anomaly classification input layer: the input layer accepts a noise vector input of size N and a vector containing a class of samples. The elements of the random noise vector obey a normal distribution, where μ=0, σ=1. Dimension of [1 XC ]]Is a zero vector with class position 1. C represents the number of classes present in a given dataset. Class of sample is defined by c _p Representation, c _p Derived from the following formula.

c _p ＝argmax(V _label )

Wherein V is _label Is a label vector.

Anomaly classification generator-decoder: is a modified version of the generator-decoder for anomaly detection. In this case, the generator-decoder inputs the two vectors interpreted in the input layer and connects them in order to pass through the structure of the generator-decoder. The calculation process is as follows:

F ₀ ＝σ(Conv(x _t ,x _f ))

F _i ＝σ(Conv(x)),i＝1,2,3

wherein x is _t Is the noise vector, x _f Is a label vector.

Discriminator-encoder for anomaly classification: the discriminator-encoder uses the input vector of M features to represent the data samples. Since the proposed structure not only produces a validity approximation, but also an abnormal classification of the incoming samples, the output of the discriminator-encoder consists of two parts. The first part is a validity label for a given sample, which is used to distinguish between authenticity of the sample. The second part is a label vector representing the multi-class classification of the sample based on the class given in the dataset. The calculation process is as follows:

F _i ＝σ(Conv(x)),i＝0,1,2

F _out1 ＝Softmax(Conv(x))

F _out2 ＝Sigmoid(Conv(x))

3. Training of the network model using the data set:

the invention adopts a supervision training mode, firstly converts original power data and corresponding labels into tensors, then inputs the tensors into a model for abnormality detection and sample generation training, and then inputs generated power data obtained by generating an countermeasure network and corresponding labels into the model for abnormality detection and classification training. The invention calculates network loss by adopting a binary cross entropy function, sets the batch size of each training as 16, sets the learning rate of the equal interval adjustment (StepLR) strategy to correspondingly reduce the learning rate along with the increase of training times to train so as to achieve better training effect, wherein the initial learning rate is set as 0.0002, the attenuation coefficient is 0.98, and the learning rate is updated every 5 times of training, and is totally trained 500 times. RMSprop optimizer compilation was used in the training process.

4. Predicting and generating abnormal detection and classification conditions of the intelligent power grid by using the trained network model:

the method comprises the steps of obtaining the weight of a model after training, and entering a prediction stage of the model.

In summary, the method for detecting and classifying the abnormal condition of the smart grid according to the invention uses the self-coding network to extract the characteristics of the input power measurement data in the using process, integrally adopts the self-coding network and the structure for generating the countermeasure network, proposes an ADCS system structure for detecting the abnormal condition and a generator-decoder and a discriminator-encoder structure thereof in a model, and realizes the abnormal condition detection and classification of the smart grid by a deep learning model structure, solves the problem of abnormal condition detection, distinguishes five network attacks aiming at DNP3 and potential abnormal conditions related to operation data (namely, instant sequence power measurement), solves the challenging multi-class classification problem consisting of 14 classes (13 MODBUS/TCP network attacks and normal examples), and is superior to the prior method in the identification accuracy in various actual smart grid evaluation environments.

Claims

1. The intelligent power grid anomaly detection and classification method is characterized by comprising the following steps of:

step 1, acquiring a proper training data set, which comprises two cases:

2. The smart grid anomaly detection and classification method of claim 1, wherein: the generation of the countermeasure network in step 2 depends on two sub-neural networks, a generator G and a discriminator D, the generator G obtaining random noise data and generating data similar to real data, the discriminator D attempting to classify the input data sample as true or false, the generation of the two sub-networks in which the countermeasure network is intended to push and train in competition with each other, so that the generator G can generate data which the discriminator D cannot distinguish from the real data, and the relation equation between G and D is expressed as follows:

r:X→F,p:F→P。

3. The smart grid anomaly detection and classification method of claim 1, wherein:

AdvL(d _r ,d _p )＝||d _r -d _p ||

4. The smart grid anomaly detection and classification method of claim 1, wherein:

F ₁ ＝σ(Conv(x))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝tanh(Conv(x))

F ₁ ＝Dr(σ(Conv(x)))

F ₂ ＝Dr(σ(Conv(x)))

F ₃ ＝Conv(x)

F _out ＝Sigmoid(Conv(x))

5. The smart grid anomaly detection and classification method of claim 1, wherein:

c _p ＝argmax(V _label )

wherein V is _label Is a label vector;

F ₀ ＝σ(Conv(x _t ,x _f ))

F _i ＝σ(Conv(x)),i＝1,2,3

wherein x is _t Is the noise vector, x _f Is a label vector;

F _i ＝σ(Conv(x)),i＝0,1,2

F _out1 ＝Softmax(Conv(x))

F _out2 ＝Sigmoid(Conv(x))

6. The smart grid anomaly detection and classification method of claim 1, wherein: the four real smart grid evaluation environments for evaluating and verifying the smart grid anomaly detection and classification method are smart grid laboratories, distribution substations, hydropower stations and power plants, respectively.

7. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the smart grid anomaly detection and classification method of any one of claims 1 to 6 when the computer program is executed.

8. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program that performs the smart grid anomaly detection and classification method of any one of claims 1 to 6.