CN113902102A

CN113902102A - Non-invasive load decomposition method based on seq2seq

Info

Publication number: CN113902102A
Application number: CN202111215244.8A
Authority: CN
Inventors: 卞海红; 孙鑫; 徐懂理; 裔传仁; 高瑞阳
Original assignee: Nanjing Institute of Technology
Current assignee: Nanjing Institute of Technology
Priority date: 2021-10-19
Filing date: 2021-10-19
Publication date: 2022-01-07

Abstract

The invention provides a non-invasive load decomposition method based on seq2seq, which comprises the following steps: the first step is as follows: designing a seq2seq model; the second step is that: function extraction; performing convolution and pooling on the power sequence on a one-dimensional scale by using Conv1D, and extracting power features by means of a plurality of convolution kernels with the same weight; the third step: (3) load identification based on LSTM; the fourth step: seq2seqBCL load decomposition. Aiming at the problem that the decomposition accuracy of the existing non-invasive load decomposition method is low under the low-frequency sampling condition (1Hz or below), the invention provides a non-invasive load decomposition algorithm (seq2seq base on CNN and LSTM, seq2seq BCL) of seq2seq Based on the combination of a Convolutional Neural Network (CNN) and a long-short term memory network (LSTM). The deep learning model takes the power time series as the input of the network and performs characteristic extraction through CNN. In consideration of the time sequence of the power data, an LSTM layer is added for electrical appliance identification, and compared with a seq2seq model in NILMTK, the number of network layers is reduced, and the network structure is simplified.

Description

Non-invasive load decomposition method based on seq2seq

Technical Field

The invention belongs to the field of non-invasive load detection, and relates to a non-invasive load decomposition method based on seq2 seq.

Background

The development of Non-invasive Load Monitoring (NILM) is roughly divided into three stages: a proposing stage, a machine learning stage and a deep learning stage. Non-invasive load monitoring was first proposed by professor Hart in the 80's 20 th century, and in 1992 professor Hart first proposed a non-invasive load monitoring system. Until 2008, no scholars have proposed an integer-based planning method. Kolter et al used FHMM model for non-invasive load splitting in 2011, which achieved the best monitoring performance at the time by testing on the REDD data set. Non-intuitive Load Monitoring Toolkit (NILTK) issued in 2014, which is an open source tool specifically designed to compare energy decomposition algorithms in a repeatable manner, was the first study to compare multiple decomposition methods across multiple publicly available data sets. In 2015, a deep learning model is applied to the NILM field, students do not perform load decomposition according to the conventional four steps of data processing, event detection, feature extraction and load identification, and the steps of event detection and feature extraction are omitted through self-learning of the deep learning model. Mauch et al propose a two-layer bidirectional recurrent neural network (LSTM) architecture and a scheme combining HMM and Deep Neural Network (DNN) for load decomposition, which are improved compared with conventional FHMM, but because the data used in training the model is not sufficient, the generalization capability of the algorithm is not fully verified. The scholars in 2019 proposed a Convolutional Neural Network (CNN) based architecture that takes inputs and outputs as data sequences, while taking into account the previous state of the device to better estimate its current state. Furthermore, to better capture the correlation of the energy signal, the model gives the CNN model a recursive property. By adopting the multichannel CNN structure, additional variables related to power consumption (current, reactive power and apparent power) are added on the basis of the multichannel CNN structure, and the overall performance, the anti-noise capability and the convergence time of the system are improved. The NILM load identification algorithm has high requirement on the quality of the data set, and researchers use ENERTAK to research and discover that the performance of the NILM is greatly influenced and the decomposition precision is low when the sampling frequency of the data set is lower than 1-3 Hz.

Disclosure of Invention

1. The technical problem to be solved is as follows:

the existing non-invasive load monitoring research method only focuses on a certain point in a characteristic and time relation when a deep learning model is applied, does not consider the characteristics of characteristic extraction and electric power data based on a time sequence at the same time, and has lower decomposition precision under low-frequency data.

2. The technical scheme is as follows:

in order to solve the above problems, the present invention provides a seq2 seq-based non-intrusive load decomposition method, comprising the following steps: the first step is as follows: designing a seq2seq model; the second step is that: function extraction; performing convolution and pooling on the power sequence on a one-dimensional scale by using Conv1D, and extracting power features by means of a plurality of convolution kernels with the same weight; the third step: (3) load identification based on LSTM; the fourth step: seq2seqBCL load decomposition.

The method for designing the seq2seq model in the first step comprises the following steps: firstly, inputting the total power of the household power into a one-dimensional convolutional neural network (Conv1D) for feature self-extraction, placing the extracted distributed power features in a full connection layer with a fixed length for storage, and outputting the features integrated into a sample space to the next layer through an activation function, wherein the next layer is used for carrying out load identification on an electric appliance.

In the second step, the convolution operation is shown in the following formula.

Wherein Xi represents the input vector of the ith layer; f represents an activation function, and the introduction of the activation function can enable the model to have nonlinear processing and enhance the expression capability of the model.

Represents a convolution operation; wi represents a weight matrix of the ith layer of convolution kernel; bi represents the bias value of the weight matrix in the i-th layer convolution kernel.The distributed features are further pooled and mapped to a full connection layer to obtain a final feature vector, and the mathematical formula of the pooling operation is as follows:

X_i＝Maxpooling(X_i-1)

wherein X_iRepresenting the pooled vectors; x_i-1Representing the vector before pooling; maxpooling stands for maximum pooling operation.

In the third step, the LSTM has two transfer states: cell State (C)^t) And hidden state (H)^t) The calculation formula for each state is as follows:

C^t＝Z^f⊙C^t-1+Zⁱ⊙Z

H^t＝Z^o⊙tanh(C^t)

Y^t＝σ(W′H^t)，X^trepresenting the total power vector input at the time t; y is^tRepresenting the load output at the time t to identify an electric appliance vector; h^tAnd H^t-1Respectively representing the hidden state at the time t and the hidden state at the previous time; c^tAnd C^t-1Respectively representing the cell state at time t and the cell state at the previous time, Z^f，Zⁱ， Z^oIs three gates and Z is a new candidate vector.

Z is^f，Zⁱ，Z^oThe mathematical formula for Z is:

Z^f＝σ(W^f⊙[X^t，H^t-1]+b^f)

Z^t＝σ(Wⁱ⊙[X^t，H^t-1]+bⁱ)

Z^o＝σ(W^o⊙[X^t，H^t-1]+b^o)

Z＝tanh(W⊙[X^t，H^t-1]+ b) wherein W^f，Wⁱ，W^oW represents a weight matrix; an operation representing multiplication of two matrices; [ X ]^t，H^t-1]Represents X^tAnd H^t-1Forming a splicing matrix; σ and tanh represent activation functions. b^f，bⁱ，b^oAnd b represents an offset value.

In the fourth step, the specific steps of seq2seqBCL load decomposition are as follows: preparing data: dividing the general table data and the sub table data according to different electrical appliances, and dividing respective training sets and test sets according to the electrical appliances; training a model: training the prepared data on a seq2seqBCL model, and storing the trained model for load identification and prediction; application model: and inputting the total power into the trained seq2seqBCL model aiming at a certain electric appliance to obtain the recognition result. Beneficial effects are that:

aiming at the problem that the decomposition accuracy of the existing non-invasive load decomposition method is low under the low-frequency sampling condition (1Hz and below), the invention provides a non-invasive load decomposition algorithm (seq2seq Based on CNN and LSTM, seq2seq BCL) of seq2seq Based on the combination of a Convolutional Neural Network (CNN) and a long-short term memory network (LSTM). The deep learning model takes the power time series as the input of the network and performs characteristic extraction through CNN. In consideration of the time sequence of the power data, an LSTM layer is added for electrical appliance identification, and compared with a seq2seq model in NILMTK, the number of network layers is reduced, and the network structure is simplified.

Detailed Description

The present invention will be described in detail below.

The invention provides a non-invasive load decomposition method based on seq2seq, which comprises the following steps: the first step is as follows: designing a seq2seq model; the second step is that: function extraction; performing convolution and pooling on the power sequence on a one-dimensional scale by using Conv1D, and extracting power features by means of a plurality of convolution kernels with the same weight; the third step: (3) load identification based on LSTM; the fourth step: seq2seqBCL load decomposition.

The method specifically comprises the following steps: the first step is as follows: designing a seq2seq model; a non-intrusive load decomposition algorithm (seq2seq Based on CNN and LSTM, seq2seq BCL) Based on sequence-to-sequence of CNN and LSTM is designed, firstly, the total power of household electricity is input into a one-dimensional convolutional neural network (Conv1D) for feature self-extraction, the extracted distributed power features are placed in a full connection layer (Dense) with a fixed length for storage, and the features integrated into a sample space are output to the next layer through an activation function, so that the non-linear expression capability of the algorithm is enhanced. And the LSTM is adopted by the lower layer to carry out load identification on the electric appliance, and the value information hidden in the power data in the time relation can be mined.

The second step is that: the Conv1D is used for performing convolution and pooling on the power sequence on a one-dimensional scale, the power features are extracted by means of a plurality of convolution kernels with the same weight, the Cony1D is used for avoiding traditional manual feature extraction, and the structure has strong robustness.

The household electricity consumption data are preprocessed to obtain input vectors, convolution operation is conducted on the input vectors through convolution cores, and then distributed characteristics of the power data are obtained through an activation function. The convolution operation is shown in equation (1).

Represents a convolution operation; wi represents a weight matrix of the ith layer of convolution kernel; bi represents the bias value of the weight matrix in the i-th layer convolution kernel. The distributed features are further pooled and mapped to a full connection layer to obtain a final feature vector, and the mathematical formula of the pooling operation is as follows:

X_i＝Maxpooling(X_i-1)

The third step: load identification based on the LSTM has strong relevance to data before and after a certain time when load decomposition is performed on a power time series. In order to mine valuable information hidden in the relevance, LSTM is adopted to carry out load identification processing, and the problems of gradient disappearance and gradient explosion generated when long sequence power data are trained are solved.

In contrast to normal RNNs, LSTM has two transitive states: the method comprises a cell state (Ct) and a hidden state (Ht), wherein the Ct changes slowly in the transfer process, and the Ht can be greatly different at different layer nodes, which is a key point for solving the problems of gradient disappearance and gradient explosion when a long power sequence is trained.

X^tRepresenting the total power vector input at the time t; y is^tRepresenting the load output at the time t to identify an electric appliance vector; h^tAnd H^t-1Respectively representing the hidden state at the time t and the hidden state at the previous time; c^tAnd C^t-1Respectively representing the cell state at time t and the cell state at the previous time. At time t, the calculation formula for each state is as follows:

C^t＝Z^f⊙C^t-1+Zⁱ⊙Z

H^t＝Z^o⊙tanh(C^t)

Y^t＝σ(W′H^t)

Z^f，Zⁱ，Z^ois three gates and Z is a new candidate vector. The mathematical formula is as follows:

Z^f＝σ(W^f⊙[X^t.H^t-1]+b^f)

Zⁱ＝σ(Wⁱ⊙[X^t，H^t-1]+bⁱ)

Z^o＝σ(W^o⊙[X^t，H^t-1]+b^o)

Z＝tanh(W⊙[X^t，H^t-1]+b)

wherein W^f，Wⁱ，W^oW represents a weight matrix; an operation representing multiplication of two matrices; [ x ] of^t，H^t-1]Represents X^tAnd H^t-1Forming a splicing matrix; σ and tanh represent activation functions. b^f，bⁱ，b^oAnd b represents an offset value.

The fourth step: the seq2seqBCL load decomposition comprises the following specific steps: data preparation. And dividing the general table data and the sub table data according to different electrical appliances, and dividing respective training sets and test sets according to the electrical appliances.

And secondly, training the model. And training the prepared data on a seq2seqBCL model, and storing the trained model for load identification and prediction.

And thirdly, applying the model. And inputting the total power into the trained seq2seqBCL model aiming at a certain electric appliance to obtain the recognition result.

Claims

1. A seq2 seq-based non-intrusive load decomposition method comprises the following steps: the first step is as follows: designing a seq2seq model; the second step is that: function extraction; performing convolution and pooling on the power sequence on a one-dimensional scale by using Conv1D, and extracting power features by means of a plurality of convolution kernels with the same weight; the third step: (3) load identification based on LSTM; the fourth step: seq2seqBCL load decomposition.

2. The method of claim 1, wherein: the method for designing the seq2seq model in the first step comprises the following steps: firstly, inputting the total power of the household power into a one-dimensional convolutional neural network (Conv1D) for feature self-extraction, placing the extracted distributed power features in a full connection layer with a fixed length for storage, and outputting the features integrated into a sample space to the next layer through an activation function, wherein the next layer is used for carrying out load identification on an electric appliance.

3. The method of claim 1, wherein: in the second step, the convolution operation is represented by the following formula,

wherein Xi represents the input vector of the ith layer; f represents an activation function, the introduction of the activation function can enable the model to have nonlinear processing, enhance the expression capability of the model,

represents a convolution operation;wi represents a weight matrix of the ith layer of convolution kernel; bi represents the offset value of the weight matrix in the convolution kernel of the ith layer, the distributed features are further pooled and mapped to the full-connected layer to obtain the final feature vector, and the mathematical formula of the pooling operation is as follows:

X_i＝Maxpooling (X_i-1)

4. The method of claim 1, wherein: in the third step, the LSTM has two transfer states: cell State (C)^t) And hidden state (H)^t) The calculation formula for each state is as follows:

C^t＝Z^f⊙C^t-1+Zⁱ⊙Z

H^t＝Z^o⊙tanh(C^t)

Y^t＝σ(W′H^t)，X^trepresenting the total power vector input at the time t; y is^tRepresenting the load output at the time t to identify an electric appliance vector; h^tAnd H^t-1Respectively representing the hidden state at the time t and the hidden state at the previous time; c^tAnd C^t-1Respectively representing the cell state at time t and the cell state at the previous time, Z^f，Zⁱ，Z^oIs three gates and Z is a new candidate vector.

5. The method of claim 4, wherein: z^f，Zⁱ，Z^oThe mathematical formula for Z is: z^f＝σ(W^f⊙[X^t，H^t-1]+b^f)

Zⁱ＝σ(Wⁱ⊙[X^t，H^t-1]+bⁱ)

Z^o＝σ(W^o⊙[X^t，H^t-1]+b^o)

Z＝tanh(W⊙[X^t，H^t-1]+ b) wherein W^f，Wⁱ，W^oW represents a weight matrix; an operation representing multiplication of two matrices; [ X ]^t，H^t-1]Represents X^tAnd H^t-1Forming a splicing matrix; σ and tanh represent activation functions, b^f，bⁱ，b^oAnd b represents an offset value.

6. The method of any one of claims 1 to 5, wherein: in the fourth step, the specific steps of seq2seqBCL load decomposition are as follows: preparing data: dividing the general table data and the sub table data according to different electrical appliances, and dividing respective training sets and test sets according to the electrical appliances; training a model: training the prepared data on a seq2seqBCL model, and storing the trained model for load identification and prediction; application model: and inputting the total power into the trained seq2seqBCL model aiming at a certain electric appliance to obtain the recognition result.