CN116306864A

CN116306864A - Method for training deep learning model of power system

Info

Publication number: CN116306864A
Application number: CN202310033698.6A
Authority: CN
Inventors: 杨楠; 叶迪; 贾俊杰; 黄悦华; 邾玢鑫; 李振华; 张涛; 张磊; 王灿
Original assignee: China Three Gorges University CTGU
Current assignee: China Three Gorges University CTGU
Priority date: 2019-09-16
Filing date: 2019-09-16
Publication date: 2023-06-23
Also published as: CN110674459A; CN110674459B

Abstract

A method of training a deep learning model of a power system, comprising the steps of: step 1: constructing a loss function based on an average absolute error MAE through an Encoder-Decoder architecture; step 2: using an Adam algorithm as an updating algorithm of the neuron weight to realize training of each parameter of GRU neurons in an Encoder-Decoder architecture; step 3: the learning rate of each parameter is adaptively found by Adam algorithm. The invention aims to solve the technical problem that when a deep learning model based on a Seq2Seq technology is adopted to train differential sample data, the training efficiency is low because a unit start-stop state matrix and an output state matrix of an actual power system are high-dimensional sample matrices.

Description

Method for training deep learning model of power system

Technical Field

The invention belongs to the field of electric power systems and automation research, in particular to the research of a unit combination decision method of a deep learning intelligent algorithm, and relates to a divisional application of a data driving unit combination intelligent decision method (application number 2019108724540) based on GRU and Seq2Seq technologies.

Background

The open mature power market often requires a separate power market operator (Independent System Operators, ISO) to maintain its safe and reliable operation, while the problem of combining units (Security Constrained Unit Commitment, SCUC) taking into account safety constraints is an important theoretical basis for ISO decisions. In recent years, the worldwide development of the electric power market is rapid, and the ISO is required to have a powerful calculation tool to maintain market operation and make an intelligent and fine day-ahead power generation plan; on the other hand, with the great application of new energy technologies such as electric vehicles, intermittent energy sources, demand side management, etc., challenges faced by ISO decisions are also endless. Therefore, the research of the SCUC decision theory with high adaptability and high precision has important theoretical and engineering significance for the development and marketization of the electric power industry.

Depending on the factors considered, current crew combinations can be broadly classified into multi-objective crew combinations, uncertainty crew combinations, crew combinations that consider diversified constraints and decision variables, and the like. In general, the research of the current unit combination problem is that a mathematical model is firstly put forward from the actual engineering problem on the premise of mechanism research, and then a corresponding mathematical method is researched to solve the problem, although the emphasis is different. The thinking of the research is based on strict logic deduction and mechanism research and driven by a mathematical model and an algorithm, so that the method can be called a unit combination decision method based on physical model driving. Because the model itself needs to be modified and reconstructed when the method faces new problems continuously appearing, the research thought may have insufficient applicability under the background that the energy source is changed day by day and theoretical challenges are endless.

In terms of engineering practice, once the unit combination decision method is used for the practice, a large amount of structured historical data is often accumulated, and in the long term, the unit combination decision also has certain repeatability, if a unit combination decision method based on data driving can be provided, the built-in mechanism is not researched, but the mapping relation between known input quantity and decision results is directly constructed based on the deep learning method, the mapping relation between the known input quantity and the decision results is trained by utilizing massive historical decision data, and the sustainable correction of a model is realized through accumulation of the historical data, so that the unit combination decision is endowed with self-evolution and self-learning capabilities. The decision method based on data driving not only can greatly simplify the modeling and solving processes and complexity of the unit combination problem, but also can cope with various theoretical problems and challenges which are continuously emerging through self-learning, however, the research of people in the field is relatively rare at present. Aiming at the time sequence characteristics of the unit combination sample data, for example, paper-study of a unit combination intelligent decision method with self-learning capability based on data driving, a cyclic neural network, namely a Long Short-Term Memory (LSTM), is adopted as a core training tool for the first time, a unit combination decision model based on data driving is successfully constructed, and the self-evolution characteristics of the method and the adaptability of the method to different unit combination problems are verified. The method in this document still has the following problems:

1) Since the LSTM model is too complex, it not only requires a large amount of computing resources when processing training samples with high dimensions, but also is prone to an overfitting phenomenon. In contrast, the latest improved generation in the cyclic neural network, the threshold cyclic network (Gated Recurrent Unit, GRU) combines the input gate and the forgetting gate on the basis of LSTM, simplifies the memory unit, and can effectively reduce the complexity of the model;

2) If the power system is trained offline by directly adopting a single cyclic neural network architecture, a unique compromise mapping model is inevitably generated when the power system is faced with the history sample data with huge differences (such as great differences of load characteristics in different seasons), so that the accuracy of online decision is difficult to ensure. In this regard, the document proposes a concept of clustering training, that is, clustering processing is performed on historical scheduling data first, then training is performed on each type of sample respectively, so as to obtain a plurality of mapping models, at the time of decision, the type of input data is first determined, and then online decision is performed by using the mapping model of the corresponding type. Although the fitting precision problem of the single-cycle neural network architecture facing the differential samples is solved to a certain extent, the training and decision-making efficiency of the thinking is greatly reduced because the thinking needs to train a plurality of deep learning models.

As an effective means for solving the sequence-type problem, the sequence-to-sequence (Sequence to Sequence, seq2 Seq) technique has been widely used in recent years for machine translation, intelligent question-answering, and the like. Unlike the conventional single-loop neural network architecture, which reads all input amounts with a single neuron and outputs the result, the Seq2Seq technique uses two loop neural networks to construct an Encoder-Decoder architecture, respectively. The Encoder reads the input sequence step by step according to the time steps, and then outputs the intermediate state C of the whole sequence. Since the recurrent neural network can record the process information for each training step, the intermediate state C can theoretically account for the information of the entire input sequence. In the Decoder, the other cyclic neural network performs the inverse operation of the Encoder, and the resulting intermediate state C is decoded in steps to form the final output sequence. The intermediate state C can completely store the category information and the pointing probability of the input sequence and the output sequence, so that the Seq2Seq technology is expected to become a feasible thinking for solving the problem that a single-cycle neural network model cannot accurately train differential sample data in theory. However, since the dimension of the set start-stop matrix is proportional to the number of system sets, the set start-stop state matrix and the output state matrix of the actual power system are high-dimensional sample matrices, and if the deep learning model based on the Seq2Seq technology is directly used for training the data, the training efficiency is low. Therefore, it is also necessary to research the dimension reduction strategy for the set start-stop sample data while introducing the Seq2Seq technology so as to further improve the training efficiency.

Disclosure of Invention

The invention aims to solve the technical problem that when a deep learning model based on a Seq2Seq technology is adopted to train differential sample data, the training efficiency is low because a unit start-stop state matrix and an output state matrix of an actual power system are high-dimensional sample matrices.

The technical scheme adopted by the invention is as follows:

the intelligent decision method for the data driving unit combination based on GRU and Seq2Seq technology comprises the following steps:

1. compressing dimensions of the crew combination history decision data using a sample encoding technique for a high-dimensional crew combination training sample matrix;

2. introducing a Seq2Seq technology on the basis of a threshold circulation network, and establishing a composite neural network architecture oriented to unit combination decision;

3. on the basis, a unit combination deep learning model is built, and a mapping model between the daily load of the system and a unit start-stop scheme is built through historical data training;

4. and carrying out unit combination decision by using the generated mapping model, obtaining a unit start-stop state and unit output under an optimal power flow model, taking the obtained unit combination decision result as new historical sample data, and training the deep learning model, thereby realizing continuous correction of the model.

In step 1, when the high-dimensional unit combination training sample matrix is processed, specifically, the unit combination start-stop state vector of each period is encoded, so that the vector codes with identical start-stop states are identical.

And converting the set combined start-stop state vector of each period into a decimal code corresponding to the set combined start-stop state vector of each period, so that the dimension of the sample matrix is compressed.

In step 2, an Encoder-Decoder composite neural network architecture is constructed based on GRU and Seq2Seq technology, specifically adopting the following steps:

1) A history map sample (P _L ，U _G ) Substituted into the Encoder-Decode architecture, where P _L U as daily load data _G For the corresponding unit start-stop scheme, the Encoder architecture will P _L The GRU neuron hidden layer state at the time t is jointly determined by the GRU neuron hidden layer state at the time t-1 and the daily load at the time t by steps, and the specific formula is as follows:

h _t ＝f(h _t-1 ,P _Lt ) (1)

wherein: h is a _t The hidden layer state of the GRU neuron at the moment t is represented; h is a _t-1 The hidden layer state of GRU neurons at the time t-1 is represented; p (P) _Lt The daily load input at the time t is represented;

2) GRU neuron hidden layer state h at time t in the Encoder architecture _t In the Decoder architecture, the GRU neuron hidden layer state h at the k moment is made to be the same as the intermediate state of the Encoder architecture _k The specific formula is as follows, which is the same as the intermediate state of the Decoder architecture:

wherein: c (C) _t Representing the intermediate state of the Encoder architecture at the time t; c (C) _k Representing the intermediate state of the Encoder architecture at time k;

3) The intermediate state of the output of the Encoder framework at the moment T is the intermediate state C of the input sequence, and the value is C _T Representing the complete information of the input sequence, specifically the following formula:

C＝C _T (3)；

4) Inputting the sequence intermediate state C into a Decoder architecture, wherein the initial value C of the Decoder intermediate state ₀ Like the intermediate state C of the sequence, will C ₀ After input, the hidden layer state h of GRU neuron at k moment can be obtained _k The hidden layer state of the GRU neuron at the moment k-1 and the input of the GRU neuron at the moment k are determined together, and the specific formula is as follows:

h _k ＝f(h _k-1 ,x _k ) (4)

wherein: h is a _k-1 The hidden layer state of GRU neurons at the moment k-1 is represented; x is x _k A GRU neuron input at time k is represented;

5) The k-1 time Decoder architecture output will be used as the k time GRU neuron input, specifically as follows:

x _k ＝U _Gk-1 (5)

wherein: u (U) _Gk-1 The Decoder architecture output at time k-1 is represented;

6) Substituting the formula (5) into the formula (4), and simultaneously, executing the operation opposite to the operation of the Encoder by the Decoder framework, and performing step-by-step decoding on the input sequence intermediate state C according to the time step to form a final output sequence, wherein the time of k-1 is the intermediate state C of the Decoder framework _k-1 And h _k-1 Equal, the Decoder architecture output at time k is defined by h _k-1 、U _Gk-1 H _k The common decision is specifically described as follows:

wherein: u (U) _Gk The Decoder architecture output at the time k is represented; p represents probability; g represents a softmax function; f represents a conversion function;

7) GRU neuron input at time kx _k And k-1 time Decoder architecture intermediate state C _k-1 Construction of update gate z in GRU neurons for variables _k Reset gate r _k Pending output value

The concrete model of the three is as follows:

wherein: w (W) _r Represents x _k And r _k Weight coefficient between; w (W) _z Represents x _k And z _k Weight coefficient between; w (W) _h Represents x _k And

weight coefficient between; alpha represents an activation function sigmoid in the neural network;

8) Will z _k 、r _k and

The three are combined to obtain the output h of the hidden layer of the GRU neuron _k The specific formula is as follows:

wherein: h is a _k-1 The GRU neuron hidden layer output at the moment k-1 is represented;

through the steps, an Encoder-Decode composite neural network architecture is constructed.

In step 3, daily load data P of a typical day is obtained _L Set start-stop scheme U corresponding to set _G As a history mapping sample, in the history mapping sample, the unit start-stop scheme U _G And daily load P _L U for relation of (C) _G ＝F(p(P _L ) Description; wherein p represents the probability between the daily load and the corresponding unit start-stop scheme, and F represents the conversion function.

Accumulating historical data for depth based on Seq2Seq and GRUOffline training is performed by the learning model, so that a describable U is obtained _G And P _L A mapping model of the probability relation between them.

Training a deep learning model by adopting an Adam algorithm, and specifically adopting the following steps:

1) Constructing a loss function based on an average absolute error (Mean Absolute Deviation, MAE) by an Encoder-Decoder architecture, setting the output of the Encoder-Decoder architecture at time k as U _Gok The target value is U _Gdk The total error E of the sample during training is shown as follows:

2) The Adam algorithm is used as an updating algorithm of the neuron weight to realize training of each parameter of GRU neurons in an Encoder-Decoder architecture, and the basic formula is shown as follows;

wherein: θ _k The parameter variable to be updated at the moment k, delta is the learning rate,

and->

The specific calculation formula of the gradient weighted average value after error correction and the gradient weighted biased variance is shown as follows:

3) Substituting the formula (11) into the formula (10), and adaptively searching the learning rate of each parameter by utilizing an Adam algorithm to realize W in GRU neurons in an Encoder-Decoder architecture _r 、W _z W is provided _h The correction of the three weight coefficients is specifically as follows:

training of the Encoder-Decoder architecture is achieved by equation (12) based on the constant correction of each weight coefficient.

The composite neural network architecture for unit combination decision is constructed based on GRU and Seq2Seq technology, and concretely comprises the following steps:

1) A history map sample (P _L ，U _G ) Substituting into the Encoder-Decode architecture, the Encoder architecture loads the sequence of daily loads P _L The GRU neuron hidden layer state at the time t is jointly determined by the GRU neuron hidden layer state at the time t-1 and the daily load at the time t by steps, and the specific formula is as follows:

h _t ＝f(h _t-1 ,P _Lt ) (1)

C＝C _T (3)；

h _k ＝f(h _k-1 ,x _k ) (4)

x _k ＝U _Gk-1 (5)

7) Input x with GRU neuron at time k _k And k-1 time Decoder architecture intermediate state C _k-1 Construction of update gate z in GRU neurons for variables _k Reset gate r _k Pending output value

The concrete model of the three is as follows:

8) Will z _k 、r _k and

A method for training a deep learning model of an electric power system adopts an Adam algorithm to train the deep learning model, and specifically adopts the following steps:

and->

Compared with the prior art, the data-driven unit combination decision method provided by the invention has the following advantages and

the beneficial effects are that:

1) The invention constructs a GRU-based unit combination decision deep learning model, and compared with the LSTM model used in the existing literature, the training efficiency is higher;

2) According to the invention, a Seq2Seq technology is introduced on the basis of GRU, and an Encoder-Decoder composite neural network architecture for unit combination decision is provided, compared with the method for clustering samples in the prior art, the method provided by the invention has the advantages that the sample data is not required to be clustered and preprocessed, and training of all differential sample data can be completed by directly utilizing a deep learning model, so that the training and decision efficiency is higher;

3) The invention provides a sample coding technology for a high-dimensional unit combination sample matrix, which effectively compresses the dimension of unit combination sample data and further improves the training efficiency of a unit combination deep learning model.

Drawings

FIG. 1 is a data driven set combination decision method framework based on a composite neural network architecture.

Fig. 2 is a schematic diagram of a sample encoding technique.

FIG. 3 is a daily load and crew start-stop scheme mapping model.

Fig. 4 is an Encoder-Decoder complex neural network architecture.

Fig. 5 is a diagram showing the internal structure of a GRU neuron.

FIG. 6 is a graph of training errors based on a GRU model versus a Seq2Seq technique and a GRU model.

Detailed Description

As shown in fig. 1, the intelligent decision method for the data driving unit combination based on the GRU and Seq2Seq technology is characterized by comprising the following steps:

As shown in FIG. 2, daily load data P of a typical day _L Set start-stop scheme U corresponding to set _G As a calendarShi Yingshe samples. In a mapping sample, unit start-stop scheme U _G And daily load P _L Available U of relation of (C) _G ＝F(p(P _L ) Description). The mapping relationship is shown in fig. 1.

In fig. 2 p represents the probability between daily load and corresponding unit start-stop scheme and F represents the transfer function. For daily load P _L In other words, the mapping model is built by accumulating a large amount of historical data, and performing offline training on the deep learning model based on the Seq2Seq and the GRU, thereby obtaining the descriptive U _G And P _L A mapping model of the probability relation between them.

As shown in fig. 3, the set combination start-stop state vector of each period is encoded, and the vector codes with identical start-stop states are identical. The main purpose of sample coding is to convert the set combined start-stop state vector of each period into a decimal code corresponding to the set combined start-stop state vector, so that the dimension of a sample matrix is compressed, and finally, the purpose of improving the training efficiency of a deep learning model is achieved, and the principle of the method is shown in figure 2.

An Encoder-Decode composite neural network architecture is constructed based on GRU and Seq2Seq techniques, with the specific architecture shown in FIG. 4.

The structure of the GRU neurons is shown in FIG. 5.

At the time of specific construction, a history map sample (P _L ，U _G ) Substituting into the Encoder-Decode architecture, the Encoder architecture loads the sequence of daily loads P _L The GRU neuron hidden layer state at the time t is jointly determined by the GRU neuron hidden layer state at the time t-1 and the daily load at the time t by steps, and the specific formula is as follows:

h _t ＝f(h _t-1 ,P _Lt ) (1)

wherein: h is a _t The hidden layer state of the GRU neuron at the moment t is represented; h is a _t-1 The hidden layer state of GRU neurons at the time t-1 is represented; p (P) _Lt The daily load input at time t is shown.

According to the characteristics of the GRU model, in the Encoder architecture, the hidden layer state h of GRU neurons at the moment t _t The same as the intermediate state of the Encoder architecture. In the Decoder architecture, time kGRU neuron hidden layer state h _k The specific formula is as follows, which is the same as the intermediate state of the Decoder architecture:

wherein: c (C) _t Representing the intermediate state of the Encoder architecture at the time t; c (C) _k Representing the intermediate state of the Encoder architecture at time k.

According to the characteristics of the Encoder architecture, the intermediate state output by the Encoder architecture at the moment T is the intermediate state C of the input sequence, and the value is C _T Representing the complete information of the input sequence, specifically the following formula:

C＝C _T (3)

inputting the sequence intermediate state C into a Decoder architecture, wherein the initial value C of the Decoder intermediate state ₀ The same as the intermediate state C of the sequence. C is C ₀ After input, the hidden layer state h of GRU neuron at k moment can be obtained _k The hidden layer state of the GRU neuron at the moment k-1 and the input of the GRU neuron at the moment k are determined together, and the specific formula is as follows:

h _k ＝f(h _k-1 ,x _k ) (4)

wherein: h is a _k-1 The hidden layer state of GRU neurons at the moment k-1 is represented; x is x _k The GRU neuron input at time k is represented.

According to the Decoder architecture characteristics, the k-1 time Decoder architecture output is used as the input of the k time GRU neuron, and the following specific formula is as follows:

x _k ＝U _Gk-1 (5)

wherein: u (U) _Gk-1 Representing the Decoder architecture output at time k-1.

Substituting the formula (5) into the formula (4), and simultaneously, executing the operation opposite to the operation of the Encoder by the Decoder framework, and performing step-by-step decoding on the input sequence intermediate state C according to the time step to form a final output sequence, wherein the time of k-1 is the intermediate state C of the Decoder framework _k-1 And h _k-1 Equal. Thus, the k-time Decoder architecture output is defined by h _k-1 、U _Gk-1 H _k The common decision is specifically described as follows:

wherein: u (U) _Gk The Decoder architecture output at the time k is represented; p represents probability; g represents a softmax function; f represents a conversion function.

As can be seen from FIG. 4, GRU neuron input x is taken as k _k And k-1 time Decoder architecture intermediate state C _k-1 Construction of update gate z in GRU neurons for variables _k Reset gate r _k Pending output value

The concrete model of the three is as follows:

weight coefficient between; alpha represents the activation function sigmoid in the neural network.

Will z _k 、r _k and

wherein: h is a _k-1 The GRU neuron hidden layer output at time k-1 is represented.

In step 4, training the deep learning model by adopting an Adam algorithm.

The Encoder-Decoder architecture works with the average absolute error (Mean Absolute Deviation,MAE) as a basis to construct a loss function, and setting the output of the Encoder-Decoder architecture at the k moment as U _Gok The target value is U _Gdk The total error E of the sample during training is shown in the following equation.

The Adam algorithm is used as an updating algorithm of the neuron weights to realize the training of each parameter of the GRU neurons in the Encoder-Decoder architecture, and the basic formula is shown as follows.

and->

The specific calculation formula is shown in the following formula, wherein the gradient weighted average value after error correction and the gradient weighted variance are obtained.

Substituting the formula (11) into the formula (10), and adaptively searching the learning rate of each parameter by utilizing an Adam algorithm to realize W in GRU neurons in an Encoder-Decoder architecture _r 、W _z W is provided _h The correction of the three weight coefficients is specifically as follows.

Examples:

in order to verify the validity and the correctness of the invention, the simulation test is carried out based on IEEE118 node standard calculation example and Hunan power grid actual data. In the IEEE118 node example, a load sample of 93 typical days applicable to an IEEE118 node system is constructed based on a Hunan electric network daily load characteristic curve. Among the daily load samples, the daily load samples 1 to 90 are used as training samples, and the daily load samples 91 to 93 are used as test samples. For convenience of subsequent calculation and analysis, the sample numbers 1 to 90 can be clustered into three clustered sample sets by a method of literature on the basis of a data-driven unit combination intelligent decision method study with self-learning capability, namely, 1 to 30 are clustered sample sets 1, and a sample number 91 also belongs to the type; 31-60 is cluster sample set 2, and test sample number 92 is also of this type; 61-90 is a cluster sample set 3 and test sample No. 93 is also of this type.

All the unit combination deep learning models of the invention are trained and tested on a Tensorflow1.6.0 platform. The relevant simulation calculations were all implemented on an Intel Kuri 5-4460 processor/3.20 GHz,8G memory computer.

In order to verify the correctness of the method of the invention, four methods are set: the method 1 is a unit combination decision method based on an LSTM model; the method 2 is a unit combination decision method based on a GRU model; the method 3 is a unit combination decision method based on a Seq2Seq technology and a GRU model; the method 4 is based on a set combination decision method based on the Seq2Seq technology and the GRU model, and integrates a sample coding technology.

1) Procedural simulation and correctness verification of the method

Firstly, training No. 1-90 samples by using the method of the invention, respectively carrying out unit combination decision on 91-93 test samples by using a mapping model obtained by training, and carrying out unit start-stop scheme and use document' Network-Constrained AC Unit Commitment Under Uncertainty: a Benders' Decomposition Approach compares the unit start-stop schemes of the unit combination decision method based on the physical model driving, and the solving result of the No. 91 test sample is shown in Table 1.

Table 1 the present invention and document Network-Constrained AC Unit Commitment Under Uncertainty: unit start-stop scheme obtained by solving No. 91 test sample by A Benders' Decomposition Approach method

As can be seen from Table 1, the method of the present invention and the document "Network-Constrained AC Unit Commitment Under Uncertainty: the unit start-stop scheme obtained by solving the method A Benders' Decomposition Approach is the same, which shows that the invention can fully learn the mapping relation between daily load and the unit start-stop scheme, and the mapping model obtained by training can carry out correct unit start-stop scheme decision on any input daily load data.

For the test sample No. 91-93, the invention solves the optimal power flow model on the basis of solving the start-stop scheme to obtain the unit combination decision result. The method and document "Network-Constrained AC Unit Commitment Under Uncertainty" of the invention are: the unit combination decision results of the A Benders' Decomposition Approach method are compared and the total cost is shown in Table 2.

Table 2 the method of the invention and literature Network-Constrained AC Unit Commitment Under Uncertainty: comparison of total cost of ABenders' Decomposition Approach method

As can be seen from Table 2, the unit output scheme and the total cost obtained by solving the method of the invention are as follows: the solution of A Benders' Decomposition Approach is the same. The result shows that after the unit start-stop scheme is obtained, the optimal power flow model can be obtained by solving the optimal power flow model, and the literature of Network-Constrained AC Unit Commitment Under Uncertainty is obtained: a Benders 'Decomposition Approach' method is the same set output scheme.

This is due to the unit start-stop scheme U _G Methods and literature of the inventionNetwork-Constrained AC Unit Commitment Under Uncertainty: the methods of A Benders' Decomposition Approach all belong to decision variables. Using the method of the invention and literature Network-Constrained AC Unit Commitment Under Uncertainty: when solving the unit output scheme by the method A Benders' Decomposition Approach, the same optimal power flow model is adopted. Thus, for the same unit start-stop scheme U _G The method and document "Network-Constrained AC Unit Commitment Under Uncertainty" of the invention: the method of A Benders' Decomposition Approach can be solved to obtain the same unit output scheme P _G . The ability of self-learning and self-evolution of the data-driven unit combination intelligent decision method and the applicability thereof in facing different types of unit combination problems have been verified in the literature on study of unit combination intelligent decision method with self-learning ability based on data driving, and the invention is not described here in detail. The decision accuracy in the subsequent calculation examples all represent the unit start-stop scheme obtained by solving the method and the document Network-Constrained AC Unit Commitment Under Uncertainty: correlation between unit start-stop protocols as determined in A Benders' Decomposition Approach.

2) The method introduces the validity verification of the GRU model

In order to verify the effectiveness of the GRU model introduced in the invention, training samples after clustering pretreatment are used for training the method 1 and the method 2, the training times are set to be 500 times, and then the 91-93 test samples are solved by the two methods, wherein the specific results are shown in the table 3.

Table 3 decision accuracy versus training time for methods 1 and 2

As can be seen from table 3, in terms of decision accuracy, the decision accuracy of method 2 was 100% when deciding on 3 test samples, while the decision accuracy of method 1 was less than 100% when deciding on sample No. 93, and the total cost of method 1 was higher than that of method 2. This shows that by 500 training, method 2 has been able to generate an accurate mapping model for all clustered training sample sets, whereas method 1 has not been able to generate an accurate mapping model for clustered sample set 3, requiring more training times. As far as training time is concerned, the training time of method 2 for the 3 clustered sample sets is reduced by 77s, 91s and 82s, respectively, compared to method 1. This indicates that the training time required for method 2 is shorter for the same number of training.

The main reason for the phenomenon is that the GRU combines the input gate and the forgetting gate, the forgetting gate is recombined into the update gate and the reset gate, and meanwhile, the memory unit in the LSTM is simplified, so that the memory unit can directly calculate and output the result, the overall structure of the model is simpler, and the training and decision-making precision is higher under the same training parameters. It can be seen that it is correct and efficient to construct a deep learning model of unit combination decisions with GRUs instead of LSTM.

3) The method introduces the validity verification of the Seq2Seq technology

In order to verify the effectiveness of the method of the invention by introducing the Seq2Seq technology, samples subjected to clustering pretreatment and samples not subjected to clustering pretreatment are respectively used as training samples of the method 2 and the method 3, and the two methods are used for respectively solving the test samples of the numbers 91-93, wherein the specific results are shown in the table 4.

Table 4 decision accuracy and training time comparison for methods 2 and 3

As can be seen from table 4, in terms of decision accuracy, if the training method 2 is directly used without performing clustering pretreatment on the set combination training samples, the decision accuracy of the method 2 after training is generally lower than 90%. Training method 2 by adopting training samples subjected to clustering pretreatment can enable the decision accuracy to reach 100%. The method 3 is different, and even if training samples which are not subjected to clustering pretreatment are adopted for training, the method 3 can still obtain 100% of decision accuracy. This suggests that after the introduction of the Seq2Seq technique, a single deep learning model can complete the training of all the differential sample data. The reason for this is that if a single deep learning model is directly constructed by using the GRU, it is difficult to avoid generating a unique compromise mapping model in the face of training sample data having large differences, and thus it is difficult to ensure the accuracy of online decision. Therefore, in order to ensure the on-line decision accuracy, only the training samples can be clustered and preprocessed, and then each class of training samples is trained by adopting a corresponding deep learning model. If the Seq2Seq technology is introduced to construct an Encoder-Decoder composite neural network architecture based on GRU, as the intermediate state C can completely save the category information and the pointing probability of the input sequence and the output sequence, the accurate training of all the differential samples can be realized by using a single deep learning model.

As far as training time is concerned, method 2 is trained with non-clustered samples, which requires a minimum total time. While training method 3 with the non-clustered samples required 110.02s more total time than the former, it took 179.23s less time than method 2 with the clustered samples. The reason for this is that if method 2 is trained using non-clustered samples, the training process is terminated prematurely because it cannot converge to the most accurate mapping model, so the training time required for its entirety is short. However, if the method 2 is trained by using the clustering samples, the clustering pretreatment process also takes a certain time because the training needs to be completed on a plurality of deep learning models, so that the total training time consumed in the case is the longest.

To further analyze the reasons for the time difference between training methods 2 and 3, the actual convergence curves of the two methods when training methods 2 and 3 using the non-clustered samples are shown in fig. 6.

As can be seen from fig. 6, the total training error of the method 2 is basically converged to about 0.09 after the training times exceeds 100 times, and the error cannot be further reduced, so that the training process is finished in advance. And the error of the method 3 finally converges to about 0.0002 after training for more than 100 times. It follows that method 3 requires longer time to train the non-clustered samples due to the greater number of training times, while method 2, although training time is shorter, does not guarantee training accuracy of the model.

In summary, if the training sample without clustering pretreatment is directly used to train the method 2, it is difficult to ensure the decision accuracy of the model. By introducing the clustering preprocessing strategy, the problem of training accuracy of the method 2 in the face of the differential training samples can be solved, but the complexity of offline training of the method 2 is greatly increased, and thus the required offline training time and total time are greatly increased. The method 3 introduces the Seq2Seq technology, so that accurate training of the differential sample can be realized by using only one single deep learning model, the training process is simpler, and the training and decision-making efficiency of the deep learning model can be improved while the training precision is ensured.

4) The method introduces the validity verification of the sample coding technology

To verify the effect of the sample coding technique proposed by the present invention on the training efficiency of the deep learning model, the training of method 3 and method 4 was performed using non-clustered training samples, respectively, and the training and testing results are shown in table 5.

Table 5 decision accuracy and training time comparison for methods 3 and 4

As can be seen from table 5, after training, the decision accuracy was the same for method 3 and method 4, and the accuracy was 100% for all 3 test samples. However, the training time required for method 4 is reduced by 351s compared to method 3. The sample coding technology provided by the invention can directly compress the data dimension of the training samples, and reduces the set start-stop state matrix of 1 training sample from 24×54 dimension to 24×1 dimension, thus directly reducing the number of variables required to be calculated in the training process of the deep learning model, and further reducing the training time of the model. Therefore, after the sample coding technology is introduced, the sample coding process consumes a certain time, but the method can effectively reduce the overall training time of the deep learning model.

In summary, the sample coding technology provided by the invention can effectively compress the data dimension of the combined training sample of the unit, and directly reduce the number of variables required to be calculated in the training process of the deep learning model, so that the training accuracy of the deep learning model is ensured, and meanwhile, the training time of the deep learning model is effectively reduced; compared with an LSTM neural network adopted in a literature (study of a set combination intelligent decision method with self-learning ability based on data driving), the GRU model introduced by the invention can obtain higher training and decision accuracy under the same training parameters; according to the invention, the sequence 2 sequence technology is introduced to construct the Encoder-Decoder composite neural network architecture taking GRU as the neuron, so that the method can realize accurate training of the differential sample by only using a single deep learning model, the training process is simpler, and the training and decision-making efficiency of the method can be improved while the training precision of the method is ensured.

Claims

1. The method for training the deep learning model of the electric power system is characterized by training the deep learning model by adopting an Adam algorithm, and specifically comprises the following steps of:

step 1: constructing a loss function based on an average absolute error MAE by using an Encoder-Decoder architecture, and setting the output of the Encoder-Decoder architecture at the k moment as U _Gok The target value is U _Gdk The total error E of the sample during training is shown as follows:

step 2: the Adam algorithm is used as an updating algorithm of the neuron weight to realize training of each parameter of GRU neurons in an Encoder-Decoder architecture, and the basic formula is shown as follows;

and->

step 3: substituting the formula (11) into the formula (10), and adaptively searching the learning rate of each parameter by utilizing an Adam algorithm to realize W in GRU neurons in an Encoder-Decoder architecture _r 、W _z W is provided _h The correction of the three weight coefficients is specifically as follows:

2. The method of claim 1, wherein the deep learning model is obtained on the basis of a set-decision-oriented composite neural network architecture, which is an Encoder-Decoder composite neural network architecture constructed based on the GRU and Seq2Seq techniques.

3. The method according to claim 1, wherein the set-combination decision-oriented composite neural network architecture, when constructed, comprises the steps of:

h _t ＝f(h _t-1 ,P _Lt ) (1)

C＝C _T (3)

h _k ＝f(h _k-1 ,x _k ) (4)

wherein: h is a _k-1 The hidden layer state of GRU neurons at the moment k-1 is represented; x is x _k Indicating the time kA GRU neuron input;

x _k ＝U _Gk-1 (5)

The concrete model of the three is as follows:

weight coefficient between; alphaRepresenting an activation function sigmoid in the neural network;

8) Will z _k 、r _k and

4. The composite neural network architecture for unit combination decision is characterized by being an Encoder-Decoder composite neural network architecture constructed based on GRU and Seq2Seq technology, and specifically comprises the following steps:

step S1: a history map sample (P _L ，U _G ) Substituting into the Encoder-Decode architecture, the Encoder architecture loads the sequence of daily loads P _L The GRU neuron hidden layer state at the time t is jointly determined by the GRU neuron hidden layer state at the time t-1 and the daily load at the time t by steps, and the specific formula is as follows:

h _t ＝f(h _t-1 ,P _Lt ) (1)

step S2: GRU neuron hidden layer state h at time t in the Encoder architecture _t In the Decoder architecture, the GRU neuron hidden layer state h at the k moment is made to be the same as the intermediate state of the Encoder architecture _k The specific formula is as follows, which is the same as the intermediate state of the Decoder architecture:

step S3: the intermediate state of the output of the Encoder framework at the moment T is the intermediate state C of the input sequence, and the value is C _T Representing the complete information of the input sequence, specifically the following formula:

C＝C _T (3)

step S4: inputting the sequence intermediate state C into a Decoder architecture, wherein the initial value C of the Decoder intermediate state ₀ Like the intermediate state C of the sequence, will C ₀ After input, the hidden layer state h of GRU neuron at k moment can be obtained _k The hidden layer state of the GRU neuron at the moment k-1 and the input of the GRU neuron at the moment k are determined together, and the specific formula is as follows:

h _k ＝f(h _k-1 ,x _k ) (4)

step S5: the k-1 time Decoder architecture output will be used as the k time GRU neuron input, specifically as follows:

x _k ＝U _Gk-1 (5)

step S6: substituting the formula (5) into the formula (4), and simultaneously, executing the operation opposite to the operation of the Encoder by the Decoder framework, and performing step-by-step decoding on the input sequence intermediate state C according to the time step to form a final output sequence, wherein the time of k-1 is the intermediate state C of the Decoder framework _k-1 And h _k-1 Equal, the Decoder architecture output at time k is defined by h _k-1 、U _Gk-1 H _k The common decision is specifically described as follows:

step S7: input x with GRU neuron at time k _k And k-1 time Decoder architecture intermediate state C _k-1 Construction of update gate z in GRU neurons for variables _k Reset gate r _k Pending output value

The concrete model of the three is as follows:

step S8: will z _k 、r _k and